Query lcl|Aclame:protein:vir:1025|NCBI_annot:capsid protein|genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Match_columns 408 No_of_seqs 121 out of 1069 Neff 9.7 Searched_HMMs 1612 Date Sat Nov 30 04:20:39 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_45 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_45_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1025 Length: 408 # 100.0 4.1E-91 2.6E-94 516.1 41.5 408 1-408 1-408 (408) 2 protein:vir:7409 Length: 408 # 100.0 8.3E-90 5.1E-93 509.0 42.5 408 1-408 1-408 (408) 3 protein:vir:3991 Length: 404 # 100.0 6E-86 3.7E-89 487.8 42.1 404 1-404 1-404 (404) 4 protein:vir:4953 Length: 397 # 100.0 2.7E-81 1.7E-84 462.3 40.4 397 1-405 1-397 (397) 5 protein:vir:4997 Length: 397 # 100.0 1.8E-80 1.1E-83 457.7 40.2 397 1-405 1-397 (397) 6 protein:vir:4830 Length: 397 # 100.0 1E-78 6.4E-82 448.1 40.6 397 1-405 1-397 (397) 7 protein:vir:3845 Length: 395 # 100.0 3E-75 1.8E-78 429.2 40.0 390 5-406 1-395 (395) 8 protein:vir:1268 Length: 397 # 100.0 4.6E-73 2.8E-76 417.2 39.1 388 1-393 1-397 (397) 9 protein:vir:4456 Length: 401 # 100.0 3.2E-72 2E-75 412.5 39.3 373 1-393 1-401 (401) 10 protein:vir:485 Length: 407 # 100.0 4.7E-72 2.9E-75 411.6 39.2 379 4-400 1-407 (407) 11 protein:vir:102873 Length: 392 100.0 5E-72 3.1E-75 411.5 39.2 382 1-401 1-392 (392) 12 protein:vir:107593 Length: 392 100.0 5E-72 3.1E-75 411.5 39.2 382 1-401 1-392 (392) 13 protein:vir:102082 Length: 392 100.0 5E-72 3.1E-75 411.5 39.2 382 1-401 1-392 (392) 14 protein:vir:105004 Length: 392 100.0 5E-72 3.1E-75 411.5 39.2 382 1-401 1-392 (392) 15 protein:vir:81160 Length: 371 100.0 5E-72 3.1E-75 411.5 37.4 366 1-393 1-371 (371) 16 protein:vir:102119 Length: 404 100.0 5.9E-71 3.6E-74 405.6 38.7 385 1-397 1-404 (404) 17 protein:vir:100884 Length: 389 100.0 4.8E-70 3E-73 400.6 38.9 379 7-400 1-389 (389) 18 protein:vir:3870 Length: 400 # 100.0 8E-70 5E-73 399.4 39.7 382 1-394 1-400 (400) 19 protein:vir:100172 Length: 394 100.0 1.2E-69 7.7E-73 398.4 39.1 383 7-406 1-394 (394) 20 protein:vir:1383 Length: 421 # 100.0 1E-69 6.3E-73 398.8 38.1 393 1-408 1-403 (421) 21 protein:vir:1084 Length: 437 # 100.0 1E-69 6.5E-73 398.8 35.9 392 1-408 1-437 (437) 22 protein:vir:100247 Length: 425 100.0 5.2E-69 3.2E-72 395.0 38.1 373 1-394 1-425 (425) 23 protein:vir:9704 Length: 394 # 100.0 9.2E-68 5.7E-71 388.1 38.3 376 1-397 1-394 (394) 24 protein:vir:4511 Length: 409 # 100.0 5.3E-68 3.3E-71 389.4 36.7 380 5-396 1-409 (409) 25 protein:vir:4700 Length: 415 # 100.0 2.7E-67 1.6E-70 385.6 39.9 383 1-404 1-415 (415) 26 protein:vir:4600 Length: 415 # 100.0 2.7E-67 1.6E-70 385.6 39.9 383 1-404 1-415 (415) 27 protein:vir:81100 Length: 415 100.0 3.4E-67 2.1E-70 385.0 39.4 383 1-404 1-415 (415) 28 protein:vir:98339 Length: 415 100.0 3.4E-67 2.1E-70 385.0 39.4 383 1-404 1-415 (415) 29 protein:vir:79987 Length: 415 100.0 3.4E-67 2.1E-70 385.0 39.4 383 1-404 1-415 (415) 30 protein:vir:9410 Length: 415 # 100.0 1E-66 6.2E-70 382.4 40.1 383 1-404 1-415 (415) 31 protein:vir:6212 Length: 434 # 100.0 2.4E-66 1.5E-69 380.4 36.0 380 1-398 1-434 (434) 32 protein:vir:4856 Length: 293 # 100.0 1.1E-67 6.8E-71 387.7 26.8 293 112-405 1-293 (293) 33 protein:vir:962 Length: 397 # 100.0 2E-65 1.3E-68 375.3 36.2 380 1-393 1-397 (397) 34 protein:vir:4339 Length: 395 # 100.0 8.8E-64 5.4E-67 366.3 37.6 374 1-393 1-395 (395) 35 protein:vir:100135 Length: 418 100.0 3.3E-63 2E-66 363.1 37.8 383 1-396 4-418 (418) 36 protein:vir:1328 Length: 392 # 100.0 3.8E-63 2.4E-66 362.8 37.6 369 1-394 1-392 (392) 37 protein:vir:191 Length: 385 # 100.0 6.1E-63 3.8E-66 361.7 37.1 369 1-394 1-385 (385) 38 protein:vir:1886 Length: 385 # 100.0 6.1E-63 3.8E-66 361.7 37.1 369 1-394 1-385 (385) 39 protein:vir:6242 Length: 390 # 100.0 7.1E-63 4.4E-66 361.3 36.8 369 1-394 1-390 (390) 40 protein:vir:101650 Length: 497 100.0 5.5E-63 3.4E-66 361.9 34.7 384 1-397 1-497 (497) 41 protein:vir:7855 Length: 497 # 100.0 5.5E-63 3.4E-66 361.9 34.7 384 1-397 1-497 (497) 42 protein:vir:10364 Length: 390 100.0 2.1E-62 1.3E-65 358.7 35.9 368 3-391 1-390 (390) 43 protein:vir:81070 Length: 390 100.0 8.4E-62 5.2E-65 355.4 36.1 368 1-391 1-390 (390) 44 protein:vir:94424 Length: 387 100.0 4.6E-62 2.8E-65 356.9 33.8 367 1-399 1-387 (387) 45 protein:vir:96978 Length: 387 100.0 4.6E-62 2.8E-65 356.9 33.8 367 1-399 1-387 (387) 46 protein:vir:2685 Length: 387 # 100.0 4.6E-62 2.8E-65 356.9 33.8 367 1-399 1-387 (387) 47 protein:vir:93881 Length: 387 100.0 1.1E-61 7.1E-65 354.7 34.9 366 1-399 1-387 (387) 48 protein:vir:97053 Length: 390 100.0 2.9E-61 1.8E-64 352.5 36.1 368 3-391 1-390 (390) 49 protein:vir:9361 Length: 402 # 100.0 1E-61 6.5E-65 354.9 33.5 370 1-399 13-402 (402) 50 protein:vir:101607 Length: 379 100.0 4.5E-61 2.8E-64 351.4 36.8 371 1-393 1-379 (379) 51 protein:vir:81227 Length: 413 100.0 5.8E-61 3.6E-64 350.8 35.7 378 6-396 1-413 (413) 52 protein:vir:95376 Length: 425 100.0 1.7E-60 1.1E-63 348.2 37.8 380 1-397 1-425 (425) 53 protein:vir:1433 Length: 435 # 100.0 1.9E-61 1.2E-64 353.4 32.4 376 5-395 1-435 (435) 54 protein:vir:8102 Length: 543 # 100.0 3.2E-60 2E-63 346.7 37.4 370 1-394 143-543 (543) 55 protein:vir:94673 Length: 419 100.0 4.6E-60 2.8E-63 345.9 38.0 382 1-395 1-419 (419) 56 protein:vir:80376 Length: 435 100.0 1E-60 6.4E-64 349.5 32.3 376 5-395 1-435 (435) 57 protein:vir:104256 Length: 458 100.0 3.5E-59 2.2E-62 341.1 36.5 382 1-393 1-458 (458) 58 protein:vir:105038 Length: 428 100.0 2.6E-59 1.6E-62 341.8 34.2 374 1-393 1-428 (428) 59 protein:vir:8420 Length: 477 # 100.0 5.4E-58 3.3E-61 334.6 35.4 390 1-399 1-477 (477) 60 protein:vir:78640 Length: 352 100.0 2.1E-57 1.3E-60 331.3 30.0 332 39-399 1-352 (352) 61 protein:vir:98635 Length: 377 100.0 1.9E-57 1.2E-60 331.6 28.6 342 1-393 1-377 (377) 62 protein:vir:4092 Length: 390 # 100.0 3.3E-56 2.1E-59 324.7 33.9 354 7-408 1-385 (390) 63 protein:vir:93616 Length: 645 100.0 1.1E-55 6.7E-59 321.9 34.7 384 1-399 193-645 (645) 64 protein:vir:95963 Length: 395 100.0 7.3E-55 4.6E-58 317.4 33.0 358 1-408 1-391 (395) 65 protein:vir:80128 Length: 466 100.0 5.1E-54 3.1E-57 312.8 33.6 389 1-408 1-463 (466) 66 protein:vir:7771 Length: 330 # 100.0 2.4E-55 1.5E-58 320.1 25.2 285 108-400 1-330 (330) 67 protein:vir:41 Length: 299 # N 100.0 1.9E-55 1.2E-58 320.6 24.1 273 111-394 1-299 (299) 68 protein:vir:97148 Length: 324 100.0 2.5E-54 1.6E-57 314.4 27.9 298 84-402 1-324 (324) 69 protein:vir:9574 Length: 300 # 100.0 8.1E-55 5E-58 317.1 25.0 270 116-393 1-300 (300) 70 protein:vir:1638 Length: 298 # 100.0 9.5E-55 5.9E-58 316.8 25.3 266 120-392 1-298 (298) 71 protein:vir:9643 Length: 377 # 100.0 1.8E-53 1.1E-56 309.8 32.1 334 1-393 1-377 (377) 72 protein:vir:9309 Length: 324 # 100.0 8E-54 5E-57 311.7 27.9 298 84-408 1-324 (324) 73 protein:vir:8187 Length: 311 # 100.0 2.8E-54 1.7E-57 314.2 25.2 270 118-394 1-311 (311) 74 protein:vir:9759 Length: 303 # 100.0 2.9E-54 1.8E-57 314.1 25.2 269 118-393 1-303 (303) 75 protein:vir:78830 Length: 324 100.0 1.1E-53 6.6E-57 311.0 27.7 298 84-408 1-324 (324) 76 protein:vir:96392 Length: 324 100.0 1.1E-53 6.6E-57 311.0 27.7 298 84-408 1-324 (324) 77 protein:vir:105905 Length: 304 100.0 3E-54 1.9E-57 314.0 24.1 271 108-392 1-304 (304) 78 protein:vir:94142 Length: 304 100.0 3E-54 1.9E-57 314.0 24.1 271 108-392 1-304 (304) 79 protein:vir:4226 Length: 326 # 100.0 3.2E-54 2E-57 313.9 23.9 289 82-396 1-326 (326) 80 protein:vir:80684 Length: 315 100.0 5E-54 3.1E-57 312.8 24.9 279 116-402 1-315 (315) 81 protein:vir:99749 Length: 324 100.0 1.9E-53 1.2E-56 309.6 27.8 298 84-408 1-324 (324) 82 protein:vir:2430 Length: 318 # 100.0 1.3E-53 8E-57 310.5 26.3 286 101-398 1-318 (318) 83 protein:vir:103955 Length: 324 100.0 2.3E-53 1.4E-56 309.2 27.5 298 84-408 1-324 (324) 84 protein:vir:96223 Length: 324 100.0 4E-53 2.5E-56 307.8 27.5 298 84-408 1-324 (324) 85 protein:vir:94771 Length: 298 100.0 1.5E-53 9E-57 310.3 24.9 266 120-392 1-298 (298) 86 protein:vir:2344 Length: 397 # 100.0 1.5E-53 9E-57 310.3 24.6 291 105-408 1-322 (397) 87 protein:vir:100632 Length: 381 100.0 1.2E-52 7.5E-56 305.2 28.8 343 1-406 1-381 (381) 88 protein:vir:78523 Length: 338 100.0 6E-53 3.7E-56 306.9 26.3 286 107-396 1-338 (338) 89 protein:vir:101291 Length: 381 100.0 2.5E-52 1.5E-55 303.5 28.8 339 1-404 1-381 (381) 90 protein:vir:9509 Length: 381 # 100.0 2.5E-52 1.5E-55 303.5 28.8 339 1-404 1-381 (381) 91 protein:vir:78350 Length: 383 100.0 6.4E-52 3.9E-55 301.3 29.1 353 1-403 1-383 (383) 92 protein:vir:2504 Length: 305 # 100.0 8.7E-53 5.4E-56 306.0 24.2 271 116-400 1-305 (305) 93 protein:vir:5739 Length: 366 # 100.0 1.1E-52 6.8E-56 305.5 23.9 317 63-393 1-366 (366) 94 protein:vir:95763 Length: 297 100.0 1.1E-52 6.6E-56 305.5 23.7 272 108-394 1-297 (297) 95 protein:vir:104085 Length: 320 100.0 4.3E-52 2.7E-55 302.2 25.8 284 101-396 1-320 (320) 96 protein:vir:78223 Length: 333 100.0 6.4E-52 4E-55 301.2 25.7 283 105-395 1-333 (333) 97 protein:vir:99920 Length: 311 100.0 5.2E-52 3.2E-55 301.7 23.8 272 116-393 1-311 (311) 98 protein:vir:96762 Length: 632 100.0 2.6E-50 1.6E-53 292.4 29.6 368 1-392 207-632 (632) 99 protein:vir:97397 Length: 517 100.0 6E-41 3.7E-44 241.1 27.6 369 1-396 124-517 (517) 100 protein:vir:4159 Length: 315 # 100.0 1.6E-41 1E-44 244.2 21.1 283 91-392 1-315 (315) 101 protein:vir:4197 Length: 314 # 100.0 2.4E-41 1.5E-44 243.3 21.5 283 98-396 1-314 (314) 102 protein:vir:4074 Length: 480 # 100.0 3.6E-38 2.2E-41 225.8 22.8 352 1-396 111-480 (480) 103 protein:vir:3158 Length: 321 # 100.0 9.3E-35 5.8E-38 207.2 22.6 292 92-403 1-321 (321) 104 protein:vir:3033 Length: 272 # 99.9 1.1E-28 6.9E-32 173.8 22.2 266 116-396 1-272 (272) 105 protein:vir:9820 Length: 272 # 99.9 1.1E-28 6.9E-32 173.8 22.2 266 116-396 1-272 (272) 106 protein:vir:3613 Length: 272 # 99.8 2E-21 1.2E-24 134.0 18.4 267 116-393 1-272 (272) 107 protein:vir:93742 Length: 274 99.8 1.2E-19 7.3E-23 124.4 20.9 265 116-397 1-274 (274) 108 protein:vir:79928 Length: 393 99.8 1.2E-19 7.5E-23 124.3 20.7 348 21-403 1-393 (393) 109 protein:vir:94933 Length: 330 99.7 1.6E-18 9.8E-22 118.2 18.1 294 77-394 1-330 (330) 110 protein:vir:105334 Length: 276 99.7 6.4E-18 4E-21 114.8 20.3 267 116-399 1-276 (276) 111 protein:vir:96123 Length: 274 99.7 1.1E-17 6.9E-21 113.5 19.8 265 116-397 1-274 (274) 112 protein:vir:80930 Length: 278 99.7 1.3E-17 8.2E-21 113.1 19.6 264 116-394 1-278 (278) 113 protein:vir:97433 Length: 274 99.6 6.9E-17 4.3E-20 109.2 20.9 265 116-397 1-274 (274) 114 protein:vir:94494 Length: 274 99.6 6.9E-17 4.3E-20 109.2 20.9 265 116-397 1-274 (274) 115 protein:vir:96833 Length: 275 99.6 3.5E-17 2.2E-20 110.8 19.2 266 114-397 1-275 (275) 116 protein:vir:95107 Length: 270 99.6 1.6E-16 9.6E-20 107.3 19.4 266 116-398 1-270 (270) 117 protein:vir:1239 Length: 274 # 99.6 6.6E-16 4.1E-19 103.8 19.4 265 116-397 1-274 (274) 118 protein:vir:93858 Length: 400 99.5 5.5E-15 3.4E-18 98.7 22.9 363 1-391 8-400 (400) 119 protein:vir:96262 Length: 274 99.5 2.6E-15 1.6E-18 100.5 20.2 265 116-397 1-274 (274) 120 protein:vir:95898 Length: 274 99.5 2.6E-15 1.6E-18 100.5 20.2 265 116-397 1-274 (274) 121 protein:vir:8324 Length: 410 # 99.5 4.4E-15 2.7E-18 99.3 20.7 364 1-391 1-410 (410) 122 protein:vir:739 Length: 231 # 99.4 1E-14 6.5E-18 97.2 15.3 228 150-393 1-231 (231) 123 protein:vir:97255 Length: 310 99.4 1.4E-13 8.5E-17 91.1 21.0 270 116-392 1-310 (310) 124 protein:vir:99424 Length: 360 99.3 2.9E-12 1.8E-15 83.9 21.5 290 84-396 1-360 (360) 125 protein:vir:7990 Length: 273 # 99.1 3.3E-11 2E-14 78.1 16.8 258 116-393 1-273 (273) 126 protein:vir:105822 Length: 273 99.0 5.9E-11 3.7E-14 76.7 17.9 258 116-393 1-273 (273) 127 protein:vir:102605 Length: 273 99.0 5.9E-11 3.7E-14 76.7 17.9 258 116-393 1-273 (273) 128 protein:vir:8885 Length: 347 # 99.0 4.4E-11 2.7E-14 77.4 15.1 282 105-394 1-347 (347) 129 protein:vir:108211 Length: 318 99.0 2.7E-11 1.7E-14 78.6 13.8 274 101-394 1-318 (318) 130 protein:vir:80213 Length: 334 99.0 5.5E-11 3.4E-14 76.8 14.9 280 98-395 1-334 (334) 131 protein:vir:94576 Length: 347 98.9 1.4E-10 8.9E-14 74.5 14.5 278 98-393 1-347 (347) 132 protein:vir:5974 Length: 324 # 98.8 2.1E-09 1.3E-12 68.1 18.5 277 116-408 1-304 (324) 133 protein:vir:100057 Length: 375 98.8 3.9E-09 2.4E-12 66.7 18.8 284 97-400 1-375 (375) 134 protein:vir:2201 Length: 345 # 98.7 1.5E-09 9.5E-13 68.9 15.5 277 105-393 1-345 (345) 135 protein:vir:3364 Length: 347 # 98.7 1.1E-09 6.7E-13 69.8 14.3 281 98-395 1-347 (347) 136 protein:vir:94622 Length: 341 98.7 1.6E-09 9.7E-13 68.9 14.2 281 105-397 1-341 (341) 137 protein:vir:1541 Length: 347 # 98.7 4.7E-09 2.9E-12 66.2 16.5 282 98-395 1-347 (347) 138 protein:vir:80180 Length: 381 98.7 2E-09 1.2E-12 68.3 14.4 290 98-408 1-327 (381) 139 protein:vir:10450 Length: 344 98.7 3.6E-09 2.2E-12 66.9 15.7 272 105-393 1-344 (344) 140 protein:vir:94711 Length: 347 98.7 1.4E-09 8.4E-13 69.2 12.8 277 101-394 1-347 (347) 141 protein:vir:95318 Length: 328 98.7 1.4E-09 8.7E-13 69.1 12.8 216 105-334 1-328 (328) 142 protein:vir:78739 Length: 332 98.6 2E-09 1.3E-12 68.3 12.7 278 98-391 1-332 (332) 143 protein:vir:6324 Length: 335 # 98.6 1.1E-08 7E-12 64.2 15.8 288 105-400 1-335 (335) 144 protein:vir:78935 Length: 335 98.6 1.2E-08 7.7E-12 63.9 15.8 284 105-400 1-335 (335) 145 protein:vir:1583 Length: 351 # 98.6 3.5E-08 2.2E-11 61.4 18.1 278 116-408 1-342 (351) 146 protein:vir:103759 Length: 330 98.5 5E-09 3.1E-12 66.1 12.3 216 100-334 1-330 (330) 147 protein:vir:103323 Length: 364 98.5 1.5E-07 9.2E-11 58.0 19.9 289 105-408 1-354 (364) 148 protein:vir:3136 Length: 322 # 98.4 3E-08 1.8E-11 61.9 14.6 266 116-400 1-322 (322) 149 protein:vir:97031 Length: 402 98.4 6.2E-08 3.8E-11 60.1 14.9 293 105-408 1-354 (402) 150 protein:vir:102944 Length: 330 98.4 2.5E-07 1.6E-10 56.8 18.1 275 116-408 1-310 (330) 151 protein:vir:107826 Length: 331 98.3 1.6E-07 1E-10 57.8 15.6 216 100-334 1-331 (331) 152 protein:vir:98525 Length: 331 98.3 1.6E-07 1E-10 57.8 15.6 216 100-334 1-331 (331) 153 protein:vir:107388 Length: 331 98.3 1.6E-07 1E-10 57.8 15.6 216 100-334 1-331 (331) 154 protein:vir:7324 Length: 335 # 98.2 9E-08 5.6E-11 59.2 12.3 218 100-336 1-335 (335) 155 protein:vir:7019 Length: 401 # 98.2 1.9E-07 1.1E-10 57.5 13.9 291 105-408 1-350 (401) 156 protein:vir:99675 Length: 324 98.2 3.5E-07 2.2E-10 56.0 15.3 246 149-408 1-317 (324) 157 protein:vir:9927 Length: 295 # 98.2 2E-07 1.2E-10 57.3 13.4 262 116-400 1-295 (295) 158 protein:vir:105645 Length: 400 98.1 4.8E-07 3E-10 55.2 14.1 293 105-408 1-349 (400) 159 protein:vir:102655 Length: 322 98.0 1.9E-06 1.2E-09 51.9 16.6 272 111-394 1-322 (322) 160 protein:vir:9875 Length: 296 # 97.9 7.7E-07 4.8E-10 54.1 12.9 264 100-394 1-296 (296) 161 protein:vir:103285 Length: 296 97.9 1.7E-06 1E-09 52.2 14.0 266 116-394 1-296 (296) 162 protein:vir:107687 Length: 319 97.7 8.2E-06 5.1E-09 48.5 14.4 280 84-391 1-319 (319) 163 protein:vir:80068 Length: 301 97.6 8.5E-06 5.3E-09 48.4 14.2 259 118-391 1-301 (301) 164 protein:vir:106647 Length: 303 97.6 3.9E-06 2.4E-09 50.2 12.0 264 105-402 1-303 (303) 165 protein:vir:93966 Length: 400 97.6 2.6E-05 1.6E-08 45.7 16.0 360 1-391 8-400 (400) 166 protein:vir:8843 Length: 317 # 97.5 1.6E-05 9.7E-09 46.9 14.6 265 116-395 1-317 (317) 167 protein:vir:1663 Length: 393 # 97.4 3.7E-05 2.3E-08 44.9 15.3 356 1-391 1-393 (393) 168 protein:vir:99075 Length: 392 97.3 8.5E-05 5.3E-08 42.9 16.1 280 116-408 1-344 (392) 169 protein:vir:95131 Length: 325 97.0 0.00021 1.3E-07 40.7 18.6 286 82-408 1-307 (325) 170 protein:vir:97331 Length: 319 97.0 0.00021 1.3E-07 40.7 18.8 295 97-408 1-317 (319) 171 protein:vir:94800 Length: 319 97.0 0.00021 1.3E-07 40.7 18.8 295 97-408 1-317 (319) 172 protein:vir:104342 Length: 314 96.9 8.3E-05 5.1E-08 43.0 12.6 285 84-394 1-314 (314) 173 protein:vir:98566 Length: 355 96.9 0.00028 1.8E-07 40.0 17.3 293 89-405 1-355 (355) 174 protein:vir:80446 Length: 367 96.8 0.00036 2.2E-07 39.5 19.1 281 105-408 1-343 (367) 175 protein:vir:1829 Length: 355 # 96.7 0.00037 2.3E-07 39.4 17.2 294 89-406 1-355 (355) 176 protein:vir:1153 Length: 338 # 96.7 0.00039 2.4E-07 39.3 16.9 281 89-395 1-338 (338) 177 protein:vir:5694 Length: 357 # 96.4 0.00062 3.8E-07 38.2 16.3 295 89-405 1-357 (357) 178 protein:vir:104011 Length: 337 96.4 0.00067 4.1E-07 38.0 16.8 284 89-396 1-337 (337) 179 protein:vir:6061 Length: 357 # 96.4 0.00069 4.3E-07 37.9 16.7 294 89-405 1-357 (357) 180 protein:vir:1383 Length: 421 # 96.4 0.00071 4.4E-07 37.8 19.9 367 1-408 8-409 (421) 181 protein:vir:79171 Length: 337 96.3 0.00079 4.9E-07 37.6 16.8 284 89-396 1-337 (337) 182 protein:vir:79642 Length: 329 96.2 0.00084 5.2E-07 37.4 14.2 290 84-394 1-329 (329) 183 protein:vir:2016 Length: 357 # 96.2 0.00091 5.6E-07 37.3 16.3 295 89-405 1-357 (357) 184 protein:vir:107120 Length: 329 96.0 0.0012 7.2E-07 36.7 19.1 306 74-408 1-319 (329) 185 protein:vir:108303 Length: 418 96.0 0.0012 7.4E-07 36.6 17.6 270 119-408 1-329 (418) 186 protein:vir:5255 Length: 304 # 96.0 0.00057 3.6E-07 38.4 11.8 261 121-390 1-304 (304) 187 protein:vir:79157 Length: 339 95.8 0.0015 9.2E-07 36.1 16.3 284 89-397 1-339 (339) 188 protein:vir:100331 Length: 342 95.7 0.0016 9.8E-07 35.9 16.1 283 89-397 1-342 (342) 189 protein:vir:78186 Length: 337 95.7 0.0016 1E-06 35.9 16.2 282 89-396 1-337 (337) 190 protein:vir:6212 Length: 434 # 93.9 0.0062 3.8E-06 32.7 17.2 377 5-407 1-434 (434) 191 protein:vir:94989 Length: 349 93.7 0.0065 4.1E-06 32.6 19.6 276 116-408 1-329 (349) 192 protein:vir:95875 Length: 401 93.7 0.0066 4.1E-06 32.6 13.4 286 97-396 1-401 (401) 193 protein:vir:3525 Length: 423 # 93.5 0.0073 4.5E-06 32.3 17.0 277 116-408 1-339 (423) 194 protein:vir:78777 Length: 358 93.0 0.0092 5.7E-06 31.7 19.4 296 82-408 1-356 (358) 195 protein:vir:99311 Length: 463 92.5 0.011 6.9E-06 31.3 16.5 305 80-408 1-355 (463) 196 protein:vir:95603 Length: 463 92.5 0.011 6.9E-06 31.3 16.5 305 80-408 1-355 (463) 197 protein:vir:80128 Length: 466 92.3 0.012 7.4E-06 31.1 16.8 375 5-408 1-460 (466) 198 protein:vir:98856 Length: 343 92.1 0.013 7.8E-06 31.0 16.7 294 89-403 1-343 (343) 199 protein:vir:79548 Length: 652 92.1 0.013 7.9E-06 31.0 24.1 359 1-390 216-652 (652) 200 protein:vir:861 Length: 318 # 91.2 0.017 1.1E-05 30.3 13.5 287 84-391 1-318 (318) 201 protein:vir:270 Length: 341 # 91.2 0.017 1.1E-05 30.3 16.7 291 82-407 1-341 (341) 202 protein:vir:3643 Length: 336 # 90.9 0.017 1E-05 30.3 10.3 296 56-391 1-336 (336) 203 protein:vir:1781 Length: 221 # 90.6 0.02 1.2E-05 29.9 12.5 182 201-408 1-215 (221) 204 protein:vir:78387 Length: 349 89.8 0.024 1.5E-05 29.5 19.7 278 116-408 1-329 (349) 205 protein:vir:94870 Length: 318 89.1 0.029 1.8E-05 29.1 13.5 287 84-391 1-318 (318) 206 protein:vir:174 Length: 423 # 88.4 0.033 2E-05 28.7 17.1 283 116-408 1-339 (423) 207 protein:vir:95512 Length: 693 88.3 0.033 2.1E-05 28.7 26.9 377 1-391 258-693 (693) 208 protein:vir:105522 Length: 423 87.8 0.037 2.3E-05 28.5 17.5 277 116-408 1-339 (423) 209 protein:vir:79008 Length: 299 87.3 0.04 2.5E-05 28.3 20.2 264 116-395 1-299 (299) 210 protein:vir:96666 Length: 462 87.1 0.041 2.6E-05 28.2 15.9 315 76-408 1-352 (462) 211 protein:vir:96792 Length: 315 85.8 0.05 3.1E-05 27.7 16.9 270 116-408 1-294 (315) 212 protein:vir:105374 Length: 423 83.6 0.067 4.2E-05 27.0 17.0 283 116-408 1-339 (423) 213 protein:vir:94070 Length: 339 81.3 0.087 5.4E-05 26.4 13.2 289 80-391 1-339 (339) 214 protein:vir:3746 Length: 336 # 80.0 0.099 6.2E-05 26.1 15.7 285 92-399 1-336 (336) 215 protein:vir:78558 Length: 336 79.2 0.11 6.7E-05 25.9 11.8 292 63-391 1-336 (336) 216 protein:vir:3783 Length: 336 # 78.8 0.11 6.9E-05 25.8 15.6 285 92-399 1-336 (336) 217 protein:vir:95451 Length: 313 75.2 0.15 9.3E-05 25.1 14.0 271 116-395 1-313 (313) 218 protein:vir:80835 Length: 464 75.0 0.15 9.4E-05 25.1 11.9 312 75-408 1-357 (464) 219 protein:vir:1084 Length: 437 # 72.7 0.18 0.00011 24.7 19.0 361 1-396 4-437 (437) 220 protein:vir:79712 Length: 285 72.0 0.19 0.00012 24.6 16.7 260 116-393 1-285 (285) 221 protein:vir:106734 Length: 336 70.6 0.21 0.00013 24.3 10.8 295 63-391 1-336 (336) 222 protein:vir:10364 Length: 390 68.1 0.24 0.00015 24.0 17.8 367 7-404 1-390 (390) 223 protein:vir:3870 Length: 400 # 64.7 0.3 0.00018 23.5 16.9 364 5-407 1-400 (400) 224 protein:vir:98143 Length: 524 63.5 0.32 0.0002 23.3 13.8 341 1-408 1-514 (524) 225 protein:vir:100851 Length: 514 63.1 0.32 0.0002 23.3 10.8 319 83-408 1-387 (514) 226 protein:vir:101557 Length: 336 59.0 0.4 0.00025 22.8 12.6 299 56-391 1-336 (336) 227 protein:vir:81070 Length: 390 52.0 0.57 0.00035 21.9 16.4 359 7-404 1-390 (390) 228 protein:vir:9410 Length: 415 # 47.8 0.69 0.00043 21.5 17.2 378 7-408 1-411 (415) 229 protein:vir:80986 Length: 528 47.0 0.72 0.00044 21.4 14.7 340 18-408 1-518 (528) 230 protein:vir:107732 Length: 379 45.4 0.77 0.00048 21.2 8.5 314 39-391 1-379 (379) 231 protein:vir:78920 Length: 290 44.4 0.81 0.0005 21.1 20.3 260 116-393 1-290 (290) 232 protein:vir:103463 Length: 521 43.5 0.84 0.00052 21.0 13.2 338 1-408 1-509 (521) 233 protein:vir:4339 Length: 395 # 40.8 0.95 0.00059 20.7 16.4 373 7-406 1-395 (395) 234 protein:vir:63741 Length: 468 39.6 1 0.00063 20.6 14.0 308 80-408 1-360 (468) 235 protein:vir:102823 Length: 470 39.2 0.81 0.0005 21.1 5.4 286 80-408 1-313 (470) 236 protein:vir:4830 Length: 397 # 38.1 1.1 0.00067 20.4 19.4 356 7-408 1-397 (397) 237 protein:vir:1328 Length: 392 # 33.7 1.3 0.00083 19.9 16.7 360 5-407 1-392 (392) 238 protein:vir:6901 Length: 522 # 30.8 1.5 0.00096 19.5 15.2 346 1-408 1-512 (522) 239 protein:vir:4600 Length: 415 # 30.8 1.5 0.00096 19.5 16.1 372 7-408 1-411 (415) 240 protein:vir:4700 Length: 415 # 30.8 1.5 0.00096 19.5 16.1 372 7-408 1-411 (415) 241 protein:vir:80491 Length: 467 29.6 1.6 0.001 19.4 13.7 308 80-408 1-359 (467) 242 protein:vir:1268 Length: 397 # 28.8 1.7 0.0011 19.3 19.3 361 5-406 1-397 (397) 243 protein:vir:103886 Length: 302 28.4 1.8 0.0011 19.2 14.3 257 116-397 1-302 (302) 244 protein:vir:97053 Length: 390 27.5 1.8 0.0011 19.1 19.6 373 7-404 1-390 (390) 245 protein:vir:107947 Length: 519 26.9 1.9 0.0012 19.1 15.2 338 5-408 1-509 (519) 246 protein:vir:7409 Length: 408 # 25.3 2.1 0.0013 18.8 21.3 356 1-405 5-408 (408) 247 protein:vir:1025 Length: 408 # 24.3 2.2 0.0014 18.7 22.4 365 1-405 5-408 (408) 248 protein:vir:108295 Length: 711 22.0 2.5 0.0016 18.4 7.2 92 1-103 616-711 (711) 249 protein:vir:7214 Length: 521 # 21.1 2.7 0.0016 18.3 13.4 332 1-408 1-496 (521) No 1 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=4.1e-91 Score=516.11 Aligned_cols=408 Identities=100% Similarity=1.342 Sum_probs=389.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+++|+|+||++++.++.++++++++++++++++++.+.+++++++.+++.++++++.++.++.+.+............. T Consensus 1 m~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) T protein:vir:10 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 99999999999999999999999999999998899999999999999999999999999999999888877777777777 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ............+.++|.++++++.......+.++...+++++||++||++++++|++.+++.++|+++|+++++++.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~ 160 (408) T protein:vir:10 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) T ss_pred ccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcc Confidence 77777777888899999999999998888889999999999999999999999999999999999999999999999999 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) +++++...+..+.+.|++|++++|+++.++|++|++++++++++++||+||++|+.++|.+||.++|+++++++++.+|+ T Consensus 161 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il 240 (408) T protein:vir:10 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) T ss_pred eEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999887777888999999999988889999999999999999999999999999999999999999999999999999 Q ss_pred hccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecc Q lcl|Aclame:pro 241 EVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) Q Consensus 241 ~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~ 320 (408) +|+|++++..+..++++++++++..+++.|+.++.|+|||++|..|+++||++|+|+|++++.++.+++|+|+||+++++ T Consensus 241 ~g~g~~~~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~ 320 (408) T protein:vir:10 241 EVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) T ss_pred hcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecc Confidence 99999999999999999999998889999999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) ..+|+.+++...++||||+++|.+++|++++++++++.+..|.+|++.||++.|+||++++|+||+++++++++++++.+ T Consensus 321 ~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 400 (408) T protein:vir:10 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) T ss_pred cccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeeccccCCCCC Confidence 88999888889999999999999999999999999999899999999999999999999999999999999999999999 Q ss_pred cCCCcccC Q lcl|Aclame:pro 401 KTTTSTAV 408 (408) Q Consensus 401 ~~~~~~~~ 408 (408) +++|++|| T Consensus 401 ~~~~~~~~ 408 (408) T protein:vir:10 401 KTTTSTAV 408 (408) T ss_pred CCCCcccC Confidence 99999999 No 2 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=8.3e-90 Score=508.96 Aligned_cols=408 Identities=96% Similarity=1.311 Sum_probs=386.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) ||++|+|+||++++.++.+++++++++++++.++.+...+++++++.+++.++++++.++.++.+.+............. T Consensus 1 m~~~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) T protein:vir:74 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) T ss_pred CChhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999888887777776666666 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ............+.++|.++++++.......+.+++..+++++||++||+++.+.|++.+++.++|+++|+++++++.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~ 160 (408) T protein:vir:74 81 PLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSG 160 (408) T ss_pred cccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcc Confidence 66677777778889999999999988888899999999999999999999999999999999999999999999999999 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) .++++...+..+.+.|++|++++++++.++|++|++++++++++++||+|+++|+.++|++||.++|++++++++|.+|+ T Consensus 161 ~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il 240 (408) T protein:vir:74 161 SRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) T ss_pred eEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99999887777778899999999988889999999999999999999999999999999999999999999999999999 Q ss_pred hccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecc Q lcl|Aclame:pro 241 EVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) Q Consensus 241 ~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~ 320 (408) +|+|++++..+..++++++++++..+++.|+.+++|+|||++|.+|+++||++|+|+|++++.++.+++|+|+||+++++ T Consensus 241 ~G~G~~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~ 320 (408) T protein:vir:74 241 AAMGTVPKKPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) T ss_pred hcccccccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecC Confidence 99999999999999999999998899999999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) ..+|+.++++.+++||||+++|.+++|++++++++++.+..|.+|++.+|++.|+||++++|+||++++++++++++++| T Consensus 321 ~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 400 (408) T protein:vir:74 321 RWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGNF 400 (408) T ss_pred cccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecccCCCCCC Confidence 88999889999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred cCCCcccC Q lcl|Aclame:pro 401 KTTTSTAV 408 (408) Q Consensus 401 ~~~~~~~~ 408 (408) ++++|+|| T Consensus 401 ~~~~~~~~ 408 (408) T protein:vir:74 401 KTTTSTAV 408 (408) T ss_pred CCCccccC Confidence 99999999 No 3 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=6e-86 Score=487.81 Aligned_cols=404 Identities=95% Similarity=1.309 Sum_probs=379.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |.+||+|+||++++.++.++++++.++++++..+.+...+++++++++++.++.+++.+++++.+.+............. T Consensus 1 ~~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:39 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (404) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 99999999999999999999999999999999998999999999999999999999999999998887777776666666 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ............++++|.++++.+.......+.++...+++++||++||+++...|++.+++.++|+++|+++++++..+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 160 (404) T protein:vir:39 81 PLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (404) T ss_pred ccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcc Confidence 66666666778889999999999988888889999999999999999999999999999999999999999999999999 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) ++++++..+..+.+.|++|++++|+++.++|++|++++++++++++||+|+++|+.++|++||.++|++++++++|.+++ T Consensus 161 ~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il 240 (404) T protein:vir:39 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (404) T ss_pred eEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999887777888999999999988889999999999999999999999999999999999999999999999999999 Q ss_pred hccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecc Q lcl|Aclame:pro 241 EVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) Q Consensus 241 ~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~ 320 (408) +|+|++++..+..++++++++++..+++.|..+++|+|||++|..|+++||++|+|+|++++.++.+++|+|+||+++++ T Consensus 241 ~g~g~~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~ 320 (404) T protein:vir:39 241 AAMGTVPKKPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVAD 320 (404) T ss_pred hcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEecc Confidence 99999999999999999999998889999999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) ..+|+...+..+++||||+++|.+++|++++++++++.++.|++|++.+|++.|+|+++.+|+||+++++++++++.+++ T Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~ 400 (404) T protein:vir:39 321 RWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGNF 400 (404) T ss_pred cccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeeccccCCCCC Confidence 88998888889999999999999999999999999999889999999999999999999999999999999999988877 Q ss_pred cCCC Q lcl|Aclame:pro 401 KTTT 404 (408) Q Consensus 401 ~~~~ 404 (408) ++.- T Consensus 401 ~~~~ 404 (404) T protein:vir:39 401 TAGK 404 (404) T ss_pred CCCC Confidence 7665 No 4 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=2.7e-81 Score=462.28 Aligned_cols=397 Identities=59% Similarity=0.897 Sum_probs=358.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |. +++||++++.++.+++++++++++.+..++..+.+++++++++++.++++++.+++++.+.............+. T Consensus 1 Mk---~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MK---TSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred Cc---hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 54 467888889999999999999998888888888999999999999999999988888777666655555555555 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ............++++|.++++++... ...+...+++++||++||+++.+.|++.+++.++|+++|+++++++.++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~l~~~~~~----~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:49 78 PLTKSEEEVKAGFVKDFKNLVRGRYQN----LLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTG 153 (397) T ss_pred ccccchhHHHHHHHHHHHHHHhcchhH----HHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCcc Confidence 555556666778889999999886532 3345567788899999999999999999999999999999999999999 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) +++++...+..+.+.|++|++++++++.++|++|++++++++++++||+||++|+.++|++||.++|++++++++|.+|+ T Consensus 154 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~ 233 (397) T protein:vir:49 154 SRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAIL 233 (397) T ss_pred ceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999887777888999999999987889999999999999999999999999999999999999999999999999999 Q ss_pred hccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecc Q lcl|Aclame:pro 241 EVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) Q Consensus 241 ~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~ 320 (408) +|+|++++..+.+++|++++++. .+++.|..+++|+|||++|..|++|||++|+|+|++++.++.+++|+|+||+++++ T Consensus 234 ~G~g~~~~~~~~~~~d~i~~~~~-~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~ 312 (397) T protein:vir:49 234 EAIAALPTKPTLTKWDDIIDLEA-KVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVAD 312 (397) T ss_pred hhccccccccccccHHHHHHHHH-hhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecc Confidence 99999999999999999999875 58899999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) .++|+...+...++||||+++|.+++|++++++++++.++.|.+|++.+|++.|+|+++++|+||+++++++++.+++++ T Consensus 313 ~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 392 (397) T protein:vir:49 313 RWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNL 392 (397) T ss_pred cccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCCc Confidence 88898888889999999999999999999999999998889999999999999999999999999999999999999888 Q ss_pred cCCCc Q lcl|Aclame:pro 401 KTTTS 405 (408) Q Consensus 401 ~~~~~ 405 (408) ++++- T Consensus 393 ~~~~~ 397 (397) T protein:vir:49 393 GSTAV 397 (397) T ss_pred ccccC Confidence 87766 No 5 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=1.8e-80 Score=457.75 Aligned_cols=397 Identities=57% Similarity=0.886 Sum_probs=358.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |. +++||+++++++.++++++++++++...+.+...+++++++++++.++++++.+++++.+.+...........+. T Consensus 1 Mk---~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MK---TSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred Cc---hHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 44 367788888888899999999888888888888899999999999999988888888777666665555555555 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ............++++|.++++++... ..++...+++++||++||+++...|++.+++.++|+++++++++++.++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~l~~~~~~----~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:49 78 PLTKNEEEVKANFVKDFKNLVRGRYQN----LLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTG 153 (397) T ss_pred cccchhhHHHHHHHHHHHHHhhcchhh----HHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcc Confidence 555556666778889999999876432 3455667788899999999999999999999999999999999999999 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) +++++...+..+.+.|++|++++|+++.++|++|++++++++++++||+|+++|+.++|++||.++|++++++++|.+|+ T Consensus 154 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail 233 (397) T protein:vir:49 154 SRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAIL 233 (397) T ss_pred eEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999887777888999999999987778999999999999999999999999999999999999999999999999999 Q ss_pred hccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecc Q lcl|Aclame:pro 241 EVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) Q Consensus 241 ~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~ 320 (408) +|+|++++..+.+++|++++++. .+++.|..+++|+|||++|.+|++|||++|+|+|.+++.++.+++|+|+||+++++ T Consensus 234 ~G~g~~~~~~~~~~~d~i~~~~~-~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~ 312 (397) T protein:vir:49 234 EAIGTLPNKPTLAKWDDIIDLQA-KVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISD 312 (397) T ss_pred hccccccccccccCHHHHHHHHH-hhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecceeeEEecc Confidence 99999999999999999998874 58899999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) ..+|...+++.+++||||+++|++++|++++++++++.+..|.+|++.||++.|+|+++++|+||+++++++++.+++.| T Consensus 313 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 392 (397) T protein:vir:49 313 RFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKAKL 392 (397) T ss_pred cccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEecccccccCcc Confidence 88998888899999999999999999999999999998888999999999999999999999999999999999999999 Q ss_pred cCCCc Q lcl|Aclame:pro 401 KTTTS 405 (408) Q Consensus 401 ~~~~~ 405 (408) .+++| T Consensus 393 ~~~~~ 397 (397) T protein:vir:49 393 STAGA 397 (397) T ss_pred cccCC Confidence 99999 No 6 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1e-78 Score=448.13 Aligned_cols=397 Identities=58% Similarity=0.885 Sum_probs=353.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |. +++||++.+.++.+++++++++++.+..++....+++++++.+++.+..+++.++++.................. T Consensus 1 Mk---~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:48 1 MK---TSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKK 77 (397) T ss_pred Cc---hHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccc Confidence 44 456677778888888999988888887777778889999999999998888888887777666555555555555 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ............+.++|.++++.+... ...+...+++++||++||+++++.|++.+++.++|+++|+++++++.++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 153 (397) T protein:vir:48 78 PLTKSEEEVKAGFVKDFKNLVRGRYQN----LLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTG 153 (397) T ss_pred cccchhhHHHHHHHHHHHHHHhhhhhH----HHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcc Confidence 555556666778889999998875322 2334456677889999999999999999999999999999999999999 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) .++++...+..+.++|++|++.+++++.++|++|++++++++++++||+|+++|+.++|.+||.++|++++++++|.+|+ T Consensus 154 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il 233 (397) T protein:vir:48 154 SRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAIL 233 (397) T ss_pred eEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 99998887777888999999999987789999999999999999999999999999999999999999999999999999 Q ss_pred hccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecc Q lcl|Aclame:pro 241 EVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) Q Consensus 241 ~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~ 320 (408) +|+|++++..+..+++++++++ ..+++.|..+++|+|||++|..|++|||++|+|+|++++.++.+++|+|+||+++++ T Consensus 234 ~G~g~~~~~~~~~~~d~i~~~~-~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~ 312 (397) T protein:vir:48 234 EAIATLPTKPTLTKWDDIIDLQ-AKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVAD 312 (397) T ss_pred hcccccccccccccHHHHHHHH-HHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecc Confidence 9999999999999999999876 458899999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) .+++....++.+++||||+++|.+++|++++++++++.+..|.+|++.||++.|+|+++++|+||+++++++++++++++ T Consensus 313 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 392 (397) T protein:vir:48 313 RWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNL 392 (397) T ss_pred cccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEecccccCCCCc Confidence 88888888899999999999999999999999999988888999999999999999999999999999999999999998 Q ss_pred cCCCc Q lcl|Aclame:pro 401 KTTTS 405 (408) Q Consensus 401 ~~~~~ 405 (408) ++++- T Consensus 393 ~~~~~ 397 (397) T protein:vir:48 393 GSTAV 397 (397) T ss_pred cccCC Confidence 88776 No 7 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=3e-75 Score=429.17 Aligned_cols=390 Identities=45% Similarity=0.693 Sum_probs=308.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFS-----AEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |+++||++++.++.++++++.++++++..+.... .++++.++++++.+++..+..+....+...... .... T Consensus 1 M~~~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~----~~~~ 76 (395) T protein:vir:38 1 MNINQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLN----AEPV 76 (395) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh----hccc Confidence 6677788888888888988888877665443322 223344444444443333332222222111111 1111 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) ....... .......+++...+++.. .......+.++++||++||+++++.|++.+++.++|+++|+++++++.. T Consensus 77 ~~~~~~~-~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 150 (395) T protein:vir:38 77 NKKPLPV-KDGKPDAQAMKNQFVKDF-----KNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSH 150 (395) T ss_pred cccccch-hhhhHHHHHHHHHHHHHH-----HHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCc Confidence 1111111 111112222222222211 0112223445677999999999999999999999999999999999999 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) +.++++...+..+.+.|++|++.+++++.++|++|++++++++++++||+||++|+.++|++||.++|+++++++++.+| T Consensus 151 ~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i 230 (395) T protein:vir:38 151 GSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKI 230 (395) T ss_pred ceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999888777778899999999998788999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeec Q lcl|Aclame:pro 240 IEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVA 319 (408) Q Consensus 240 ~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~ 319 (408) ++|+|++.+..+..++++++++++..++..|+.+++|+|||++|..|+++||++|+|+|++++.++.+++|+|+||++++ T Consensus 231 l~g~g~~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~ 310 (395) T protein:vir:38 231 LEVMGKAPKKPTISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIA 310 (395) T ss_pred hhcccccccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccceeEEec Confidence 99999999999999999999999888999999999999999999999999999999999999999999999999999998 Q ss_pred cccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCC Q lcl|Aclame:pro 320 DRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGN 399 (408) Q Consensus 320 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 399 (408) +..+|.. .+..+++||||+++|++++|++++++++++.+..|.+|++.||++.|+|+++.+|+||+++++++++++++. T Consensus 311 ~~~~~~~-~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~ 389 (395) T protein:vir:38 311 DKWLPDV-SGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQAQG 389 (395) T ss_pred ccccCcC-CCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCCCC Confidence 8767764 567789999999999999999999999998888899999999999999999999999999999998877666 Q ss_pred ccCCCcc Q lcl|Aclame:pro 400 FKTTTST 406 (408) Q Consensus 400 ~~~~~~~ 406 (408) |+.+. + T Consensus 390 ~~~~~-~ 395 (395) T protein:vir:38 390 TAGTG-K 395 (395) T ss_pred ccCCC-C Confidence 54433 3 No 8 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=4.6e-73 Score=417.17 Aligned_cols=388 Identities=30% Similarity=0.489 Sum_probs=308.2 Q ss_pred CChH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-ccc Q lcl|Aclame:pro 1 MGVK--LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNM-REE 77 (408) Q Consensus 1 M~~~--~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 77 (408) |.++ ..+++|+++++++.++++++.++.+ .++.+...++++++.++++.+....+................. ... T Consensus 1 ~~~~m~k~l~el~~~~~~~~~~~~~~~~~~~--~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (397) T protein:vir:12 1 MPMQMSKKEIALRQQFTEKKQQADKALQEGN--TDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPE 78 (397) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhc Confidence 5444 4566787777777777666544321 1122223345555555555444333332222222221111111 111 Q ss_pred cccccccchhhhHHHHHHHHHHHhhcchh------hHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhc Q lcl|Aclame:pro 78 EKGPLNKSENELKDKFVKDFVNMVRNPMA------FMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVR 151 (408) Q Consensus 78 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~------~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~ 151 (408) ..............++.++|.++++++.. .....+.+++.++++++||++||+++.+.|++.+++.++|+++|+ T Consensus 79 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~ 158 (397) T protein:vir:12 79 GQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVT 158 (397) T ss_pred ccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcc Confidence 11122222333445678888888876542 123335677778888999999999999999999999999999999 Q ss_pred eeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 152 VESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKV 231 (408) Q Consensus 152 ~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~ 231 (408) ++++++.++.++++...+. +.++|++|++++|+++.++|++|++++++++++++||+|+++|+.++|++||.+.|++++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~ 237 (397) T protein:vir:12 159 VEPVTTRSGTRLLEKNADM-VPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKS 237 (397) T ss_pred eeeccCCceeEEEEEecCC-cceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHH Confidence 9999998899988876544 567999999999987889999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccccc Q lcl|Aclame:pro 232 VVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIK 311 (408) Q Consensus 232 ~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~ 311 (408) ++++|.+|++|+|++++. +..++++++++++..+++.|+.+++|+|||++|.+|+++||++|+|+|++++.++.+++|+ T Consensus 238 ~~~~d~~il~G~g~~~~~-g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~ 316 (397) T protein:vir:12 238 VVTRNNLILAAIASLKKV-DIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLD 316 (397) T ss_pred HHHHHHHHHhcccccccc-ccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCcccc Confidence 999999999999998875 5677999999998889999999999999999999999999999999999999999999999 Q ss_pred ccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEee Q lcl|Aclame:pro 312 GKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) Q Consensus 312 G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 391 (408) |+||+++++ .++....++..++||||+++|.+++|++++++++++.+..|.+|++.||++.|+|+++++|+||++++++ T Consensus 317 G~pv~~~~~-~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t 395 (397) T protein:vir:12 317 GRPVVPFTN-RVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQIT 395 (397) T ss_pred ceeeEEecc-cccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEe Confidence 999998865 3555567788899999999999999999999999998989999999999999999999999999999998 Q ss_pred cc Q lcl|Aclame:pro 392 AI 393 (408) Q Consensus 392 ~~ 393 (408) +- T Consensus 396 ~~ 397 (397) T protein:vir:12 396 VE 397 (397) T ss_pred eC Confidence 87 No 9 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=3.2e-72 Score=412.51 Aligned_cols=373 Identities=18% Similarity=0.225 Sum_probs=297.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+++ +++|++.+.++.++.++++++.++..++. +++..++.++++.++.++++++..+.+.+........ T Consensus 1 m~~~--lk~l~~~~~el~~~~~~~k~~~~~~~~~~---e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~----- 70 (401) T protein:vir:44 1 MAVD--IKDVEQVAQELQQKFDDFKAKNDKRVEAI---EQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKR----- 70 (401) T ss_pred CCcc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc----- Confidence 7776 56676666666666666655544433321 2344455555566666665555555544333222111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhh-HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAF-MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~-~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) +..........+++++|.++++++... ....+.+++..+++++||++||+++.++|++.+++.++|+++|+++++.+.. T Consensus 71 ~~~~~~~~~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 150 (401) T protein:vir:44 71 PARGAQNKVAAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSD 150 (401) T ss_pred cccccccchhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCc Confidence 111222334456788999999876543 3456778899999999999999999999999999999999999999987655 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) .. ++... ..+.+.|++|++.+|+++.++|++|++++++++++++||+|+++|+.++|++||.++|+++++++++.++ T Consensus 151 ~~--~~~~~-~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~ 227 (401) T protein:vir:44 151 YK--KLVNL-GGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAF 227 (401) T ss_pred eE--EEEec-CCccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhh Confidence 44 44433 3355689999999998787899999999999999999999999999999999999999999999999999 Q ss_pred hhccccccch---------------------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcc Q lcl|Aclame:pro 240 IEVMKAAPKK---------------------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTA 292 (408) Q Consensus 240 ~~g~g~~~~~---------------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~ 292 (408) ++|+|++.|. .+..+++++++++. .|++.|+.+++|+||+++|.+|+++||+ T Consensus 228 l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~-~l~~~~~~~a~~v~n~~~~~~L~~lkd~ 306 (401) T protein:vir:44 228 TTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIY-TLRKAHRTGAKFMMNNNSLFAIRLLKDT 306 (401) T ss_pred hccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHH-hcchhhhcCCEEEEcHHHHHHHHHhhcc Confidence 9999986542 11234788887775 5889999999999999999999999999 Q ss_pred cCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEE Q lcl|Aclame:pro 293 EGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVI 372 (408) Q Consensus 293 ~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~ 372 (408) +|||+|++++..+.+++|+|+||+++++ +|..+.+..+++||||+++|.+++|.++++.++++ |.+|++.||++ T Consensus 307 ~G~~l~~~~~~~g~~~~l~G~PVv~~~~--~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~----~~~~~v~~~a~ 380 (401) T protein:vir:44 307 EGNYLWRPGLELGQPSSLAGYGIAENEQ--MPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPY----TNKPFVGFYTT 380 (401) T ss_pred CCceeecCCcCCCCCceecceeeEEecC--cCCccCCccEEEEeehhccEEEEEecceEEeeecc----ccCCcEEEEEE Confidence 9999999999999999999999998765 67777788889999999999999999999987653 66899999999 Q ss_pred eeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 373 DRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 373 ~r~d~~v~~~~a~~~l~~~~~ 393 (408) .|+|+++++|+||++++++++ T Consensus 381 ~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 381 KRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EEeccEEecccceEEEEeecC Confidence 999999999999999999998 No 10 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=4.7e-72 Score=411.61 Aligned_cols=379 Identities=17% Similarity=0.222 Sum_probs=304.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) Q Consensus 4 ~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) -..+++|++.+.++.+.+++++++.++..+.. +++..++..+++.+++++++++++..+.+.......+ +.. T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~~~~~~~---e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~-----~~~ 72 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKNDKRIDAI---EQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKR-----PAG 72 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-----ccc Confidence 22467777777777777777766554444332 3455566666666666666665555544433222111 112 Q ss_pred cchhhhHHHHHHHHHHHhhcchhh-HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccce Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAF-MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSR 162 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~-~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~ 162 (408) ........+++++|.++++++... ....+.+++..+++++||++||++++++|++.+++.++|+++|+++++.+.. + T Consensus 73 ~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~--~ 150 (407) T protein:vir:48 73 GTQNKVASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSD--Y 150 (407) T ss_pred ccccchhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCc--e Confidence 223344566888999999876543 3456788999999999999999999999999999999999999999887654 4 Q ss_pred EEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 163 VYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEV 242 (408) Q Consensus 163 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g 242 (408) .+|...+ .+.+.|++|++.+|+++.++|+++++++++++++++||+|+++|+.++|++||.++|+++++.+++.+|++| T Consensus 151 ~~~~~~~-~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G 229 (407) T protein:vir:48 151 KKLVNLG-GTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSG 229 (407) T ss_pred EEEEecC-CcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 4444433 356799999999998887899999999999999999999999999999999999999999999999999999 Q ss_pred cccccchh---------------------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc Q lcl|Aclame:pro 243 MKAAPKKP---------------------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK 295 (408) Q Consensus 243 ~g~~~~~~---------------------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~ 295 (408) +|++.|.+ +...++++++++. .|++.|+.+++|+||+++|..|++|||++|| T Consensus 230 ~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~-~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr 308 (407) T protein:vir:48 230 DGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIY-TLRKAHRSGAKFMMNNSSLFAIRLLKDNDGN 308 (407) T ss_pred CCCCccceeeecccccccccccccccccccccccccccChHHHHHHHH-hhchhhhcCCEEEEcHHHHHHHHHhhccCCc Confidence 99865421 1224778887775 5899999999999999999999999999999 Q ss_pred eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeee Q lcl|Aclame:pro 296 YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRF 375 (408) Q Consensus 296 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~ 375 (408) |+|+|++..+.+++|+|+||+++++ +|..+.+..+++||||+++|.+++|.++++..+++ |.+|++.||++.|+ T Consensus 309 ~l~~~~~~~g~~~~l~G~PV~~~~~--~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~----~~~~~~~~~~~~r~ 382 (407) T protein:vir:48 309 YLWRPGIELGQPSSLAGYGIVENEQ--MPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPY----TNKPFVGFYTTKRT 382 (407) T ss_pred eeeccCcCCCCCceecceeeEEecC--cCCccCCccEEEEEeccccEEEEEeeceEEEeecc----ccCCcEEEEEEEEe Confidence 9999999999999999999998764 67777788889999999989999999999987654 56899999999999 Q ss_pred CcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 376 DVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 376 d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) |+++++|+||+++++++++++.+.- T Consensus 383 d~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 383 GGMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred ccEEecccceEEEEeeccCCCCCCC Confidence 9999999999999999998876554 No 11 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=5e-72 Score=411.48 Aligned_cols=382 Identities=34% Similarity=0.508 Sum_probs=306.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+.+ |+||++++.++.++++.+.++.+ .++++++.+++++++++++..++.. +.+......... T Consensus 1 M~k~--l~el~~~~~~~~~e~~~~~~~~~---------~~e~~~~~~e~~~l~~~i~~~~~~~-~~~~~~~~~~~~---- 64 (392) T protein:vir:10 1 MSKE--LRELLAKLEGKKEEVRSLMGEDK---------VAEAEQMMEEVRSLQKKIDLQRSLD-EAETEERNNGRE---- 64 (392) T ss_pred CcHH--HHHHHHHHHHHHHHHHHHhhHHH---------HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcccc---- Confidence 8866 78888888887777766644311 1345555556666666665543322 222221111111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhh-------HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhcee Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAF-------MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~-------~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~ 153 (408) ..........+++++|.++++.+... ....+.+.+..+++++||++||+++.+.|++.+++.++|+++|+++ T Consensus 65 -~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~ 143 (392) T protein:vir:10 65 -VETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) T ss_pred -ccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee Confidence 11112234456778888888765421 1223455667778889999999999999999999999999999999 Q ss_pred ecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVV 233 (408) Q Consensus 154 ~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~ 233 (408) ++++.++.++++...+ .+.+.|++|++++++++.++|++|++++++++++++||+|+|+|+.++|++||.+.|++++++ T Consensus 144 ~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 222 (392) T protein:vir:10 144 PVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) T ss_pred eccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 9999999988887654 466799999999998777999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccccccc Q lcl|Aclame:pro 234 TRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGK 313 (408) Q Consensus 234 ~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~ 313 (408) +++.+|++|+|++++. +..++++++++++..+++.|+.+++|+|||++|..|+++||++|+|+|++++.++.+++|+|+ T Consensus 223 ~~d~~~~~g~g~~~~~-~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~ 301 (392) T protein:vir:10 223 TRNVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) T ss_pred HHHHHHhhcccccccc-CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCc Confidence 9999999999998875 557799999999888999999999999999999999999999999999999999999999998 Q ss_pred ceEeecc-cccc--ccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEe Q lcl|Aclame:pro 314 QVIVVAD-RWLP--NTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) Q Consensus 314 pv~~~~~-~~~~--~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 390 (408) |++++.+ ..++ ....+..+++||||+++|.+++|.+++++++++....|++|++.||++.|+|+++++|+||+++++ T Consensus 302 ~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 381 (392) T protein:vir:10 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) T ss_pred ccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 7666543 3332 335577789999999999999999999999998888899999999999999999999999999999 Q ss_pred eccccCCCCcc Q lcl|Aclame:pro 391 SAIADQVGNFK 401 (408) Q Consensus 391 ~~~~~~~~~~~ 401 (408) ++.+|+...-+ T Consensus 382 ~~~a~~~~~~~ 392 (392) T protein:vir:10 382 DLSAPVEQPQG 392 (392) T ss_pred cccccccCCCC Confidence 88776543322 No 12 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=5e-72 Score=411.48 Aligned_cols=382 Identities=34% Similarity=0.508 Sum_probs=306.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+.+ |+||++++.++.++++.+.++.+ .++++++.+++++++++++..++.. +.+......... T Consensus 1 M~k~--l~el~~~~~~~~~e~~~~~~~~~---------~~e~~~~~~e~~~l~~~i~~~~~~~-~~~~~~~~~~~~---- 64 (392) T protein:vir:10 1 MSKE--LRELLAKLEGKKEEVRSLMGEDK---------VAEAEQMMEEVRSLQKKIDLQRSLD-EAETEERNNGRE---- 64 (392) T ss_pred CcHH--HHHHHHHHHHHHHHHHHHhhHHH---------HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcccc---- Confidence 8866 78888888887777766644311 1345555556666666665543322 222221111111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhh-------HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhcee Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAF-------MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~-------~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~ 153 (408) ..........+++++|.++++.+... ....+.+.+..+++++||++||+++.+.|++.+++.++|+++|+++ T Consensus 65 -~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~ 143 (392) T protein:vir:10 65 -VETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) T ss_pred -ccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee Confidence 11112234456778888888765421 1223455667778889999999999999999999999999999999 Q ss_pred ecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVV 233 (408) Q Consensus 154 ~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~ 233 (408) ++++.++.++++...+ .+.+.|++|++++++++.++|++|++++++++++++||+|+|+|+.++|++||.+.|++++++ T Consensus 144 ~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 222 (392) T protein:vir:10 144 PVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) T ss_pred eccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 9999999988887654 466799999999998777999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccccccc Q lcl|Aclame:pro 234 TRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGK 313 (408) Q Consensus 234 ~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~ 313 (408) +++.+|++|+|++++. +..++++++++++..+++.|+.+++|+|||++|..|+++||++|+|+|++++.++.+++|+|+ T Consensus 223 ~~d~~~~~g~g~~~~~-~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~ 301 (392) T protein:vir:10 223 TRNVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) T ss_pred HHHHHHhhcccccccc-CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCc Confidence 9999999999998875 557799999999888999999999999999999999999999999999999999999999998 Q ss_pred ceEeecc-cccc--ccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEe Q lcl|Aclame:pro 314 QVIVVAD-RWLP--NTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) Q Consensus 314 pv~~~~~-~~~~--~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 390 (408) |++++.+ ..++ ....+..+++||||+++|.+++|.+++++++++....|++|++.||++.|+|+++++|+||+++++ T Consensus 302 ~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 381 (392) T protein:vir:10 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) T ss_pred ccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 7666543 3332 335577789999999999999999999999998888899999999999999999999999999999 Q ss_pred eccccCCCCcc Q lcl|Aclame:pro 391 SAIADQVGNFK 401 (408) Q Consensus 391 ~~~~~~~~~~~ 401 (408) ++.+|+...-+ T Consensus 382 ~~~a~~~~~~~ 392 (392) T protein:vir:10 382 DLSAPVEQPQG 392 (392) T ss_pred cccccccCCCC Confidence 88776543322 No 13 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=5e-72 Score=411.48 Aligned_cols=382 Identities=34% Similarity=0.508 Sum_probs=306.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+.+ |+||++++.++.++++.+.++.+ .++++++.+++++++++++..++.. +.+......... T Consensus 1 M~k~--l~el~~~~~~~~~e~~~~~~~~~---------~~e~~~~~~e~~~l~~~i~~~~~~~-~~~~~~~~~~~~---- 64 (392) T protein:vir:10 1 MSKE--LRELLAKLEGKKEEVRSLMGEDK---------VAEAEQMMEEVRSLQKKIDLQRSLD-EAETEERNNGRE---- 64 (392) T ss_pred CcHH--HHHHHHHHHHHHHHHHHHhhHHH---------HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcccc---- Confidence 8866 78888888887777766644311 1345555556666666665543322 222221111111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhh-------HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhcee Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAF-------MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~-------~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~ 153 (408) ..........+++++|.++++.+... ....+.+.+..+++++||++||+++.+.|++.+++.++|+++|+++ T Consensus 65 -~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~ 143 (392) T protein:vir:10 65 -VETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) T ss_pred -ccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee Confidence 11112234456778888888765421 1223455667778889999999999999999999999999999999 Q ss_pred ecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVV 233 (408) Q Consensus 154 ~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~ 233 (408) ++++.++.++++...+ .+.+.|++|++++++++.++|++|++++++++++++||+|+|+|+.++|++||.+.|++++++ T Consensus 144 ~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 222 (392) T protein:vir:10 144 PVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) T ss_pred eccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 9999999988887654 466799999999998777999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccccccc Q lcl|Aclame:pro 234 TRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGK 313 (408) Q Consensus 234 ~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~ 313 (408) +++.+|++|+|++++. +..++++++++++..+++.|+.+++|+|||++|..|+++||++|+|+|++++.++.+++|+|+ T Consensus 223 ~~d~~~~~g~g~~~~~-~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~ 301 (392) T protein:vir:10 223 TRNVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) T ss_pred HHHHHHhhcccccccc-CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCc Confidence 9999999999998875 557799999999888999999999999999999999999999999999999999999999998 Q ss_pred ceEeecc-cccc--ccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEe Q lcl|Aclame:pro 314 QVIVVAD-RWLP--NTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) Q Consensus 314 pv~~~~~-~~~~--~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 390 (408) |++++.+ ..++ ....+..+++||||+++|.+++|.+++++++++....|++|++.||++.|+|+++++|+||+++++ T Consensus 302 ~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 381 (392) T protein:vir:10 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) T ss_pred ccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 7666543 3332 335577789999999999999999999999998888899999999999999999999999999999 Q ss_pred eccccCCCCcc Q lcl|Aclame:pro 391 SAIADQVGNFK 401 (408) Q Consensus 391 ~~~~~~~~~~~ 401 (408) ++.+|+...-+ T Consensus 382 ~~~a~~~~~~~ 392 (392) T protein:vir:10 382 DLSAPVEQPQG 392 (392) T ss_pred cccccccCCCC Confidence 88776543322 No 14 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=5e-72 Score=411.48 Aligned_cols=382 Identities=34% Similarity=0.508 Sum_probs=306.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+.+ |+||++++.++.++++.+.++.+ .++++++.+++++++++++..++.. +.+......... T Consensus 1 M~k~--l~el~~~~~~~~~e~~~~~~~~~---------~~e~~~~~~e~~~l~~~i~~~~~~~-~~~~~~~~~~~~---- 64 (392) T protein:vir:10 1 MSKE--LRELLAKLEGKKEEVRSLMGEDK---------VAEAEQMMEEVRSLQKKIDLQRSLD-EAETEERNNGRE---- 64 (392) T ss_pred CcHH--HHHHHHHHHHHHHHHHHHhhHHH---------HHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcccc---- Confidence 8866 78888888887777766644311 1345555556666666665543322 222221111111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhh-------HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhcee Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAF-------MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~-------~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~ 153 (408) ..........+++++|.++++.+... ....+.+.+..+++++||++||+++.+.|++.+++.++|+++|+++ T Consensus 65 -~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~ 143 (392) T protein:vir:10 65 -VETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) T ss_pred -ccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee Confidence 11112234456778888888765421 1223455667778889999999999999999999999999999999 Q ss_pred ecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVV 233 (408) Q Consensus 154 ~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~ 233 (408) ++++.++.++++...+ .+.+.|++|++++++++.++|++|++++++++++++||+|+|+|+.++|++||.+.|++++++ T Consensus 144 ~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~ 222 (392) T protein:vir:10 144 PVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKV 222 (392) T ss_pred eccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 9999999988887654 466799999999998777999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccccccc Q lcl|Aclame:pro 234 TRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGK 313 (408) Q Consensus 234 ~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~ 313 (408) +++.+|++|+|++++. +..++++++++++..+++.|+.+++|+|||++|..|+++||++|+|+|++++.++.+++|+|+ T Consensus 223 ~~d~~~~~g~g~~~~~-~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~ 301 (392) T protein:vir:10 223 TRNVLILGVIEKLTKQ-AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGT 301 (392) T ss_pred HHHHHHhhcccccccc-CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCc Confidence 9999999999998875 557799999999888999999999999999999999999999999999999999999999998 Q ss_pred ceEeecc-cccc--ccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEe Q lcl|Aclame:pro 314 QVIVVAD-RWLP--NTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) Q Consensus 314 pv~~~~~-~~~~--~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 390 (408) |++++.+ ..++ ....+..+++||||+++|.+++|.+++++++++....|++|++.||++.|+|+++++|+||+++++ T Consensus 302 ~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 381 (392) T protein:vir:10 302 NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) T ss_pred ccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 7666543 3332 335577789999999999999999999999998888899999999999999999999999999999 Q ss_pred eccccCCCCcc Q lcl|Aclame:pro 391 SAIADQVGNFK 401 (408) Q Consensus 391 ~~~~~~~~~~~ 401 (408) ++.+|+...-+ T Consensus 382 ~~~a~~~~~~~ 392 (392) T protein:vir:10 382 DLSAPVEQPQG 392 (392) T ss_pred cccccccCCCC Confidence 88776543322 No 15 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=5e-72 Score=411.47 Aligned_cols=366 Identities=33% Similarity=0.504 Sum_probs=306.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+.+ |+++.+++.++.++++.+.++ + ..++++++..+++.++++++.+++..+........ . T Consensus 1 M~k~--l~~l~e~~~~~~~e~~~~~~~-------~--~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~-------~ 62 (371) T protein:vir:81 1 MPKE--LRELLEQINNKKEEARKLLAE-------N--KIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIED-------K 62 (371) T ss_pred CcHH--HHHHHHHHHHHHHHHHHHhhH-------H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------c Confidence 8864 677777776666665554432 1 22456667777777777776665544443222111 1 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ............++++|.++++.+ +.++++.+++++||++||+++..+|++.+++.++|+++++++++++.++ T Consensus 63 ~~~~~~~~~~~~~~~~~~~~l~~~-------~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~ 135 (371) T protein:vir:81 63 EPLKPTVQVKENEVEAFVNHIRTR-------FRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSG 135 (371) T ss_pred cccccchhhHHHHHHHHHHHHHHH-------HHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCce Confidence 112223334566788888887754 4567788899999999999999999999999999999999999998888 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) .++++...+. +.+.|++|++++++++.++|++|++++++++++++||+|+++|+.++|++||.+.|++++++++|.+|+ T Consensus 136 ~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~ 214 (371) T protein:vir:81 136 SRVFKKRSQQ-TGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLII 214 (371) T ss_pred eEEEEeecCC-cceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHH Confidence 8888876543 667999999999987889999999999999999999999999999999999999999999999999999 Q ss_pred hccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecc Q lcl|Aclame:pro 241 EVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD 320 (408) Q Consensus 241 ~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~ 320 (408) +|+|++++. +..++++++.++...+++.|+.+++|+|||++|.+|+++||++|+|+|++++.++.+++|+|+||+++++ T Consensus 215 ~g~g~~~~~-~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~ 293 (371) T protein:vir:81 215 NVLNTKAKT-AIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSN 293 (371) T ss_pred hhccccccc-ccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecceeEEEecc Confidence 999998875 5577899999888889999999999999999999999999999999999999999999999999999876 Q ss_pred cccc-----ccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 321 RWLP-----NTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 321 ~~~~-----~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) .+.+ +...+...++||||+++|.+++|.+++++++++.++.|++|++.||++.|+|+++++|+||++++++++ T Consensus 294 ~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 294 KVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred cccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 5433 334567789999999999999999999999999988999999999999999999999999999999988 No 16 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=5.9e-71 Score=405.62 Aligned_cols=385 Identities=27% Similarity=0.370 Sum_probs=302.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+.+ |++|+++++++.++++ .+.++.+.+.++++.+.+++++++++++..++..+............... T Consensus 1 M~k~--l~el~~~~~~~~~e~~-------~~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~- 70 (404) T protein:vir:10 1 MSKE--LRELLNQLDSKNKELN-------SLLNKDGVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNT- 70 (404) T ss_pred CcHH--HHHHHHHHHHHHHHHH-------HHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc- Confidence 8864 6777776666555444 34444556667788888888888777765544333322222111111111 Q ss_pred ccccchhhhHHHHHHHHH-HHhhcc---hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFV-NMVRNP---MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS 156 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~-~~~~~~---~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~ 156 (408) ..............+.+. .+++.. .......+.+++..+++++||++||+++.+.|++.+++.++|++++++++++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~ 150 (404) T protein:vir:10 71 GKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVF 150 (404) T ss_pred ccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeecc Confidence 111111111111112221 122221 1123344678888889999999999999999999999999999999999999 Q ss_pred cCccceEEeeccCCccccchhccccccccc-ccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 157 TSNGSRVYEKWTDVTPLTVMDAEDGKIPDL-DNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTR 235 (408) Q Consensus 157 ~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~-~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~ 235 (408) +.+|.+.+++..+ .+.++|++|++.++.+ ..++|++|++++++++++++||+|+++|+.++|++||.+.|++++++++ T Consensus 151 ~~~g~~~~~~~~~-~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~ 229 (404) T protein:vir:10 151 TRSGSRTYEKRSK-QKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITR 229 (404) T ss_pred CCccceEEEEecC-CcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHH Confidence 9999999887654 4677999999999865 3588999999999999999999999999999999999999999999999 Q ss_pred HHHHhhccccccch--------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccc Q lcl|Aclame:pro 236 NQAIIEVMKAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPD 301 (408) Q Consensus 236 ~~~~~~g~g~~~~~--------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~ 301 (408) |.+|++|+|++.+. .+...++++..++...+++.|..+++|+|||++|..|+++||++|+|+|.|+ T Consensus 230 ~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~ 309 (404) T protein:vir:10 230 NAEILYGAGGDEHATGIMTANKFKKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPD 309 (404) T ss_pred HHHHhhcCCCCCcccceeeccccceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccC Confidence 99999999976532 2334578888888878999999999999999999999999999999999999 Q ss_pred cccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEec Q lcl|Aclame:pro 302 PTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATD 381 (408) Q Consensus 302 ~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~ 381 (408) +.++.+++|+|+||+++++. ++....+..+++||||+++|.++.|++++++++++.+..|.+|++.||++.|+|+++.+ T Consensus 310 ~~~~~~~~l~G~PV~~~~~~-~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~ 388 (404) T protein:vir:10 310 PKDPTQYRFLGLPVIELPND-LLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKD 388 (404) T ss_pred cCCCCCccccceeeEEeccc-ccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEec Confidence 99999999999999987654 45556778889999999999999999999999998888999999999999999999999 Q ss_pred ccceEEEEeeccccCC Q lcl|Aclame:pro 382 SEALVAGSFSAIADQV 397 (408) Q Consensus 382 ~~a~~~l~~~~~~~~~ 397 (408) |+||+++++++++.+. T Consensus 389 ~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 389 SEALLIAEIPVESVQA 404 (404) T ss_pred ccceEEEEeecccCCC Confidence 9999999999988877 No 17 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=4.8e-70 Score=400.63 Aligned_cols=379 Identities=23% Similarity=0.303 Sum_probs=312.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc----cc Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG----PL 82 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 82 (408) |++|++.++++.++++++++++++...+++...+++++++++++++.++++++++++...+............. .. T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKG 80 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Confidence 78888888888899999999998888888888899999999999999999988888887665443332222111 11 Q ss_pred ccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccce Q lcl|Aclame:pro 83 NKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSR 162 (408) Q Consensus 83 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~ 162 (408) ..........++++|.++++++.. ..+++..+++++||++||+++...|++.+++.++|+++|+++++++.++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~lr~~~~-----~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~ 155 (389) T protein:vir:10 81 TDLSKKPIDAKKKAINDFIHSHGK-----VIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTY 155 (389) T ss_pred cccchhHHHHHHHHHHHHhhcchh-----hhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEE Confidence 122222334456788888887542 345667788899999999999999999999999999999999999888888 Q ss_pred EEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 163 VYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEV 242 (408) Q Consensus 163 ~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g 242 (408) ++.... ...++|++|++++++.+.++|++|++.++++++++++|+|+++||.++|++||.++|+++++++++.+|++| T Consensus 156 ~~~~~~--~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g 233 (389) T protein:vir:10 156 PILKRA--TDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPV 233 (389) T ss_pred EEEecC--CCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 777654 345578999999998788999999999999999999999999999999999999999999999999999999 Q ss_pred cccccch--hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccccc----CCcccccccceE Q lcl|Aclame:pro 243 MKAAPKK--PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTK----PNSYLIKGKQVI 316 (408) Q Consensus 243 ~g~~~~~--~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~----~~~~~l~G~pv~ 316 (408) ++++.+. .+...+++++++++..+++.+ ++.|+|||++|..|+++||++|+|+|++++.+ +.+++|+|+||+ T Consensus 234 ~~~~~~~~~~~~~~~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~ 311 (389) T protein:vir:10 234 LQSFTAKKTTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVY 311 (389) T ss_pred hcccccccccccccHHHHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeE Confidence 9887554 455678999888877777776 57899999999999999999999999887644 445799999999 Q ss_pred eeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 317 VVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) ++++..++.. +++.+++||||+++|.+++|+++++.++++.+ | ...+|++.|+|+++++|+||+++++++++.. T Consensus 312 ~~~~~~~~~~-~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~---~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~ 385 (389) T protein:vir:10 312 VVGDTLLGSL-AGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKI--Y---GKYLGAAFRFGVQKADSKAGYFVTNTDVPGS 385 (389) T ss_pred EecccccCCC-CCceEEEEeeccccEEEEeecceEEEeecccc--c---cceEEEEEEeccEEecccceEEEEeeccCCC Confidence 9887666654 56678999999999999999999999987543 3 4578999999999999999999998876654 Q ss_pred CCCc Q lcl|Aclame:pro 397 VGNF 400 (408) Q Consensus 397 ~~~~ 400 (408) .++= T Consensus 386 ~~~~ 389 (389) T protein:vir:10 386 ALGK 389 (389) T ss_pred CCCC Confidence 4433 No 18 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=8e-70 Score=399.40 Aligned_cols=382 Identities=20% Similarity=0.234 Sum_probs=298.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFS--AEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEE 78 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (408) ||+++.|+++++++.++.+++.++.++++++++..+.. ..+..+++.+++.++.+++++++++...+........... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~ 80 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSG 80 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 99999999999999999999999999998877655432 2345666777777888888777776665444333222222 Q ss_pred ccccccchhhhHHHHHHHH------HHHhhc---------chhhHHHHHHHHh-hccccccCceecchhhhhhhhhhhhh Q lcl|Aclame:pro 79 KGPLNKSENELKDKFVKDF------VNMVRN---------PMAFMNTVSSKTE-TSGSDSAAGLTIPQDIRTMINTLVRQ 142 (408) Q Consensus 79 ~~~~~~~~~~~~~~~~~a~------~~~~~~---------~~~~~~~~~~~a~-~~~t~~~gg~~vP~~~~~~ii~~~~~ 142 (408) ................... ...... ...........+. ...++++||++||+++.+.|++.+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~ 160 (400) T protein:vir:38 81 KKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQT 160 (400) T ss_pred ccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHh Confidence 2111111111111100000 000000 0000011111222 23467789999999999999999999 Q ss_pred hhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHH Q lcl|Aclame:pro 143 YDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAW 222 (408) Q Consensus 143 ~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~ 222 (408) .++|+++++++++++.++.++++... .+.++|++|+++.++.+.++|++|++++++++++++||+|||+||.++|++| T Consensus 161 ~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~ 238 (400) T protein:vir:38 161 VVDLKPFTNVFQASTQKGTYPTVANA--TTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGL 238 (400) T ss_pred hhhhhhcceeEeccCcceEEEEEecC--CCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHH Confidence 99999999999999888888777643 4566899999999987889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccc Q lcl|Aclame:pro 223 LSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP 302 (408) Q Consensus 223 v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~ 302 (408) |.+.|+++++.+++.++++|+|++++. +..+++++++++...++..+ +++|+|||++|.+|++|||++|+|+|+|++ T Consensus 239 i~~~l~~~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~ 315 (400) T protein:vir:38 239 IAQNGQQIKVNTTNGAVATLLKGFTAK-TISSVDDLKHINNVDLDPAY--SRVIIASQSFYNFLDTVKDGNGRYLLQDSI 315 (400) T ss_pred HHHHHHHHHHHHHHHhhhhcccccccc-ccccHHHHHHHHHhhhhhhh--CcEEEEcHHHHHHHHHhhccCCCeeeecCc Confidence 999999999999999999999988765 55678999988877666554 679999999999999999999999999999 Q ss_pred ccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecc Q lcl|Aclame:pro 303 TKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDS 382 (408) Q Consensus 303 ~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 382 (408) .++.+++|+|+||+++++. |....++..++||||+++|.+++|++++++++++. .+...+|+++|+|+++.+| T Consensus 316 ~~~~~~~l~G~pv~~~~~~--~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~r~d~~~~~~ 388 (400) T protein:vir:38 316 LTPSGKSVLGMPIAVVSDD--TLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQ-----IYGQFLQAGMRFGVSVADE 388 (400) T ss_pred CCCCccccccceeEEeccc--ccCCCCceEEEEEeccccEEEEeecceEEEEeccc-----ccceeEEEEEEeccEEecc Confidence 9999999999999998764 44456677899999999999999999999998754 3456899999999999999 Q ss_pred cceEEEEeeccc Q lcl|Aclame:pro 383 EALVAGSFSAIA 394 (408) Q Consensus 383 ~a~~~l~~~~~~ 394 (408) +||+++++++.+ T Consensus 389 ~a~~~l~~~~~a 400 (400) T protein:vir:38 389 KAGYFLTYTPKA 400 (400) T ss_pred cceEEEEeecCC Confidence 999999999988 No 19 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=1.2e-69 Score=398.36 Aligned_cols=383 Identities=23% Similarity=0.304 Sum_probs=308.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc-----cc Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK-----GP 81 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~ 81 (408) |++|++.++++.+.+.+++++++++..+++...+++++++++++.+..+++.++++++.++............ .. T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 80 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPN 80 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccc Confidence 5556666666667777777777777777778888999999999999999999888887765543322211111 11 Q ss_pred cccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccc Q lcl|Aclame:pro 82 LNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGS 161 (408) Q Consensus 82 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~ 161 (408) ............+++|.++++++... ...+...+++++||++||++++..|++.+++.++|+++|+++++++.+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~l~~~~~~----~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 156 (394) T protein:vir:10 81 GTDLKKKPIDAKKKAINDFIHSHGKV----IDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGT 156 (394) T ss_pred ccchhhhHHHHHHHHHHHHHhccchh----hhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceE Confidence 11222223345667899998876533 23456677888999999999999999999999999999999999988888 Q ss_pred eEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 162 RVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE 241 (408) Q Consensus 162 ~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~ 241 (408) ++++... .+.+.|++|++++++++.++|++|++++++++++++||+|||+|+.++|++||.++|++++++++|.+|++ T Consensus 157 ~~~~~~~--~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~ 234 (394) T protein:vir:10 157 YPILKRA--TDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAP 234 (394) T ss_pred EEEEecC--CCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 8877653 35568999999999888899999999999999999999999999999999999999999999999999999 Q ss_pred ccccccch--hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccccc----CCcccccccce Q lcl|Aclame:pro 242 VMKAAPKK--PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTK----PNSYLIKGKQV 315 (408) Q Consensus 242 g~g~~~~~--~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~----~~~~~l~G~pv 315 (408) |+|++.+. .+...+|++++++...++..| ++.|+|||++|.+|++|||++|||+|++++.+ +.+++|+|+|| T Consensus 235 g~g~~~~~~~~~~~~~d~l~~~~~~~~~~~~--~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV 312 (394) T protein:vir:10 235 VLQSFTAKATTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPV 312 (394) T ss_pred cccccccccccccccHHHHHHHHHhhhhhhc--cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCccccccccee Confidence 99987654 455668889888877777766 57899999999999999999999999887644 44579999999 Q ss_pred EeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 316 IVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) +++++..++. ..++.+++||||+++|+++++++++++++++.+ | ...+|++.|+|+++++|+||++++++++++ T Consensus 313 ~~~~~~~~~~-~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~--~---~~~~~~~~r~d~~~~~~~ai~~~~~~~~~~ 386 (394) T protein:vir:10 313 YVVGDALLGS-AAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKI--Y---GRYLGAAFRFGVKQADSNAGYFVTNTDAAS 386 (394) T ss_pred EEecccccCC-CCCceEEEEeeccccEEEEeecceEEEEecccc--c---ceeEEEEEEeccEEeccccEEEEEeecccC Confidence 9988766655 466778999999999999999999999887543 3 356899999999999999999998877654 Q ss_pred CCCCccCCCcc Q lcl|Aclame:pro 396 QVGNFKTTTST 406 (408) Q Consensus 396 ~~~~~~~~~~~ 406 (408) +++..+++ T Consensus 387 ---~~~~~~~~ 394 (394) T protein:vir:10 387 ---GSTSGTGK 394 (394) T ss_pred ---CCCCCCCC Confidence 33444444 No 20 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1e-69 Score=398.84 Aligned_cols=393 Identities=18% Similarity=0.209 Sum_probs=310.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc---- Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMRE---- 76 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 76 (408) ||+++.|++|++++.++.++++.+.++++....+.. .++..++.+++++++.+++.++++++..........+. T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~--~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 78 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKK--EEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFT 78 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 999999999999999999999999998877665543 35566666777777777776666555544332221111 Q ss_pred -ccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec Q lcl|Aclame:pro 77 -EEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV 155 (408) Q Consensus 77 -~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 155 (408) ................+.++|.++++... ...+.++ ..++++||++||++++..|++.+++.++|+++|+++++ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ra--~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~ 153 (421) T protein:vir:13 79 GGRVIINGDSKEEKRSLQLSAMSKTIRGIQ---LSEEERD--IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPV 153 (421) T ss_pred ccccccccchhHHHHHHHHHHHHHhhhccc---hhHHHhh--ccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeec Confidence 11122222233334556667777776542 2223333 34556789999999999999999999999999999999 Q ss_pred ccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 156 STSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTR 235 (408) Q Consensus 156 ~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~ 235 (408) ++.++.++++..... ..+.|++|++.+++ +.++|++|++++++++++++||+|+|+|+.++|++||.++|++++..++ T Consensus 154 ~~~~~~~~~~~~~~~-~~~~~~~E~~~~~~-s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~ 231 (421) T protein:vir:13 154 NRNAGKMPVRAGASV-DKLANLAKDTELVK-AMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTE 231 (421) T ss_pred cCCceEEEEeecCCc-cceeeccccccccc-cccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHh Confidence 988888888766543 44578999999986 5689999999999999999999999999999999999999999999999 Q ss_pred HHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccce Q lcl|Aclame:pro 236 NQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQV 315 (408) Q Consensus 236 ~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv 315 (408) |..+++...+.....+..+++++++++.. +..+|+.+++|+|||++|.+|++|||++|+|+|++ +..+.+++|+|+|| T Consensus 232 ~~~i~~~~~g~~~~~~~~~~d~i~~~~~~-l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~-~~~~~~~tl~G~pV 309 (421) T protein:vir:13 232 NAEIVKQAKAVLAEETINDYAGLVKTINS-LVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKE-LSDGGDLVFKGRPV 309 (421) T ss_pred hhhHhhhhhhccccccccchHHHHHHHHH-hhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecC-cCCCCCceecceee Confidence 99999887666666777889999998865 78889999999999999999999999999999965 77777889999999 Q ss_pred EeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 316 IVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 316 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) +++++. |....+...++||||+++|++++|++++++++++. .|.+|++.||++.|+|+++++|+||..+.+...+. T Consensus 310 ~~~~~~--~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~--~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a 385 (421) T protein:vir:13 310 IELEES--IFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEA--GYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGV 385 (421) T ss_pred EEeccc--cccCCCceEEEEEeccccEEEEEecceEEEeeccc--ccccCeeEEEEEeeecceeecchhhheeeecccce Confidence 998764 44445778899999999999999999999998875 49999999999999999999999998887765554 Q ss_pred CCCCccC-----CCcccC Q lcl|Aclame:pro 396 QVGNFKT-----TTSTAV 408 (408) Q Consensus 396 ~~~~~~~-----~~~~~~ 408 (408) -+..+.+ ++++++ T Consensus 386 ~v~~~~~~~~~~~~~~~~ 403 (421) T protein:vir:13 386 IVKLQEVLKSSPRSGKNK 403 (421) T ss_pred eeccccccCCCCcCCCCc Confidence 3322222 222332 No 21 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1e-69 Score=398.77 Aligned_cols=392 Identities=19% Similarity=0.236 Sum_probs=279.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc---- Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMRE---- 76 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 76 (408) |. |+||++++.++.+++.+..++++++..+.....+++++.+.+++++..+++++++++.+........... T Consensus 1 Mk----i~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~ 76 (437) T protein:vir:10 1 MK----IEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDD 76 (437) T ss_pred CC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44 5555555555555555555555555444444444444444444444444443333333221110000000 Q ss_pred ----------ccccccccchhh----hH-----------HHHHHHHHH--Hhh-------------cchhhHHHHHHHHh Q lcl|Aclame:pro 77 ----------EEKGPLNKSENE----LK-----------DKFVKDFVN--MVR-------------NPMAFMNTVSSKTE 116 (408) Q Consensus 77 ----------~~~~~~~~~~~~----~~-----------~~~~~a~~~--~~~-------------~~~~~~~~~~~~a~ 116 (408) ............ .. ....+.... ... .-.......+.++. T Consensus 77 ~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 156 (437) T protein:vir:10 77 SDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDV 156 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhh Confidence 000000000000 00 000000000 000 00001122234566 Q ss_pred hccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeee Q lcl|Aclame:pro 117 TSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKY 196 (408) Q Consensus 117 ~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~ 196 (408) ..+++++||++||+++...|.. +++.++|+++++++++++.++.++++.. ..+.++|++|++..++.+.++|++|++ T Consensus 157 ~~~~~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~e~~~~~e~~~~~~~~v~~ 233 (437) T protein:vir:10 157 TGIALKDGKVIIPETILTPEKE-VHQFPRLGSLVRTESVTTTTGKLPIFNN--STDLLTAHTEYGQTTKNATPVITPILW 233 (437) T ss_pred hhcccccccccchHHHHHHHHH-hhhhhhhhhcceeEeeccCceeeEEeec--cccccccccccccccccccccceeeee Confidence 6778889999999999877655 6788899999999999888777776643 345679999999999888899999999 Q ss_pred chheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhh-hhHHHHHHHHHHhhhhhccCCCE Q lcl|Aclame:pro 197 LIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI-AKFDDVITMINTAVDPAIIATSS 275 (408) Q Consensus 197 ~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~-~~~d~i~~~~~~~l~~~~~~~a~ 275 (408) .+++++++++||+|+|+|+.++|.+||.++|+++++.+++.+|++|+|++.+.... ..+++++++++..+++.|+.+++ T Consensus 234 ~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 313 (437) T protein:vir:10 234 DLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTYLLGDLKKVLNVTLKPQDSAAAS 313 (437) T ss_pred ehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHhhhhhhhhcCCE Confidence 99999999999999999999999999999999999999999999999988765443 45778888888889999999999 Q ss_pred EEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEe Q lcl|Aclame:pro 276 LLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPT 355 (408) Q Consensus 276 ~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 355 (408) |+|||++|..|++|||++|+|+|+++++++.+++|+|+||+++++..+|....++.+++||||+++|.+++|+++++.++ T Consensus 314 ~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~ 393 (437) T protein:vir:10 314 IVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQ 393 (437) T ss_pred EEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEe Confidence 99999999999999999999999999999999999999999998888898888899999999999999999999999886 Q ss_pred ccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 356 NIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 356 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ++ |..+.+.+++..|+||++++|+||++|+++..+.++ +.+.+| T Consensus 394 ~~----~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~-----~~~~~~ 437 (437) T protein:vir:10 394 DT----YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTV-----VQSTAV 437 (437) T ss_pred cc----cccccceeeEEEEEccEEecccceEEEEeecccccc-----CCCCCC Confidence 53 566778999999999999999999999876433221 111222 No 22 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=5.2e-69 Score=394.95 Aligned_cols=373 Identities=17% Similarity=0.229 Sum_probs=277.1 Q ss_pred CChHHHHHH---------------HHHHHHHHHHHHHHHHHHHHHHHhhhcc-cHHHHHHHH---------HHHHHHHHH Q lcl|Aclame:pro 1 MGVKLTVNQ---------------LNEAWIASGDKVTDFNDQINMALNDDNF-SAEAMSELK---------NKRDNEKVR 55 (408) Q Consensus 1 M~~~~~i~e---------------l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~---------~~~~~~~~~ 55 (408) |++|..|-- +.+++.+..++++++.+++.....+-.. .++.+.+++ ++++.++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~~~~~~~~~~~~e~~~~~~~~~~e 80 (425) T protein:vir:10 1 MSKKLLIAVLTAALTGPVGAVPRGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQLDAVKAGLPTSDALAKVDKVSAD 80 (425) T ss_pred CchhHHHHhhHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHHHHH Confidence 666654422 2222222223333333332222111000 001111111 112222333 Q ss_pred HHHHHHHHHHHHHHHhhhcccccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhh Q lcl|Aclame:pro 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTM 135 (408) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ 135 (408) ++.++..+++....... ..............+++++|..+++++. ..++++.+++++||++||+++++. T Consensus 81 i~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~af~~~l~~~e------~~~al~~~t~~~gG~lvP~~~~~~ 149 (425) T protein:vir:10 81 LEALQAAVDEANIKIAA-----AQMGANGVKPLRDPEYTEAFKAHVKRGD------VQAALNKGEDSEGGYLTPIEWDRT 149 (425) T ss_pred HHHHHHHHHHHHHHHHh-----hhcccccccccccHHHHHHHHHHhhhhh------hHHHhhcCcCCCCceeccHhHHHH Confidence 33333322222111111 1111122233344567888998887653 456778889999999999999999 Q ss_pred hhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcc Q lcl|Aclame:pro 136 INTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT 215 (408) Q Consensus 136 ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds 215 (408) |++.+++.++|+++|+++++++..+.+++ .. ..+.+.|++|++.+|+++.++|++|++++++++++++||+|+++|+ T Consensus 150 ii~~~~~~s~l~~l~~~~~~~~~~~~~~~--~~-~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds 226 (425) T protein:vir:10 150 ITNKLVLISPMRQLCRVQPVSKAGFSKLF--NM-GGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDA 226 (425) T ss_pred HHHHHHhhhhhhhhceeeeccCCceEEEE--Ec-CCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcc Confidence 99999999999999999998876655554 33 3456799999999998777899999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhccccccch---------------------------hhhhhHHHHHHHHHHhhhh Q lcl|Aclame:pro 216 AENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK---------------------------PTIAKFDDVITMINTAVDP 268 (408) Q Consensus 216 ~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~---------------------------~~~~~~d~i~~~~~~~l~~ 268 (408) .++|++||.++|++++++++|.+|++|+|++.|. .+...++++++++. .+++ T Consensus 227 ~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~-~l~~ 305 (425) T protein:vir:10 227 EIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVY-DLPS 305 (425) T ss_pred hhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHh-hhhh Confidence 9999999999999999999999999999876541 12235778887765 5899 Q ss_pred hccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeecc Q lcl|Aclame:pro 269 AIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRE 348 (408) Q Consensus 269 ~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 348 (408) .|+.+++|+|||++|.+|+++||++|+|+|++++..+.+++|+|+||+++++ +|..+.+..+++||||+++|++++|. T Consensus 306 ~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~--~p~~~~~~~~i~~Gd~~~~~~i~~~~ 383 (425) T protein:vir:10 306 AFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPD--MPDVAANSTPILFGDFQQTYLIIDRI 383 (425) T ss_pred hhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecC--cCCccCCccEEEEEehhccEEEEEec Confidence 9999999999999999999999999999999999999999999999998765 67777788889999999999999999 Q ss_pred ceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 349 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) ++++..+++ |.+|++.||++.|+|+++++|+||+++++++.. T Consensus 384 ~~~v~~d~~----~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 384 GVRVLRDPY----TAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ceEEEeccc----ccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 999877654 568999999999999999999999999999888 No 23 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=9.2e-68 Score=388.11 Aligned_cols=376 Identities=21% Similarity=0.263 Sum_probs=294.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |-.+ .|+||++++.++.+++.+..++.+.+..++ ..+++.+++.++++++++++++++++.+.+............. T Consensus 1 M~~~-~l~el~~~l~e~~~~i~~~~~e~~~~~~~~--~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~ 77 (394) T protein:vir:97 1 MFEE-KIKEIKATIADLNNTIVTKTAQVKNALESD--DLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGG 77 (394) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhchh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 6555 488899999998888888887776665543 3456777888888888888888877776655433222222111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhh------------------HHHHHHHHhhccccccCceecchhhhhhhhhhhhh Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAF------------------MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQ 142 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~------------------~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~ 142 (408) .... .....+++.+.++++..... ............+.++||++||+++++.|++.+++ T Consensus 78 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~ 154 (394) T protein:vir:97 78 KEVT---QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKT 154 (394) T ss_pred cccc---hhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhh Confidence 1111 11222333333333321110 00011122234567789999999999999999999 Q ss_pred hhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHH Q lcl|Aclame:pro 143 YDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAW 222 (408) Q Consensus 143 ~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~ 222 (408) .++|+++|++++++++++.+++... ..+.++|++|++++|+++.++|++|++++++++++++||+||++|+.++|++| T Consensus 155 ~~~l~~~~~~~~~~~~~~~~~~~~~--~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~ 232 (394) T protein:vir:97 155 VVDLKPFTTVYQAKKASGKYPVLQR--ATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGI 232 (394) T ss_pred hhhhhhhceeeeccCcceEEEEEec--CCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHH Confidence 9999999999999888777776654 34566899999999987889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccc Q lcl|Aclame:pro 223 LSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP 302 (408) Q Consensus 223 v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~ 302 (408) |.+.|++++++++|.+|++|.+++++. +...++++++++...+++.+ ++.|+|||++|..|++|||++|+|+|++++ T Consensus 233 i~~~la~~~~~~~~~~i~~g~~~~~~~-~~~~~~~~~~~~~~~~~~~~--~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~ 309 (394) T protein:vir:97 233 VSESISQIKVNTTNDAIAKVLKSFTTK-TVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDI 309 (394) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccc-ccccHHHHHHHHHhhhhhhh--CCEEEEcHHHHHHHHHhhccCCCeeeecCc Confidence 999999999999999999999888765 55678999999887665544 678999999999999999999999999999 Q ss_pred ccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecc Q lcl|Aclame:pro 303 TKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDS 382 (408) Q Consensus 303 ~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 382 (408) .++.+++|+|+||+++++. ..++.+++||||+++|.+++|++++++++++. .+...+|++.|+|+++.+| T Consensus 310 ~~~~~~~l~G~pv~~~~~~-----~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~r~d~~v~~~ 379 (394) T protein:vir:97 310 TAVSGKVLLGKPVFVLSDE-----VLGANKAFIGDFKRGVLFADRKDLGLRWADNE-----IYGQYLQAVLRFGVSKVDD 379 (394) T ss_pred CCCCCceeccceeEEeccc-----ccCCccEEEeeccccEEEEEecceEEEEeccc-----ccceeEEEEEEEccEEecc Confidence 9999999999999987643 34456689999999999999999999987643 4456899999999999999 Q ss_pred cceEEEEeeccccCC Q lcl|Aclame:pro 383 EALVAGSFSAIADQV 397 (408) Q Consensus 383 ~a~~~l~~~~~~~~~ 397 (408) +||+++++++++.+. T Consensus 380 ~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 380 KAGYYVTFTPEPLPL 394 (394) T ss_pred cceEEEEecccccCC Confidence 999999998766655 No 24 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=5.3e-68 Score=389.40 Aligned_cols=380 Identities=16% Similarity=0.216 Sum_probs=288.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAE---AMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGP 81 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (408) |+|+||++++.++.++++++.++.+ +...+++ ++++++.+++.++++++..++.....+.............. T Consensus 1 M~l~eL~e~r~~l~~e~~~l~~k~~----~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~ 76 (409) T protein:vir:45 1 MKLHELKQKRNTIATDMRALNEKIG----DNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNL 76 (409) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHhh----cCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccC Confidence 6666788888888888888776532 2222333 34455555666655555444333322222111111111111 Q ss_pred cccchhhhHHHHHHHHHHHhhcchhhHHH------HHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec Q lcl|Aclame:pro 82 LNKSENELKDKFVKDFVNMVRNPMAFMNT------VSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV 155 (408) Q Consensus 82 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~------~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 155 (408) .........+.+.++|.++++++...... .+.++..++++++||++||+++.++|++.+++.++|+++|+++++ T Consensus 77 ~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~ 156 (409) T protein:vir:45 77 DPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTT 156 (409) T ss_pred CCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeec Confidence 22222233445667888888876433222 244566778888999999999999999999999999999999998 Q ss_pred ccCccceEEeeccCCccccchhcccccccccccccceeeeechheee-eehHHHHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 156 STSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYA-GIITATNTSLKDTAENILAWLSSWIAKKVVVT 234 (408) Q Consensus 156 ~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~-~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~ 234 (408) ++.. .+.++...+....+.|++|++.+|++ .++|+++++.++|++ ++++||+|+++|+.++|++||.++|+++++.+ T Consensus 157 ~~~~-~~~~~~~~~~~~~~~~v~E~~~~~~~-~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~ 234 (409) T protein:vir:45 157 SDGR-TMEWATADGTSEVGVLLGENEEAGEE-DTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRG 234 (409) T ss_pred CCCc-eEEEEeeccCcccccccccccccccc-ccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHH Confidence 6543 44555555555667899999999974 578999999999986 67999999999999999999999999999999 Q ss_pred HHHHHhhccccccc-----------------hhhhhhHHHHHHHHHHhhhhhccCCCEE--EEcHHHHHHHHhhhcccCc Q lcl|Aclame:pro 235 RNQAIIEVMKAAPK-----------------KPTIAKFDDVITMINTAVDPAIIATSSL--LTNQSGLNKLALVKTAEGK 295 (408) Q Consensus 235 ~~~~~~~g~g~~~~-----------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~--~~n~~~~~~l~~lkd~~G~ 295 (408) ++.+|++|+|++.+ ..+..+++++++++. .+++.|+.++.| +||+++|++|++|||++|+ T Consensus 235 ~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~-~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~ 313 (409) T protein:vir:45 235 EARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKH-SIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGR 313 (409) T ss_pred HHHHhhccCCCCCccccceeeeccccccccccccccchHHHHHHHH-hhhhhhccCCeEEEEECHHHHHHHHHhhcCCCc Confidence 99999999987522 122345788888775 588999988876 6799999999999999999 Q ss_pred eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeee Q lcl|Aclame:pro 296 YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRF 375 (408) Q Consensus 296 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~ 375 (408) |+|++++.++.+.+|+|+||+++++ +|..+.+..+++||||++|+ +..+++++++++.+.+ |.+|++.||++.|+ T Consensus 314 ~i~~~~~~~~~~~~l~G~PV~~~~~--~p~~~~~~~~i~~Gd~~~~~-i~~~~~~~~~~~~d~~--~~~~~~~~~~~~r~ 388 (409) T protein:vir:45 314 PLWLPDIVGVAPASVLNVPYVIDQE--IDDIGAGKKFMFCGDFDRFI-IRRVRYMILKRLVERY--AEYDQTGFLAFHRF 388 (409) T ss_pred eeeccCcCCCCCceecceeeEEecC--cCCccCCccEEEEeehhhhh-eeeccceEEEEeeccc--ccCCcEEEEEEEEe Confidence 9999999999999999999998654 67667777889999999865 5678999998876554 78899999999999 Q ss_pred CcEEecccceEEEEeeccccC Q lcl|Aclame:pro 376 DVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 376 d~~v~~~~a~~~l~~~~~~~~ 396 (408) |+++++|+||+++++++.+.. T Consensus 389 d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 389 DCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred ccEeechhheEEEEeccCCCC Confidence 999999999999999887666 No 25 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=2.7e-67 Score=385.58 Aligned_cols=383 Identities=20% Similarity=0.258 Sum_probs=292.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |.. ++||++++.++++++.+..++++.+..+++ .++.+++.+++++++++++.+++++++.+............. T Consensus 1 mk~---~~em~~~l~el~~~~~~~~~e~~~~~~~~~--~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:47 1 MKT---KEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSV 75 (415) T ss_pred Cch---HHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 443 466777777777777766666666555443 234556666666666666666555554443322211111111 Q ss_pred ccccchhh-----------------hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PLNKSENE-----------------LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY 143 (408) Q Consensus 81 ~~~~~~~~-----------------~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~ 143 (408) ........ ....++++|..+.+.+ ........++++||++||+++.+.|++.+++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~ 148 (415) T protein:vir:47 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETR-------NDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) T ss_pred ccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhh-------hhhhhccccccCCcccccHHHHHHHHHHHHhh Confidence 11111111 1111222222222221 11122333456789999999999999999999 Q ss_pred hhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHH Q lcl|Aclame:pro 144 DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWL 223 (408) Q Consensus 144 ~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v 223 (408) ++|+++|+++++++..++++++...+. +.++|++|++++|+++.++|++|++++++++++++||+|+++|+.++|++|| T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i 227 (415) T protein:vir:47 149 FNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQEL 227 (415) T ss_pred hhhhhhcceeeccCCceeEEEEEecCC-cceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHH Confidence 999999999999999899988876544 5678999999999888889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccc---------------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh Q lcl|Aclame:pro 224 SSWIAKKVVVTRNQAIIEVMKAAPK---------------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL 288 (408) Q Consensus 224 ~~~l~~~~~~~~~~~~~~g~g~~~~---------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~ 288 (408) .+.|++++++++|.+|++|+|++.+ ..+...++++++++.. +...+..++.|+|||++|..|++ T Consensus 228 ~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:47 228 KLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHH Confidence 9999999999999999999987543 2233467888888765 55677788999999999999999 Q ss_pred hhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceee Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) +||++|+|+|++++.++.+++|+|+||+++++. |....++..++||||+++|.+++|+++++++++ |.++++. T Consensus 307 lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~ 379 (415) T protein:vir:47 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGEC 379 (415) T ss_pred hhccCCCeeeccCcCCCCCccccceeeEEeccc--cccCCCccEEEEEehhccEEEEeecceEEEeec-----cccCceE Confidence 999999999999999999999999999998764 444456678999999999999999999999875 5677889 Q ss_pred EEEEeeeCcEEecccceEEEEeeccccCCCCccCCC Q lcl|Aclame:pro 369 IRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTT 404 (408) Q Consensus 369 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~ 404 (408) +|+++|+|+++++|+||+++++++++..+|.-.--+ T Consensus 380 ~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:47 380 LMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEEEeccEEeccccEEEEEeeccCCCCCCccCCC Confidence 999999999999999999999999999888877666 No 26 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=2.7e-67 Score=385.58 Aligned_cols=383 Identities=20% Similarity=0.258 Sum_probs=292.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |.. ++||++++.++++++.+..++++.+..+++ .++.+++.+++++++++++.+++++++.+............. T Consensus 1 mk~---~~em~~~l~el~~~~~~~~~e~~~~~~~~~--~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:46 1 MKT---KEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSV 75 (415) T ss_pred Cch---HHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 443 466777777777777766666666555443 234556666666666666666555554443322211111111 Q ss_pred ccccchhh-----------------hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PLNKSENE-----------------LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY 143 (408) Q Consensus 81 ~~~~~~~~-----------------~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~ 143 (408) ........ ....++++|..+.+.+ ........++++||++||+++.+.|++.+++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~ 148 (415) T protein:vir:46 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETR-------NDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) T ss_pred ccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhh-------hhhhhccccccCCcccccHHHHHHHHHHHHhh Confidence 11111111 1111222222222221 11122333456789999999999999999999 Q ss_pred hhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHH Q lcl|Aclame:pro 144 DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWL 223 (408) Q Consensus 144 ~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v 223 (408) ++|+++|+++++++..++++++...+. +.++|++|++++|+++.++|++|++++++++++++||+|+++|+.++|++|| T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i 227 (415) T protein:vir:46 149 FNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQEL 227 (415) T ss_pred hhhhhhcceeeccCCceeEEEEEecCC-cceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHH Confidence 999999999999999899988876544 5678999999999888889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccc---------------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh Q lcl|Aclame:pro 224 SSWIAKKVVVTRNQAIIEVMKAAPK---------------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL 288 (408) Q Consensus 224 ~~~l~~~~~~~~~~~~~~g~g~~~~---------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~ 288 (408) .+.|++++++++|.+|++|+|++.+ ..+...++++++++.. +...+..++.|+|||++|..|++ T Consensus 228 ~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:46 228 KLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHH Confidence 9999999999999999999987543 2233467888888765 55677788999999999999999 Q ss_pred hhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceee Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) +||++|+|+|++++.++.+++|+|+||+++++. |....++..++||||+++|.+++|+++++++++ |.++++. T Consensus 307 lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~ 379 (415) T protein:vir:46 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGEC 379 (415) T ss_pred hhccCCCeeeccCcCCCCCccccceeeEEeccc--cccCCCccEEEEEehhccEEEEeecceEEEeec-----cccCceE Confidence 999999999999999999999999999998764 444456678999999999999999999999875 5677889 Q ss_pred EEEEeeeCcEEecccceEEEEeeccccCCCCccCCC Q lcl|Aclame:pro 369 IRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTT 404 (408) Q Consensus 369 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~ 404 (408) +|+++|+|+++++|+||+++++++++..+|.-.--+ T Consensus 380 ~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:46 380 LMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEEEeccEEeccccEEEEEeeccCCCCCCccCCC Confidence 999999999999999999999999999888877666 No 27 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=3.4e-67 Score=385.02 Aligned_cols=383 Identities=20% Similarity=0.254 Sum_probs=292.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+. ++||++++.++++++.+..++.+.++.++.. ++..++..++++++++++.+++.+.++.............. T Consensus 1 mk~---~~el~~~l~el~~~~~~~~~e~~~~l~~~~~--~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:81 1 MKT---KEELQSEISDIKRQIDLKVKYATRALNNDEL--EKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred Cch---HHHHHHHHHHHHHHHHHHHHHHHHHhchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 554 4566777777777777776666665544332 34555666666666666666555554433222211111111 Q ss_pred ccccchhhh-----------------HHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PLNKSENEL-----------------KDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY 143 (408) Q Consensus 81 ~~~~~~~~~-----------------~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~ 143 (408) ......... ...++++|.++++.+. .......++++||++||+++.+.|++.+++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~ 148 (415) T protein:vir:81 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRN-------DIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhh-------hhhhccccccccccccchHHHHHHHHHHHhh Confidence 111111111 1112222222222211 1111233456789999999999999999999 Q ss_pred hhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHH Q lcl|Aclame:pro 144 DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWL 223 (408) Q Consensus 144 ~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v 223 (408) ++|+++|+++++++.+++++++...+. +.++|++|++++|+.+.++|++|++++++++++++||+||++|+.++|++|| T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i 227 (415) T protein:vir:81 149 FNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQEL 227 (415) T ss_pred hhhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHH Confidence 999999999999999999999876544 5678999999999878889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccch---------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh Q lcl|Aclame:pro 224 SSWIAKKVVVTRNQAIIEVMKAAPKK---------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL 288 (408) Q Consensus 224 ~~~l~~~~~~~~~~~~~~g~g~~~~~---------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~ 288 (408) .+.|+++++++++.++++|+|++.+. .+...++++++++.. +...+..+++|+|||++|..|++ T Consensus 228 ~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:81 228 KLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHH Confidence 99999999999999999999876432 234568889888855 66778889999999999999999 Q ss_pred hhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceee Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) +||++|+|+|.+++.++.+++|+|+||+++++. |....++.+++||||+++|++++|.++++++++ |..+++. T Consensus 307 lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~ 379 (415) T protein:vir:81 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGEC 379 (415) T ss_pred hhccCCceeeccCcCCCCCceecceeeEEeccc--ccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCceE Confidence 999999999999999999999999999998764 444456778999999999989999999999875 4567788 Q ss_pred EEEEeeeCcEEecccceEEEEeeccccCCCCccCCC Q lcl|Aclame:pro 369 IRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTT 404 (408) Q Consensus 369 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~ 404 (408) +|+++|+|+++++|+||+++++++++...|.-.--+ T Consensus 380 ~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:81 380 LMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 999999999999999999999999999888877666 No 28 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=3.4e-67 Score=385.02 Aligned_cols=383 Identities=20% Similarity=0.254 Sum_probs=292.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+. ++||++++.++++++.+..++.+.++.++.. ++..++..++++++++++.+++.+.++.............. T Consensus 1 mk~---~~el~~~l~el~~~~~~~~~e~~~~l~~~~~--~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:98 1 MKT---KEELQSEISDIKRQIDLKVKYATRALNNDEL--EKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred Cch---HHHHHHHHHHHHHHHHHHHHHHHHHhchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 554 4566777777777777776666665544332 34555666666666666666555554433222211111111 Q ss_pred ccccchhhh-----------------HHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PLNKSENEL-----------------KDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY 143 (408) Q Consensus 81 ~~~~~~~~~-----------------~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~ 143 (408) ......... ...++++|.++++.+. .......++++||++||+++.+.|++.+++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~ 148 (415) T protein:vir:98 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRN-------DIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhh-------hhhhccccccccccccchHHHHHHHHHHHhh Confidence 111111111 1112222222222211 1111233456789999999999999999999 Q ss_pred hhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHH Q lcl|Aclame:pro 144 DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWL 223 (408) Q Consensus 144 ~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v 223 (408) ++|+++|+++++++.+++++++...+. +.++|++|++++|+.+.++|++|++++++++++++||+||++|+.++|++|| T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i 227 (415) T protein:vir:98 149 FNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQEL 227 (415) T ss_pred hhhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHH Confidence 999999999999999999999876544 5678999999999878889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccch---------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh Q lcl|Aclame:pro 224 SSWIAKKVVVTRNQAIIEVMKAAPKK---------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL 288 (408) Q Consensus 224 ~~~l~~~~~~~~~~~~~~g~g~~~~~---------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~ 288 (408) .+.|+++++++++.++++|+|++.+. .+...++++++++.. +...+..+++|+|||++|..|++ T Consensus 228 ~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:98 228 KLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHH Confidence 99999999999999999999876432 234568889888855 66778889999999999999999 Q ss_pred hhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceee Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) +||++|+|+|.+++.++.+++|+|+||+++++. |....++.+++||||+++|++++|.++++++++ |..+++. T Consensus 307 lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~ 379 (415) T protein:vir:98 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGEC 379 (415) T ss_pred hhccCCceeeccCcCCCCCceecceeeEEeccc--ccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCceE Confidence 999999999999999999999999999998764 444456778999999999989999999999875 4567788 Q ss_pred EEEEeeeCcEEecccceEEEEeeccccCCCCccCCC Q lcl|Aclame:pro 369 IRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTT 404 (408) Q Consensus 369 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~ 404 (408) +|+++|+|+++++|+||+++++++++...|.-.--+ T Consensus 380 ~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:98 380 LMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 999999999999999999999999999888877666 No 29 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=3.4e-67 Score=385.02 Aligned_cols=383 Identities=20% Similarity=0.254 Sum_probs=292.2 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+. ++||++++.++++++.+..++.+.++.++.. ++..++..++++++++++.+++.+.++.............. T Consensus 1 mk~---~~el~~~l~el~~~~~~~~~e~~~~l~~~~~--~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:79 1 MKT---KEELQSEISDIKRQIDLKVKYATRALNNDEL--EKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred Cch---HHHHHHHHHHHHHHHHHHHHHHHHHhchHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 554 4566777777777777776666665544332 34555666666666666666555554433222211111111 Q ss_pred ccccchhhh-----------------HHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PLNKSENEL-----------------KDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY 143 (408) Q Consensus 81 ~~~~~~~~~-----------------~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~ 143 (408) ......... ...++++|.++++.+. .......++++||++||+++.+.|++.+++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~ 148 (415) T protein:vir:79 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRN-------DIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhh-------hhhhccccccccccccchHHHHHHHHHHHhh Confidence 111111111 1112222222222211 1111233456789999999999999999999 Q ss_pred hhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHH Q lcl|Aclame:pro 144 DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWL 223 (408) Q Consensus 144 ~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v 223 (408) ++|+++|+++++++.+++++++...+. +.++|++|++++|+.+.++|++|++++++++++++||+||++|+.++|++|| T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i 227 (415) T protein:vir:79 149 FNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQEL 227 (415) T ss_pred hhhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHH Confidence 999999999999999999999876544 5678999999999878889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccch---------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh Q lcl|Aclame:pro 224 SSWIAKKVVVTRNQAIIEVMKAAPKK---------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL 288 (408) Q Consensus 224 ~~~l~~~~~~~~~~~~~~g~g~~~~~---------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~ 288 (408) .+.|+++++++++.++++|+|++.+. .+...++++++++.. +...+..+++|+|||++|..|++ T Consensus 228 ~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:79 228 KLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHH Confidence 99999999999999999999876432 234568889888855 66778889999999999999999 Q ss_pred hhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceee Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) +||++|+|+|.+++.++.+++|+|+||+++++. |....++.+++||||+++|++++|.++++++++ |..+++. T Consensus 307 lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~ 379 (415) T protein:vir:79 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGEC 379 (415) T ss_pred hhccCCceeeccCcCCCCCceecceeeEEeccc--ccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCceE Confidence 999999999999999999999999999998764 444456778999999999989999999999875 4567788 Q ss_pred EEEEeeeCcEEecccceEEEEeeccccCCCCccCCC Q lcl|Aclame:pro 369 IRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTT 404 (408) Q Consensus 369 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~ 404 (408) +|+++|+|+++++|+||+++++++++...|.-.--+ T Consensus 380 ~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:79 380 LMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 999999999999999999999999999888877666 No 30 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=1e-66 Score=382.43 Aligned_cols=383 Identities=20% Similarity=0.255 Sum_probs=292.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |.. +++|++++.++.+++.+..++.+.++.++. .++.+++.+++++++++++.+++.+++............... T Consensus 1 mk~---~~el~~~l~el~~~~~~~~~~~~~~~~~~~--~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:94 1 MKT---KEELQSEISDIKRQIDLKVKYATRALNNDE--LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred CCh---HHHHHHHHHHHHHHHHHHHHHHHHHhchhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 544 455666777776666666666655554433 244556666666666666665555444433222211111111 Q ss_pred ccccchhh-----------------hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhh Q lcl|Aclame:pro 81 PLNKSENE-----------------LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY 143 (408) Q Consensus 81 ~~~~~~~~-----------------~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~ 143 (408) ........ ....++++|.++++... .......+.++||++||+++.+.|++.+++. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~-------~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~ 148 (415) T protein:vir:94 76 EVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRN-------DIQGGSLKTDSGFVVIPEEIVTDILKLKEVE 148 (415) T ss_pred cccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhh-------hhhhhccccccccccCcHHHHHHHHHHHHhh Confidence 11111111 11122333333332211 1122334466789999999999999999999 Q ss_pred hhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHH Q lcl|Aclame:pro 144 DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWL 223 (408) Q Consensus 144 ~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v 223 (408) ++|+++|+++++++.+++++++...+. +.+.|++|++++|+.+.++|++|++++++++++++||+|+++|+.++|++|| T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i 227 (415) T protein:vir:94 149 FNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQEL 227 (415) T ss_pred hhhhhhcceeeccCCceeEEEEeecCC-ccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHH Confidence 999999999999988899998876544 5678999999999878889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccch---------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh Q lcl|Aclame:pro 224 SSWIAKKVVVTRNQAIIEVMKAAPKK---------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL 288 (408) Q Consensus 224 ~~~l~~~~~~~~~~~~~~g~g~~~~~---------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~ 288 (408) .++|+++++++++.+|++|+|++.+. .+...++++++++.. +...+..++.|+|||++|..|++ T Consensus 228 ~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~vmn~~~~~~l~~ 306 (415) T protein:vir:94 228 KLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL-NVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHh-hhhhccCCCEEEEcHHHHHHHHH Confidence 99999999999999999999876432 233568889888865 55677788899999999999999 Q ss_pred hhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceee Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) +||++|+|+|.+++.++.+++|+|+||+++++. |....++.+++||||+++|++++|+++++++++ |..+++. T Consensus 307 lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~-----~~~~~~~ 379 (415) T protein:vir:94 307 MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDE--VLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----YMHFGEC 379 (415) T ss_pred hhccCCCeeeccCcCCCCCceecceeeEEeccc--ccCCCCccEEEEEehhccEEEEeecceEEEEec-----cccCceE Confidence 999999999999999999999999999998764 444456778999999999999999999999875 5677889 Q ss_pred EEEEeeeCcEEecccceEEEEeeccccCCCCccCCC Q lcl|Aclame:pro 369 IRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTT 404 (408) Q Consensus 369 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~ 404 (408) +|++.|+|+++++|+||+++++++++...|.-.--+ T Consensus 380 ~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 380 LMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 999999999999999999999999998888777665 No 31 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=2.4e-66 Score=380.38 Aligned_cols=380 Identities=15% Similarity=0.139 Sum_probs=267.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc----- Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMR----- 75 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----- 75 (408) ||+++.++++++++++...+++. ..++.+...+++.+++++++++.++++.+.++++.++........ T Consensus 1 M~l~el~~~~~~~~~~~~a~l~~-------~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~ 73 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRLAELQG-------KVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKD 73 (434) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH-------HHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 88877777777666554443333 233334445556666666666666666665555443321111000 Q ss_pred --ccccc-cc---c-c-------------------------chhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhcccccc Q lcl|Aclame:pro 76 --EEEKG-PL---N-K-------------------------SENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSA 123 (408) Q Consensus 76 --~~~~~-~~---~-~-------------------------~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~ 123 (408) ..... .. . . .......+++++|.++++.+.. ..+.++.+.+ +++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~---~~e~~a~~~~-t~~ 149 (434) T protein:vir:62 74 DDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNID---EKEARALGLV-TGN 149 (434) T ss_pred chhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccc---hhhhhhhccc-ccc Confidence 00000 00 0 0 0001112334444444443321 1233444433 467 Q ss_pred CceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeee Q lcl|Aclame:pro 124 AGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAG 203 (408) Q Consensus 124 gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~ 203 (408) ||++||+++++.|++.+++.++|+++|+++++.+ +.++++........+..|.+|++.++. +.++|++|+++++++++ T Consensus 150 GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~~~~~a~~~~~~~e~~~~~~-~~~~f~~v~~~~~k~~~ 227 (434) T protein:vir:62 150 GSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE-NIKYPVLVKKAEAQGHKNERTNNEMPE-TDIEFDEIELSPTEFDA 227 (434) T ss_pred cceecchhhHHHHHHhhhhhhhhhhhcceeccCC-ceEEEEEecCCcccceecccccccccc-cccceeeEEeeheeeEe Confidence 8999999999999999999999999999988753 345555444444444455677778875 56899999999999999 Q ss_pred ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchh-------------hhhhHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 204 IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKP-------------TIAKFDDVITMINTAVDPAI 270 (408) Q Consensus 204 ~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~-------------~~~~~d~i~~~~~~~l~~~~ 270 (408) +++||+|||+|+.++|++||.++|+++++++++.+|++|+|++.+.. +...++++++++. .+++.| T Consensus 228 ~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~l~~l~~-~l~~~~ 306 (434) T protein:vir:62 228 LATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDALVKMKN-TPVKEV 306 (434) T ss_pred ehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchhhHHHHHHh-hcchhh Confidence 99999999999999999999999999999999999999999765432 2234778887765 689999 Q ss_pred cCCCEEEEcHHHHHHHHhhhcccCceeeccc--cccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeecc Q lcl|Aclame:pro 271 IATSSLLTNQSGLNKLALVKTAEGKYLLEPD--PTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRE 348 (408) Q Consensus 271 ~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~--~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 348 (408) +.+++|+|||++|..|++|||++|+|+|+|. ..++.+++|+|+||+++++...+. .++...++||||++|+ +++|. T Consensus 307 ~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~-~~~~~~i~~Gdfs~~~-i~~~~ 384 (434) T protein:vir:62 307 RKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPD-SPDTPVFYFGDFSKFY-IQDVI 384 (434) T ss_pred hcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCcc-CCCceEEEEeeccceE-EEEee Confidence 9999999999999999999999999999874 345667899999999987644333 2344558899999865 45665 Q ss_pred -ceEEEEeccchhhhhhceeeEEEEeeeCcEEec-ccceEEEEeeccccCCC Q lcl|Aclame:pro 349 -NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATD-SEALVAGSFSAIADQVG 398 (408) Q Consensus 349 -~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~-~~a~~~l~~~~~~~~~~ 398 (408) .++++++++. .|.++++.||++.|+|+++++ |.++++++++-..++.+ T Consensus 385 g~~~i~~~~~~--~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 385 GSLEVQKLVEL--FSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred ceeEEEeehhh--hcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 5888887655 478999999999999999876 99888887765444444 No 32 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=1.1e-67 Score=387.68 Aligned_cols=293 Identities=63% Similarity=0.955 Sum_probs=274.5 Q ss_pred HHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccc Q lcl|Aclame:pro 112 SSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQL 191 (408) Q Consensus 112 ~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f 191 (408) -.++++.+++++||++||+++.++|++.+++.++|+++|+++++++.+++++++...+..+.+.|++|++++++++.++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 45667788889999999999999999999999999999999999999999999988777788899999999998788999 Q ss_pred eeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 192 TIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 192 ~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) ++++++++|++++++||+|+++|+.++|++||.++|++++++++|.+|++|++++++..+..+++++++++. .++++|+ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~d~i~~~~~-~l~~~~~ 159 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTKPTLTKWDDIIDLEA-KVDPAIK 159 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccccccccCHHHHHHHHH-hhhhhhc Confidence 999999999999999999999999999999999999999999999999999999999999999999999774 5888999 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceE Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMS 351 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~ 351 (408) .+++|+||+++|.+|+++||++|||+|++++.++.+++|+|+||+++++..+|....++..++||||+++|++++|++++ T Consensus 160 ~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~ 239 (293) T protein:vir:48 160 QTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMS 239 (293) T ss_pred CCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecceE Confidence 99999999999999999999999999999999999999999999999888888888888899999999999999999999 Q ss_pred EEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCc Q lcl|Aclame:pro 352 LLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTS 405 (408) Q Consensus 352 i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~ 405 (408) ++++++..+.|++|++.||++.|+|+++++|+||+++++++++.++++..+.+- T Consensus 240 i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:48 240 LLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGSTAV 293 (293) T ss_pred EEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCccccccCC Confidence 999998878899999999999999999999999999999999988887766555 No 33 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=2e-65 Score=375.25 Aligned_cols=380 Identities=21% Similarity=0.255 Sum_probs=279.4 Q ss_pred CChHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHhh--hcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 1 MGVKL-----TVNQLNEAWIASGDKVTDFNDQINMALND--DNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVN 73 (408) Q Consensus 1 M~~~~-----~i~el~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (408) |..++ .+++++++++++.+..+++.++.+++... +...+++..++..+++++..+++.+++++.+++...... T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l 80 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDL 80 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 77664 35556666666665555555554443221 112334555566666666666666666555544322211 Q ss_pred cccccccccccchhhhHHHHHHHHHHHhh----------cchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhh Q lcl|Aclame:pro 74 MREEEKGPLNKSENELKDKFVKDFVNMVR----------NPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQY 143 (408) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~----------~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~ 143 (408) ............... .....+....... .................+..+||++||+++...|++ +++. T Consensus 81 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~ 158 (397) T protein:vir:96 81 EDELAKAADPTDQKP-KDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDI 158 (397) T ss_pred HHHHHhhhhhhhhhh-HHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhh Confidence 111100000000000 0001111100000 000111112233445567788999999999999998 5778 Q ss_pred hhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHH Q lcl|Aclame:pro 144 DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWL 223 (408) Q Consensus 144 ~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v 223 (408) .+++++|+++++++.++.++++... ....+|++|+++.++.+.++|++|+++++++++++++|+++++|+.+++++|| T Consensus 159 ~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i 236 (397) T protein:vir:96 159 VDLSKYVRSVPVNSASGKFPVISKS--GSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLI 236 (397) T ss_pred hhHHHhhhhccccccceeEEEEecc--CCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHH Confidence 8899999999999888888877653 34568999999999888899999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccc Q lcl|Aclame:pro 224 SSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPT 303 (408) Q Consensus 224 ~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~ 303 (408) .+.|+++++.+++.+|++|+|.+++. +..++|++++++...++..+ +++|+|||++|..|++|||++|+|+|++++. T Consensus 237 ~~~l~~~~~~~~~~~i~~g~g~~~~~-~~~~~d~~~~~~~~~~~~~~--~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~ 313 (397) T protein:vir:96 237 ADEIQDQSLNTKNADIAAVLKTATAK-SVVGVDGLKDLINKEIKKVY--DVKLFISASMYSELDKLKDKNGRYLLQDSIT 313 (397) T ss_pred HHHHHHHHHHHHHHHHhhcccccccc-cccchHHHHHHHHHhhhhhc--CcEEEEcHHHHHHHHHhhccCCCeEeccCcc Confidence 99999999999999999999988775 56779999999877665544 6799999999999999999999999999999 Q ss_pred cCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEeccc Q lcl|Aclame:pro 304 KPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSE 383 (408) Q Consensus 304 ~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~ 383 (408) ++.+++|+|+||+++++. .+....++.+++||||+++|++++|+++++.++++. .+.+.+|+++|+||++++|+ T Consensus 314 ~~~~~~l~G~pv~~~~~~-~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~r~d~~~~~~~ 387 (397) T protein:vir:96 314 AASGKQLLGKEVVVLDDD-VIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNN-----IYGQLLAGIIRYDVKATDKK 387 (397) T ss_pred CCCcccccccceEEeccc-ccCCCCCceEEEEeehhcceEeEeecceEEEEeccc-----ccceeEEEEEEEccEEeccc Confidence 999999999999988764 444567788899999999999999999999987653 34568999999999999999 Q ss_pred ceEEEEeecc Q lcl|Aclame:pro 384 ALVAGSFSAI 393 (408) Q Consensus 384 a~~~l~~~~~ 393 (408) ||++++++++ T Consensus 388 a~~~~~~~~a 397 (397) T protein:vir:96 388 AGFYVTFTIG 397 (397) T ss_pred ceEEEEeecC Confidence 9999999988 No 34 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=8.8e-64 Score=366.29 Aligned_cols=374 Identities=16% Similarity=0.231 Sum_probs=284.0 Q ss_pred CCh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGV-KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~-~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |.- +..|+||+++++++.++++++.++.++.... ..+...+++.+++++..+++.++.++.+.+............ T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~---~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQVNTQIAN---FGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGG 77 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 553 3578888888888888777766655433222 223444555555666666666665555544433322221111 Q ss_pred c--ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeeccc Q lcl|Aclame:pro 80 G--PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVST 157 (408) Q Consensus 80 ~--~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 157 (408) . ............+.++|...++.+... ...+....++++++|++||++++..|++.+++.++|+++|+++++.+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~ 154 (395) T protein:vir:43 78 EEAPKTAGQMVAESLKEQGVTSSLRGSHRV---SMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTES 154 (395) T ss_pred cchhhhHHHHHHHHHHHHHHHHHhhhhhhh---hhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCC Confidence 1 111222223344556666666654322 12233445566778899999999999999999999999999999876 Q ss_pred CccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 158 SNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQ 237 (408) Q Consensus 158 ~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~ 237 (408) .. +.++...+..+.+.|++|++++|+. .++|+++++++++++++++||+++|+|++ ++++||.+.|+++++.++|. T Consensus 155 ~~--~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~~d~ 230 (395) T protein:vir:43 155 NS--VEYVRETGFVNNAAPVSEGTQKPYS-DLTFELENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLVEEC 230 (395) T ss_pred Cc--eEEEEEecCCCceeeecCCcccccc-ccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHH Confidence 54 4455555556677999999999965 58999999999999999999999999976 69999999999999999999 Q ss_pred HHhhccccccchhhh------------------hhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeec Q lcl|Aclame:pro 238 AIIEVMKAAPKKPTI------------------AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLE 299 (408) Q Consensus 238 ~~~~g~g~~~~~~~~------------------~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~ 299 (408) ++++|+|++.+..+. ..++++++++ ..+.+.+..+++|+|||++|..|++++|++|+|+|. T Consensus 231 ~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~ 309 (395) T protein:vir:43 231 QLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAI-LQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIG 309 (395) T ss_pred HHHhccCCCCccccccccccccccccccccccchhHHHHHHHH-HhhccccCCCcEEEEcHHHHHHHHHhhccCCceecc Confidence 999999877653221 2355666665 557888999999999999999999999999999996 Q ss_pred cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEE Q lcl|Aclame:pro 300 PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKA 379 (408) Q Consensus 300 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v 379 (408) + +.++.+++|+|+||++++. +|. +.++||||+++|.+++|++++|+++++....|++|++.||++.|+||++ T Consensus 310 ~-~~~~~~~~l~G~pVv~~~~--~~~-----~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v 381 (395) T protein:vir:43 310 S-PQNGTTPTLWRLPVVETQA--ITQ-----DEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAV 381 (395) T ss_pred c-cccCCCceecceeeEEcCC--CCC-----CcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEE Confidence 4 6677778999999988653 443 4589999999888899999999999988888999999999999999999 Q ss_pred ecccceEEEEeecc Q lcl|Aclame:pro 380 TDSEALVAGSFSAI 393 (408) Q Consensus 380 ~~~~a~~~l~~~~~ 393 (408) ++|+||++++++++ T Consensus 382 ~~~~a~~~~~~taa 395 (395) T protein:vir:43 382 YRPEAFVTGSLTAS 395 (395) T ss_pred ecccceEEEEeccC Confidence 99999999999988 No 35 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=3.3e-63 Score=363.13 Aligned_cols=383 Identities=15% Similarity=0.189 Sum_probs=276.8 Q ss_pred CChHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHhh----hcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MGVKLTVNQ-------LNEAWIASGDKVTDFNDQINMALND----DNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAE 69 (408) Q Consensus 1 M~~~~~i~e-------l~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 69 (408) |+..-.+.+ ++++++++.++++++.++++++.++ .....+...+++++++++.++.+++++++.+++.. T Consensus 4 ~~~~~~~~~~~~~~~el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~ 83 (418) T protein:vir:10 4 MNEPRQFGRKSGGDSHPEQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQK 83 (418) T ss_pred chhHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 443333322 3333333333333333333333222 11122334455566666777777777766665554 Q ss_pred Hhhhcccccc-cccccchhhhHHHHHHHHHHHhhcchhhH----HHHHHHHhhccccccCceecchhhhhhhhhhhhhhh Q lcl|Aclame:pro 70 QVVNMREEEK-GPLNKSENELKDKFVKDFVNMVRNPMAFM----NTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYD 144 (408) Q Consensus 70 ~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~~~~~~~~~~~----~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~ 144 (408) .......... ...............++|..+++.+.... .....+.....+.++||++||++++..|++.+++.+ T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~ 163 (418) T protein:vir:10 84 LARGGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKM 163 (418) T ss_pred HhhcccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhh Confidence 3332222111 11111222223345556666655443221 111223444556778899999999999999999999 Q ss_pred hhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHH Q lcl|Aclame:pro 145 SLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLS 224 (408) Q Consensus 145 ~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~ 224 (408) +|+++++++++++.+. .++...+..+.+.|++|+++++++ .++|++|++.+++++++++||+++++|+. ++++||. T Consensus 164 ~l~~~~~~~~~~~~~~--~~~~~~~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~-~l~~~i~ 239 (418) T protein:vir:10 164 TIRDLLMPGQTSSSSI--EYTVETGFTNNAAAVAEGAQKPTS-DLKFNLKNQPVRTIAHLFKASRQILDDAP-ALQSYID 239 (418) T ss_pred hHHhhcceeeccCCce--eEEEEecCCCceeeeccCcccccc-ccceeeEEEeeeeEEEeehhhHHHHHhHH-HHHHHHH Confidence 9999999999876554 455555555677999999999865 58999999999999999999999999986 7999999 Q ss_pred HHHHHHHHHHHHHHHhhccccccchh----------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh Q lcl|Aclame:pro 225 SWIAKKVVVTRNQAIIEVMKAAPKKP----------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL 288 (408) Q Consensus 225 ~~l~~~~~~~~~~~~~~g~g~~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~ 288 (408) +.|++++++++|.+|++|+|++..+. +...++++++++.. +...+..+++|+|||++|..|++ T Consensus 240 ~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~~v~n~~~~~~L~~ 318 (418) T protein:vir:10 240 GRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKIRLALLQ-AVLAEFPATGIVLNPIDWASIEL 318 (418) T ss_pred HHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHHHHHHHHHh-hccccCCCCEEEEcHHHHHHHHH Confidence 99999999999999999998764221 22346777777754 67788888899999999999999 Q ss_pred hhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceee Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTK 368 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 368 (408) ++|++|+|+|. ++.++.+++|+|+||++++. +|. +.++||||+++|++++++++++.++++....|.+|++. T Consensus 319 lkd~~G~~i~~-~~~~~~~~~l~G~pV~~~~~--~p~-----~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 390 (418) T protein:vir:10 319 TKDSQGRYIVG-NPVNGTTPRLWNLPVVETQA--MTA-----NEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVS 390 (418) T ss_pred hhcCCCceecc-ccccCCCceecceeeEEcCC--CCC-----CcEEEeeccceEEEEEecceEEEEecccchhhhcCceE Confidence 99999999995 56777788999999998643 453 45899999998888999999999999988889999999 Q ss_pred EEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 369 IRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 369 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) ||++.|+||++++|+||+++++++++.= T Consensus 391 ~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 391 IRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred EEEEEeeccEEecccceEEEEeccCCCC Confidence 9999999999999999999988754433 No 36 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=3.8e-63 Score=362.78 Aligned_cols=369 Identities=14% Similarity=0.096 Sum_probs=264.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAE---AMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) |+.. +|++|++++.++.++++.+.++... .+.+.+ ++.+++.+++.++.++++..+.+.............. T Consensus 1 m~~~-~l~~l~e~r~~~~~e~~~l~~~~~~----~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~ 75 (392) T protein:vir:13 1 MDAT-TLSANFEARERATAELRSLTDEFAG----KEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQ 75 (392) T ss_pred CCHH-HHHHHHHHHHHHHHHHHHHHHHhhc----ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccC Confidence 7754 5788988888888888888775432 222222 2334444555554444332222222111111111111 Q ss_pred cccccccchhhhHHHHHHHHHHHhhcchhhH-HHHHH-HHhhccccccCceecchhhhhhhhhh-hhhhhhhhhhhceee Q lcl|Aclame:pro 78 EKGPLNKSENELKDKFVKDFVNMVRNPMAFM-NTVSS-KTETSGSDSAAGLTIPQDIRTMINTL-VRQYDSLQQYVRVES 154 (408) Q Consensus 78 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~-~~~~~-~a~~~~t~~~gg~~vP~~~~~~ii~~-~~~~~~l~~~~~~~~ 154 (408) .... .........+..+++++.... ...+. .....++.+++|.++|+.+...+|.. +...++++.++++++ T Consensus 76 ---~~~~---~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~ 149 (392) T protein:vir:13 76 ---GSGS---GAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFT 149 (392) T ss_pred ---Cccc---chhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeee Confidence 0111 111122223344444443211 11111 11223444555666677766776655 455667788888887 Q ss_pred cccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 155 VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVT 234 (408) Q Consensus 155 ~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~ 234 (408) +.+. +.+.+|... ..+.++|++|++++|++ .++|++++++++|++++++||+|+|+|+.++|++||.++|+++++++ T Consensus 150 ~~~~-~~~~~~~~~-~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~ 226 (392) T protein:vir:13 150 TSDA-NPMDFTVIT-GRATAGIVGETAEIPES-YPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDA 226 (392) T ss_pred cCCC-ceeEEEEEc-CCcceeeeccccccccc-ccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHH Confidence 6543 456666654 45678999999999975 58999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhccccccch-----------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCcee Q lcl|Aclame:pro 235 RNQAIIEVMKAAPKK-----------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYL 297 (408) Q Consensus 235 ~~~~~~~g~g~~~~~-----------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~ 297 (408) ++.+|++|+|++.|. .+...++++++++. .+++.|+.+++|+||++++..|++|||++|+|+ T Consensus 227 ~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~-~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l 305 (392) T protein:vir:13 227 MGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALIDLFH-EVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYL 305 (392) T ss_pred HHHHHhcccCCccccccccccccccccccccccccccHHHHHHHHH-hhhhhhhcCCEEEEcHHHHHHHHHhhccCCcee Confidence 999999999976542 12234778887764 588999999999999999999999999999999 Q ss_pred eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 298 LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV 377 (408) Q Consensus 298 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~ 377 (408) |+++++.+.+++|+|+||+++++ +| .++++||||++ |.+.++++++++++.+. .|.+|++.||++.|+|| T Consensus 306 ~~~~~~~g~~~~l~G~Pv~~~~~--~~-----~~~i~~Gdf~~-~~i~~~~~~~i~~~~~~--~~~~~~~~~r~~~r~d~ 375 (392) T protein:vir:13 306 WQSALTVGAPDTFNGKVVETDDG--MP-----ADKVLFADLSK-YRVRFAGSLRVDRSVDA--KFSTDQIVYRFLQRADG 375 (392) T ss_pred ecCCcCCCCCceecceeeEEcCC--CC-----CCcEEEeeccc-eeEEeecceEEEeeccc--cccCCcEEEEEEEEecc Confidence 99999999999999999998653 44 34689999997 56678999999988765 48999999999999999 Q ss_pred EEecccceEEEEeeccc Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIA 394 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~ 394 (408) ++.+|+||+++++++++ T Consensus 376 ~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 376 LLVDARGAKVLTVTPAA 392 (392) T ss_pred EEecccceEEEEeeccC Confidence 99999999999999988 No 37 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=6.1e-63 Score=361.68 Aligned_cols=369 Identities=13% Similarity=0.160 Sum_probs=279.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+ +|++|+++++++.++++++.++.++..+ ...++.++++++++.+.++++.+++.+.+.+......... T Consensus 1 M~---~l~el~~~~~~~~~e~~~l~~~~~~e~~---~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 70 (385) T protein:vir:19 1 MS---ELALIQKAIEESQQKMTQLFDAQKAEIE---STGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAEN---- 70 (385) T ss_pred Ch---HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---- Confidence 66 3788988888888888888766543322 2334556666666666666666555555444332211111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ..............+.+.++.........+.+.....++.++|.+||+++...|++.+++.++|+++|+++++.+.. T Consensus 71 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~- 147 (385) T protein:vir:19 71 --PGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNA- 147 (385) T ss_pred --cchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcc- Confidence 11111111122233333333333222333334344444555677789999999999999999999999999987654 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) +.++...+..+.+.|++|++++|+. +++|+++++++++++++++||+|+++|++ ++++||.++|+++++.++|.+|+ T Consensus 148 -~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l 224 (385) T protein:vir:19 148 -LEYVREEVFTNNADVVAEKALKPES-DITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLL 224 (385) T ss_pred -eEEEEEecCCcceeeeccCcccccc-ccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 4455555556677999999999865 58999999999999999999999999986 69999999999999999999999 Q ss_pred hccccccchh----------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccccc Q lcl|Aclame:pro 241 EVMKAAPKKP----------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTK 304 (408) Q Consensus 241 ~g~g~~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~ 304 (408) +|+|++.+.. +...++++++++ ..+...+..+++|+|||++|..|+++||++|+|+|.+ +.. T Consensus 225 ~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~-~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~-~~~ 302 (385) T protein:vir:19 225 NGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAI-YQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGG-PQA 302 (385) T ss_pred hccCCCCcccccccccccccccccccccchHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccC-ccc Confidence 9998875522 223466677776 4578889999999999999999999999999999964 667 Q ss_pred CCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccc Q lcl|Aclame:pro 305 PNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEA 384 (408) Q Consensus 305 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 384 (408) +.+++|+|+||++++. +|. +.++||||+++|.++++++++++++++..+.|.+|++.||++.|+|+++.+|+| T Consensus 303 ~~~~~l~G~pV~~~~~--~p~-----~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a 375 (385) T protein:vir:19 303 FTSNIMWGLPVVPTKA--QAA-----GTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTA 375 (385) T ss_pred CCCceecceeeEEcCc--CCC-----CcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc Confidence 7788999999998653 443 458999999989999999999999988888899999999999999999999999 Q ss_pred eEEEEeeccc Q lcl|Aclame:pro 385 LVAGSFSAIA 394 (408) Q Consensus 385 ~~~l~~~~~~ 394 (408) |+++++++++ T Consensus 376 ~~~~~~~aa~ 385 (385) T protein:vir:19 376 IIKGTFSSGS 385 (385) T ss_pred eEEEEeccCC Confidence 9999999998 No 38 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=6.1e-63 Score=361.68 Aligned_cols=369 Identities=13% Similarity=0.160 Sum_probs=279.6 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+ +|++|+++++++.++++++.++.++..+ ...++.++++++++.+.++++.+++.+.+.+......... T Consensus 1 M~---~l~el~~~~~~~~~e~~~l~~~~~~e~~---~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 70 (385) T protein:vir:18 1 MS---ELALIQKAIEESQQKMTQLFDAQKAEIE---STGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAEN---- 70 (385) T ss_pred Ch---HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---- Confidence 66 3788988888888888888766543322 2334556666666666666666555555444332211111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ..............+.+.++.........+.+.....++.++|.+||+++...|++.+++.++|+++|+++++.+.. T Consensus 71 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~- 147 (385) T protein:vir:18 71 --PGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNA- 147 (385) T ss_pred --cchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcc- Confidence 11111111122233333333333222333334344444555677789999999999999999999999999987654 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) +.++...+..+.+.|++|++++|+. +++|+++++++++++++++||+|+++|++ ++++||.++|+++++.++|.+|+ T Consensus 148 -~~~~~~~~~~~~a~~v~E~~~~~~~-~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l 224 (385) T protein:vir:18 148 -LEYVREEVFTNNADVVAEKALKPES-DITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLL 224 (385) T ss_pred -eEEEEEecCCcceeeeccCcccccc-ccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHH Confidence 4455555556677999999999865 58999999999999999999999999986 69999999999999999999999 Q ss_pred hccccccchh----------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccccc Q lcl|Aclame:pro 241 EVMKAAPKKP----------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTK 304 (408) Q Consensus 241 ~g~g~~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~ 304 (408) +|+|++.+.. +...++++++++ ..+...+..+++|+|||++|..|+++||++|+|+|.+ +.. T Consensus 225 ~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~-~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~-~~~ 302 (385) T protein:vir:18 225 NGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAI-YQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGG-PQA 302 (385) T ss_pred hccCCCCcccccccccccccccccccccchHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccC-ccc Confidence 9998875522 223466677776 4578889999999999999999999999999999964 667 Q ss_pred CCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccc Q lcl|Aclame:pro 305 PNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEA 384 (408) Q Consensus 305 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a 384 (408) +.+++|+|+||++++. +|. +.++||||+++|.++++++++++++++..+.|.+|++.||++.|+|+++.+|+| T Consensus 303 ~~~~~l~G~pV~~~~~--~p~-----~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a 375 (385) T protein:vir:18 303 FTSNIMWGLPVVPTKA--QAA-----GTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTA 375 (385) T ss_pred CCCceecceeeEEcCc--CCC-----CcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccc Confidence 7788999999998653 443 458999999989999999999999988888899999999999999999999999 Q ss_pred eEEEEeeccc Q lcl|Aclame:pro 385 LVAGSFSAIA 394 (408) Q Consensus 385 ~~~l~~~~~~ 394 (408) |+++++++++ T Consensus 376 ~~~~~~~aa~ 385 (385) T protein:vir:18 376 IIKGTFSSGS 385 (385) T ss_pred eEEEEeccCC Confidence 9999999998 No 39 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=7.1e-63 Score=361.31 Aligned_cols=369 Identities=14% Similarity=0.099 Sum_probs=258.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAE---AMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) |+. |+|++|++++.++.++++.+.++.. +...+.+ ++++++.++++++++++...+................ T Consensus 1 m~~-~~l~~l~e~r~~~~~e~~~L~~~~~----~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~ 75 (390) T protein:vir:62 1 MDA-TTLSANFEARERATAELRTLTDEFA----GKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQ 75 (390) T ss_pred CCh-hHHHHHHHHHHHHHHHHHHHHHHhh----cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 664 4567777777777777777766432 1222222 2334444455554444433332222211111110000 Q ss_pred cccccccchhhhHHHHHHHHHHHhhcchhhH---HHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceee Q lcl|Aclame:pro 78 EKGPLNKSENELKDKFVKDFVNMVRNPMAFM---NTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES 154 (408) Q Consensus 78 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~---~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~ 154 (408) .... ...... ...+..++|++.... ...........+.++|++++|+.+...|++.++..++++++|++++ T Consensus 76 ~~~~--~~~~~~----~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~ 149 (390) T protein:vir:62 76 GSGS--GAQRSA----DVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFT 149 (390) T ss_pred cccc--cchhhc----chHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeee Confidence 0000 000000 011222333332111 1111112223334445555555555556677778888899999988 Q ss_pred cccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 155 VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVT 234 (408) Q Consensus 155 ~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~ 234 (408) +.+. +.+.+|...+ .+.+.|++|++.+|++ .++|+++++++++++++++||+|+|+|+.++|++||.+.|+++++.+ T Consensus 150 ~~~~-~~~~~p~~~~-~~~a~wv~E~~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~ 226 (390) T protein:vir:62 150 TSDA-NPLDFTVITG-RSSASIVGETAEIPES-YPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDA 226 (390) T ss_pred cCCC-ceeEEEEEcC-Ccceeeeccccccccc-ccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHH Confidence 7643 3466776543 4677999999999975 58999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhccccccc---------------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeec Q lcl|Aclame:pro 235 RNQAIIEVMKAAPK---------------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLE 299 (408) Q Consensus 235 ~~~~~~~g~g~~~~---------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~ 299 (408) +|.+|++|+|.... ..+...++++++++. .+++.|+.+++|+||++++.+|++|||++|+|+|+ T Consensus 227 ~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~ 305 (390) T protein:vir:62 227 MGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALIDLFH-EVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQ 305 (390) T ss_pred HHhhhhccCCccccccccccccccceecccccccchHHHHHHHH-hhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeec Confidence 99999999874211 122345788887775 58899999999999999999999999999999999 Q ss_pred cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEE Q lcl|Aclame:pro 300 PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKA 379 (408) Q Consensus 300 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v 379 (408) |++..+.+.+|+|+||+++++ +|. ..++||||++ |.+.++++++++++.+. .|.+|++.||++.|+||++ T Consensus 306 ~~~~~g~~~~l~G~Pv~~~~~--~p~-----~~i~~gd~s~-~~i~~~~~~~v~~~~~~--~~~~~~~~~~~~~r~d~~~ 375 (390) T protein:vir:62 306 SGLTVGAPSLFNGKVVETDDG--MPA-----DKILFADLSK-YRVRFAGSLRVDRSVDA--KFSTDQIVYRFLQRADGLL 375 (390) T ss_pred CCcCCCccceecccceEEecC--CCC-----ccEEEeeccc-eeEEeecceEEEeeccc--cccCCcEEEEEEEEeCcEe Confidence 999999999999999998654 443 4589999997 45678999999998765 4999999999999999999 Q ss_pred ecccceEEEEeeccc Q lcl|Aclame:pro 380 TDSEALVAGSFSAIA 394 (408) Q Consensus 380 ~~~~a~~~l~~~~~~ 394 (408) ++|+||++|++++.+ T Consensus 376 ~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 376 VDARGAKVLTVTPGA 390 (390) T ss_pred echhheEEEEeecCC Confidence 999999999999998 No 40 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=5.5e-63 Score=361.92 Aligned_cols=384 Identities=13% Similarity=0.116 Sum_probs=245.6 Q ss_pred CChH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc----ccHHH------HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MGVK----LTVNQLNEAWIASGDKVTDFNDQINMALNDDN----FSAEA------MSELKNKRDNEKVRRDALREQLVEA 66 (408) Q Consensus 1 M~~~----~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (408) |.-. ...+++.++++++..+..++.+|++++..... ...++ ..++..+++.+.++++.++..+.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4322 22233333333333333333333222222110 00011 1111111222222222222222222 Q ss_pred HHHHhhhcccc-cccc---------cccch--hh--------hHH-HHHH--HHHHHhhcchhhHHHHHHHHhhcccccc Q lcl|Aclame:pro 67 QAEQVVNMREE-EKGP---------LNKSE--NE--------LKD-KFVK--DFVNMVRNPMAFMNTVSSKTETSGSDSA 123 (408) Q Consensus 67 ~~~~~~~~~~~-~~~~---------~~~~~--~~--------~~~-~~~~--a~~~~~~~~~~~~~~~~~~a~~~~t~~~ 123 (408) +.......... .... ..... .. ... .... .+...... ......+.+....+++++ T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 158 (497) T protein:vir:10 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFAD--GETAPAAIGQNPFGSTGT 158 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhh--hhhhHHHHHhhhcccCcc Confidence 11100000000 0000 00000 00 000 0000 00000000 111122345556778889 Q ss_pred CceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeee Q lcl|Aclame:pro 124 AGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAG 203 (408) Q Consensus 124 gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~ 203 (408) ||++||+++...|++.+++.++|++++++++++++ .+.||...+..+.++|++|++.+|+ ++++|++|++.++++++ T Consensus 159 gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~--~~~~~~~~~~~~~a~wv~E~~~~~~-s~~~f~~i~~~~~k~a~ 235 (497) T protein:vir:10 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP--NLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVAN 235 (497) T ss_pred cccccchhhhHHHHHHHHhhhhHHhhccccccCCC--ceEEEEEcCCCCcceeeccCccccc-ccccceeeEeeeeeeEe Confidence 99999999999999999999999999999998765 4566666666677899999999997 55899999999999999 Q ss_pred ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh------------------------------ Q lcl|Aclame:pro 204 IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIA------------------------------ 253 (408) Q Consensus 204 ~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~------------------------------ 253 (408) +++||+|||+|++ ++++||.++|++++++++|.+|++|+|++.|.+-.. T Consensus 236 ~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) T protein:vir:10 236 ALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) T ss_pred ecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhccc Confidence 9999999999986 599999999999999999999999999765321100 Q ss_pred ---------------------------------------hHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccC Q lcl|Aclame:pro 254 ---------------------------------------KFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEG 294 (408) Q Consensus 254 ---------------------------------------~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G 294 (408) ..+.+..++.......+.....|+|||.+|..|+++||++| T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G 394 (497) T protein:vir:10 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) T ss_pred ccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCC Confidence 00011112222223344556689999999999999999999 Q ss_pred ceeecccccc------CCcccccccceEeeccccccccccCcceEEEEehhc-ceEeeeccceEEEEeccchhhhhhcee Q lcl|Aclame:pro 295 KYLLEPDPTK------PNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAFETDTT 367 (408) Q Consensus 295 ~~~~~~~~~~------~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~ 367 (408) +|+|++.... ..+++|+|+||++++. +|. +.++||||++ +|.+++|++++|+++++....|++|++ T Consensus 395 ~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~--~~~-----~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v 467 (497) T protein:vir:10 395 QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKV 467 (497) T ss_pred ceeccCcccccccccccCCceeeceeeEecCC--CCC-----CceEEeecccceEEEEEecccEEEeecccchhhhcCcE Confidence 9999875422 2346899999998654 342 4579999998 466789999999999998888999999 Q ss_pred eEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 368 KIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 368 ~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) .||++.|+|+.|++|+||+++++++++... T Consensus 468 ~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 468 TVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred EEEEEEeecceeeccccEEEEEecCCccCC Confidence 999999999999999999999998887776 No 41 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=5.5e-63 Score=361.92 Aligned_cols=384 Identities=13% Similarity=0.116 Sum_probs=245.6 Q ss_pred CChH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc----ccHHH------HHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MGVK----LTVNQLNEAWIASGDKVTDFNDQINMALNDDN----FSAEA------MSELKNKRDNEKVRRDALREQLVEA 66 (408) Q Consensus 1 M~~~----~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~ 66 (408) |.-. ...+++.++++++..+..++.+|++++..... ...++ ..++..+++.+.++++.++..+.+. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 80 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEV 80 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4322 22233333333333333333333222222110 00011 1111111222222222222222222 Q ss_pred HHHHhhhcccc-cccc---------cccch--hh--------hHH-HHHH--HHHHHhhcchhhHHHHHHHHhhcccccc Q lcl|Aclame:pro 67 QAEQVVNMREE-EKGP---------LNKSE--NE--------LKD-KFVK--DFVNMVRNPMAFMNTVSSKTETSGSDSA 123 (408) Q Consensus 67 ~~~~~~~~~~~-~~~~---------~~~~~--~~--------~~~-~~~~--a~~~~~~~~~~~~~~~~~~a~~~~t~~~ 123 (408) +.......... .... ..... .. ... .... .+...... ......+.+....+++++ T Consensus 81 e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 158 (497) T protein:vir:78 81 EVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFAD--GETAPAAIGQNPFGSTGT 158 (497) T ss_pred HhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhh--hhhhHHHHHhhhcccCcc Confidence 11100000000 0000 00000 00 000 0000 00000000 111122345556778889 Q ss_pred CceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeee Q lcl|Aclame:pro 124 AGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAG 203 (408) Q Consensus 124 gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~ 203 (408) ||++||+++...|++.+++.++|++++++++++++ .+.||...+..+.++|++|++.+|+ ++++|++|++.++++++ T Consensus 159 gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~--~~~~~~~~~~~~~a~wv~E~~~~~~-s~~~f~~i~~~~~k~a~ 235 (497) T protein:vir:78 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP--NLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVAN 235 (497) T ss_pred cccccchhhhHHHHHHHHhhhhHHhhccccccCCC--ceEEEEEcCCCCcceeeccCccccc-ccccceeeEeeeeeeEe Confidence 99999999999999999999999999999998765 4566666666677899999999997 55899999999999999 Q ss_pred ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh------------------------------ Q lcl|Aclame:pro 204 IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIA------------------------------ 253 (408) Q Consensus 204 ~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~------------------------------ 253 (408) +++||+|||+|++ ++++||.++|++++++++|.+|++|+|++.|.+-.. T Consensus 236 ~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) T protein:vir:78 236 ALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) T ss_pred ecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhccc Confidence 9999999999986 599999999999999999999999999765321100 Q ss_pred ---------------------------------------hHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccC Q lcl|Aclame:pro 254 ---------------------------------------KFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEG 294 (408) Q Consensus 254 ---------------------------------------~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G 294 (408) ..+.+..++.......+.....|+|||.+|..|+++||++| T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G 394 (497) T protein:vir:78 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANG 394 (497) T ss_pred ccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCC Confidence 00011112222223344556689999999999999999999 Q ss_pred ceeecccccc------CCcccccccceEeeccccccccccCcceEEEEehhc-ceEeeeccceEEEEeccchhhhhhcee Q lcl|Aclame:pro 295 KYLLEPDPTK------PNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAFETDTT 367 (408) Q Consensus 295 ~~~~~~~~~~------~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~ 367 (408) +|+|++.... ..+++|+|+||++++. +|. +.++||||++ +|.+++|++++|+++++....|++|++ T Consensus 395 ~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~--~~~-----~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v 467 (497) T protein:vir:78 395 QYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--IPL-----GTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKV 467 (497) T ss_pred ceeccCcccccccccccCCceeeceeeEecCC--CCC-----CceEEeecccceEEEEEecccEEEeecccchhhhcCcE Confidence 9999875422 2346899999998654 342 4579999998 466789999999999998888999999 Q ss_pred eEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 368 KIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 368 ~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) .||++.|+|+.|++|+||+++++++++... T Consensus 468 ~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 468 TVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred EEEEEEeecceeeccccEEEEEecCCccCC Confidence 999999999999999999999998887776 No 42 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=2.1e-62 Score=358.72 Aligned_cols=368 Identities=14% Similarity=0.154 Sum_probs=266.0 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 3 VKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAE---AMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 3 ~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) +++.+++|+++++++.++++++.++... +.+.+.+ .+.++..+++.+.++++++++++.+.+....... . T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~---~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~----~ 73 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVR---DGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGD----V 73 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHh---hcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----c Confidence 5555777888888888777776655332 2222222 3344555555566666655555444333222111 1 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhH---HHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFM---NTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS 156 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~---~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~ 156 (408) ............+..+++......+.... ..........++++++|.++|+++...|++.+++.++|+++|+++++. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~ 153 (390) T protein:vir:10 74 QHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTD 153 (390) T ss_pred cccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeecc Confidence 11111122222233344443333222111 111112233344455566788888899999999999999999999987 Q ss_pred cCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 157 TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRN 236 (408) Q Consensus 157 ~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~ 236 (408) +.+ +.++...+..+.+.|++|++++|+. .++|+++++++++++++++||++|++|++ ++++||.++|+++++++++ T Consensus 154 ~~~--~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~ 229 (390) T protein:vir:10 154 SAL--IEYVQETGFVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKED 229 (390) T ss_pred CCc--eEEEEEecCCcceeeecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHH Confidence 654 4555555556678999999999975 58999999999999999999999999986 7999999999999999999 Q ss_pred HHHhhccccccchh----------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecc Q lcl|Aclame:pro 237 QAIIEVMKAAPKKP----------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEP 300 (408) Q Consensus 237 ~~~~~g~g~~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~ 300 (408) .++++|+|++..+. +...++++++++ ..+.+.+..+++|+|||++|.+|+++||++|+|+|++ T Consensus 230 ~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~ 308 (390) T protein:vir:10 230 AEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAM-LQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGN 308 (390) T ss_pred HHHhhcCCCCccccccccccccccccccccccchHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecC Confidence 99999998765321 122355666665 5578889999999999999999999999999999987 Q ss_pred ccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 301 DPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 301 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) ... +.+++|+|+||++++. +|. +.++||||+++|.+++|++++++++++. ..|.+|++.||++.|+||+++ T Consensus 309 ~~~-~~~~~l~G~pv~~~~~--~p~-----~~~~~gdf~~~~~~~~~~~~~i~~~~~~-~~~~~~~~~~r~~~r~d~~v~ 379 (390) T protein:vir:10 309 ARG-TLTPTLWGLPVVATQA--MAP-----GEFLVGAFDLAAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALVVY 379 (390) T ss_pred CcC-cCCceecceeeEEcCC--CCC-----CcEEEEeccceEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccEEe Confidence 554 4456999999998653 443 4589999999898899999999987643 459999999999999999999 Q ss_pred cccceEEEEee Q lcl|Aclame:pro 381 DSEALVAGSFS 391 (408) Q Consensus 381 ~~~a~~~l~~~ 391 (408) +|+||+++++. T Consensus 380 ~~~a~~~~~~a 390 (390) T protein:vir:10 380 RPEALISGSFA 390 (390) T ss_pred ccccEEEEEeC Confidence 99999999998 No 43 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=8.4e-62 Score=355.41 Aligned_cols=368 Identities=15% Similarity=0.160 Sum_probs=263.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSA---EAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) |. ..+++|++++.++.++++.+.++... +...+. +.+.++..+++.+.++++++++.+.+.+....... T Consensus 1 m~--~l~~~l~~~~~~~~~~~~~~~e~~~~---~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~--- 72 (390) T protein:vir:81 1 MT--DITSKLEATLANVTDSLRAFGERAVR---DGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGD--- 72 (390) T ss_pred Ch--HHHHHHHHHHHHHHHHHHHHHHHHHh---hcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--- Confidence 43 33566777777776666665544222 222222 23344444555555555555444443322211111 Q ss_pred cccccccchhhhHHHHHHHHHHHhhcchhhH---HHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceee Q lcl|Aclame:pro 78 EKGPLNKSENELKDKFVKDFVNMVRNPMAFM---NTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES 154 (408) Q Consensus 78 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~---~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~ 154 (408) ................+++........... ..........++++++|+++|+++...|++.+++.++|++++++++ T Consensus 73 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~ 151 (390) T protein:vir:81 73 -VQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGR 151 (390) T ss_pred -cccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceee Confidence 111111111112222333333222221111 1111223344566778889999999999999999999999999998 Q ss_pred cccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 155 VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVT 234 (408) Q Consensus 155 ~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~ 234 (408) +.+. .+.++...+..+.+.|++|++++|++ .++|+++++++++++++++||+|+++|++ ++++||.+.|+++++++ T Consensus 152 ~~~~--~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~d~~-~~~~~i~~~l~~~~~~~ 227 (390) T protein:vir:81 152 TDSA--LIEYVQETGFVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVK 227 (390) T ss_pred ccCC--ceEEEEEecCCcceeeecCCcccccc-cceeeEEEEeeeEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHH Confidence 8754 45566666666678999999999975 58999999999999999999999999986 79999999999999999 Q ss_pred HHHHHhhccccccchh----------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceee Q lcl|Aclame:pro 235 RNQAIIEVMKAAPKKP----------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLL 298 (408) Q Consensus 235 ~~~~~~~g~g~~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~ 298 (408) +|.+|++|+|++..+. +...++++++++ ..+.+.+..+++|+|||++|..|+++||++|+|+| T Consensus 228 ~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~ 306 (390) T protein:vir:81 228 EDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAM-LQASLAEYNPSGIVINPIDWAAIELAKDANNQYLI 306 (390) T ss_pred HHHHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceee Confidence 9999999998765321 223456666666 45778888999999999999999999999999999 Q ss_pred ccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcE Q lcl|Aclame:pro 299 EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK 378 (408) Q Consensus 299 ~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 378 (408) ++ ...+.+++|+|+||+++++ +| .+.++||||+++|.+++|++++++++++. ..|.+|++.||++.|+|++ T Consensus 307 ~~-~~~~~~~~l~G~pv~~~~~--~p-----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~~v~~r~~~r~d~~ 377 (390) T protein:vir:81 307 GN-ARGTLTPTLWGLPVVATQA--MA-----PGEFLVGAFDLAAQIFDQWDARVEIGYVG-EDFQRNMITVLAEERLALV 377 (390) T ss_pred cC-cccccCceecceeeEEcCC--CC-----CCcEEEEehhceEEEEEecceEEEEeccc-chhhcCcEEEEEEEeeccE Confidence 76 4455667999999998653 44 34589999999888899999999988653 4699999999999999999 Q ss_pred EecccceEEEEee Q lcl|Aclame:pro 379 ATDSEALVAGSFS 391 (408) Q Consensus 379 v~~~~a~~~l~~~ 391 (408) +++|+||+++++. T Consensus 378 v~~~~a~v~~t~a 390 (390) T protein:vir:81 378 VYRPEALISGSFA 390 (390) T ss_pred EecccceEEEEeC Confidence 9999999999998 No 44 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=4.6e-62 Score=356.87 Aligned_cols=367 Identities=11% Similarity=0.174 Sum_probs=279.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |. +|+||++++.++.++++++++++.++..+...+.+++.+++.++++++++++.++++++..+............. T Consensus 1 Mk---~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 77 (387) T protein:vir:94 1 MP---TLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEA 77 (387) T ss_pred Cc---hHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 33 367888889999999999999998888888888899999999999999999999888887665443332222211 Q ss_pred ccccchhhhHHHHHHHHHHHhhcch--------hhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhce Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPM--------AFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV 152 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~--------~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~ 152 (408) .... .......++|.+++++.. ........+++..+++++||++||+++.++|++.+++.++|++++++ T Consensus 78 ~~~~---~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:94 78 YQSL---SDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred CCCC---chhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhcee Confidence 1111 111222334444444321 11122344667788889999999999999999999999999999999 Q ss_pred eecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~ 232 (408) +++.+. . +|+.....+.+.|++|++..+++ .++|++|++.+++++++++||+|||+||.++|++||.++|+++++ T Consensus 155 ~~~~~~--~--~p~~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:94 155 TNIKGL--E--IPRVSYTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred eecCCc--e--eeeeeccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHH Confidence 887542 3 34444445667999999999875 589999999999999999999999999999999999999999999 Q ss_pred HHHHH-HHhhccccccc-----------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecc Q lcl|Aclame:pro 233 VTRNQ-AIIEVMKAAPK-----------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEP 300 (408) Q Consensus 233 ~~~~~-~~~~g~g~~~~-----------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~ 300 (408) ++++. .|.+|+|++.+ .++...+|+++++++ .+++.|+.+++|+||+.+|..+.++++.+|+|+|.+ T Consensus 230 ~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~-~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~ 308 (387) T protein:vir:94 230 AKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT 308 (387) T ss_pred HHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCCccccc Confidence 99765 45566665432 223345888988886 589999999999999999888776666677777753 Q ss_pred ccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 301 DPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 301 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) .+.+|+|+||+++++ ..+++||||++||..+ .++.+..+.+ ..++++.|++..|+|++++ T Consensus 309 -----~~~~llG~PV~~~~~---------~~~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~ 368 (387) T protein:vir:94 309 -----PAEKVFGKPVVFTDA---------AVKPIVGDFNYFGINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQRT 368 (387) T ss_pred -----CCccccccceEEecC---------CCceeeechhhhhhhh--hhhhheeccc----ccCCceEEEEEEEeCcEee Confidence 456899999999764 2347999999987654 4555555443 3368999999999999999 Q ss_pred cccceEEEEeeccccCCCC Q lcl|Aclame:pro 381 DSEALVAGSFSAIADQVGN 399 (408) Q Consensus 381 ~~~a~~~l~~~~~~~~~~~ 399 (408) +|+||+++++++.+.+.+. T Consensus 369 ~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 369 LDSAFRIAKAKENTGPLPS 387 (387) T ss_pred chhheEEEEeecCCCCCCC Confidence 9999999999887655444 No 45 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=4.6e-62 Score=356.87 Aligned_cols=367 Identities=11% Similarity=0.174 Sum_probs=279.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |. +|+||++++.++.++++++++++.++..+...+.+++.+++.++++++++++.++++++..+............. T Consensus 1 Mk---~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 77 (387) T protein:vir:96 1 MP---TLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEA 77 (387) T ss_pred Cc---hHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 33 367888889999999999999998888888888899999999999999999999888887665443332222211 Q ss_pred ccccchhhhHHHHHHHHHHHhhcch--------hhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhce Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPM--------AFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV 152 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~--------~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~ 152 (408) .... .......++|.+++++.. ........+++..+++++||++||+++.++|++.+++.++|++++++ T Consensus 78 ~~~~---~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:96 78 YQSL---SDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred CCCC---chhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhcee Confidence 1111 111222334444444321 11122344667788889999999999999999999999999999999 Q ss_pred eecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~ 232 (408) +++.+. . +|+.....+.+.|++|++..+++ .++|++|++.+++++++++||+|||+||.++|++||.++|+++++ T Consensus 155 ~~~~~~--~--~p~~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:96 155 TNIKGL--E--IPRVSYTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred eecCCc--e--eeeeeccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHH Confidence 887542 3 34444445667999999999875 589999999999999999999999999999999999999999999 Q ss_pred HHHHH-HHhhccccccc-----------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecc Q lcl|Aclame:pro 233 VTRNQ-AIIEVMKAAPK-----------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEP 300 (408) Q Consensus 233 ~~~~~-~~~~g~g~~~~-----------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~ 300 (408) ++++. .|.+|+|++.+ .++...+|+++++++ .+++.|+.+++|+||+.+|..+.++++.+|+|+|.+ T Consensus 230 ~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~-~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~ 308 (387) T protein:vir:96 230 AKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT 308 (387) T ss_pred HHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCCccccc Confidence 99765 45566665432 223345888988886 589999999999999999888776666677777753 Q ss_pred ccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 301 DPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 301 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) .+.+|+|+||+++++ ..+++||||++||..+ .++.+..+.+ ..++++.|++..|+|++++ T Consensus 309 -----~~~~llG~PV~~~~~---------~~~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~ 368 (387) T protein:vir:96 309 -----PAEKVFGKPVVFTDA---------AVKPIVGDFNYFGINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQRT 368 (387) T ss_pred -----CCccccccceEEecC---------CCceeeechhhhhhhh--hhhhheeccc----ccCCceEEEEEEEeCcEee Confidence 456899999999764 2347999999987654 4555555443 3368999999999999999 Q ss_pred cccceEEEEeeccccCCCC Q lcl|Aclame:pro 381 DSEALVAGSFSAIADQVGN 399 (408) Q Consensus 381 ~~~a~~~l~~~~~~~~~~~ 399 (408) +|+||+++++++.+.+.+. T Consensus 369 ~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 369 LDSAFRIAKAKENTGPLPS 387 (387) T ss_pred chhheEEEEeecCCCCCCC Confidence 9999999999887655444 No 46 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=4.6e-62 Score=356.87 Aligned_cols=367 Identities=11% Similarity=0.174 Sum_probs=279.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |. +|+||++++.++.++++++++++.++..+...+.+++.+++.++++++++++.++++++..+............. T Consensus 1 Mk---~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 77 (387) T protein:vir:26 1 MP---TLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEA 77 (387) T ss_pred Cc---hHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 33 367888889999999999999998888888888899999999999999999999888887665443332222211 Q ss_pred ccccchhhhHHHHHHHHHHHhhcch--------hhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhce Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPM--------AFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV 152 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~--------~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~ 152 (408) .... .......++|.+++++.. ........+++..+++++||++||+++.++|++.+++.++|++++++ T Consensus 78 ~~~~---~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~ 154 (387) T protein:vir:26 78 YQSL---SDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred CCCC---chhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhcee Confidence 1111 111222334444444321 11122344667788889999999999999999999999999999999 Q ss_pred eecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~ 232 (408) +++.+. . +|+.....+.+.|++|++..+++ .++|++|++.+++++++++||+|||+||.++|++||.++|+++++ T Consensus 155 ~~~~~~--~--~p~~~~~~~~a~~v~Eg~~~~~~-~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:26 155 TNIKGL--E--IPRVSYTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred eecCCc--e--eeeeeccCCcccccccccccccc-ccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHH Confidence 887542 3 34444445667999999999875 589999999999999999999999999999999999999999999 Q ss_pred HHHHH-HHhhccccccc-----------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecc Q lcl|Aclame:pro 233 VTRNQ-AIIEVMKAAPK-----------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEP 300 (408) Q Consensus 233 ~~~~~-~~~~g~g~~~~-----------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~ 300 (408) ++++. .|.+|+|++.+ .++...+|+++++++ .+++.|+.+++|+||+.+|..+.++++.+|+|+|.+ T Consensus 230 ~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~-~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~~ 308 (387) T protein:vir:26 230 AKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT 308 (387) T ss_pred HHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCCccccc Confidence 99765 45566665432 223345888988886 589999999999999999888776666677777753 Q ss_pred ccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 301 DPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 301 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) .+.+|+|+||+++++ ..+++||||++||..+ .++.+..+.+ ..++++.|++..|+|++++ T Consensus 309 -----~~~~llG~PV~~~~~---------~~~~~~GDf~~~~~~~--~~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~ 368 (387) T protein:vir:26 309 -----PAEKVFGKPVVFTDA---------AVKPIVGDFNYFGINY--DGTTYDTDKD----VKKGEYLFVLTAWYDQQRT 368 (387) T ss_pred -----CCccccccceEEecC---------CCceeeechhhhhhhh--hhhhheeccc----ccCCceEEEEEEEeCcEee Confidence 456899999999764 2347999999987654 4555555443 3368999999999999999 Q ss_pred cccceEEEEeeccccCCCC Q lcl|Aclame:pro 381 DSEALVAGSFSAIADQVGN 399 (408) Q Consensus 381 ~~~a~~~l~~~~~~~~~~~ 399 (408) +|+||+++++++.+.+.+. T Consensus 369 ~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 369 LDSAFRIAKAKENTGPLPS 387 (387) T ss_pred chhheEEEEeecCCCCCCC Confidence 9999999999887655444 No 47 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=1.1e-61 Score=354.69 Aligned_cols=366 Identities=11% Similarity=0.171 Sum_probs=276.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |. ++.||++++.++.+++++++++++++..+.+.+.+++++++.+++.++++++.+++++++++............. T Consensus 1 Mk---~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 77 (387) T protein:vir:93 1 MP---TLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEA 77 (387) T ss_pred Cc---hHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 33 456788888888999999999999988888888888999999999999999998888777654332222111111 Q ss_pred ccccchhhhHHHHHHHHHHHhhcch--------hhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhce Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPM--------AFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV 152 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~--------~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~ 152 (408) .... .......++|.+++++.. ......+.++++.+++++||++||+++.++|++.+++.++|+++|++ T Consensus 78 ~~~~---~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v 154 (387) T protein:vir:93 78 YQSL---NDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 154 (387) T ss_pred CCCc---chhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheee Confidence 1111 111222334444443321 11223356778888999999999999999999999999999999999 Q ss_pred eecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~ 232 (408) +++.+. . +|......+.+.|++|++..+++ .++|++|++++++++++++||+|||+||.++|++||.++|+++++ T Consensus 155 ~~~~~~--~--~p~~~~~~~~a~~v~E~~~~~~~-~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:93 155 TNIKGL--E--IPRVSYTLDDDDFITDVETAKEL-KLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred eecCCc--e--EEEEeecCCccccccCccccccc-ccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHH Confidence 887542 2 44444445667899999999874 589999999999999999999999999999999999999999999 Q ss_pred HHHHH-HHhhccccccc-----------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHH-HhhhcccCceeec Q lcl|Aclame:pro 233 VTRNQ-AIIEVMKAAPK-----------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKL-ALVKTAEGKYLLE 299 (408) Q Consensus 233 ~~~~~-~~~~g~g~~~~-----------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l-~~lkd~~G~~~~~ 299 (408) ++++. .|.+|+|++.+ ..+...+|+|++++. .+++.|+.+++|+||+.+|..+ ++++|.+| |+|. T Consensus 230 ~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~~~-~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~-~~~~ 307 (387) T protein:vir:93 230 AKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTT-NFFD 307 (387) T ss_pred HHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCC-cccc Confidence 99776 45667765543 223334788888875 5899999999999999987665 56666655 4443 Q ss_pred cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEE Q lcl|Aclame:pro 300 PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKA 379 (408) Q Consensus 300 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v 379 (408) +.|.+|+|+||+++++ ..+++||||++||.. +.++.+..+.+ +.++++.|++..|+|+++ T Consensus 308 -----~~~~~llG~PV~~~~~---------~~~~~~GDf~~~~~~--~~~~~~~~~~~----~~~~~~~~~~~~r~d~~v 367 (387) T protein:vir:93 308 -----TPAEKVFGKPVVFTDA---------AVKPIVGDFNYFGIN--YDGTTYDTDKD----VKKGEYLFVLTAWYDQQR 367 (387) T ss_pred -----cCCccccccceEEecC---------CCceeeeehhhhhee--hhhheeeeccc----ccCCceeEEEEeeeCcee Confidence 3456899999999764 234799999998764 45666655443 457899999999999999 Q ss_pred ecccceEEEEeeccccCCCC Q lcl|Aclame:pro 380 TDSEALVAGSFSAIADQVGN 399 (408) Q Consensus 380 ~~~~a~~~l~~~~~~~~~~~ 399 (408) ++|+||+++++++.+.+.+. T Consensus 368 ~~~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 368 TLDSAFRIAKAKENTGSLPS 387 (387) T ss_pred echhheEEEEeecCCCCCCC Confidence 99999999999887765554 No 48 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=2.9e-61 Score=352.50 Aligned_cols=368 Identities=14% Similarity=0.160 Sum_probs=267.2 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 3 VKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEA---MSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 3 ~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) +.+.+++|+++++++.++++.+.++... +...+.++ +.+++.+++.+.++++++++++.+........ .. T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~---~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~----~~ 73 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVR---DGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGG----DV 73 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHh---hcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----cc Confidence 3333566777777777777766555322 22233233 34444455555555555444433322221111 11 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhh---HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAF---MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS 156 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~---~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~ 156 (408) ............+..++|......+... ......++...+++.++|++||+++...|++.+++.++|+++++++++. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~ 153 (390) T protein:vir:97 74 QHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD 153 (390) T ss_pred ccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeecc Confidence 1111122222233334444333222211 1122234445566788899999999999999999999999999999987 Q ss_pred cCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 157 TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRN 236 (408) Q Consensus 157 ~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~ 236 (408) +. .+.++...+..+.+.|++|++++|++ .++|+++++++++++++++||+|+++|++ ++++||.++|++++++++| T Consensus 154 ~~--~~~~~~~~~~~~~a~~v~Eg~~~~~~-~~~~~~i~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~la~a~~~~~d 229 (390) T protein:vir:97 154 SA--LIEYVQETGFVNNAAIVAEGALKPES-SLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKED 229 (390) T ss_pred CC--ceEEEEEecCCcceeeecCCcccccc-ccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHH Confidence 55 45556555556678999999999964 58999999999999999999999999985 7999999999999999999 Q ss_pred HHHhhccccccchh----------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecc Q lcl|Aclame:pro 237 QAIIEVMKAAPKKP----------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEP 300 (408) Q Consensus 237 ~~~~~g~g~~~~~~----------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~ 300 (408) .+|++|+|++..+. +...++++.+++ ..+...+..+++|+|||++|..|+++||++|+|+|.+ T Consensus 230 ~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~-~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~ 308 (390) T protein:vir:97 230 AEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAM-LQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGN 308 (390) T ss_pred HHHhhcCCCCccccceeeccccccccccccccchHHHHHHHH-HhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecC Confidence 99999998765321 223355666665 4578888999999999999999999999999999986 Q ss_pred ccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 301 DPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 301 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) . ..+.+++|+|+||++++. +| .++++||||+++|.+++++++++.++++. ..|.+|++.||++.|+|++++ T Consensus 309 ~-~~~~~~~l~G~pV~~~~~--~~-----~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~f~~~~~~~r~~~r~d~~v~ 379 (390) T protein:vir:97 309 A-RGTLTPTLWGLPVVATQA--MA-----PGEFLVGAFDLAAQIFDQWDARVEIGYVN-DDFQRNMVTVLAEERLALVVY 379 (390) T ss_pred c-cCCCCceecceeeEEcCC--CC-----CCcEEEEeccceEEEEEecceEEEEeecc-cccccCcEEEEEEEeeccEEe Confidence 4 456678999999998653 44 34689999999888899999999997543 359999999999999999999 Q ss_pred cccceEEEEee Q lcl|Aclame:pro 381 DSEALVAGSFS 391 (408) Q Consensus 381 ~~~a~~~l~~~ 391 (408) +|+||+++++. T Consensus 380 ~~~a~v~~~~a 390 (390) T protein:vir:97 380 RPEALITGSFA 390 (390) T ss_pred ccccEEEEEeC Confidence 99999999998 No 49 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=1e-61 Score=354.91 Aligned_cols=370 Identities=11% Similarity=0.164 Sum_probs=277.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) =|..-+|.||++++.++.+++++++++++++..+...+.+++.+++.+++.++++++.++++++..+............. T Consensus 13 g~~mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 92 (402) T protein:vir:93 13 GNEMPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEA 92 (402) T ss_pred CCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 01112467788888999999999999998888888888899999999999999999999888887665443322222211 Q ss_pred ccccchhhhHHHHHHHHHHHhhcch--------hhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhce Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPM--------AFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV 152 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~--------~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~ 152 (408) .... .......++|.++++... ........+++..+++++||++||++++.+|++.+++.++|+++|++ T Consensus 93 ~~~~---~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v 169 (402) T protein:vir:93 93 YQSL---SDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 169 (402) T ss_pred CCCC---chhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhcee Confidence 1111 111223334444443321 11122345667788899999999999999999999999999999999 Q ss_pred eecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~ 232 (408) +++.+. . +|........+.|++|++..+++ .++|++|++.+++++++++||+|||+||.++|++||.++|+++++ T Consensus 170 ~~~~~~--~--~p~~~~~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~ 244 (402) T protein:vir:93 170 TNIKGL--E--IPRVSYTLDDDDFITDVETAKEL-KAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLA 244 (402) T ss_pred eecCCc--e--eeeeeccCCcccccccccccccc-ccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHH Confidence 887543 3 34433344567899999999875 589999999999999999999999999999999999999999999 Q ss_pred HHHHH-HHhhccccccc-----------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecc Q lcl|Aclame:pro 233 VTRNQ-AIIEVMKAAPK-----------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEP 300 (408) Q Consensus 233 ~~~~~-~~~~g~g~~~~-----------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~ 300 (408) ++++. .|.+|+|++.+ ..+...+|+|+++++ .+++.|+.+++|+||+.++..+.++++.+|+|+|. T Consensus 245 ~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~~-~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~- 322 (402) T protein:vir:93 245 AKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD- 322 (402) T ss_pred HHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHh-ccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc- Confidence 99765 46667665543 223344788888875 58999999999999999988776666666777774 Q ss_pred ccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 301 DPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 301 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) +.|.+|+|+||+++++ ..+++||||++||.+++ ++.+..+.+ ..++++.|++..|+|++++ T Consensus 323 ----~~~~~llG~PV~~t~~---------~~~i~~GDf~~~~~~~~--~~~~~~~~~----~~~~~~~~~~~~r~Dg~v~ 383 (402) T protein:vir:93 323 ----TPAEKVFGKPVVFTDA---------AVKPIVGDFNYFGINYD--GTTYDTDKD----VKKGEYLFVLTAWYDQQRT 383 (402) T ss_pred ----cCCccccccceEEecC---------CCceeeechhhhhhhhh--hhhhhhhhc----ccCCceEEEEEEEeCcEEe Confidence 3456899999998764 23479999999877654 444444332 2368999999999999999 Q ss_pred cccceEEEEeeccccCCCC Q lcl|Aclame:pro 381 DSEALVAGSFSAIADQVGN 399 (408) Q Consensus 381 ~~~a~~~l~~~~~~~~~~~ 399 (408) +|+||+++++++.+.+.++ T Consensus 384 ~~~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 384 LDSAFRIAKAKENTGPLPS 402 (402) T ss_pred chhheEEEEeecCCCCCCC Confidence 9999999999887654444 No 50 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=4.5e-61 Score=351.44 Aligned_cols=371 Identities=13% Similarity=0.094 Sum_probs=258.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |++++..+++++..++++++.....++.+...++.. +++.. .+...++++.+...+.+++++.+............ T Consensus 1 m~~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~---~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~ 77 (379) T protein:vir:10 1 MEALEIKVALEAIKGQVDSKSSAQALEVKGLIEALE---AKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDK 77 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 766655555555555554444444444333322211 11211 12223444444555555555444333222111111 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) .. ...........+....+..... ...+ +...+++++++.+||+++...|++.++..++|+++|+++++++++ T Consensus 78 ~~----~~~~~~~~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~ 150 (379) T protein:vir:10 78 SD----SLVKSITENFNDIKEVRNGKSI-QVKA--VGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGT 150 (379) T ss_pred ch----hHHHHHHHHHHhHHHHHhhhhh-hhhh--hcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCc Confidence 00 0000000111111111211100 0111 122244556666899999999999999999999999999987665 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) ..++..... ..+.+.|++|++.+|+ ++++|++|++++++++++++||+|||+|++ ++.+||.++|+++++++++.+| T Consensus 151 ~~~~~~~~~-~~~~~~~v~Eg~~~~~-~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~ 227 (379) T protein:vir:10 151 YTFVRENGA-GEGAIGAQVEGATKGQ-KDYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAF 227 (379) T ss_pred eEEEEeecC-CCcccccccCCccccc-cccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 555544332 3356689999999996 458999999999999999999999999987 5999999999999999999999 Q ss_pred hhccccccc-----hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccc--cCCcccccc Q lcl|Aclame:pro 240 IEVMKAAPK-----KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPT--KPNSYLIKG 312 (408) Q Consensus 240 ~~g~g~~~~-----~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~--~~~~~~l~G 312 (408) +.|+|+++. ..+...++++.+++.. +...+..++.|+|||++|.+|+++||++|+|+|+|++. .+.+.+|+| T Consensus 228 ~~g~~~~~~~~~~~~~~~~~~d~i~~~~~~-~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G 306 (379) T protein:vir:10 228 NAVLAANATASTEIITNKNKVEMLINEIAK-QENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRING 306 (379) T ss_pred hcccccccccccccccCcccHHHHHHHHHh-hhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecc Confidence 999986543 2333457788877754 66778888899999999999999999999999998764 456679999 Q ss_pred cceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeec Q lcl|Aclame:pro 313 KQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) Q Consensus 313 ~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 392 (408) +||++++. +| .+.++||||++++. ..|+++++.++++....|.+|++.||++.|+|+++++|+||+++++++ T Consensus 307 ~pvv~s~~--~~-----ag~~~~gdf~~~~~-~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~ 378 (379) T protein:vir:10 307 IPLFRATW--LA-----ANKYYVGDWTRVTK-VTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFTA 378 (379) T ss_pred eeeEecCC--CC-----CCceEEeecccEEE-EEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEEecC Confidence 99988543 33 34589999999665 468899999998888889999999999999999999999999999999 Q ss_pred c Q lcl|Aclame:pro 393 I 393 (408) Q Consensus 393 ~ 393 (408) + T Consensus 379 ~ 379 (379) T protein:vir:10 379 V 379 (379) T ss_pred C Confidence 9 No 51 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=5.8e-61 Score=350.83 Aligned_cols=378 Identities=13% Similarity=0.120 Sum_probs=249.8 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHhhhcccccccc Q lcl|Aclame:pro 6 TVNQLNEA-WIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLV---EAQAEQVVNMREEEKGP 81 (408) Q Consensus 6 ~i~el~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 81 (408) .++|.++. .++...+.++++ .+.++.+...++.+++.++++.+...++.+++... +.+.............. T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVK----SMVEQFKADEDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKS 76 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhh Confidence 12332222 222222222222 22222222223333333333333333333222211 11111110000000000 Q ss_pred ccc---chhhhHHHHHHHHHHHhhc----chhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceee Q lcl|Aclame:pro 82 LNK---SENELKDKFVKDFVNMVRN----PMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES 154 (408) Q Consensus 82 ~~~---~~~~~~~~~~~a~~~~~~~----~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~ 154 (408) ... ............+...-.. ..........+....++++++|++||++++..|++.+++.++|++++++++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~ 156 (413) T protein:vir:81 77 IGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLT 156 (413) T ss_pred hhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceee Confidence 000 0000000000111110000 011112223344556677889999999999999999999999999999999 Q ss_pred cccCccceEEeeccC-CccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 155 VSTSNGSRVYEKWTD-VTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVV 233 (408) Q Consensus 155 ~~~~~g~~~~~~~~~-~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~ 233 (408) +++.+..+++..... ....+.|++|++.+|+++.++|++|++++++++++++||+|||+|++. |++||.+.|++++++ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~-l~~~i~~~la~~~~~ 235 (413) T protein:vir:81 157 MTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYDF-LVSYINARLLEELAI 235 (413) T ss_pred ccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHHH-HHHHHHHHHHHHHHH Confidence 988777776655432 335678999999999877789999999999999999999999999875 999999999999999 Q ss_pred HHHHHHhhccccccchhhh---------------hhHHHHHHHHHHhh-hhhccCCCEEEEcHHHHHHHHhhhcccCcee Q lcl|Aclame:pro 234 TRNQAIIEVMKAAPKKPTI---------------AKFDDVITMINTAV-DPAIIATSSLLTNQSGLNKLALVKTAEGKYL 297 (408) Q Consensus 234 ~~~~~~~~g~g~~~~~~~~---------------~~~d~i~~~~~~~l-~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~ 297 (408) ++|.+|++|+|++.+..+. ..++++..++.... ...+..+ .|+|||++|.+|++|||++|+|+ T Consensus 236 ~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~l 314 (413) T protein:vir:81 236 EEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQAD-ALVINPLDYQELRLAKDANGQYY 314 (413) T ss_pred HHHHHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCc-EEEEcHHHHHHHHHhhccCCcee Confidence 9999999999987653222 22444444543322 2344444 59999999999999999999999 Q ss_pred eccccccC-------CcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEE Q lcl|Aclame:pro 298 LEPDPTKP-------NSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIR 370 (408) Q Consensus 298 ~~~~~~~~-------~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r 370 (408) |.+..... .+.+|+|+||+++++ +| .+.++||||+++|++++|++++++++++..+.|.+|++.|| T Consensus 315 ~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~--~~-----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r 387 (413) T protein:vir:81 315 GGGVFQGQYGSGGIMLDPAPWGLRTVQSQV--VP-----VGKPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVR 387 (413) T ss_pred ccccccccccccccccCceecceeeEEcCC--CC-----cccEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEE Confidence 98754432 345899999998653 34 34689999999899999999999999998888999999999 Q ss_pred EEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 371 VIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 371 ~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) +++|+|+++.+|+||+++++++++.+ T Consensus 388 ~~~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 388 AEERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred EEEeeccEEecccceEEEEecCCCCC Confidence 99999999999999999998776655 No 52 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=1.7e-60 Score=348.25 Aligned_cols=380 Identities=17% Similarity=0.198 Sum_probs=254.6 Q ss_pred CChHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHh--hhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------ Q lcl|Aclame:pro 1 MGVKLTV-----NQLNEAWIASGDKVTDFNDQINMALN--DDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQ------ 67 (408) Q Consensus 1 M~~~~~i-----~el~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 67 (408) |.++..+ ++++.++.++.++.+++.++..++.. +...+++++..+.++++.++++.+.+.+....++ T Consensus 1 ~~~~~~~~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~ 80 (425) T protein:vir:95 1 MALRQLMLTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQL 80 (425) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 7777432 44444555555555444433322211 1112344555555555544444443333322222 Q ss_pred -HHHhh-hccccc---ccccccchhhhHHHHHHHHHHHhhcchhh----HHHHHHHHhhccccccCceecchhhhhhhhh Q lcl|Aclame:pro 68 -AEQVV-NMREEE---KGPLNKSENELKDKFVKDFVNMVRNPMAF----MNTVSSKTETSGSDSAAGLTIPQDIRTMINT 138 (408) Q Consensus 68 -~~~~~-~~~~~~---~~~~~~~~~~~~~~~~~a~~~~~~~~~~~----~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~ 138 (408) ..... +..... ..................+.+.++.+... ............+.++||++||+++.+.|++ T Consensus 81 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~ 160 (425) T protein:vir:95 81 EDELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMD 160 (425) T ss_pred HHHHHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHH Confidence 11100 000000 00000000111111122222333222211 1111222333455678999999999999999 Q ss_pred hhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHH Q lcl|Aclame:pro 139 LVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAEN 218 (408) Q Consensus 139 ~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~ 218 (408) .+++.++|+++|++++++ |.+.+|+.. ..+.+.|++|++++|+.+.++|++|++++++++++++||+|+++|+.++ T Consensus 161 ~l~~~~~i~~~~~~~~~~---g~~~ip~~~-~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~ 236 (425) T protein:vir:95 161 IMGDYTTLYPLVDKIRVK---GTTRILVDT-DTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIIN 236 (425) T ss_pred HHHhhhhHHHhhceeecC---ceeEEEEec-CCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHH Confidence 999999999999999875 345566554 4577899999999998777899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccch-----------------hhhhhHHHHHHHHHHhhhhhc--cCCCEEEEc Q lcl|Aclame:pro 219 ILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK-----------------PTIAKFDDVITMINTAVDPAI--IATSSLLTN 279 (408) Q Consensus 219 ~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~-----------------~~~~~~d~i~~~~~~~l~~~~--~~~a~~~~n 279 (408) |++||.++|++++++++|.++++|+|+++.. .....++++++++.. +..++ ..+++|+|| T Consensus 237 l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~~ 315 (425) T protein:vir:95 237 LDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGL-IDTGDDSVGEIVAVMK 315 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHHh-hhhhccccCceEEEEe Confidence 9999999999999999999999999875321 122335667666543 55554 367789999 Q ss_pred HHHH----HHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEe Q lcl|Aclame:pro 280 QSGL----NKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPT 355 (408) Q Consensus 280 ~~~~----~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 355 (408) +.++ ..|+++||++|+|+|++. .+..++|+|+||++++. +| .+.++||||++ |.+.+|+++++.++ T Consensus 316 ~~~~~~~l~~l~~~kd~~g~~i~~~~--~~~~~~l~G~pvv~~~~--~~-----~~~i~~Gd~~~-~~~~~~~~~~i~~~ 385 (425) T protein:vir:95 316 RSTYYNRLVEFSIQVDSNGNVVGKLP--NLRTPDLLGLRVVFNNF--LD-----DDTVLFGEFEQ-YTLVERENITIDSS 385 (425) T ss_pred ChHHHHHHHHHHhhcCCCCceeeccC--CCCCccccceeeEEcCc--CC-----CccEEEEeccc-EEEEeecceEEEee Confidence 9874 356788999999999743 44456899999998653 44 34689999998 56678999999999 Q ss_pred ccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 356 NIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 356 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) ++. .|.+|++.||++.|+||++++|+||+++++++...-. T Consensus 386 ~~~--~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 386 THV--KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred ccc--ccccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 876 4999999999999999999999999999887744333 No 53 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=1.9e-61 Score=353.43 Aligned_cols=376 Identities=16% Similarity=0.199 Sum_probs=257.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc----c Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSA---EAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMRE----E 77 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~ 77 (408) |+|+||++++.++.++++++.+... +....++ +++.+++.+++++..+++.+++..+............ . T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~---e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~ 77 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEV---GGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVA 77 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHh---ccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhh Confidence 7777888888888888887765322 1222222 2345555566666555555443322211111000000 0 Q ss_pred c-ccccccchhhhHHHHHHHHHHHhh---cchh------------hHHHHHHHHhhccccccCceecchhhhhhhhhhhh Q lcl|Aclame:pro 78 E-KGPLNKSENELKDKFVKDFVNMVR---NPMA------------FMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVR 141 (408) Q Consensus 78 ~-~~~~~~~~~~~~~~~~~a~~~~~~---~~~~------------~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~ 141 (408) . ..................|..+++ .... ........+.+.+++++||++||+++..+|++.++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~ 157 (435) T protein:vir:14 78 APAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLR 157 (435) T ss_pred hccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHh Confidence 0 000000000011111112222221 1100 01111224456777888999999999999999999 Q ss_pred hhhhhhhh-hceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchH--H Q lcl|Aclame:pro 142 QYDSLQQY-VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAE--N 218 (408) Q Consensus 142 ~~~~l~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~--~ 218 (408) +.++++++ ++++++. ++.+.+|...+ .+.++|++|++.+|+ +.++|++|++.+++++++++||+||++|+.+ + T Consensus 158 ~~~~i~~~~~~~~~~~--~~~~~~p~~~~-~~~a~~v~E~~~~~~-~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~ 233 (435) T protein:vir:14 158 PKSVVRKLGARTLPLS--NGNITIPRLKG-GAIVGYIGADTDIPT-TQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPN 233 (435) T ss_pred hhchhhhhcceeeecC--CCceEEEEEeC-CcceeeeccCccccc-cccceeEEEeeeEEEEEeehhhHHHHHhhccCHH Confidence 99999987 6676654 45566776654 467799999999996 5589999999999999999999999999854 6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccchhhh---------------hhHH----HHHHHHHHhhhh--hccCCCEEE Q lcl|Aclame:pro 219 ILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI---------------AKFD----DVITMINTAVDP--AIIATSSLL 277 (408) Q Consensus 219 ~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~---------------~~~d----~i~~~~~~~l~~--~~~~~a~~~ 277 (408) |++||.++|++++++++|.+|++|+|++..+.+. .+++ ++.+++ ..+.. .+..+++|+ T Consensus 234 l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~~v 312 (435) T protein:vir:14 234 VDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVI-LALENADANLTQPGWI 312 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHH-HHhhhccccccCCEEE Confidence 9999999999999999999999999976422111 1122 233333 22332 355688999 Q ss_pred EcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccccc---ccCcceEEEEehhcceEeeeccceEEEE Q lcl|Aclame:pro 278 TNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNT---GSTVYPLYYGDMSQAITLFDRENMSLLP 354 (408) Q Consensus 278 ~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~---~~~~~~~~~gd~~~~~~~~~~~~~~i~~ 354 (408) |||++|..|+++||++|+|+|. +.. +++|+|+||++++. +|.. ..+...++||||++++ +++|+++++.+ T Consensus 313 ~n~~~~~~L~~lkd~~G~~l~~-~~~---~g~l~G~Pv~~~~~--~p~~~~~~~~~~~i~~gd~s~~~-i~~~~~~~~~~ 385 (435) T protein:vir:14 313 MAPRTFRFLEGLRDGNGNKVYP-ELA---NGMLKGYPVGKTTQ--VPINLGETGKESEIYFTDFGDVF-IGEEETLEIDY 385 (435) T ss_pred EcHHHHHHHHHhhccCCceecc-CCC---CCeeecceeEeecc--ccccccCCCccceEEEeecccEE-EEEecccEEEE Confidence 9999999999999999999994 332 34899999999654 4442 3455679999999855 67899999999 Q ss_pred eccch---------hhhhhceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 355 TNIGA---------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 355 ~~~~~---------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) +++.. ..|.+|++.||+++|+||++++|+||++++..+... T Consensus 386 ~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 386 SKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred eccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 98754 569999999999999999999999999998777766 No 54 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=3.2e-60 Score=346.72 Aligned_cols=370 Identities=12% Similarity=0.122 Sum_probs=250.9 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDD-NFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |.++.. +.++.....+++.+.++.+....+. ....+++.+++.+.+.+...+....+++...+..... T Consensus 143 ~~l~e~----~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~------- 211 (543) T protein:vir:81 143 DSIEDC----RFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLAR------- 211 (543) T ss_pred ccHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh------- Confidence 333222 2222222222222222111110000 0011122333333333333333222222222211110 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhH------HHHHHHHhhccccccCceecchhhhhhhh-hhhhhhhhhhhhhce Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFM------NTVSSKTETSGSDSAAGLTIPQDIRTMIN-TLVRQYDSLQQYVRV 152 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~------~~~~~~a~~~~t~~~gg~~vP~~~~~~ii-~~~~~~~~l~~~~~~ 152 (408) .........+.++|.++++...... ...........++++||++||++++..|+ ..+++.++|+.++++ T Consensus 212 ----~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~ 287 (543) T protein:vir:81 212 ----QCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQ 287 (543) T ss_pred ----hhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhccc Confidence 0111122334445544444322211 11122223345678899999999998865 667888999999987 Q ss_pred eecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~ 232 (408) .++ +|.+.++... ..+.+.|++|++.+|+ +.++|++|++++++++++++||+++++|+ ++|.+||.+.|+++++ T Consensus 288 ~~~---~g~~~~~~~~-~~~~a~~v~Eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~ 361 (543) T protein:vir:81 288 VVA---TGDVWHGVSS-AAVQWSWDAEFEEVSD-DSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKD 361 (543) T ss_pred ccC---CcceEEEEec-CCcceeecccCccccc-cccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHH Confidence 665 3556666654 4577899999999986 56899999999999999999999999998 5899999999999999 Q ss_pred HHHHHHHhhccccccchh------------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccC Q lcl|Aclame:pro 233 VTRNQAIIEVMKAAPKKP------------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEG 294 (408) Q Consensus 233 ~~~~~~~~~g~g~~~~~~------------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G 294 (408) ++++.+|++|+|++..+. +...++++++++ ..+++.|..+++|+|||++|..|+++||++| T Consensus 362 ~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G 440 (543) T protein:vir:81 362 ELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVY-EQLAARHRRQGAWLANNLIYNKIRQFDTQGG 440 (543) T ss_pred HHHHHHHhccCCCCcccccchhhcccccccccccccccccHHHHHHHH-HhhhccccCCcEEEEcHHHHHHHHHhhcCCC Confidence 999999999998753211 122356666665 4588999999999999999999999999999 Q ss_pred ceeeccccccCCcccccccceEeecccccc---ccccCcceEEEEehhcceEeeeccceEEEEeccch--hhhhhceeeE Q lcl|Aclame:pro 295 KYLLEPDPTKPNSYLIKGKQVIVVADRWLP---NTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA--GAFETDTTKI 369 (408) Q Consensus 295 ~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~---~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~~f~~~~~~~ 369 (408) +|+|.+ +..+.+++|+|+||+++++.+.. ....+..+++||||+. |.++++++++|.++++.+ ..|.+|++.| T Consensus 441 ~~l~~~-~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~ 518 (543) T protein:vir:81 441 AGLWTT-IGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQN-YVIADRIGMTVEFIPHLFGTNRRPNGSRGW 518 (543) T ss_pred ceeccC-cCCCCCccccceeeEEeccccccccccccCCcceEEEeeccc-eeEEeecccEEEEeccccccchhhcCceEE Confidence 999976 55566789999999998763322 2345677899999986 566789999999988764 3567899999 Q ss_pred EEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 370 RVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 370 r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) +++.|+|+++++|+||+++++++++ T Consensus 519 ~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 519 FAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred EEEEeeccEeecccceEEEEecccC Confidence 9999999999999999999999998 No 55 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=4.6e-60 Score=345.90 Aligned_cols=382 Identities=12% Similarity=0.114 Sum_probs=269.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc- Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK- 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 79 (408) |+..+.++|+++++.+..+.+++..++.++..++.+.. ..+++.+++.+..+++.++......+............ T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~~~~~~~~~~~e~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEIVAEARGL---ADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAE 77 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 99999999999999998888877777766654443322 23333444444444444443333322221111111100 Q ss_pred --cccccchhhhHHHHHHHHHHHhhcchhhHHH--H-----HHHHhhcc-ccccCceecchhhhhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 --GPLNKSENELKDKFVKDFVNMVRNPMAFMNT--V-----SSKTETSG-SDSAAGLTIPQDIRTMINTLVRQYDSLQQY 149 (408) Q Consensus 80 --~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~--~-----~~~a~~~~-t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~ 149 (408) ...............+.+....+.+...... . .......+ +..++++++|..+...|+..++....++++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~ 157 (419) T protein:vir:94 78 AGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADL 157 (419) T ss_pred cccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhc Confidence 0001111111111222222222222111100 0 11111222 234445667777777788888888899999 Q ss_pred hceeecccCccceEEeec-----cCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHH Q lcl|Aclame:pro 150 VRVESVSTSNGSRVYEKW-----TDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLS 224 (408) Q Consensus 150 ~~~~~~~~~~g~~~~~~~-----~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~ 224 (408) |+++++.+....++.... ....+.+.|++|++++++ ++++|+++++++++++++++||+|+++|+. +|++||. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~ 235 (419) T protein:vir:94 158 LDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQ 235 (419) T ss_pred ceeeeccCCceeeeeeccccccccccCcccceecCCccccc-cccceeeEEeeeeeEEEeehhhHHHHHhHH-HHHHHHH Confidence 999998765544443211 112345789999999986 558999999999999999999999999975 7999999 Q ss_pred HHHHHHHHHHHHHHHhhccccccchh--------------------hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHH Q lcl|Aclame:pro 225 SWIAKKVVVTRNQAIIEVMKAAPKKP--------------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLN 284 (408) Q Consensus 225 ~~l~~~~~~~~~~~~~~g~g~~~~~~--------------------~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~ 284 (408) ++|++++++++|.+|++|+|++.|.+ ....++++++++.. +...+..+++|+|||++|. T Consensus 236 ~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~-~~~~~~~~~~~v~n~~~~~ 314 (419) T protein:vir:94 236 GRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTV-AEIAGFPPDGVVVHPQDWE 314 (419) T ss_pred HHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHHh-hhhccCCCCEEEEcHHHHH Confidence 99999999999999999999875422 11235677777755 6677778889999999999 Q ss_pred HHHhhhcccCc-eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhh Q lcl|Aclame:pro 285 KLALVKTAEGK-YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFE 363 (408) Q Consensus 285 ~l~~lkd~~G~-~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~ 363 (408) .|++++|++|+ |++++++.++.+++|+|+||+++++ +| .+.++||||+++|.+++|++++++++++..+.|. T Consensus 315 ~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~--~~-----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~ 387 (419) T protein:vir:94 315 SIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVA--IA-----QGTALVGGFRQGATLWSRQGITVLMTDSHADFFT 387 (419) T ss_pred HHHHHhhcCCCceeecCCcccCCCccccceeeEEcCC--CC-----CccEEEeeccceEEEEEecceEEEEeccccchhh Confidence 99999998665 5678888888899999999998654 44 3458999999988899999999999999888899 Q ss_pred hceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 364 TDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 364 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) +|++.||++.|+|+++++|+||+++++++++. T Consensus 388 ~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred cCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 99999999999999999999999999998877 No 56 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=1e-60 Score=349.46 Aligned_cols=376 Identities=16% Similarity=0.198 Sum_probs=256.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc--c-- Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAE---AMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMRE--E-- 77 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-- 77 (408) |+|+||++++.++.++++++.+... +....+++ ++.+++.++++++.+++++++............... . T Consensus 1 M~l~eL~~~r~~~~~~~~~l~~~~~---e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~ 77 (435) T protein:vir:80 1 MNVNELRRERAAVNQRVQALAQIEV---GGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVT 77 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHh---ccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhc Confidence 7777888888888888887765322 12222333 344555566666666655443222111111000000 0 Q ss_pred c-ccccccchhhhHHHHHHHHHHHhh---cchh------------hHHHHHHHHhhccccccCceecchhhhhhhhhhhh Q lcl|Aclame:pro 78 E-KGPLNKSENELKDKFVKDFVNMVR---NPMA------------FMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVR 141 (408) Q Consensus 78 ~-~~~~~~~~~~~~~~~~~a~~~~~~---~~~~------------~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~ 141 (408) . ..................|.++++ .... ........+.+.++++.||++||+++.++|++.++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~ 157 (435) T protein:vir:80 78 ASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLR 157 (435) T ss_pred cccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHh Confidence 0 000000000000111111222111 1000 01111223355677788999999999999999999 Q ss_pred hhhhhhhh-hceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchH--H Q lcl|Aclame:pro 142 QYDSLQQY-VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAE--N 218 (408) Q Consensus 142 ~~~~l~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~--~ 218 (408) +.++++++ ++++++.. +.+.+|...+ .+.+.|++|++.+|+ +.++|++|++.+++++++++||+|+|+|+.+ + T Consensus 158 ~~~~i~~~~~~~v~~~~--~~~~~p~~~~-~~~a~~v~E~~~~~~-~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~ 233 (435) T protein:vir:80 158 PKSVVRKLGARTLPLSN--GNITIPRLKG-GAIVGYIGADTDIPT-TQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPN 233 (435) T ss_pred hhchhhhccceeeecCC--CceEEEEEeC-CcceeeeccCccccc-cccceeeEEEeeEEEEEeehhhHHHHHhhcccHH Confidence 99999997 67766654 4566666543 466799999999997 5589999999999999999999999999854 7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccchhhh---------------hhHH----HHHHHHHHhhhh--hccCCCEEE Q lcl|Aclame:pro 219 ILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI---------------AKFD----DVITMINTAVDP--AIIATSSLL 277 (408) Q Consensus 219 ~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~---------------~~~d----~i~~~~~~~l~~--~~~~~a~~~ 277 (408) +++||.++|+++++++++.+|++|+|++..+.+. ..++ ++.+++. .+.. .++.+++|+ T Consensus 234 l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~-~~~~~~~~~~~~~~v 312 (435) T protein:vir:80 234 VDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAIL-ALENADANLTQPGWI 312 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHH-HhhccccccccCEEE Confidence 9999999999999999999999999875322111 1122 3333332 2222 356788999 Q ss_pred EcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccc---cccCcceEEEEehhcceEeeeccceEEEE Q lcl|Aclame:pro 278 TNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPN---TGSTVYPLYYGDMSQAITLFDRENMSLLP 354 (408) Q Consensus 278 ~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~---~~~~~~~~~~gd~~~~~~~~~~~~~~i~~ 354 (408) |||.++..|+++||++|+|+|. +.. +++|+|+||++++. +|. ...+..+++||||++++ +.+|+++++++ T Consensus 313 mn~~~~~~L~~lkd~~G~~l~~-~~~---~~~l~G~pv~~~~~--~p~~~~~~~~~~~i~~gd~s~~~-i~~~~~~~i~~ 385 (435) T protein:vir:80 313 MAPRTFRFLEGLRDGNGNKVYP-ELA---NGMLKGYPVGKTTQ--VPINLGEAGKESEIYFTDFGDVF-IGEEETLEIDY 385 (435) T ss_pred EcHHHHHHHHhhhccCCceecc-CCC---CCeEeeeeeEEecc--ccccccCCCCcceEEEEEcccEE-EEeecceEEEE Confidence 9999999999999999999994 332 34899999998754 443 23456679999999855 67899999999 Q ss_pred eccch---------hhhhhceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 355 TNIGA---------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 355 ~~~~~---------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) +++.. ..|.+|++.||++.|+||++.+|+||++++..+... T Consensus 386 ~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 386 SKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred eccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 98764 459999999999999999999999999999777766 No 57 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=3.5e-59 Score=341.05 Aligned_cols=382 Identities=13% Similarity=0.054 Sum_probs=243.7 Q ss_pred CChHH-HHHH-H-----HHHHHH--HHHH---HHHHHHHHHH-HH-hhhcccHHHHHHHHHHHHHHHHHHHHHHHHH--- Q lcl|Aclame:pro 1 MGVKL-TVNQ-L-----NEAWIA--SGDK---VTDFNDQINM-AL-NDDNFSAEAMSELKNKRDNEKVRRDALREQL--- 63 (408) Q Consensus 1 M~~~~-~i~e-l-----~~~~~~--~~~~---~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 63 (408) |.++| +++| + .+.+.. ..++ .+.++++.++ .+ +..+...+.+.+.+.++++...+++.+.++. T Consensus 1 ~~~~~~~~~~e~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~ 80 (458) T protein:vir:10 1 MTIDINKLKEELGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKS 80 (458) T ss_pred CccchhhhhhhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 55553 1111 1 011100 0000 0000000000 00 0000000011111111111111111111110 Q ss_pred ---------------HHHHHHHhhh----------cccccccccccchhhhHHHHHHHHHHHhhcchhhHHHH---HHHH Q lcl|Aclame:pro 64 ---------------VEAQAEQVVN----------MREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTV---SSKT 115 (408) Q Consensus 64 ---------------~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~---~~~a 115 (408) .+........ .....................++|.+.+.......... ...+ T Consensus 81 ~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a 160 (458) T protein:vir:10 81 NELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKA 160 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhh Confidence 0000000000 00000000011111111222334444333322111111 1122 Q ss_pred h-hccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccc-----cc Q lcl|Aclame:pro 116 E-TSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLD-----NP 189 (408) Q Consensus 116 ~-~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~-----~~ 189 (408) . ..++.++||++||+++++.|++.+++.++|+++|+++++++... .++... ..+.+.|++|++..++.. .+ T Consensus 161 ~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~--~~~~~~-~~~~a~~v~e~~~~~~~~~~~~~~~ 237 (458) T protein:vir:10 161 VNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKIL--TMLVEP-DAGKATWVAASTYGTDTTTGEEVKG 237 (458) T ss_pred hhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcce--EEEEec-CCcceeecccccccccccccccccc Confidence 2 23455678999999999999999999999999999999876543 344433 346679999998887643 56 Q ss_pred cceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchh------------------- Q lcl|Aclame:pro 190 QLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKP------------------- 250 (408) Q Consensus 190 ~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~------------------- 250 (408) +|+++++++++++++++||+|+++|+.++|.+||.++|++++++++|.+|++|+|++.|.+ T Consensus 238 ~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) T protein:vir:10 238 ALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKAD 317 (458) T ss_pred cceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccccc Confidence 8999999999999999999999999999999999999999999999999999999865421 Q ss_pred --hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccc----ccCCcccccccceEeecccccc Q lcl|Aclame:pro 251 --TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP----TKPNSYLIKGKQVIVVADRWLP 324 (408) Q Consensus 251 --~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~----~~~~~~~l~G~pv~~~~~~~~~ 324 (408) +..+++++++++. .+...|..+++|+|||++|..|+++||++|+|+|.+.. ..+.+.+|+|+||++++. +| T Consensus 318 ~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~--~p 394 (458) T protein:vir:10 318 GSVLVTAKTISKLRR-KLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEY--FP 394 (458) T ss_pred ccccccHHHHHHHHH-hhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccc--cc Confidence 1235788888764 58889999999999999999999999999999987643 335567999999998654 66 Q ss_pred ccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 325 NTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 325 ~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) +. +++..++||||+++|.+++|.++++.++++ +.++++.||++.|+|+.+++|+||++.++.+. T Consensus 395 ~~-~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~----~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 395 AK-ANSAEFAVIVYKDNFVMPRQRAVTVERERQ----AGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred cc-cCCcceEEEEecccEEEEEeeceEEEeecc----cCCCceEEEEEEEecceEecccceEEEeeccC Confidence 63 456678999999889999999999988764 45889999999999999999999999888777 No 58 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=2.6e-59 Score=341.79 Aligned_cols=374 Identities=14% Similarity=0.209 Sum_probs=254.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEA---MSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) |+. |++|++++.++.++++++.+.. .++...++++ +.+++.++++++.+++.++++.+...... ...+.. T Consensus 1 M~k---l~~L~e~r~~l~~~~~~l~~~~---~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~-~~~~~~ 73 (428) T protein:vir:10 1 MPQ---IEELRRQRAGINEQIQALATIE---ATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVA-KPVKAT 73 (428) T ss_pred Cch---HHHHHHHHHHHHHHHHHHHHHH---hccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhhch Confidence 777 5677777777777777776532 1222233333 34555566666666555443322211111 000100 Q ss_pred ccccc----ccchhhhHHHHHHHHHHHhhcc----------hhh-HHHHHHHHhhccccccCceecchhhhhhhhhhhhh Q lcl|Aclame:pro 78 EKGPL----NKSENELKDKFVKDFVNMVRNP----------MAF-MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQ 142 (408) Q Consensus 78 ~~~~~----~~~~~~~~~~~~~a~~~~~~~~----------~~~-~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~ 142 (408) ..... ............+......+.. ... ......++. ..++++||++||+++.++|++.+++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~gg~liP~~~~~~ii~~l~~ 152 (428) T protein:vir:10 74 QHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAI-STAAGSGGVLIPQNIHSEVIELLRD 152 (428) T ss_pred hhccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhh-cccccCCccccchhHHHHHHHHHhh Confidence 00000 0000000111111111111110 000 111122222 2344578999999999999999999 Q ss_pred hhhhhhh-hceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHH Q lcl|Aclame:pro 143 YDSLQQY-VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILA 221 (408) Q Consensus 143 ~~~l~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~ 221 (408) .++|+++ ++++++. +|.+.+|+..+ .+.+.|++|++.+|+ ++++|++|++.+++++++++||+|+++|+.++|++ T Consensus 153 ~~~l~~~~~~~~~~~--~g~~~~p~~~~-~~~a~~v~Eg~~~~~-~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~ 228 (428) T protein:vir:10 153 RTIVRKLGARSIPLP--NGNMSLPRLAG-GATASYTGENQDAKV-SEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQ 228 (428) T ss_pred hchhhhhcceeeecC--CcceEEEEEeC-CcceeeeccCccccc-cccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHH Confidence 9999998 5665544 56677777654 467899999999997 45899999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccccchhhh------------------hhHHH---HHHHH--HHhhhhhccCCCEEEE Q lcl|Aclame:pro 222 WLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI------------------AKFDD---VITMI--NTAVDPAIIATSSLLT 278 (408) Q Consensus 222 ~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~------------------~~~d~---i~~~~--~~~l~~~~~~~a~~~~ 278 (408) ||.+.|++++++++|.+|++|+|++..+.+. .+.+. .++.+ .......+..+++|+| T Consensus 229 ~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 308 (428) T protein:vir:10 229 LVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGM 308 (428) T ss_pred HHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEE Confidence 9999999999999999999999975321111 11222 22222 1223445667889999 Q ss_pred cHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccc---cccCcceEEEEehhcceEeeeccceEEEEe Q lcl|Aclame:pro 279 NQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPN---TGSTVYPLYYGDMSQAITLFDRENMSLLPT 355 (408) Q Consensus 279 n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~---~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 355 (408) |+++|..|+++||++|+|+|++ . .+++|+|+||++++. +|. .+.++.+++||||++++ +..+++++++++ T Consensus 309 n~~~~~~L~~lkd~~G~~i~~~-~---~~g~l~G~pv~~~~~--~p~~~~~~~~~~~i~~gd~s~~~-i~~~~~i~i~~~ 381 (428) T protein:vir:10 309 SNRTYMKLFGLRDGNGNKVYPE-M---AQGMLKGYPIQRTSA--IPANLGEGGKESEIYFADFNDVV-IGEDGNMKVDFS 381 (428) T ss_pred cHHHHHHHHHhhccCCceeccC-C---CCCeeeceeeEEecc--ccccccCCCccceEEEEecceEE-EEEecceEEEee Confidence 9999999999999999999964 2 234899999998764 443 24466789999999755 568999999998 Q ss_pred ccch---------hhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 356 NIGA---------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 356 ~~~~---------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) ++.. ..|.+|++.||++.|+||++.+|+||++++.... T Consensus 382 ~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 382 KEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 8743 5699999999999999999999999999998888 No 59 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=5.4e-58 Score=334.55 Aligned_cols=390 Identities=12% Similarity=0.106 Sum_probs=258.0 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh------cccHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHh Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDD------NFSAEAMSELKNKRDNEKVRRDALREQL---VEAQAEQV 71 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~ 71 (408) |+.+ |+||+.++.+++++..++.++++.+++.. ....++..+.+.++++++++++.+++.+ ++++.... T Consensus 1 ~~k~--~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~ 78 (477) T protein:vir:84 1 MEKH--LEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIE 78 (477) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5555 77888888888888887777776655432 2334445555555555555554433322 22211111 Q ss_pred hhcc----cccc---cccccc----hhhhHHHHHHHHHHHhhcchh----------------------hHHHHHHHHhhc Q lcl|Aclame:pro 72 VNMR----EEEK---GPLNKS----ENELKDKFVKDFVNMVRNPMA----------------------FMNTVSSKTETS 118 (408) Q Consensus 72 ~~~~----~~~~---~~~~~~----~~~~~~~~~~a~~~~~~~~~~----------------------~~~~~~~~a~~~ 118 (408) .... .... ...... .......+.+.+....+.... .....+.+.... T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (477) T protein:vir:84 79 RSGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLD 158 (477) T ss_pred HhhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhcccc Confidence 0000 0000 000000 000000111111111110000 000111222234 Q ss_pred cccccCceecchhh-hhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccc----ccccccee Q lcl|Aclame:pro 119 GSDSAAGLTIPQDI-RTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPD----LDNPQLTI 193 (408) Q Consensus 119 ~t~~~gg~~vP~~~-~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~----~~~~~f~~ 193 (408) ++++.||++||+++ .+.|++.+++.++++++++++++++.++++.+|...++...+.|++|++..++ .++++|++ T Consensus 159 ~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~ 238 (477) T protein:vir:84 159 RNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGF 238 (477) T ss_pred ccCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceee Confidence 55667888888875 67899999999999999999999999999999987777677789999875432 24578999 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhh---------------h----- Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI---------------A----- 253 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~---------------~----- 253 (408) +++++++++++++||+|||+|+.+++++||.++|+++++.++|.+|++|+|++..+.+. . T Consensus 239 i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~ 318 (477) T protein:vir:84 239 VQANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKH 318 (477) T ss_pred EEEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhH Confidence 99999999999999999999999999999999999999999999999999965321111 0 Q ss_pred --hHHHHHHHHHHhhhhhccC-CCEEEEcHHHHHHHHhhhcccCceeeccc-------------cccCCcccccccceEe Q lcl|Aclame:pro 254 --KFDDVITMINTAVDPAIIA-TSSLLTNQSGLNKLALVKTAEGKYLLEPD-------------PTKPNSYLIKGKQVIV 317 (408) Q Consensus 254 --~~d~i~~~~~~~l~~~~~~-~a~~~~n~~~~~~l~~lkd~~G~~~~~~~-------------~~~~~~~~l~G~pv~~ 317 (408) .++++++++. .+...|.. .+.|+|||++|..|+++||++|||+|+|+ +..+.+++|+|+||++ T Consensus 319 ~~~~~~i~~~~~-~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~ 397 (477) T protein:vir:84 319 QIIYQKIADAIQ-RVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVT 397 (477) T ss_pred HHHHHHHHHHHh-hccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEe Confidence 1223344443 35556654 44799999999999999999999999875 3344557999999998 Q ss_pred ecccccccc---ccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEE-ecccceEEEEeecc Q lcl|Aclame:pro 318 VADRWLPNT---GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKA-TDSEALVAGSFSAI 393 (408) Q Consensus 318 ~~~~~~~~~---~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v-~~~~a~~~l~~~~~ 393 (408) ++ .+|.. .++...++||||+++++ +. .++++.++++.+ +...++.|+++.++++.. ++|+||+.+++++. T Consensus 398 s~--~~p~~~~~~~d~~~i~~gd~~~~~i-~~-~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 398 DP--TLPTTLGTGTDQDVIHVLRASDLAL-FE-SSVRMRALQETR--AENLSVLLQVYGYLAFTAARFPQSVVEIGGTAL 471 (477) T ss_pred cC--cccccccccCCcceEEEEEeceEEE-Ee-eceeEEeccccc--cccceeeeeehhhhhhhhhccccceEEeecccc Confidence 65 46653 23455799999997554 43 578888887765 557888888888888744 56999999988876 Q ss_pred ccCCCC Q lcl|Aclame:pro 394 ADQVGN 399 (408) Q Consensus 394 ~~~~~~ 399 (408) |.+.-. T Consensus 472 ~~~~~~ 477 (477) T protein:vir:84 472 TAPTFA 477 (477) T ss_pred cccccC Confidence 654333 No 60 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=2.1e-57 Score=331.26 Aligned_cols=332 Identities=10% Similarity=0.147 Sum_probs=230.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccchhhhHHHHHHHHHHHhhcch--------hhHHH Q lcl|Aclame:pro 39 AEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPM--------AFMNT 110 (408) Q Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~--------~~~~~ 110 (408) .|+++++++++++++++++.+++++++++................. .......++|.+++++.. ..... T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~~~~d~~e~e~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~ 77 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSL---NDNEKLVKAKAEFYRHAILPNEFEKPSMEAQ 77 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc---chhhhHHHHHHHHHHHHhhhhHHHHHHhhHH Confidence 1222222222222222222222222211111100000000000000 000111222333332221 11222 Q ss_pred HHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccccccccccccc Q lcl|Aclame:pro 111 VSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQ 190 (408) Q Consensus 111 ~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~ 190 (408) ...++++.+++++||++||+++.++|++.+++.++|+++++++++.+. .+|......+.+.|++|++.++++ .++ T Consensus 78 ~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~----~~p~~~~~~~~a~~v~E~~~~~~~-~~~ 152 (352) T protein:vir:78 78 RLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL----EIPRVSYTLDDDDFITDVETAKEL-KLK 152 (352) T ss_pred HHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCc----eEEEEecCCCcccccccccccccc-ccc Confidence 344677888899999999999999999999999999999999876532 244444445677999999999975 589 Q ss_pred ceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHH-HHhhccccccch-----------hhhhhHHHH Q lcl|Aclame:pro 191 LTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQ-AIIEVMKAAPKK-----------PTIAKFDDV 258 (408) Q Consensus 191 f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~-~~~~g~g~~~~~-----------~~~~~~d~i 258 (408) |++|++.+++++++++||+|||+||.++|++||.++|+++++++++. .+.+|+|++.+. ++...+|++ T Consensus 153 f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d~i 232 (352) T protein:vir:78 153 GDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYDAI 232 (352) T ss_pred ceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccccchHHHH Confidence 99999999999999999999999999999999999999999998655 566777665432 233347888 Q ss_pred HHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEeh Q lcl|Aclame:pro 259 ITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDM 338 (408) Q Consensus 259 ~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~ 338 (408) ++++. .+++.|+.+++|+||+.++..|.+++|.+|+|+|.. .+.+|+|+||+++++ ...++|||| T Consensus 233 ~~~~~-~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~-----~~~~llG~PV~~~~~---------~~~~~~Gdf 297 (352) T protein:vir:78 233 INALA-DLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT-----PAEKVFGKPVVFTDA---------AVKPIVGDF 297 (352) T ss_pred HHHHh-ccChhhhcCCEEEEehHHHHHHHHHHhccCCccccc-----CCccccccceEEecC---------CCceeEeeh Confidence 88886 699999999999999999999999988899998853 356899999998753 234799999 Q ss_pred hcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCC Q lcl|Aclame:pro 339 SQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGN 399 (408) Q Consensus 339 ~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 399 (408) +.||.. +.++.+....+ +.++++.|++..|+|+++++|+||+.+++++.+.+-+. T Consensus 298 ~~~~~~--~~~~~~~~~~~----~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 298 NYFGIN--YDGTTYDTDKD----VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSLPS 352 (352) T ss_pred hhhhhh--hhhheeeeecc----ccCCeeEEEEEeeeCceeechhheEEEEeecccCCCCC Confidence 987764 45566555433 34789999999999999999999999999998876655 No 61 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=1.9e-57 Score=331.60 Aligned_cols=342 Identities=11% Similarity=0.032 Sum_probs=244.8 Q ss_pred CChHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKL-TVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~-~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |.+++ ++++++++++++.+.+++. ...++..+...+....+..++.. +..+ +........ T Consensus 1 M~i~~k~~~~~~~~~~~l~~~~~~~-----------~~~ee~~~~~~~~~~~~~~~~~~---~~~~-e~~~~~~~~---- 61 (377) T protein:vir:98 1 MAINLKELPKYREAVAELSAKISAG-----------ATSEEQEKLFEAAFTTMGDEILA---KNEE-EMERMFDLR---- 61 (377) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHhh-----------hhhHHHHHHHHHHHHhHHHHHHH---HHHH-HHHHHHHhc---- Confidence 77763 3344444333332222211 11111111111111111111111 0000 001100000 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) ........++++.|.+.+ ..+++++||++||+++...|++.+.+.++++++|++.++++ T Consensus 62 ----~~~~~lt~ee~~~~~~~~---------------~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-- 120 (377) T protein:vir:98 62 ----DKNRELTAEEIKFFNDID---------------KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-- 120 (377) T ss_pred ----cCCcccCHHHHHHHHHHH---------------hccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCc-- Confidence 011122344555554432 34577899999999999999999999999999999988753 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) ...++.. +..+.+.|++|++++++.+.++|+++++++++++++++||++||+|+.+++++||.++|+++++++++.+| T Consensus 121 -~~~~~~~-~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~ 198 (377) T protein:vir:98 121 -RLKALTA-ETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAI 198 (377) T ss_pred -ceEEEEe-cCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhce Confidence 3445544 35677899999988877778999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccchhhhh--------------------hHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeec Q lcl|Aclame:pro 240 IEVMKAAPKKPTIA--------------------KFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLE 299 (408) Q Consensus 240 ~~g~g~~~~~~~~~--------------------~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~ 299 (408) ++|+|++.|.+-.. ..+.+.++ ...++..|+.+++|+||+.++..++++||.+|+|+|. T Consensus 199 i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~ 277 (377) T protein:vir:98 199 VKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADL-SDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLI 277 (377) T ss_pred EeccCCCcceeeeecccccccccccccccccccchhhhHhhh-hhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEE Confidence 99999876542211 11233333 4668889999999999999999999999999999995 Q ss_pred ccc--------------ccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhc Q lcl|Aclame:pro 300 PDP--------------TKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETD 365 (408) Q Consensus 300 ~~~--------------~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~ 365 (408) .++ .+|.+.+++|+|+.++.+..+|. +.++||||++ |.+++|++++|+++++.+ |.+| T Consensus 278 ~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~-----~~i~fgdf~~-Y~i~~r~~~~i~~~~~~~--~~~d 349 (377) T protein:vir:98 278 LNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVET-----GKAIAFVANR-YDAFMATASTIEEYDQTF--AMED 349 (377) T ss_pred ecccchhhccccccccCCCCccccccCCCceEEecCCCCc-----ccEEEEEecc-eeEEeecceEEEeechhh--hhcC Confidence 332 23455689999987766555554 4589999998 777899999999998765 8999 Q ss_pred eeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 366 TTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 366 ~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) ++.||+..|+||++++|+||++++++-. T Consensus 350 ~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 350 LQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 9999999999999999999999999887 No 62 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=3.3e-56 Score=324.72 Aligned_cols=354 Identities=9% Similarity=0.045 Sum_probs=232.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhcccccccccccc Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQV-VNMREEEKGPLNKS 85 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 85 (408) |++|++++.+..+..+++.+.++. .....+..+.+.+...... ++...+.+....... .......+ . T Consensus 1 ik~L~e~~~e~~e~~~~~~~~~~~----~~~~~e~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-----~ 68 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAFLNAIKE----GATEAEQVTAFTNMAEQIQ---NNIIAQARKEVNREMNDNNVLASR-----G 68 (390) T ss_pred CchHHHHHHHHHHHHHHHHHHHhh----hhhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHhc-----C Confidence 556665555544444433333221 1111111111111111111 011011000000000 00000000 0 Q ss_pred hhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEe Q lcl|Aclame:pro 86 ENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYE 165 (408) Q Consensus 86 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~ 165 (408) ......++++.+.. ....+++++||++||++++++|++.+++.++|+++|+++++.+.... ++ T Consensus 69 ~~~l~~~~r~~~~~---------------~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~--i~ 131 (390) T protein:vir:40 69 ANALTSDESKYYNE---------------VIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEW--II 131 (390) T ss_pred chhccHHHHHHHHH---------------HHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeE--EE Confidence 01111222222211 22345677899999999999999999999999999999998765444 44 Q ss_pred eccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 166 KWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA 245 (408) Q Consensus 166 ~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~ 245 (408) .. +..+.+.|++|++++++.++++|++|++++++++++++||+||++|+.++|++||.++|+++++.+++.+|++|+|+ T Consensus 132 ~~-~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~ 210 (390) T protein:vir:40 132 SV-GDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGK 210 (390) T ss_pred EE-cCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCC Confidence 43 34567899999999887778999999999999999999999999999999999999999999999999999999998 Q ss_pred ccchhhh------------------hhHHHHHHHHH---Hhh---hhhccCCCEEEEcHHHH-H---HHHhhhcccCcee Q lcl|Aclame:pro 246 APKKPTI------------------AKFDDVITMIN---TAV---DPAIIATSSLLTNQSGL-N---KLALVKTAEGKYL 297 (408) Q Consensus 246 ~~~~~~~------------------~~~d~i~~~~~---~~l---~~~~~~~a~~~~n~~~~-~---~l~~lkd~~G~~~ 297 (408) +.|.+.. .+.+++.+++. ..+ ...+..+++|+||+.++ . .+++++|++|+|+ T Consensus 211 ~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d~~G~~v 290 (390) T protein:vir:40 211 DQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMTPQGVWV 290 (390) T ss_pred CccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccCCCCccc Confidence 7653111 11222222221 111 12345688999999873 3 4558999999999 Q ss_pred eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 298 LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV 377 (408) Q Consensus 298 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~ 377 (408) |.. .++|+||++++ .+|. +.++||||++ |.+++|++++++++++. .|.+|++.||++.|+|+ T Consensus 291 ~~~--------~~~g~pvv~~~--~~p~-----~~i~~Gd~s~-~~i~~~~~~~v~~~~~~--~f~~~~~~~r~~~r~dg 352 (390) T protein:vir:40 291 TGI--------LPVPLEIVQSV--AVPV-----GKAVAGRAKD-YFMGIGSEQVIRTSTEY--RLLDDETLYYAKQYANG 352 (390) T ss_pred ccc--------CCCceeEEEcC--CCCC-----CcEEEEeece-EEEEeecceEEEecchh--hhhcCcEEEEEEEEeCC Confidence 743 35799998854 3443 4589999998 56688999999998865 49999999999999999 Q ss_pred EEecccceEEEEeeccccC--CCCccCCCcccC Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQ--VGNFKTTTSTAV 408 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~--~~~~~~~~~~~~ 408 (408) ++++|+||+++++++...+ .+.++++.++-- T Consensus 353 ~v~~~~A~~~l~~~~~~~~~~~~~~~~~~~~~~ 385 (390) T protein:vir:40 353 RPKDNSSFLVFDITGLEGSPAIDVNVVNNATPS 385 (390) T ss_pred EEecccceEEEEeeccCCCCCCCcceeeCCCCC Confidence 9999999999999998653 333333222211 No 63 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=1.1e-55 Score=321.93 Aligned_cols=384 Identities=13% Similarity=0.064 Sum_probs=253.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAE---AMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) |+..+.+++|++++.++.++++.+.++..+. ....+.+ ++.++..+++++..+++.+++................ T Consensus 193 ~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~--g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~ 270 (645) T protein:vir:93 193 MNIGEQIKSFENKRAALAASLEEVMTKAAEE--GRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAG 270 (645) T ss_pred cchhhhhhhhhHHHHHHHHHhhhhhhhHhhh--ccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 8888888889888888887777766543221 1122222 3444455555555444444332111111000000000 Q ss_pred c---ccccccc--hhhhHHHHHHHHHHHhhc----chh----------------hHHHHHHHH----hhccccccCceec Q lcl|Aclame:pro 78 E---KGPLNKS--ENELKDKFVKDFVNMVRN----PMA----------------FMNTVSSKT----ETSGSDSAAGLTI 128 (408) Q Consensus 78 ~---~~~~~~~--~~~~~~~~~~a~~~~~~~----~~~----------------~~~~~~~~a----~~~~t~~~gg~~v 128 (408) . ....... ....+......|.++++. +.. ........+ .++++.++||+++ T Consensus 271 ~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~v 350 (645) T protein:vir:93 271 NGNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSE 350 (645) T ss_pred ccccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccC Confidence 0 0000000 000011111122222211 000 000001112 2233345689999 Q ss_pred chhhhhhhhhhhhhhhhhhhhhceeecc--cCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehH Q lcl|Aclame:pro 129 PQDIRTMINTLVRQYDSLQQYVRVESVS--TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIIT 206 (408) Q Consensus 129 P~~~~~~ii~~~~~~~~l~~~~~~~~~~--~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~ 206 (408) |+++..+||+.+++.+++++++...... +..+.+.+|...++ +.++|++|++.+|+ +.++|++++++++|++++++ T Consensus 351 p~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~-~~a~wv~Eg~~~~~-s~~~f~~v~l~~~kla~~~~ 428 (645) T protein:vir:93 351 YQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSG-GAAGWVGEGKTKPL-TKFDFESITFSHAKVSAIAV 428 (645) T ss_pred chhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecC-cceEEeccCccccc-cccceeEEEEeeEEEEEeeh Confidence 9999999999999999999886543322 12346677776544 67899999999996 46899999999999999999 Q ss_pred HHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----ch----------hhhhhHHHHHHHHHHhhhh-hcc Q lcl|Aclame:pro 207 ATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP----KK----------PTIAKFDDVITMINTAVDP-AII 271 (408) Q Consensus 207 iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~----~~----------~~~~~~d~i~~~~~~~l~~-~~~ 271 (408) +|+|||+|+.+++++||.++|++++++++|.+|++|+|++. |. ......+++..++...... ... T Consensus 429 iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~~~~a~~~~ 508 (645) T protein:vir:93 429 LTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEAAFGQFVAANLQP 508 (645) T ss_pred hHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHHHHHHHHhcCCCc Confidence 99999999999999999999999999999999999887642 11 1112234555554332222 234 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceE Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMS 351 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~ 351 (408) .+++|+|||.++.+|+++||++|+|+| |++. ..+++|+|+||++++. +|+ .++||||+++++ .+++++. T Consensus 509 ~~a~~vmn~~~~~~L~~lkd~~G~~~~-~~~~-~~~~tL~G~PV~~s~~--vp~------~~~~gd~s~~~i-g~~~~v~ 577 (645) T protein:vir:93 509 TGAVWLMSSTNALALSMRKNALGQKEY-PDMT-LLGGSFQGLPVIVSQY--VGD------QLVLVNAPDIYL-ADDGGVA 577 (645) T ss_pred cccEEEEcHHHHHHHHhccccCCceee-cCCC-CCCceeeceeeEEecc--CCc------ceeEeccccEEE-EEecceE Confidence 578999999999999999999999998 4443 3346999999998643 453 268999998654 5678888 Q ss_pred EEEeccch--------------------hhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCC Q lcl|Aclame:pro 352 LLPTNIGA--------------------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGN 399 (408) Q Consensus 352 i~~~~~~~--------------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 399 (408) +.++.+.. ++|++|+++||+++|+||++++|+||++|+-.....+.+. T Consensus 578 i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 578 VDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSASGG 645 (645) T ss_pred EEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcccCC Confidence 77765432 3599999999999999999999999999998888877777 No 64 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=7.3e-55 Score=317.36 Aligned_cols=358 Identities=11% Similarity=0.030 Sum_probs=239.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |.....+++..+.+++.++++. ++.++....+++...+.+.++++..++...... +.+...........+. T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~-------~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~--e~~~~~~~~~~~~~r~ 71 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFA-------NLVQNGASDEEQSKAFGAMFDALSNDLQEEITA--EINNRVVDNGILAKRS 71 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHH-------HHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhcC Confidence 7776444433333333333332 222322222223333232222222111110000 0000000000000000 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ......++++ |. .+...+++++||++||+++.++|++.+++.++|+++|+++++++ T Consensus 72 -----~~~l~~ee~~-~~---------------~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~--- 127 (395) T protein:vir:95 72 -----QDPLTSEERK-FF---------------NDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI--- 127 (395) T ss_pred -----ccccchHHHH-HH---------------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--- Confidence 0001111111 11 23445678899999999999999999999999999999998853 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) ...++.. +..+.+.|++|+++++..+.++|++|++++|+++++++||+|||+|+.+++++||.+.|+++++++++.+|+ T Consensus 128 ~~~i~~~-~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i 206 (395) T protein:vir:95 128 KTRVIKA-DPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAII 206 (395) T ss_pred ceEEEEe-cCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhhee Confidence 4455554 455677899888887666779999999999999999999999999999999999999999999999999999 Q ss_pred hcccccc--chhh------------------hhhHHHHHHHH------HHhh-------hhhccCCCEEEEcHHHHHHHH Q lcl|Aclame:pro 241 EVMKAAP--KKPT------------------IAKFDDVITMI------NTAV-------DPAIIATSSLLTNQSGLNKLA 287 (408) Q Consensus 241 ~g~g~~~--~~~~------------------~~~~d~i~~~~------~~~l-------~~~~~~~a~~~~n~~~~~~l~ 287 (408) +|+|++. |.+- ...++++...+ ...+ ...+..++.|+|||+++. T Consensus 207 ~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~--- 283 (395) T protein:vir:95 207 NGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW--- 283 (395) T ss_pred eccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh--- Confidence 9999863 2110 01122222111 1111 224567889999998865 Q ss_pred hhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhcee Q lcl|Aclame:pro 288 LVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTT 367 (408) Q Consensus 288 ~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~ 367 (408) |.+|+|+|++ .+|.+.+++|+|+-++.+..+|. +.++||||++ |.+.+|++++++++++.+ |.+|++ T Consensus 284 ---~~~g~~~~~~--~~G~~~~~lg~g~~v~~~~~~p~-----~~i~fgdfs~-y~i~~r~~~~i~~~~~~~--~~~d~~ 350 (395) T protein:vir:95 284 ---DVQARYTYLT--ANGGFVTVLPYNVTIITSEFVPE-----GKLVAFVTDR-YNAVRGGGLTVKKFDQTL--ALEDAV 350 (395) T ss_pred ---hcCCcceecc--CCCcceeccCCcceEEEcCCCCC-----CcEEEEeccc-EEEEEecceEEEeccchh--hhCCcE Confidence 5579999987 45667788877754433445664 3489999998 677899999999998764 899999 Q ss_pred eEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 368 KIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 368 ~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) .||+..|+||++++++||++|+++...+++.++++.++++= T Consensus 351 ~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~~~~~~ 391 (395) T protein:vir:95 351 LFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAGGTTDG 391 (395) T ss_pred EEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCCCCCCc Confidence 99999999999999999999999988888888887777766 No 65 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=5.1e-54 Score=312.77 Aligned_cols=389 Identities=15% Similarity=0.159 Sum_probs=241.6 Q ss_pred CChHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHhh--hcccHHHH-------HHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MGVKL-----TVNQLNEAWIASGDKVTDFNDQINMALND--DNFSAEAM-------SELKNKRDNEKVRRDALREQLVEA 66 (408) Q Consensus 1 M~~~~-----~i~el~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~ 66 (408) |.++. .|+++++++.++.++.+++.++.+++... +...+++. ..+..+++++++++..+++++.++ T Consensus 1 ~~~~~~~l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~l 80 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKEL 80 (466) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65553 34444444444544444443332222110 11122222 222222222222333333333332 Q ss_pred HHHHhhhc-ccccccccccchh-------hhHHHHHHHHHHHhhcch---h--------hHHHHHHHHhhccccccCcee Q lcl|Aclame:pro 67 QAEQVVNM-REEEKGPLNKSEN-------ELKDKFVKDFVNMVRNPM---A--------FMNTVSSKTETSGSDSAAGLT 127 (408) Q Consensus 67 ~~~~~~~~-~~~~~~~~~~~~~-------~~~~~~~~a~~~~~~~~~---~--------~~~~~~~~a~~~~t~~~gg~~ 127 (408) +....... ............. .......+++.+.+.... . .............+.++|+++ T Consensus 81 e~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 160 (466) T protein:vir:80 81 ENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELT 160 (466) T ss_pred HHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccc Confidence 22211100 0000000000000 000111222211111000 0 000001111122234566789 Q ss_pred cchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHH Q lcl|Aclame:pro 128 IPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITA 207 (408) Q Consensus 128 vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~i 207 (408) ||+++.+.|++.+++.++|++++++.++++ ...++.. ...+.+.|++|+++++++ +++|++|++.+++++++++| T Consensus 161 vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g---~~~~~~~-~~~~~a~wv~E~~~~~~~-~~~f~~i~~~~~k~~~~~~i 235 (466) T protein:vir:80 161 IPDVMLELLRDNMHRYSKLISKVRLRPLKG---TARQNIA-GAIPEGVWTEAVANLNEL-SLSFSQIEVDGYKVGGFIPI 235 (466) T ss_pred ccHHHHHHHHHhhhhhhhhhhheeeeecCc---eeEeeee-cCCcceeecccccccccc-cccccceeecceeeeeehhh Confidence 999999999999999999999999998864 3445443 345667999999999975 59999999999999999999 Q ss_pred HHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhh---------------------hHHHH-------- Q lcl|Aclame:pro 208 TNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIA---------------------KFDDV-------- 258 (408) Q Consensus 208 S~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~---------------------~~d~i-------- 258 (408) |+|||+|+.++|++||..+|+++++.+++.+|++|+|++.|.+-.. ....+ T Consensus 236 S~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 315 (466) T protein:vir:80 236 PNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGK 315 (466) T ss_pred hHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhcc Confidence 9999999999999999999999999999999999999876532100 01111 Q ss_pred ---------HHHHHHhhhhhccCCCEEEEcHHHHHHHHhhh---cccCceeeccccccCCcccccccceEeecccccccc Q lcl|Aclame:pro 259 ---------ITMINTAVDPAIIATSSLLTNQSGLNKLALVK---TAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNT 326 (408) Q Consensus 259 ---------~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lk---d~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 326 (408) +..+.........+++.|+||+.++..|..++ +.+|.|++.+. + +..|+|+||+++++ +|. T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~--~--~~~i~G~pvv~s~~--~~~- 388 (466) T protein:vir:80 316 SAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLN--N--TMPIVGGDIVILDF--IPD- 388 (466) T ss_pred chhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCC--C--cccccccceeecCc--cCc- Confidence 11111212233445667999999999999887 66777776542 2 23599999988653 333 Q ss_pred ccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcc Q lcl|Aclame:pro 327 GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTST 406 (408) Q Consensus 327 ~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~ 406 (408) +.++||||+. |.+++|++++|.++++. .|.+|++.||+..|+||++++|+||++++++...+.+....+.+.. T Consensus 389 ----~~~~~g~~~~-y~i~~r~~~~i~~~~~~--~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~~~~~~ 461 (466) T protein:vir:80 389 ----NDIIGGYGSL-YLLAERADIKLAQSEHV--RFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITFAPDEA 461 (466) T ss_pred ----cceeeecccc-EEEEeecceEEEechhh--hhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceeeecCcC Confidence 4489999996 56789999999998765 4999999999999999999999999999999877666555555444 Q ss_pred cC Q lcl|Aclame:pro 407 AV 408 (408) Q Consensus 407 ~~ 408 (408) -+ T Consensus 462 ~~ 463 (466) T protein:vir:80 462 NV 463 (466) T ss_pred cC Confidence 44 No 66 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=2.4e-55 Score=320.06 Aligned_cols=285 Identities=12% Similarity=0.080 Sum_probs=233.6 Q ss_pred HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccc Q lcl|Aclame:pro 108 MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLD 187 (408) Q Consensus 108 ~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~ 187 (408) ....+.++..+.++.++|.+||+++.++|++.+++.++|+++++++++.+.. +.+|+..+ .+.+.|++|++++++. T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~~p~~~~-~~~a~~v~Eg~~~~~~- 76 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTG--ISIPHWTG-AVSASWTGEAERKPIT- 76 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEcC-CcceeEecCCCccccc- Confidence 3334456666666667777899999999999999999999999999987654 44555543 4567999999999975 Q ss_pred cccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhh--------------- Q lcl|Aclame:pro 188 NPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI--------------- 252 (408) Q Consensus 188 ~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~--------------- 252 (408) .++|++++++++|++++++||+|+++|+.+++++||.++|+++++++++.++++|+|++.+..+. T Consensus 77 ~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~ 156 (330) T protein:vir:77 77 KGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNL 156 (330) T ss_pred cceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccc Confidence 58999999999999999999999999999999999999999999999999999999976542111 Q ss_pred --------hhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccC-----CcccccccceEeec Q lcl|Aclame:pro 253 --------AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKP-----NSYLIKGKQVIVVA 319 (408) Q Consensus 253 --------~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~-----~~~~l~G~pv~~~~ 319 (408) ..++++.+++. .+...+..+++|+|||++|..|+++||++|+|+|++....+ .+.+|+|+||++++ T Consensus 157 ~~~~~~~~~~~~~l~~~~~-~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~ 235 (330) T protein:vir:77 157 TTASGPQGNAYLAVNNALS-LLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVAD 235 (330) T ss_pred cccccccchhHHHHHHHHH-hhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEec Confidence 12456666654 36677888889999999999999999999999999865544 34689999999876 Q ss_pred ccccccc-ccCcceEEEEehhcceEeeeccceEEEEeccch----------------hhhhhceeeEEEEeeeCcEEecc Q lcl|Aclame:pro 320 DRWLPNT-GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA----------------GAFETDTTKIRVIDRFDVKATDS 382 (408) Q Consensus 320 ~~~~~~~-~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~----------------~~f~~~~~~~r~~~r~d~~v~~~ 382 (408) + +|.. ..++..++||||++++ +.++++++++++++.+ +.|++|++.||++.|+|+++.+| T Consensus 236 ~--~p~~~~~~~~~~~~gd~s~~~-i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 312 (330) T protein:vir:77 236 N--VVNGTVGNRVVGVMGDFSQVI-WGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDK 312 (330) T ss_pred c--ccCCCCCCccEEEEEecceEE-EEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecc Confidence 5 4432 3456779999999865 6789999999877643 56999999999999999999999 Q ss_pred cceEEEEeeccccCCCCc Q lcl|Aclame:pro 383 EALVAGSFSAIADQVGNF 400 (408) Q Consensus 383 ~a~~~l~~~~~~~~~~~~ 400 (408) +||++++.++..+.+-.- T Consensus 313 ~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 313 DAFVKLTDQVAGTDPEEE 330 (330) T ss_pred cceEEEEeccCCcCCCCC Confidence 999999988866655554 No 67 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=1.9e-55 Score=320.62 Aligned_cols=273 Identities=16% Similarity=0.132 Sum_probs=233.9 Q ss_pred HHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccccccccccccc Q lcl|Aclame:pro 111 VSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQ 190 (408) Q Consensus 111 ~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~ 190 (408) ....++...+.++||.+||++++++|++.+++.++|+++|+++++++.... ++... .+.+.|++|+++++++ .++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~--~~~~~--~~~a~~v~E~~~~~~~-~~~ 75 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEE--FTFMS--GVGAFWVDEAERIQTS-KPT 75 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEE--EEEEc--CCceeeeecCcccccc-ccc Confidence 333455666778889999999999999999999999999999998765544 44443 3567999999999975 589 Q ss_pred ceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch--------------hhhhhHH Q lcl|Aclame:pro 191 LTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK--------------PTIAKFD 256 (408) Q Consensus 191 f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~--------------~~~~~~d 256 (408) |++|++.+++++++++||+|+++|+.++|++||.+.|++++++++|.++++|+|++.+. .+...++ T Consensus 76 f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~ 155 (299) T protein:vir:41 76 FTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYD 155 (299) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHH Confidence 99999999999999999999999999999999999999999999999999999876542 2334578 Q ss_pred HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEE Q lcl|Aclame:pro 257 DVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYG 336 (408) Q Consensus 257 ~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~g 336 (408) ++++++.. +...+..+++|+|||++|.+|+++||++|+|+|++++.++. ++|+|+||+++++ +|. +.++..++|| T Consensus 156 ~l~~~~~~-l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~-~~l~G~PV~~~~~--~~~-~~~~~~~~~g 230 (299) T protein:vir:41 156 DLNEAIGL-IEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGV-DDVLGLPIAYTPK--YTF-GDKDISELVG 230 (299) T ss_pred HHHHHHHh-hhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC-ceecceeeEEecc--cCC-CCCceEEEEE Confidence 88888764 78888999999999999999999999999999998777654 5899999999765 454 3466779999 Q ss_pred ehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 337 DMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 337 d~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) ||++++ +.+|++++++++++.+ +.|++|++.+|++.|+|+++.+|+||++++.++.- T Consensus 231 dfs~~~-i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 231 DWNQAY-YGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred ecccEE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 999854 6789999999987643 35899999999999999999999999999988877 No 68 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=2.5e-54 Score=314.40 Aligned_cols=298 Identities=14% Similarity=0.099 Sum_probs=236.4 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) ..+........+.|....+.+. +.++....+..+||++||++++++|++.+++.++|+++++++++++.+ +. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~------~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ 72 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQ------VFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE--KK 72 (324) T ss_pred CccchhHHHHHHHHHHhhhhhh------hhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCc--eE Confidence 2222222333444555444332 234555666778899999999999999999999999999999987654 44 Q ss_pred EeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) +|+.. ..+.+.|++|++.+|+ +.++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+ T Consensus 73 ip~~~-~~~~a~~v~Eg~~~~~-~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~ 150 (324) T protein:vir:97 73 FTFWA-DKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred EEEEe-cCcceeEeccCccccc-cccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccC Confidence 55544 3467799999999996 458999999999999999999999999999999999999999999999999999999 Q ss_pred ccccch--------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccc Q lcl|Aclame:pro 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) Q Consensus 244 g~~~~~--------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~ 309 (408) |++... .+...++++++++ ..+...++.+++|+|||++|..|+++||++|+|+|.+ +.+++ T Consensus 151 g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~----~~~~t 225 (324) T protein:vir:97 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDT 225 (324) T ss_pred CCCccCccccccccccceeccccCCHHHHHHHH-HhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC----CCCcc Confidence 876422 1333578888776 4588889999999999999999999999999999863 34568 Q ss_pred ccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccc------------hhhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIG------------AGAFETDTTKIRVIDRFDV 377 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~------------~~~f~~~~~~~r~~~r~d~ 377 (408) |+|+||++++ +...++..++||||++++ +.++++++++++++. ++.|++|++.||++.|+|+ T Consensus 226 l~G~PV~~~~-----~~~~~~~~~~~gd~~~~~-i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~ 299 (324) T protein:vir:97 226 LDGLPVVNLK-----SSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) T ss_pred ccceeeEeec-----CCCCCcceEEEEecccEE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Confidence 9999998754 334556779999999865 568999999998764 3569999999999999999 Q ss_pred EEecccceEEEEeeccccCCCCccC Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQVGNFKT 402 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~~~~~~~ 402 (408) ++.+|+||++++.+........... T Consensus 300 ~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 300 HIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred EEecccceEEEEeccCCCCCCCCCC Confidence 9999999999987554332211111 No 69 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=8.1e-55 Score=317.13 Aligned_cols=270 Identities=11% Similarity=0.024 Sum_probs=223.0 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIK 195 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~ 195 (408) |..+++ ++|++||++++.+|++.+++.++++++|+++++++.... +|.... .+.++|++|++++|+ +.++|++++ T Consensus 1 ma~~t~-~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~--~p~~~~-~~~a~wv~Eg~~~~~-s~~~f~~v~ 75 (300) T protein:vir:95 1 MSEAQL-SKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQR--EFVFDF-DSDIDIVAENGKKTH-GGVSLDPVT 75 (300) T ss_pred Cccccc-CCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceE--EEEEec-CcceEEeeCCccccc-ccccceeeE Confidence 555444 457789999999999999999999999999988765444 555443 467899999999996 558999999 Q ss_pred echheeeeehHHHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------------hhhh Q lcl|Aclame:pro 196 YLIKRYAGIITATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--------------------KPTI 252 (408) Q Consensus 196 ~~~~~~~~~~~iS~ell~---ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--------------------~~~~ 252 (408) +++||++++++||+||+. |+.++++++|.++|++++++++|.++++|++.+.+ ..+. T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 155 (300) T protein:vir:95 76 IVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDT 155 (300) T ss_pred eeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeeccccc Confidence 999999999999999994 67789999999999999999999999999532111 1122 Q ss_pred hhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccc-cCcc Q lcl|Aclame:pro 253 AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTG-STVY 331 (408) Q Consensus 253 ~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-~~~~ 331 (408) ..++++.+++ ..+...++.+++|+|||.++.+|++|||++|+|+|.+...++.+++|+|+||++++. +|... .... T Consensus 156 ~~~~~i~~~~-~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~--v~~~~~~~~~ 232 (300) T protein:vir:95 156 NPDESMEDAV-GMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRT--VSYSQTDPKN 232 (300) T ss_pred chHHHHHHHH-HHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecC--CCCCCCCCcc Confidence 2345565555 456677788889999999999999999999999998888888889999999998654 45433 3445 Q ss_pred eEEEEehhcceEeeeccceEEEEeccch------hhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 332 PLYYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 332 ~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) .+++|||++++.+..|++++++++++.. +.|++|++.+|+++|+|+++.+|+||++|+.++- T Consensus 233 ~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 233 TAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred EEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 6788999998877789999999987643 3599999999999999999999999999987777 No 70 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=9.5e-55 Score=316.76 Aligned_cols=266 Identities=13% Similarity=0.009 Sum_probs=221.3 Q ss_pred ccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechh Q lcl|Aclame:pro 120 SDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIK 199 (408) Q Consensus 120 t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~ 199 (408) ...+||++||++++.+|++.+++.++++++|+++++++.. ..+|... ..+.++|++|++++|++ .++|++++++++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~--~~ip~~~-~~~~a~~v~E~~~~~~~-~~~f~~v~l~~~ 76 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFT-MDSEIDVVAESGKKTHG-GVTLAPQTMVPI 76 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCc--eEEEEEe-cCcceEEecCCcccccc-ccceeEEEEeee Confidence 4466789999999999999999999999999999987654 4455544 34678999999999965 589999999999 Q ss_pred eeeeehHHHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c--h------------------hhhhh Q lcl|Aclame:pro 200 RYAGIITATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP--K--K------------------PTIAK 254 (408) Q Consensus 200 ~~~~~~~iS~ell~---ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~--~--~------------------~~~~~ 254 (408) |+++++++|+||++ |+.++|++||.++|++++++++|.++++|.+.++ + . .+... T Consensus 77 k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) T protein:vir:16 77 KVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) T ss_pred eEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccH Confidence 99999999999995 5668999999999999999999999999954221 1 0 00111 Q ss_pred HHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccccc-ccCcceE Q lcl|Aclame:pro 255 FDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNT-GSTVYPL 333 (408) Q Consensus 255 ~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~-~~~~~~~ 333 (408) ++++.+++ ..+..++..+++|+|||+++..|+++||++|+|+|++.+.++.+++|+|+||++++. +|.. ..++..+ T Consensus 157 ~~~i~~~~-~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~--v~~~~~~~~~~~ 233 (298) T protein:vir:16 157 NGAIENAV-ELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKT--VSDMSLTQRDRA 233 (298) T ss_pred HHHHHHHH-HHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecc--cccccCCCccEE Confidence 34455555 456677888889999999999999999999999999988888899999999998654 4443 3456678 Q ss_pred EEEehhcceEeeeccceEEEEeccch------hhhhhceeeEEEEeeeCcEEecccceEEEEeec Q lcl|Aclame:pro 334 YYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) Q Consensus 334 ~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 392 (408) ++|||++++.+..|++++++++++.. +.|++|++.+|++.|+|+++++|+||++|+..+ T Consensus 234 ~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 234 IIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 99999998877889999999977532 359999999999999999999999999997766 No 71 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=1.8e-53 Score=309.81 Aligned_cols=334 Identities=12% Similarity=0.055 Sum_probs=233.6 Q ss_pred CChHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKL-TVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~-~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |.+.+ ++++++++++++.+.+++. ...+++.+.+.+.+..+..++.+.... +........ T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~-----------~~~e~~~~~~~~~~~~~~~~~~~~~~~----e~~~~~~~~---- 61 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAG-----------ATPEEQEKLFEAAFTTMGDEILAKNEE----EMERMFDLR---- 61 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhc-----------ccHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHhc---- Confidence 77764 3455554444443333221 111112222222222222211111000 000000000 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) ........++++.|.+.. ..+++++||++||+++..+|++.+.+.++++++|++.++++ T Consensus 62 ----~~~~~lt~ee~~~~~~~~---------------~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-- 120 (377) T protein:vir:96 62 ----DKNRELTAEEIKFFNDID---------------KNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-- 120 (377) T ss_pred ----cCCcccCHHHHHHHHHHH---------------hcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC-- Confidence 011122334455443322 34577889999999999999999999999999999998753 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) ...++.. +..+.+.|++|++++++.+.++|+++++.+|+++++++||++||+|+.+++++||.+.|+++++++++.+| T Consensus 121 -~~~i~~~-~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~ 198 (377) T protein:vir:96 121 -RLKALTA-ETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAI 198 (377) T ss_pred -ceEEEEe-cCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhce Confidence 4455554 45578899999998877778999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccchhhh-------------------------------hhHHHHHHHHHHhhhh-----------hccCCCEEE Q lcl|Aclame:pro 240 IEVMKAAPKKPTI-------------------------------AKFDDVITMINTAVDP-----------AIIATSSLL 277 (408) Q Consensus 240 ~~g~g~~~~~~~~-------------------------------~~~d~i~~~~~~~l~~-----------~~~~~a~~~ 277 (408) ++|+|.+.|.+-. .+.+.+++.+.. +.. .+.++++|+ T Consensus 199 i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~~~~~~~~~~~~~~~~a~~~ 277 (377) T protein:vir:96 199 VKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVP-VMKHLSVNDKKHPLKIAGQVKLL 277 (377) T ss_pred EeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHH-HHHhhccccccccccccCceEEE Confidence 9999977553211 123344443322 222 234577899 Q ss_pred EcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEecc Q lcl|Aclame:pro 278 TNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNI 357 (408) Q Consensus 278 ~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 357 (408) |||.++..+ .|+|.|++ .+|.+.+++|+|+.++.+..+|. +.++||||++ |.+.+|++++|+.+++ T Consensus 278 mn~~t~~~~------~~~~~~~~--~~G~~~~~l~~p~~v~~s~~~p~-----~~i~fgdf~~-Y~i~~r~~~~i~~~~~ 343 (377) T protein:vir:96 278 LNPEDRWTL------EAKFTSRN--QFGEYVTVLPHGITILESLAVET-----GKAIAFVANR-YDAFMATASTIEEYDQ 343 (377) T ss_pred EchhhHHhc------cccccccC--CCCCceeccCCCceEEecCCCCc-----ccEEEEEcCc-EEEEEecccEEEeehh Confidence 999987654 57788875 35666789999988776655664 3489999998 7888999999999987 Q ss_pred chhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 358 GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 358 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) .+ |.+|++.||+..|+||++++|+||++++++-. T Consensus 344 ~~--~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 344 TF--AMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred hh--hhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 64 99999999999999999999999999999877 No 72 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=8e-54 Score=311.68 Aligned_cols=298 Identities=14% Similarity=0.091 Sum_probs=233.2 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) ..+........+.|......+ .+.++..+.+..+++.+||++++++|++.+++.++|+++|+++++++.... T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~------~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-- 72 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKP------QVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK-- 72 (324) T ss_pred CchhHHHHHHHHHHHHhhhhh------hhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE-- Confidence 222222222233344433322 122455556666778899999999999999999999999999998765544 Q ss_pred EeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) +|+.. ..+.+.|++|++.+|+. .++|++++++++|++++++||+|+++||.+++++||.++|++++++++|.++++|+ T Consensus 73 ip~~~-~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~ 150 (324) T protein:vir:93 73 FTFWA-DKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred EEEEe-cCcceeeecCCcccccc-ccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 44443 34667999999999975 58999999999999999999999999999999999999999999999999999998 Q ss_pred ccccch--------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccc Q lcl|Aclame:pro 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) Q Consensus 244 g~~~~~--------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~ 309 (408) |++... .+...++++++++. .+...+..+++|+|||++|..|++++|++|+|++.+ +.+++ T Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~----~~~~~ 225 (324) T protein:vir:93 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) T ss_pred CCCCcCccccccccccceeccccccHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CCCCc Confidence 865321 23345788887775 478888888999999999999999999999999863 35678 Q ss_pred ccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDV 377 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~ 377 (408) |+|+||+++++ ...+++.+++|||+++ .+..+++++++++++.. +.|++|++.||+++|+|+ T Consensus 226 l~G~PVv~~~~-----~~~~~~~i~~gdfs~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~ 299 (324) T protein:vir:93 226 LDGLPVVNLKS-----SNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) T ss_pred ccceeeEeecC-----CCCCcceEEEEecceE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Confidence 99999987542 3456678999999985 46789999999988753 569999999999999999 Q ss_pred EEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ++.+|+||++|+.......+.... | T Consensus 300 ~v~~~~a~~~l~~a~~~~~~~~~~------~ 324 (324) T protein:vir:93 300 HIADDKAFAKLVPADKRTDSVPGE------V 324 (324) T ss_pred EEecccceEEEecccccCCCCCCC------C Confidence 999999999998554443332222 2 No 73 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=2.8e-54 Score=314.18 Aligned_cols=270 Identities=10% Similarity=0.030 Sum_probs=219.3 Q ss_pred ccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeec Q lcl|Aclame:pro 118 SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYL 197 (408) Q Consensus 118 ~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~ 197 (408) =.+.++||++||+++.+.|++.+++.++++++|++++++++. ..+|... ..+.++|++|++++|+ +.++|+++++. T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~--~~~p~~~-~~~~a~wv~Eg~~~~~-~~~~f~~v~l~ 76 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE--QQYMTLT-APPRGEVVGEGAQKSE-STATFAPVTAI 76 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCc--eEEEEEe-CCceeEEeecCccccc-ccceeeEEEEe Confidence 235567899999999999999999999999999999887654 5555553 3467799999999996 56899999999 Q ss_pred hheeeeehHHHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch--------------------hhhhh Q lcl|Aclame:pro 198 IKRYAGIITATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK--------------------PTIAK 254 (408) Q Consensus 198 ~~~~~~~~~iS~ell~---ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~--------------------~~~~~ 254 (408) ++|+++++++|+||++ |+.++|+++|.++|++++++++|.++++|++.++.. ..... T Consensus 77 ~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~ 156 (311) T protein:vir:81 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSAT 156 (311) T ss_pred eEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccch Confidence 9999999999999996 566789999999999999999999999997533211 01112 Q ss_pred HHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccc---------- Q lcl|Aclame:pro 255 FDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLP---------- 324 (408) Q Consensus 255 ~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~---------- 324 (408) .+..+..+...+........+|+|||.++.+|++|||++|+|+|.+....+.+++|+|+||++++. +| T Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~--i~~~~~~~~~~~ 234 (311) T protein:vir:81 157 PDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDT--VRGGPEAVTAST 234 (311) T ss_pred HHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEeccc--cccccccccccc Confidence 333344443444444455557999999999999999999999999888888899999999998543 33 Q ss_pred ---ccccCcceEEEEehhcceEeeeccceEEEEeccch-----hhhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 325 ---NTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA-----GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 325 ---~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) ....++..++||||++++ +..+++++++++++.+ +.|++|++.+|++.|+|+++++|+||++|+....+ T Consensus 235 ~~~~~~~~~~~~~~gDfs~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 235 GVYRTTNPNVKAIAGDFSAFR-WGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred chhcccCCccEEEEEecccEE-EEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 233456678999999855 4578999999987643 45999999999999999999999999999887777 No 74 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=2.9e-54 Score=314.10 Aligned_cols=269 Identities=13% Similarity=0.083 Sum_probs=223.7 Q ss_pred ccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeec Q lcl|Aclame:pro 118 SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYL 197 (408) Q Consensus 118 ~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~ 197 (408) -++.++||++||++++.+|++.+++.++++++|+++++++.... +|... ..+.+.|++|++++|+ +.++|++++++ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~--ip~~~-~~~~a~wv~E~~~~~~-s~~~f~~v~l~ 76 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSK--EFTFT-LDSDIDVVAENGKKTH-GGLSLEPVTIV 76 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceE--EEEEe-cCcceEEeecCccccc-cccceeeEEee Confidence 34667789999999999999999999999999999998765544 44443 3457799999999996 55899999999 Q ss_pred hheeeeehHHHHHHH---hcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------------hhhhh Q lcl|Aclame:pro 198 IKRYAGIITATNTSL---KDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK---------------------KPTIA 253 (408) Q Consensus 198 ~~~~~~~~~iS~ell---~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~---------------------~~~~~ 253 (408) ++|+++++++|+||+ .|+.++|++||.++|++++++++|.++++|++..+. ..+.. T Consensus 77 ~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (303) T protein:vir:97 77 PIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESED 156 (303) T ss_pred eEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccc Confidence 999999999999999 467789999999999999999999999999642111 01223 Q ss_pred hHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccccc-CCcccccccceEeeccccccc---cccC Q lcl|Aclame:pro 254 KFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTK-PNSYLIKGKQVIVVADRWLPN---TGST 329 (408) Q Consensus 254 ~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~-~~~~~l~G~pv~~~~~~~~~~---~~~~ 329 (408) .++++.+++. .+...+..++.|+|||+++.+|+++||++|+|+|+++... +.+++|+|+||+++++ +|. ...+ T Consensus 157 ~~~~i~~~~~-~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~--v~~~~~~~~~ 233 (303) T protein:vir:97 157 ADANIEAAVN-LIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTT--VGAGADEAES 233 (303) T ss_pred hHHHHHHHHH-HHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecc--cCCccccCCC Confidence 3567777664 4666778888899999999999999999999999988654 4567999999999764 443 2345 Q ss_pred cceEEEEehhcceEeeeccceEEEEeccch------hhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 330 VYPLYYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 330 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) ...++||||+..|.+..|++++++++++.. +.|++|++.+|++.|+|+++++|+||++|+-..+ T Consensus 234 ~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 234 KDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred ccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 667999999887878889999999987643 3599999999999999999999999999998877 No 75 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=1.1e-53 Score=310.99 Aligned_cols=298 Identities=14% Similarity=0.103 Sum_probs=235.1 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) +.+........+.|....... ...++....+.++||++||+++.+.|++.+++.++|+++++++++++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~------~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-- 72 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKP------QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-- 72 (324) T ss_pred CCcchhhhHHHHHHHHHhhhh------hhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE-- Confidence 222222333334444433322 123344556677889999999999999999999999999999998765444 Q ss_pred EeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) +|... ..+.+.|++|++.+|+. .++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+ T Consensus 73 ~p~~~-~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~ 150 (324) T protein:vir:78 73 FTFWA-DKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred EEEEe-cCcceeEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 55543 34677999999999974 58999999999999999999999999999999999999999999999999999999 Q ss_pred ccccch--------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccc Q lcl|Aclame:pro 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) Q Consensus 244 g~~~~~--------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~ 309 (408) |++... .+...++++.+++. .+...+..+++|+|||++|.+|++++|++|+|++.+ +.+++ T Consensus 151 g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~-~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~----~~~~~ 225 (324) T protein:vir:78 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) T ss_pred CCCCcCccccccccccceeccccccHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC----CCCCc Confidence 865432 12334778888774 588889999999999999999999999999999853 45578 Q ss_pred ccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDV 377 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~ 377 (408) |+|+||++++ ....+++.+++|||++++ +..+++++++++++.. +.|++|++.||+++|+|+ T Consensus 226 l~G~PV~~~~-----~~~~~~~~~~~gd~~~~~-~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~ 299 (324) T protein:vir:78 226 LDGLPVVNLK-----SSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) T ss_pred ccceeeEeeC-----CCCCCcceEEEEecceEE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEcc Confidence 9999998754 334566789999999854 6789999999987643 569999999999999999 Q ss_pred EEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ++.+|+||++|+.......+ |+.-| T Consensus 300 ~v~~~~A~~~l~~a~~~~~~------~~~~~ 324 (324) T protein:vir:78 300 HIADDKAFAKLVPADKRTDS------VPGEV 324 (324) T ss_pred EEecccceEEEecccccCCC------CCCCC Confidence 99999999999864443322 22222 No 76 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=1.1e-53 Score=310.99 Aligned_cols=298 Identities=14% Similarity=0.103 Sum_probs=235.1 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) +.+........+.|....... ...++....+.++||++||+++.+.|++.+++.++|+++++++++++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~------~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-- 72 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKP------QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-- 72 (324) T ss_pred CCcchhhhHHHHHHHHHhhhh------hhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE-- Confidence 222222333334444433322 123344556677889999999999999999999999999999998765444 Q ss_pred EeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) +|... ..+.+.|++|++.+|+. .++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+ T Consensus 73 ~p~~~-~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~ 150 (324) T protein:vir:96 73 FTFWA-DKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred EEEEe-cCcceeEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 55543 34677999999999974 58999999999999999999999999999999999999999999999999999999 Q ss_pred ccccch--------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccc Q lcl|Aclame:pro 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) Q Consensus 244 g~~~~~--------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~ 309 (408) |++... .+...++++.+++. .+...+..+++|+|||++|.+|++++|++|+|++.+ +.+++ T Consensus 151 g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~-~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~----~~~~~ 225 (324) T protein:vir:96 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) T ss_pred CCCCcCccccccccccceeccccccHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecC----CCCCc Confidence 865432 12334778888774 588889999999999999999999999999999853 45578 Q ss_pred ccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDV 377 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~ 377 (408) |+|+||++++ ....+++.+++|||++++ +..+++++++++++.. +.|++|++.||+++|+|+ T Consensus 226 l~G~PV~~~~-----~~~~~~~~~~~gd~~~~~-~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~ 299 (324) T protein:vir:96 226 LDGLPVVNLK-----SSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) T ss_pred ccceeeEeeC-----CCCCCcceEEEEecceEE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEcc Confidence 9999998754 334566789999999854 6789999999987643 569999999999999999 Q ss_pred EEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ++.+|+||++|+.......+ |+.-| T Consensus 300 ~v~~~~A~~~l~~a~~~~~~------~~~~~ 324 (324) T protein:vir:96 300 HIADDKAFAKLVPADKRTDS------VPGEV 324 (324) T ss_pred EEecccceEEEecccccCCC------CCCCC Confidence 99999999999864443322 22222 No 77 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=3e-54 Score=313.99 Aligned_cols=271 Identities=15% Similarity=0.120 Sum_probs=227.1 Q ss_pred HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccc Q lcl|Aclame:pro 108 MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLD 187 (408) Q Consensus 108 ~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~ 187 (408) .......+.++.++++||++||++++++|++.+++.++|+++++++++.+.. +.+|+.. ..+.+.|++|++++|+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~ip~~~-~~~~a~~v~E~~~~~~~- 76 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQK--KKFTYLA-KGVGAYWVSETERIQTS- 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEe-CCcceEEeecCcccccc- Confidence 1111224555677788999999999999999999999999999999987654 4455554 34667999999999975 Q ss_pred cccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch------------------ Q lcl|Aclame:pro 188 NPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK------------------ 249 (408) Q Consensus 188 ~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~------------------ 249 (408) .++|++++++++|++++++||+|+++|+.++|++||.++|++++++++|.++++|+|++.+. T Consensus 77 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 156 (304) T protein:vir:10 77 KPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVV 156 (304) T ss_pred cceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccc Confidence 58999999999999999999999999999999999999999999999999999999875431 Q ss_pred -hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccccccc Q lcl|Aclame:pro 250 -PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGS 328 (408) Q Consensus 250 -~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 328 (408) .+...++++++++ ..+...+..+++|+|||++|..|+++||++|+|+|.++ +++|+|+||+++++ +|. .. T Consensus 157 ~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-----~~~l~G~PV~~~~~--~~~-~~ 227 (304) T protein:vir:10 157 TDTNNLYVDLSALM-ATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-----GNEIMGLPLSYTGA--DVY-DK 227 (304) T ss_pred ccccchHHHHHHHH-HHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-----CccccceeeEEecc--ccc-CC Confidence 1223477787776 45888899999999999999999999999999999753 46899999998764 454 34 Q ss_pred CcceEEEEehhcceEeeeccceEEEEeccch--------------hhhhhceeeEEEEeeeCcEEecccceEEEEeec Q lcl|Aclame:pro 329 TVYPLYYGDMSQAITLFDRENMSLLPTNIGA--------------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) Q Consensus 329 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 392 (408) ++..++||||+++ ++..|++++++++++.. +.|++|++.||+++|+|+++++|+||++|+... T Consensus 228 ~~~~~~~gd~~~~-~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 228 KKSLALMGDWDYA-RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCcEEEEEehhhE-EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 5677999999985 46789999999877632 469999999999999999999999999998777 No 78 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=3e-54 Score=313.99 Aligned_cols=271 Identities=15% Similarity=0.120 Sum_probs=227.1 Q ss_pred HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccc Q lcl|Aclame:pro 108 MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLD 187 (408) Q Consensus 108 ~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~ 187 (408) .......+.++.++++||++||++++++|++.+++.++|+++++++++.+.. +.+|+.. ..+.+.|++|++++|+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~--~~ip~~~-~~~~a~~v~E~~~~~~~- 76 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQK--KKFTYLA-KGVGAYWVSETERIQTS- 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEe-CCcceEEeecCcccccc- Confidence 1111224555677788999999999999999999999999999999987654 4455554 34667999999999975 Q ss_pred cccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch------------------ Q lcl|Aclame:pro 188 NPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK------------------ 249 (408) Q Consensus 188 ~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~------------------ 249 (408) .++|++++++++|++++++||+|+++|+.++|++||.++|++++++++|.++++|+|++.+. T Consensus 77 ~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 156 (304) T protein:vir:94 77 KPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVV 156 (304) T ss_pred cceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccc Confidence 58999999999999999999999999999999999999999999999999999999875431 Q ss_pred -hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccccccc Q lcl|Aclame:pro 250 -PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGS 328 (408) Q Consensus 250 -~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 328 (408) .+...++++++++ ..+...+..+++|+|||++|..|+++||++|+|+|.++ +++|+|+||+++++ +|. .. T Consensus 157 ~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-----~~~l~G~PV~~~~~--~~~-~~ 227 (304) T protein:vir:94 157 TDTNNLYVDLSALM-ATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-----GNEIMGLPLSYTGA--DVY-DK 227 (304) T ss_pred ccccchHHHHHHHH-HHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-----CccccceeeEEecc--ccc-CC Confidence 1223477787776 45888899999999999999999999999999999753 46899999998764 454 34 Q ss_pred CcceEEEEehhcceEeeeccceEEEEeccch--------------hhhhhceeeEEEEeeeCcEEecccceEEEEeec Q lcl|Aclame:pro 329 TVYPLYYGDMSQAITLFDRENMSLLPTNIGA--------------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) Q Consensus 329 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 392 (408) ++..++||||+++ ++..|++++++++++.. +.|++|++.||+++|+|+++++|+||++|+... T Consensus 228 ~~~~~~~gd~~~~-~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 228 KKSLALMGDWDYA-RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCcEEEEEehhhE-EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 5677999999985 46789999999877632 469999999999999999999999999998777 No 79 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=3.2e-54 Score=313.88 Aligned_cols=289 Identities=15% Similarity=0.138 Sum_probs=228.7 Q ss_pred cccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccc Q lcl|Aclame:pro 82 LNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGS 161 (408) Q Consensus 82 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~ 161 (408) ++...+ ....+....+.|+.+++++++|| +||++++++|++.+++.++|+++++++++.+.... T Consensus 1 ~~~~~~---------------r~~~~~~~~e~~a~~~~~~~~g~-~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~ 64 (326) T protein:vir:42 1 MAVNPD---------------RTTPFLGVNDPKVAQTGDSMFEG-YLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQK 64 (326) T ss_pred CCCCcc---------------chhhhcCcchhhheeccccCCcc-eechhhHHHHHHHHHhcchhhhhcceeeccCCceE Confidence 110000 01112233456777776665555 69999999999999999999999999998765544 Q ss_pred eEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 162 RVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE 241 (408) Q Consensus 162 ~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~ 241 (408) +|... ..+.+.|++|++++|+. .++|++++++++++++++++|+|+++||.+++++||.++|++++++++|.++++ T Consensus 65 --~p~~~-~~~~a~~v~Eg~~~~~~-~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~ 140 (326) T protein:vir:42 65 --IPHWT-GDVSASWIGEGDMKPIT-KGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAIN 140 (326) T ss_pred --EEEEe-CCcceEEecCCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 44444 34667999999999975 589999999999999999999999999999999999999999999999999999 Q ss_pred ccccccchhhh--------------------hhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccc Q lcl|Aclame:pro 242 VMKAAPKKPTI--------------------AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPD 301 (408) Q Consensus 242 g~g~~~~~~~~--------------------~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~ 301 (408) |+|++.+.... ...+..+..+...+...+..+++|+|||+++..|++|||++|+|+|++. T Consensus 141 G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~ 220 (326) T protein:vir:42 141 GTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIES 220 (326) T ss_pred ccCCCccccccccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccc Confidence 99976442110 1112222333445677788899999999999999999999999999987 Q ss_pred cccCCc-----ccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhh Q lcl|Aclame:pro 302 PTKPNS-----YLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFET 364 (408) Q Consensus 302 ~~~~~~-----~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~ 364 (408) ...+.+ ++|+|+||+++++ +| .++..++||||++++ +..+++++++++++.+ +.|++ T Consensus 221 ~~~~~~~~~~~~~l~G~pv~~~~~--~~---~~~~~~~~Gd~s~~~-~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~ 294 (326) T protein:vir:42 221 TYTEENSPFRLGRIVARPTILSDH--VA---SGTVVGYQGDFRQLV-WGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQH 294 (326) T ss_pred cccCccccccCceeeeeeEEEcCC--CC---CCceEEEEeecceEE-EEEecceEEEEeecceeeecccccccchhhhhc Confidence 665543 4799999998653 44 355667899999876 5689999999877643 45999 Q ss_pred ceeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 365 DTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 365 ~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) |++.||+++|+|+++.+|+||++|+.++++++ T Consensus 295 d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 295 NLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred CcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 99999999999999999999999999998888 No 80 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=5e-54 Score=312.82 Aligned_cols=279 Identities=9% Similarity=0.028 Sum_probs=221.5 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIK 195 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~ 195 (408) |..+++++||++||++++.+|++.+++.++++++++++++.+.. +.+|+.. ..+.++|++|++++++ +.++|++++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~--~~ip~~~-~~~~a~wv~Eg~~~~~-s~~~f~~v~ 76 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP--VKGAVFS-GVPRAKIVGEGEVKPS-ASVDVSAFT 76 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCc--eEEEEEe-CCcceEEeeCCccccc-cccceeeeE Confidence 78888999999999999999999999999999999999887544 4555554 3567799999999996 569999999 Q ss_pred echheeeeehHHHHHHHhcchHH----HHHHHHHHHHHHHHHHHHHHHhhccccccc--h---------------hhhhh Q lcl|Aclame:pro 196 YLIKRYAGIITATNTSLKDTAEN----ILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--K---------------PTIAK 254 (408) Q Consensus 196 ~~~~~~~~~~~iS~ell~ds~~~----~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--~---------------~~~~~ 254 (408) ++++|++++++||+||++|+..+ |+++|.++|++++++++|.++++|++.++. . .+... T Consensus 77 l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (315) T protein:vir:80 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSA 156 (315) T ss_pred eeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccc Confidence 99999999999999999988766 789999999999999999999999763211 1 11123 Q ss_pred HHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc-----eeeccccccCCcccccccceEeecccccccc--- Q lcl|Aclame:pro 255 FDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK-----YLLEPDPTKPNSYLIKGKQVIVVADRWLPNT--- 326 (408) Q Consensus 255 ~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~-----~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~--- 326 (408) ++++++++.......+..+++|+|||.++..|++++|.+|+ |+| +++..+.+++|+|+||+++++ +|.. T Consensus 157 ~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~-~~~~~g~~~tl~G~PV~~~~~--~~~~~~~ 233 (315) T protein:vir:80 157 TADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY-PAAGFAGLDNWRGLNVGASST--VSGAPEM 233 (315) T ss_pred hHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccc-cccccCCCceecceeeEecCc--CCccccc Confidence 56677766443345566777899999999999999877665 455 456666778999999998764 4432 Q ss_pred -ccCcceEEEEehhcceEeeeccceEEEEeccch------hhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCC Q lcl|Aclame:pro 327 -GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGN 399 (408) Q Consensus 327 -~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 399 (408) ......++||||++++ +..+++++++++++.. +.|++|++.||+++|+|+++++|+||++|+.+++..+.+. T Consensus 234 ~~~~~~~~~~GDfs~~~-~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~ 312 (315) T protein:vir:80 234 SPASGVKAIVGDFSRVH-WGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPP 312 (315) T ss_pred ccccccEEEEeecccEE-EEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCC Confidence 2345678999999854 4568999999887643 4599999999999999999999999999987665332222 Q ss_pred ccC Q lcl|Aclame:pro 400 FKT 402 (408) Q Consensus 400 ~~~ 402 (408) ... T Consensus 313 ~~~ 315 (315) T protein:vir:80 313 AEN 315 (315) T ss_pred CCC Confidence 222 No 81 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=1.9e-53 Score=309.61 Aligned_cols=298 Identities=14% Similarity=0.105 Sum_probs=236.0 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) +..........+.|...+..+. ..++....+..++|.+||++++++|++.+++.++|+++|+++++.+.+ +. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~------~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ 72 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQ------VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTE--KK 72 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhh------hccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc--eE Confidence 2222223334555665554432 223444555666778999999999999999999999999999987654 44 Q ss_pred EeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) +|... ..+.+.|++|++.+|+. .++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+ T Consensus 73 ~p~~~-~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~ 150 (324) T protein:vir:99 73 FTFWA-DKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred EEEEe-cCcceeEeccCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 55544 35677999999999975 58999999999999999999999999999999999999999999999999999998 Q ss_pred ccccch--------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccc Q lcl|Aclame:pro 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) Q Consensus 244 g~~~~~--------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~ 309 (408) |++... .+...++++++++ ..+.+.+..+++|+|||++|..|++++|++|+|+|.+ +.+++ T Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~----~~~~~ 225 (324) T protein:vir:99 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDT 225 (324) T ss_pred CCCccCccccccccccceeccccCCHHHHHHHH-HhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecC----CCCcc Confidence 865321 2334577888776 4588888889999999999999999999999999853 34578 Q ss_pred ccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDV 377 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~ 377 (408) |+|+||++++ ....++..+++|||++++ +.++++++|+++++.. +.|++|++.+|+++|+|+ T Consensus 226 l~G~PVv~~~-----~~~~~~~~~i~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~ 299 (324) T protein:vir:99 226 LDGLPVVNLK-----SSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) T ss_pred ccceeEEeec-----CCCCCcceEEEEecccEE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEcc Confidence 9999998754 334556789999999854 6789999999987643 459999999999999999 Q ss_pred EEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ++.+|+||++|+.+.....+. +..| T Consensus 300 ~v~~~~a~~~lt~a~~~~~~~------~~~~ 324 (324) T protein:vir:99 300 HIADDKAFAKLVPADKKTDSV------PGEV 324 (324) T ss_pred EEecccceEEEEeccCCCCCC------CCCC Confidence 999999999998765544332 2222 No 82 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=1.3e-53 Score=310.54 Aligned_cols=286 Identities=15% Similarity=0.104 Sum_probs=232.1 Q ss_pred hhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccc Q lcl|Aclame:pro 101 VRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (408) Q Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~ 180 (408) ++.+... ..+.+.+...+++++|.+||+++.++|++.+++.++|+++++++++.+.... +|+.. ..+.+.|++|+ T Consensus 1 ~~~~~~~--~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~--ip~~~-~~~~a~~v~Eg 75 (318) T protein:vir:24 1 MAAGTAF--AVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQK--IPHWV-GDVSAQWIGEG 75 (318) T ss_pred CCCCCCC--CHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceE--EEEEe-CCcceEEecCC Confidence 5554322 2244555556667778899999999999999999999999999998765544 45444 34677999999 Q ss_pred ccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchh---------- Q lcl|Aclame:pro 181 GKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKP---------- 250 (408) Q Consensus 181 ~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~---------- 250 (408) +++++. .++|++++++++|+++++++|+|+++|+.+++++||.+.|++++++++|.++++|+|++.+.. T Consensus 76 ~~~~~~-~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~ 154 (318) T protein:vir:24 76 DMKPIT-KGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISI 154 (318) T ss_pred cccccc-ccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccc Confidence 999974 589999999999999999999999999999999999999999999999999999998754311 Q ss_pred -----hhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCc-----ccccccceEeecc Q lcl|Aclame:pro 251 -----TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNS-----YLIKGKQVIVVAD 320 (408) Q Consensus 251 -----~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~-----~~l~G~pv~~~~~ 320 (408) ....+++.+..+...+...+..+++|+|||++|..|+++||++|+|+|++++.++.+ ..++|+||+++++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~ 234 (318) T protein:vir:24 155 ADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDH 234 (318) T ss_pred cccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCC Confidence 112233333333456778899999999999999999999999999999988777654 4677888877543 Q ss_pred ccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCcEEecccceEEE Q lcl|Aclame:pro 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKATDSEALVAG 388 (408) Q Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 388 (408) + ..++..++||||++++ +..+++++++++++.+ +.|++|++.||+++|+|+++.+|+||++| T Consensus 235 --~---~~~~~~~~~gdfs~~~-~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i 308 (318) T protein:vir:24 235 --V---VEGTTVGFMGDFSQLI-WGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVAL 308 (318) T ss_pred --C---CCCccEEEEeecceEE-EEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEE Confidence 2 3455678999999854 5689999999887643 45999999999999999999999999999 Q ss_pred EeeccccCCC Q lcl|Aclame:pro 389 SFSAIADQVG 398 (408) Q Consensus 389 ~~~~~~~~~~ 398 (408) +.++.+..-+ T Consensus 309 ~~~~a~~~~~ 318 (318) T protein:vir:24 309 TNVVSGGGEG 318 (318) T ss_pred EeeccCCCCC Confidence 9888887777 No 83 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=2.3e-53 Score=309.17 Aligned_cols=298 Identities=14% Similarity=0.109 Sum_probs=236.3 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) ..+........+.|...+..+. ..++....+..++|.+||++++++|++.+++.++|+++|+++++.+.+ +. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~------~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~--~~ 72 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQ------VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE--KK 72 (324) T ss_pred CCCchHHHHHHHHHHHHhhccc------eecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc--eE Confidence 2222223334555666555442 123444555666778999999999999999999999999999987654 44 Q ss_pred EeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) +|... ..+.+.|++|++++|+. .++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+ T Consensus 73 ~p~~~-~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~ 150 (324) T protein:vir:10 73 FTFWA-DKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred EEEEe-CCcceeEeccCcccccc-ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 55543 35678999999999975 58999999999999999999999999999999999999999999999999999998 Q ss_pred ccccch--------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccc Q lcl|Aclame:pro 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) Q Consensus 244 g~~~~~--------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~ 309 (408) |++... .+...++++++++. .+...+..+++|+|||++|..|++++|++|+|+|.+ +.+++ T Consensus 151 g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~~-~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~----~~~~~ 225 (324) T protein:vir:10 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDT 225 (324) T ss_pred CCCccCccccccccccceeccccCCHHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecC----CCCcc Confidence 875321 13345778887764 578888889999999999999999999999999864 34568 Q ss_pred ccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDV 377 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~ 377 (408) |+|+||++++ +...++..+++|||++++ +..+++++++++++.+ +.|++|++.||+++|+|+ T Consensus 226 l~G~PV~~~~-----~~~~~~~~~~~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~ 299 (324) T protein:vir:10 226 LDGLPVVNLK-----SSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) T ss_pred ccceeEEeec-----CCCCCcceEEEEecccEE-EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEcc Confidence 9999998754 334566789999999865 5689999999987643 569999999999999999 Q ss_pred EEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ++.+|+||++|+..+....+ ++..| T Consensus 300 ~v~~~~A~~~l~~a~~~~~~------~~~~~ 324 (324) T protein:vir:10 300 HIADDKAFAKLVPADKKTDS------VPGEV 324 (324) T ss_pred EEecccceEEEEeccCCCCC------CCCCC Confidence 99999999999865544322 22222 No 84 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=4e-53 Score=307.82 Aligned_cols=298 Identities=14% Similarity=0.104 Sum_probs=233.3 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) ...........+.|...+..+. ..++.......++|.+||++++++|++.+++.++|+++++++++++.+.. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~------~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-- 72 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQ------VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK-- 72 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhh------hcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE-- Confidence 2222222334444655544432 12334444456677899999999999999999999999999998865544 Q ss_pred EeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) +|...+ .+.+.|++|++.+|+ +.++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+ T Consensus 73 ~p~~~~-~~~a~~v~Eg~~~~~-~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~ 150 (324) T protein:vir:96 73 FTFWAD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred EEEEec-CcceeeecCCccccc-cccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 444433 456799999999997 458999999999999999999999999999999999999999999999999999998 Q ss_pred ccccch--------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccc Q lcl|Aclame:pro 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) Q Consensus 244 g~~~~~--------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~ 309 (408) |++... .+...++++++++. .+...+..+++|+|||++|.+|++++|++|+|++.+ +.+++ T Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~----~~~~~ 225 (324) T protein:vir:96 151 GNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) T ss_pred CCCCcCccccccccccceecccccchHHHHHHHH-hhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecC----CCCCc Confidence 865332 12335788888775 477888888899999999999999999999999853 45678 Q ss_pred ccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCc Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDV 377 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~ 377 (408) |+|+||+++. ....++..++||||+++ .+..+++++++++++.. +.|++|++.||+++|+|+ T Consensus 226 l~G~PV~~~~-----~~~~~~~~~~~gd~s~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~ 299 (324) T protein:vir:96 226 LDGLPVVNLK-----SSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) T ss_pred ccceeeEeec-----CCCCCcceEEEEecceE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Confidence 9999998753 33455677999999985 45689999999987643 569999999999999999 Q ss_pred EEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ++.+|+||++|+.......+.... | T Consensus 300 ~v~~~~a~~~l~~a~~~~~~~~~~------~ 324 (324) T protein:vir:96 300 HIADDKAFAKLVPADKRTDSVPGE------V 324 (324) T ss_pred EEecccceEEEecccccCCCCCCC------C Confidence 999999999998554443332222 2 No 85 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.5e-53 Score=310.26 Aligned_cols=266 Identities=13% Similarity=0.030 Sum_probs=220.9 Q ss_pred ccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechh Q lcl|Aclame:pro 120 SDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIK 199 (408) Q Consensus 120 t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~ 199 (408) ...+||++||+++..+|++.+++.++++++|+++++++.. +.+|+.. ..+.+.|++|++++|+ +.++|++++++++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~--~~~p~~~-~~~~a~~v~Eg~~~~~-~~~~f~~v~l~~~ 76 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNG--EKVFTFT-MDSEIDVVAESGKKTH-GGVTLAPQTMVPI 76 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCc--eEEEEEe-cCcceEEeeCCccccc-cccceeEEEEeee Confidence 4446789999999999999999999999999999987654 4455543 3466799999999996 5689999999999 Q ss_pred eeeeehHHHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cch------------------hhhhh Q lcl|Aclame:pro 200 RYAGIITATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA----PKK------------------PTIAK 254 (408) Q Consensus 200 ~~~~~~~iS~ell~---ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~----~~~------------------~~~~~ 254 (408) |+++++++|+|+++ ++..+|+++|.++|++++++++|.++++|.+.+ ... ..... T Consensus 77 k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) T protein:vir:94 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) T ss_pred EEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccH Confidence 99999999999996 456789999999999999999999999984321 110 01111 Q ss_pred HHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccccc-ccCcceE Q lcl|Aclame:pro 255 FDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNT-GSTVYPL 333 (408) Q Consensus 255 ~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~-~~~~~~~ 333 (408) ++++.+++ ..+...+..+++|+|||+++.+|+++||++|+|+|++.+.++.+++|+|+||++++. +|.. ..+...+ T Consensus 157 ~~~i~~~~-~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~--v~~~~~~~~~~~ 233 (298) T protein:vir:94 157 NGAIENAV-ELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKT--VSDMSLTQRDRA 233 (298) T ss_pred HHHHHHHH-HhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecc--cccccCCCccEE Confidence 34555555 457777888889999999999999999999999999988989899999999998654 4433 3455678 Q ss_pred EEEehhcceEeeeccceEEEEeccch------hhhhhceeeEEEEeeeCcEEecccceEEEEeec Q lcl|Aclame:pro 334 YYGDMSQAITLFDRENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) Q Consensus 334 ~~gd~~~~~~~~~~~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 392 (408) ++|||++++.+..|++++++++++.. +.|++|++.+|++.|+|+++.+|+||++++..+ T Consensus 234 ~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 234 IIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99999998877789999999877532 369999999999999999999999999997666 No 86 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=1.5e-53 Score=310.26 Aligned_cols=291 Identities=14% Similarity=0.095 Sum_probs=229.8 Q ss_pred hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIP 184 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~ 184 (408) ++.. .+.+.+...+++++|.+||+++..+|++.+++.++|+++++++++.+.+ +.+|+... .+.+.|++|+++++ T Consensus 1 ~g~~--~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~--~~ip~~~~-~~~a~wv~Eg~~~~ 75 (397) T protein:vir:23 1 MGFS--ADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATG--IVIPHWTG-DVSAQWIGEGDMKP 75 (397) T ss_pred CCcC--HHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCc--eEEEEEcC-CcceEEecCCcccc Confidence 2222 2233333334444455677788999999999999999999999987654 44555443 46679999999999 Q ss_pred ccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch-------------hh Q lcl|Aclame:pro 185 DLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK-------------PT 251 (408) Q Consensus 185 ~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~-------------~~ 251 (408) + +.++|+++++++||++++++||+||++|+.+++++||.++|++++++++|.++++|+|++.+. .+ T Consensus 76 ~-s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~ 154 (397) T protein:vir:23 76 I-TKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISP 154 (397) T ss_pred c-cccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecc Confidence 6 568999999999999999999999999999999999999999999999999999999876432 22 Q ss_pred hhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCc-----ccccccceEeecccccccc Q lcl|Aclame:pro 252 IAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNS-----YLIKGKQVIVVADRWLPNT 326 (408) Q Consensus 252 ~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~-----~~l~G~pv~~~~~~~~~~~ 326 (408) ...++++++++. .+...+..+++|+||++++..|+++||++|+|+|+++...+.+ .+|+|+||++.++ +| T Consensus 155 ~~~~~~~~~~~~-~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~--~~-- 229 (397) T protein:vir:23 155 NAYQGLGVSGLT-KLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDH--VA-- 229 (397) T ss_pred cchhHHHHHHHH-hhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCC--CC-- Confidence 233556666654 4778899999999999999999999999999999987766543 5899999998654 34 Q ss_pred ccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 327 GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 327 ~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) .++..++||||++++ +..+++++++++++.+ +.|++|++.||+++|+|+++++|+||++++..+.. T Consensus 230 -~g~~~~~~gDfs~~~-i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~ 307 (397) T protein:vir:23 230 -EGDVVGYAGDFSQII-WGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVL 307 (397) T ss_pred -CCceEEEEeecceEE-EEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccc Confidence 345567999999865 5678999999887642 56999999999999999999999999999987765 Q ss_pred cCCCCc-cCCCcccC Q lcl|Aclame:pro 395 DQVGNF-KTTTSTAV 408 (408) Q Consensus 395 ~~~~~~-~~~~~~~~ 408 (408) .+...+ +++++.+. T Consensus 308 ~~~~~~~~~~~~~~~ 322 (397) T protein:vir:23 308 TTYALDLDGASAGNF 322 (397) T ss_pred ceeeecccccCcceE Confidence 443322 33333332 No 87 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=1.2e-52 Score=305.21 Aligned_cols=343 Identities=11% Similarity=0.027 Sum_probs=224.3 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |.+|.. +++++++.++...+++...+ ..+. +....+ +.++. +..+.+... +....... . T Consensus 1 m~~kl~-~~~~~~~~~~~~~~~~~~~~----~~~~----~~~~~~---~~~~~---~~~~~~~~~-e~~~~~~~---~-- 59 (381) T protein:vir:10 1 MTINLS-ETFANAKNEFINAVNNGEPQ----ERQN----ELYGDM---INQLF---EETKLQAKA-EAERVSSL---P-- 59 (381) T ss_pred CchhHH-HHHHHHHHHHHHHHHhhhHH----HHHH----HHHHHH---HHhhh---hhHHHHHHH-HHHHHHHh---c-- Confidence 777732 33443333322222211000 0000 000000 00000 000000000 00000000 0 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG 160 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g 160 (408) ........++++.| .+...+++++||++||+++.++|++.+++.++|+++|+++++++ T Consensus 60 ---~~~~~l~~~e~~~~----------------~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~--- 117 (381) T protein:vir:10 60 ---KSAQTLSANQRNFF----------------MDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--- 117 (381) T ss_pred ---ccccccCHHHHHHH----------------HHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc--- Confidence 00011112222222 23456778899999999999999999999999999999998743 Q ss_pred ceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII 240 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~ 240 (408) ...++.. +..+.+.|++|.++++..+.++|+++++.+++++++++||++||+|+.+++++||.++|+++++++++.+|+ T Consensus 118 ~~~i~~~-~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi 196 (381) T protein:vir:10 118 RLKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL 196 (381) T ss_pred ceEEEee-cCCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeE Confidence 3445544 345777899998888767789999999999999999999999999999999999999999999999999999 Q ss_pred hccccccchhhhh----------------------hHHHHHHH------HHHhh-------hhhccCCCEEEEcHHHHHH Q lcl|Aclame:pro 241 EVMKAAPKKPTIA----------------------KFDDVITM------INTAV-------DPAIIATSSLLTNQSGLNK 285 (408) Q Consensus 241 ~g~g~~~~~~~~~----------------------~~d~i~~~------~~~~l-------~~~~~~~a~~~~n~~~~~~ 285 (408) +|+|+++|.+-.. ++.++... +...+ ...|..++.|+|||.++.. T Consensus 197 ~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~ 276 (381) T protein:vir:10 197 KGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFE 276 (381) T ss_pred ecccCCCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHh Confidence 9999876632110 11111111 11111 2246778899999999988 Q ss_pred HHhhh---cccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhh Q lcl|Aclame:pro 286 LALVK---TAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAF 362 (408) Q Consensus 286 l~~lk---d~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f 362 (408) |+.++ +++|+|+|... +|+||+.+ ..+|. +.|+||||++ |.+.+|.+++++++++.+ | T Consensus 277 l~~~~~~~~~~G~~v~~lp---------~g~~vv~~--~~~p~-----~~i~fGDfs~-Y~i~~r~~~~i~~~~~~~--~ 337 (381) T protein:vir:10 277 VQAQYTHLNANGVYVTALP---------FNLNVIES--TVQEA-----GKVLTYVKGL-YDGYLAGGINVQKFKETL--A 337 (381) T ss_pred hccccccCCCCCceeecCC---------CCceeEEc--CCCCc-----CcEEEEEccc-EEEEEecccEEEeechhh--h Confidence 88654 88899987531 46777664 33553 4589999998 788999999999998765 9 Q ss_pred hhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcc Q lcl|Aclame:pro 363 ETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTST 406 (408) Q Consensus 363 ~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~ 406 (408) .+|++.||+..|+||++++|+||++++++.....|....+.-.- T Consensus 338 ~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEETL 381 (381) T ss_pred hcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccccccccC Confidence 99999999999999999999999998887554322222221111 No 88 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=6e-53 Score=306.87 Aligned_cols=286 Identities=12% Similarity=0.033 Sum_probs=222.4 Q ss_pred hHHHHHHHHhhcccc------ccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccC-----Cccccc Q lcl|Aclame:pro 107 FMNTVSSKTETSGSD------SAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTD-----VTPLTV 175 (408) Q Consensus 107 ~~~~~~~~a~~~~t~------~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~-----~~~~~~ 175 (408) ...-.+.++...+++ +.++.+||++++++|++.+++.++|+++|+++++++.+..+++..... ....+. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 111122333333332 334558999999999999999999999999999987666655543221 224567 Q ss_pred hhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch------ Q lcl|Aclame:pro 176 MDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK------ 249 (408) Q Consensus 176 ~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~------ 249 (408) |++|++++++ +.++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+|++++. T Consensus 81 ~~~Eg~~~~~-~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~ 159 (338) T protein:vir:78 81 EQREGGTKPL-SGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGID 159 (338) T ss_pred cccccccccc-cccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc Confidence 8999999986 458999999999999999999999999999999999999999999999999999999864321 Q ss_pred ----------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHH---HhhhcccCceeeccccccCCcccc Q lcl|Aclame:pro 250 ----------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKL---ALVKTAEGKYLLEPDPTKPNSYLI 310 (408) Q Consensus 250 ----------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l---~~lkd~~G~~~~~~~~~~~~~~~l 310 (408) .....++++.+++............+|+|||+++..| ++++|++|+|+|.+...++.+++| T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l 239 (338) T protein:vir:78 160 TNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDL 239 (338) T ss_pred cccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCcee Confidence 0111244555554433333444556799999998776 457899999999988888889999 Q ss_pred cccceEeeccccccc----cccCcceEEEEehhcceEeeeccceEEEEeccc------------hhhhhhceeeEEEEee Q lcl|Aclame:pro 311 KGKQVIVVADRWLPN----TGSTVYPLYYGDMSQAITLFDRENMSLLPTNIG------------AGAFETDTTKIRVIDR 374 (408) Q Consensus 311 ~G~pv~~~~~~~~~~----~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~------------~~~f~~~~~~~r~~~r 374 (408) +|+||++++. +|+ ..+.+..++||||+. |++.++++++++++++. .+.|++|++.+|++.| T Consensus 240 ~G~PV~~~~~--ip~~~~~~~~~~~~~~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 316 (338) T protein:vir:78 240 LGLPVQFGKA--VGGDLGAATDSKVRVVGGDFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVT 316 (338) T ss_pred eeeeEEEccc--cCccccccCCcccEEEEEecce-EEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 9999998653 443 344567799999998 55678999999998764 2569999999999999 Q ss_pred eCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 375 FDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 375 ~d~~v~~~~a~~~l~~~~~~~~ 396 (408) +||++++|+||++|+..+...+ T Consensus 317 ~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 317 FGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred eccEeecccceEEEecccCCCC Confidence 9999999999999987655555 No 89 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=2.5e-52 Score=303.51 Aligned_cols=339 Identities=12% Similarity=0.027 Sum_probs=222.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNK-RDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |.++.. +++.+++.+..+.+++. ... ++..+..++ ++.+.. ..+.+... +....... .+ T Consensus 1 m~ik~~-~~~~~~~~e~~~~~~~~-----------~~~-~~~~~~~~~~~~~~~~---~~~~~~~~-e~~~~~~~---~~ 60 (381) T protein:vir:10 1 MTINLS-ETFANAKNEFINAVNNG-----------EPQ-ERQNELYGDMINQLFE---ETKLQAKA-EAERVSSL---PK 60 (381) T ss_pred CchhhH-HHHHHHHHHHHHHHhhh-----------hhh-HHHHHHHHHHHHhhhh---hHHHHHHH-HHHHHHHh---cc Confidence 777643 33333332222222111 000 011111111 111100 00000000 00000000 00 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) .......++++.| .+...+++++||++||+++.++|++.+++.++|+++|++.++++ T Consensus 61 -----~~~~lt~~e~~~~----------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-- 117 (381) T protein:vir:10 61 -----SAQSLSANQRSFF----------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-- 117 (381) T ss_pred -----CcccccHHHHHHH----------------HHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc-- Confidence 0111112223322 22345678899999999999999999999999999999998753 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) ...++.. +..+.+.|++|+++++..+.++|+++++++|+++++++||++||+|+.++|++||.++|+++++++++.+| T Consensus 118 -~~~i~~~-~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~ 195 (381) T protein:vir:10 118 -RLKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) T ss_pred -ceEEEEe-cCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhhee Confidence 3455554 35578899999998876677999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccchhhh----------------------hh-------HHHHHHHHHHhhh-------hhccCCCEEEEcHHHH Q lcl|Aclame:pro 240 IEVMKAAPKKPTI----------------------AK-------FDDVITMINTAVD-------PAIIATSSLLTNQSGL 283 (408) Q Consensus 240 ~~g~g~~~~~~~~----------------------~~-------~d~i~~~~~~~l~-------~~~~~~a~~~~n~~~~ 283 (408) ++|+|++.|.+-. .+ ++.+...+ ..+. ..|..+++|+|||.++ T Consensus 196 i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) T protein:vir:10 196 LKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVF-KYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) T ss_pred EeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHH-HhhccccccccccccCceEEEEccccH Confidence 9999987653211 01 11222222 2222 2577889999999999 Q ss_pred HHHHhhh---cccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchh Q lcl|Aclame:pro 284 NKLALVK---TAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAG 360 (408) Q Consensus 284 ~~l~~lk---d~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 360 (408) ..|+.++ +++|+|+|... +|.||+. +..+| ++.++||||++ |.+++|++++++++++.+ T Consensus 275 ~~l~~~~~~~~~~G~~v~~l~---------~g~~vv~--s~~~p-----~~~iifgDfs~-Y~i~~r~~~~i~~~~~~~- 336 (381) T protein:vir:10 275 FEVQAQYTHLNANGVYVTALP---------FNLNVIE--STVQE-----AGKVLTYVKGL-YDGYLAGGINVQKFKETL- 336 (381) T ss_pred HhhccccccCCCCCceeecCC---------CCceEEe--cCCCC-----cCcEEEEeccc-EEEEEecccEEEeechhH- Confidence 8888765 66788886421 2445554 33344 34589999998 788999999999998765 Q ss_pred hhhhceeeEEEEeeeCcEEecccceEEEEeecc--ccCCCCccCCC Q lcl|Aclame:pro 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFSAI--ADQVGNFKTTT 404 (408) Q Consensus 361 ~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~--~~~~~~~~~~~ 404 (408) |.+|++.||+..|+||++++++||++++++.. .+.+..+..+- T Consensus 337 -~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 337 -ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred -hhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 99999999999999999999999999887763 22222222222 No 90 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=2.5e-52 Score=303.51 Aligned_cols=339 Identities=12% Similarity=0.027 Sum_probs=222.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNK-RDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |.++.. +++.+++.+..+.+++. ... ++..+..++ ++.+.. ..+.+... +....... .+ T Consensus 1 m~ik~~-~~~~~~~~e~~~~~~~~-----------~~~-~~~~~~~~~~~~~~~~---~~~~~~~~-e~~~~~~~---~~ 60 (381) T protein:vir:95 1 MTINLS-ETFANAKNEFINAVNNG-----------EPQ-ERQNELYGDMINQLFE---ETKLQAKA-EAERVSSL---PK 60 (381) T ss_pred CchhhH-HHHHHHHHHHHHHHhhh-----------hhh-HHHHHHHHHHHHhhhh---hHHHHHHH-HHHHHHHh---cc Confidence 777643 33333332222222111 000 011111111 111100 00000000 00000000 00 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) .......++++.| .+...+++++||++||+++.++|++.+++.++|+++|++.++++ T Consensus 61 -----~~~~lt~~e~~~~----------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-- 117 (381) T protein:vir:95 61 -----SAQSLSANQRSFF----------------MDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL-- 117 (381) T ss_pred -----CcccccHHHHHHH----------------HHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc-- Confidence 0111112223322 22345678899999999999999999999999999999998753 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) ...++.. +..+.+.|++|+++++..+.++|+++++++|+++++++||++||+|+.++|++||.++|+++++++++.+| T Consensus 118 -~~~i~~~-~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~ 195 (381) T protein:vir:95 118 -RLKFLKS-ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) T ss_pred -ceEEEEe-cCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhhee Confidence 3455554 35578899999998876677999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccchhhh----------------------hh-------HHHHHHHHHHhhh-------hhccCCCEEEEcHHHH Q lcl|Aclame:pro 240 IEVMKAAPKKPTI----------------------AK-------FDDVITMINTAVD-------PAIIATSSLLTNQSGL 283 (408) Q Consensus 240 ~~g~g~~~~~~~~----------------------~~-------~d~i~~~~~~~l~-------~~~~~~a~~~~n~~~~ 283 (408) ++|+|++.|.+-. .+ ++.+...+ ..+. ..|..+++|+|||.++ T Consensus 196 i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~-~~~~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) T protein:vir:95 196 LKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVF-KYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) T ss_pred EeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHH-HhhccccccccccccCceEEEEccccH Confidence 9999987653211 01 11222222 2222 2577889999999999 Q ss_pred HHHHhhh---cccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchh Q lcl|Aclame:pro 284 NKLALVK---TAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAG 360 (408) Q Consensus 284 ~~l~~lk---d~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 360 (408) ..|+.++ +++|+|+|... +|.||+. +..+| ++.++||||++ |.+++|++++++++++.+ T Consensus 275 ~~l~~~~~~~~~~G~~v~~l~---------~g~~vv~--s~~~p-----~~~iifgDfs~-Y~i~~r~~~~i~~~~~~~- 336 (381) T protein:vir:95 275 FEVQAQYTHLNANGVYVTALP---------FNLNVIE--STVQE-----AGKVLTYVKGL-YDGYLAGGINVQKFKETL- 336 (381) T ss_pred HhhccccccCCCCCceeecCC---------CCceEEe--cCCCC-----cCcEEEEeccc-EEEEEecccEEEeechhH- Confidence 8888765 66788886421 2445554 33344 34589999998 788999999999998765 Q ss_pred hhhhceeeEEEEeeeCcEEecccceEEEEeecc--ccCCCCccCCC Q lcl|Aclame:pro 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFSAI--ADQVGNFKTTT 404 (408) Q Consensus 361 ~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~--~~~~~~~~~~~ 404 (408) |.+|++.||+..|+||++++++||++++++.. .+.+..+..+- T Consensus 337 -~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:95 337 -ALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred -hhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 99999999999999999999999999887763 22222222222 No 91 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=6.4e-52 Score=301.26 Aligned_cols=353 Identities=9% Similarity=0.031 Sum_probs=225.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDA-LREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 79 (408) |+++ +++..+++.+ +.+++.+. .+.++..+++.+.+.+.++.+...+.. .+.+..+........ T Consensus 1 M~~k--l~~~~~~~~e---~~~~l~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 65 (383) T protein:vir:78 1 MTIK--LKNNLANYEE---KRTAFVNA----VKNEDTQEIQNKAYVEMVDAMAADIMEQAKKEARQEADAYISA------ 65 (383) T ss_pred Cchh--HHHHHHHHHH---HHHHHHHH----HhccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------ Confidence 7766 3333333333 33332221 112222222222222222222211111 011111100000000 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) .........++++.| +++..+++++||++||+++.++|++.+++.++|+++|++.++++ T Consensus 66 ---~~g~~~lt~~e~~~~----------------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~-- 124 (383) T protein:vir:78 66 ---SRTDKNITNEEIKFF----------------NDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL-- 124 (383) T ss_pred ---cCChhhhhHHHHHHH----------------HHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC-- Confidence 000111112222222 23456778899999999999999999999999999999988753 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) ...++.. +..+.+.|++|+++++..+.++|+++++++++++++++||++||+|+.+++++||.+.|+++++++++.+| T Consensus 125 -~~~i~~~-~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~ 202 (383) T protein:vir:78 125 -RTKFLKS-ETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAY 202 (383) T ss_pred -ceEEEEE-cCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhhe Confidence 3445554 34567789999988876678999999999999999999999999999999999999999999999999999 Q ss_pred hhccccccchhhh----------------------hhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhh---cccC Q lcl|Aclame:pro 240 IEVMKAAPKKPTI----------------------AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVK---TAEG 294 (408) Q Consensus 240 ~~g~g~~~~~~~~----------------------~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lk---d~~G 294 (408) ++|+|.+.|.+-. .+.+++.. +...+ ..+..++.|+||...+..+++++ +..+ T Consensus 203 i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~ 280 (383) T protein:vir:78 203 IVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKT-TVNEL-TDVYKYHSVKENGHPLNVAGKVTLLVNPTD 280 (383) T ss_pred EeccCCCCceeeeeccCCcccccccccccccccchhhhhhhHH-HHHHH-HHHHhccchhcccchhhhcCceEEEEcCcc Confidence 9999977553211 11223322 22323 34555556666666666666554 2112 Q ss_pred ceeecccc----ccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEE Q lcl|Aclame:pro 295 KYLLEPDP----TKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIR 370 (408) Q Consensus 295 ~~~~~~~~----~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r 370 (408) .|.|+|.. .+|.+.+++|+|+.++.+..+|. +.++||||++ |.+.+|++++++++++.+ |.+|++.|| T Consensus 281 ~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~p~-----~~iifgdfs~-Y~i~~r~~~~i~~~~~~~--f~~d~~~f~ 352 (383) T protein:vir:78 281 AWDVKKQYTSLNANGVYVTALPFNLNIIESLFVPE-----KKAISYVAER-YDALIGGPLDIGTYDQTL--AIEDLNLYA 352 (383) T ss_pred hhhhccchhccCCCCceeeecCCCceEEecCCCCc-----ccEEEeeccc-eEEEecccceEEecchhh--hhcCceEEE Confidence 23344322 33445578899987665555553 4589999998 778899999999988765 999999999 Q ss_pred EEeeeCcEEecccceEEEEeeccccCCCCccCC Q lcl|Aclame:pro 371 VIDRFDVKATDSEALVAGSFSAIADQVGNFKTT 403 (408) Q Consensus 371 ~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~ 403 (408) +..|+||++++|+||++++++-.. ...|+.. T Consensus 353 ~~~r~dG~~~~~~A~~vl~~~~~~--~~~~~~~ 383 (383) T protein:vir:78 353 AKQFAYGKAKDDKAAAVWTLNINP--AEQTPEG 383 (383) T ss_pred EEEEEcCEEecCCeEEEEEEEecC--CCCCCCC Confidence 999999999999999999988322 1222222 No 92 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=8.7e-53 Score=306.00 Aligned_cols=271 Identities=14% Similarity=0.081 Sum_probs=218.5 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccc----cccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPD----LDNPQL 191 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~----~~~~~f 191 (408) +...++++||++||++++.+|++.+++.++|+++++++++.+.+ +.+|... ..+.+.|++|++..++ .+.++| T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~--~~~p~~~-~~~~a~wv~E~~~~~~~~~~~s~~~f 77 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT--THLPVLA-TLPEADWVGESATDPKGVKPTSKVTW 77 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCc--EEEEEEe-CCcceEEeecccccccccccccccce Confidence 77888899999999999999999999999999999999987654 4455544 3467899999987554 346899 Q ss_pred eeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch-------------------hhh Q lcl|Aclame:pro 192 TIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK-------------------PTI 252 (408) Q Consensus 192 ~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~-------------------~~~ 252 (408) ++++++++|++++++||+||++|+.+++++||.++|++++++++|.++++|+|++.+. .+. T Consensus 78 ~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (305) T protein:vir:25 78 ANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGV 157 (305) T ss_pred eeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999864321 011 Q ss_pred hhHHHHHHHHHH---hhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccC Q lcl|Aclame:pro 253 AKFDDVITMINT---AVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGST 329 (408) Q Consensus 253 ~~~d~i~~~~~~---~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~ 329 (408) ...+++...+.. .+...+.....|+|||.++..|+++||++|+|+|++ ++|+|+||++++. +|. ..+ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~-------~~l~G~Pv~~~~~--~~~-~~~ 227 (305) T protein:vir:25 158 ANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD-------DSFAGFRTFFNRN--GAW-DAD 227 (305) T ss_pred hhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC-------CcccccceEEcCc--cCC-CCC Confidence 112333333322 222333344459999999999999999999999975 3899999998754 343 456 Q ss_pred cceEEEEehhcceEeeeccceEEEEeccc--------hhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 330 VYPLYYGDMSQAITLFDRENMSLLPTNIG--------AGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 330 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--------~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) +.+++||||+++ .+..+++++++++++. +..|++|++.+|++.|+|+.+.+|+||++++..+++...+.+ T Consensus 228 ~~~~~~gd~s~~-~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 228 AAIEVIADSSRV-KIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) T ss_pred ccEEEEEecceE-EEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCCC Confidence 778999999985 5578999999988753 246999999999999999999999999999988876544433 No 93 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=1.1e-52 Score=305.46 Aligned_cols=317 Identities=14% Similarity=0.154 Sum_probs=221.0 Q ss_pred HHHHHHHHhhhc----ccccccccccchhhhHHHHHHHHHHHhhcch---------hh-HHHHHHHHhhccccccCceec Q lcl|Aclame:pro 63 LVEAQAEQVVNM----REEEKGPLNKSENELKDKFVKDFVNMVRNPM---------AF-MNTVSSKTETSGSDSAAGLTI 128 (408) Q Consensus 63 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~---------~~-~~~~~~~a~~~~t~~~gg~~v 128 (408) +.+..+...... ....+.............+.++... ..+. .. ......++.. .+.++||++| T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~--~~g~~~~a~~~a~~~~~~~~~~~a~~-~~~~~Gg~lv 77 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAA--GKGNLADAAKFAATELGDTGLSMAIS-TAAGSGGALI 77 (366) T ss_pred CcccccccccccccccccccccccccccchhHHHHHHHHHh--cccchhHHHHHHHHhhcchhhhhhcc-ccccCCcccc Confidence 111000000000 0000000000000000111111100 0000 00 0111123333 3456799999 Q ss_pred chhhhhhhhhhhhhhhhhhhh-hceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHH Q lcl|Aclame:pro 129 PQDIRTMINTLVRQYDSLQQY-VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITA 207 (408) Q Consensus 129 P~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~i 207 (408) |+++.++|++.+++.++++++ ++++++. ++.+.+|+..+ .+.++|++|++.+|++ .++|++|+++++|++++++| T Consensus 78 P~~~~~~ii~~l~~~s~l~~lg~~~v~~~--~g~~~~p~~t~-~~~a~wv~E~~~~~~s-~~~f~~i~~~~~k~~~~~~i 153 (366) T protein:vir:57 78 PQNMQNEVIELLRDRTVVRILGARSIPLP--NGNLSMPRLSG-GATAGYVGEGKDVVAT-GATFDDVKLSAKTMIALVPV 153 (366) T ss_pred chhHHHHHHHHHhhhcchhhhceeeeecC--CCceEEEEEeC-CcceeeeccCcccccc-ccceeEEEEeeEEEEEeehh Confidence 999999999999999999987 7777654 45677777654 4678999999999975 58999999999999999999 Q ss_pred HHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhh-----------------h---hHHHHHHHHH--Hh Q lcl|Aclame:pro 208 TNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTI-----------------A---KFDDVITMIN--TA 265 (408) Q Consensus 208 S~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~-----------------~---~~d~i~~~~~--~~ 265 (408) |+|||+|+.+++++||.++|++++++++|.+|++|+|++..+.+. . ..+..++.+. .. T Consensus 154 S~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~ 233 (366) T protein:vir:57 154 SNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHM 233 (366) T ss_pred hHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhh Confidence 999999999999999999999999999999999999865221111 1 1222233322 22 Q ss_pred hhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccccc---ccCcceEEEEehhcce Q lcl|Aclame:pro 266 VDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNT---GSTVYPLYYGDMSQAI 342 (408) Q Consensus 266 l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~---~~~~~~~~~gd~~~~~ 342 (408) ....+..++.|+|||.++..|+++||++|+|+|.+ .. +++|+|+||++++. +|.. ..+...++||||++++ T Consensus 234 ~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~-~~---~g~l~G~Pvv~s~~--ip~~~~~~~~~~~i~~gdfs~~~ 307 (366) T protein:vir:57 234 DSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPE-MS---QGILKGYPIQRTSA--IPANLGDDGNESEIYFCDFNDVV 307 (366) T ss_pred ccccccccCEEEecHHHHHHHHhhhccCCceeccC-CC---CCeecceeeEEccc--cccccccCCCccEEEEEecceEE Confidence 34556788999999999999999999999999953 32 34899999998653 5543 3456779999999855 Q ss_pred EeeeccceEEEEeccch---------hhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 343 TLFDRENMSLLPTNIGA---------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 343 ~~~~~~~~~i~~~~~~~---------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) +.+|++++++++++.. ..|++|++.+|+++|+||++.||+||++++-..- T Consensus 308 -i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 308 -IGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred -EEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 6789999999887632 4699999999999999999999999999987777 No 94 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.1e-52 Score=305.51 Aligned_cols=272 Identities=13% Similarity=0.102 Sum_probs=225.4 Q ss_pred HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccc Q lcl|Aclame:pro 108 MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLD 187 (408) Q Consensus 108 ~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~ 187 (408) ......++.+..+++++|.+||++++++|++.+++.++|+++|+++++.+.. ...++.. ...+.+.|++|++++++. T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~~~~~-~~~~~a~~v~Eg~~~~~~- 77 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQ-EKTVYVQ-TDGISAYWVNETEKIKTD- 77 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCc-cEEEEEE-cCCceeEEeecCcccccc- Confidence 1112234555666778889999999999999999999999999999987654 3334443 345677999999999975 Q ss_pred cccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch-------------hhhhh Q lcl|Aclame:pro 188 NPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK-------------PTIAK 254 (408) Q Consensus 188 ~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~-------------~~~~~ 254 (408) .++|+++++++++++++++||+|+++|+.+++++||.++|++++++++|.++++|+|++.+. .+..+ T Consensus 78 ~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t 157 (297) T protein:vir:95 78 KPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPIN 157 (297) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccC Confidence 58999999999999999999999999999999999999999999999999999999875431 23346 Q ss_pred HHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEE Q lcl|Aclame:pro 255 FDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) Q Consensus 255 ~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 334 (408) ++++++++. .+...+..+++|+|||+++.+|++|+|++|+|+|.+. +++|+|+||++.. ....+++.++ T Consensus 158 ~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~-----~~~l~G~Pv~~~~-----~~~~~~~~~~ 226 (297) T protein:vir:95 158 YDNILKLQD-ALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKA-----ANTIDGITTVDLK-----SARFEKGDLL 226 (297) T ss_pred HHHHHHHHH-HhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCC-----CCcccceeeEeec-----CCCCCCceEE Confidence 888988875 4778888889999999999999999999999999653 4689999998753 3345667799 Q ss_pred EEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 335 YGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 335 ~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) ||||++++ +..+++++++++++.+ +.|++|++.+|+++|+|+++++|+||++|+..+.- T Consensus 227 ~gd~s~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 227 AGDFDNLI-YGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred EEecccEE-EEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 99999854 6789999999987653 56999999999999999999999999998633322 No 95 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=4.3e-52 Score=302.20 Aligned_cols=284 Identities=14% Similarity=0.103 Sum_probs=226.6 Q ss_pred hhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccc Q lcl|Aclame:pro 101 VRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (408) Q Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~ 180 (408) +..+.. ...+.+++...+++++|.+||++++++|++.+++.++|+++++++++.+.+.. +|... ..+.+.|++|+ T Consensus 1 ~~~~~~--~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~--~p~~~-~~~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTA--FQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQK--IPHWI-GDVSAQWIGEG 75 (320) T ss_pred CCCCcc--CCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceE--EEEEe-CCcceEEecCC Confidence 222221 12355556666667777789999999999999999999999999998765544 45443 34567999999 Q ss_pred ccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchh---------- Q lcl|Aclame:pro 181 GKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKP---------- 250 (408) Q Consensus 181 ~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~---------- 250 (408) +++|+. .++|++++++++|++++++||+|+++|+.+++++||.+.|++++++++|+++++|+|++.+.. T Consensus 76 ~~~~~~-~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~ 154 (320) T protein:vir:10 76 DMKPIT-KGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSL 154 (320) T ss_pred cccccc-ccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccc Confidence 999974 589999999999999999999999999999999999999999999999999999998654311 Q ss_pred ---h------hhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCC-----cccccccceE Q lcl|Aclame:pro 251 ---T------IAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPN-----SYLIKGKQVI 316 (408) Q Consensus 251 ---~------~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~-----~~~l~G~pv~ 316 (408) + ....++.+..+...+...+..+++|+|||++|.+|+++||++|+|+|++....+. +.+++|+||+ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~ 234 (320) T protein:vir:10 155 ADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTI 234 (320) T ss_pred eecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeE Confidence 0 0112333333445677889999999999999999999999999999987655543 3579999998 Q ss_pred eeccccccccccCcceEEEEehhcceEeeeccceEEEEeccch------------hhhhhceeeEEEEeeeCcEEecccc Q lcl|Aclame:pro 317 VVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKATDSEA 384 (408) Q Consensus 317 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~~~a 384 (408) +++. +| .++..++||||++++ +..|++++++++++.+ +.|++|++.||+++|+|+++++|+| T Consensus 235 ~~~~--~~---~~~~~~~~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a 308 (320) T protein:vir:10 235 LSDH--VA---DGTTVGYMGDFRNVI-WGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDA 308 (320) T ss_pred ecCC--CC---CCceEEEEeecceEE-EEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccc Confidence 8643 33 344567899999865 5679999999987654 4699999999999999999999999 Q ss_pred eEEEEeeccccC Q lcl|Aclame:pro 385 LVAGSFSAIADQ 396 (408) Q Consensus 385 ~~~l~~~~~~~~ 396 (408) |++|+..++.++ T Consensus 309 ~~~l~~~~ap~~ 320 (320) T protein:vir:10 309 FVKLTNVVTPDA 320 (320) T ss_pred eEEEEeccCCCC Confidence 999986554333 No 96 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=6.4e-52 Score=301.24 Aligned_cols=283 Identities=12% Similarity=0.023 Sum_probs=218.2 Q ss_pred hhhHHHHHHHHhhcccccc------CceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSA------AGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDA 178 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~------gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 178 (408) +... .|.++...++... ++.+||+++.++|++.+++.++++++++++++++.. +.+|+.. ..+.+.|++ T Consensus 1 ~a~l--~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~--~~~p~~~-~~~~a~~v~ 75 (333) T protein:vir:78 1 MATL--NELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGE--TIIPTTV-KRPEVGQVG 75 (333) T ss_pred Cchh--HHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCc--eEEEEEe-CCceeEeec Confidence 1111 2334444444333 344899999999999999999999999999987654 4455443 334455555 Q ss_pred c--------cccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch- Q lcl|Aclame:pro 179 E--------DGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK- 249 (408) Q Consensus 179 E--------~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~- 249 (408) | ++.+++ +.++|++++++++|++++++||+|+++|+.+++++||.++|++++++++|.++++|+|++++. T Consensus 76 eg~~~~~~e~~~~~~-~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~ 154 (333) T protein:vir:78 76 VGTSNEQREGGLKPL-SGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSA 154 (333) T ss_pred Ccccccccccccccc-cccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcc Confidence 5 456664 568999999999999999999999999999999999999999999999999999999865321 Q ss_pred ---------------------hhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh---hhcccCceeeccccccC Q lcl|Aclame:pro 250 ---------------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL---VKTAEGKYLLEPDPTKP 305 (408) Q Consensus 250 ---------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~---lkd~~G~~~~~~~~~~~ 305 (408) .+...++++++++.......+...+.|+|||.+|..|++ ++|++|+|+|.+.+..+ T Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~ 234 (333) T protein:vir:78 155 LQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAA 234 (333) T ss_pred cccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccC Confidence 112236677777654333345556689999999877654 78999999999888888 Q ss_pred CcccccccceEeecccc--ccccccCcceEEEEehhcceEeeeccceEEEEeccch---------hhhhhceeeEEEEee Q lcl|Aclame:pro 306 NSYLIKGKQVIVVADRW--LPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA---------GAFETDTTKIRVIDR 374 (408) Q Consensus 306 ~~~~l~G~pv~~~~~~~--~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---------~~f~~~~~~~r~~~r 374 (408) .+++|+|+||++++... +++...++..++||||++++ +.++++++++++++.. +.|++|++.+|++.| T Consensus 235 ~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~-~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r 313 (333) T protein:vir:78 235 QTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLK-FGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVT 313 (333) T ss_pred CCceeeceeeEEccccCCCccccCCCccEEEEEecccEE-EEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEE Confidence 89999999999865421 22334556789999999854 5689999999988642 469999999999999 Q ss_pred eCcEEecccceEEEEeecccc Q lcl|Aclame:pro 375 FDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 375 ~d~~v~~~~a~~~l~~~~~~~ 395 (408) +|+++++|+||++|+..+ +| T Consensus 314 ~d~~v~~~~a~~~l~~~~-a~ 333 (333) T protein:vir:78 314 FGWLLGDKQAFVKFVDDE-QP 333 (333) T ss_pred EccEEecccceEEEeccC-CC Confidence 999999999999985433 22 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=5.2e-52 Score=301.73 Aligned_cols=272 Identities=11% Similarity=0.019 Sum_probs=212.4 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIK 195 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~ 195 (408) |.+ .+++||++||++++++|++.+++.++|+++|+++++.+... .+|... ..+.+.|++|++++|+ ++++|++++ T Consensus 1 Mat-~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~--~~p~~~-~~~~a~wv~Eg~~~~~-~~~~f~~v~ 75 (311) T protein:vir:99 1 MAT-FGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNE--DIITFN-GRPKAEFVGEGQQKSS-TTGEFDFVT 75 (311) T ss_pred Cce-ecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCce--EEEEEe-CCceeEEeecCccccc-ccceeeEEE Confidence 444 45678899999999999999999999999999999876544 455544 3457899999999996 558999999 Q ss_pred echheeeeehHHHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch--h------------------hh Q lcl|Aclame:pro 196 YLIKRYAGIITATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK--P------------------TI 252 (408) Q Consensus 196 ~~~~~~~~~~~iS~ell~---ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~--~------------------~~ 252 (408) ++++|++++++||+||++ |+.++|++||.++|++++++++|.++++|+|++++. . +. T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~ 155 (311) T protein:vir:99 76 STPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTI 155 (311) T ss_pred EeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecccccc Confidence 999999999999999994 778899999999999999999999999998754321 0 01 Q ss_pred hhHHHHHHHHHHhhhhh--ccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccc-------- Q lcl|Aclame:pro 253 AKFDDVITMINTAVDPA--IIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRW-------- 322 (408) Q Consensus 253 ~~~d~i~~~~~~~l~~~--~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~-------- 322 (408) ...+..+..+...+... ......|+|||.++..|++|||++|||+|++...++.+++|+|+||++++... T Consensus 156 ~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~ 235 (311) T protein:vir:99 156 ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPD 235 (311) T ss_pred chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccc Confidence 11222233232222222 12233599999999999999999999999998888888999999999864311 Q ss_pred -ccccccCcceEEEEehhcceEeeeccceEEEEeccch-----hhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 323 -LPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA-----GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 323 -~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) .+...++...+++|||++++.+..+.+++++++++.. +.|++|++.+|++.|+||++.+|+ |++++-+++ T Consensus 236 ~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~-~v~~~~~~A 311 (311) T protein:vir:99 236 DEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDR-FVVIENAVA 311 (311) T ss_pred cchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChh-HeeeecccC Confidence 1112345666899999998888889999999887643 459999999999999999999975 555443333 No 98 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=2.6e-50 Score=292.39 Aligned_cols=368 Identities=12% Similarity=0.114 Sum_probs=225.3 Q ss_pred CCh-------------HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHH-HHHHHHHHHHH-HHHH---- Q lcl|Aclame:pro 1 MGV-------------KLTVNQL--NEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKRDNEKVR-RDAL---- 59 (408) Q Consensus 1 M~~-------------~~~i~el--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~---- 59 (408) +.. ++...|. ..++.++.++.. ..+...++..+. .+.++.++ +.+.+...... .... T Consensus 207 ~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~~~~-~~~~~~~ai~~g-~sld~~ra~~ld~l~~~~~a~~~~~~a~~ 284 (632) T protein:vir:96 207 TGAKNPAPAASGANENDILSRERTRISEITAIGQQFS-QRSLAQEAIQKG-HTVDQFRALVLERMNPGQPGNFEKPGAGD 284 (632) T ss_pred hcccccchhhhhhhhhhhhhhhHHHHHHHHHHHHHhh-hhhhHHHHHhcc-ccHHHHHHHHHHHHhhhhhhhhhhhhhhh Confidence 000 0000000 001111111111 001111111111 11122111 11111000000 0000 Q ss_pred ---HHHHHH---HH--HHHhhhccc--ccccccccc--hhhhHHHHHHHHHH-Hhhcch---hhHHHHHHHHhhcccccc Q lcl|Aclame:pro 60 ---REQLVE---AQ--AEQVVNMRE--EEKGPLNKS--ENELKDKFVKDFVN-MVRNPM---AFMNTVSSKTETSGSDSA 123 (408) Q Consensus 60 ---~~~~~~---~~--~~~~~~~~~--~~~~~~~~~--~~~~~~~~~~a~~~-~~~~~~---~~~~~~~~~a~~~~t~~~ 123 (408) ...+.. +. ......... ......... ......+......+ ..+... -.....+.++..++++++ T Consensus 285 ~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~ 364 (632) T protein:vir:96 285 LPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGK 364 (632) T ss_pred hhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccc Confidence 000000 00 000000000 000000000 00000000000000 000000 112334567888889999 Q ss_pred Cceecchhh-hhhhhhhhhhhhhhhhh-hceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechhee Q lcl|Aclame:pro 124 AGLTIPQDI-RTMINTLVRQYDSLQQY-VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRY 201 (408) Q Consensus 124 gg~~vP~~~-~~~ii~~~~~~~~l~~~-~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~ 201 (408) ||++||+++ ...||+.+++.++++++ +++++ +.+|.+.+|+..+ .+.++|++|++.+++ +.++|++++++++++ T Consensus 365 gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~--~~~g~~~ip~~~~-~~~a~wv~E~~~~~~-s~~~f~~i~l~~~k~ 440 (632) T protein:vir:96 365 GGELVATELLSEEFIDILRNKAIIGQMGARMLP--GLVGDVDIPKKTS-GANFYWIGEDEDVQD-SDFDFTTLSFSPKTI 440 (632) T ss_pred cccccccccchHHHHHHHhhcchhhhhcceEee--cCCcceEEEEEeC-CceeEeecCCccccc-cccceeeEEeeeeEE Confidence 999999986 57899999999999887 55554 4456778887764 467899999999997 558999999999999 Q ss_pred eeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch---------------hhhhhHHHHHHHHHHhh Q lcl|Aclame:pro 202 AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK---------------PTIAKFDDVITMINTAV 266 (408) Q Consensus 202 ~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~---------------~~~~~~d~i~~~~~~~l 266 (408) +++++||+|||.|+.++++++|.+.|.++++.++|.++++|+|++..+ .+..+++++.++.. .+ T Consensus 441 ~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~-~i 519 (632) T protein:vir:96 441 AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMET-KI 519 (632) T ss_pred EEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHHHHHHH-HH Confidence 999999999999999999999999999999999999999999864321 12234566666653 34 Q ss_pred hhhc--cCCCEEEEcHHHHHHHHh--hhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcce Q lcl|Aclame:pro 267 DPAI--IATSSLLTNQSGLNKLAL--VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAI 342 (408) Q Consensus 267 ~~~~--~~~a~~~~n~~~~~~l~~--lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~ 342 (408) ...+ ..++.|+||+..+..+++ ++|++|+|+|.+ ++|+|+||++++ .+|. +.++||||++++ T Consensus 520 ~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------~~l~G~pv~~s~--~ip~-----~~~~~gd~s~~~ 585 (632) T protein:vir:96 520 STFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASN--QIPA-----DTWIFGDWSQIV 585 (632) T ss_pred hhcccccCccEEEEchhHHHHHHHHhccCCCCceeecC-------CeecccceEecc--cccc-----CcEEEeecceEE Confidence 4443 457789999998877765 789999999964 489999998853 3443 458999999865 Q ss_pred EeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeec Q lcl|Aclame:pro 343 TLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) Q Consensus 343 ~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 392 (408) +.++++++|.++++.+ |.+|++.||+++|+|+++++|+||++++.++ T Consensus 586 -i~~~~~~~i~~~~~~~--~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 586 -IAMWGVLDLKVDPYTK--AASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred -EEEecceEEEEccccc--cccCceEEEEEeecCceeechhhhhheeecC Confidence 5679999999998764 8899999999999999999999999999988 No 99 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=6e-41 Score=241.08 Aligned_cols=369 Identities=14% Similarity=0.128 Sum_probs=207.4 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHH---HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEA---MSELKNKRDNE-KVRRDALREQLVEAQAEQVVNMRE 76 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 76 (408) ++-...|..+++.......++.+..+.+++.....+...++ ++++.+++.+. ....+.+..++...+...... T Consensus 124 a~~~a~I~~vke~~~~e~~~~~~~~a~~ee~~e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~--- 200 (517) T protein:vir:97 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKIL--- 200 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhc--- Confidence 44433333333222211111111111111111110001111 11111111110 111111111222111111100 Q ss_pred ccccccccchhhhHHHHHHHHHHHh--hcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceee Q lcl|Aclame:pro 77 EEKGPLNKSENELKDKFVKDFVNMV--RNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES 154 (408) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~a~~~~~--~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~ 154 (408) .... ........++...+.... ..+...................+|+++|+.+...|...+...++++.++++.+ T Consensus 201 -~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~ 277 (517) T protein:vir:97 201 -GVEA--LKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHEN 277 (517) T ss_pred -cccc--ccccchhhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccceeeeeecc Confidence 0000 000000111111111000 00000000000011112234457899999999999999999988888777655 Q ss_pred cccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHH----HHHHHHHHHHHH Q lcl|Aclame:pro 155 VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAEN----ILAWLSSWIAKK 230 (408) Q Consensus 155 ~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~----~~~~v~~~l~~~ 230 (408) +.. ..++. ......+.|+.||+..|+ +.++|+++++.++++++++++|++||+|+.++ |++||.++|+++ T Consensus 278 i~~----~~~~~-~~~~~~a~~~~eG~~kp~-s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~ 351 (517) T protein:vir:97 278 LPT----LVVGG-DNALTQGTGHTTGTDKTE-SNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDM 351 (517) T ss_pred ccc----eeeec-ccccceeeeeecCCcccc-cccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHH Confidence 432 22222 223345578999999986 45799999999999999999999999998887 999999999999 Q ss_pred HHHHHHHHHhhccccccchh------------hhhhHH---HHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc Q lcl|Aclame:pro 231 VVVTRNQAIIEVMKAAPKKP------------TIAKFD---DVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK 295 (408) Q Consensus 231 ~~~~~~~~~~~g~g~~~~~~------------~~~~~d---~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~ 295 (408) ++++++.+|++|+|++.... +....+ +++..+...+.. ..++.|||||.+|++|++|||++|| T Consensus 352 l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d~i~~l~~a~~~--a~~a~~vmn~~t~~~I~klKD~~G~ 429 (517) T protein:vir:97 352 VIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPK--AADSTLVIHRNDLAAIRFLKDKNGN 429 (517) T ss_pred HHHHHHHHHhcccCCCcccccccccccccccccccccchHHHHHHHHHHHhhh--ccCCEEEECHHHHHHHHHhhcCCCC Confidence 99999999999998763211 111223 333333333222 2478899999999999999999999 Q ss_pred eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeee Q lcl|Aclame:pro 296 YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRF 375 (408) Q Consensus 296 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~ 375 (408) |+|++...++.+.+++|..-. +|....+. ..+++++. |.+..+.++.+..+. .+.+|+..|+..+|+ T Consensus 430 Yl~~~~~~~~~~~~l~G~~~~------~~~~~~~~--~~~~~~~~-y~i~~~~g~~~~~~f----d~~~n~~~f~~~~~~ 496 (517) T protein:vir:97 430 YVFPVGVSNQTIATHFGFNRL------VQSVAVDE--KTAVSLSG-YVTNGSRGMEFEQGT----ILVENNKEYLFEMPI 496 (517) T ss_pred eeccCcCCcccccccCCcccc------ccccccCc--eeEeeccc-cEEEeecceeeeeee----ecccCceeEeeeeee Confidence 999988888888999985211 22222222 23444554 555667777654331 145788999999999 Q ss_pred CcEEecccceEEEEeeccccC Q lcl|Aclame:pro 376 DVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 376 d~~v~~~~a~~~l~~~~~~~~ 396 (408) ++.|+.|++|+++.+.+.+.- T Consensus 497 ~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 497 SGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred ccccccccceEEEEEcCCCCC Confidence 999999999998876665443 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=1.6e-41 Score=244.19 Aligned_cols=283 Identities=11% Similarity=0.048 Sum_probs=207.7 Q ss_pred HHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCccceEEeeccC Q lcl|Aclame:pro 91 DKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNGSRVYEKWTD 169 (408) Q Consensus 91 ~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g~~~~~~~~~ 169 (408) -.. -+.++.+. .. ...++.++ ++.+||+++|... .++++.+.+.++++++|++++. .+..+. ++.... T Consensus 1 ~~~----~~~~~~~~--~~-~~~k~~t~-~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~--i~~~g~ 69 (315) T protein:vir:41 1 MLT----IEDIRGGK--PF-EIVPKIDV-PDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKD--ISRLSL 69 (315) T ss_pred Ccc----cchhhcCC--hh-hhhhhcCC-cCCCCceechHHH-HHHHHHHHhhhhhhhhceeeecccccccc--cccccc Confidence 000 01112221 11 12344443 5668899898886 4688999999999999998754 333333 332211 Q ss_pred ---CccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 170 ---VTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAE--NILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) Q Consensus 170 ---~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~--~~~~~v~~~l~~~~~~~~~~~~~~g~g 244 (408) ......|.+|.+..++ +.++|+++.+.++++...+.||+++|+|+.+ +|++||...+++++++.++.++++|+| T Consensus 70 ~~~~~~g~~~~~~~~~~~~-~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg 148 (315) T protein:vir:41 70 VLDVGPGRDETGQKLAPPE-STAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDT 148 (315) T ss_pred CcccccccccccCcCCCCC-CccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 1122357778777664 6689999999999999999999999999964 999999999999999999999999988 Q ss_pred cccch------h-----------------hhhhHHHHHHHHHHhhhhhcc---CCCEEEEcHHHHHHHHhhhcccCceee Q lcl|Aclame:pro 245 AAPKK------P-----------------TIAKFDDVITMINTAVDPAII---ATSSLLTNQSGLNKLALVKTAEGKYLL 298 (408) Q Consensus 245 ~~~~~------~-----------------~~~~~d~i~~~~~~~l~~~~~---~~a~~~~n~~~~~~l~~lkd~~G~~~~ 298 (408) +...+ + +.....+.+..+.+.+++.|+ .+++|+||++++..|+++||++|+|+| T Consensus 149 ~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw 228 (315) T protein:vir:41 149 SSSDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLG 228 (315) T ss_pred cCcCccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccc Confidence 53110 0 001112344445567999997 467899999999999999999999999 Q ss_pred ccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcE Q lcl|Aclame:pro 299 EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK 378 (408) Q Consensus 299 ~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 378 (408) ++.+..+.+.+|+|+||..+++ +|....++..++||||+++ .+..+.+++++.+.+. ..+.+.|.+..|+|+. T Consensus 229 ~~~~~~g~~~tl~G~PV~~~~~--m~~~~~~~~~ilf~d~~nl-~~~~~~~i~i~~~~~a----~~~~~~~~~~~r~d~~ 301 (315) T protein:vir:41 229 DQALTGANSILYDGRPVQYVPA--LEALNDGKSRALFVVPTQL-VYGFWRNIKVVPDYDA----EMRLTKYVASLRTDNH 301 (315) T ss_pred cchhhcCCCceecccceEeccc--ccccCCCCccEEEecccce-EEEeccccEEEeeecC----CCCceEEEEEEEecee Confidence 9999999999999999988654 7777778889999999985 4467888888876543 3566778888999999 Q ss_pred EecccceEEEEeec Q lcl|Aclame:pro 379 ATDSEALVAGSFSA 392 (408) Q Consensus 379 v~~~~a~~~l~~~~ 392 (408) +.++++.++..++- T Consensus 302 ~~~~~~~a~~~~~v 315 (315) T protein:vir:41 302 YEDEEGAVSATITV 315 (315) T ss_pred EEeccceeEeeeeC Confidence 88877633333333 No 101 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=2.4e-41 Score=243.27 Aligned_cols=283 Identities=12% Similarity=0.034 Sum_probs=217.3 Q ss_pred HHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCC---cccc Q lcl|Aclame:pro 98 VNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDV---TPLT 174 (408) Q Consensus 98 ~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~---~~~~ 174 (408) ++++++-. ...++.++ ++.+||+++|.++ .++++.+++.++++++++++++.+ +++..++....+ .+.. T Consensus 1 ~~~~~~~~-----~~~k~it~-~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~-s~~~~i~~i~~g~~~~~~~ 72 (314) T protein:vir:41 1 MDFLNKPF-----QITPKIDV-PDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALK-SYEVDISRISLGVELEPGR 72 (314) T ss_pred CchhhhHH-----Hhhccccc-ccCCCceeChHHH-HHHHHHHHhccchhhheeeecccC-ccceeecccccCccccccc Confidence 22333211 12344443 5667999999987 479999999999999999886422 234455544322 2344 Q ss_pred chhcccccccccccccceeeeechheeeeehHHHHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHHhhccccccc---- Q lcl|Aclame:pro 175 VMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAE--NILAWLSSWIAKKVVVTRNQAIIEVMKAAPK---- 248 (408) Q Consensus 175 ~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~--~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~---- 248 (408) .|.+|....++ +.++|+++.|.++++...++||+|+|+|+.+ +|++||...+++++++.++..+++|+|+... T Consensus 73 ~~~~~~~~~~~-~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~ 151 (314) T protein:vir:41 73 NTSGTKVAPTA-DEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGREL 151 (314) T ss_pred ccccCCccCCc-ccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccc Confidence 56677666664 6689999999999999999999999999975 9999999999999999999999999985311 Q ss_pred -------------------hhhhhhHHHHHHHHHHhhhhhccC---CCEEEEcHHHHHHHHhhhcccCceeeccccccCC Q lcl|Aclame:pro 249 -------------------KPTIAKFDDVITMINTAVDPAIIA---TSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPN 306 (408) Q Consensus 249 -------------------~~~~~~~d~i~~~~~~~l~~~~~~---~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~ 306 (408) ..+....++.+..+.+.|++.|+. +++|+||+.++.+++++++.+|+++|++.+..+. T Consensus 152 ~~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~ 231 (314) T protein:vir:41 152 YRINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGAT 231 (314) T ss_pred hhcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCC Confidence 011223455666666789999975 6689999999999999999999999999999999 Q ss_pred cccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceE Q lcl|Aclame:pro 307 SYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALV 386 (408) Q Consensus 307 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 386 (408) +.+|+|+||+.++ .+|....++.+|+||||+++ ++..+..+++..+.+ ..++++.|.+..|+|+.+.++.|.+ T Consensus 232 ~~~l~G~PV~~~~--~~~~~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~~----a~~~~~~~~~~~r~d~~~~~~~aa~ 304 (314) T protein:vir:41 232 GLQYDGIPIQYVP--ALDALGDDKARALLTVPTNL-VYGFWRNIRIEPKRD----AAMRRTEYIASLRADCNYEDENAAV 304 (314) T ss_pred CceecceeeEecc--cccccCCCCceEEEechhhe-EEEeeceeEEeeccc----CcCCeEEEEEEEEeceEEEEcCcEE Confidence 9999999998865 46777888999999999985 445566666655443 3588999999999999999998877 Q ss_pred EEEeeccccC Q lcl|Aclame:pro 387 AGSFSAIADQ 396 (408) Q Consensus 387 ~l~~~~~~~~ 396 (408) +..+.....- T Consensus 305 ~~~~~~~~~~ 314 (314) T protein:vir:41 305 AAVIDMSSGG 314 (314) T ss_pred EEEeeccCCC Confidence 7655544332 No 102 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=3.6e-38 Score=225.85 Aligned_cols=352 Identities=10% Similarity=0.057 Sum_probs=191.9 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) ++-...+..+++.-...... ...++.++...+. .+.+....++.++++++...+++........... . T Consensus 111 a~~~a~v~~vks~~~~~e~~--~~~~e~~e~~~e~-------~e~~~~~~el~akl~el~k~~ee~k~~~~~~~~~--~- 178 (480) T protein:vir:40 111 SNKGAKVTKVREENKGEQEQ--MGANETQEIMKQA-------IEAGVKVRELEAKVEELNKEREELKKEREASIPS--E- 178 (480) T ss_pred cchhhhhhhhhhhhhhhhhh--hhhHHHHHHHHhh-------hhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhccc--c- Confidence 55555554443321110000 0000000100000 0111111222223333322222221111111000 0 Q ss_pred ccccchhhhHHHHHHHHHHHhhcchhh--HHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccC Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPMAF--MNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTS 158 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~~~--~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 158 (408) ..........+++..+++..... ......+. ..+...+++. +|+.+...+........++...+..... T Consensus 179 ----~~~~~~~~e~r~~~~~~~~~~e~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 249 (480) T protein:vir:40 179 ----KPEDAERKFMRELGSKMAEMPEQGFLREFANGA-DLNVVNSLGS-ITSKYARKSGIYDGAMKARFQGLTLAED--- 249 (480) T ss_pred ----chhhhhhHHHHHHHHHhccchhhhhhhhhhhhc-cccccccccc-cccchhhheeechhhhhhhhhcceeeec--- Confidence 00111112233333333322111 11111111 2222333444 5555555555445555554443332211 Q ss_pred ccceEEeeccCCccccchhccccccccccc-ccceeeeec---hheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 159 NGSRVYEKWTDVTPLTVMDAEDGKIPDLDN-PQLTIIKYL---IKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVT 234 (408) Q Consensus 159 ~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~-~~f~~v~~~---~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~ 234 (408) +.....|++|+...+.+.. .++.+..+. +++++.+...|+++|+|+. +|++||.++|+++++.+ T Consensus 250 -----------g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~~l~~~~~~~ 317 (480) T protein:vir:40 250 -----------GVDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMSEMVNRVIQK 317 (480) T ss_pred -----------cccceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHHHHHHHHHHH Confidence 1112245555544333221 133344444 5788888999999999987 79999999999999999 Q ss_pred HHHHHhhccccccc-----------hhhhhhHHHHHHHHHHhhhhhccCCC-EEEEcHHHHHHHHhhhcccCceeecccc Q lcl|Aclame:pro 235 RNQAIIEVMKAAPK-----------KPTIAKFDDVITMINTAVDPAIIATS-SLLTNQSGLNKLALVKTAEGKYLLEPDP 302 (408) Q Consensus 235 ~~~~~~~g~g~~~~-----------~~~~~~~d~i~~~~~~~l~~~~~~~a-~~~~n~~~~~~l~~lkd~~G~~~~~~~~ 302 (408) ++.+|++|+|++.. .+.....+++++.|..++...|+.++ .|||||.+|+.|++|||++|+|+|+|.+ T Consensus 318 ee~a~l~G~g~g~~~~~g~~~~~~~~~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~G~Yi~q~~~ 397 (480) T protein:vir:40 318 VEYNMILGSVDGSNGFYGLKTATDGWTKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTDGHSRFNELA 397 (480) T ss_pred HHHHhhccCCCCccccccceeecccccccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCCCCeeccCcc Confidence 99999999665422 11223456677766677889998888 5999999999999999999999999999 Q ss_pred ccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecc Q lcl|Aclame:pro 303 TKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDS 382 (408) Q Consensus 303 ~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 382 (408) +.+.+.+|+|+||++++. .+|. +...+|.++.|+.+++++ ++.. ....+..++..++++.|+++.+..| T Consensus 398 ~~~~~~~llG~pvv~~~~-~~~~-----~~~~~~~~~~~~~~~d~~-~~~~----~~~~~~~~~~~~~~e~~v~g~~~~~ 466 (480) T protein:vir:40 398 TKEQIAQSFGAVNLETRV-WMPK-----DEVAVYNHDEYVLIGDLN-VENY----NDFDLRYNVEQWLSETLVGGSIRGK 466 (480) T ss_pred cccCcceecccceeeeec-cccC-----CcceeeeCCccEEEEecc-ccee----cccccccchhhhhhhhhhceeeEcc Confidence 999999999999976532 2222 112344445566666543 2221 1112457788899999999999999 Q ss_pred cceEEEEeeccccC Q lcl|Aclame:pro 383 EALVAGSFSAIADQ 396 (408) Q Consensus 383 ~a~~~l~~~~~~~~ 396 (408) +||+.++.+..=-- T Consensus 467 ~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 467 NRSAYLKKKGSLGV 480 (480) T ss_pred ccEEEEEeccCcCC Confidence 99999987753222 No 103 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=9.3e-35 Score=207.15 Aligned_cols=292 Identities=11% Similarity=-0.001 Sum_probs=207.1 Q ss_pred HHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCc Q lcl|Aclame:pro 92 KFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVT 171 (408) Q Consensus 92 ~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~ 171 (408) ..++.|.++++. ...+.....++.++|++||+++..+|++.+.+.++++++++++++....+.++.. ..+ T Consensus 1 ~~~k~~~~~l~~-------~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~--~~~- 70 (321) T protein:vir:31 1 MASRTINNDLSR-------ITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTL--NIG- 70 (321) T ss_pred CchHHHHHHHHH-------HHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeee--ccC- Confidence 233344444332 1222233345677889999999999999999999999999999998777765533 222 Q ss_pred cccchhc-ccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 172 PLTVMDA-EDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK 248 (408) Q Consensus 172 ~~~~~~~-E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~ 248 (408) +...|++ |+......+.++|+++++.++++...++||+|+|+|+. ++|+++|.+.++++++..++..+++|+|.+.+ T Consensus 71 ~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~ 150 (321) T protein:vir:31 71 ERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAED 150 (321) T ss_pred CcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCC Confidence 2335665 33333334668999999999999999999999999985 59999999999999999999999999986543 Q ss_pred h----------------------hhhhhHHHHHHHHHHhhhhhcc--CCCEEEEcHHHHHHHHh-hhcccCceeeccccc Q lcl|Aclame:pro 249 K----------------------PTIAKFDDVITMINTAVDPAII--ATSSLLTNQSGLNKLAL-VKTAEGKYLLEPDPT 303 (408) Q Consensus 249 ~----------------------~~~~~~d~i~~~~~~~l~~~~~--~~a~~~~n~~~~~~l~~-lkd~~G~~~~~~~~~ 303 (408) . ....+.+.+.++ ...+++.|+ ++.+|+||++++..++. ++|. +.++|.+.+. T Consensus 151 ~~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l-~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~-~~~~~~~~l~ 228 (321) T protein:vir:31 151 SFENQNDGFITVAEGDVETIDAADDILDNDLVIRT-IAGLDSKYRARMNPALIVSEDQLLSYHYTLTDR-DTPLGDNVIM 228 (321) T ss_pred cccccchhhhhhhccccccccccccccCHHHHHHH-HHhccHhHhcCCCeEEEechHHHHHHHHHHhcC-CCccccchhh Confidence 1 111234555544 466888887 46689999999887765 5665 4578888888 Q ss_pred cCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhh-hhhceeeEEEEeeeCcEEecc Q lcl|Aclame:pro 304 KPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGA-FETDTTKIRVIDRFDVKATDS 382 (408) Q Consensus 304 ~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~-f~~~~~~~r~~~r~d~~v~~~ 382 (408) ++.+.+|+|+||+.+++ +|. +.++|+||++++.. .+.++++++....... ...+.+......++|+.+.++ T Consensus 229 ~~~~~tl~G~pvv~~~~--mP~-----~~il~t~~~nl~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~ 300 (321) T protein:vir:31 229 GEADVNPFSFPIIGSGL--WPD-----DKAMFTDPQNLIYA-LYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENT 300 (321) T ss_pred ccccccccceeEEEcCC--CCC-----CcEEEeccccEEEE-EeeccEEEEeecCccccccceeeEeeeeeecceeEecc Confidence 88888999999998653 554 45899999996543 4677777765443211 123334444555689999999 Q ss_pred cceEEEEeeccccCCCCccCC Q lcl|Aclame:pro 383 EALVAGSFSAIADQVGNFKTT 403 (408) Q Consensus 383 ~a~~~l~~~~~~~~~~~~~~~ 403 (408) +|++.++-...+-....-.++ T Consensus 301 ~a~a~~~~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 301 EAVVLAEGLGDPLEHLEEETS 321 (321) T ss_pred ccEEEEecCCcchhcccCCCC Confidence 999999854433222222222 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.94 E-value=1.1e-28 Score=173.82 Aligned_cols=266 Identities=14% Similarity=0.135 Sum_probs=203.8 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceee-cccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES-VSTSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |..+++..+..++|+.++..+++.+.+.+.+.+++.+-. ..+..| .+.+|... ..+.+.|++||+.++. +.++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~-~~~~a~~v~eg~~i~~-~~~~~~~ 78 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWD-YIGDAEDVAEGEAIPM-TQLGFKK 78 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEec-CCCCcccccCCCcccc-cccccce Confidence 665556667799999999999999999988888776533 222223 46677664 3567789999999984 6689999 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-chhhhhhHHHHHHHHHHhhhhhccC Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP-KKPTIAKFDDVITMINTAVDPAIIA 272 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~-~~~~~~~~d~i~~~~~~~l~~~~~~ 272 (408) +++.+++++...++|+++..++..++.+++.+++++++++.+|..++....+.. ...+..+++++.+++. .+...+.. T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~~t~d~i~da~~-~l~~~~~~ 157 (272) T protein:vir:30 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEATATVDGVSKALD-IFNDEDDA 157 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHH-HHhccCCC Confidence 999999999999999999999999999999999999999999999998765433 3345567888888764 46666777 Q ss_pred CCEEEEcHHHHHHHHhhhccc--C-ceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccc Q lcl|Aclame:pro 273 TSSLLTNQSGLNKLALVKTAE--G-KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDREN 349 (408) Q Consensus 273 ~a~~~~n~~~~~~l~~lkd~~--G-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 349 (408) ...|+|||.++..|++.+..+ + .....+.+.++..++|+|+||+++++ +|. +.+++.+.. ++..+.+.+ T Consensus 158 ~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~p~-----~t~~~~~~~-a~~~~~~~~ 229 (272) T protein:vir:30 158 ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--CPK-----GTAYMVRKG-ALRIMLKRN 229 (272) T ss_pred ccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--CCc-----ceEEEEcCC-eEEEEecCC Confidence 889999999999998764221 1 11222334455557899999998654 442 335655555 456667888 Q ss_pred eEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 350 MSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 350 ~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) .+++.+++. .++...++...|+++++.+|++++++++++++.. T Consensus 230 ~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 230 TMVETDRDI----TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ceeeecccc----ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 888766543 4567889999999999999999999999988888 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.94 E-value=1.1e-28 Score=173.82 Aligned_cols=266 Identities=14% Similarity=0.135 Sum_probs=203.8 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceee-cccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES-VSTSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |..+++..+..++|+.++..+++.+.+.+.+.+++.+-. ..+..| .+.+|... ..+.+.|++||+.++. +.++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~-~~~~a~~v~eg~~i~~-~~~~~~~ 78 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWD-YIGDAEDVAEGEAIPM-TQLGFKK 78 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEec-CCCCcccccCCCcccc-cccccce Confidence 665556667799999999999999999988888776533 222223 46677664 3567789999999984 6689999 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-chhhhhhHHHHHHHHHHhhhhhccC Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP-KKPTIAKFDDVITMINTAVDPAIIA 272 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~-~~~~~~~~d~i~~~~~~~l~~~~~~ 272 (408) +++.+++++...++|+++..++..++.+++.+++++++++.+|..++....+.. ...+..+++++.+++. .+...+.. T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~~t~d~i~da~~-~l~~~~~~ 157 (272) T protein:vir:98 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEATATVDGVSKALD-IFNDEDDA 157 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHH-HHhccCCC Confidence 999999999999999999999999999999999999999999999998765433 3345567888888764 46666777 Q ss_pred CCEEEEcHHHHHHHHhhhccc--C-ceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccc Q lcl|Aclame:pro 273 TSSLLTNQSGLNKLALVKTAE--G-KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDREN 349 (408) Q Consensus 273 ~a~~~~n~~~~~~l~~lkd~~--G-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 349 (408) ...|+|||.++..|++.+..+ + .....+.+.++..++|+|+||+++++ +|. +.+++.+.. ++..+.+.+ T Consensus 158 ~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~p~-----~t~~~~~~~-a~~~~~~~~ 229 (272) T protein:vir:98 158 ETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--CPK-----GTAYMVRKG-ALRIMLKRN 229 (272) T ss_pred ccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--CCc-----ceEEEEcCC-eEEEEecCC Confidence 889999999999998764221 1 11222334455557899999998654 442 335655555 456667888 Q ss_pred eEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 350 MSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 350 ~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) .+++.+++. .++...++...|+++++.+|++++++++++++.. T Consensus 230 ~~ve~~r~~----~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 230 TMVETDRDI----TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ceeeecccc----ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 888766543 4567889999999999999999999999988888 No 106 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.80 E-value=2e-21 Score=134.05 Aligned_cols=267 Identities=14% Similarity=0.120 Sum_probs=189.8 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |..+.+.-.-..+|+.|...+.+.+.+...+.+++.+-+. .+..| .+.+|.+. ..+.+.+.+||++++. +..+.++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~-~~gda~~~~eg~~i~~-~~lt~~~ 78 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFT-YIGDAADVAEGGEISL-DKIGTTT 78 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeec-cCccccccCCCCccCh-hhcCCcc Confidence 4433444445778999999999999888888888766543 22222 34566654 3355567899999974 5568889 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc-cchhhhhhHHHHHHHHHHhhhhhccC Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA-PKKPTIAKFDDVITMINTAVDPAIIA 272 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~-~~~~~~~~~d~i~~~~~~~l~~~~~~ 272 (408) .++..++.+....++++...++..++.+.+.++++..+++.+|..++....+. .......++|.+.+++.. +...... T Consensus 79 ~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~~~~~~~d~i~~A~~~-lgd~~~~ 157 (272) T protein:vir:36 79 KSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTVSTKANVDGVQAALDI-FNDEDAQ 157 (272) T ss_pred eeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccHHHHHHHHHH-hhhcCCC Confidence 99999999888999999988888899999999999999999999998776443 334455678888887754 5555555 Q ss_pred CCEEEEcHHHHHHHHhhhcccCc--eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccce Q lcl|Aclame:pro 273 TSSLLTNQSGLNKLALVKTAEGK--YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENM 350 (408) Q Consensus 273 ~a~~~~n~~~~~~l~~lkd~~G~--~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 350 (408) ...++|||..+..|++..+-... +...+.+.++.-++++|+||+++++ +|...+....++++. .++..+...++ T Consensus 158 ~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~--~p~~~~~~~~~~~~~--gA~~~~~~~~~ 233 (272) T protein:vir:36 158 AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKK--LAEGSALMFKIVSNS--PALKLVLKRGV 233 (272) T ss_pred ceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCC--CCCCceeEEEEEecc--cceeeeecCCc Confidence 66899999999998765322111 1111122334446899999998654 565443333455553 34444556677 Q ss_pred EEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 351 SLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 351 ~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) +++.+++.. +....+++..++++++++|+++++++++.+ T Consensus 234 ~vE~~R~~~----~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 234 QVETDRDIV----TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred ccccccchh----hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 777655432 445678899999999999999999999999 No 107 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.76 E-value=1.2e-19 Score=124.37 Aligned_cols=265 Identities=14% Similarity=0.099 Sum_probs=189.5 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |....+.-+-.++|+.|...+.+.+.+...+.+++.+... .+..| .+.+|.+. ..+.+.+..||+.++ .+..+++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~-~~g~~~~~~eg~~i~-~~~it~~~ 78 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeec-cCCCcccccCCCccc-ccccccce Confidence 4445555556889999999999999988888888766432 33223 45577654 335567889999987 45678899 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--hhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--KPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) .++..++.+....++++...++..++.+.+.+++++++++++|..++....+++. ......++.+++++.. +..... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~dA~~~-l~d~~~ 157 (274) T protein:vir:93 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDL 157 (274) T ss_pred eEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hhhccC Confidence 9999999998899999999998889999999999999999999999887765543 2234568888888744 555555 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeec-----cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLE-----PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~-----~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 346 (408) ...+++|||..+..|++. ..-+++-. +.+.++.-++++|+||+++++ +|. +..++.... ++..+. T Consensus 158 ~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~-----~t~~l~~~g-ai~~~~ 227 (274) T protein:vir:93 158 EPMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEA-----GTAILAKKG-AVKLIL 227 (274) T ss_pred CccEEEeCHHHHHHHHhh--hhhcccccccccccceeecccceecCeeEEEcCC--CCc-----ceEEEEeCC-eEEEEe Confidence 667899999999988643 21111111 112334456899999998654 442 334555544 445556 Q ss_pred ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 347 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) +.+++++.++.. .+....+++..++++++++|++++++++.+..-.- T Consensus 228 ~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 228 KRDFFLEVARDA----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cCCcccccccch----hhcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 677777665543 24567889999999999999999999866554433 No 108 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.76 E-value=1.2e-19 Score=124.30 Aligned_cols=348 Identities=14% Similarity=0.178 Sum_probs=197.6 Q ss_pred HHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccchhhhHHHHHHHHHHH Q lcl|Aclame:pro 21 VTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNM 100 (408) Q Consensus 21 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 100 (408) ++.-.++ +++...++...+++++.+..++ + .+-+.+.+.... .-+...-+....|.++ T Consensus 1 ~~~~~~~----~~~~~~~~~~~~e~k~lr~~me----~-~et~~e~~~~~~-------------~~~~~e~el~E~f~Km 58 (393) T protein:vir:79 1 MENWLKQ----LKESGFTETQVQEQKSLRTRME----R-GETLAEADANKL-------------ALNEEETQILESFAKM 58 (393) T ss_pred CchHHHH----HHhccCchhHHHHHHHHHHHhh----h-hhhhhhhhhhhh-------------hcchhHHHHHHHHHHH Confidence 1111111 1122233333333333222221 1 001111111111 0111122334556666 Q ss_pred hhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccc Q lcl|Aclame:pro 101 VRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (408) Q Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~ 180 (408) +.+.- .+.+.+..-.-++.+|..+||..+++.+.+...+.....++...+....+. ++.++... ...+.-++|| T Consensus 59 m~G~~---p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Gr-sm~F~~~g--~~Ra~~IgEG 132 (393) T protein:vir:79 59 MEGET---PTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQ-SMIFPSIG--IMRAYDVAEG 132 (393) T ss_pred hcCCC---chhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCc-ceeccchh--eeeecccccc Confidence 65322 222233322345566789999999999999888877777777777763332 34455443 4455778999 Q ss_pred ccccccc--cccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------- Q lcl|Aclame:pro 181 GKIPDLD--NPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP----------- 247 (408) Q Consensus 181 ~~~~~~~--~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~----------- 247 (408) +++|+.+ ..+++.|+++.+|++..+.+|+|+++||..|+.+++.....+++++..+..++++..+.+ T Consensus 133 gE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t 212 (393) T protein:vir:79 133 QEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNK 212 (393) T ss_pred ccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCc Confidence 9998755 368899999999999999999999999999999999999999999999999998765432 Q ss_pred -----------chhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHh---hh----cccCceeec--cccccCCc Q lcl|Aclame:pro 248 -----------KKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL---VK----TAEGKYLLE--PDPTKPNS 307 (408) Q Consensus 248 -----------~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~---lk----d~~G~~~~~--~~~~~~~~ 307 (408) ...+....+|+++++.... +.....++++|||-.|+.+.+ |. .+-|+|--. +....-+| T Consensus 213 ~ahptGr~~~~~qNGTlSleDllDm~~av~-~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp 291 (393) T protein:vir:79 213 LAHTTGLDKNGVQNDTFSAEDFLDLIIAVM-ANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGP 291 (393) T ss_pred cceeecCCccccccccccHHHHHHHHHHHh-cccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhch Confidence 2345677899999987755 444566689999999987764 32 222222111 11112233 Q ss_pred ccccc-----cceEeecccccccc-ccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEec Q lcl|Aclame:pro 308 YLIKG-----KQVIVVADRWLPNT-GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATD 381 (408) Q Consensus 308 ~~l~G-----~pv~~~~~~~~~~~-~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~ 381 (408) ..|.| +.|++++ .+|-. .....+.+..|-.+.-++..+-+++.+..+ .-..|.+.++...|+|+.|++ T Consensus 292 ~~i~~~~~~nlnv~~sP--fvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~d----dk~rdiq~iKl~ERYG~gvLn 365 (393) T protein:vir:79 292 DSIQGRLPFNFNVNLSP--FIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWD----EKARGLQNIKMIERYGIGILN 365 (393) T ss_pred hhhccccccceeEEEec--ccccccccceeeEEEeecCCceEEEEecCcceeccc----cccccceeeeeeeeeceeeee Confidence 34444 4555543 22321 223333344443332222333333332211 124788999999999999998 Q ss_pred c-cceEEEE---eeccccCCC--CccCC Q lcl|Aclame:pro 382 S-EALVAGS---FSAIADQVG--NFKTT 403 (408) Q Consensus 382 ~-~a~~~l~---~~~~~~~~~--~~~~~ 403 (408) . +|+.+.+ ++..-+.|- .++.- T Consensus 366 ~gkaiavakNI~~~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 366 EGKAIAVAKNISMDKSYAEPMLIKNVGN 393 (393) T ss_pred CCceEEEEecceeecccccchhhhccCC Confidence 8 5655554 222211111 11111 No 109 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.69 E-value=1.6e-18 Score=118.17 Aligned_cols=294 Identities=10% Similarity=-0.007 Sum_probs=196.1 Q ss_pred ccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecc Q lcl|Aclame:pro 77 EEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS 156 (408) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~ 156 (408) ..+.. ....+.-|+... + . .-..++++-|...++.+.|......|++.+.+.+.|++.+....+. T Consensus 1 ~~~~~--------~~~~~~~~~~~~-~---~---~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve 65 (330) T protein:vir:94 1 MVRIC--------TPPLRGRWRTLT-H---Q---FPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIE 65 (330) T ss_pred Cceec--------CCccccceeehh-c---c---ccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhccccccc Confidence 00000 000000111100 0 0 0123455556677788999999999999999999999888776666 Q ss_pred cCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHH--hcchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 157 TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSL--KDTAENILAWLSSWIAKKVVVT 234 (408) Q Consensus 157 ~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell--~ds~~~~~~~v~~~l~~~~~~~ 234 (408) +....+. +.. .-+.+.|...++..+.....+|.+++.+++.+.+.+.|.+.+. .....++..+-.+...+++.++ T Consensus 66 ~~~~~~~--r~~-~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~ 142 (330) T protein:vir:94 66 GNALAYN--REN-VLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQ 142 (330) T ss_pred CCcceee--eee-cCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHH Confidence 5544443 333 3577789888888765444589999999999999999999995 4566788889999999999999 Q ss_pred HHHHHhhccccccc------------------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCce Q lcl|Aclame:pro 235 RNQAIIEVMKAAPK------------------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKY 296 (408) Q Consensus 235 ~~~~~~~g~g~~~~------------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~ 296 (408) .+.++|||++++.. .++..+.|++-.++ ..+......+.+|+||+....+|+.+....|+| T Consensus 143 ~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl-~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~ 221 (330) T protein:vir:94 143 YQASMITGDGTGNSFQGMMGLVAASQTISAGANGGTLTFELLDQLL-DLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGA 221 (330) T ss_pred HHHHhhccCCCCccccchhhcCCcccEEecCCCCCCCCHHHHHHHH-HHhcCCCCCCcEEEechhHHHHHHHHHHhccCC Confidence 99999999865321 12333445543333 334333346779999999999999999888877 Q ss_pred eecccccc---CCcccccccceEeecccccccc-----ccCcceEEEEehh-----cceEeee---ccceEEEEeccchh Q lcl|Aclame:pro 297 LLEPDPTK---PNSYLIKGKQVIVVADRWLPNT-----GSTVYPLYYGDMS-----QAITLFD---RENMSLLPTNIGAG 360 (408) Q Consensus 297 ~~~~~~~~---~~~~~l~G~pv~~~~~~~~~~~-----~~~~~~~~~gd~~-----~~~~~~~---~~~~~i~~~~~~~~ 360 (408) ...+...+ ....++.|.||+.++. +|.. .++...||+..|. +++.... ..++.+..-.. T Consensus 222 ~v~~~~~~~~G~~v~~~~GvPi~~~d~--ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~--- 296 (330) T protein:vir:94 222 AIGEVMTLPSGRQIPTYRGVPWFVNDF--IPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGA--- 296 (330) T ss_pred CCCCcccccCCCEEeeeCCeEEEeccc--ccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCC--- Confidence 65543322 2224688999887543 4432 3455667777663 3555543 23556543221 Q ss_pred hhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 361 ~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) .-+++...+++.++++.++.+|.|+.+|+-.... T Consensus 297 ~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 297 KENADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred ccccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 1246677789999999999999999999855555 No 110 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.68 E-value=6.4e-18 Score=114.83 Aligned_cols=267 Identities=13% Similarity=0.117 Sum_probs=186.0 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccC-ccceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STS-NGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~-~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |...++.-.-.++|+.|...+.+.+.+...+.+++.+-+. .+. ...+.+|.+. ..+.+.+++||++++. +..+.++ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~-~igda~~~~eg~~i~~-~~lt~~~ 78 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFV-YSGDATVVPEGQKIPV-DKIETNR 78 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeec-CCCccccccCCCccCc-cccccce Confidence 4433444445789999999999999999998888765442 221 2245566553 2345567899999874 5568889 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--chhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP--KKPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~--~~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) .+...++.+....++++....+..|+.+.+.++++..+++.++..++.-..++. ......+++.+.+++.. +..... T Consensus 79 ~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~~~~t~d~i~~A~~~-lgd~~~ 157 (276) T protein:vir:10 79 REAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSADIGTLAGLEAAIDT-FDDEDL 157 (276) T ss_pred eeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hccccC Confidence 999999999999999999998888999999999999999999998876554432 23334568888877644 444444 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeec-----cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLE-----PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~-----~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 346 (408) ...+++|||..+..|+++.+.+ ++-. +.+.++.-++++|++|+++++ +|.. ..++|+. .++..+. T Consensus 158 ~~~~ivv~p~~~~~L~k~~~~~--f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~----t~~l~~~--gAi~~~~ 227 (276) T protein:vir:10 158 EPMVLFINPKDAGKLRSSASDN--FTRATELGDNIIVKGAFGEALGAVIVRSKK--LDEG----EAILAKR--GAVKLIT 227 (276) T ss_pred cccEEEEcHHHHHHHHHhcccc--ccccccccccceeccccceecceeEEEcCC--CCcc----eEEEEec--cceeeee Confidence 5667999999999998764322 1111 112334446899999998654 3421 2245553 3455566 Q ss_pred ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCC Q lcl|Aclame:pro 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGN 399 (408) Q Consensus 347 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 399 (408) ..+++++.++... +....+++..++++++.+|..++++++...+.+.+. T Consensus 228 ~~~~~vE~dRd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~~~ 276 (276) T protein:vir:10 228 KRDFFLETDRDPS----TKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDSGA 276 (276) T ss_pred cCCceeecccchh----hcccEEEEeeEEEEEEEcCcceEEEecCCcCCcCCC Confidence 7788887766543 446678888999999999999999976553333332 No 111 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.67 E-value=1.1e-17 Score=113.54 Aligned_cols=265 Identities=14% Similarity=0.124 Sum_probs=183.9 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccC-ccceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STS-NGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~-~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |...++.-+-..+|+.|+..+.+.+.....+.+++..-+. .+. ...+.+|.+. ..+.+....||+.++. +..+++. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~-~~g~~~~~~~g~~i~~-~~it~~~ 78 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFT-YSGDAQVIAEGEKIPV-DQIGTSK 78 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeec-cCCCccccCCCCcCch-hhcccce Confidence 4444445556889999999999999888888777765432 211 1245566654 3344456788888874 5578888 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--hhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--KPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) .++..++.+....++++....+..++.+.+.++++..+++.+|..++....+++. .....+++.+++++.. +..... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~~~~~d~i~dA~~~-l~d~~~ 157 (274) T protein:vir:96 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDK-FNDEDL 157 (274) T ss_pred eEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcccccHHHHHHHHHH-hcccCC Confidence 8899999888889999998888889999999999999999999988876554432 2234457888887654 544445 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeec-----cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLE-----PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~-----~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 346 (408) ....++|||..+..|++... .+++-. +.+..+.-++++|++|+++++ +|.. ..++|| ..++..+. T Consensus 158 ~~~~ivv~p~~~~~L~k~~~--~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~--~p~~----t~~l~~--~gA~~~~~ 227 (274) T protein:vir:96 158 EPMVLFVNPLDAGGLRTSAS--DNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKG----EALLAK--KGAVKLIT 227 (274) T ss_pred CceEEEeCHHHHHHHHhccc--ccccccccccccceeecccceecCeeEEEcCC--CCcc----eEEEEe--Ccceeeee Confidence 66789999999999877531 112211 112234456899999988654 4532 224454 23455556 Q ss_pred ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 347 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) ..+++++.++.. .+....+++..++++++++|.++++++..+..+-- T Consensus 228 ~~~~~vE~~Rd~----~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 228 KRDFFLEKDRDA----SRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred cCCcccccccch----hhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 667777654433 24566788889999999999999999776655544 No 112 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.66 E-value=1.3e-17 Score=113.11 Aligned_cols=264 Identities=15% Similarity=0.058 Sum_probs=176.3 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |...++.-+-..||+.|+..+.+.+.+...+.+++..... .+..| .+.+|.+. ..+.+.+..|++.++. +..++++ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~-~~g~a~~~~~g~~i~~-~~lt~~~ 78 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYK-YIGDAQDVAEGAAIDY-SALETES 78 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeec-cCCcceeecCCCcCcc-cccccce Confidence 4444444556889999999999999988888777654432 22223 45566654 2344567888888864 5578889 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccch----hhhh----hHHHHHHHHHHh Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK----PTIA----KFDDVITMINTA 265 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~----~~~~----~~d~i~~~~~~~ 265 (408) .++..++.+....++++....+..++.+.+.++++..+++..|..++....+.... .+.. .++.+.++.. . T Consensus 79 ~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~-~ 157 (278) T protein:vir:80 79 VKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPD-A 157 (278) T ss_pred eeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHH-h Confidence 99998888888899999999888899999999999999999999888765432211 1111 1233333332 2 Q ss_pred hhhhc-cCCCEEEEcHHHHHHHHhhhcccCc---eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcc Q lcl|Aclame:pro 266 VDPAI-IATSSLLTNQSGLNKLALVKTAEGK---YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQA 341 (408) Q Consensus 266 l~~~~-~~~a~~~~n~~~~~~l~~lkd~~G~---~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~ 341 (408) +..+. .....++|||..+..|++....+.. ..-.+.+.++.-++++|+||+++++ +|.. ..++|+ ..+ T Consensus 158 l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~----t~~l~~--~gA 229 (278) T protein:vir:80 158 IEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKK--LADG----NALAVK--AGA 229 (278) T ss_pred hcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCC--CCcc----eEEEEe--ccc Confidence 33332 2344688999999988765321110 1111223344456899999999765 4431 234454 234 Q ss_pred eEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 342 ITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 342 ~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) +..+...+++++.++.. .+....+++..++++++++|+++++++..+.. T Consensus 230 i~~~~~~~~~vE~~Rd~----~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 230 LKTFLKRNLLAESGRDM----DHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred eeeeecCCcccccccch----hhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 54556677777655433 24566788889999999999999999887776 No 113 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.64 E-value=6.9e-17 Score=109.18 Aligned_cols=265 Identities=14% Similarity=0.102 Sum_probs=183.8 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |....+.-+-.++|+.|...+.+.+.....+.+++.+-+. .+..| .+.+|.+. ..+.+....||+.++ .+..+.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~-~~g~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeec-CCCccccccCCCccc-ccccccce Confidence 4444445556889999999999999888777777766432 22222 45566554 234455678888886 45678888 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--hhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--KPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) .++..++.+....++++....+..++.+.+.++++.++++.+|..++....+++. ....++++.+++++.. +..... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~-l~d~~~ 157 (274) T protein:vir:97 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDL 157 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHH-hhccCC Confidence 9999999888889999998888888999999999999999999998876554432 2334567888888754 555555 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeec-----cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLE-----PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~-----~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 346 (408) ...+++|||..+..|++. ..-+++-. +-+.++.-++++|++|+++++ +|.. ..++||- .++..+. T Consensus 158 ~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~----t~~l~~~--gA~~~~~ 227 (274) T protein:vir:97 158 EPMVLFVNPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAG----TAILAKK--GAVKLIL 227 (274) T ss_pred CceEEEeCHHHHHHHHhh--hhhhccccCcccccceeccccceecCeeEEEcCC--CCcc----eEEEEeC--cceEeee Confidence 667899999999998753 21122111 112334446899999998654 4421 2244542 3455556 Q ss_pred ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 347 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) ..++.++.++... +....+++..++++++++|.++++++++...-.- T Consensus 228 ~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 228 KRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cCCceeccccchh----hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 6777777655432 3456788889999999999999999866554443 No 114 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.64 E-value=6.9e-17 Score=109.18 Aligned_cols=265 Identities=14% Similarity=0.102 Sum_probs=183.8 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |....+.-+-.++|+.|...+.+.+.....+.+++.+-+. .+..| .+.+|.+. ..+.+....||+.++ .+..+.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~-~~g~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeec-CCCccccccCCCccc-ccccccce Confidence 4444445556889999999999999888777777766432 22222 45566554 234455678888886 45678888 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--hhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--KPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) .++..++.+....++++....+..++.+.+.++++.++++.+|..++....+++. ....++++.+++++.. +..... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~-l~d~~~ 157 (274) T protein:vir:94 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDL 157 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHH-hhccCC Confidence 9999999888889999998888888999999999999999999998876554432 2334567888888754 555555 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeec-----cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLE-----PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~-----~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 346 (408) ...+++|||..+..|++. ..-+++-. +-+.++.-++++|++|+++++ +|.. ..++||- .++..+. T Consensus 158 ~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~----t~~l~~~--gA~~~~~ 227 (274) T protein:vir:94 158 EPMVLFVNPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAG----TAILAKK--GAVKLIL 227 (274) T ss_pred CceEEEeCHHHHHHHHhh--hhhhccccCcccccceeccccceecCeeEEEcCC--CCcc----eEEEEeC--cceEeee Confidence 667899999999998753 21122111 112334446899999998654 4421 2244542 3455556 Q ss_pred ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 347 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) ..++.++.++... +....+++..++++++++|.++++++++...-.- T Consensus 228 ~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 228 KRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cCCceeccccchh----hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 6777777655432 3456788889999999999999999866554443 No 115 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.64 E-value=3.5e-17 Score=110.82 Aligned_cols=266 Identities=13% Similarity=0.098 Sum_probs=184.9 Q ss_pred HHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCcc-ceEEeeccCCccccchhcccccccccccccc Q lcl|Aclame:pro 114 KTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQL 191 (408) Q Consensus 114 ~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f 191 (408) -++... +.-.-.++|+.|...+.+.+.+...+.+++.+-+. .+..| .+.+|.+.. .+.+.+..||++++. +..+. T Consensus 1 ~~~~~~-T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~-~~lt~ 77 (275) T protein:vir:96 1 MALENM-TKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVY-SGDAKVVPEGEEIPI-DLIET 77 (275) T ss_pred CCCccc-chhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeecc-CCccccccCCCCcch-hhccc Confidence 222221 22233678999999999999999888888766543 22212 455665542 345567889999874 55688 Q ss_pred eeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--hhhhhhHHHHHHHHHHhhhhh Q lcl|Aclame:pro 192 TIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--KPTIAKFDDVITMINTAVDPA 269 (408) Q Consensus 192 ~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~~~~l~~~ 269 (408) +..+...++.+....++++....+..|+.+.+.++++..+++.+|..++...+++.. ....+++|.+.+++.. +... T Consensus 78 ~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~~~~~~d~i~dA~~~-lgd~ 156 (275) T protein:vir:96 78 KKRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVEADITKLAGLQTAIDK-FNDE 156 (275) T ss_pred ceeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hccc Confidence 888999999999999999998887778899999999999999999998876655432 2344568888887744 4444 Q ss_pred ccCCCEEEEcHHHHHHHHhhhcccCceee-----ccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEe Q lcl|Aclame:pro 270 IIATSSLLTNQSGLNKLALVKTAEGKYLL-----EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITL 344 (408) Q Consensus 270 ~~~~a~~~~n~~~~~~l~~lkd~~G~~~~-----~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~ 344 (408) ......++|||..+..|+++..- +++- .+.+.++.-++++|++|+++++ +|.. ..++|+- .++.. T Consensus 157 ~~~~~~ivv~p~~~~~L~k~~~~--~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~----t~~i~~~--gA~~~ 226 (275) T protein:vir:96 157 DLEPMVLFVNPLDAGKLRASATD--NFTRATLLGDNVIVKGAFGEALGAIIVRSNK--IKEG----EAILAKR--GAVKL 226 (275) T ss_pred cCCccEEEeCHHHHHHHHhcccc--cccccccccccceeccccceecCeeEEEeCC--CCcc----eEEEEec--cceee Confidence 44566899999999999776321 1111 1123344457899999998654 4432 2355653 34555 Q ss_pred eeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 345 FDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 345 ~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) +.+.+++++.++... +....+++..++++++++|.+++++++++.+=-+ T Consensus 227 ~~~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 227 ITKRDFFLETERHAS----HKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred eecCCcccccccchh----hcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 566777776655432 4567788889999999999999999886655444 No 116 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.60 E-value=1.6e-16 Score=107.25 Aligned_cols=266 Identities=14% Similarity=0.104 Sum_probs=181.2 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |..+.-++ .++|+.|...+.+...+...+.+++.+-+. .+..| .+.+|.+. -.+.+..+.||++++ ....++++ T Consensus 1 Ma~T~~~d--~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~-~igdae~~~eg~~i~-~~~lt~~~ 76 (270) T protein:vir:95 1 MTQTKKAN--LINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYA-YIGAAEDLQEGVAMD-TTQMSMTT 76 (270) T ss_pred CCceehhh--hcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeec-CCCccccccCCCccc-hhhcccch Confidence 44433332 679999999999999888888888776443 22112 44455543 344555678999887 45667888 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-hhhhhhHHHHHHHHHHhhhhhccC Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK-KPTIAKFDDVITMINTAVDPAIIA 272 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~-~~~~~~~d~i~~~~~~~l~~~~~~ 272 (408) -....++.+....++++....+..|....+.++++..++++++..++.-..+... .....+++++++++.. +...... T Consensus 77 ~~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~~~~t~~~~~dA~~~-lgd~~~~ 155 (270) T protein:vir:95 77 TKVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTATVSADATGILDAIEV-FNSENDE 155 (270) T ss_pred heeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hccccCC Confidence 8888899999999999988776667888999999999999999988765554432 2345667888888744 5455555 Q ss_pred CCEEEEcHHHHHHHHhhhcccC-ceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceE Q lcl|Aclame:pro 273 TSSLLTNQSGLNKLALVKTAEG-KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMS 351 (408) Q Consensus 273 ~a~~~~n~~~~~~l~~lkd~~G-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~ 351 (408) ..+++|||..+..|++...-.+ ++. ...+.++.-++++|++|++.++. .+. ....+|+ +.++.++...++. T Consensus 156 ~~~i~vhs~~~~~Lrk~~~~~~~~~~-~~~~~~G~ig~~~G~~Viv~s~~-~~~----~~~~l~~--~gAi~~~~~~~~~ 227 (270) T protein:vir:95 156 DYVLYVNPKDYNKLVKSLFKVGGNVQ-DRAISKGDLVEIVGVSDIVKSKR-VSE----NTAFLQR--YGAMEIVNKKKPE 227 (270) T ss_pred CcEEEEcHHHHHHHHhhhcccccccc-cchhcccccceecceeEEEeCCC-CCc----eeEEEEe--ccceeeeecCCce Confidence 6689999999999986421111 111 11233445578999999886543 121 1234555 3456666777878 Q ss_pred EEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCC Q lcl|Aclame:pro 352 LLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVG 398 (408) Q Consensus 352 i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 398 (408) ++.++... +....+.+..++++++.+|..+++++++++...-- T Consensus 228 vEtdRd~~----~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 228 AYTDFDIL----KRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred eeeccchh----hcccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 87765443 44567778899999999999999999864433222 No 117 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.56 E-value=6.6e-16 Score=103.79 Aligned_cols=265 Identities=14% Similarity=0.095 Sum_probs=180.4 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) +....+.-.-.++|+.|...+.+.+.....+.+++.+-.- .+..| .+.+|.+. ..+.+....||+.++ .+..+.+. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~-~ig~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFV-YSGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeec-CCCccccccCCCccc-hhhcccce Confidence 4333444445789999999999988887777777665432 22223 44566554 234455678888886 45567788 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--chhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP--KKPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~--~~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) .++..++.+....++++....+..|+.+.+.++++.++++.+|..++....++. .......++.+.+++.. +..... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~~~d~i~dA~~~-lgd~~~ 157 (274) T protein:vir:12 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDK-FNDEDL 157 (274) T ss_pred eeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hccccc Confidence 888888888889999988777777889999999999999999998887655432 23344568888888754 555555 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeec-----cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLE-----PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~-----~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 346 (408) ...+++|||..+..|++.. .-+++-. +-+.++.-++++|++|+++++ +|.. ..++||.- ++..+. T Consensus 158 ~~~~ivv~p~~~~~L~k~~--~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~----t~~l~~~g--A~~~~~ 227 (274) T protein:vir:12 158 EPMVLFINPLDAGKLRGDA--STNFTRATELGDDIIVKGAFGEALGAIIVRSNK--LEAG----TAILAKKG--AVKLIL 227 (274) T ss_pred cccEEEeCHHHHHHHHhhh--hhhccccccccccceecccceeecCeeEEEeCC--CCcc----eEEEEecc--ceeeee Confidence 6678999999999887531 1111111 112334446799999998654 4432 23566643 445556 Q ss_pred ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 347 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) ..+++++.++... +....+++..++++++++|..+++++.....-.- T Consensus 228 ~~~~~vE~~Rd~~----~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 228 KRDFFLEVARDAS----TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cCCceeccccchh----hcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 6777877665443 3455788889999999999999999854443333 No 118 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.54 E-value=5.5e-15 Score=98.74 Aligned_cols=363 Identities=11% Similarity=0.054 Sum_probs=193.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDN-FSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |++.- |.|-++.+....+..-.++..+..+.-... -+.....+++.-+.++..++...+.++.. .....+ T Consensus 8 ~~k~~-~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln~--------~~E~~K 78 (400) T protein:vir:93 8 MNKPD-LIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNA--------QEEKPK 78 (400) T ss_pred cccch-HHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhhh--------hhhhcc Confidence 54443 222222333333333333332222200000 00111223333333333333322222211 011111 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchh----hHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMA----FMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV 155 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~----~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 155 (408) ......+.-...+-.-.|.+.+....+ ........+...-+..+.-..+|.-+...|.+.++...++.++.++.++ T Consensus 79 gk~~mtefLkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~ 158 (400) T protein:vir:93 79 GKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 158 (400) T ss_pred cchhHHHhhhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecC Confidence 111111111111122223222221110 0000011111112212223468999999999999999999999888877 Q ss_pred ccCccceEEeeccCCccccch-hcccccccccccccceeeeechheeeeehHHHHHHHhc--chHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 156 STSNGSRVYEKWTDVTPLTVM-DAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKD--TAENILAWLSSWIAKKVV 232 (408) Q Consensus 156 ~~~~g~~~~~~~~~~~~~~~~-~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~d--s~~~~~~~v~~~l~~~~~ 232 (408) ++ +.+. ....+...+| +--|.++++ +..+|..-++.|+-++.++.+.+-..++ +.-.|..||.++|..++. T Consensus 159 p~----l~V~-~~~dt~~qa~gHk~G~~K~e-q~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI 232 (400) T protein:vir:93 159 GA----LLVS-RSFDSANEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIV 232 (400) T ss_pred Cc----eeee-cchhhhcccceeccCCcccc-eeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHH Confidence 42 2222 2233444566 455666654 4468999999999999988885555443 334579999999999999 Q ss_pred HH-HHHHHhhccccccc--------------------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhc Q lcl|Aclame:pro 233 VT-RNQAIIEVMKAAPK--------------------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKT 291 (408) Q Consensus 233 ~~-~~~~~~~g~g~~~~--------------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd 291 (408) .+ .+.+++-|+|.++. .++.+.+.+++.-+.....+-...+-.++++|+.|+.|+.|+| T Consensus 233 ~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~ 312 (400) T protein:vir:93 233 NKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQ 312 (400) T ss_pred HHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcC Confidence 64 79999999887642 1223334455554433344445556679999999999999999 Q ss_pred ccCceeeccccccCCcccccccceEe-eccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEE Q lcl|Aclame:pro 292 AEGKYLLEPDPTKPNSYLIKGKQVIV-VADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIR 370 (408) Q Consensus 292 ~~G~~~~~~~~~~~~~~~l~G~pv~~-~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r 370 (408) ++|.+.|.....+-+-.+-+|+-=.+ ..-.+++ +. .+..|-. +.+ +-.+++- .....|.+|+..|. T Consensus 313 a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~-----kp-~V~VDek--~~i-~~~~~~t----~~sf~~~tNs~~il 379 (400) T protein:vir:93 313 ATANANVRIKNDDTEIASEVGVDEIIVYTGSKAL-----KP-TVLVDQK--YHI-DMQDLTK----VDAFEWKTNSNMIL 379 (400) T ss_pred CcceeeeeeccccchhhhhcccceeeeeccCCCC-----Cc-eeeeehh--hhc-cccCcee----ccceeeeeccceEE Confidence 99999995444443334455553222 1111122 22 2333533 222 3344442 22233667888899 Q ss_pred EEeeeCcEEecccceEEEEee Q lcl|Aclame:pro 371 VIDRFDVKATDSEALVAGSFS 391 (408) Q Consensus 371 ~~~r~d~~v~~~~a~~~l~~~ 391 (408) .+...+|.+.-|++-+++++. T Consensus 380 vetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 380 VETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eeeeeccceecccceeeEeeC Confidence 999999999999999998887 No 119 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.53 E-value=2.6e-15 Score=100.51 Aligned_cols=265 Identities=13% Similarity=0.084 Sum_probs=179.3 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccC-ccceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STS-NGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~-~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) +....+.=.-..+|+.|+..+.+.+.....+.+++.+-+. .+. ...+.+|.+.. .+.+....|++.++. +..+.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~-~~lt~~~ 78 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIY-SGDAKVVAEGEKIPT-DILETKK 78 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecC-CCccccccCCCccch-hhcccce Confidence 3333333345778999999999999888887777654432 211 12455665542 244456788888864 5567788 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--hhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--KPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) .++..++.+....++++....+..++.+.+.++++.++++..|..++.-.+++.. .....+++.+.+++.. +..... T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~-lgd~~~ 157 (274) T protein:vir:96 79 REAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDK-FNDEDL 157 (274) T ss_pred eEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hccccc Confidence 8888888888889999988887778999999999999999999988876655432 2344568888887744 554445 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeec-----cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLE-----PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~-----~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 346 (408) ...+++|||..+..|++. ..-+++-. +.+.++.-++++|++|+++++ +|. ...++||.. ++..+. T Consensus 158 ~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~~~----~t~~l~~~g--A~~~~~ 227 (274) T protein:vir:96 158 EPMVLFISPLDAGKLRGD--ATTNFTRATELGDDVIVKGAFGEALGAVIVRSNK--LEA----GTAILAKKG--AVKLIT 227 (274) T ss_pred cccEEEeCHHHHHHHHhh--ccccccccccccccceeccccceecCeEEEEeCC--CCC----ceEEEEecc--ceeeee Confidence 667899999999998753 11122211 112334456899999998654 342 233667743 445556 Q ss_pred ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 347 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) ..+++++.++.. .+....+++..++++++++|.++++++...-.=.- T Consensus 228 ~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 228 KRDFFLETDRDP----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cCCccccccccc----ccccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 677777665543 24567788889999999999999998743322222 No 120 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.53 E-value=2.6e-15 Score=100.51 Aligned_cols=265 Identities=13% Similarity=0.084 Sum_probs=179.3 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec-ccC-ccceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV-STS-NGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~-~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) +....+.=.-..+|+.|+..+.+.+.....+.+++.+-+. .+. ...+.+|.+.. .+.+....|++.++. +..+.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~~-~~lt~~~ 78 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIY-SGDAKVVAEGEKIPT-DILETKK 78 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecC-CCccccccCCCccch-hhcccce Confidence 3333333345778999999999999888887777654432 211 12455665542 244456788888864 5567788 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--hhhhhhHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK--KPTIAKFDDVITMINTAVDPAII 271 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~~~~l~~~~~ 271 (408) .++..++.+....++++....+..++.+.+.++++.++++..|..++.-.+++.. .....+++.+.+++.. +..... T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~-lgd~~~ 157 (274) T protein:vir:95 79 REAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDK-FNDEDL 157 (274) T ss_pred eEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHH-hccccc Confidence 8888888888889999988887778999999999999999999988876655432 2344568888887744 554445 Q ss_pred CCCEEEEcHHHHHHHHhhhcccCceeec-----cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee Q lcl|Aclame:pro 272 ATSSLLTNQSGLNKLALVKTAEGKYLLE-----PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) Q Consensus 272 ~~a~~~~n~~~~~~l~~lkd~~G~~~~~-----~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 346 (408) ...+++|||..+..|++. ..-+++-. +.+.++.-++++|++|+++++ +|. ...++||.. ++..+. T Consensus 158 ~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~~~----~t~~l~~~g--A~~~~~ 227 (274) T protein:vir:95 158 EPMVLFISPLDAGKLRGD--ATTNFTRATELGDDVIVKGAFGEALGAVIVRSNK--LEA----GTAILAKKG--AVKLIT 227 (274) T ss_pred cccEEEeCHHHHHHHHhh--ccccccccccccccceeccccceecCeEEEEeCC--CCC----ceEEEEecc--ceeeee Confidence 667899999999998753 11122211 112334456899999998654 342 233667743 445556 Q ss_pred ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 347 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) ..+++++.++.. .+....+++..++++++++|.++++++...-.=.- T Consensus 228 ~~~~~vE~~Rd~----~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 228 KRDFFLETDRDP----STKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred cCCccccccccc----ccccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 677777665543 24567788889999999999999998743322222 No 121 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.52 E-value=4.4e-15 Score=99.29 Aligned_cols=364 Identities=12% Similarity=0.081 Sum_probs=189.2 Q ss_pred CChHHHHH-----HHHHHHHHH-----------HHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MGVKLTVN-----QLNEAWIAS-----------GDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLV 64 (408) Q Consensus 1 M~~~~~i~-----el~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (408) |.-.-.-+ .+..++... ..+++..+++..+...++....+.++...+..+++....++.+.... T Consensus 1 ~~n~t~a~d~~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~~~e~~en~~e~~~~~~~~~~E~Rs~~~ 80 (410) T protein:vir:83 1 MGNATTASDEYIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRGRMEQIKNQMEQAQEVNRIAFETRSKGQ 80 (410) T ss_pred CCCcccchhhHHHHHHHHhhhhheeeeccccccccccccchhhhccccccccCcccchhhhhHHHHHHHHHHHHHHHHHH Confidence 65442221 111111000 11122222221111222222222222222222222211122111111 Q ss_pred HHHHHHhhhcccccccccccchhhhHHHHHHHHHHHh----hcchhhHHHHHH--HHhhccccccCceecchhhhhhhhh Q lcl|Aclame:pro 65 EAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMV----RNPMAFMNTVSS--KTETSGSDSAAGLTIPQDIRTMINT 138 (408) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~----~~~~~~~~~~~~--~a~~~~t~~~gg~~vP~~~~~~ii~ 138 (408) ++. .+..........+. .+-.. ...|.+.+ +.........+. ++....++.+....||+++....++ T Consensus 81 ~i~-~~~~~~r~~p~~~~--veyRS----aGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~ 153 (410) T protein:vir:83 81 AVD-AAISAMRGSPVGTE--VEYRS----AGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVID 153 (410) T ss_pred HHH-hhhccCcCCCCCCC--ccccc----HHHHHHHHhccCCchHHHHHHHHHHHHhhccCcccccccccchhHhhhHHH Confidence 111 11111111111111 11111 12344444 333333333343 5555566666667789999999999 Q ss_pred hhhhhhhhhhhhceeecccCccceEEeeccCCcccc-------chhcccccccccccccceeeeechheeeeehHHHHHH Q lcl|Aclame:pro 139 LVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLT-------VMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS 211 (408) Q Consensus 139 ~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~-------~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el 211 (408) .+.+..++.++....|.++.+..|++. . ..++. ....||...+ ..+.+|+..+...++++++..+|++. T Consensus 154 li~q~r~i~slf~tLP~~g~T~eY~v~--t-~~~tV~~q~~~~kqa~EGd~L~-~gKl~~~t~tA~ikTyGGyt~LSRQ~ 229 (410) T protein:vir:83 154 FIDSARPLVSTLGTLPLNNATFYRPIV--S-QRPAVGLQGVAGGASDEKTELD-SQKMVIDRLTVNAKTLGGYVNVSRQA 229 (410) T ss_pred HHhhccchhhhhhhCCCCCCeeEEeee--c-cccccccccccccccccccccc-ccceeeeeccceeehhcCccccccee Confidence 999999999998888887765555433 2 22221 2234666665 56678888889999999999999999 Q ss_pred HhcchHHHHHHHHHHHHHHHHHHHHH---HHhhccccccchhhhhhHHHHHHHHHHhhh---hh--ccCCCEEEEcHHHH Q lcl|Aclame:pro 212 LKDTAENILAWLSSWIAKKVVVTRNQ---AIIEVMKAAPKKPTIAKFDDVITMINTAVD---PA--IIATSSLLTNQSGL 283 (408) Q Consensus 212 l~ds~~~~~~~v~~~l~~~~~~~~~~---~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~---~~--~~~~a~~~~n~~~~ 283 (408) ++.|.+...+...+.|..+++.+-+. +++..+-++....+..+.+.++.++..... .+ ...-..+.++|+.+ T Consensus 230 IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~~~a~~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl 309 (410) T protein:vir:83 230 IDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTGAVGYGNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVL 309 (410) T ss_pred eecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhh Confidence 99999999999999998888888664 345544443333344455555444432221 11 12223578899998 Q ss_pred HHHHhh-hcccCceeecc-------ccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEe Q lcl|Aclame:pro 284 NKLALV-KTAEGKYLLEP-------DPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPT 355 (408) Q Consensus 284 ~~l~~l-kd~~G~~~~~~-------~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 355 (408) ..+..+ ++-+ +.+.. .+..+-.+.++|.||+..++ ...+.++|-|.. ++..+.-.+-.++.. T Consensus 310 ~~~~~~f~~~~--~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~-------a~AgTA~f~~~~-Ai~~~eS~~gp~qL~ 379 (410) T protein:vir:83 310 GDFGPLFAPVN--PTNAHSTGFEAGRFGQGVMGSISGIPVVMSAA-------LGSGDAYLFSTA-AIECFEQRVGTLQVV 379 (410) T ss_pred hhccceeeccC--CCCcccccccccccccchhhhhcccceEEecC-------CCcCeeeEeccc-eeeeeecCCceeEee Confidence 766544 2222 22211 11233457899999998543 334557787865 455554343233332 Q ss_pred c-cchhhhhhceeeEEEEeeeCcEEecccceEEEEee Q lcl|Aclame:pro 356 N-IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) Q Consensus 356 ~-~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 391 (408) + ...++ .+++- .++.+.+.++.+++-+... T Consensus 380 d~~i~nL-t~~yS-----gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 380 EPSVFGL-QVAYA-----GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred CCchhhh-hhhhe-----eeeeeccccccceeeeccC Confidence 2 22222 23321 5578888999998887665 No 122 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.42 E-value=1e-14 Score=97.24 Aligned_cols=228 Identities=15% Similarity=0.085 Sum_probs=159.6 Q ss_pred hceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHH Q lcl|Aclame:pro 150 VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAK 229 (408) Q Consensus 150 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~ 229 (408) .+-++ ..-.+.+|.. .+.+..++||++++ ....+++.-+.+.++.+..+.|+++..-.+.-|......++++. T Consensus 1 ~~~~~---~Gdtit~P~~---iGda~~v~eG~~i~-~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~ 73 (231) T protein:vir:73 1 ENGIN---LANLCEYPND---IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGL 73 (231) T ss_pred Ccccc---CCceEEeccc---ccchhhhcCCCcCC-hhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHH Confidence 11111 1124555554 23446689999997 45568889999999999999999999887777889999999999 Q ss_pred HHHHHHHHHHhhcccccc-chhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc--eeeccccccCC Q lcl|Aclame:pro 230 KVVVTRNQAIIEVMKAAP-KKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK--YLLEPDPTKPN 306 (408) Q Consensus 230 ~~~~~~~~~~~~g~g~~~-~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~--~~~~~~~~~~~ 306 (408) ++++++|..++.-..+.. ...+..+++.+.+++.. +........+++|||..+..|++..+.+.. ..-.+-+.+|. T Consensus 74 ~iA~kvD~di~~~~~~a~l~~~~~~t~d~i~~A~~~-fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~~~G~ 152 (231) T protein:vir:73 74 SLANKVDDDLLKAAKTTSQTVSTKANVDGVQAALDI-FNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGT 152 (231) T ss_pred HHHHhhhHHHHHhhccccccccccccHHHHHHHHHH-hccccccceEEEEcchHHHhhhhccchhhhhhhhccceeeecc Confidence 999999999887655433 23445678888877744 555566677899999999999886543221 11112234455 Q ss_pred cccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceE Q lcl|Aclame:pro 307 SYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALV 386 (408) Q Consensus 307 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~ 386 (408) -+.+.|+||+++++ +|...+-...+++ -+.++.++...+++++.+++. .+....+.+.+++++++.+|..++ T Consensus 153 iG~i~G~~Vi~S~~--~~~~~~~~~~~i~--~~gAl~~~~k~~~~vEtdRd~----~~k~~~i~~~~~y~v~l~~~~~vv 224 (231) T protein:vir:73 153 YADVLGAQIVRSKK--LAEGSALMFKIVS--NSPALKLVLKRGVQVETDRDI----VTKTTVITADEHYAAYLYDLTKVV 224 (231) T ss_pred cceEcceEEEEcCC--CCCCceeeeeEEe--eccceeeeecccceeeccccc----cccccEEEEeEEEEEEEEcCccEE Confidence 57899999998654 3432221111222 144566677788888876543 355677889999999999999999 Q ss_pred EEEeecc Q lcl|Aclame:pro 387 AGSFSAI 393 (408) Q Consensus 387 ~l~~~~~ 393 (408) +++++-+ T Consensus 225 ~~t~~g~ 231 (231) T protein:vir:73 225 NITFTGV 231 (231) T ss_pred EEEeecC Confidence 9999999 No 123 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.41 E-value=1.4e-13 Score=91.10 Aligned_cols=270 Identities=9% Similarity=0.039 Sum_probs=169.0 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccc--cchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPL--TVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~--~~~~~E~~~~~~~~~~~f~~ 193 (408) |..-|-...+.+.+......||+.+.+.+.|+..+...++.+....+.....-...+. ..|..-.+..++ +..+|.+ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~-~~~t~~~ 79 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGK-AAATFTK 79 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCccc-cccccce Confidence 3333334445677888899999999999999988887777665544443322112111 122221122222 4468999 Q ss_pred eeechheeeeehHHHHHHHhc--c-hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------hhhh Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKD--T-AENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK------------------KPTI 252 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~d--s-~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~------------------~~~~ 252 (408) ++...+.+++.+.|.+.+.+- + ..+...+=.+...+++.++.+..+|||+.++.+ .++. T Consensus 80 ~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~gg~ 159 (310) T protein:vir:97 80 VNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATGSA 159 (310) T ss_pred eeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCCCC Confidence 999999999999999866542 2 344455556778899999999999999875432 1122 Q ss_pred hhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhh-hcccCceeeccc--cccCCcccccccceEeecccccccc--- Q lcl|Aclame:pro 253 AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALV-KTAEGKYLLEPD--PTKPNSYLIKGKQVIVVADRWLPNT--- 326 (408) Q Consensus 253 ~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~l-kd~~G~~~~~~~--~~~~~~~~l~G~pv~~~~~~~~~~~--- 326 (408) .+.|++-.++ ..+....+.+.+++|||.++.+|+.+ +..+++.++.+. ..+....++.|.|++.++. +|.. T Consensus 160 ~t~d~LDeLl-~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~--ip~~~~~ 236 (310) T protein:vir:97 160 ISFAILDELM-DLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDY--IPTNQTK 236 (310) T ss_pred CCHHHHHHHH-HHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCc--cCCCccc Confidence 3444443333 33322344677899999998888755 344445554432 2333335899999988653 4432 Q ss_pred --ccCcceEEEEehh-----cceEeee---ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEE-eec Q lcl|Aclame:pro 327 --GSTVYPLYYGDMS-----QAITLFD---RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGS-FSA 392 (408) Q Consensus 327 --~~~~~~~~~gd~~-----~~~~~~~---~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~-~~~ 392 (408) ..+...||..-|. +++.... ..++.+..-... -+++-..+|+.++++.++..|+|+.+|. +.. T Consensus 237 ~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~---~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 237 GGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGES---EDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred cccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcc---cCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 3456667765553 3444332 234555443211 2456677889999999999999999987 444 No 124 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.29 E-value=2.9e-12 Score=83.86 Aligned_cols=290 Identities=12% Similarity=0.053 Sum_probs=163.4 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) .+. .+..+.+++.. ...+.....+.+. =||.+++++....+++.+.+.+++++.++++++.+.++.+. T Consensus 1 ~~~--------~~~~~~~~n~~--~~~i~k~~it~~~--l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~ 68 (360) T protein:vir:99 1 MSS--------NSTIDSVRNQN--MNSLSQKDIGLAE--LDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVP 68 (360) T ss_pred Ccc--------hhHHHHHhhhH--HHHHHhhhccccc--cCceeecHHHHHHHHHHHhhccchhhhcceeeccccccccc Confidence 111 11112222211 1111222222222 24677899999999999999999999999999888877654 Q ss_pred EeeccCCccccchhcccccccccccccceeeeec-hheeeeehHHHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYL-IKRYAGIITATNTSLKDT----AENILAWLSSWIAKKVVVTRNQA 238 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~-~~~~~~~~~iS~ell~ds----~~~~~~~v~~~l~~~~~~~~~~~ 238 (408) -....-....+ ..|+++.++...++...+.+. .+++.....+..+-+++. ...+++.|.+.+++++++-++.- T Consensus 69 kig~G~r~~r~--~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l 146 (360) T protein:vir:99 69 QFGVPRLSGHT--RDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLM 146 (360) T ss_pred ccccceeeccc--cccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHH Confidence 22211111111 123222222222333344442 344445555666665553 34678999999999999999988 Q ss_pred Hhhccccccch---h----------hh------------------------------------h----------hHHHHH Q lcl|Aclame:pro 239 IIEVMKAAPKK---P----------TI------------------------------------A----------KFDDVI 259 (408) Q Consensus 239 ~~~g~g~~~~~---~----------~~------------------------------------~----------~~d~i~ 259 (408) .++|+...... . +. . ....+. T Consensus 147 ~~~g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf 226 (360) T protein:vir:99 147 GIRAGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLF 226 (360) T ss_pred HhhccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHH Confidence 88887542100 0 00 0 011233 Q ss_pred HHHHHhhhhhccCC----CEEEEcHHHHHHH-HhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEE Q lcl|Aclame:pro 260 TMINTAVDPAIIAT----SSLLTNQSGLNKL-ALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) Q Consensus 260 ~~~~~~l~~~~~~~----a~~~~n~~~~~~l-~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 334 (408) ..+...|+..|+.+ -+|+|++..+... +.|.+-+. ++-...+.++..-+.+|+||+.++. +| ++.++ T Consensus 227 ~~~~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t-~LGd~~l~g~~~~~~~Gipi~~v~~--~p-----d~~~m 298 (360) T protein:vir:99 227 NETIQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTERED-PLGSAVIFGDSDITPFSYDLVGVNG--FP-----DEYMM 298 (360) T ss_pred HHHHHhcchhhhcCcccceEEEccCchHHHHHHHHhccCc-ccchhheecccccccceeeeEEcCC--CC-----CCceE Confidence 34445678888753 3799999875544 44543332 2322224444445678999988753 44 34589 Q ss_pred EEehhcceEeeeccceEEEEeccchhhhhhc-eeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 335 YGDMSQAITLFDRENMSLLPTNIGAGAFETD-TTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 335 ~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~-~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) |-++++.+.. .+.++++..+........+. .+..-....+|+.+.+++|+++++-....++ T Consensus 299 lT~p~NLi~g-~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 299 FTDPNNLAFG-LYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred EeccCceeEE-eeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 9999986554 46777776533321111111 1222245568999999999999886655544 No 125 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.05 E-value=3.3e-11 Score=78.05 Aligned_cols=258 Identities=12% Similarity=0.019 Sum_probs=142.7 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceee-cccCcc-ceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES-VSTSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |... .++|+.|+..+++.++..+.+.+++..-. ..+..| ++.+|...... ......++..++ ....+... T Consensus 1 MA~~------~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~-~~d~~~~~~~~~-~~~~~~~~ 72 (273) T protein:vir:79 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPT-VKDYKAAGRQTS-ADAISDTG 72 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCccc-ccccccCCCccC-ccccccce Confidence 2221 35899999999999999988877764421 111122 46666544322 223355666554 23345566 Q ss_pred eeechhee-eeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc---cchh---h-hhhHHHHHHHHHHh Q lcl|Aclame:pro 194 IKYLIKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA---PKKP---T-IAKFDDVITMINTA 265 (408) Q Consensus 194 v~~~~~~~-~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~---~~~~---~-~~~~d~i~~~~~~~ 265 (408) ++++..+. +..+.|++.-...+..++.++ .+++..++++++|..++.-..+. .... + ...++.++.+. .. T Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~-~~ 150 (273) T protein:vir:79 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASAL-KE 150 (273) T ss_pred EEEEEeeecccceeeccHHHHhhcccHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHH-HH Confidence 66666553 333455553333445578874 56788899999987665432211 1111 1 12234444443 44 Q ss_pred hhhhcc--CCCEEEEcHHHHHHHHhhhccc-Ccee-e-ccccccCCcccccccceEeeccccccccccCcceEEEEehhc Q lcl|Aclame:pro 266 VDPAII--ATSSLLTNQSGLNKLALVKTAE-GKYL-L-EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ 340 (408) Q Consensus 266 l~~~~~--~~a~~~~n~~~~~~l~~lkd~~-G~~~-~-~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~ 340 (408) ++.... .+..++++|..+..|.+..+.- .... . ...+.+|.-++|.|++|+.+.+ +|...+ .. ++.+- +. T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~--lp~~~~-~~-~~a~~-~~ 225 (273) T protein:vir:79 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDD-EQ-FVAFH-PS 225 (273) T ss_pred hhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEeccc--ccccCc-eE-EEEEe-cc Confidence 555543 3567899999999886643211 1111 1 1124455567899999988543 554322 11 22221 22 Q ss_pred ceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 341 ~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) +..... ....++..... . +....+++.+++|+++++|++++.++.+.+ T Consensus 226 A~~~a~-~~~~~e~~r~~-~---~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 226 AAAYVS-QIDTVEALRDQ-D---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ceeeee-ehhhhhcccCc-c---cceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 333222 11222221111 1 234567888999999999999999876665 No 126 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.04 E-value=5.9e-11 Score=76.67 Aligned_cols=258 Identities=11% Similarity=0.019 Sum_probs=139.6 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhcee-ecccCc-cceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE-SVSTSN-GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~-~~~~~~-g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |... .++|+.|+..+++.++..+.+.+++..- ...... .++.+|...... ......++..++. ...+-+. T Consensus 1 MA~~------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~-~~~~~~~ 72 (273) T protein:vir:10 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSA-DAISDTG 72 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeeccccc-ccccccCCCccCc-cccccce Confidence 2221 3579999999999999988888776432 111112 245555543322 2223445554432 2234455 Q ss_pred eeechhee-eeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------cchhhhhhHHHHHHHHHHh Q lcl|Aclame:pro 194 IKYLIKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA-------PKKPTIAKFDDVITMINTA 265 (408) Q Consensus 194 v~~~~~~~-~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~-------~~~~~~~~~d~i~~~~~~~ 265 (408) ++++..+. +....|++.-...+..++.+ +.++..++++.+.|..++.-..+. .+......++.++.+. .. T Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~-~~ 150 (273) T protein:vir:10 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKAL-KE 150 (273) T ss_pred EEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHH-HH Confidence 55555443 22334555333334557877 456788999999998766432211 1111112244454443 34 Q ss_pred hhhhcc--CCCEEEEcHHHHHHHHhhhcccCc-eee--ccccccCCcccccccceEeeccccccccccCcceEEEEehhc Q lcl|Aclame:pro 266 VDPAII--ATSSLLTNQSGLNKLALVKTAEGK-YLL--EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ 340 (408) Q Consensus 266 l~~~~~--~~a~~~~n~~~~~~l~~lkd~~G~-~~~--~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~ 340 (408) |+.... .+..++++|..+..|.+..+--.+ ... ...+.++.-++|.|++|+.+.+ +|...+ ..++.+- +. T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~--lp~~~~--~~~~~~~-~~ 225 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDD--EQFVAFH-PS 225 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecc--cccCCc--cEEEEEe-cc Confidence 555543 356789999999988764321111 111 1123455567899999998543 564332 2233332 22 Q ss_pred ceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 341 ~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) +..... ...+++.... .. +....+++...+|+++++|++++.++.+.+ T Consensus 226 A~~~a~-q~~~~e~~r~-~~---~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 226 AAAYVS-QIDTVEALRD-QD---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ceeeee-eeehhhcccC-CC---cceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 332222 2122222111 11 224567788899999999999999876655 No 127 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.04 E-value=5.9e-11 Score=76.67 Aligned_cols=258 Identities=11% Similarity=0.019 Sum_probs=139.6 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhcee-ecccCc-cceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE-SVSTSN-GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~-~~~~~~-g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) |... .++|+.|+..+++.++..+.+.+++..- ...... .++.+|...... ......++..++. ...+-+. T Consensus 1 MA~~------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~-~~~~~~~ 72 (273) T protein:vir:10 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSA-DAISDTG 72 (273) T ss_pred Ccch------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeeccccc-ccccccCCCccCc-cccccce Confidence 2221 3579999999999999988888776432 111112 245555543322 2223445554432 2234455 Q ss_pred eeechhee-eeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------cchhhhhhHHHHHHHHHHh Q lcl|Aclame:pro 194 IKYLIKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA-------PKKPTIAKFDDVITMINTA 265 (408) Q Consensus 194 v~~~~~~~-~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~-------~~~~~~~~~d~i~~~~~~~ 265 (408) ++++..+. +....|++.-...+..++.+ +.++..++++.+.|..++.-..+. .+......++.++.+. .. T Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~-~~ 150 (273) T protein:vir:10 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKAL-KE 150 (273) T ss_pred EEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHH-HH Confidence 55555443 22334555333334557877 456788999999998766432211 1111112244454443 34 Q ss_pred hhhhcc--CCCEEEEcHHHHHHHHhhhcccCc-eee--ccccccCCcccccccceEeeccccccccccCcceEEEEehhc Q lcl|Aclame:pro 266 VDPAII--ATSSLLTNQSGLNKLALVKTAEGK-YLL--EPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ 340 (408) Q Consensus 266 l~~~~~--~~a~~~~n~~~~~~l~~lkd~~G~-~~~--~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~ 340 (408) |+.... .+..++++|..+..|.+..+--.+ ... ...+.++.-++|.|++|+.+.+ +|...+ ..++.+- +. T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~--lp~~~~--~~~~~~~-~~ 225 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDD--EQFVAFH-PS 225 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecc--cccCCc--cEEEEEe-cc Confidence 555543 356789999999988764321111 111 1123455567899999998543 564332 2233332 22 Q ss_pred ceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 341 ~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) +..... ...+++.... .. +....+++...+|+++++|++++.++.+.+ T Consensus 226 A~~~a~-q~~~~e~~r~-~~---~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 226 AAAYVS-QIDTVEALRD-QD---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ceeeee-eeehhhcccC-CC---cceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 332222 2122222111 11 224567788899999999999999876655 No 128 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=98.99 E-value=4.4e-11 Score=77.35 Aligned_cols=282 Identities=13% Similarity=0.042 Sum_probs=151.6 Q ss_pred hhhHHHHHHHHhhcccc-ccCc--eecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSD-SAAG--LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDG 181 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~-~~gg--~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~ 181 (408) +.............+.. +++- .+-=+.|..++.......+.++++.++.++.++ .++.+++....+. .....|. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G-~sv~~~~iG~~~~--~~~~~g~ 77 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNG-KSASFPVMGRTKG--YYLAPGE 77 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCc-ceEEEeeecceee--eeecccc Confidence 11111111111111111 1111 122277899999988888999999998887643 3566666554433 3344454 Q ss_pred ccccc-ccccceeeeechheeee-ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh-----------------c Q lcl|Aclame:pro 182 KIPDL-DNPQLTIIKYLIKRYAG-IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE-----------------V 242 (408) Q Consensus 182 ~~~~~-~~~~f~~v~~~~~~~~~-~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~-----------------g 242 (408) ....+ ..+..+++++...++-- ...|.+-=.-++..|+.+.+.++.++++++..|+.++. | T Consensus 78 ~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:88 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) T ss_pred CCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCC Confidence 44321 22455666665555422 12333322223456789999999999999999988752 1 Q ss_pred cccccchh-------------hhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhh-cccCceeeccccccCC Q lcl|Aclame:pro 243 MKAAPKKP-------------TIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVK-TAEGKYLLEPDPTKPN 306 (408) Q Consensus 243 ~g~~~~~~-------------~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lk-d~~G~~~~~~~~~~~~ 306 (408) .+++.... ....++.|.++. ..|+....+ +-.++++|..|..|.+-. ...+.|.-..++..+. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) T protein:vir:88 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) T ss_pred ccccccccccccccccchhhhHHHHHHHHHHHH-HHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcce Confidence 11111100 111244555544 335454433 457889999998875432 2334444444555566 Q ss_pred cccccccceEeecccccccccc------------------CcceEEEEehhcceEee-e--------ccceEEEEeccch Q lcl|Aclame:pro 307 SYLIKGKQVIVVADRWLPNTGS------------------TVYPLYYGDMSQAITLF-D--------RENMSLLPTNIGA 359 (408) Q Consensus 307 ~~~l~G~pv~~~~~~~~~~~~~------------------~~~~~~~gd~~~~~~~~-~--------~~~~~i~~~~~~~ 359 (408) .++++|++|+.+.+.++..... +...-+.+|++....++ . -.+++++..... T Consensus 237 vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~- 315 (347) T protein:vir:88 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP- 315 (347) T ss_pred eeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeech- Confidence 6789999999976543221110 01111444554422221 1 112233322111 Q ss_pred hhhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 360 GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 360 ~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) .+-...+++...+|.++++|++.+.+++++++ T Consensus 316 ---~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 316 ---EFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred ---hhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 12233567778899999999999999988888 No 129 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.98 E-value=2.7e-11 Score=78.55 Aligned_cols=274 Identities=12% Similarity=0.065 Sum_probs=150.8 Q ss_pred hhcchhhHHHHHHHHhhccccccCceec------chhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc--CCcc Q lcl|Aclame:pro 101 VRNPMAFMNTVSSKTETSGSDSAAGLTI------PQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT--DVTP 172 (408) Q Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~v------P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~--~~~~ 172 (408) +... .-.+.+..++..+| |+.+...|.+.+...-..-.+.+.+. ...++.+.+.... .... T Consensus 1 ~~~~----------~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~-a~~~~~v~f~~~~p~~~~~ 69 (318) T protein:vir:10 1 MTAP----------TGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGG-ANPNGVVAYNEGNPSFLED 69 (318) T ss_pred CCCC----------CcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhccc-ccccceeEEEecccccccC Confidence 0000 00011112222222 44444555665544443333444432 2223445554432 1134 Q ss_pred ccchhcccccccccccccceeeee-chheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Q lcl|Aclame:pro 173 LTVMDAEDGKIPDLDNPQLTIIKY-LIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA----- 246 (408) Q Consensus 173 ~~~~~~E~~~~~~~~~~~f~~v~~-~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~----- 246 (408) ....++|++++|... +.++...+ ..+|.+.-+.||+|++.....+...-..+.+++.+++..|..++...-.+ T Consensus 70 d~e~VaEggEiP~~~-~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~ 148 (318) T protein:vir:10 70 DVADVAEFGEIPVSA-GARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTL 148 (318) T ss_pred cHhhccCcccccccC-CCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 556789999999644 67766655 55799999999999999999999888999999999999998776544211 Q ss_pred cchhhhhh----HHHHHHHH----------------HHhhhhhccCCCEEEEcHHHHHHHHh------hhcccCceeec- Q lcl|Aclame:pro 247 PKKPTIAK----FDDVITMI----------------NTAVDPAIIATSSLLTNQSGLNKLAL------VKTAEGKYLLE- 299 (408) Q Consensus 247 ~~~~~~~~----~d~i~~~~----------------~~~l~~~~~~~a~~~~n~~~~~~l~~------lkd~~G~~~~~- 299 (408) +...+... ..++.++. ...++-.|.++ .++|||..|..|++ +-..++.+++. T Consensus 149 ~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pd-tIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~ 227 (318) T protein:vir:10 149 AVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPD-TIVMHYALLPILMDNENFMKVYERNANYVSTA 227 (318) T ss_pred cCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccce-eeEECHHHHHHHhcchhhhhhhhccchhhhhc Confidence 00001000 01111111 12233345555 68999999998843 32334444432 Q ss_pred cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEecc--chhhh-hhceeeEEEEeeeC Q lcl|Aclame:pro 300 PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNI--GAGAF-ETDTTKIRVIDRFD 376 (408) Q Consensus 300 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~--~~~~f-~~~~~~~r~~~r~d 376 (408) +..++..+..++|+.|+.++ .+|. +.+++.+-...=.+.+-.+++...-.. ..... .+..+.+|+..+.. T Consensus 228 ~~~tg~~~g~~lGl~vi~s~--~~p~-----~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~ 300 (318) T protein:vir:10 228 PDWTGNFPGSVMGLNVIRSR--TFPI-----DRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRA 300 (318) T ss_pred ccccccccceeeceEEeecC--ccCC-----CeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeee Confidence 23455556789999988744 4554 224555532211122333444332111 01111 23456678888888 Q ss_pred cEEecccceEEEEeeccc Q lcl|Aclame:pro 377 VKATDSEALVAGSFSAIA 394 (408) Q Consensus 377 ~~v~~~~a~~~l~~~~~~ 394 (408) ..|.+|+|+.+|+.--+- T Consensus 301 ~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 301 LAVDQPKAALWLTGIVTP 318 (318) T ss_pred eeeeCcceeEEEeeccCC Confidence 999999999999833222 No 130 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.97 E-value=5.5e-11 Score=76.81 Aligned_cols=280 Identities=6% Similarity=0.012 Sum_probs=151.5 Q ss_pred HHHhhcchhhHHHHHHHHhhccccccCceecc-hhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccch Q lcl|Aclame:pro 98 VNMVRNPMAFMNTVSSKTETSGSDSAAGLTIP-QDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVM 176 (408) Q Consensus 98 ~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP-~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 176 (408) +..+..+. -.+. .-..+++-..++ ..|+.++.......+.++++.++.++.++ .++.++..... .... T Consensus 1 m~~~~~~~------~t~~--~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G-~s~~~~~iG~~--~~~~ 69 (334) T protein:vir:80 1 MTYPAANT------HTRP--GWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGT-NQLRVDRVGAS--TIAG 69 (334) T ss_pred CCCCcCCC------cccc--ccccccchheehhhhhhhHHHHHHHHhhhhhccceeeecccc-ceEEEeeecce--eeee Confidence 00000000 0000 001122223455 78999999999999999999999988754 36666655433 3344 Q ss_pred hcccccccccccccceeeeechheee-eehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc------------- Q lcl|Aclame:pro 177 DAEDGKIPDLDNPQLTIIKYLIKRYA-GIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEV------------- 242 (408) Q Consensus 177 ~~E~~~~~~~~~~~f~~v~~~~~~~~-~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g------------- 242 (408) ..-|+.+.. +..+.++.+|....+- ....|.+-=--++..|+.+.+.+++++++++..|++++.. T Consensus 70 ~~~g~~l~~-~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~ 148 (334) T protein:vir:80 70 RKAGEELVV-QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHL 148 (334) T ss_pred ecCCCCCCC-CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 555666543 3345566666666532 2223333222245668999999999999999999977521 Q ss_pred -----ccc-------ccchhhhhhHHHHHHHH---HHhhhhhccC-----CCEEEEcHHHHHHHHhhhcccC-ceeecc- Q lcl|Aclame:pro 243 -----MKA-------APKKPTIAKFDDVITMI---NTAVDPAIIA-----TSSLLTNQSGLNKLALVKTAEG-KYLLEP- 300 (408) Q Consensus 243 -----~g~-------~~~~~~~~~~d~i~~~~---~~~l~~~~~~-----~a~~~~n~~~~~~l~~lkd~~G-~~~~~~- 300 (408) .|. ++......+.+.+..++ ...|.....+ ..+.+++|..|..|..-+.--. .|.-.+ T Consensus 149 ~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~ 228 (334) T protein:vir:80 149 KPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEG 228 (334) T ss_pred cccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccc Confidence 110 11111223333443322 1223333333 4578999999998865321111 111111 Q ss_pred --ccccCCcccccccceEeeccccccccc------cCcceEEEEehhcceEeeec-cc--------eEEEEeccchhhhh Q lcl|Aclame:pro 301 --DPTKPNSYLIKGKQVIVVADRWLPNTG------STVYPLYYGDMSQAITLFDR-EN--------MSLLPTNIGAGAFE 363 (408) Q Consensus 301 --~~~~~~~~~l~G~pv~~~~~~~~~~~~------~~~~~~~~gd~~~~~~~~~~-~~--------~~i~~~~~~~~~f~ 363 (408) ...++.-.+++|+||+.+.+ +|... ++....+=|||+.....+-. +. ++.+.... ...| T Consensus 229 ~~~~~~g~i~~v~G~~V~~Sn~--~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~-~~~~- 304 (334) T protein:vir:80 229 GNSFVGGRIAMLNGVRVVETPR--FPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEE-KKDF- 304 (334) T ss_pred cccccceeEEEEeceEEEeecC--CCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeec-hhhH- Confidence 12233346899999998643 66543 22333566677654332222 21 12222111 1111 Q ss_pred hceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 364 TDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 364 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) ...+.+.+-+|.++++|++.+.++++-+-| T Consensus 305 --~d~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 305 --GHYLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred --HHHHHHHHHcCCceeccceEEEEEEeeecC Confidence 123344455899999999999999998888 No 131 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.89 E-value=1.4e-10 Score=74.54 Aligned_cols=278 Identities=12% Similarity=0.069 Sum_probs=150.4 Q ss_pred HHHhhcchhhHHHHHHHHhhcccc-ccCc--eecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCcccc Q lcl|Aclame:pro 98 VNMVRNPMAFMNTVSSKTETSGSD-SAAG--LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLT 174 (408) Q Consensus 98 ~~~~~~~~~~~~~~~~~a~~~~t~-~~gg--~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 174 (408) +.....+... ....+.. ++|. .+-=..|+.++.......+.++++.++.++.++ .+..+++.... .. T Consensus 1 ma~~~~~~~~-------~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G-~sv~~~~iG~~--~~ 70 (347) T protein:vir:94 1 MANMNGGQQM-------GKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSG-KSAQFPVLGRT--KA 70 (347) T ss_pred CCcccccccc-------ccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheecccc-ceEEeeeccce--eE Confidence 0000011000 0001111 2221 122378899999999999999999998887653 35666655433 33 Q ss_pred chhccccccccc-ccccceeeeechhee--eeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh---------- Q lcl|Aclame:pro 175 VMDAEDGKIPDL-DNPQLTIIKYLIKRY--AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE---------- 241 (408) Q Consensus 175 ~~~~E~~~~~~~-~~~~f~~v~~~~~~~--~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~---------- 241 (408) ..+..|.....+ ..+..++.++...++ .. ..|-+-=-.++..|+.+.+.++.++++++..|+.++. T Consensus 71 ~~~~~G~~l~~~~~~~~~~e~~ltID~~~y~~-~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~ 149 (347) T protein:vir:94 71 AYLQPGENLDDKRKDMKHTEKTINIDGLLTAD-VLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPT 149 (347) T ss_pred eeeecCcCCCCCcCCccccceEEEEcchhhhh-hhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 445666665432 234566665555543 33 2222222223556899999999999999999988862 Q ss_pred -------cccccc--------c------hhhhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhh-hcccCcee Q lcl|Aclame:pro 242 -------VMKAAP--------K------KPTIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALV-KTAEGKYL 297 (408) Q Consensus 242 -------g~g~~~--------~------~~~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~l-kd~~G~~~ 297 (408) |.+.+. . ......++.+.++. ..|+....+ +-.++++|..|..|.+. ....+.+. T Consensus 150 ~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~-~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~ 228 (347) T protein:vir:94 150 ANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLAR-AKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQ 228 (347) T ss_pred ccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHH-HHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccc Confidence 111100 0 00111234454443 334444433 34556689998887653 33334444 Q ss_pred eccccccCCcccccccceEeeccccccccccCc--------------------ceEEEEehhcceEee---------ecc Q lcl|Aclame:pro 298 LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTV--------------------YPLYYGDMSQAITLF---------DRE 348 (408) Q Consensus 298 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~--------------------~~~~~gd~~~~~~~~---------~~~ 348 (408) ...++..+.-.+++|+||+.+.+ +|....+. ..-|=+||++.+..+ .-. T Consensus 229 ~~~~~~~G~V~~v~G~~V~~Sn~--~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~ 306 (347) T protein:vir:94 229 ALIDPSTGSIRNVMGFEVIEVPH--LTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLK 306 (347) T ss_pred cccccccceeEEeeceEEEEcCc--cccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhc Confidence 44455566667999999998765 33321110 111334444422211 122 Q ss_pred ceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 349 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) ++++++.... .+-...+.+..-+|..+++|++.+.++++.+ T Consensus 307 ~~~~e~~~~~----~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 307 DMALERARRA----NFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred ccceeeeech----hhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 3344433211 1223356677779999999999999888877 No 132 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.81 E-value=2.1e-09 Score=68.13 Aligned_cols=277 Identities=6% Similarity=0.013 Sum_probs=142.8 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhh---------hhceeecccCccceEEeeccCCccccchhccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ---------YVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDL 186 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~ 186 (408) |.++.- .-..+|+.|...+.....+.+.|.+ +.......++...+.+|.+..-.+.+--+.|+.+++. T Consensus 1 MA~T~l--sd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~- 77 (324) T protein:vir:59 1 MAYTKI--SDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVP- 77 (324) T ss_pred CCceee--eceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccch- Confidence 443222 2367888888887777777766532 1112111112223445544433333444677877764 Q ss_pred ccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------c-------cchhhh Q lcl|Aclame:pro 187 DNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA-------A-------PKKPTI 252 (408) Q Consensus 187 ~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~-------~-------~~~~~~ 252 (408) +..+-++-.-..+..+....++++...-+.-+....+.++++....+..+..++....+ . +..... T Consensus 78 ~~l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~ 157 (324) T protein:vir:59 78 QKINAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGI 157 (324) T ss_pred hhcccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccccce Confidence 33344443444444555555666555445667888899999999999888777644321 1 111122 Q ss_pred hhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccccccc--Cc Q lcl|Aclame:pro 253 AKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGS--TV 330 (408) Q Consensus 253 ~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~--~~ 330 (408) .+++.+.+++.. +......-.+|+||+.++..|++..-. .++. +.-.+..-++++|+||++.|..+...... +. T Consensus 158 ~s~~~l~~A~~~-~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~~-~s~~~~~i~~~~G~~VivdD~~p~~~~~~~~~~ 233 (324) T protein:vir:59 158 YSAETFVDASYK-LGDHESLLTAIGMHSATMASAVKQDLI--EFVK-DSQSGIRFPTYMNKRVIVDDSMPVETLEDGTKV 233 (324) T ss_pred ecHHHHHHHHHH-hCCcccCcEEEEEchHHHHHHHHhhhh--hhcc-ccccCceeeeecccEEEEeCCCCccccCCCCce Confidence 456777777755 434455667899999999999876311 1221 11112233678999999976533221111 11 Q ss_pred -ceEEEEehhcceEeee-ccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 331 -YPLYYGDMSQAITLFD-RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 331 -~~~~~gd~~~~~~~~~-~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ...+||. .++.... +..+.++.++.. ..+...+....++ +++|.++..-+-+..+..|..+.-.++..- T Consensus 234 y~s~l~~~--GAi~~~~~~~~v~vE~dRd~----~~g~~~l~~r~~~---~~~p~G~s~~~~~~~~~sPt~~~L~~~~NW 304 (324) T protein:vir:59 234 FTSYLFGA--GALGYAEGQPEVPTETARNA----LGSQDILINRKHF---VLHPRGVKFTENAMAGTTPTDEELANGANW 304 (324) T ss_pred EEEEEEec--CeEEEeecCCCcceecccCc----cccceEEEEeeEE---EeEeeeEEecccccCCCCCChhhhcCCccc Confidence 2345553 2233222 223444444332 2344555555554 456666555332222222222222222221 No 133 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.78 E-value=3.9e-09 Score=66.68 Aligned_cols=284 Identities=12% Similarity=0.028 Sum_probs=143.6 Q ss_pred HHHHhhcchhhHHHHHHHHhhcccc-ccCc-----eecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCC Q lcl|Aclame:pro 97 FVNMVRNPMAFMNTVSSKTETSGSD-SAAG-----LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDV 170 (408) Q Consensus 97 ~~~~~~~~~~~~~~~~~~a~~~~t~-~~gg-----~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~ 170 (408) +...... ...+.+.++. .-|| .+-=+.|..++.......+.++++.++.++.++ .++.+++.... T Consensus 1 ~~~~~~~--------~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~G-ksv~f~~iG~~ 71 (375) T protein:vir:10 1 MANANQV--------ALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNG-KSLQFIYTGRM 71 (375) T ss_pred Ccccccc--------ccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccC-ceEEEEeeeee Confidence 0000000 0111111111 1111 122267889999999999999999999888754 36666666444 Q ss_pred ccccchhccccccccc--cccccee--eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc--- Q lcl|Aclame:pro 171 TPLTVMDAEDGKIPDL--DNPQLTI--IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM--- 243 (408) Q Consensus 171 ~~~~~~~~E~~~~~~~--~~~~f~~--v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~--- 243 (408) +. ....-|.++... .+....+ +++.-.++..+ .|.+-=-.++..++.+.+.++.++++++..|+.++.-. T Consensus 72 t~--~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~-~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~ka 148 (375) T protein:vir:10 72 TS--SFHTPGTPILGNADKAPPVAEKTIVMDDLLISSA-FVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRG 148 (375) T ss_pred EE--eeecCCcCcCCccccCCCCCceEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 33 334334433211 1112233 44444433332 22221122356689999999999999999998876311 Q ss_pred ---------------c-------cccchh----hhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhcccCc Q lcl|Aclame:pro 244 ---------------K-------AAPKKP----TIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVKTAEGK 295 (408) Q Consensus 244 ---------------g-------~~~~~~----~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd~~G~ 295 (408) | ++.... ....++.+.++. ..|+....+ +.+++++|..|..|.+-+|.+ T Consensus 149 a~~~~p~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~-~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~-- 225 (375) T protein:vir:10 149 ARSASPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAA-AAMDEKGVSSQGRCAVLNPRQYYALIQDIGSN-- 225 (375) T ss_pred hhhccccccccccccCcceeeeccccccccccCHHHHHHHHHHHH-HHHhhcCCCCCCCEEEeChHHHHHHHhcCCcc-- Confidence 0 011111 112244455444 345554443 456789999998886555432 Q ss_pred eeeccc------cccCCcccccccceEeecccccccccc--------------------------------CcceEEEEe Q lcl|Aclame:pro 296 YLLEPD------PTKPNSYLIKGKQVIVVADRWLPNTGS--------------------------------TVYPLYYGD 337 (408) Q Consensus 296 ~~~~~~------~~~~~~~~l~G~pv~~~~~~~~~~~~~--------------------------------~~~~~~~gd 337 (408) .+...+ ...+...++.|++|+.+.+ +|.... +.+.-|-+| T Consensus 226 ~~~n~d~~~~~~~~~g~v~~i~Gv~V~~Sn~--lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d 303 (375) T protein:vir:10 226 GLVNRDVQGSALQSGNGVIEIAGIHIYKSMN--IPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTN 303 (375) T ss_pred ceeeecccccceeccceEEEEeceEEEEecc--ccccccccccccccccccchhhhhccccccCCcceeecccccccccc Confidence 121111 1222235799999988543 443221 111234444 Q ss_pred h---hc-ceEeeec--------cceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 338 M---SQ-AITLFDR--------ENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 338 ~---~~-~~~~~~~--------~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) | +. .-+++.+ .+++++++...+. -.+-...+.+..-+|..+++|++.+.|+..++++.. + T Consensus 304 ~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~-~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~~~~--~ 375 (375) T protein:vir:10 304 AELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVS-VIYQGDVILGRMAMGADYLNPAAAVELYIGATAPSA--F 375 (375) T ss_pred ccccCceEEEEEchhheeeeeeeccccccccchhh-heeeeeeeeeeeeeccCccCceeEEEEecCcCcccc--C Confidence 4 11 1122222 2334443321111 112233455666789999999998888766554444 3 No 134 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.74 E-value=1.5e-09 Score=68.91 Aligned_cols=277 Identities=13% Similarity=0.062 Sum_probs=145.5 Q ss_pred hhhHHHHHHHHhhccccc--cCc--eecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDS--AAG--LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~--~gg--~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~ 180 (408) +.............+... .|. .+-=+.|..++.......+.++++.++.++.++ .++.+++.... .......| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~g-ks~~~~~iG~~--~~~~~~~G 77 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSG-KSAQFPVLGRT--QAAYLAPG 77 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeecccc-ceEEEeeecce--EEEeeecC Confidence 111100000000111111 111 223367889999999999999999999988754 35666665433 33456666 Q ss_pred ccccccc-ccccee--eeechheeee-ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc-------------- Q lcl|Aclame:pro 181 GKIPDLD-NPQLTI--IKYLIKRYAG-IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEV-------------- 242 (408) Q Consensus 181 ~~~~~~~-~~~f~~--v~~~~~~~~~-~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g-------------- 242 (408) .....+. .+..++ +++.-.++.. .+.==++. ++..|+.+.+.+++++++++..|+.++.- T Consensus 78 ~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~--q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~ 155 (345) T protein:vir:22 78 ENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDA--MNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN 155 (345) T ss_pred CCCCCCCCCcccceEEEEecchhhhhhhHhhHHHH--hcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 6653321 345566 4444444333 22222222 35568999999999999999999887621 Q ss_pred ---cccc---------cc-----hhhhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhcc-cCceeecccc Q lcl|Aclame:pro 243 ---MKAA---------PK-----KPTIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVKTA-EGKYLLEPDP 302 (408) Q Consensus 243 ---~g~~---------~~-----~~~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd~-~G~~~~~~~~ 302 (408) .+.+ .. ......++.+.++. ..|+....+ +.+++++|..|..|..-+.- +..|.-..+. T Consensus 156 ~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~-~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~ 234 (345) T protein:vir:22 156 IEGLGTATVIETTQNKAALTDQVALGKEIIAALTKAR-AALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDP 234 (345) T ss_pred ccccccccccccccccccccccccCHHHHHHHHHHHH-HHhhhcCCCccCCEEEeChHHHHHHhcccccccccccccccc Confidence 1110 00 00112244454443 335444433 34678999999987543221 2233322233 Q ss_pred ccCCcccccccceEeecccccccccc------------------CcceEEEEehh--------cceEeeeccceEEEEec Q lcl|Aclame:pro 303 TKPNSYLIKGKQVIVVADRWLPNTGS------------------TVYPLYYGDMS--------QAITLFDRENMSLLPTN 356 (408) Q Consensus 303 ~~~~~~~l~G~pv~~~~~~~~~~~~~------------------~~~~~~~gd~~--------~~~~~~~~~~~~i~~~~ 356 (408) ..|.-.+++|+||+.+.+ +|.... +......++-+ .++....-.+++++... T Consensus 235 ~~G~V~~i~G~~V~~sn~--lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r 312 (345) T protein:vir:22 235 EKGSIRNVMGFEVVEVPH--LTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 312 (345) T ss_pred ccceEEEEeceEEEeccc--ccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeee Confidence 344456899999998654 332111 11111111111 11222222233444433 Q ss_pred cchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 357 IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 357 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) .. ..| ...+++..-+|.++++|++.+.++++-- T Consensus 313 ~~-~~~---~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 313 RA-NFQ---ADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred ch-hHH---HHHHHHHHhcCCcccccceeEEEEEeeC Confidence 22 112 2345666778999999999999988766 No 135 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.73 E-value=1.1e-09 Score=69.75 Aligned_cols=281 Identities=10% Similarity=0.076 Sum_probs=145.3 Q ss_pred HHHhhcchhhHHHHHHHHhhccccccCc-eecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccch Q lcl|Aclame:pro 98 VNMVRNPMAFMNTVSSKTETSGSDSAAG-LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVM 176 (408) Q Consensus 98 ~~~~~~~~~~~~~~~~~a~~~~t~~~gg-~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 176 (408) +.....+... ..|.-..+..++.= ..| +.|+.++.......+.++++.++.++.++ .++.+++....+. .. T Consensus 1 ~~~~~~~~~~----~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r~~~~G-~sv~i~~iG~~t~--~~ 72 (347) T protein:vir:33 1 MANIQGGQQI----GTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLRSIASG-KSAQFPVIGRTKA--AY 72 (347) T ss_pred CCCCccCccc----ccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhcccccccc-ceeEeeeccceee--ee Confidence 0000011000 00000000111100 124 78899999989999999999988776643 3566666554433 33 Q ss_pred hccccccccc-ccccceeeeechh--eeee-ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc--------- Q lcl|Aclame:pro 177 DAEDGKIPDL-DNPQLTIIKYLIK--RYAG-IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM--------- 243 (408) Q Consensus 177 ~~E~~~~~~~-~~~~f~~v~~~~~--~~~~-~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~--------- 243 (408) ...|..+... ...+..+.++... ++.. ++.==++. ++..|+.+.+.++.++++++..|+.++.-. T Consensus 73 ~~~g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~--q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~ 150 (347) T protein:vir:33 73 LKPGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDA--MNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDG 150 (347) T ss_pred ecCCCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHH--hcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc Confidence 4455554321 1234455445433 3322 33322222 345678999999999999999998876210 Q ss_pred ----------ccccc--h-hh----------hhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhc-ccCcee Q lcl|Aclame:pro 244 ----------KAAPK--K-PT----------IAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVKT-AEGKYL 297 (408) Q Consensus 244 ----------g~~~~--~-~~----------~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd-~~G~~~ 297 (408) +++.+ . ++ ...++.++++. ..|+....+ +..++++|..|..|.+-.. .+..|. T Consensus 151 ~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~-~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~ 229 (347) T protein:vir:33 151 SNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIAR-ASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQ 229 (347) T ss_pred cccccccccccccccccccccccccchhhhHHHHHHHHHHHH-HHHhhcCCCccCcEEEeCHHHHHHHhccccccccccc Confidence 00000 0 00 01133344333 334444432 5578899999998865322 223343 Q ss_pred eccccccCCcccccccceEeeccccccccccC---------cceE--------EEEehhcc---------eEeeeccceE Q lcl|Aclame:pro 298 LEPDPTKPNSYLIKGKQVIVVADRWLPNTGST---------VYPL--------YYGDMSQA---------ITLFDRENMS 351 (408) Q Consensus 298 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~---------~~~~--------~~gd~~~~---------~~~~~~~~~~ 351 (408) -...+..+.-.+++|++|+.+.+ +|..... .... +-++|+.. +..+.-.+++ T Consensus 230 ~~~~~~~G~V~~i~G~~V~~Sn~--lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~ 307 (347) T protein:vir:33 230 ALLDPERGTIRNVMGFEVVEVPH--LTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLA 307 (347) T ss_pred cccccccceeEEEeceeEEEecc--cccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeecee Confidence 22234445557899999998654 5543221 1111 11222111 1011112223 Q ss_pred EEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 352 LLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 352 i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) ++...... +....+++...+|.++++|++.+.++++.+.+ T Consensus 308 ~e~~r~~~----~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 308 LERARRAN----YQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eeeccchh----hhhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 43332211 22234566677899999999999999999998 No 136 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.69 E-value=1.6e-09 Score=68.86 Aligned_cols=281 Identities=11% Similarity=0.043 Sum_probs=143.8 Q ss_pred hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc-ceEEeeccCCccccchhcccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG-SRVYEKWTDVTPLTVMDAEDGKI 183 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g-~~~~~~~~~~~~~~~~~~E~~~~ 183 (408) +...+..-..+. +.+.-...||+.|+.+|++.+++...+.++++..+.....| ++.+|... .+......++..+ T Consensus 1 ~~~~~~~~~~~~---~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g--~~~~~d~~~~~~i 75 (341) T protein:vir:94 1 MALGNTITGPSI---NTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS--ELGVEDKATDVPV 75 (341) T ss_pred Ccchhhhccccc---cchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC--cceeeeecCCCcc Confidence 111111111111 11222345899999999999999888888876554333333 45566543 3333445566665 Q ss_pred cccccccceeeeechhee-eeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------c--h Q lcl|Aclame:pro 184 PDLDNPQLTIIKYLIKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP-----------K--K 249 (408) Q Consensus 184 ~~~~~~~f~~v~~~~~~~-~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~-----------~--~ 249 (408) + ....+-..++++..+. +....|++.-..++..++.+.+.++..+++++..|..++.-....+ . . T Consensus 76 ~-~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~ 154 (341) T protein:vir:94 76 G-VQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAI 154 (341) T ss_pred c-cccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccc Confidence 4 2334445666666333 3445666655555677899999999999999999988764321110 0 0 Q ss_pred ---hhhhhHHHHHHHHHHhhhhhcc--CCCEEEEcHHHHHHHHhhhcccC-ceeeccccccCCcccccccceEeeccccc Q lcl|Aclame:pro 250 ---PTIAKFDDVITMINTAVDPAII--ATSSLLTNQSGLNKLALVKTAEG-KYLLEPDPTKPNSYLIKGKQVIVVADRWL 323 (408) Q Consensus 250 ---~~~~~~d~i~~~~~~~l~~~~~--~~a~~~~n~~~~~~l~~lkd~~G-~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 323 (408) .....++.++.+ ...|+.... .+..++++|..+..|.+...-.. .+.-...+..+.-++|+|++|+.+.+ + T Consensus 155 t~~~~~~~~~~i~~a-~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~--l 231 (341) T protein:vir:94 155 TGNGQAFSFAVFLAA-RRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSL--I 231 (341) T ss_pred cCchhhhhHHHHHHH-HHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEecc--c Confidence 011123444443 344555543 34567889999999865321111 12212234555567899999998654 4 Q ss_pred cccccCcce-----E-------------EE----EehhcceEee--eccce-EEE-Eec-----------cchhhhh--h Q lcl|Aclame:pro 324 PNTGSTVYP-----L-------------YY----GDMSQAITLF--DRENM-SLL-PTN-----------IGAGAFE--T 364 (408) Q Consensus 324 ~~~~~~~~~-----~-------------~~----gd~~~~~~~~--~~~~~-~i~-~~~-----------~~~~~f~--~ 364 (408) |........ + .+ +|+.. +..+ .+..+ .++ +++ .....|. + T Consensus 232 p~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~ 310 (341) T protein:vir:94 232 GNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSL-PATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENRE 310 (341) T ss_pred cccccccccccccceeccccccccccccccccccccccc-EEEEEEecccccceeeecchhhhccccccccccccchhhh Confidence 432211100 0 00 01111 1110 11110 110 000 0001111 1 Q ss_pred ceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 365 DTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 365 ~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) -...+++..-+|+++++|++.+.| ...++++ T Consensus 311 ~~~~i~~~~~~G~~~lrp~~~v~~--~~~~~~~ 341 (341) T protein:vir:94 311 QVWLMVGRQAYGARLYRPLHAVNI--HTTGDTV 341 (341) T ss_pred hhhhhhhhhhhcccccCcceeEEE--ecCcCCC Confidence 223344555689999999996655 4444444 No 137 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.68 E-value=4.7e-09 Score=66.25 Aligned_cols=282 Identities=10% Similarity=0.076 Sum_probs=141.7 Q ss_pred HHHhhcchhhHHHHHHHHhhccccccCc--eecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccc Q lcl|Aclame:pro 98 VNMVRNPMAFMNTVSSKTETSGSDSAAG--LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTV 175 (408) Q Consensus 98 ~~~~~~~~~~~~~~~~~a~~~~t~~~gg--~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 175 (408) +.-...+.... .+. .-..+++- .+-=+.|+.++....+..+.++++.++.++.++ .++.+++....+. . T Consensus 1 ma~~~~~~~~~----t~~--~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G-~sv~i~~ig~~t~--~ 71 (347) T protein:vir:15 1 MANIQGGQQIG----TNQ--GKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASG-KSAQFPVIGRTKA--A 71 (347) T ss_pred CCccccCCccc----ccc--ccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhcccccccccc-ceeEeeeccceee--e Confidence 00000010000 000 00001110 011256788899988898989999888877653 3666666654433 3 Q ss_pred hhccccccccc-ccccceeeeechh--eeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Q lcl|Aclame:pro 176 MDAEDGKIPDL-DNPQLTIIKYLIK--RYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK-------- 244 (408) Q Consensus 176 ~~~E~~~~~~~-~~~~f~~v~~~~~--~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g-------- 244 (408) ....|..++.. ...+.++.+|... ++..+ .|-+-=-.++..|+.+.+.++.++++++..|+.++.-.. T Consensus 72 ~~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~-~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~ 150 (347) T protein:vir:15 72 YLKPGENLDDKRKDIKHTEKVIHIDGLLTADV-LIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDA 150 (347) T ss_pred eeccCCCCCCCCCCCccceEEEEechhhhhhH-HhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 34455544321 1234556555433 33221 222211223556899999999999999999988863211 Q ss_pred -----------cccc---hh------hhhhHHHHHHHH---HHhhhhhcc--CCCEEEEcHHHHHHHHhhhccc-Cceee Q lcl|Aclame:pro 245 -----------AAPK---KP------TIAKFDDVITMI---NTAVDPAII--ATSSLLTNQSGLNKLALVKTAE-GKYLL 298 (408) Q Consensus 245 -----------~~~~---~~------~~~~~d~i~~~~---~~~l~~~~~--~~a~~~~n~~~~~~l~~lkd~~-G~~~~ 298 (408) .... .. .....+.+.+++ ...|+...- .+-.++++|..|..|.+-.+-. ..|.- T Consensus 151 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~ 230 (347) T protein:vir:15 151 SNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQA 230 (347) T ss_pred ccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccc Confidence 0000 00 001122333332 123444333 3445677999999886543222 22221 Q ss_pred ccccccCCcccccccceEeecccccccccc---------CcceEEEEe--------hhc---------ceEeeeccceEE Q lcl|Aclame:pro 299 EPDPTKPNSYLIKGKQVIVVADRWLPNTGS---------TVYPLYYGD--------MSQ---------AITLFDRENMSL 352 (408) Q Consensus 299 ~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~---------~~~~~~~gd--------~~~---------~~~~~~~~~~~i 352 (408) ...+..|.-.+++|++|+.+.+ +|.... +....+-++ |+. ++..+.-.++++ T Consensus 231 ~~~~~~G~Vg~i~G~~V~~Sn~--lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~ 308 (347) T protein:vir:15 231 LIDHERGTIRNVMGFEVVEVPH--LTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLAL 308 (347) T ss_pred cccccceEEEEEeceEEEeccc--ccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceee Confidence 2234445457899999998654 553221 111111111 111 111111122334 Q ss_pred EEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 353 LPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 353 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) +..... .+....+++...+|.++++|++.+.++++.+.+ T Consensus 309 e~~~~~----~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 309 ERARRA----NYQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred eecccc----hhhhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 433221 122345667777899999999999999999988 No 138 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.68 E-value=2e-09 Score=68.30 Aligned_cols=290 Identities=10% Similarity=0.033 Sum_probs=146.1 Q ss_pred HHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCcc-ceEEeeccCCccccch Q lcl|Aclame:pro 98 VNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNG-SRVYEKWTDVTPLTVM 176 (408) Q Consensus 98 ~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g-~~~~~~~~~~~~~~~~ 176 (408) +..++.+.... ..++ ..++.-..||+.|..++++.+++.+.+.++++.....+..| ++.+|... .+.... T Consensus 1 ~~~~~~~~~~~----~~~~---~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g--~~~a~d 71 (381) T protein:vir:80 1 MATIQGTGGYK----GSAV---DLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS--RAAVYD 71 (381) T ss_pred Cceeccccccc----Cccc---chhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC--cceeee Confidence 22222111100 0001 11112346899999999999999888888776544433333 44566543 344556 Q ss_pred hcccccccccccccceeeeechheee-eehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------- Q lcl|Aclame:pro 177 DAEDGKIPDLDNPQLTIIKYLIKRYA-GIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA--------- 246 (408) Q Consensus 177 ~~E~~~~~~~~~~~f~~v~~~~~~~~-~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~--------- 246 (408) ..++..++- ...+...++++..+.. ....|++.-...+..++.+.+.+++..++++..|+.++...... T Consensus 72 ~~~g~~i~~-~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~ 150 (381) T protein:vir:80 72 KQPQTPVNL-QARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIY 150 (381) T ss_pred ecCCCcccc-cccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 777777643 4445566666664432 33566665555566789999999999999999999886432100 Q ss_pred ---------------cchhhhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhc-ccCceeeccccccCCcc Q lcl|Aclame:pro 247 ---------------PKKPTIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVKT-AEGKYLLEPDPTKPNSY 308 (408) Q Consensus 247 ---------------~~~~~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd-~~G~~~~~~~~~~~~~~ 308 (408) +.......++.|+++. ..|+....+ +-.++++|..+..|.+... .+-.+.-...+.++..+ T Consensus 151 t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~-~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig 229 (381) T protein:vir:80 151 SYDTTLGDGTVNAHLTGTPAPLTYAALLLAK-QKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVG 229 (381) T ss_pred cccccccccccccccccchhhHHHHHHHHHH-HHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeee Confidence 0001112345555554 345555443 4478999999998865421 11123333446666678 Q ss_pred cccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecc-cceEE Q lcl|Aclame:pro 309 LIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDS-EALVA 387 (408) Q Consensus 309 ~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~-~a~~~ 387 (408) +|+|++|+.+.+ +|........+..|-..... ..+ .-..+ ...|..+-..++....+|.++... ..+-. T Consensus 230 ~i~G~~Vv~Sn~--lp~~~~t~~~~~agap~~~~-----~~~--~~~~~-~g~~s~~a~av~~~k~yd~~~~~~~~~~~~ 299 (381) T protein:vir:80 230 TILGMEVIVTTQ--IGINSLTGYVNGQGAPTQPT-----PGV--LGSPY-LPDQAGTANVVNTGSASDLAVSLSYFGLPV 299 (381) T ss_pred EEcceEEEeecc--cccccccceeeecccccccc-----ccc--ccccc-ccccccceeeeeeeeeeceeeeeeecccee Confidence 999999998643 56543333333222211100 000 00011 111223334555555566655332 22222 Q ss_pred EEeeccccCCCCccCCCc-------ccC Q lcl|Aclame:pro 388 GSFSAIADQVGNFKTTTS-------TAV 408 (408) Q Consensus 388 l~~~~~~~~~~~~~~~~~-------~~~ 408 (408) +...-.+...+..+..+= .+| T Consensus 300 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 327 (381) T protein:vir:80 300 FSGAGATAADGGQTLGSFGGANRWATAV 327 (381) T ss_pred eecceeeecCCCceeeeehhhhhhhhhc Confidence 221111111111111111 011 No 139 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.68 E-value=3.6e-09 Score=66.88 Aligned_cols=272 Identities=14% Similarity=0.091 Sum_probs=141.5 Q ss_pred hhhHHHHHHHHhhccccccC---------ceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAA---------GLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTV 175 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~g---------g~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 175 (408) +...... ...++...+ ...| +.|+.++.......+.++++.++.++.++ .++.+++....+ .. T Consensus 1 ma~~~~~----~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r~i~~g-~s~~~~~iG~~~--~~ 72 (344) T protein:vir:10 1 MANMTGG----QQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVRSISSG-KSAQFPVLGRTQ--AA 72 (344) T ss_pred Ccccccc----ccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceeeeeccc-ceEEEEeeceeE--EE Confidence 1000000 000111111 1123 77899999999999999999999988754 366666654333 34 Q ss_pred hhcccccccccc-cccceeeeechhe--eee-ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc-------- Q lcl|Aclame:pro 176 MDAEDGKIPDLD-NPQLTIIKYLIKR--YAG-IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM-------- 243 (408) Q Consensus 176 ~~~E~~~~~~~~-~~~f~~v~~~~~~--~~~-~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~-------- 243 (408) ....|+....+. .+.-++++|...+ +.. .+.==++. ++..|+.+.+.++.++++++..|+.++.-. T Consensus 73 ~~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~--q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~ 150 (344) T protein:vir:10 73 YLAPGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDA--MNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVES 150 (344) T ss_pred eeecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHH--hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Confidence 455666654332 2344554554443 332 22211222 355689999999999999999998775211 Q ss_pred ---------cccc------ch----hh----hhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhcc-cCcee Q lcl|Aclame:pro 244 ---------KAAP------KK----PT----IAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVKTA-EGKYL 297 (408) Q Consensus 244 ---------g~~~------~~----~~----~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd~-~G~~~ 297 (408) +.+. .. .. ...++.+.++. ..|+....+ +-++|++|..|..|..-+.- ++.|. T Consensus 151 ~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~-~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~ 229 (344) T protein:vir:10 151 QYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKAR-AALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYA 229 (344) T ss_pred ccccccccccccceeecccccccccchhhhHHHHHHHHHHHH-HHHhhcCCCccCCEEEeChHHHHHHhhcccccccccc Confidence 0000 00 00 11133344333 335444433 34567899999887543211 12222 Q ss_pred eccccccCCcccccccceEeeccccccccccC----------------cceEEEEehhcce---------EeeeccceEE Q lcl|Aclame:pro 298 LEPDPTKPNSYLIKGKQVIVVADRWLPNTGST----------------VYPLYYGDMSQAI---------TLFDRENMSL 352 (408) Q Consensus 298 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~----------------~~~~~~gd~~~~~---------~~~~~~~~~i 352 (408) -......|.-.+++|+||+.+.+ +|..... ....+.++|++-. ....-.++++ T Consensus 230 ~~~~~~~G~V~~v~G~~V~~Sn~--lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~ 307 (344) T protein:vir:10 230 ALIDPEKGSIRNVMGFEVVEVPH--LTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLAL 307 (344) T ss_pred cccceeeeEEEEEeceEEEeccc--cccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhcccee Confidence 22223334446799999998654 4422110 1111223443311 1111123344 Q ss_pred EEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 353 LPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 353 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) +..... ..|. ..+++..-+|.++++|++.+.+++++- T Consensus 308 e~~r~~-~~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 308 ERARRA-NFQA---DQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred ecccch-hHHH---HHHHHHhhcccceecccceEEEEeecC Confidence 433221 1222 345667778999999999988888776 No 140 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.65 E-value=1.4e-09 Score=69.20 Aligned_cols=277 Identities=13% Similarity=0.075 Sum_probs=138.7 Q ss_pred hhcchhhHHHHHHHHhhcccc-ccCc--eecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchh Q lcl|Aclame:pro 101 VRNPMAFMNTVSSKTETSGSD-SAAG--LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMD 177 (408) Q Consensus 101 ~~~~~~~~~~~~~~a~~~~t~-~~gg--~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 177 (408) +.... ..... ...+.. +++- .+-=+.|..+++......+.++++.++.++.++ .++.+++....+. ... T Consensus 1 m~~~~--~~~~~---t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G-~sv~i~~iG~~tv--~~~ 72 (347) T protein:vir:94 1 MANVP--GQKIG---TDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNG-KSAQFPVMGRTSG--VYL 72 (347) T ss_pred CCCCC--ccccc---cccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhccccccccccc-ceEEEecccceee--eee Confidence 00000 00000 000111 1111 112257888888888888888999998887653 3566666544433 334 Q ss_pred cccccccccc-cccceeeeec--hheeee-ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc---------- Q lcl|Aclame:pro 178 AEDGKIPDLD-NPQLTIIKYL--IKRYAG-IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM---------- 243 (408) Q Consensus 178 ~E~~~~~~~~-~~~f~~v~~~--~~~~~~-~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~---------- 243 (408) ..|+..+..- ..+-.+++|. -.++.. ++.==++. ++..++.+.+.++.++++++..|+.++.-. T Consensus 73 t~G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~--q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~ 150 (347) T protein:vir:94 73 APGERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDA--MNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAAS 150 (347) T ss_pred cCCCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHH--hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 4455442210 1223453443 333322 22211222 245678999999999999999998775311 Q ss_pred -------ccccch---------h----hhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhc-ccCceeecc Q lcl|Aclame:pro 244 -------KAAPKK---------P----TIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVKT-AEGKYLLEP 300 (408) Q Consensus 244 -------g~~~~~---------~----~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd-~~G~~~~~~ 300 (408) +.+... . ....++.|.++. ..|+....+ +.++|++|..|..|..-++ .+..|.-.. T Consensus 151 ~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~-~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 229 (347) T protein:vir:94 151 NENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIAR-AKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALI 229 (347) T ss_pred ccccCCCcccceeeccccccccchhhhHHHHHHHHHHHH-HHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccc Confidence 111000 0 011123333332 335544433 4467899999987743322 122233333 Q ss_pred ccccCCcccccccceEeecccccccccc-------------CcceEEEEe--------hhcceEe-eec--------cce Q lcl|Aclame:pro 301 DPTKPNSYLIKGKQVIVVADRWLPNTGS-------------TVYPLYYGD--------MSQAITL-FDR--------ENM 350 (408) Q Consensus 301 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~-------------~~~~~~~gd--------~~~~~~~-~~~--------~~~ 350 (408) .+..|.-.+++|++|+.+.+ +|.... +...++-+| |+..... +.+ .++ T Consensus 230 ~~~~G~Vg~i~G~~V~~Sn~--lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~ 307 (347) T protein:vir:94 230 DPETGNIRNVMGFVVVEVPH--LVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDL 307 (347) T ss_pred cccccceEEEeceEEEecCc--ccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccc Confidence 44555557899999998654 443211 112222223 3221111 111 122 Q ss_pred EEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccc Q lcl|Aclame:pro 351 SLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) Q Consensus 351 ~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 394 (408) +++..... .+-...+++...+|.++++|++.+.++++++- T Consensus 308 ~~e~~r~~----~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 308 ALERDRDV----DAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred cccchhch----hhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 33322111 12233577888899999999999999887443 No 141 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.65 E-value=1.4e-09 Score=69.12 Aligned_cols=216 Identities=8% Similarity=-0.034 Sum_probs=137.6 Q ss_pred hhhHHHHHHHHhhcccccc-CceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSA-AGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKI 183 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~-gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~ 183 (408) +.. ..+...|-.. ...+-|......||+.+.+.++|+..+.......+++. .+... ..-|.+.|..=++.+ T Consensus 1 m~~------~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~-~~~v~-~~LP~~~fR~lN~g~ 72 (328) T protein:vir:95 1 MAV------KGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGH-RTTIR-SGLPSATWRLLNYGV 72 (328) T ss_pred CCc------cccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcc-eeeEe-eccCCceeeecCCcc Confidence 000 0000011111 12234556777899999999999999998887655443 22222 345788899888888 Q ss_pred cccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccch------------ Q lcl|Aclame:pro 184 PDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK------------ 249 (408) Q Consensus 184 ~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~------------ 249 (408) +. +.+++.+++-..+-+.+.+.|.+.+.+... .++...-.....+++.+.+...||+|+.+..+. T Consensus 73 ~~-s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~ 151 (328) T protein:vir:95 73 QP-SKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSL 151 (328) T ss_pred Cc-ccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCcc Confidence 75 668999999999999999999998887653 234444556788999999999999985432100 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 250 -------------------------------------------------------------------------------- 249 (408) Q Consensus 250 -------------------------------------------------------------------------------- 249 (408) T Consensus 152 s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r 231 (328) T protein:vir:95 152 SAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWR 231 (328) T ss_pred ccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcc Confidence Q ss_pred -------------hhhhhHHHHHHHHH---HhhhhhccCCCEEEEcHHHHHHHHhh-hcccCceeeccccccCCcccccc Q lcl|Aclame:pro 250 -------------PTIAKFDDVITMIN---TAVDPAIIATSSLLTNQSGLNKLALV-KTAEGKYLLEPDPTKPNSYLIKG 312 (408) Q Consensus 250 -------------~~~~~~d~i~~~~~---~~l~~~~~~~a~~~~n~~~~~~l~~l-kd~~G~~~~~~~~~~~~~~~l~G 312 (408) .......+++++|. ..++.....+.+|+||+.....|++. .+....++-..+..+...-.++| T Consensus 232 ~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~g 311 (328) T protein:vir:95 232 YVVRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRG 311 (328) T ss_pred cEEEEecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECC Confidence 00001122344433 23555667788999999999999875 44444444444555555667999 Q ss_pred cceEeeccccccccccCcceEE Q lcl|Aclame:pro 313 KQVIVVADRWLPNTGSTVYPLY 334 (408) Q Consensus 313 ~pv~~~~~~~~~~~~~~~~~~~ 334 (408) .||..+|. .+ .+...++ T Consensus 312 ipir~~da-i~----~tE~~vv 328 (328) T protein:vir:95 312 VPIRETDA-LL----ETEARVV 328 (328) T ss_pred eEEEEEee-ee----cCccccC Confidence 99999863 11 1222222 No 142 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.61 E-value=2e-09 Score=68.25 Aligned_cols=278 Identities=10% Similarity=0.009 Sum_probs=138.4 Q ss_pred HHHhhcchhhHHHHHHHHhhccccccCc----eecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccc Q lcl|Aclame:pro 98 VNMVRNPMAFMNTVSSKTETSGSDSAAG----LTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPL 173 (408) Q Consensus 98 ~~~~~~~~~~~~~~~~~a~~~~t~~~gg----~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~ 173 (408) +-++..- .... -+......+++. ..| ..|+.+++......+.++++.++.++.++ .++.+++....+.. T Consensus 1 ~~~~~~~---~~~~--~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~i~~G-~tv~i~~ig~~~~~ 73 (332) T protein:vir:78 1 MTTLSNF---SLPN--QANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYDLRGG-KSKQFMFTGKLSAG 73 (332) T ss_pred Ccccccc---cCCc--cccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhcccccccccc-ceEEEEeccceeEe Confidence 0000000 0000 000011111221 233 78899999999999999999998887743 46667766544433 Q ss_pred cchhcccccccccccccceeeeechh--eeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc----cccc Q lcl|Aclame:pro 174 TVMDAEDGKIPDLDNPQLTIIKYLIK--RYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM----KAAP 247 (408) Q Consensus 174 ~~~~~E~~~~~~~~~~~f~~v~~~~~--~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~----g~~~ 247 (408) ....|........++-++++|... ++..+ .|-+-=-.++..++.+.+.++.++++++..|+.++.-. .... T Consensus 74 --~~~~g~~l~~~~~~~~~~~~l~ID~~ky~~~-~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~ 150 (332) T protein:vir:78 74 --YHTPGTPIVGDAGIKANEKTLVMDDLLVSSQ-FVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS 150 (332) T ss_pred --eecCCCCCCCCCCCCCceEEEEEehhhhhHH-HHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccC Confidence 333344332111122234444443 33322 22221111355689999999999999999998776311 1100 Q ss_pred -------------chhhhh----hHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhccc--Cc-eee-cccccc Q lcl|Aclame:pro 248 -------------KKPTIA----KFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVKTAE--GK-YLL-EPDPTK 304 (408) Q Consensus 248 -------------~~~~~~----~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd~~--G~-~~~-~~~~~~ 304 (408) +.++.+ .++.|+++. ..|+...-+ +-.++++|..|..|.+.+|.. .+ +.- ...+.+ T Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~-~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~ 229 (332) T protein:vir:78 151 PVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAA-AVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNS 229 (332) T ss_pred cccccccccccccCCccccCHHHHHHHHHHHH-HHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceec Confidence 011112 234455544 345555443 334677999998886643321 00 000 011222 Q ss_pred CC-cccccccceEeecccccccccc---------CcceEEEEehhcceE-eeecc--------ceEEEEec--cchhhhh Q lcl|Aclame:pro 305 PN-SYLIKGKQVIVVADRWLPNTGS---------TVYPLYYGDMSQAIT-LFDRE--------NMSLLPTN--IGAGAFE 363 (408) Q Consensus 305 ~~-~~~l~G~pv~~~~~~~~~~~~~---------~~~~~~~gd~~~~~~-~~~~~--------~~~i~~~~--~~~~~f~ 363 (408) +. -.+++|++|+.+.+ +|...+ +....+-|||+.... ++-+. ++.++... .....| T Consensus 230 g~~i~~i~G~~V~~Sn~--lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~- 306 (332) T protein:vir:78 230 GKGLYSIAGIRILKSNN--LAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQ- 306 (332) T ss_pred ceeeeEEeeeEEEecCc--cccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhh- Confidence 22 36899999988654 554322 112234455544211 11222 22332211 111112 Q ss_pred hceeeEEEEeeeCcEEecccceEEEEee Q lcl|Aclame:pro 364 TDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) Q Consensus 364 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 391 (408) ...+++...+|.++++|++.+.++-. T Consensus 307 --~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 307 --GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred --HhhhhhhhhhcCceecccceEEEeeC Confidence 23456666799999999999988644 No 143 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.58 E-value=1.1e-08 Score=64.16 Aligned_cols=288 Identities=8% Similarity=0.055 Sum_probs=152.2 Q ss_pred hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIP 184 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~ 184 (408) |.... .-.|....++.++-...| ..|..++.......+.++++.++.++.++ .++.++..... ......-|.... T Consensus 1 ms~~~-~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g-~s~~~~~iG~~--~~~~~~pG~~l~ 75 (335) T protein:vir:63 1 MSFLN-DLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGS-NVVRLDRLGNV--EAKGRRAGEELE 75 (335) T ss_pred CCCcc-cchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhccccceeeeccc-eeEEEeeeeee--eeecccCCcCcC Confidence 11110 111222223333333334 78999999999999999999999998764 46667765433 334455555553 Q ss_pred ccccccceeeeechheeee-ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHh----hcc--------------cc Q lcl|Aclame:pro 185 DLDNPQLTIIKYLIKRYAG-IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAII----EVM--------------KA 245 (408) Q Consensus 185 ~~~~~~f~~v~~~~~~~~~-~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~----~g~--------------g~ 245 (408) .+.+..++.++....+-. ...|-+-=--++..|+.+.+.+++.+++++..|+.++ .+- |. T Consensus 76 -~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:63 76 -RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred -CCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCc Confidence 333455666666665432 1222221122356789999999999999999999875 111 10 Q ss_pred c-----cchhhhhhHHHHHHHH---HHhhhhhccC-----CCEEEEcHHHHHHHHhhhcccCc-eeec---cccccCCcc Q lcl|Aclame:pro 246 A-----PKKPTIAKFDDVITMI---NTAVDPAIIA-----TSSLLTNQSGLNKLALVKTAEGK-YLLE---PDPTKPNSY 308 (408) Q Consensus 246 ~-----~~~~~~~~~d~i~~~~---~~~l~~~~~~-----~a~~~~n~~~~~~l~~lkd~~G~-~~~~---~~~~~~~~~ 308 (408) . +......+.+.+..++ ...|...+.+ ..+.+++|..|..|..-+.--.+ |... .+...+... T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:63 155 LEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeE Confidence 0 0111112344443322 2334444433 35789999999988664221111 2111 122334456 Q ss_pred cccccceEeecccccccccc------CcceEEEEehhcceEeeeccce--EEEEeccchhhh---hhceeeEEEEeeeCc Q lcl|Aclame:pro 309 LIKGKQVIVVADRWLPNTGS------TVYPLYYGDMSQAITLFDRENM--SLLPTNIGAGAF---ETDTTKIRVIDRFDV 377 (408) Q Consensus 309 ~l~G~pv~~~~~~~~~~~~~------~~~~~~~gd~~~~~~~~~~~~~--~i~~~~~~~~~f---~~~~~~~r~~~r~d~ 377 (408) .++|+||+.+. .+|.... +....+=|||......+-.... +++.-+.....| .+-...+.+..-+|. T Consensus 235 ~v~Gv~V~~sn--~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~ 312 (335) T protein:vir:63 235 ILNGVKVLETP--RFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNI 312 (335) T ss_pred EeeceEEEeec--cCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCC Confidence 89999998864 3564332 2223344566443322222211 222111111000 111234455556899 Q ss_pred EEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 378 KATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 378 ~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) .++||++.+.++++....-.-+- T Consensus 313 g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 313 GARRPDTAGAIELKGIGAFDITA 335 (335) T ss_pred cccccceEEEEEEcCCCceeecC Confidence 99999999999875543321111 No 144 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.57 E-value=1.2e-08 Score=63.92 Aligned_cols=284 Identities=7% Similarity=0.040 Sum_probs=148.4 Q ss_pred hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIP 184 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~ 184 (408) |.... .-.|-...++.++-...| ..|+.++.......+.++++.++.++.++ .++.++..... ......-|+... T Consensus 1 ms~~~-~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~rti~~g-~s~~~~~iG~~--~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSFLN-DLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGS-NVVRLDRLGNV--EAKGRRAGEELE 75 (335) T ss_pred CCccc-cccccccccccchhhhhh-hhhhhHHHHHHHHhhhhccccceeeeccc-eeEEEeeeeee--eecccccCcccC Confidence 11110 011111122333333334 78899999999999999999999998764 46667765433 334455555543 Q ss_pred ccccccceeeeechheeeee-hHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh----cc--------------cc Q lcl|Aclame:pro 185 DLDNPQLTIIKYLIKRYAGI-ITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE----VM--------------KA 245 (408) Q Consensus 185 ~~~~~~f~~v~~~~~~~~~~-~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~----g~--------------g~ 245 (408) .+.+..++.++....+-.. ..|-+-=--++..|+.+.+.+++++++++..|+.++- +. |. T Consensus 76 -~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:78 76 -RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred -CCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCc Confidence 3345556666666654321 1222211223567899999999999999999998751 11 11 Q ss_pred cc-----chhhhhhHHHHHHHHH---HhhhhhccC-----CCEEEEcHHHHHHHHhhhcccCc-eeec---cccccCCcc Q lcl|Aclame:pro 246 AP-----KKPTIAKFDDVITMIN---TAVDPAIIA-----TSSLLTNQSGLNKLALVKTAEGK-YLLE---PDPTKPNSY 308 (408) Q Consensus 246 ~~-----~~~~~~~~d~i~~~~~---~~l~~~~~~-----~a~~~~n~~~~~~l~~lkd~~G~-~~~~---~~~~~~~~~ 308 (408) .. ......+.+.+.+++. ..+...+.+ ..+.+++|..|..|..-+.--.+ |... .+...+... T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:78 155 LEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeE Confidence 00 0111123333333321 223333332 45789999999988764221111 2111 123334457 Q ss_pred cccccceEeeccccccccccC------cceEEEEehhc-ceEeeecc--------ceEEEEeccchhhhhhceeeEEEEe Q lcl|Aclame:pro 309 LIKGKQVIVVADRWLPNTGST------VYPLYYGDMSQ-AITLFDRE--------NMSLLPTNIGAGAFETDTTKIRVID 373 (408) Q Consensus 309 ~l~G~pv~~~~~~~~~~~~~~------~~~~~~gd~~~-~~~~~~~~--------~~~i~~~~~~~~~f~~~~~~~r~~~ 373 (408) .++|+||+.+. .+|..... .+..+=+||++ ..+++.+. ++..++..+. ..| ...+.+.. T Consensus 235 ~v~Gv~V~~Sn--~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~-~~~---~~~i~~~~ 308 (335) T protein:vir:78 235 ILNGVKVLETP--RFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDH-DQF---SWVLDTFQ 308 (335) T ss_pred EeeceEEEeec--cCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeecc-chh---hHhhhHHH Confidence 89999998854 46654321 11223334433 12222222 1222221111 112 23445556 Q ss_pred eeCcEEecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 374 RFDVKATDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 374 r~d~~v~~~~a~~~l~~~~~~~~~~~~ 400 (408) -+|.+++||++.+.++++....-.-+- T Consensus 309 a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 309 MYNIGARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred HcCCcccCcceEEEEEecCCCcccccC Confidence 689999999999999866543221111 No 145 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.56 E-value=3.5e-08 Score=61.43 Aligned_cols=278 Identities=9% Similarity=-0.021 Sum_probs=130.4 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhh------hceee-cccCccceEEeeccCCccccchhccccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQY------VRVES-VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDN 188 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~------~~~~~-~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~ 188 (408) |..+.- .-..+|+.|...+.+...+.+.|.+- ...-. ..++...+.+|.+..-.+.+--+.|+..++.... T Consensus 1 MA~T~l--sd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~ki 78 (351) T protein:vir:15 1 MAETHL--SDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNL 78 (351) T ss_pred CCceee--eeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchhee Confidence 433222 23678888877777766666655331 11111 1122223444444322233344677777754333 Q ss_pred ccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------------ccchhh Q lcl|Aclame:pro 189 PQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA-----------------APKKPT 251 (408) Q Consensus 189 ~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~-----------------~~~~~~ 251 (408) .+ .+-.-..+..+....++++...-+.-+....+.++++....+..+..++....+ ...... T Consensus 79 tt-~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~~ 157 (351) T protein:vir:15 79 TS-GKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSEP 157 (351) T ss_pred cc-cceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceecccccccccc Confidence 23 332333334444455555544445557788899999998888888776643211 011223 Q ss_pred hhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCc- Q lcl|Aclame:pro 252 IAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTV- 330 (408) Q Consensus 252 ~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~- 330 (408) ..+++.+.+++....+.....-.+|+||+.++..|++..--+ |+ ++.-.+..-++++|++|++.|..+........ T Consensus 158 ~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~--~~-~~s~~~~~i~t~~G~~VivdD~~p~~~~~~~~~ 234 (351) T protein:vir:15 158 MFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIE--TI-QPQNGATPFEAYNGLRIVLDDDIEIDLTDKTKP 234 (351) T ss_pred ccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhh--hc-cccccCcccceecceEEEEcCCCccccCCCCCc Confidence 345677777775543333333578999999999988653110 11 11111222368999999987654332222211 Q ss_pred --ceEEEEehhcceEeeecc-ceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEe----------------- Q lcl|Aclame:pro 331 --YPLYYGDMSQAITLFDRE-NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF----------------- 390 (408) Q Consensus 331 --~~~~~gd~~~~~~~~~~~-~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~----------------- 390 (408) ...+||. .++....+. ..++.+++... .++-.+....+ -+++|..+..-+- T Consensus 235 ~ytsyl~~~--GAi~~~~~~~~ve~~rd~~~~----~g~d~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt~~~L~~~ 305 (351) T protein:vir:15 235 VSTSYIFAP--GAVRYSTNMRSTETKYDPLIN----GGQDVIVQKRV---GTIHVAGTSIKASFSPSKASFPTIDELAKS 305 (351) T ss_pred eeEEEEEec--ceeeeecCCcCcceeecccCC----CCceEEEEeee---eeeeeeeeeecccccccCcCCcChHHhcCC Confidence 2344553 122222222 24444443321 12222222222 3466666555221 Q ss_pred -------------------eccccCCCCccCCCcccC Q lcl|Aclame:pro 391 -------------------SAIADQVGNFKTTTSTAV 408 (408) Q Consensus 391 -------------------~~~~~~~~~~~~~~~~~~ 408 (408) +.-.+....-.+.++.|= T Consensus 306 ~NW~~v~~~d~k~I~iv~~~~~~~~~~~~~~~~~~~~ 342 (351) T protein:vir:15 306 STWEVVDGIDVRSIGVVAYTAQLDPALTPGAQMPAAD 342 (351) T ss_pred cccccccCCCccccceEEEEEecCcccccCCcCcCCC Confidence 111111111111111111 No 146 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.51 E-value=5e-09 Score=66.10 Aligned_cols=216 Identities=10% Similarity=-0.035 Sum_probs=135.8 Q ss_pred HhhcchhhHHHHHHHHhhcccccc-CceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhc Q lcl|Aclame:pro 100 MVRNPMAFMNTVSSKTETSGSDSA-AGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDA 178 (408) Q Consensus 100 ~~~~~~~~~~~~~~~a~~~~t~~~-gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 178 (408) +..-+... .|-.+ ...+-|......|++.+.+.++|+..++.......++.... ....-|.+.|.. T Consensus 1 m~~~~~~a-----------~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~--vrt~LP~~~fR~ 67 (330) T protein:vir:10 1 MATLSTNN-----------PTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTS--VRTGLPTPTWRK 67 (330) T ss_pred CCcCCCCc-----------ccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCccccee--EEeecCCchhhh Confidence 00000000 01001 11223445667899999999999888887765555544332 223457788998 Q ss_pred ccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------- Q lcl|Aclame:pro 179 EDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK-------- 248 (408) Q Consensus 179 E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~-------- 248 (408) =++.++. +.+++.+++-+.+-+.+...|-+.+.+... .++...-.....+++.+.+...+|+|+.+..+ T Consensus 68 lN~g~~~-s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~k 146 (330) T protein:vir:10 68 LYGGVLP-NKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSP 146 (330) T ss_pred cCCcccc-ccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhh Confidence 8888874 568999999999999999999998877533 24445566778999999999999998643210 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 249 -------------------------------------------------------------------------------- 248 (408) Q Consensus 249 -------------------------------------------------------------------------------- 248 (408) T Consensus 147 R~~~~ta~~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~G 226 (330) T protein:vir:10 147 RYNSLSAENKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIG 226 (330) T ss_pred hcCCCCCCchhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeee Confidence Q ss_pred -------------------hhhhhhHHHHHHHHHH---hhhhhccCCCEEEEcHHHHHHHHhh-hcccCceeeccccccC Q lcl|Aclame:pro 249 -------------------KPTIAKFDDVITMINT---AVDPAIIATSSLLTNQSGLNKLALV-KTAEGKYLLEPDPTKP 305 (408) Q Consensus 249 -------------------~~~~~~~d~i~~~~~~---~l~~~~~~~a~~~~n~~~~~~l~~l-kd~~G~~~~~~~~~~~ 305 (408) .......++++++|.. .++.......+|+||+....+|++. .+++...+-...+.+. T Consensus 227 l~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~ 306 (330) T protein:vir:10 227 LTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGE 306 (330) T ss_pred eEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCe Confidence 0000113345555533 3455666778999999999999975 4554444433344444 Q ss_pred CcccccccceEeeccccccccccCcceEE Q lcl|Aclame:pro 306 NSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) Q Consensus 306 ~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 334 (408) ..-.++|.||..+|. ...+...++ T Consensus 307 ~~t~~~gipir~~Da-----il~tE~~vv 330 (330) T protein:vir:10 307 RVMTFDGIPVQRTDA-----LLNTESRVV 330 (330) T ss_pred eeEEECCeEEEEEee-----eecCccccC Confidence 446799999999863 111222222 No 147 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.50 E-value=1.5e-07 Score=58.03 Aligned_cols=289 Identities=11% Similarity=0.024 Sum_probs=145.0 Q ss_pred hhhHHHHHHHHhhccccccCceecc-hhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIP-QDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKI 183 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP-~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~ 183 (408) |...... .+ .....++.-..+. ..|..++.......+.++++.++.++.++ .++.++.....+. ....-|+.. T Consensus 1 ms~~n~~-t~--~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~g-kS~q~~~iG~~~~--~~~~~G~~l 74 (364) T protein:vir:10 1 MSNPNVL-TQ--PAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGT-NSVSNKYIGETEL--QVLSPGKSP 74 (364) T ss_pred CCCcccc-cc--cccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeeccc-ceEEeeeeeeeEE--eeeccCccc Confidence 1110000 00 0001111123344 67889999999899999999999988755 4666776643333 223333332 Q ss_pred cccccccceeeeechheeeee-hHH--HHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhhcc---------------- Q lcl|Aclame:pro 184 PDLDNPQLTIIKYLIKRYAGI-ITA--TNTSLKDTAEN-ILAWLSSWIAKKVVVTRNQAIIEVM---------------- 243 (408) Q Consensus 184 ~~~~~~~f~~v~~~~~~~~~~-~~i--S~ell~ds~~~-~~~~v~~~l~~~~~~~~~~~~~~g~---------------- 243 (408) +...+..++.+|....+--+ ..| =+|.+ +.++ +.+.+.+++++++++..|+.++--. T Consensus 75 -d~~~~~~~k~~itID~ll~a~~~V~diDe~q--~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~ 151 (364) T protein:vir:10 75 -DASPTEFDKNRLVVDTTVIARNTVAHFHDVQ--NDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPR 151 (364) T ss_pred -CCCCcccCcEEEEecceeeechhhhhHHHHh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCc Confidence 33445556666666554322 222 22333 3455 6889999999999999998774210 Q ss_pred --cc-------ccchhhhhhHHHHHHHH---HHhhhhhcc--CCCEEEEcHHHHHHHHhhhcccC-ceeec--cccccCC Q lcl|Aclame:pro 244 --KA-------APKKPTIAKFDDVITMI---NTAVDPAII--ATSSLLTNQSGLNKLALVKTAEG-KYLLE--PDPTKPN 306 (408) Q Consensus 244 --g~-------~~~~~~~~~~d~i~~~~---~~~l~~~~~--~~a~~~~n~~~~~~l~~lkd~~G-~~~~~--~~~~~~~ 306 (408) +. +.+....++.+.+.+++ ...|...+. ...+.+++|..|..|.+-.+=-. .|... .+...+. T Consensus 152 ~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~ 231 (364) T protein:vir:10 152 VAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGF 231 (364) T ss_pred ccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccce Confidence 00 01111122223333322 122444433 34578999999987765311000 11110 1223344 Q ss_pred cccccccceEeecccccccccc--------Cc--------ceE--EEEehhcc-eEeeec--------cceEEEEeccch Q lcl|Aclame:pro 307 SYLIKGKQVIVVADRWLPNTGS--------TV--------YPL--YYGDMSQA-ITLFDR--------ENMSLLPTNIGA 359 (408) Q Consensus 307 ~~~l~G~pv~~~~~~~~~~~~~--------~~--------~~~--~~gd~~~~-~~~~~~--------~~~~i~~~~~~~ 359 (408) ..+++|+||+.+.+ +|.... .. +.- ..|||+.. .+.|.+ .+++.++.... T Consensus 232 v~~v~Gv~Vv~Sn~--lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~- 308 (364) T protein:vir:10 232 VLKSWNTPIVPSNR--FPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEK- 308 (364) T ss_pred eEEEeceEEEeccc--cccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeecc- Confidence 46799999987543 553211 00 111 23454331 222333 23333332211 Q ss_pred hhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 360 GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 360 ~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) .+-...+.+.+-+|..++||++.+.++...++.....--+--|++- T Consensus 309 ---~~~~~~ida~~a~G~g~lRPeaa~~i~~~~~~~~~~~~~~~~~~~~ 354 (364) T protein:vir:10 309 ---KEKTWYIDTFLAEGAIPDRWEAVAVVTAADTAELATDHNAILARAN 354 (364) T ss_pred ---ceeeeeeeeehcccCcccCccceEEEEecCCCCCccchhhhhhhcc Confidence 1223444566669999999999999876655544433333333332 No 148 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.43 E-value=3e-08 Score=61.86 Aligned_cols=266 Identities=12% Similarity=0.060 Sum_probs=140.0 Q ss_pred hhccc-cccCceec-chhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccccccc--ccccccc Q lcl|Aclame:pro 116 ETSGS-DSAAGLTI-PQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIP--DLDNPQL 191 (408) Q Consensus 116 ~~~~t-~~~gg~~v-P~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~--~~~~~~f 191 (408) +..+. ++++..+| |+.|+..|..-+.+......+.++.... ..-++.|+.....+... ..+++.+. +.+... T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g-~GDtV~InsIg~~tV~d--Y~~~~~i~~d~ltt~~- 76 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFP-DGDKLTIPSVGTPVVRS--RPEQGDFTFDNLDTGE- 76 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccC-CCCeEEecccccccccc--ccCCCCcccccCCCce- Confidence 33333 34444445 9999999998888887777766654432 23356666655444333 33333321 112212 Q ss_pred eeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc----------ccccc------------ch Q lcl|Aclame:pro 192 TIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEV----------MKAAP------------KK 249 (408) Q Consensus 192 ~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g----------~g~~~------------~~ 249 (408) ..+.++..|+.++. |++... +...+|.+...++.+++++...|..+..- .++.. .. T Consensus 77 ~~l~IDq~KYfaf~-VdDD~~-Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt 154 (322) T protein:vir:31 77 ISIILRDEVYAGNA-ISKKLR-QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGT 154 (322) T ss_pred EEEEEehhhhhccc-cchhHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCC Confidence 25567777777765 777554 45678999999999999999888766431 11100 00 Q ss_pred hhhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhh-----cccCcee--eccccccCC--cccccccceEee Q lcl|Aclame:pro 250 PTIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVK-----TAEGKYL--LEPDPTKPN--SYLIKGKQVIVV 318 (408) Q Consensus 250 ~~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lk-----d~~G~~~--~~~~~~~~~--~~~l~G~pv~~~ 318 (408) .....++.++++- ..|+...-+ +-.+|++|..+..|..+. -.++|+. ...+...+. .++++|+-|+++ T Consensus 155 ~~~~ay~~lv~l~-~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V~~S 233 (322) T protein:vir:31 155 DQTMDVTDFSRVN-YVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVS 233 (322) T ss_pred CchhhHHHHHHHH-HHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceeeeee Confidence 1123466666654 446555544 345677899987774321 1123432 111121111 468999999886 Q ss_pred ccccccccccCcceEEEE---------ehhcceEeee---------ccce-EEEEeccchhhhhhceeeEEEEeeeCcEE Q lcl|Aclame:pro 319 ADRWLPNTGSTVYPLYYG---------DMSQAITLFD---------RENM-SLLPTNIGAGAFETDTTKIRVIDRFDVKA 379 (408) Q Consensus 319 ~~~~~~~~~~~~~~~~~g---------d~~~~~~~~~---------~~~~-~i~~~~~~~~~f~~~~~~~r~~~r~d~~v 379 (408) .+ ++. ++.+++-| -++.+..+.+ +..| +-+...+. .+.--.+|+..|+|.++ T Consensus 234 N~--l~~---~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~----~~~~d~~~~~~~~g~g~ 304 (322) T protein:vir:31 234 NL--LAD---ANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDD----YNDDLNTATTARWGNGL 304 (322) T ss_pred cc--ccc---cccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCc----cccccceeeeeeeccee Confidence 43 321 11111111 1111111000 0111 00100000 13344578889999999 Q ss_pred ecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 380 TDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 380 ~~~~a~~~l~~~~~~~~~~~~ 400 (408) ++|+..++|.- ...+.++ T Consensus 305 ~r~e~l~~~~a---~~~~~~~ 322 (322) T protein:vir:31 305 VRDENLVCVLA---NADKVTF 322 (322) T ss_pred ecccceEEEEe---ccccccC Confidence 99999988732 2222333 No 149 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.36 E-value=6.2e-08 Score=60.11 Aligned_cols=293 Identities=13% Similarity=0.051 Sum_probs=142.9 Q ss_pred hhhHHHHHHHHhhccccccCceecc-hhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIP-QDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKI 183 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP-~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~ 183 (408) |...... .+ .....++.-..+. ..|.+++.......+.++++.++.++.++ .++.+++....+. ....-|+.. T Consensus 1 Ms~~n~~-t~--~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~G-kS~qf~~iG~~~a--~y~~~G~~l 74 (402) T protein:vir:97 1 MSTPNTL-TN--VAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGT-NTVSNKYLGETEL--QVLAPGQSP 74 (402) T ss_pred CCCcccc-cc--cccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeeccc-ceEEEEEEeeeEE--eeecccccc Confidence 1110000 00 0000111123344 67889999999899999999999988755 4666766543333 223333332 Q ss_pred cccccccceeeeechheeeee-hHH--HHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhhcc-----------c---- Q lcl|Aclame:pro 184 PDLDNPQLTIIKYLIKRYAGI-ITA--TNTSLKDTAEN-ILAWLSSWIAKKVVVTRNQAIIEVM-----------K---- 244 (408) Q Consensus 184 ~~~~~~~f~~v~~~~~~~~~~-~~i--S~ell~ds~~~-~~~~v~~~l~~~~~~~~~~~~~~g~-----------g---- 244 (408) +...+..++..+....+-.. ..| =+|.+ +.++ +.+.+.+++++++++..|+.++.-. + T Consensus 75 -dg~~~~~~k~~ItID~lL~a~~~V~diDeaq--~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~ 151 (402) T protein:vir:97 75 -NATPTQADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) T ss_pred -CCCCcccccEEEEeCceeechhhhhhHHHHH--hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCc Confidence 33445556666665554322 222 22333 3455 6889999999999999999775210 0 Q ss_pred -----c-----ccchhhhhhHHHHHHHHH---Hhhhhhcc--CCCEEEEcHHHHHHHHhhhcc-cCceeec--cccccCC Q lcl|Aclame:pro 245 -----A-----APKKPTIAKFDDVITMIN---TAVDPAII--ATSSLLTNQSGLNKLALVKTA-EGKYLLE--PDPTKPN 306 (408) Q Consensus 245 -----~-----~~~~~~~~~~d~i~~~~~---~~l~~~~~--~~a~~~~n~~~~~~l~~lkd~-~G~~~~~--~~~~~~~ 306 (408) + ++.....++.+.+.+++. ..|...+. ...+++++|..|..|.+-.+= |-.|... .++..+. T Consensus 152 ~~~~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~ 231 (402) T protein:vir:97 152 VKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) T ss_pred ccccccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccce Confidence 0 000001123333333221 22333332 234789999999887653210 1111111 1133344 Q ss_pred cccccccceEeeccccccccc----------cCcceE--EEEehhcc-eEeeeccc-eEEEEeccchhhhh---hceeeE Q lcl|Aclame:pro 307 SYLIKGKQVIVVADRWLPNTG----------STVYPL--YYGDMSQA-ITLFDREN-MSLLPTNIGAGAFE---TDTTKI 369 (408) Q Consensus 307 ~~~l~G~pv~~~~~~~~~~~~----------~~~~~~--~~gd~~~~-~~~~~~~~-~~i~~~~~~~~~f~---~~~~~~ 369 (408) ...++|+||+.+.+ +|... .+.+.. +-||++.. .+.|.++. .+++.-+.+.+.|. +-...+ T Consensus 232 v~~v~Gv~Vv~Snn--lP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~i 309 (402) T protein:vir:97 232 VLSSYNCPVIPSNR--FPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYI 309 (402) T ss_pred eEEEeceEEEecCc--cccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHH Confidence 46899999988654 44321 111122 33676532 22333332 23333222222211 122234 Q ss_pred EEEeeeCcEEecccceEEEEeecccc---CCCCccC---CCcccC Q lcl|Aclame:pro 370 RVIDRFDVKATDSEALVAGSFSAIAD---QVGNFKT---TTSTAV 408 (408) Q Consensus 370 r~~~r~d~~v~~~~a~~~l~~~~~~~---~~~~~~~---~~~~~~ 408 (408) .+.+-+|..+++|++..++.++.-.. +++.++- .-|+|- T Consensus 310 d~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~~~~~~~~~~~~~~ 354 (402) T protein:vir:97 310 DTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQ 354 (402) T ss_pred HHHHHhCCcccCccceEEEEEecccccccCCccccchhhhhcccc Confidence 45556899999999999998776221 1111110 001111 No 150 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.36 E-value=2.5e-07 Score=56.77 Aligned_cols=275 Identities=10% Similarity=0.016 Sum_probs=130.4 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhh---------hhceeecccCccceEEeeccCCccccchhcccc-cccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ---------YVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDG-KIPD 185 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~---------~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~-~~~~ 185 (408) |...++.-.-..+|+.|...+.+...+.+.|.+ +......++. .+.+|.+..-.+..--+.|+. .++. T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~--~i~~P~~~~l~G~~~~~~dg~~~i~~ 78 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGL--LVNMPFWNDLTGDSEVLGNGDKALET 78 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCC--EEEecccccCCCcccccCCCccccch Confidence 333223333477898887777777766655532 1111112233 344444432223333355664 3542 Q ss_pred cccccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------------- Q lcl|Aclame:pro 186 LDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA-------------------- 245 (408) Q Consensus 186 ~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~-------------------- 245 (408) +..+-.+-.-..++.+....++++...-+..|....+.+++++...+..+..++....+ T Consensus 79 -~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~ 157 (330) T protein:vir:10 79 -GKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSD 157 (330) T ss_pred -hhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheec Confidence 22232333333333444444444444345557788888998888777776655532210 Q ss_pred ccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccc Q lcl|Aclame:pro 246 APKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPN 325 (408) Q Consensus 246 ~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~ 325 (408) ........+.+.+.+++.. +......-.+|+||+.++..|++.+=- .++ ++...+..-++++|++|++++. +|. T Consensus 158 ~~~~~a~~s~~~l~~A~~~-~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~-~~s~~~~~i~~~~G~~VivdD~--~p~ 231 (330) T protein:vir:10 158 QSKASTGIDAGMVLDAKQL-LGDSADQVTAIAMHSAVYTKLQKDNLI--QYI-QPTTATINIPTYLGYRVIIDDG--IAP 231 (330) T ss_pred ccccccccCHHHHHHHHHH-hccccccceEEEEcHHHHHHHHHhhhh--hhh-cccccCcccccccceEEEEeCC--CCC Confidence 0111223445667777544 334444566899999999998864210 111 1111222336899999999765 443 Q ss_pred cccCcceEEEEehhcceEeeec---cceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEe--eccccCCCCc Q lcl|Aclame:pro 326 TGSTVYPLYYGDMSQAITLFDR---ENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF--SAIADQVGNF 400 (408) Q Consensus 326 ~~~~~~~~~~gd~~~~~~~~~~---~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~--~~~~~~~~~~ 400 (408) ....-...+||. .++.+.+. ....++.++.. ..++..+....+ -+++|..|..-.- +.....|..+ T Consensus 232 ~~~~yt~yl~~~--GAi~~~~~~~~~~v~~EtdRd~----~~g~~~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt~~ 302 (330) T protein:vir:10 232 TGDIYTSYLFRT--GSIGLNTGNPSGLTTFETSREA----AKGNDMIYTRRA---LVMHPYGVKWTGAEVDAGNITPSNA 302 (330) T ss_pred CCCceeEEEEec--CceeeecccCCccccccccCCc----cccceEEEEeeE---EEeeeeeeeecccccccCcCCcChH Confidence 333333445553 22222211 11233333322 133344444444 3456666665321 1111222222 Q ss_pred cCCCcccC Q lcl|Aclame:pro 401 KTTTSTAV 408 (408) Q Consensus 401 ~~~~~~~~ 408 (408) .-.++..- T Consensus 303 ~L~~~~NW 310 (330) T protein:vir:10 303 DLAKFKNW 310 (330) T ss_pred HhcCCcCc Confidence 22222222 No 151 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.29 E-value=1.6e-07 Score=57.79 Aligned_cols=216 Identities=6% Similarity=-0.044 Sum_probs=131.3 Q ss_pred HhhcchhhHHHHHHHHhhccccccCc-eecch-hhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchh Q lcl|Aclame:pro 100 MVRNPMAFMNTVSSKTETSGSDSAAG-LTIPQ-DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMD 177 (408) Q Consensus 100 ~~~~~~~~~~~~~~~a~~~~t~~~gg-~~vP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 177 (408) +..-+... .|-.+.. .+=|. .+...|++.+.+.++|+..+.........+... . ....-|.+.|. T Consensus 1 m~~~~~~~-----------~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~-~-vrt~LP~~~fR 67 (331) T protein:vir:10 1 MPTLSTTN-----------PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKT-T-VRSGLPTGTWR 67 (331) T ss_pred CCccccCc-----------ccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCcccee-e-EEeccCCchhh Confidence 00000000 0000000 01122 245679999999999999888887665555433 1 22456778999 Q ss_pred cccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccch------ Q lcl|Aclame:pro 178 AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK------ 249 (408) Q Consensus 178 ~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~------ 249 (408) .=++..+. +.+++.+++-..+-+.+.+.|.+.+.+... .++...-.....+++.+.+...||+|+.+..+. T Consensus 68 ~lN~g~~~-s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~ 146 (331) T protein:vir:10 68 KLNYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) T ss_pred ccCCccCc-ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccch Confidence 88888874 668999999999999999999999887643 234444566788999999999999986421000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 250 -------------------------------------------------------------------------------- 249 (408) Q Consensus 250 -------------------------------------------------------------------------------- 249 (408) T Consensus 147 kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl 226 (331) T protein:vir:10 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) T ss_pred hhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeee Confidence Q ss_pred -------------h-------hhhhHHHHHHHHH---HhhhhhccCCCEEEEcHHHHHHHHhh-hcccCce-eecccccc Q lcl|Aclame:pro 250 -------------P-------TIAKFDDVITMIN---TAVDPAIIATSSLLTNQSGLNKLALV-KTAEGKY-LLEPDPTK 304 (408) Q Consensus 250 -------------~-------~~~~~d~i~~~~~---~~l~~~~~~~a~~~~n~~~~~~l~~l-kd~~G~~-~~~~~~~~ 304 (408) . ....-.+++++|. ..++.....+.+|+||+.....|++. .+....+ +-.....+ T Consensus 227 ~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g 306 (331) T protein:vir:10 227 TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG 306 (331) T ss_pred EEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCC Confidence 0 0000122344432 23444556778999999999999875 3443223 33333444 Q ss_pred CCcccccccceEeeccccccccccCcceEE Q lcl|Aclame:pro 305 PNSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) Q Consensus 305 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 334 (408) ...-.++|.||..+|.- ..+...++ T Consensus 307 ~~~t~~~gipir~~dai-----~~tE~~Vv 331 (331) T protein:vir:10 307 KKVVAFDGIPCRRTDAL-----LLTEARVV 331 (331) T ss_pred cceeEECCeeEEEeeee-----ecCccccC Confidence 45567999999998631 11122222 No 152 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.29 E-value=1.6e-07 Score=57.79 Aligned_cols=216 Identities=6% Similarity=-0.044 Sum_probs=131.3 Q ss_pred HhhcchhhHHHHHHHHhhccccccCc-eecch-hhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchh Q lcl|Aclame:pro 100 MVRNPMAFMNTVSSKTETSGSDSAAG-LTIPQ-DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMD 177 (408) Q Consensus 100 ~~~~~~~~~~~~~~~a~~~~t~~~gg-~~vP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 177 (408) +..-+... .|-.+.. .+=|. .+...|++.+.+.++|+..+.........+... . ....-|.+.|. T Consensus 1 m~~~~~~~-----------~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~-~-vrt~LP~~~fR 67 (331) T protein:vir:98 1 MPTLSTTN-----------PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKT-T-VRSGLPTGTWR 67 (331) T ss_pred CCccccCc-----------ccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCcccee-e-EEeccCCchhh Confidence 00000000 0000000 01122 245679999999999999888887665555433 1 22456778999 Q ss_pred cccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccch------ Q lcl|Aclame:pro 178 AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK------ 249 (408) Q Consensus 178 ~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~------ 249 (408) .=++..+. +.+++.+++-..+-+.+.+.|.+.+.+... .++...-.....+++.+.+...||+|+.+..+. T Consensus 68 ~lN~g~~~-s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~ 146 (331) T protein:vir:98 68 KLNYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) T ss_pred ccCCccCc-ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccch Confidence 88888874 668999999999999999999999887643 234444566788999999999999986421000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 250 -------------------------------------------------------------------------------- 249 (408) Q Consensus 250 -------------------------------------------------------------------------------- 249 (408) T Consensus 147 kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl 226 (331) T protein:vir:98 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) T ss_pred hhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeee Confidence Q ss_pred -------------h-------hhhhHHHHHHHHH---HhhhhhccCCCEEEEcHHHHHHHHhh-hcccCce-eecccccc Q lcl|Aclame:pro 250 -------------P-------TIAKFDDVITMIN---TAVDPAIIATSSLLTNQSGLNKLALV-KTAEGKY-LLEPDPTK 304 (408) Q Consensus 250 -------------~-------~~~~~d~i~~~~~---~~l~~~~~~~a~~~~n~~~~~~l~~l-kd~~G~~-~~~~~~~~ 304 (408) . ....-.+++++|. ..++.....+.+|+||+.....|++. .+....+ +-.....+ T Consensus 227 ~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g 306 (331) T protein:vir:98 227 TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG 306 (331) T ss_pred EEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCC Confidence 0 0000122344432 23444556778999999999999875 3443223 33333444 Q ss_pred CCcccccccceEeeccccccccccCcceEE Q lcl|Aclame:pro 305 PNSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) Q Consensus 305 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 334 (408) ...-.++|.||..+|.- ..+...++ T Consensus 307 ~~~t~~~gipir~~dai-----~~tE~~Vv 331 (331) T protein:vir:98 307 KKVVAFDGIPCRRTDAL-----LLTEARVV 331 (331) T ss_pred cceeEECCeeEEEeeee-----ecCccccC Confidence 45567999999998631 11122222 No 153 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.29 E-value=1.6e-07 Score=57.79 Aligned_cols=216 Identities=6% Similarity=-0.044 Sum_probs=131.3 Q ss_pred HhhcchhhHHHHHHHHhhccccccCc-eecch-hhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchh Q lcl|Aclame:pro 100 MVRNPMAFMNTVSSKTETSGSDSAAG-LTIPQ-DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMD 177 (408) Q Consensus 100 ~~~~~~~~~~~~~~~a~~~~t~~~gg-~~vP~-~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 177 (408) +..-+... .|-.+.. .+=|. .+...|++.+.+.++|+..+.........+... . ....-|.+.|. T Consensus 1 m~~~~~~~-----------~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~-~-vrt~LP~~~fR 67 (331) T protein:vir:10 1 MPTLSTTN-----------PTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKT-T-VRSGLPTGTWR 67 (331) T ss_pred CCccccCc-----------ccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCcccee-e-EEeccCCchhh Confidence 00000000 0000000 01122 245679999999999999888887665555433 1 22456778999 Q ss_pred cccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccch------ Q lcl|Aclame:pro 178 AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK------ 249 (408) Q Consensus 178 ~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~------ 249 (408) .=++..+. +.+++.+++-..+-+.+.+.|.+.+.+... .++...-.....+++.+.+...||+|+.+..+. T Consensus 68 ~lN~g~~~-s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~ 146 (331) T protein:vir:10 68 KLNYGVQP-EKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLT 146 (331) T ss_pred ccCCccCc-ccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccch Confidence 88888874 668999999999999999999999887643 234444566788999999999999986421000 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 250 -------------------------------------------------------------------------------- 249 (408) Q Consensus 250 -------------------------------------------------------------------------------- 249 (408) T Consensus 147 kR~~~~~a~~~~q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl 226 (331) T protein:vir:10 147 PRFNSLSAENGQNIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGL 226 (331) T ss_pred hhccccccccccceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeee Confidence Q ss_pred -------------h-------hhhhHHHHHHHHH---HhhhhhccCCCEEEEcHHHHHHHHhh-hcccCce-eecccccc Q lcl|Aclame:pro 250 -------------P-------TIAKFDDVITMIN---TAVDPAIIATSSLLTNQSGLNKLALV-KTAEGKY-LLEPDPTK 304 (408) Q Consensus 250 -------------~-------~~~~~d~i~~~~~---~~l~~~~~~~a~~~~n~~~~~~l~~l-kd~~G~~-~~~~~~~~ 304 (408) . ....-.+++++|. ..++.....+.+|+||+.....|++. .+....+ +-.....+ T Consensus 227 ~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g 306 (331) T protein:vir:10 227 TLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAG 306 (331) T ss_pred EEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCC Confidence 0 0000122344432 23444556778999999999999875 3443223 33333444 Q ss_pred CCcccccccceEeeccccccccccCcceEE Q lcl|Aclame:pro 305 PNSYLIKGKQVIVVADRWLPNTGSTVYPLY 334 (408) Q Consensus 305 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~ 334 (408) ...-.++|.||..+|.- ..+...++ T Consensus 307 ~~~t~~~gipir~~dai-----~~tE~~Vv 331 (331) T protein:vir:10 307 KKVVAFDGIPCRRTDAL-----LLTEARVV 331 (331) T ss_pred cceeEECCeeEEEeeee-----ecCccccC Confidence 45567999999998631 11122222 No 154 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.19 E-value=9e-08 Score=59.20 Aligned_cols=218 Identities=9% Similarity=-0.094 Sum_probs=130.2 Q ss_pred HhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcc Q lcl|Aclame:pro 100 MVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAE 179 (408) Q Consensus 100 ~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E 179 (408) +..-+.....-.+. ...+-|......|++.+.+.++|+..++.......++.... ....-|.+.|..= T Consensus 1 m~~~~~~a~TL~E~----------Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~--vrt~LP~~~fR~l 68 (335) T protein:vir:73 1 MALIGQTLPSLLDI----------YNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTT--IRAGIPEPVWRRY 68 (335) T ss_pred CCcCCCCchhHHHH----------HhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCccccee--EEEecCCchhhhc Confidence 00000000000000 01122344556699999999999888887765555544332 2234577889988 Q ss_pred cccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccccccch-------- Q lcl|Aclame:pro 180 DGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK-------- 249 (408) Q Consensus 180 ~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~-------- 249 (408) ++.++. +.+++.+++-+.+-+.+.+.|-+.+.+... .++...-.....+++.+.+...+|+|+.+..+. T Consensus 69 N~g~~~-s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR 147 (335) T protein:vir:73 69 NQGVQP-TKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPR 147 (335) T ss_pred CCcccc-ccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhh Confidence 888874 568999999999999999999997776543 244555566788999999999999985422110 Q ss_pred --------------------------------------------h----------------------------------- Q lcl|Aclame:pro 250 --------------------------------------------P----------------------------------- 250 (408) Q Consensus 250 --------------------------------------------~----------------------------------- 250 (408) + T Consensus 148 ~~~~st~~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~G 227 (335) T protein:vir:73 148 FNTLSTSKAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIG 227 (335) T ss_pred hcCccccccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeee Confidence 0 Q ss_pred ----------------------hhhhHHHHHHHHHHhh-----hhhccCCCEEEEcHHHHHHHHhh-hcccCceeecccc Q lcl|Aclame:pro 251 ----------------------TIAKFDDVITMINTAV-----DPAIIATSSLLTNQSGLNKLALV-KTAEGKYLLEPDP 302 (408) Q Consensus 251 ----------------------~~~~~d~i~~~~~~~l-----~~~~~~~a~~~~n~~~~~~l~~l-kd~~G~~~~~~~~ 302 (408) ......+++++|..++ +.......+|+||+.....|++. +++....+-...+ T Consensus 228 l~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~ 307 (335) T protein:vir:73 228 LSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEY 307 (335) T ss_pred eEEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeecc Confidence 0001123455554433 23334457899999999999875 4444333433334 Q ss_pred ccCCcccccccceEeeccccccccccCcceEEEE Q lcl|Aclame:pro 303 TKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYG 336 (408) Q Consensus 303 ~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~g 336 (408) .+...-.++|.||..+|.- +.+ .. .++. T Consensus 308 ~g~~~t~~~gipir~~Dai-l~t----E~-~v~~ 335 (335) T protein:vir:73 308 GGKKIVSFLGIPIRRVDAI-LNT----ES-AVTA 335 (335) T ss_pred CCceeEEECCeEEEEEeee-ecC----cc-cccC Confidence 3333456889999998631 111 11 1111 No 155 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.19 E-value=1.9e-07 Score=57.49 Aligned_cols=291 Identities=13% Similarity=0.074 Sum_probs=141.3 Q ss_pred hhhHHHHHHHHhhccccccC--ceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAA--GLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGK 182 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~g--g~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~ 182 (408) |...... .+ .+..+.| =.+.=..|.+++.......+.++++.++.++.+++ ++.+++...... ....-|+. T Consensus 1 Ms~~n~~-t~---~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gk-S~qf~~~G~s~~--~~~~pG~~ 73 (401) T protein:vir:70 1 MSTPNNL-TN---VAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTN-TVSNKYLGETEL--QVLAPGQS 73 (401) T ss_pred CCCCccc-cc---cccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccc-eEEEEEeeeeEe--eeecCCCC Confidence 1000000 00 0001111 12334567888888888999999999999887664 666666543333 33434443 Q ss_pred ccccccccceeeeechheeeee-hHHH--HHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhhcc--------------- Q lcl|Aclame:pro 183 IPDLDNPQLTIIKYLIKRYAGI-ITAT--NTSLKDTAEN-ILAWLSSWIAKKVVVTRNQAIIEVM--------------- 243 (408) Q Consensus 183 ~~~~~~~~f~~v~~~~~~~~~~-~~iS--~ell~ds~~~-~~~~v~~~l~~~~~~~~~~~~~~g~--------------- 243 (408) . +...+..++..|....+-.. ..|. ++.+ +.++ +.+.+.+++.+++++..|+.++--. T Consensus 74 l-d~~~~~~dK~~ItID~lL~a~~~V~dlDe~q--~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p 150 (401) T protein:vir:70 74 P-AATSTQADKNQLVIDATVIARNTVAHLHDVQ--GDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNP 150 (401) T ss_pred c-CCCCcccccEEEEeCceeehhhhhhhHHHHH--hcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCC Confidence 3 33445556665655554322 2222 2333 3455 6889999999999999998663211 Q ss_pred ---ccc-------cchhhhhhHHHHH----HHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhcc-cCceeec--ccccc Q lcl|Aclame:pro 244 ---KAA-------PKKPTIAKFDDVI----TMINTAVDPAIIA--TSSLLTNQSGLNKLALVKTA-EGKYLLE--PDPTK 304 (408) Q Consensus 244 ---g~~-------~~~~~~~~~d~i~----~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd~-~G~~~~~--~~~~~ 304 (408) +.+ .......+.+.+. ++... |...+.+ .-+++++|..|..|..-..- |-.|... .+... T Consensus 151 ~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~-LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~ 229 (401) T protein:vir:70 151 RVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQ-QLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQ 229 (401) T ss_pred CcCCCceEEeccccccccccCHHHHHHHHHHHHHH-HHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCcccc Confidence 000 0011112222333 33322 3332222 23677788888877542100 1011111 11233 Q ss_pred CCcccccccceEeeccccccccc----------cCcceE--EEEehhcceE-eeeccc-eEEEEeccchhhh---hhcee Q lcl|Aclame:pro 305 PNSYLIKGKQVIVVADRWLPNTG----------STVYPL--YYGDMSQAIT-LFDREN-MSLLPTNIGAGAF---ETDTT 367 (408) Q Consensus 305 ~~~~~l~G~pv~~~~~~~~~~~~----------~~~~~~--~~gd~~~~~~-~~~~~~-~~i~~~~~~~~~f---~~~~~ 367 (408) +...++.|+||+.+.+ +|... .+.+.. +-|||+.... .|.+.. ..++.-+.+...| .+-.. T Consensus 230 G~v~~vaGv~Vv~Snn--lP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~ 307 (401) T protein:vir:70 230 GFTLSSYNCPVIPSNR--FPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTY 307 (401) T ss_pred ceEEEEeceEEEeecc--ccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHH Confidence 3345799999998654 44321 111222 3367765322 333332 3333333222222 12223 Q ss_pred eEEEEeeeCcEEecccceEEEEeec--cccCCCCccCCCcccC Q lcl|Aclame:pro 368 KIRVIDRFDVKATDSEALVAGSFSA--IADQVGNFKTTTSTAV 408 (408) Q Consensus 368 ~~r~~~r~d~~v~~~~a~~~l~~~~--~~~~~~~~~~~~~~~~ 408 (408) .+-+.+-+|..+.+|++..+++.+- +++.+..++.+.-..+ T Consensus 308 ~id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~~~~~~~ 350 (401) T protein:vir:70 308 YIDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGTDGAQHTIV 350 (401) T ss_pred HHHHHHHhCCcccchhheEEEeecCcccccccccCCcchhhhh Confidence 3445666899999999998885443 3444423321111111 No 156 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.18 E-value=3.5e-07 Score=55.99 Aligned_cols=246 Identities=13% Similarity=0.059 Sum_probs=119.1 Q ss_pred hhceeecccCccceEEeeccCCccccchhccccccccc-cccccee--eeechheeee-ehHHHHHHHhcchHHHHHHHH Q lcl|Aclame:pro 149 YVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDL-DNPQLTI--IKYLIKRYAG-IITATNTSLKDTAENILAWLS 224 (408) Q Consensus 149 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~-~~~~f~~--v~~~~~~~~~-~~~iS~ell~ds~~~~~~~v~ 224 (408) ++|. +.++ .++.+++....+ .....-|..+... ....-++ +++.-.++.. ++.==++. ++..|+.+.+. T Consensus 1 ~vr~--i~~g-~s~~~~~iG~~~--~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~--qa~~Dlr~e~s 73 (324) T protein:vir:99 1 MTRT--ITSG-KSAQFPVMGRTK--ARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDA--MNHYDVRSEYS 73 (324) T ss_pred Ceee--eecC-ceEEEeeeeeeE--eccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHH--hcCccchhHHH Confidence 4333 3332 355566553332 2334444444210 1122234 4444444443 22222222 25568999999 Q ss_pred HHHHHHHHHHHHHHHhhc------------------cccc-------cchhhhhh----HHHHHHHHHHhhhhhccC--C Q lcl|Aclame:pro 225 SWIAKKVVVTRNQAIIEV------------------MKAA-------PKKPTIAK----FDDVITMINTAVDPAIIA--T 273 (408) Q Consensus 225 ~~l~~~~~~~~~~~~~~g------------------~g~~-------~~~~~~~~----~d~i~~~~~~~l~~~~~~--~ 273 (408) ++.++++++..|+.++.- .|.+ .......+ ++.+.++. ..|+....+ + T Consensus 74 ~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~-~~Lde~~VP~~g 152 (324) T protein:vir:99 74 TQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYAR-AAFAKKYIPAGD 152 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHH-HHHhhcCCCCCC Confidence 999999999999877521 1100 00011111 33333332 335444433 4 Q ss_pred CEEEEcHHHHHHHHhhh-cccCceeeccccccCCcccccccceEeeccccccccccCcc--------------------e Q lcl|Aclame:pro 274 SSLLTNQSGLNKLALVK-TAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVY--------------------P 332 (408) Q Consensus 274 a~~~~n~~~~~~l~~lk-d~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~--------------------~ 332 (408) .+++++|..|..|..-+ -.++.|.-...+..+.-.+++|++|+.+.+ +|...+... . T Consensus 153 R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~--lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ 230 (324) T protein:vir:99 153 RTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPH--MTAQMVTNPTDAFDGTGHIFPATGDSTTTG 230 (324) T ss_pred CEEEeChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCC--cccccccccccccccccccccccccccccc Confidence 46789999998764332 223344444445556667899999998644 444211100 0 Q ss_pred EEEEehhcceE-eeec--------cceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeecccc-----CCC Q lcl|Aclame:pro 333 LYYGDMSQAIT-LFDR--------ENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD-----QVG 398 (408) Q Consensus 333 ~~~gd~~~~~~-~~~~--------~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~-----~~~ 398 (408) -+-+|++.... .+.+ .+++++..... .+-...+++..-+|.+++||++.+.+++++.+. .+. T Consensus 231 ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~----~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~~ 306 (324) T protein:vir:99 231 KMTVGADNVVGLFVHRSAVATLKLKDMALERARRP----EYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDVI 306 (324) T ss_pred ccccccCceeEEEEehhheEEEeeecceecceech----hhHHHhhhhhhhhcCcccccceEEEEEEccCccccccchhh Confidence 12333332111 1111 12233332211 122344566677899999999999888766542 222 Q ss_pred CccCCCcc-cC Q lcl|Aclame:pro 399 NFKTTTST-AV 408 (408) Q Consensus 399 ~~~~~~~~-~~ 408 (408) .+.+..+- +- T Consensus 307 ~~~~~~~~~~~ 317 (324) T protein:vir:99 307 TGVASFAAPAS 317 (324) T ss_pred hhhccccCccc Confidence 22111110 00 No 157 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.15 E-value=2e-07 Score=57.29 Aligned_cols=262 Identities=14% Similarity=0.030 Sum_probs=133.7 Q ss_pred hhccccccCceecch---hhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccce Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQ---DIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLT 192 (408) Q Consensus 116 ~~~~t~~~gg~~vP~---~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~ 192 (408) |........--++|. ++.+.+-..+.+-..++...|..|+..++ .+.+|++. -.+.+.-++||+.+|- +..+.. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~-tIt~pK~~-~tgda~dVaEGe~Ipl-skvt~~ 77 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDL-KIQTYKWE-VTLDQTDPGEGETIPL-SKVTRT 77 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCC-eEEeeeee-eecccccccCCcccch-hhheee Confidence 111111111122322 34444544444444455555778877654 66677654 3455567999999984 545544 Q ss_pred ---eeeechheeeeehHHHHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhh----HHHHHHHHHH Q lcl|Aclame:pro 193 ---IIKYLIKRYAGIITATNTSLKDTAE-NILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAK----FDDVITMINT 264 (408) Q Consensus 193 ---~v~~~~~~~~~~~~iS~ell~ds~~-~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~----~d~i~~~~~~ 264 (408) ..+++.+|++..+ |.|.++.+.. +-...-.++|..++..+++..|+.-.++++....... +..+...+. T Consensus 78 ~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~tg~~lq~a~a~~~~al~- 154 (295) T protein:vir:99 78 KDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVKGVGLQKALSASWAKLA- 154 (295) T ss_pred eeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeeehhhHHHHHHHhhhhhh- Confidence 3677778877754 9999865543 3466778999999999999999988877665433222 122222221 Q ss_pred hhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccc--cccCCcccccccc-eEeeccccccccccCcceEEEE---eh Q lcl|Aclame:pro 265 AVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPD--PTKPNSYLIKGKQ-VIVVADRWLPNTGSTVYPLYYG---DM 338 (408) Q Consensus 265 ~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~--~~~~~~~~l~G~p-v~~~~~~~~~~~~~~~~~~~~g---d~ 338 (408) .....+..+.++++||.....+++-..-+ |+.. +...---.++|.. |+++.. +|. +.++.- |+ T Consensus 155 ~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~----~~~a~~fG~~~L~nfLG~q~II~S~k--v~~-----G~~~aT~~~Ni 223 (295) T protein:vir:99 155 TFNEFEGSPLVSFVSPLDVANYLGDTKVG----ADASNVFGMTLLKNFLGMQNVIVMPS--VPE-----GKIYSTAVENL 223 (295) T ss_pred hcccccCCceEEEEehHHHHHHHhccccc----cchhhhhhhhhhhhhhccceEEEccc--CCC-----ceEEEeeccce Confidence 22334556678999999988876542211 2211 1000001378887 555332 221 111110 11 Q ss_pred hcceEeeeccceEEEEeccchhhhhhceeeEEEEee-------------eCcEE---ecccceEEEEeeccccCCCCc Q lcl|Aclame:pro 339 SQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDR-------------FDVKA---TDSEALVAGSFSAIADQVGNF 400 (408) Q Consensus 339 ~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r-------------~d~~v---~~~~a~~~l~~~~~~~~~~~~ 400 (408) .-+|.....+++.- .+ .+..|.+.+.+..+ +.+.. -++.++++.++++...+.... T Consensus 224 ~~ay~~~~~g~l~~-----~f-~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~~~~~ 295 (295) T protein:vir:99 224 VFASLNVKGGDLGG-----LF-ADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVPGIGG 295 (295) T ss_pred EEEEecCCchhhhh-----hh-hhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcCCCCCC Confidence 11222221122210 01 11223333333222 22333 344688888886655433333 No 158 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.07 E-value=4.8e-07 Score=55.20 Aligned_cols=293 Identities=13% Similarity=0.062 Sum_probs=145.4 Q ss_pred hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIP 184 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~ 184 (408) |...... ++-...++ ++--.+.=..|..++.......+.++++.++.++.+++ ++.+++.... ......-|+.. T Consensus 1 Ms~~n~~-t~p~~~gs-g~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gk-S~qf~~lG~s--~a~y~~pG~~l- 74 (400) T protein:vir:10 1 MSTPNNL-TNVAVSAS-GEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTN-TVSNKYLGET--ELQVLAPGQSP- 74 (400) T ss_pred CCCCccc-cccccccc-cchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccc-eEEEEEeeee--EEeeecCCCCc- Confidence 1000000 00000000 01112334567888888888999999999999887664 6666665333 33445555553 Q ss_pred ccccccceeeeechheeeee-hHH--HHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhh----c-----------ccc Q lcl|Aclame:pro 185 DLDNPQLTIIKYLIKRYAGI-ITA--TNTSLKDTAEN-ILAWLSSWIAKKVVVTRNQAIIE----V-----------MKA 245 (408) Q Consensus 185 ~~~~~~f~~v~~~~~~~~~~-~~i--S~ell~ds~~~-~~~~v~~~l~~~~~~~~~~~~~~----g-----------~g~ 245 (408) +.+.+..++..+....+-.. ..| =+|.+ +.+| +.+.+.+++.+++++..|+.++. + .+. T Consensus 75 dg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q--~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g 152 (400) T protein:vir:10 75 AATSTQADKNQLVIDATVIARNTVAHLHDVQ--GDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRV 152 (400) T ss_pred CCCCcccCcEEEEeCceeeecchhhhHHHHh--hccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCc Confidence 34445556666665554422 223 23333 3455 78999999999999999987752 1 011 Q ss_pred c----------cchhhhhhHHHHH----HHHHHhhhhhcc--CCCEEEEcHHHHHHHHhhhcc-cCceeec--cccccCC Q lcl|Aclame:pro 246 A----------PKKPTIAKFDDVI----TMINTAVDPAII--ATSSLLTNQSGLNKLALVKTA-EGKYLLE--PDPTKPN 306 (408) Q Consensus 246 ~----------~~~~~~~~~d~i~----~~~~~~l~~~~~--~~a~~~~n~~~~~~l~~lkd~-~G~~~~~--~~~~~~~ 306 (408) . .......+.+.+. .+... +...+. ..-++++.|..|..|..-.-- |-.|... .++..+. T Consensus 153 ~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~-LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~ 231 (400) T protein:vir:10 153 KGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQ-QLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGF 231 (400) T ss_pred cccccceeecccccccccCHHHHHHHHHHHHHH-HHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccce Confidence 0 0011111222232 22221 222221 123678889999887532100 0011111 1122233 Q ss_pred cccccccceEeeccccccccc----------cCcceE--EEEehhcceE-eeeccc-eEEEEeccchhhh---hhceeeE Q lcl|Aclame:pro 307 SYLIKGKQVIVVADRWLPNTG----------STVYPL--YYGDMSQAIT-LFDREN-MSLLPTNIGAGAF---ETDTTKI 369 (408) Q Consensus 307 ~~~l~G~pv~~~~~~~~~~~~----------~~~~~~--~~gd~~~~~~-~~~~~~-~~i~~~~~~~~~f---~~~~~~~ 369 (408) ..+++|+||+.+.+ +|... ++.+.. +-|||+.... .|.+.. ..++.-+.....| .+-...+ T Consensus 232 v~~v~Gv~Iv~Sn~--lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~i 309 (400) T protein:vir:10 232 VLSSYNCPVIPSNR--FPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYI 309 (400) T ss_pred EEEEeceEEEeeCc--CCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHH Confidence 35799999998654 55321 111222 3477765332 333332 3333322222222 1223344 Q ss_pred EEEeeeCcEEecccceEEEEeeccccCCCCccCCC-cccC Q lcl|Aclame:pro 370 RVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTT-STAV 408 (408) Q Consensus 370 r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~-~~~~ 408 (408) .+.+-+|..+.+|++..+++.+-.++.....+..+ -.+| T Consensus 310 d~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~~~~~~~~ 349 (400) T protein:vir:10 310 DTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGNAAQHTQV 349 (400) T ss_pred HHHHHhCCcccchhheEEEEecCCcccccccCcchhHHHH Confidence 56666899999999999998776654433322111 1111 No 159 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.03 E-value=1.9e-06 Score=51.92 Aligned_cols=272 Identities=13% Similarity=0.003 Sum_probs=136.8 Q ss_pred HHHHHhhccccccCceecchh----hhhhhhhhhh-hhhhhhhhhceeecccCccceEEeeccCCccccc-------hhc Q lcl|Aclame:pro 111 VSSKTETSGSDSAAGLTIPQD----IRTMINTLVR-QYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTV-------MDA 178 (408) Q Consensus 111 ~~~~a~~~~t~~~gg~~vP~~----~~~~ii~~~~-~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~-------~~~ 178 (408) ....+...+... =+..||.. |..++.-..+ ..+.|++.++..+-.+.+..+..+. ....... -.+ T Consensus 1 ~~~~~~~~~~~~-Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~ 77 (322) T protein:vir:10 1 MKLNAIMSMLPL-IAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLA--SMDPDAVKRKRSRQQSA 77 (322) T ss_pred Ccccceeeeeee-eechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecc--ccccccccccccccccc Confidence 000011111000 01124444 4444444333 4456666555443333322222221 1111110 011 Q ss_pred ccc-ccccccc-ccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------cc- Q lcl|Aclame:pro 179 EDG-KIPDLDN-PQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA-------PK- 248 (408) Q Consensus 179 E~~-~~~~~~~-~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~-------~~- 248 (408) .+. +.|.... .....+.+..+.. ...|.+.-+-+...|..+...+..+.+++++.|..++.+.-+. ++ T Consensus 78 d~~~dtp~~~~~~~~r~~~~~d~~~--~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v 155 (322) T protein:vir:10 78 DGTYPTPVNNKPFAKRRTNVDTYDT--GHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPV 155 (322) T ss_pred CcccCCCccccccceEEEeeccccc--ceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccccccccccc Confidence 111 1111111 1222344444443 4455555555556788899999999999999999888642111 00 Q ss_pred ----------hhhhhhHHHHHHHHHHhhhhhccCC---CEEEEcHHHHHHHHhhhcc-cCceeecccc-ccCCccccccc Q lcl|Aclame:pro 249 ----------KPTIAKFDDVITMINTAVDPAIIAT---SSLLTNQSGLNKLALVKTA-EGKYLLEPDP-TKPNSYLIKGK 313 (408) Q Consensus 249 ----------~~~~~~~d~i~~~~~~~l~~~~~~~---a~~~~n~~~~~~l~~lkd~-~G~~~~~~~~-~~~~~~~l~G~ 313 (408) .....+++.++.+.. .++...-+. -.++++|..|..|.....- +..|.-...+ .+|..++++|+ T Consensus 156 ~~~ss~~i~~g~~g~t~~kl~~a~~-~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf 234 (322) T protein:vir:10 156 EFLATQEIGDGTKPISFDYVTEITE-RFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGY 234 (322) T ss_pred ccCCCcccccCccchhHHHHHHHHH-HHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeE Confidence 112334666665543 355544442 3578899999887654322 2334433334 33556789999 Q ss_pred ceEeecccccccccc-------------CcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 314 QVIVVADRWLPNTGS-------------TVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 314 pv~~~~~~~~~~~~~-------------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) .++...+ +|.+.. .....+++- +.++.+....+++.+++..... .+...+++..-+|..++ T Consensus 235 ~~i~s~~--lp~~~~t~~~~~~~~~~~~~~~~~~a~~-k~Av~~a~~~dv~~~i~~~~~~---~~a~~I~~~~~~Ga~ri 308 (322) T protein:vir:10 235 TWIVSTR--LDKFDPTQWGMAAEDGPQGDEIWCIAMT-DMALGYHSCKDIWTKVAEDPSA---SFAWRIYSAFTADCVRV 308 (322) T ss_pred EEEEecc--CCccccccccccccCCCCccceeEEEEe-cCceeEEEeeeeeEEeeccCCc---chhhhhhhhhhhCceEe Confidence 9988643 442211 122233333 3356666566666666433322 22334556677999999 Q ss_pred cccceEEEEeeccc Q lcl|Aclame:pro 381 DSEALVAGSFSAIA 394 (408) Q Consensus 381 ~~~a~~~l~~~~~~ 394 (408) +|+.++.+..+..= T Consensus 309 ~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 309 EDEHIFKLRLKNSL 322 (322) T ss_pred ccCcEEEEEEeccC Confidence 99999999987655 No 160 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.94 E-value=7.7e-07 Score=54.09 Aligned_cols=264 Identities=13% Similarity=0.082 Sum_probs=133.3 Q ss_pred HhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccce-EEeeccCCccccchhc Q lcl|Aclame:pro 100 MVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSR-VYEKWTDVTPLTVMDA 178 (408) Q Consensus 100 ~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~-~~~~~~~~~~~~~~~~ 178 (408) ++.+. ..- ..+..+..+-+.....+|.+++-..+.+-.-++...|..|+..++ .+ .++.+ .-.+.+.-++ T Consensus 1 ~~~~~-----~~~--e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~Gs-tIkt~k~~-~y~gda~dVa 71 (296) T protein:vir:98 1 MVTSR-----TYP--EENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGY-DVTLAEGNVP 71 (296) T ss_pred CCCcc-----ccC--cCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCC-EEeeccce-eeeecccccc Confidence 11000 000 001112223344455677777777666666666667888887654 34 23333 3445556799 Q ss_pred cccccccccccccee---eeechheeeeehHHHHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhh Q lcl|Aclame:pro 179 EDGKIPDLDNPQLTI---IKYLIKRYAGIITATNTSLKDTAE-NILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAK 254 (408) Q Consensus 179 E~~~~~~~~~~~f~~---v~~~~~~~~~~~~iS~ell~ds~~-~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~ 254 (408) ||+.+|- +..+... .+++.+|++.-+ |.|.++.+.. +-...-.++|...+..+++..|+.-.++++.... .. T Consensus 72 EGe~Ipl-skvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~-~t 147 (296) T protein:vir:98 72 EGEVIPL-SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD-AL 147 (296) T ss_pred CCcccch-hhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceee-ec Confidence 9999984 5455443 677778887774 9999865443 3466778899999999999999988766543211 23 Q ss_pred HHHHHHHHH-------HhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCC-cccccccceEeecccccccc Q lcl|Aclame:pro 255 FDDVITMIN-------TAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPN-SYLIKGKQVIVVADRWLPNT 326 (408) Q Consensus 255 ~d~i~~~~~-------~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~-~~~l~G~pv~~~~~~~~~~~ 326 (408) .+.+..++. ...........++++||.....++ ++++ +-.....+.. .-.++|.-|+.+.. +|. T Consensus 148 ~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~yl--g~a~---it~qt~fG~tyl~nfLG~~II~S~k--V~~- 219 (296) T protein:vir:98 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYI--AKAG---ITTQTAFGLTYLVDFTGTVIISTND--VTK- 219 (296) T ss_pred hhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHh--cCCc---cchhheechhhhhhccccEEEEcCc--CCC- Confidence 344444432 112222234668899999877653 3332 1001111111 01277865544322 232 Q ss_pred ccCcceEEEE---ehhcceEeeeccceEEEEeccchhhhhhceeeEEEEee-------------eCcEEe---cccceEE Q lcl|Aclame:pro 327 GSTVYPLYYG---DMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDR-------------FDVKAT---DSEALVA 387 (408) Q Consensus 327 ~~~~~~~~~g---d~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r-------------~d~~v~---~~~a~~~ 387 (408) +.++.- |+.-+|.....+++.-.+ .+..|.+.+.+..+ +.+..+ ++.++++ T Consensus 220 ----G~~~~T~~~Ni~~ay~~~~~~~l~~~f------~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~ 289 (296) T protein:vir:98 220 ----GEIWATVPENIIFAYINPNNSELAKEF------NLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVK 289 (296) T ss_pred ----ceEEEeeecceEEEeecccccchhhhh------ccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEE Confidence 111111 111122221112121111 11223333333222 223333 3467888 Q ss_pred EEeeccc Q lcl|Aclame:pro 388 GSFSAIA 394 (408) Q Consensus 388 l~~~~~~ 394 (408) .++++.- T Consensus 290 ~tI~~~~ 296 (296) T protein:vir:98 290 VTLTPGV 296 (296) T ss_pred EEecCCC Confidence 7775544 No 161 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=97.89 E-value=1.7e-06 Score=52.25 Aligned_cols=266 Identities=10% Similarity=-0.006 Sum_probs=139.4 Q ss_pred hhcc-ccccCceecch--hhhhhhhhhhhhhhhhhhhhceee-cccCccceEEeeccCCccccchhccccc-cccccccc Q lcl|Aclame:pro 116 ETSG-SDSAAGLTIPQ--DIRTMINTLVRQYDSLQQYVRVES-VSTSNGSRVYEKWTDVTPLTVMDAEDGK-IPDLDNPQ 190 (408) Q Consensus 116 ~~~~-t~~~gg~~vP~--~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~g~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~ 190 (408) ++.- .++.|.+++.+ .+.+.+++...+....+.++.+.. ++...-++.+... +..+.+.|.+.++. +|..+ .. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~-~~~G~a~~~~~~~~dip~v~-~~ 78 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVF-DGVGIAQIVADYTDDLPLVD-AL 78 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeee-eccCceeEeCCCccccceee-cc Confidence 1111 12223344433 355667776666555555544332 1111123333222 23355567766554 44333 45 Q ss_pred ceeeeechheeeeehHHHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------------ch Q lcl|Aclame:pro 191 LTIIKYLIKRYAGIITATNTSLKDT---AENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------------KK 249 (408) Q Consensus 191 f~~v~~~~~~~~~~~~iS~ell~ds---~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~------------------~~ 249 (408) ++......+.++.-..++..=++.+ ..++..--....++++.+.+|+-+++|+.... .. T Consensus 79 ~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~ 158 (296) T protein:vir:10 79 ATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWS 158 (296) T ss_pred ceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCcc Confidence 6777777788787777765544433 35677777888889999999999999876321 00 Q ss_pred hhhhhHHHHHHHHHHhh--hhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccc Q lcl|Aclame:pro 250 PTIAKFDDVITMINTAV--DPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTG 327 (408) Q Consensus 250 ~~~~~~d~i~~~~~~~l--~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~ 327 (408) .....++|+..++.... ...+..+..++++|..+..|...-+..|.-++.---.+..+.+|.+.|... +... T Consensus 159 ~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~------~a~~ 232 (296) T protein:vir:10 159 QPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLN------DYNG 232 (296) T ss_pred CHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeec------cCCC Confidence 11122455555543211 123344557899999999887654544432222101111222344444322 1223 Q ss_pred cCcceEEEEehh-cceEeeeccceEEEEeccchhhhhhceeeEEEEeeeC-cEEecccceEEEEeeccc Q lcl|Aclame:pro 328 STVYPLYYGDMS-QAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKATDSEALVAGSFSAIA 394 (408) Q Consensus 328 ~~~~~~~~gd~~-~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~~~~ 394 (408) .++..+++.+-+ +++.+..-..++. .+.. ...-.+.+++..|++ +.+.+|.||+.++.-+.+ T Consensus 233 ~g~~~~v~~~~~~~~~~~~v~~~~~~--~~~e---~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 233 TGTSAAIAYEKDPNNMAIEIPEATNA--LPAQ---PKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred CcceEEEEEEcCCceEEEEcCcceee--eccc---ccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 344444444432 2333222233332 2211 122345667788885 799999999999988887 No 162 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.65 E-value=8.2e-06 Score=48.47 Aligned_cols=280 Identities=11% Similarity=0.020 Sum_probs=134.9 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhcccc----ccCceecc---hhhhhhhhhhhhhhhhhhhhhceeec- Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSD----SAAGLTIP---QDIRTMINTLVRQYDSLQQYVRVESV- 155 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~----~~gg~~vP---~~~~~~ii~~~~~~~~l~~~~~~~~~- 155 (408) .....-.+.+ ...+..+....+.. .+.|+..- +.+.+.+++...+....+.++.+... T Consensus 1 ~~~~~~~~~~--------------~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~ 66 (319) T protein:vir:10 1 MTTKKFDEAD--------------KSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTEL 66 (319) T ss_pred CCCcchhHHh--------------hHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCC Confidence 1111000001 11111111111111 12233323 23456677777666666666554321 Q ss_pred ccCccceEEeeccCCccccchhccccc-ccccccccceeeeechheeeeehHHHHHHHhcc---hHHHHHHHHHHHHHHH Q lcl|Aclame:pro 156 STSNGSRVYEKWTDVTPLTVMDAEDGK-IPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT---AENILAWLSSWIAKKV 231 (408) Q Consensus 156 ~~~~g~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds---~~~~~~~v~~~l~~~~ 231 (408) +...-++.+... +..+.+.|.+.++. +|..+ ..++......+.++....++..=++.+ ..++..--....++++ T Consensus 67 ~~~~~~~~~~~~-~~~G~a~~~~d~~~dip~v~-~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~ 144 (319) T protein:vir:10 67 SPTDKTFEYMTF-DKVGTAQIIADYTDDLPLVD-ALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAH 144 (319) T ss_pred CCceEEEEeeee-ccccceeeecCcccccccee-ccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHH Confidence 111112333222 33456677777655 45433 456677777777777777765444432 3567777788888999 Q ss_pred HHHHHHHHhhcccccc----------------c--hhhhhhHH----HHHHHHHHhh--hhhccCCCEEEEcHHHHHHHH Q lcl|Aclame:pro 232 VVTRNQAIIEVMKAAP----------------K--KPTIAKFD----DVITMINTAV--DPAIIATSSLLTNQSGLNKLA 287 (408) Q Consensus 232 ~~~~~~~~~~g~g~~~----------------~--~~~~~~~d----~i~~~~~~~l--~~~~~~~a~~~~n~~~~~~l~ 287 (408) .+.+|+-+++|+.... . .....+.+ ++..++.... ......+..++++|+.|..|. T Consensus 145 ~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~ 224 (319) T protein:vir:10 145 DQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLA 224 (319) T ss_pred HHhhceEEEeecccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhh Confidence 9999999998865311 0 00111233 3333332211 112334557899999999997 Q ss_pred hhhcccCceeecccccc-CCcccccccceEeeccccccccccCcceEEEEeh-hcceEeeeccceEEEEeccchhhhhhc Q lcl|Aclame:pro 288 LVKTAEGKYLLEPDPTK-PNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDM-SQAITLFDRENMSLLPTNIGAGAFETD 365 (408) Q Consensus 288 ~lkd~~G~~~~~~~~~~-~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~-~~~~~~~~~~~~~i~~~~~~~~~f~~~ 365 (408) ......|.-++.- +.. ..+-+|.+.|... +....++..+++..- .+++.+..-..++. .+... ..= T Consensus 225 ~~~~~~~~t~l~~-lk~~~~~l~I~~~pel~------~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~--~~~e~---~~l 292 (319) T protein:vir:10 225 IRMPETTMSYLDY-FKSQNSGIEIDSIAELE------DIDGAGTKGVLVYEKNPMNMSIEIPEAFNM--LPAQP---KDL 292 (319) T ss_pred cccCCCCeeHHHH-HHHhcCCceEEEeeeec------ccCCCcceEEEEEecCCceEEEecCcceee--eeeee---cCc Confidence 5555445433321 211 1222344444332 122334444444433 23333222233332 22211 111 Q ss_pred eeeEEEEeeeC-cEEecccceEEEEee Q lcl|Aclame:pro 366 TTKIRVIDRFD-VKATDSEALVAGSFS 391 (408) Q Consensus 366 ~~~~r~~~r~d-~~v~~~~a~~~l~~~ 391 (408) .+.+.+..|++ +.+.+|.||+.++.- T Consensus 293 ~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 293 HFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eEEEeeeeeeEEEEEEccceeEeeecC Confidence 23344566665 678999999999866 No 163 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=97.63 E-value=8.5e-06 Score=48.38 Aligned_cols=259 Identities=10% Similarity=-0.024 Sum_probs=133.9 Q ss_pred ccccccCceecch--hhhhhhhhhhhhhhhhhhhhceeec-ccCccceEEeeccCCccccchhccccc-cccccccccee Q lcl|Aclame:pro 118 SGSDSAAGLTIPQ--DIRTMINTLVRQYDSLQQYVRVESV-STSNGSRVYEKWTDVTPLTVMDAEDGK-IPDLDNPQLTI 193 (408) Q Consensus 118 ~~t~~~gg~~vP~--~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~f~~ 193 (408) -.++++|.+++-. .+.+.+++.+.+....+.++.+... +...-.+.+.. .+..+.+.|.+.++. +|. ....++. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~-~~~~G~~~~~~~~~~dip~-~~~~~~~ 78 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDV-MTRSGAAKIIANGADDLPL-VDVDMVR 78 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEee-eccceeEEEecCccccccc-cccccee Confidence 2344555544332 3557778888777777776554322 11111222222 234456677777655 343 3345667 Q ss_pred eeechheeeeehHHHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------ch--------h----- Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDT---AENILAWLSSWIAKKVVVTRNQAIIEVMKAAP-------KK--------P----- 250 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds---~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~-------~~--------~----- 250 (408) .......++.-..++..=++.+ ..++..--....++++.+.+|+-+++|+.... +. . T Consensus 79 ~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~ 158 (301) T protein:vir:80 79 KSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNV 158 (301) T ss_pred EEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccc Confidence 7777777777666665444432 45677777888899999999999999865321 00 0 Q ss_pred ---hhhhH----HHHHHHHHHhhh--hhccCCCEEEEcHHHHHHHHhhh--cccCceeeccccc-cCCcccccccceEee Q lcl|Aclame:pro 251 ---TIAKF----DDVITMINTAVD--PAIIATSSLLTNQSGLNKLALVK--TAEGKYLLEPDPT-KPNSYLIKGKQVIVV 318 (408) Q Consensus 251 ---~~~~~----d~i~~~~~~~l~--~~~~~~a~~~~n~~~~~~l~~lk--d~~G~~~~~~~~~-~~~~~~l~G~pv~~~ 318 (408) ...+. +++..++..... .....+-.++++|+.+..|.... +..|.-++.- +. +....+|.+.|-.. T Consensus 159 ~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~-l~~~~~~~~I~~~p~L~- 236 (301) T protein:vir:80 159 SKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKV-LQDNAWFSAIVRVPDLA- 236 (301) T ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHH-HHHHcCcceEEEcceec- Confidence 01123 344333322111 12234457999999999997543 3334333321 11 11112344444322 Q ss_pred ccccccccccCcceEEEE-ehhcceEeeeccceEEEEeccchhhhhhc-eeeEEEEeee-CcEEecccceEEEEee Q lcl|Aclame:pro 319 ADRWLPNTGSTVYPLYYG-DMSQAITLFDRENMSLLPTNIGAGAFETD-TTKIRVIDRF-DVKATDSEALVAGSFS 391 (408) Q Consensus 319 ~~~~~~~~~~~~~~~~~g-d~~~~~~~~~~~~~~i~~~~~~~~~f~~~-~~~~r~~~r~-d~~v~~~~a~~~l~~~ 391 (408) +....+++.+++. +=.+.+.+..-..++ ..+... ++ .+...++.|+ |+.+.+|.||+.++.- T Consensus 237 -----~~g~~g~~~~v~~~~~~d~~~~~v~~~~~--~~~~e~----~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 237 -----GMGTAGSDSFAVIHDSNETAELIIPMDIT--RHPEEY----SFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred -----cCCCCcccEEEEEecCCcEEEEEecCcee--eeccee----cCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 1122333333333 222223322223333 222111 12 1223345666 5689999999999966 No 164 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.60 E-value=3.9e-06 Score=50.23 Aligned_cols=264 Identities=12% Similarity=0.072 Sum_probs=133.9 Q ss_pred hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeec--cCCccccchhccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKW--TDVTPLTVMDAEDGK 182 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~--~~~~~~~~~~~E~~~ 182 (408) +... .+..+..+-+..+..+|.+++-..+.+-.-++...|..|+..++ .+..+++ .+-.+.+.-++||+. T Consensus 1 M~~e-------~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt-~iktyK~~~~~y~gda~dVaEGe~ 72 (303) T protein:vir:10 1 MSAE-------NNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGS-ALKQYRFKVEDSEKPNGDVAEGDV 72 (303) T ss_pred CCCC-------cCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCc-eeeeeeeeceeeccccccccCCcc Confidence 1000 01111222334455577777766666666666667888877544 2322221 123345567999999 Q ss_pred ccccccccce---eeeechheeeeehHHHHHHHhcchH-HHHHHHHHHHHHHHHHHHHHHHhhccccccc-----hhhhh Q lcl|Aclame:pro 183 IPDLDNPQLT---IIKYLIKRYAGIITATNTSLKDTAE-NILAWLSSWIAKKVVVTRNQAIIEVMKAAPK-----KPTIA 253 (408) Q Consensus 183 ~~~~~~~~f~---~v~~~~~~~~~~~~iS~ell~ds~~-~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~-----~~~~~ 253 (408) || .+..+.. ..+++.+|++..+ |.|.++.+.. +-...-.++|...+..+++..|+.-..+++. ..+.. T Consensus 73 Ip-lskvt~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~~ 149 (303) T protein:vir:10 73 IP-LTKVTREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKTKL 149 (303) T ss_pred cc-hhhheeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccceee Confidence 99 4544533 4678888888855 9999865443 3466677889999999999988876655431 22334 Q ss_pred hHHHHHHHHHHh---h---hhhccCCCEEEEcHHHHHHHHhhhcccCc-eeeccccccCCcccccccceEeecccccccc Q lcl|Aclame:pro 254 KFDDVITMINTA---V---DPAIIATSSLLTNQSGLNKLALVKTAEGK-YLLEPDPTKPNSYLIKGKQVIVVADRWLPNT 326 (408) Q Consensus 254 ~~d~i~~~~~~~---l---~~~~~~~a~~~~n~~~~~~l~~lkd~~G~-~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~ 326 (408) +.+.+..++... + ... ..+.++++||.+...++.-..-+.+ --|--.+. -.++|.-|+++.. +|.. T Consensus 150 s~~glq~Al~~~~~kl~~~~ed-~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L----~nfLG~~II~S~k--v~~G 222 (303) T protein:vir:10 150 SAENLQGALSKGRANLSVLLDD-EITPIAFVNPNDTAEYLANGFINSTGAQFGVNLL----TPYVGVKIVEFAD--VPQG 222 (303) T ss_pred cHHHHHHHHHhhhhhccccccc-cccEEEEEchHHHHHHhhcCCcchhhhhhhhhhh----hhhhcceEEEecc--CCCc Confidence 466665555321 1 222 3456899999998887532111101 00100011 1378887765432 2221 Q ss_pred -----ccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEee-------------eCcEEe---cccce Q lcl|Aclame:pro 327 -----GSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDR-------------FDVKAT---DSEAL 385 (408) Q Consensus 327 -----~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r-------------~d~~v~---~~~a~ 385 (408) ....=.+.+++.+. -+ ...+. |..|.+.+.+..+ +.+..+ ++.++ T Consensus 223 ~~~~T~~~Ni~~ay~~~~g-~l---~~~f~----------~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgi 288 (303) T protein:vir:10 223 EVWMTVAENLNVAYANPRG-EL---SRAFA----------FATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAV 288 (303) T ss_pred eEEEeeccceEEEEecCch-hh---hhhhh----------hccccccceEEEeccccceeeehhHhHhHHHhcccccceE Confidence 11111122333221 00 01111 2222333322222 223333 44688 Q ss_pred EEEEeeccccCCCCccC Q lcl|Aclame:pro 386 VAGSFSAIADQVGNFKT 402 (408) Q Consensus 386 ~~l~~~~~~~~~~~~~~ 402 (408) ++.++++.- ++.++. T Consensus 289 v~~ti~~~e--~~~~~~ 303 (303) T protein:vir:10 289 IKVTIKKDE--AGELPS 303 (303) T ss_pred EEEEEeccc--cCCCCC Confidence 888775543 111111 No 165 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=97.56 E-value=2.6e-05 Score=45.75 Aligned_cols=360 Identities=12% Similarity=0.082 Sum_probs=147.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDN-FSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |++.-. .|-++.+....+..-.++..+..+.-... -+.....+++.-+.+...+++.++..+...+. .++ T Consensus 8 ~~K~~l-~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E--------~~K 78 (400) T protein:vir:93 8 MNKPDL-IEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPK 78 (400) T ss_pred cccchH-HHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhh--------hhh Confidence 554432 22222333333333333332222200000 00112334444444444444444433332111 111 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHH---HH-HHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTV---SS-KTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV 155 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~---~~-~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 155 (408) +......--......-.|.+.+....+..... +. .+....+-.+--+.+|..+...|...+....++++...+... T Consensus 79 GK~kMt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~ 158 (400) T protein:vir:93 79 GKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 158 (400) T ss_pred hhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccc Confidence 11111111122223334555554332222111 11 111222223344678998888898888888888776555443 Q ss_pred ccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHH---hcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 156 STSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSL---KDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 156 ~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell---~ds~~~~~~~v~~~l~~~~~ 232 (408) + .+.+.+.-.....+...-.|.+.++.+ .+|.--++.+--++....+. ++. +++...+..||..+|+.++. T Consensus 159 ~----~~~V~~s~~s~~~Aq~HkdGqTK~eqa-~~~~~~Tl~~~~VY~~~S~A-e~~K~~~~sYsel~N~i~~ELtQ~~v 232 (400) T protein:vir:93 159 G----ALLVSRSFDSANEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIV 232 (400) T ss_pred h----hhhHHhhhhhhhhhhhhccCCccccce-eeeeeechhHHHHHHHHHHH-HHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 2 222222211222333344555655433 35555555555444444442 232 33444568899999999999 Q ss_pred -HHHHHHHhhccccccchhhhh--------------------h-HHHHHHHHHHhhhhhccCCCEEEEcHHH-HHHHHhh Q lcl|Aclame:pro 233 -VTRNQAIIEVMKAAPKKPTIA--------------------K-FDDVITMINTAVDPAIIATSSLLTNQSG-LNKLALV 289 (408) Q Consensus 233 -~~~~~~~~~g~g~~~~~~~~~--------------------~-~d~i~~~~~~~l~~~~~~~a~~~~n~~~-~~~l~~l 289 (408) +..+.++.-|+|+++...... . +|.|-.+.-...+.+.+. .+++.... .+-|..| T Consensus 233 nk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagrr--ylivktedrkalldel 310 (400) T protein:vir:93 233 NKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRR--YLIVKTEDRKALLDEL 310 (400) T ss_pred HHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCce--EEEEeccchHHHHHHH Confidence 889999999999876321111 0 122222221111111111 12333322 3334444 Q ss_pred hcccCc--eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhcee Q lcl|Aclame:pro 290 KTAEGK--YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTT 367 (408) Q Consensus 290 kd~~G~--~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~ 367 (408) +.+..+ .....+- ..-..--|..-+++. .++ .+- .+-++.|-+ |.+ +-.+++ ......|.+|.- T Consensus 311 rqatanahvrikndd--aeiasevgvdeiivy---tgs-kal-kptvlvdqk--yhi-dmqdlt----kvdafewktnsn 376 (400) T protein:vir:93 311 RQATANAHVRIKNDD--AEIASEVGVDEIIVY---TGS-KAL-KPTVLVDQK--YHI-DMQDLT----KVDAFEWKTNSN 376 (400) T ss_pred HhhccccceEeecch--hhhhhhcCcceeeee---ecc-ccc-cceeeeccc--ccc-chhhhh----hhhhheeccCCc Confidence 433222 1111110 000011122111110 000 010 111233322 111 112221 111223556666 Q ss_pred eEEEEeeeCcEEecccceEEEEee Q lcl|Aclame:pro 368 KIRVIDRFDVKATDSEALVAGSFS 391 (408) Q Consensus 368 ~~r~~~r~d~~v~~~~a~~~l~~~ 391 (408) .+.++.--.|.+.--+|-+++++. T Consensus 377 milvetltsghvetynagavitvs 400 (400) T protein:vir:93 377 MILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eEEEeecccCcceeeccceeEeeC Confidence 666666666777666777777666 No 166 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=97.54 E-value=1.6e-05 Score=46.92 Aligned_cols=265 Identities=10% Similarity=0.017 Sum_probs=136.0 Q ss_pred hhccc---cccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccce Q lcl|Aclame:pro 116 ETSGS---DSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLT 192 (408) Q Consensus 116 ~~~~t---~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~ 192 (408) |...+ .+...+..-.++...|...-....|+.+++.....++. .+.|....-..+...-..||++.++.....-. T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~--~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~ 78 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAI--THEWQTDELRQPGKNTRVEGEDATIKAGSFTT 78 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceeccc--EEEEEeeecCCccccccccCcccccccccCCE Confidence 21111 11112334456778888877788898888777665433 33444332223333445688766543321111 Q ss_pred eeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHH---HHHHHHHHHHhhcccc-----cc-c--------------- Q lcl|Aclame:pro 193 IIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAK---KVVVTRNQAIIEVMKA-----AP-K--------------- 248 (408) Q Consensus 193 ~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~---~~~~~~~~~~~~g~g~-----~~-~--------------- 248 (408) .+.=-.+-+...+.||..+..-+..+....+..++++ .+.+-++.++|+|... .+ + T Consensus 79 ~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~ 158 (317) T protein:vir:88 79 MLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGS 158 (317) T ss_pred EeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCce Confidence 2222223334445555555443333333333333333 3556678888888632 10 0 Q ss_pred -------------------hhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCc-- Q lcl|Aclame:pro 249 -------------------KPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNS-- 307 (408) Q Consensus 249 -------------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~-- 307 (408) .....+-+++.+++...-......+ .+++|+.....|.++...++.++..+.-..-.+ T Consensus 159 ~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~-~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~ 237 (317) T protein:vir:88 159 LGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQAN-SIQTSSSIKKAISKNMKGRATEITLDASDNRIAQT 237 (317) T ss_pred eccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCC-EEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEE Confidence 0001344555666655444444344 578999999999888443444543221100000 Q ss_pred ----ccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEeccc Q lcl|Aclame:pro 308 ----YLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSE 383 (408) Q Consensus 308 ----~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~ 383 (408) -+=+|. |-++.++++|. +.+++.|+.. +.+..-.++..+.....+ +......+..++..+.+|+ T Consensus 238 v~~~~tdfG~-v~ii~~r~lp~-----~~~~~~D~~~-~~l~~Lr~~~~e~laKtG-----d~~k~~i~~E~tLe~~N~~ 305 (317) T protein:vir:88 238 VDVYESDFGK-YTIRANRWFHE-----NTLFVFDPKM-HSLCYLRPFFQHELAKTG-----DSEKRQLLVEYTFRVNNEK 305 (317) T ss_pred EEEEEeCCeE-EEEEeCCCCCC-----CeEEEEcccc-cceeecccceeeccCCCc-----ccceeEEEEEEEEEEcCcc Confidence 112342 44455666663 5689999875 333222344333322222 3334457778999999999 Q ss_pred ceEEEEeecccc Q lcl|Aclame:pro 384 ALVAGSFSAIAD 395 (408) Q Consensus 384 a~~~l~~~~~~~ 395 (408) |+.++...+++- T Consensus 306 a~a~i~~l~~~~ 317 (317) T protein:vir:88 306 SGALIRDVVAQL 317 (317) T ss_pred ceeEEEEecccC Confidence 999988555444 No 167 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=97.43 E-value=3.7e-05 Score=44.86 Aligned_cols=356 Identities=14% Similarity=0.113 Sum_probs=149.1 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQI-----NMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMR 75 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (408) ||..-.|+. +..+.++++.--.++.++ ..+.++- ....+++.-+.+...+++.++..+...+ T Consensus 1 mnkpdliek-qnrlaelkennvslksqisgfevknaiedl----~K~~ELe~TlSe~~iEI~k~en~LN~~e-------- 67 (393) T protein:vir:16 1 MNKPDLIEK-QNRLAELKENNVSLKSQISGFEVKNAIEDL----PKVQELEKTLSENSIEIIKIENELNAQE-------- 67 (393) T ss_pred CCCcchhhh-hhhhhhhhhcccchhhhccchhhhhhhhhc----hhHHHHHHhHhhcchhhhhhhhhhhhhh-------- Confidence 887765543 333333333222222221 1122211 1233444444444444444443333321 Q ss_pred cccccccccchhhhHHHHHHHHHHHhhcchhhHHHH---HH-HHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhc Q lcl|Aclame:pro 76 EEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTV---SS-KTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVR 151 (408) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~---~~-~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~ 151 (408) ..++++.....--......-.|.+.+....+..... +. .+....+-.+--+.+|..+...|...+....++++... T Consensus 68 E~~KGK~kMt~~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfH 147 (393) T protein:vir:16 68 EKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH 147 (393) T ss_pred hcchhhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeee Confidence 111111111111122223334555554333222111 11 11122222334467899888889888888888877655 Q ss_pred eeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHh---cchHHHHHHHHHHHH Q lcl|Aclame:pro 152 VESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLK---DTAENILAWLSSWIA 228 (408) Q Consensus 152 ~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~---ds~~~~~~~v~~~l~ 228 (408) +...+ .+.+.+.-.....+...-.|.+.++.+ .+|.--++.+--++....+ -++.. .+...+..||..+|+ T Consensus 148 VT~~~----~~~V~~s~~s~~eAq~HkdGqTK~eqa-~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELt 221 (393) T protein:vir:16 148 VTNVG----ALLVSRSFDSANEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELT 221 (393) T ss_pred eccch----hhhHHhhhhhhhhhhhhccCCccccce-eeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHH Confidence 54432 222222111112333344555655433 3555555555444443334 22322 344456889999999 Q ss_pred HHHH-HHHHHHHhhccccccchhhhh--------------------h-HHHHHHHHHHhhhhhccCCCEEEEcHHH-HHH Q lcl|Aclame:pro 229 KKVV-VTRNQAIIEVMKAAPKKPTIA--------------------K-FDDVITMINTAVDPAIIATSSLLTNQSG-LNK 285 (408) Q Consensus 229 ~~~~-~~~~~~~~~g~g~~~~~~~~~--------------------~-~d~i~~~~~~~l~~~~~~~a~~~~n~~~-~~~ 285 (408) .++. +..+.++.-|+|+++...... . +|.|-.+.-...+.+.+. .+++.... .+- T Consensus 222 Q~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagrr--ylivktedrkal 299 (393) T protein:vir:16 222 QAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRR--YLIVKTEDRKAL 299 (393) T ss_pred HHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCce--EEEEeccchHHH Confidence 9999 889999999999876321111 1 122222221111111111 22333322 333 Q ss_pred HHhhhcccCc--eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhh Q lcl|Aclame:pro 286 LALVKTAEGK--YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFE 363 (408) Q Consensus 286 l~~lkd~~G~--~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~ 363 (408) |..|+.+... .....+-+.- ..--|..-+++. .++ .+- .+-++.|-+ |.+ +-.+++ ......|. T Consensus 300 ldelrqatananvriknddtei--asevgvdeiivy---tgs-kal-kptvlvdqk--yhi-dmqdlt----kvdafewk 365 (393) T protein:vir:16 300 LDELRQATANANVRIKNDDTEI--ASEVGVDEIIVY---TGS-KAL-KPTVLVDQK--YHI-DMQDLT----KVDAFEWK 365 (393) T ss_pred HHHHHhhhccCceeeeccchhh--hhhcCcceeeee---ecc-ccc-cceeeeccc--ccc-chhhhh----hhhhheec Confidence 3444322221 1111110000 001122111110 000 010 111233322 111 112221 11122355 Q ss_pred hceeeEEEEeeeCcEEecccceEEEEee Q lcl|Aclame:pro 364 TDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) Q Consensus 364 ~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 391 (408) ++.-.+.++.--.|.+.--+|-+++++. T Consensus 366 tnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 366 TNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred cCCceEEEeecccCcceeeccceeEeeC Confidence 6666666766667777666777777666 No 168 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=97.31 E-value=8.5e-05 Score=42.91 Aligned_cols=280 Identities=9% Similarity=0.058 Sum_probs=108.4 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhcee---ecccCcc-ceEEeeccCCccccc-h--hccccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE---SVSTSNG-SRVYEKWTDVTPLTV-M--DAEDGKIPDLDN 188 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~---~~~~~~g-~~~~~~~~~~~~~~~-~--~~E~~~~~~~~~ 188 (408) |.. -.++|+.|+.++++.+++..++.+++..- .+.+..| .+.++.........+ + .+++..+. .+. T Consensus 1 Ma~------~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~-~~~ 73 (392) T protein:vir:99 1 MAN------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLT-VSD 73 (392) T ss_pred Ccc------ccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCccc-ccc Confidence 221 24789999999999999999888776432 2222222 244443222211111 0 12222322 122 Q ss_pred ccceeeeech--heeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------ccchhhhhhHHHH Q lcl|Aclame:pro 189 PQLTIIKYLI--KRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA--------APKKPTIAKFDDV 258 (408) Q Consensus 189 ~~f~~v~~~~--~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~--------~~~~~~~~~~d~i 258 (408) .+-+.+++.. ++... +.|+++-......++...+.+...++++.++|..++.-..+ .........++.+ T Consensus 74 ~~~~~~~~~id~~k~~~-~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i 152 (392) T protein:vir:99 74 FTEDSFPVTLTDVAYHL-GVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGV 152 (392) T ss_pred cccceEEEEEeeeeecc-eeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHH Confidence 2223444444 33333 34555544444567777777888899999998877643211 1111122335556 Q ss_pred HHHHHHhhhhhcc-CCCEEEEcHHHHHHHHhhhc-ccCceeec---cccccCCcccccccceEeeccccccccccCc--- Q lcl|Aclame:pro 259 ITMINTAVDPAII-ATSSLLTNQSGLNKLALVKT-AEGKYLLE---PDPTKPNSYLIKGKQVIVVADRWLPNTGSTV--- 330 (408) Q Consensus 259 ~~~~~~~l~~~~~-~~a~~~~n~~~~~~l~~lkd-~~G~~~~~---~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~--- 330 (408) +.+. ..|+.... .+.+++++|..+..|.+... .+-.+.-. ..+.++.-+++.|++|+...+ +|...+.. T Consensus 153 ~~a~-~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~--~~~~t~~a~~~ 229 (392) T protein:vir:99 153 NGAR-RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTL--IPHGDAYLYHP 229 (392) T ss_pred HHHH-HHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecc--cccccceeeec Confidence 5554 33544333 34467889998887653310 00001100 113344556899999987543 22211100 Q ss_pred ceEEEE--------ehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecc---cceEEE-EeeccccCC- Q lcl|Aclame:pro 331 YPLYYG--------DMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDS---EALVAG-SFSAIADQV- 397 (408) Q Consensus 331 ~~~~~g--------d~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~---~a~~~l-~~~~~~~~~- 397 (408) ..+.++ +....+.......+...+.......+..+...+.. ..+...... .+|... .++...... T Consensus 230 ~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~--~~g~~~v~~~~~~~~~~~~~~~~~~~~v~ 307 (392) T protein:vir:99 230 TAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDT--YFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) T ss_pred cccccccccccccccccceeEEecccceecceeecccceeeccccccce--eEEEEEEeeccccceeeeeeeeeecceee Confidence 000000 00000000000001111000000000011000000 000000000 000000 000000000 Q ss_pred ---------------CCccCCC-----------cccC Q lcl|Aclame:pro 398 ---------------GNFKTTT-----------STAV 408 (408) Q Consensus 398 ---------------~~~~~~~-----------~~~~ 408 (408) +.+.+.. .+.| T Consensus 308 v~~v~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~v 344 (392) T protein:vir:99 308 VAPEAGANATITAAAGEDHTVQLKVTDANGDDVTALC 344 (392) T ss_pred eeeeecccceeEeeeccceeEEEEEEecCCccccceE Confidence 0000000 0011 No 169 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=97.01 E-value=0.00021 Score=40.72 Aligned_cols=286 Identities=9% Similarity=0.009 Sum_probs=96.3 Q ss_pred cccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccc Q lcl|Aclame:pro 82 LNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGS 161 (408) Q Consensus 82 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~ 161 (408) +....... |..++... ..+.+..........+.|..++-. .++.+.-.. T Consensus 1 m~lsD~~v-------fN~~~~~a--~~e~~~q~~~~fn~as~gai~l~~----------------------~~~~Gd~~~ 49 (325) T protein:vir:95 1 MALSDLAV-------YSEYAYSA--FSETLRQQVDLFNTATGGAIMLQS----------------------AAHQGDFSD 49 (325) T ss_pred Cchhhhhh-------hhhhhhhh--hhhhhhhhHhhhhhcccceeEecc----------------------ccccCceee Confidence 11111110 11111100 000000000000111111111100 001111111 Q ss_pred eEEeec-cCCccccchhcccccccccccccceeeeechheeeeehH--HHHHHH-hcchHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 RVYEKW-TDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIIT--ATNTSL-KDTAENILAWLSSWIAKKVVVTRNQ 237 (408) Q Consensus 162 ~~~~~~-~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~--iS~ell-~ds~~~~~~~v~~~l~~~~~~~~~~ 237 (408) +|+... .++....--..+.+..+.....++.++......-.+... ++..+. .+....+...|.+.+++...+.+-. T Consensus 50 ~pf~~~l~g~~~~~~~~~~~~~vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~ 129 (325) T protein:vir:95 50 VAFFAKVTGGLVRRRNAYGSGTVAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLN 129 (325) T ss_pred ccccccccccccccccCCCCceeccceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHH Confidence 122111 111111101122222322222334444444333333222 222221 1222233344444444443333222 Q ss_pred HHhhccccc---------------cchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeecccc Q lcl|Aclame:pro 238 AIIEVMKAA---------------PKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP 302 (408) Q Consensus 238 ~~~~g~g~~---------------~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~ 302 (408) .++.+..+. .......+...+.++... +-.....-..|+||..++..|.+++-.+...++..+- T Consensus 130 ~~~~~l~~a~~~~~~~v~dis~~~~~~~~~~s~~~l~~A~~k-lGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g 208 (325) T protein:vir:95 130 VGLGSVYSALSQVSDVVYDATANTDAADKLPTWNNLNNGQAK-FGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGT 208 (325) T ss_pred HHHHHHHHhhcccccceeeeecccCcccccccHHHHHHHHHH-hcccccceeEEEEchHHHHHHHHhhccccccccccCC Confidence 222222110 001112244566666644 3334445557999999999998765444433333221 Q ss_pred ccCCcccccccceEeeccccccccccC-cc-eEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 303 TKPNSYLIKGKQVIVVADRWLPNTGST-VY-PLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 303 ~~~~~~~l~G~pv~~~~~~~~~~~~~~-~~-~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) ... -++++|++|++.|.+++...+.. .. +.+||. .++......+......+... -++-...+|.+. --++ T Consensus 209 ~~~-i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~--GAi~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~---tf~l 280 (325) T protein:vir:95 209 VNV-VRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVP--GGVLIGQNNDFDANEETKNG--DENIIRTYQAEW---SYNI 280 (325) T ss_pred ccc-ccccCCcEEEEeCCCCCCCccCceeEEEEEEec--CeEEecCCCCccccccccCc--ccceeeeeeeee---eEEe Confidence 111 24789999999876444332211 11 123332 12222222222222111111 112222233221 1467 Q ss_pred cccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 381 DSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 381 ~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) +|.++..- .+.....|..+.-.++.-- T Consensus 281 hp~G~sw~-~s~~g~sPt~aeL~~~~NW 307 (325) T protein:vir:95 281 GVKGFAWD-KANGGKSPTDAALFTSTNW 307 (325) T ss_pred ecceeeee-cccccCCcChHhhcCCcCc Confidence 88888772 2222222222222222111 No 170 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.00 E-value=0.00021 Score=40.69 Aligned_cols=295 Identities=10% Similarity=-0.028 Sum_probs=118.2 Q ss_pred HHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhh--hhceeecccCccceEEeeccCCcccc Q lcl|Aclame:pro 97 FVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ--YVRVESVSTSNGSRVYEKWTDVTPLT 174 (408) Q Consensus 97 ~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~--~~~~~~~~~~~g~~~~~~~~~~~~~~ 174 (408) +.+.+++..+...- ...-...-+....-...-..++.. ++.+.....+-. .++...-..+..++.||+........ T Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~~nt~~l~~k~~~~-LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~D 78 (319) T protein:vir:97 1 MNKTIKNATGMLKL-NLQHFANKSVEPGQTLLKNKHVGI-LERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKD 78 (319) T ss_pred CCcccccccceeEe-ehhhhhccCCCcchHHHHHHHHHH-HHHHHHHhhhhhhcccCcceEeccCcEEEEeeeccccccc Confidence 00001100000000 000000001111112222233332 333333332211 12211112234567777776554444 Q ss_pred chhcccccccccccccceeeeechheeeeeh-HHHHHHHhcchHHH--HHHHHHHHHHHHHHHHHHHHhh----ccccc- Q lcl|Aclame:pro 175 VMDAEDGKIPDLDNPQLTIIKYLIKRYAGII-TATNTSLKDTAENI--LAWLSSWIAKKVVVTRNQAIIE----VMKAA- 246 (408) Q Consensus 175 ~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~-~iS~ell~ds~~~~--~~~v~~~l~~~~~~~~~~~~~~----g~g~~- 246 (408) +-.+.+-...+.+ .++...+++-.+.-.+. .--+ ...+...+ ...+.+.....+.-.+|.-.+. +.++. T Consensus 79 Y~R~~g~~~g~vt-~~~~t~tidqdR~~~F~VD~~D--~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~ 155 (319) T protein:vir:97 79 YKRNATNEFDHPK-IEETTYFLDQEKYWGRFVDALD--RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL 155 (319) T ss_pred ccCCCCcccCCcc-cceeEEEeecccccccccchhh--HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccccc Confidence 4444433332222 24445556555555442 1111 11122222 2223333344444444443222 11211 Q ss_pred -cchhhhhhHHHHHHHHHHhhhhh-ccCCCEEEEcHHHHHHHHhhhcccCce-eeccccccCCcccccccceEeeccccc Q lcl|Aclame:pro 247 -PKKPTIAKFDDVITMINTAVDPA-IIATSSLLTNQSGLNKLALVKTAEGKY-LLEPDPTKPNSYLIKGKQVIVVADRWL 323 (408) Q Consensus 247 -~~~~~~~~~d~i~~~~~~~l~~~-~~~~a~~~~n~~~~~~l~~lkd~~G~~-~~~~~~~~~~~~~l~G~pv~~~~~~~~ 323 (408) ...+....++.+.+++. .|+.. ...+.+++++|..+..|.+-..-.... +......++..++|.|+||+.+++..+ T Consensus 156 ~~~~t~~n~y~~i~~a~~-~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~ 234 (319) T protein:vir:97 156 TVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL 234 (319) T ss_pred ccccCHHHHHHHHHHHHH-HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEeccccc Confidence 11122233556666553 34443 344566788999988875432111110 112223445557899999998765433 Q ss_pred cccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCC Q lcl|Aclame:pro 324 PNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTT 403 (408) Q Consensus 324 ~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~ 403 (408) ..-.+++|-.+ +..... +=-.++......+.| ...++...++|..|++|++..++....+.++... ..+ T Consensus 235 -----k~in~i~~h~~-A~~~~~-k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~-~~~ 303 (319) T protein:vir:97 235 -----QGLQAIAVVGE-VLASPI-QADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKR-DGV 303 (319) T ss_pred -----ccceEEEEcCC-eeeeee-eeeeeeccCCCcccc---ceeeeeeeeeeeEEeccccceEEEeecCCcccCC-Ccc Confidence 23346666544 333222 211222211111112 3567788889999999997666543333222111 111 Q ss_pred Cccc---------C Q lcl|Aclame:pro 404 TSTA---------V 408 (408) Q Consensus 404 ~~~~---------~ 408 (408) .|.+ + T Consensus 304 ~~~~~~~~~~~~~~ 317 (319) T protein:vir:97 304 DAHADNVAKPSGSL 317 (319) T ss_pred ccccccccCCcccc Confidence 1111 1 No 171 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.00 E-value=0.00021 Score=40.69 Aligned_cols=295 Identities=10% Similarity=-0.028 Sum_probs=118.2 Q ss_pred HHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhh--hhceeecccCccceEEeeccCCcccc Q lcl|Aclame:pro 97 FVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ--YVRVESVSTSNGSRVYEKWTDVTPLT 174 (408) Q Consensus 97 ~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~--~~~~~~~~~~~g~~~~~~~~~~~~~~ 174 (408) +.+.+++..+...- ...-...-+....-...-..++.. ++.+.....+-. .++...-..+..++.||+........ T Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~~nt~~l~~k~~~~-LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~D 78 (319) T protein:vir:94 1 MNKTIKNATGMLKL-NLQHFANKSVEPGQTLLKNKHVGI-LERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKD 78 (319) T ss_pred CCcccccccceeEe-ehhhhhccCCCcchHHHHHHHHHH-HHHHHHHhhhhhhcccCcceEeccCcEEEEeeeccccccc Confidence 00001100000000 000000001111112222233332 333333332211 12211112234567777776554444 Q ss_pred chhcccccccccccccceeeeechheeeeeh-HHHHHHHhcchHHH--HHHHHHHHHHHHHHHHHHHHhh----ccccc- Q lcl|Aclame:pro 175 VMDAEDGKIPDLDNPQLTIIKYLIKRYAGII-TATNTSLKDTAENI--LAWLSSWIAKKVVVTRNQAIIE----VMKAA- 246 (408) Q Consensus 175 ~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~-~iS~ell~ds~~~~--~~~v~~~l~~~~~~~~~~~~~~----g~g~~- 246 (408) +-.+.+-...+.+ .++...+++-.+.-.+. .--+ ...+...+ ...+.+.....+.-.+|.-.+. +.++. T Consensus 79 Y~R~~g~~~g~vt-~~~~t~tidqdR~~~F~VD~~D--~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~ 155 (319) T protein:vir:94 79 YKRNATNEFDHPK-IEETTYFLDQEKYWGRFVDALD--RKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL 155 (319) T ss_pred ccCCCCcccCCcc-cceeEEEeecccccccccchhh--HhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccccc Confidence 4444433332222 24445556555555442 1111 11122222 2223333344444444443222 11211 Q ss_pred -cchhhhhhHHHHHHHHHHhhhhh-ccCCCEEEEcHHHHHHHHhhhcccCce-eeccccccCCcccccccceEeeccccc Q lcl|Aclame:pro 247 -PKKPTIAKFDDVITMINTAVDPA-IIATSSLLTNQSGLNKLALVKTAEGKY-LLEPDPTKPNSYLIKGKQVIVVADRWL 323 (408) Q Consensus 247 -~~~~~~~~~d~i~~~~~~~l~~~-~~~~a~~~~n~~~~~~l~~lkd~~G~~-~~~~~~~~~~~~~l~G~pv~~~~~~~~ 323 (408) ...+....++.+.+++. .|+.. ...+.+++++|..+..|.+-..-.... +......++..++|.|+||+.+++..+ T Consensus 156 ~~~~t~~n~y~~i~~a~~-~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~ 234 (319) T protein:vir:94 156 TVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL 234 (319) T ss_pred ccccCHHHHHHHHHHHHH-HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEeccccc Confidence 11122233556666553 34443 344566788999988875432111110 112223445557899999998765433 Q ss_pred cccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCC Q lcl|Aclame:pro 324 PNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTT 403 (408) Q Consensus 324 ~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~ 403 (408) ..-.+++|-.+ +..... +=-.++......+.| ...++...++|..|++|++..++....+.++... ..+ T Consensus 235 -----k~in~i~~h~~-A~~~~~-k~~~~~~~~p~~~~~---a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~-~~~ 303 (319) T protein:vir:94 235 -----QGLQAIAVVGE-VLASPI-QADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKR-DGV 303 (319) T ss_pred -----ccceEEEEcCC-eeeeee-eeeeeeccCCCcccc---ceeeeeeeeeeeEEeccccceEEEeecCCcccCC-Ccc Confidence 23346666544 333222 211222211111112 3567788889999999997666543333222111 111 Q ss_pred Cccc---------C Q lcl|Aclame:pro 404 TSTA---------V 408 (408) Q Consensus 404 ~~~~---------~ 408 (408) .|.+ + T Consensus 304 ~~~~~~~~~~~~~~ 317 (319) T protein:vir:94 304 DAHADNVAKPSGSL 317 (319) T ss_pred ccccccccCCcccc Confidence 1111 1 No 172 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=96.91 E-value=8.3e-05 Score=42.96 Aligned_cols=285 Identities=9% Similarity=-0.013 Sum_probs=134.0 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecch--hhhhhhhhhhhhhhhhhhhhceeec-ccCcc Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQ--DIRTMINTLVRQYDSLQQYVRVESV-STSNG 160 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~--~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~g 160 (408) ...+.. .+..+.-....+ ......+++|.+++.+ .+..+|++...+....+.++.+.+- +...- T Consensus 1 ~~~~~~-~~~~~~~~~~~~------------~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~e 67 (314) T protein:vir:10 1 MAIKFD-AEQAKITTHLEQ------------MGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAK 67 (314) T ss_pred CccchH-HHHHHHHHHHHh------------hcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCcee Confidence 000000 011100000000 0011223444555553 3556677766555544444433221 11111 Q ss_pred ceEEeeccCCccccchhccccc-ccccccccceeeeechheeeeehHHHHHHHhcc---hHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 SRVYEKWTDVTPLTVMDAEDGK-IPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT---AENILAWLSSWIAKKVVVTRN 236 (408) Q Consensus 161 ~~~~~~~~~~~~~~~~~~E~~~-~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds---~~~~~~~v~~~l~~~~~~~~~ 236 (408) ++.+... +..+.+.|.+.++. +|-.+ ..+++.....+.++....++..=++.+ ..++..--....++++.+.+| T Consensus 68 t~~~~~~-e~~G~a~~~~d~~~dip~vd-~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n 145 (314) T protein:vir:10 68 YFEYPEF-DGVGIAQIIADYSDDLPLVD-AFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLD 145 (314) T ss_pred EEEeeee-ccccceeeeCCcccccceee-cccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhc Confidence 2333322 34466677777655 55333 456777777788887777764433332 346777777888889999999 Q ss_pred HHHhhcccccc--------------chhhhhhHHHHHHHHHHh---hhh---hccCCCEEEEcHHHHHHHHhhhcccCce Q lcl|Aclame:pro 237 QAIIEVMKAAP--------------KKPTIAKFDDVITMINTA---VDP---AIIATSSLLTNQSGLNKLALVKTAEGKY 296 (408) Q Consensus 237 ~~~~~g~g~~~--------------~~~~~~~~d~i~~~~~~~---l~~---~~~~~a~~~~n~~~~~~l~~lkd~~G~~ 296 (408) +-+++|+.... ......+.+.+++.++.. +.. ....+..++++|+.+..|...-+..|.- T Consensus 146 ~i~f~G~~~~g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~t 225 (314) T protein:vir:10 146 KLVWSGSAPHGIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLS 225 (314) T ss_pred eEEEeecccccceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCcc Confidence 99998864311 111223344433333222 211 2233446899999998775443433432 Q ss_pred eeccccccCCcccccccceEeeccccccccccCcceEEEE-ehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeee Q lcl|Aclame:pro 297 LLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYG-DMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRF 375 (408) Q Consensus 297 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~g-d~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~ 375 (408) ++.--..+..+-+|.+.|-.. .....++..+++. +=.+.+.+..-..++ ..+... ..-.+...+..|+ T Consensus 226 vl~~l~~n~~~l~I~~~~el~------~ag~~g~~~~v~y~~~~~~~~~~vp~~~~--~l~~e~---~~~~~~~~~~~r~ 294 (314) T protein:vir:10 226 YGELFTRNNPGLTIRFLQFLD------NYDGAGGKAALAFEKSPLNMSIEIPEVTN--VLPAQP---KDLHFRYPVTSKA 294 (314) T ss_pred HHHHHHHhCCCcEEEEccccc------ccCCCcceEEEEEecCCcEEEEecCccce--eeccee---cCceEEEcceeee Confidence 221101111222344444322 1222333333333 222222222122222 222111 1112333355666 Q ss_pred -CcEEecccceEEEEeeccc Q lcl|Aclame:pro 376 -DVKATDSEALVAGSFSAIA 394 (408) Q Consensus 376 -d~~v~~~~a~~~l~~~~~~ 394 (408) |+.+.+|.||+.++.-+.+ T Consensus 295 ~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 295 TGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEEEEECcceeEeeeeeecC Confidence 5689999999999988887 No 173 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=96.87 E-value=0.00028 Score=40.02 Aligned_cols=293 Identities=12% Similarity=0.065 Sum_probs=145.7 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccc-cccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeec Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGS-DSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKW 167 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t-~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~ 167 (408) .+.+-+..|..++....... .+.+ +.+-.+.|-+.....+.+.+.+.+.+++.++++++....|....... T Consensus 1 M~~~tr~~~~~y~~~~A~~n--------gv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv 72 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELN--------NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGV 72 (355) T ss_pred CChHHHHHHHHHHHHHHHHh--------CCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeecc Confidence 23333444444443321110 0110 12346788888889999999999999999999999988887665422 Q ss_pred cCCccccchh--cccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 168 TDVTPLTVMD--AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 168 ~~~~~~~~~~--~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) .+ +.++-+ +.+......+...++.-.+..++.---..|+.+.|+..+ .+|+.-+.+.+.++++.=.-.-.++|+ T Consensus 73 ~g--~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~ 150 (355) T protein:vir:98 73 TG--TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGT 150 (355) T ss_pred Cc--cccccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccce Confidence 11 111111 101111111222344445555555545566677776533 578888888888887755545555665 Q ss_pred cccc------------------------------------------------chhhhhhHHHHHH-HHHHhhhhhccC-- Q lcl|Aclame:pro 244 KAAP------------------------------------------------KKPTIAKFDDVIT-MINTAVDPAIIA-- 272 (408) Q Consensus 244 g~~~------------------------------------------------~~~~~~~~d~i~~-~~~~~l~~~~~~-- 272 (408) ..+. ..+.-.+.|.++. +++..+++.++. T Consensus 151 s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~ 230 (355) T protein:vir:98 151 TRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDP 230 (355) T ss_pred eeeccCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCC Confidence 4221 0112234555554 344556777664 Q ss_pred CCEEEEcHHHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeecc Q lcl|Aclame:pro 273 TSSLLTNQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRE 348 (408) Q Consensus 273 ~a~~~~n~~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 348 (408) .-++||.++.++. ...+ +....|- ...++.. ...++-|+|.+.++. +|... +++=-|++.-..+..+ T Consensus 231 dLVvivG~dLla~k~~~l~-n~~~~ptE~~Aa~~i~-s~k~iGGlpa~~~Pf--fP~~~-----~lVT~L~NLsIY~Q~g 301 (355) T protein:vir:98 231 NLVAIVGRKLLADKYFPLV-NKQQENSESLAADIII-SQKRIGNLPAVRVPY--FPANA-----VLVTTLENLSIYFMDE 301 (355) T ss_pred CEEEEEchhhhHHHhhhHh-hccCCcHHHHHHHHHH-HhhhhCCceeEEccc--cCCCc-----eEEeeccccEEEEecC Confidence 4578888877552 3333 2222210 0001111 125789999998653 56533 4555555543434444 Q ss_pred ceEEEEec-cchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccc-cCCCCccCCCc Q lcl|Aclame:pro 349 NMSLLPTN-IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA-DQVGNFKTTTS 405 (408) Q Consensus 349 ~~~i~~~~-~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~-~~~~~~~~~~~ 405 (408) ..+=.+.+ ...+.++.++ ..--|..|-++..++.+.--..+ +..+.-+.+.| T Consensus 302 s~RR~~~d~p~r~rie~y~-----s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) T protein:vir:98 302 SHRRSIDENPKKDRVENYE-----SMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) T ss_pred cEEEEEEeccccccccchh-----hhcceeeeeccccEEEeeceeeeCCCCCcccccCC Confidence 43333222 1122222211 11234455666666655522222 22222222333 No 174 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=96.76 E-value=0.00036 Score=39.48 Aligned_cols=281 Identities=10% Similarity=-0.000 Sum_probs=112.4 Q ss_pred hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhce---------eecccCccceEEeeccCCccccc Q lcl|Aclame:pro 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV---------ESVSTSNGSRVYEKWTDVTPLTV 175 (408) Q Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~---------~~~~~~~g~~~~~~~~~~~~~~~ 175 (408) |...... +.-+ -..+|+.|...+.+...+.+.|.+-.-+ ....+....+|+...-++..... T Consensus 1 M~~~~~~-------T~l~--Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~ 71 (367) T protein:vir:80 1 MPDFNNQ-------VRLV--DAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNY 71 (367) T ss_pred Ccchhhh-------hhhh--hccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCcccc Confidence 0000000 0000 0345555544444443333333211000 01122222333332222222212 Q ss_pred hhccccc-ccccccccceee--eechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Q lcl|Aclame:pro 176 MDAEDGK-IPDLDNPQLTII--KYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK-------- 244 (408) Q Consensus 176 ~~~E~~~-~~~~~~~~f~~v--~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g-------- 244 (408) +...... .+-....+..++ .+.-.+--..-.++..+-- .|..+.|.++++.--.+...+.++.... T Consensus 72 ~~d~~~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG---~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a 148 (367) T protein:vir:80 72 GSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG---SNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLA 148 (367) T ss_pred CCCCCcccccccccccchheeeeehhcccchhhhHHHHhhC---chHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccc Confidence 1111111 110111122222 2222333334567776642 3567777777776655555444332111 Q ss_pred -----------------------------cccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc Q lcl|Aclame:pro 245 -----------------------------AAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK 295 (408) Q Consensus 245 -----------------------------~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~ 295 (408) ..........++.++++... +-.....-+.++||+.++..|++++= =. T Consensus 149 ~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~-lGD~~~~l~~i~mHS~V~~~L~~~~l--i~ 225 (367) T protein:vir:80 149 GNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFT-MGDHVGSIAAIAVHSMVYKRMTNNDE--IE 225 (367) T ss_pred cchhhhhhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHH-hccccccccEEEEchHHHHHHHhccc--cc Confidence 00011233556667777543 43445567789999999999987631 01 Q ss_pred eeeccccccCCcccccccceEeeccccccccccC-c-ceEEEEehhcceEeee---ccceEEEEeccchhhhhhceeeEE Q lcl|Aclame:pro 296 YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGST-V-YPLYYGDMSQAITLFD---RENMSLLPTNIGAGAFETDTTKIR 370 (408) Q Consensus 296 ~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~-~-~~~~~gd~~~~~~~~~---~~~~~i~~~~~~~~~f~~~~~~~r 370 (408) |+- +.-....-++++|++|++.|+.+....++. . .+.+||.= ++...+ ..+.+..+++...+. -++-.+ T Consensus 226 ~i~-~sd~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~G--Ai~~~~~~~~~~~E~~Rd~~~~~~--gG~d~L- 299 (367) T protein:vir:80 226 FIP-DSKGQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA--AFGYADGAPQVPVAVGRRELRGNG--SGLEYI- 299 (367) T ss_pred ccc-CCCCccccceecceeEEEeCCCcccccCCCceEEEEEEecc--eeeecccCCccceecccchhhhcC--CceEEE- Confidence 111 111112236899999999766433222211 1 12455531 222111 123444444432111 122222 Q ss_pred EEeeeCcEEecccceEEEEeeccccCCCCcc---CCCccc-----C Q lcl|Aclame:pro 371 VIDRFDVKATDSEALVAGSFSAIADQVGNFK---TTTSTA-----V 408 (408) Q Consensus 371 ~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~---~~~~~~-----~ 408 (408) ...|. .++||.+|...+-.-++|....++ .+...+ | T Consensus 300 ~~Rr~--~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eL 343 (367) T protein:vir:80 300 LERKE--WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) T ss_pred Eeeee--EEeecceeeecccccccccccccccccccccCCCChHHh Confidence 22222 688998887754332222211111 111111 1 No 175 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=96.74 E-value=0.00037 Score=39.38 Aligned_cols=294 Identities=13% Similarity=0.069 Sum_probs=146.0 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhcc-ccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeec Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSG-SDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKW 167 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~-t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~ 167 (408) .+...+..|..++..-.... .+. .+.+-.+.|-+.....+.+.+.+.+.+++.++++++....|....... T Consensus 1 M~~~tr~~~~~y~~~~A~~n--------gv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv 72 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLN--------GISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGV 72 (355) T ss_pred CChHHHHHHHHHHHHHHHHh--------CCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeecc Confidence 23334444544443321111 010 112346788888999999999999999999999999988887664422 Q ss_pred cCCccccchh--cccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 168 TDVTPLTVMD--AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) Q Consensus 168 ~~~~~~~~~~--~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~ 243 (408) . .+.++-+ ..+......+...++.-.+..++.---..|+.+.|+..+ .+|+.-+.+.+.++++.=.-.-.++|+ T Consensus 73 ~--g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~ 150 (355) T protein:vir:18 73 T--GTIASTTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGT 150 (355) T ss_pred C--cceeeccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccce Confidence 2 1111111 101111111222344445555555555566777776533 578888888888887755555556665 Q ss_pred ccccc------------------------------------------------hhhhhhHHHHHHHH-HHhhhhhccC-- Q lcl|Aclame:pro 244 KAAPK------------------------------------------------KPTIAKFDDVITMI-NTAVDPAIIA-- 272 (408) Q Consensus 244 g~~~~------------------------------------------------~~~~~~~d~i~~~~-~~~l~~~~~~-- 272 (408) ..+.. .+.-.+.|.++..+ +..+++.++. T Consensus 151 s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~ 230 (355) T protein:vir:18 151 TRADTSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDP 230 (355) T ss_pred eeeccCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCC Confidence 42210 11123455655433 4456777664 Q ss_pred CCEEEEcHHHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeecc Q lcl|Aclame:pro 273 TSSLLTNQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRE 348 (408) Q Consensus 273 ~a~~~~n~~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 348 (408) .-++||.++.++. +..+ +..+.|- ...++. -...++-|+|.+.++. +|... +++=-|++.-..+..+ T Consensus 231 dLVvivG~dLla~k~~~l~-n~~~~ptE~~Aa~~i-~s~k~iGGlpa~~~Pf--fP~~~-----~lVT~L~NLsIY~Q~g 301 (355) T protein:vir:18 231 KLVAIVGRKLLADKYFPLV-NKQQENTESLAADII-ISQKRIGNLPAVRVPY--FPANA-----VFVTTLENLSIYFMDE 301 (355) T ss_pred CEEEEEchhhhHHHHhHHh-hccCChHHHHHHHHH-HHHHhhCCceeEEccc--cCCCc-----eEEeeccccEEEEecC Confidence 4578888877552 2223 2222211 000000 0125789999998653 56533 4555555543434444 Q ss_pred ceEEEEecc-chhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcc Q lcl|Aclame:pro 349 NMSLLPTNI-GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTST 406 (408) Q Consensus 349 ~~~i~~~~~-~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~ 406 (408) ..+=.+.+. ..+.++.++ ..--|..|-++..++.+.--..++.+..-..-.++ T Consensus 302 s~RR~~~d~p~r~rie~y~-----s~Ne~YvVEd~~~~a~ieni~~~~~~~~~~~~~g~ 355 (355) T protein:vir:18 302 SHRRSIDENPKKDRVENYE-----SMNIDYVVEAYAAGCLLENITLGDFTAPAAPEGGE 355 (355) T ss_pred cEEEEEEeccccccccchh-----hhcceeeeeccccEEEEeeeeecCCCCcccccCCC Confidence 433332221 112222211 11234455566666555533333322211112222 No 176 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=96.71 E-value=0.00039 Score=39.25 Aligned_cols=281 Identities=12% Similarity=0.099 Sum_probs=148.1 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+..-+..|..++..-... ....+.+..+.|.+.....+.+.+.+.+.+++.++++++....|........ T Consensus 1 M~~~tr~~~~~y~~~~A~~---------ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~ 71 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKL---------NGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVS 71 (338) T ss_pred CCHHHHHHHHHHHHHHHHH---------hCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccC Confidence 2233334444444321111 1112344578899999999999999999999999999999888876644222 Q ss_pred CCccccchh--cccccccccccccceeeeechheeeeehHHHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 169 DVTPLTVMD--AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) Q Consensus 169 ~~~~~~~~~--~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds--~~~~~~~v~~~l~~~~~~~~~~~~~~g~g 244 (408) .+.++-+ ..+......+...++.-.+..++.---..|+.+.|+.. ..+|+.-+.+.+.++++.=.-.-.++|+. T Consensus 72 --g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s 149 (338) T protein:vir:11 72 --GTIASRTDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTS 149 (338) T ss_pred --ccccccccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhccccee Confidence 1111111 11111111111133444455555544456667777653 35788888888888877555444555554 Q ss_pred cccc--------------------------------------------hhhhhhHHHHHHH-HHHhhhhhccC--CCEEE Q lcl|Aclame:pro 245 AAPK--------------------------------------------KPTIAKFDDVITM-INTAVDPAIIA--TSSLL 277 (408) Q Consensus 245 ~~~~--------------------------------------------~~~~~~~d~i~~~-~~~~l~~~~~~--~a~~~ 277 (408) .+.. .+.-.+.|.++.. ++..+++.++. .-++| T Consensus 150 ~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvi 229 (338) T protein:vir:11 150 AAATTNRAANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVI 229 (338) T ss_pred eccCCChhhCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEE Confidence 2110 0123445666543 44567887765 45788 Q ss_pred EcHHHHHH--HHhhhcccCceeec---cccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEE Q lcl|Aclame:pro 278 TNQSGLNK--LALVKTAEGKYLLE---PDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSL 352 (408) Q Consensus 278 ~n~~~~~~--l~~lkd~~G~~~~~---~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 352 (408) |.++.++. ...+ +. +++.-. .++. ....++-|+|.+.++. +|... +++=-|++.-..+..+..+= T Consensus 230 vG~dLladk~~~l~-n~-~~~ptE~~Aa~~~-~s~k~iGGlpa~~~Pf--fP~~~-----~lVT~L~NLsIY~Q~gs~RR 299 (338) T protein:vir:11 230 LGRELVHDKYFPMV-NK-DQPATEKIATDLI-LSQKRMGGLPPVEVPY--VPEKG-----LMVTTLKNLSLYWQIGGRRR 299 (338) T ss_pred EchhhhHHHHhHHH-hc-CCChHHHHHHHHH-HHhhhhCCceeEEccc--cCCCc-----eEEeeccccEEEEecCcEEE Confidence 88877652 2222 22 221110 0000 1124799999998653 56533 45555555444444444333 Q ss_pred EEecc-chhhhhhceeeEEEEeeeCcEEecccceEEEEeecccc Q lcl|Aclame:pro 353 LPTNI-GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD 395 (408) Q Consensus 353 ~~~~~-~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 395 (408) .+.+. ..+.++.+ -..--|..|-++..++.+.-...++ T Consensus 300 ~~~d~p~r~rie~y-----~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 300 YLKEVPEKNRIENY-----ESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred EEEeccccccccch-----hhhccceeeeccccEEEeecceecC Confidence 32221 12222221 1122355677777777776555555 No 177 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=96.44 E-value=0.00062 Score=38.17 Aligned_cols=295 Identities=12% Similarity=0.076 Sum_probs=146.7 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+..-+..|..++..-.... .+.. .+.+-.+.|-+.....+...+.+.+.+++.++++++....|........ T Consensus 1 M~~~tr~~~~~y~~~~A~~n-gv~~------~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELN-GIDA------GDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHHHHHHHHHHHHHHHh-CCCh------HHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 23333444444443321110 0000 0123467888899999999999999999999999999888876644221 Q ss_pred CCccccchh--cccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 169 DVTPLTVMD--AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) Q Consensus 169 ~~~~~~~~~--~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g 244 (408) .+.++-+ +-+......+...++.-.+..++.---..|+.+.|...+ .+|..-+.+.+.++++.=.-.-.++|+. T Consensus 74 --g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts 151 (357) T protein:vir:56 74 --GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVK 151 (357) T ss_pred --ccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceeccccee Confidence 1111111 111111111112344444555544444566666666533 4777778777777776544344455543 Q ss_pred ccc------------------------------------------------chhhhhhHHHHHH-HHHHhhhhhccC--C Q lcl|Aclame:pro 245 AAP------------------------------------------------KKPTIAKFDDVIT-MINTAVDPAIIA--T 273 (408) Q Consensus 245 ~~~------------------------------------------------~~~~~~~~d~i~~-~~~~~l~~~~~~--~ 273 (408) .+. ..+.-.+.|.++. +++..+++.++. . T Consensus 152 ~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~d 231 (357) T protein:vir:56 152 RAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPD 231 (357) T ss_pred eeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCC Confidence 211 0112334566654 445567887765 4 Q ss_pred CEEEEcHHHHHH-HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccce Q lcl|Aclame:pro 274 SSLLTNQSGLNK-LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENM 350 (408) Q Consensus 274 a~~~~n~~~~~~-l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 350 (408) -++||-++.++. -..|-+..+.|- ...++. ....++-|+|.+.++. +|... +++=-|++.-..+.++.. T Consensus 232 LVvivG~dLla~k~~~l~n~~~~pTE~~Aa~~i-~s~k~iGGl~a~~~Pf--FP~~~-----llVT~L~NLsIY~Q~gs~ 303 (357) T protein:vir:56 232 LVVIVGRQLLADKYFPIVNKEQDNSEMLAADVI-ISQKRIGNLPAVRVPY--FPADA-----MLITKLENLSIYYMDDSH 303 (357) T ss_pred EEEEEchhhhhhhhhhHhhccCChHHHHHHHHH-HHhhhhCCceeEEccc--cCCCc-----eEEeeccccEEEEecCcE Confidence 678888877653 122222222211 000000 1124788999998653 56543 455555543333444443 Q ss_pred EEEEec-cchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc---ccCCCCccCCCc Q lcl|Aclame:pro 351 SLLPTN-IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI---ADQVGNFKTTTS 405 (408) Q Consensus 351 ~i~~~~-~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~---~~~~~~~~~~~~ 405 (408) +=.+.+ ...+.++.++ ..--|..|-++..++.+.-... ++++...+...| T Consensus 304 RR~~~d~p~r~riE~y~-----s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~~~~~~a 357 (357) T protein:vir:56 304 RRVIEENPKLDRVENYE-----SMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATEEPGA 357 (357) T ss_pred EEEEEeccccccccchh-----hhcceeeeeccccEEEeeeeeeccCCCCcccCCCCCC Confidence 333222 1122222211 1123456666666666653333 333333344444 No 178 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=96.39 E-value=0.00067 Score=37.99 Aligned_cols=284 Identities=11% Similarity=0.109 Sum_probs=146.8 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+...+..|.+++..-.... ...+.+-.+.|-+.....+.+.+.+.+.+++.++++++....|........ T Consensus 1 M~~~tr~~~~~y~~~~A~~n---------gv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~ 71 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLN---------DTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVS 71 (337) T ss_pred CChHHHHHHHHHHHHHHHhc---------ChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccC Confidence 23334444555444321111 011233467788889999999999999999999999999888876644222 Q ss_pred CCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 169 DVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAIIEVMKAA 246 (408) Q Consensus 169 ~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds--~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~ 246 (408) +....-.-.+.+.-.| .+...++.-.+..++.---..|+.+.|+.. ..+|+.-+.+.+.++++.=.-.-.++|+..+ T Consensus 72 g~iagrt~t~~~~R~~-~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A 150 (337) T protein:vir:10 72 GPIASRTDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA 150 (337) T ss_pred cceeeeecCCCCcccc-ccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeec Confidence 1111001111111111 122234444555555554556677777653 3578888888888887765555556665422 Q ss_pred c-------------------------------------------chhhhhhHHHHHH-HHHHhhhhhccC--CCEEEEcH Q lcl|Aclame:pro 247 P-------------------------------------------KKPTIAKFDDVIT-MINTAVDPAIIA--TSSLLTNQ 280 (408) Q Consensus 247 ~-------------------------------------------~~~~~~~~d~i~~-~~~~~l~~~~~~--~a~~~~n~ 280 (408) . ..+.-.+.|.++. +++..+++.++. .-++||.+ T Consensus 151 ~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~ 230 (337) T protein:vir:10 151 ATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGR 230 (337) T ss_pred cCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1 0112234556544 344567887765 45788888 Q ss_pred HHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEec Q lcl|Aclame:pro 281 SGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTN 356 (408) Q Consensus 281 ~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~ 356 (408) +.++. ...+. ..+.|- ...++. -...++-|+|.+.++. +|... +++=-|++.-..+..+..+=.+.+ T Consensus 231 dLladk~~~l~n-~~~~ptE~~Aa~~i-~s~k~iGGlpa~~~Pf--fP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d 301 (337) T protein:vir:10 231 ELLHDKYFPIVN-ATQAPTERLAADLI-VSQKRIGNLPAVRVPF--FPKRA-----LMVTKLSNLSIYYQEGARRRTLKE 301 (337) T ss_pred hhhhHHhhHHhc-cCCCcHHHHHHHHH-HHhhhhCCceeEEccc--cCCCc-----eEEeechhcEEEEecCcEEEEEEE Confidence 77652 22222 211110 000000 0124789999998653 56533 455555554444444443333222 Q ss_pred c-chhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 357 I-GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 357 ~-~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) . ..+.++.++ ..--|..|-++..++.+.--...++ T Consensus 302 ~p~r~rie~y~-----s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 302 VPERDRIENYE-----SSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ccccccccchh-----hccceeeeeccccEEEEeceeecCC Confidence 1 122222211 1223556677777777663333333 No 179 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=96.38 E-value=0.00069 Score=37.93 Aligned_cols=294 Identities=12% Similarity=0.088 Sum_probs=145.9 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+..-+..|..++..-.... .+.. .+.+-.+.|-+.....+...+.+.+.+++.++++++....|........ T Consensus 1 M~~~tr~~~~~y~~~~A~~n-gv~~------~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELN-GIDA------GDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHHHHHHHHHHHHHHHh-CCCh------HHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 23333444444443321110 0000 0123467888899999999999999999999999999888876644221 Q ss_pred CCccccchh--cccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 169 DVTPLTVMD--AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) Q Consensus 169 ~~~~~~~~~--~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g 244 (408) .+.++-+ +-+......+...++.-.+..++.---..|+.+.|...+ .+|..-+.+.+.++++.=.-.-.++|+. T Consensus 74 --g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts 151 (357) T protein:vir:60 74 --GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVR 151 (357) T ss_pred --cccccccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceeccccee Confidence 1111111 111111111112344444555554444566666666533 4777778777777776544344455543 Q ss_pred ccc------------------------------------------------chhhhhhHHHHHH-HHHHhhhhhccC--C Q lcl|Aclame:pro 245 AAP------------------------------------------------KKPTIAKFDDVIT-MINTAVDPAIIA--T 273 (408) Q Consensus 245 ~~~------------------------------------------------~~~~~~~~d~i~~-~~~~~l~~~~~~--~ 273 (408) .+. ..+.-.+.|.++. +++..+++.++. . T Consensus 152 ~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~d 231 (357) T protein:vir:60 152 RAETSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPD 231 (357) T ss_pred eeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCC Confidence 211 0112334566654 445567887765 4 Q ss_pred CEEEEcHHHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccc Q lcl|Aclame:pro 274 SSLLTNQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDREN 349 (408) Q Consensus 274 a~~~~n~~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 349 (408) -++||.++.++. +..+ +..+.|- ...++. ....++-|+|.+.++. +|... +++=-|++.-..+.++. T Consensus 232 LVvivG~dLla~k~~~l~-n~~~~pTE~~Aa~~i-~s~k~iGGl~a~~~Pf--FP~~~-----llVT~L~NLsIY~Q~gs 302 (357) T protein:vir:60 232 LVVIVGRQLLADKYFPIV-NREQDNSEMLAADVI-ISQKRIGNLPAVRVPY--FPADA-----MLITKLENLSIYYMDDS 302 (357) T ss_pred EEEEEchhhhhHHhhhHh-hcCCChHHHHHHHHH-HHhhhhcCcceEEccc--cCCCc-----eEEeeccccEEEEecCc Confidence 678888877652 3333 2222210 000000 1124788999998653 56543 45555555333344444 Q ss_pred eEEEEec-cchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc---ccCCCCccCCCc Q lcl|Aclame:pro 350 MSLLPTN-IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI---ADQVGNFKTTTS 405 (408) Q Consensus 350 ~~i~~~~-~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~---~~~~~~~~~~~~ 405 (408) .+=.+.+ ...+.++.++ ..--|..|-++..++.+.-... .++........| T Consensus 303 ~RR~~~d~p~r~riE~y~-----s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~~~~~~a 357 (357) T protein:vir:60 303 HRRVIEENPKLDRVENYE-----SMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATAEPGA 357 (357) T ss_pred EEEEEEeccccccccchh-----hhcceeeeeccccEEEeeeeeeccCcccccCCCCCCC Confidence 3333222 1122222211 1123456666666666653333 223333333333 No 180 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=96.35 E-value=0.00071 Score=37.84 Aligned_cols=367 Identities=7% Similarity=-0.039 Sum_probs=102.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQA-EQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 79 (408) ..++..+++|.++++++.+++++..++... ++.+...+++++++++++.++++++.....+..... ........... T Consensus 8 kel~~~~~el~~~~~~~~~~~~~~~~e~~~--~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (421) T protein:vir:13 8 KELRAKKKELEEKRCGIVEEIRSLAKEKKE--EEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGGRVIIN 85 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccch--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Confidence 445566777777777666666655443221 122223456677777777666666655544433221 11111111111 Q ss_pred -cccccchhhhHHHHHHHHH-----HHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhcee Q lcl|Aclame:pro 80 -GPLNKSENELKDKFVKDFV-----NMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (408) Q Consensus 80 -~~~~~~~~~~~~~~~~a~~-----~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~ 153 (408) ...............+... ...+...... .........-.. -.++.......+..+-...++......+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~----~gg~liP~~~~~-~Ii~~~~~~~~l~~l~~~~~~~~~~~~~ 160 (421) T protein:vir:13 86 GDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSST----NNGAVIPQEFVN-EFEKLKEGYPSLKEHCHVIPVNRNAGKM 160 (421) T ss_pred cchhHHHHHHHHHHHHHhhhccchhHHHhhccccC----CcceecchhhHH-HHHHHHHhhhhhhhhceeeeccCCceEE Confidence 1111111111111111110 0111100000 000000000000 0111111111111111111111111112 Q ss_pred ecccCccceEEeeccCCccccchhcccccccccccccceeeeechhe-eee-ehHHHHHHHhcchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKR-YAG-IITATNTSLKDTAENILAWLSSWIAKKV 231 (408) Q Consensus 154 ~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~-~~~-~~~iS~ell~ds~~~~~~~v~~~l~~~~ 231 (408) ++......-.+-....+ ....+. .+..+..++...++...- +.. ++.-|..-+.. + +...|.+.+...+ T Consensus 161 ~~~~~~~~~~~~~~~E~----~~~~~s--~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~--~-i~~~la~~~~~~~ 231 (421) T protein:vir:13 161 PVRAGASVDKLANLAKD----TELVKA--MLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLE--F-VNEEFAEFAVNTE 231 (421) T ss_pred EEeecCCccceeecccc----cccccc--ccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHH--H-HHHHHHHHHHHHh Confidence 21111111001001111 111121 111222222221111110 110 01111110111 1 2333334444333 Q ss_pred HHHHHH---HHhhccccccchhhhhhHHHHHHHHH----------------HhhhhhccCCCEEEEcHHHHHHHHhhhcc Q lcl|Aclame:pro 232 VVTRNQ---AIIEVMKAAPKKPTIAKFDDVITMIN----------------TAVDPAIIATSSLLTNQSGLNKLALVKTA 292 (408) Q Consensus 232 ~~~~~~---~~~~g~g~~~~~~~~~~~d~i~~~~~----------------~~l~~~~~~~a~~~~n~~~~~~l~~lkd~ 292 (408) ....-. .+++..+... ....-+++..+. ..|..--..+..+++.+.....-..| T Consensus 232 ~~~i~~~~~g~~~~~~~~~----~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl--- 304 (421) T protein:vir:13 232 NAEIVKQAKAVLAEETIND----YAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVF--- 304 (421) T ss_pred hhhHhhhhhhccccccccc----hHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCcee--- Confidence 333221 2222222211 111111111110 00111111233344433110000001 Q ss_pred cCceeeccc-cccCCcccccccceEeeccccccccccCcceEEEEehhcceEee------eccceEEEEeccchhhhhhc Q lcl|Aclame:pro 293 EGKYLLEPD-PTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLF------DRENMSLLPTNIGAGAFETD 365 (408) Q Consensus 293 ~G~~~~~~~-~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~------~~~~~~i~~~~~~~~~f~~~ 365 (408) .|.|+...+ ...+... -.++++-+ . +..+.+++.+..-+.. .+..+.+... +.-| T Consensus 305 ~G~pV~~~~~~~~~~~~---~~~~~~gd------~---~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~------~r~d 366 (421) T protein:vir:13 305 KGRPVIELEESIFDVGD---ETKFIVSD------F---KTLIKFMDRKQYLIDQSKEAGYTKNETIARII------ERFD 366 (421) T ss_pred cceeeEEeccccccCCC---ceEEEEEe------c---cccEEEEEecceEEEeecccccccCeeEEEEE------eeec Confidence 244443221 1101000 01222211 0 0112333333221111 1111111111 0111 Q ss_pred eeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 366 TTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 366 ~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ...+....-..+.+..+.+|+.+.-.+.+..+..+.....+-- T Consensus 367 ~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~~~~~ 409 (421) T protein:vir:13 367 VNSPLDKSSDAEKIRKFGVIVKLQEVLKSSPRSGKNKNESKEE 409 (421) T ss_pred ceeecchhhheeeecccceeeccccccCCCCcCCCCccccchh Confidence 1222222223457777788888755454444444444444433 No 181 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=96.28 E-value=0.00079 Score=37.59 Aligned_cols=284 Identities=11% Similarity=0.111 Sum_probs=146.5 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+...+..|.+++..-..... ..+.+-.+.|-+.....+.+.+.+.+.+++.++++++....|........ T Consensus 1 M~~~tr~~~~~y~~~~A~~ng---------v~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~ 71 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLND---------TGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVS 71 (337) T ss_pred CChHHHHHHHHHHHHHHHhcC---------hhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccC Confidence 233344445555443211110 11223357788888899999999999999999999999888876644222 Q ss_pred CCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|Aclame:pro 169 DVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAIIEVMKAA 246 (408) Q Consensus 169 ~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds--~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~ 246 (408) +....-.-.+.+.-.| .+...++.-.+..++.---..|+.+.|+.. ..+|+.-+.+.+.++++.=.-.-.++|+..+ T Consensus 72 g~iagrt~t~~~~R~~-~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A 150 (337) T protein:vir:79 72 GPIASRTDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA 150 (337) T ss_pred cceeeeecCCCCcccc-ccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeec Confidence 1111001111111111 122234444555555554456677777653 3578888888888887765555556665422 Q ss_pred c-------------------------------------------chhhhhhHHHHHH-HHHHhhhhhccC--CCEEEEcH Q lcl|Aclame:pro 247 P-------------------------------------------KKPTIAKFDDVIT-MINTAVDPAIIA--TSSLLTNQ 280 (408) Q Consensus 247 ~-------------------------------------------~~~~~~~~d~i~~-~~~~~l~~~~~~--~a~~~~n~ 280 (408) . ..+.-.+.|.++. +++..+++.++. .-++||-+ T Consensus 151 ~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~ 230 (337) T protein:vir:79 151 ATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGR 230 (337) T ss_pred cCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1 0112234566544 344567887765 45788888 Q ss_pred HHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEec Q lcl|Aclame:pro 281 SGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTN 356 (408) Q Consensus 281 ~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~ 356 (408) +.++. ...+. ..+.|- ...++. -...++-|+|.+.++. +|... +++=-|++.-..+..+..+=.+.+ T Consensus 231 dLladk~~~l~n-~~~~ptE~~Aa~~i-~s~k~iGGlpa~~~Pf--fP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d 301 (337) T protein:vir:79 231 ELLHDKYFPIVN-ATQAPTERLAADLI-VSQKRIGNLPAVRVPF--FPKRA-----LMVTKLSNLSIYYQEGARRRTLKE 301 (337) T ss_pred hhhhHHhhHHhc-cCCCcHHHHHHHHH-HHhhhhCCceeEEccc--cCCCc-----eEEeechhcEEEEecCcEEEEEEE Confidence 77652 22222 211110 000000 0124789999998653 56533 455555554444444443333222 Q ss_pred c-chhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 357 I-GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 357 ~-~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) . ..+.++.++ ..--|..|-++..++.+.--...++ T Consensus 302 ~p~r~rie~y~-----s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 302 VPERDRIENYE-----SSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ccccccccchh-----hccceeeeeccccEEEEeceeecCC Confidence 1 122222211 1223556777777777663333333 No 182 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=96.23 E-value=0.00084 Score=37.44 Aligned_cols=290 Identities=10% Similarity=-0.009 Sum_probs=133.3 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHHHHHHh----hccccccCceecch--hhhhhhhhhhhhhhhhhhhhceee-cc Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTE----TSGSDSAAGLTIPQ--DIRTMINTLVRQYDSLQQYVRVES-VS 156 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~----~~~t~~~gg~~vP~--~~~~~ii~~~~~~~~l~~~~~~~~-~~ 156 (408) ....-..++. +............+. ....++.|.+++.+ .+.+.|++...+....+.++.+.. ++ T Consensus 1 ~~~~~~~~~~--------~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~ 72 (329) T protein:vir:79 1 MRGNIMSKEM--------KYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELS 72 (329) T ss_pred Cccchhhhhh--------ccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCC Confidence 0000000000 000000000000111 11112223333332 355778877776666666655432 21 Q ss_pred cCccceEEeeccCCccccchhcccc-cccccccccceeeeechheeeeehHHHHHHHhc---chHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 157 TSNGSRVYEKWTDVTPLTVMDAEDG-KIPDLDNPQLTIIKYLIKRYAGIITATNTSLKD---TAENILAWLSSWIAKKVV 232 (408) Q Consensus 157 ~~~g~~~~~~~~~~~~~~~~~~E~~-~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~d---s~~~~~~~v~~~l~~~~~ 232 (408) -..-++.+... +..+.+.|.+.++ .+|. .+..+.......+.++....++..=++. ...++..--....++++. T Consensus 73 ~~~~~~t~~~~-~~~G~a~~~~d~~~dip~-vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~ 150 (329) T protein:vir:79 73 DTDKTFEYQTF-DKVGHAKIIADYTDDLST-VDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHD 150 (329) T ss_pred CceeEEEeeee-ecceeeeeecCcccccce-eecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHH Confidence 11123333332 3345567777654 3443 2234556666666667666665433332 234677777888889999 Q ss_pred HHHHHHHhhccccccc------------h--------hhhhhHH----HHHHHHHHhh-h-hhccCCCEEEEcHHHHHHH Q lcl|Aclame:pro 233 VTRNQAIIEVMKAAPK------------K--------PTIAKFD----DVITMINTAV-D-PAIIATSSLLTNQSGLNKL 286 (408) Q Consensus 233 ~~~~~~~~~g~g~~~~------------~--------~~~~~~d----~i~~~~~~~l-~-~~~~~~a~~~~n~~~~~~l 286 (408) +.+|+-+++|++.... . -...+++ ++..++.... . .....+..++++|+.+..| T Consensus 151 ~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L 230 (329) T protein:vir:79 151 QLVNHLVFKGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVL 230 (329) T ss_pred HhhccEEEeecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHh Confidence 9999999998763110 0 0111233 3433332211 1 1222345789999999888 Q ss_pred HhhhcccCceeecccccc-CCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhc Q lcl|Aclame:pro 287 ALVKTAEGKYLLEPDPTK-PNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETD 365 (408) Q Consensus 287 ~~lkd~~G~~~~~~~~~~-~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~ 365 (408) .......|.-++.- +.. ..+-+|.+.|-. .+....++..+++.+-+.-+..+ .-++.+...+... ..= T Consensus 231 ~~~~~~~~~tvl~~-lk~~~~~l~I~~~~el------~~ag~~g~~~~v~y~~~~~~~~~-~vp~~~~~l~~q~---~~~ 299 (329) T protein:vir:79 231 MVRMPETTMSYLDY-FKQQNGGITIESISEL------EDIDGAGTKAALVYEKDPMNMSI-EIPEAFNMLTAQP---KDL 299 (329) T ss_pred hcccCCCCccHHHH-HHHhCCCcEEEEcccc------cccCCCCceEEEEEecCCceEEE-ecCcceeeeecee---cCc Confidence 65544445333221 111 111233333322 12223344555554433322222 1222333322211 111 Q ss_pred eeeEEEEeeeC-cEEecccceEEEEeeccc Q lcl|Aclame:pro 366 TTKIRVIDRFD-VKATDSEALVAGSFSAIA 394 (408) Q Consensus 366 ~~~~r~~~r~d-~~v~~~~a~~~l~~~~~~ 394 (408) .+...+..|++ +.+.+|.||+.++.-.+. T Consensus 300 ~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 300 HFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred eEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 23334556665 688999999999977666 No 183 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=96.18 E-value=0.00091 Score=37.27 Aligned_cols=295 Identities=12% Similarity=0.075 Sum_probs=145.7 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+..-+..|..++..-.... .+.. .+.+-.+.|-+.....+...+.+.+.+++.++++++....|........ T Consensus 1 M~~~tr~~~~~y~~~~A~~n-gv~~------~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELN-GIDA------GDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHHHHHHHHHHHHHHHh-CCCh------HHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 23333444444443321110 0000 0123467888899999999999999999999999999888876644221 Q ss_pred CCccccchh--cccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 169 DVTPLTVMD--AEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) Q Consensus 169 ~~~~~~~~~--~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~~~v~~~l~~~~~~~~~~~~~~g~g 244 (408) .+.++-+ +-+......+...++.-.+..++.---..|+.+.|...+ .+|..-+.+.+.++++.=.-.-.++|+. T Consensus 74 --g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts 151 (357) T protein:vir:20 74 --GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVK 151 (357) T ss_pred --ccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceeccccee Confidence 1111111 111111111112344444555554444566666666532 4777778777777776544344455543 Q ss_pred ccc------------------------------------------------chhhhhhHHHHHH-HHHHhhhhhccC--C Q lcl|Aclame:pro 245 AAP------------------------------------------------KKPTIAKFDDVIT-MINTAVDPAIIA--T 273 (408) Q Consensus 245 ~~~------------------------------------------------~~~~~~~~d~i~~-~~~~~l~~~~~~--~ 273 (408) .+. ..+.-.+.|.++. +++..+++.++. . T Consensus 152 ~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~d 231 (357) T protein:vir:20 152 RAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPD 231 (357) T ss_pred eeccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCC Confidence 211 0112334566654 445567887765 4 Q ss_pred CEEEEcHHHHHH-HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccce Q lcl|Aclame:pro 274 SSLLTNQSGLNK-LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENM 350 (408) Q Consensus 274 a~~~~n~~~~~~-l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 350 (408) -++||-++.++. -..|-+..+.|- ...++. ....++-|+|.+.++. +|... +++=-|++.-..+.++.. T Consensus 232 LVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i-~s~k~iGGl~a~~~Pf--FP~~~-----ilVT~L~NLsIY~Q~gs~ 303 (357) T protein:vir:20 232 LVVIVGRQLLADKYFPIVNKEQDNSEMLAADVI-ISQKRIGNLPAVRVPY--FPADA-----MLITKLENLSIYYMDDSH 303 (357) T ss_pred EEEEEchhhhhhhhhhHhhccCChHHHHHHHHH-HHhhhhCCceeEEccc--cCCCc-----eEEeeccccEEEEecCcE Confidence 678888877653 122222222211 000000 1124788999998653 56543 455555543333444443 Q ss_pred EEEEec-cchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc---ccCCCCccCCCc Q lcl|Aclame:pro 351 SLLPTN-IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI---ADQVGNFKTTTS 405 (408) Q Consensus 351 ~i~~~~-~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~---~~~~~~~~~~~~ 405 (408) +=.+.+ ...+.++.++ ..--|..|-++..++.+.-... .++.....-++| T Consensus 304 RR~~~d~p~r~riE~y~-----s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~~~~~~a 357 (357) T protein:vir:20 304 RRVIEENPKLDRVENYE-----SMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATAEPGA 357 (357) T ss_pred EEEEEeccccccccchh-----hhcceeeeeccccEEEeeeeeeccccCCccCCCCCCC Confidence 333222 1122222211 1123556666666666653333 223333333333 No 184 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=95.99 E-value=0.0012 Score=36.69 Aligned_cols=306 Identities=10% Similarity=-0.058 Sum_probs=116.8 Q ss_pred cccccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhh-hhhce Q lcl|Aclame:pro 74 MREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQ-QYVRV 152 (408) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~-~~~~~ 152 (408) ... ... ...+-+.+-+++..+...- ...-...-+..-+....-..+...+-+.....+--. .+++. T Consensus 1 ~~~-----~~~-------~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~ 67 (329) T protein:vir:10 1 MDG-----IFI-------TGVKTMNKEIKNATGKLKL-NLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISN 67 (329) T ss_pred CCc-----eEE-------echhhhhhhhhcccceeEE-ehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeeccc Confidence 000 000 0000011111110000000 000000000011112222333333333332221101 11221 Q ss_pred -eecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcchHH--HHHHHHHHHHH Q lcl|Aclame:pro 153 -ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAEN--ILAWLSSWIAK 229 (408) Q Consensus 153 -~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~--~~~~v~~~l~~ 229 (408) +. ..+..++.||+........+-.+.+-.....+ .++...+++-.+.-.+.-=.-. .+.+... +...+.+.... T Consensus 68 ~~e-~~~g~tVkIp~i~~~gl~DY~R~~g~~~g~vt-~~~~t~tidqdR~~~F~VD~~D-~dEtn~~l~a~~i~~~~~~~ 144 (329) T protein:vir:10 68 DAI-FMQGRSFTVIKGDVTELKDYKRNATNEFDHPQ-IQETTYFLDQEKYWGRFVDALD-RRDTEGNIDINYVVAKQASE 144 (329) T ss_pred cee-eccCcEEEEeeecccccccccCCCCccccccc-cceeEEEeecccceeeecchhh-HhhhhhhhhHHHHHHHHHHH Confidence 22 22345677777765544444334433332222 2445566666665555310000 1111111 12223333444 Q ss_pred HHHHHHHHHHhhcc----cc--ccchhhhhhHHHHHHHHHHhhhhh-ccCCCEEEEcHHHHHHHHhhhcccCce-eeccc Q lcl|Aclame:pro 230 KVVVTRNQAIIEVM----KA--APKKPTIAKFDDVITMINTAVDPA-IIATSSLLTNQSGLNKLALVKTAEGKY-LLEPD 301 (408) Q Consensus 230 ~~~~~~~~~~~~g~----g~--~~~~~~~~~~d~i~~~~~~~l~~~-~~~~a~~~~n~~~~~~l~~lkd~~G~~-~~~~~ 301 (408) .+.-.+|...+... ++ ....+....++.+.+++. .|+.. ...+..++++|..+..|.+-..-.... ..... T Consensus 145 ~v~pEiDay~~skla~~a~~~~~~~~t~~nay~~i~~a~~-~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~ 223 (329) T protein:vir:10 145 VVAPYLDNLRFATLARNKAKHLTVGSGADAQYDAVLDVSV-ELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQV 223 (329) T ss_pred HhhhHHHHHHHHHHHhhcccccccccCHHHHHHHHHHHHH-HHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccc Confidence 44444554332211 11 111122233555655553 34443 334557788999998876421000000 11112 Q ss_pred cccCCcccccccceEeeccccccccccCcceEEEEehhcceEeee-ccceEEEEeccchhhhhhceeeEEEEeeeCcEEe Q lcl|Aclame:pro 302 PTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD-RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKAT 380 (408) Q Consensus 302 ~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~-~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 380 (408) ..++..++|.|+||+.+++..+. .-.+++|-.+. ..... ...+++-...+. ++...++...++|+.|+ T Consensus 224 ~~~g~Vg~idG~~Ii~vps~~~k-----~in~ii~~~~A-~~~~~K~~~~~~~~p~~~-----~~a~~v~gr~yyd~~V~ 292 (329) T protein:vir:10 224 LGKGVQGELDGFTIVKVPSKMLQ-----GVEAMAVIGEV-MASPIQANEAKLNSNVPG-----MFGTLAEQMLYTGAFVP 292 (329) T ss_pred eeeeeeeeecCeEEEEecCCccc-----ceeEEEEcCCc-eeeeeeeeeeeeeCCCCc-----cchheeeeeeeeeeEEE Confidence 33445578999999987654332 23356665442 33222 112232221111 12356778888999999 Q ss_pred cccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 381 DSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 381 ~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) +|++..++.....++....... -+.++ T Consensus 293 ~~k~~~I~~~~~~a~~~~~~~~-~~~~~ 319 (329) T protein:vir:10 293 EHLQKYIFTIGGKEVETNRDGV-DAHAD 319 (329) T ss_pred ccccCEEEEecccCcccCCCCC-Ccccc Confidence 9986665542222211111000 01111 No 185 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=95.97 E-value=0.0012 Score=36.62 Aligned_cols=270 Identities=9% Similarity=-0.036 Sum_probs=116.9 Q ss_pred cccccCceecchhhhhhhhhhhhhhhhhhhhhceee---cccCccceEEeeccCCccccchhccccccccccccccee-- Q lcl|Aclame:pro 119 GSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES---VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI-- 193 (408) Q Consensus 119 ~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~-- 193 (408) -..-++.++-|+.|..++++.+++.+++.+++..-. +......+.++...... +.++..+.- +..+=.+ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~-----v~dg~~~~~-~~~te~~v~ 74 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVK-----SASGRTLVK-QPMVDQTIP 74 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCcee-----ecccCCccc-cccccceEE Confidence 122233466799999999999999999887765422 11111244444322211 223333321 2222234 Q ss_pred eeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----ccchhhhhhHHHHHHHHHHhhhh Q lcl|Aclame:pro 194 IKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA-----APKKPTIAKFDDVITMINTAVDP 268 (408) Q Consensus 194 v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~-----~~~~~~~~~~d~i~~~~~~~l~~ 268 (408) ++++-+|...+ .|+.+=...+..+|...+.+...++++..+|..++.-... +++......+++++++- ..|+. T Consensus 75 l~id~~k~~~~-~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~~~~~~i~~a~-~~Ld~ 152 (418) T protein:vir:10 75 FKIAYQEHVGL-EYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRPGAFIDFANAG-AKQTT 152 (418) T ss_pred EEEecccccce-eechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCcchHHHHHHHH-HHHHh Confidence 44444444444 3444333344567877777888889999998877643211 12222334577777653 45666 Q ss_pred hccCC---CEEEEcHHHHHHHHhhhcccCceeecc-----ccccCCcccccccceEeeccccccccccC---cceEEEEe Q lcl|Aclame:pro 269 AIIAT---SSLLTNQSGLNKLALVKTAEGKYLLEP-----DPTKPNSYLIKGKQVIVVADRWLPNTGST---VYPLYYGD 337 (408) Q Consensus 269 ~~~~~---a~~~~n~~~~~~l~~lkd~~G~~~~~~-----~~~~~~~~~l~G~pv~~~~~~~~~~~~~~---~~~~~~gd 337 (408) ..-+. -..+++|..+..|. ++... .+.. .+.++.-++|.|+.|+.+.+ +|....+ ....+.|- T Consensus 153 ~~VP~~G~R~lVv~P~~~~~L~--~~~~~--~~~~~~~~~~lr~G~IG~i~GF~V~~S~n--ip~~tag~~~~t~~v~ga 226 (418) T protein:vir:10 153 YAVPQDGMRHAVLDPFTCASLS--DEVTK--LFKESMVEQAYKMGYRGNVAAYEVYESQN--LPKHTVGDHGGTPLVNGT 226 (418) T ss_pred cCCCCCCceEEEeCHHHHHHHh--hhccc--cccccccchhhheeeeeeeeceEEEEecC--CCcccccccccceeeecc Confidence 65552 34578999887664 33221 2221 23445557899999988654 4432111 11223332 Q ss_pred hhcceEeeeccceEEEEeccc--hhhhhhceeeEEEEe---eeCcE-EecccceEE-------------EEeeccc---- Q lcl|Aclame:pro 338 MSQAITLFDRENMSLLPTNIG--AGAFETDTTKIRVID---RFDVK-ATDSEALVA-------------GSFSAIA---- 394 (408) Q Consensus 338 ~~~~~~~~~~~~~~i~~~~~~--~~~f~~~~~~~r~~~---r~d~~-v~~~~a~~~-------------l~~~~~~---- 394 (408) ...+... .+...... +..-.=+...|-+.. ++... ..++.=|++ +++.+.- T Consensus 227 ~~~~~~~------~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~ 300 (418) T protein:vir:10 227 VVNGDTV------GFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGT 300 (418) T ss_pred cccceeE------EEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEecccccccc Confidence 2211111 11000000 000000111111100 00000 001111111 1111100 Q ss_pred ------------cCCCCcc---CCCcccC Q lcl|Aclame:pro 395 ------------DQVGNFK---TTTSTAV 408 (408) Q Consensus 395 ------------~~~~~~~---~~~~~~~ 408 (408) ..+..++ .....++ T Consensus 301 ~~~~~~~~~~~~~~~~~~v~a~~a~~~~i 329 (418) T protein:vir:10 301 ATINNENGDPVSLTAYQNVTALPADNAPI 329 (418) T ss_pred ccccccccccccccCCCcccccccCccee Confidence 0000000 0111111 No 186 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=95.96 E-value=0.00057 Score=38.36 Aligned_cols=261 Identities=15% Similarity=0.063 Sum_probs=124.3 Q ss_pred cccCceecch--hhhhhhhhhhhhhhhhhhhhceeecccCc-cceEEeeccCCccccc--hhcccc-cccccccccceee Q lcl|Aclame:pro 121 DSAAGLTIPQ--DIRTMINTLVRQYDSLQQYVRVESVSTSN-GSRVYEKWTDVTPLTV--MDAEDG-KIPDLDNPQLTII 194 (408) Q Consensus 121 ~~~gg~~vP~--~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~--~~~E~~-~~~~~~~~~f~~v 194 (408) -++.++++.+ .+..+|.+...+.-..+.++.+.+..... -++.+.. .+..+.+. |.+.++ ++|-. +..+++- T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~-~d~~G~a~~~~i~~~a~dip~v-d~~~~~~ 78 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYG-ADEHGSLDDGLITVGTSTLDQV-EVGFTPT 78 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEee-eeccCcccccccCCcCCcccee-eccccee Confidence 2333455443 13344555444433333333332211110 1222222 22233333 766553 44533 3466777 Q ss_pred eechheeeeehHHHHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccc--c------cc----------hh--- Q lcl|Aclame:pro 195 KYLIKRYAGIITATNTSLKDT---AENILAWLSSWIAKKVVVTRNQAIIEVMKA--A------PK----------KP--- 250 (408) Q Consensus 195 ~~~~~~~~~~~~iS~ell~ds---~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~--~------~~----------~~--- 250 (408) ..+.+.++.-..+|..=|+.+ ..++.+--.....+++...+|+-.+.|+.+ + .| .. T Consensus 79 ~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~ 158 (304) T protein:vir:52 79 RSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTK 158 (304) T ss_pred EEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCc Confidence 777777776666554333322 235555556667777888888888888532 1 00 00 Q ss_pred -hhhhHHHHHHHHHHhhhhh------ccCCCEEEEcHHHHHHHHhhhccc-CceeeccccccCCcccccccceEeec--c Q lcl|Aclame:pro 251 -TIAKFDDVITMINTAVDPA------IIATSSLLTNQSGLNKLALVKTAE-GKYLLEPDPTKPNSYLIKGKQVIVVA--D 320 (408) Q Consensus 251 -~~~~~d~i~~~~~~~l~~~------~~~~a~~~~n~~~~~~l~~lkd~~-G~~~~~~~~~~~~~~~l~G~pv~~~~--~ 320 (408) ...+++.|++.++..+... ......++++|+.+..|....-++ |.-++. -+....+ -..|.|+-+.. . T Consensus 159 w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~-~l~~n~~-~~~g~~l~I~~v~~ 236 (304) T protein:vir:52 159 VQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALE-FLTKHLS-AAAGRQVAIKALPS 236 (304) T ss_pred cccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHH-HHHHhcc-cccCCcceEEEecc Confidence 1124566665554433222 123446899999999886542222 221221 0111111 12355543321 1 Q ss_pred ccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhce--eeEEEEeeeCc-EEecccceEEEEe Q lcl|Aclame:pro 321 RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDT--TKIRVIDRFDV-KATDSEALVAGSF 390 (408) Q Consensus 321 ~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~--~~~r~~~r~d~-~v~~~~a~~~l~~ 390 (408) .......++++.+++.+-+.=++.+ .-++.+.+.+.. .++. +..=++.|++| .+..|.|++.+++ T Consensus 237 ~~~~~g~~g~~r~vvY~~d~~~~~~-~vP~p~~~l~~q----~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 237 NYGTRVTDGKTRAMVYVNSKEHVIF-DVPMSPTVLDAQ----PKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred cccccCCCCceEEEEEecChhheEE-ecCccccccchh----hcCCceEEecceeeeeeEEEEccceeeeecC Confidence 1122333455666666554433333 234444443322 2332 22335666665 8889999999999 No 187 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=95.77 E-value=0.0015 Score=36.09 Aligned_cols=284 Identities=11% Similarity=0.100 Sum_probs=145.6 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+...+..|..|+..-... ....+.+-.+.|-+.....+...+.+.+.+++.++++++....|........ T Consensus 1 M~~~tr~~~~~y~~~~A~~---------ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~ 71 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKL---------NGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVS 71 (339) T ss_pred CChHHHHHHHHHHHHHHHH---------hCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccC Confidence 3333444455554432111 1112334568899999999999999999999999999999888876644221 Q ss_pred CCccccchhc-ccccccccccccceeeeechheeeeehHHHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 169 DVTPLTVMDA-EDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAIIEVMKA 245 (408) Q Consensus 169 ~~~~~~~~~~-E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds--~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~ 245 (408) .+.++-+. .+......+...++.-.+..++.---..|+.+.|... ..+|..-+.+.+.++++.=.-.-.++|+.. T Consensus 72 --g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~ 149 (339) T protein:vir:79 72 --GPVASTTDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSR 149 (339) T ss_pred --cceeecccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceee Confidence 11111110 0111111111234444444444444455666666643 347888888888887765443334455432 Q ss_pred cc--------------------------------------------chhhhhhHHHHHHH-HHHhhhhhccC--CCEEEE Q lcl|Aclame:pro 246 AP--------------------------------------------KKPTIAKFDDVITM-INTAVDPAIIA--TSSLLT 278 (408) Q Consensus 246 ~~--------------------------------------------~~~~~~~~d~i~~~-~~~~l~~~~~~--~a~~~~ 278 (408) +. ..+.-.+.|.++.. ++..+++.++. .-++|| T Consensus 150 A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVviv 229 (339) T protein:vir:79 150 AATSDRVANPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVC 229 (339) T ss_pred ecCCChhhCcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 11 01123445666544 44567888874 457888 Q ss_pred cHHHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEE Q lcl|Aclame:pro 279 NQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLP 354 (408) Q Consensus 279 n~~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~ 354 (408) -++.++. ...+ +....|- ...++. -...++-|+|.+.++. +|... +++=-|++.-..+..+..+=.+ T Consensus 230 G~dLla~k~~~l~-n~~~~ptE~~Aa~~i-~s~k~iGGl~a~~~Pf--FP~~~-----llVT~L~NLsIY~Q~gs~RR~~ 300 (339) T protein:vir:79 230 GRNLLSDKYFPLV-NRDRDPVQQIAADLI-ISQKRIGNLPAIRVPY--FPANG-----LLVTRLDNLSIYYQEGGRRRTI 300 (339) T ss_pred chhhhhhHhhhHh-hcCCChHHHHHHHHH-HHhhhhCCceeEEccc--cCCCc-----eEEeechhcEEEEecCcEEEEE Confidence 8877652 2333 2211110 000000 1124788999998653 56543 4555555543434444443332 Q ss_pred ecc-chhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 355 TNI-GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 355 ~~~-~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) .+. ..+.++.++ ..--|..|-++..++.+.-...++.+ T Consensus 301 ~d~p~r~rie~y~-----s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 301 LDNAKRDRIENYE-----SSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred Eeccccccccchh-----hccceeeeeccccEEEeeeeecccCC Confidence 221 122222211 12235567777777776644444433 No 188 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=95.71 E-value=0.0016 Score=35.94 Aligned_cols=283 Identities=12% Similarity=0.087 Sum_probs=146.2 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccc-----cccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGS-----DSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t-----~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) -+.+-+..|.+++..-. ...+. +.+-.+.|-+.....+...+.+.+.+++.++++++....|... T Consensus 1 M~~~tr~~~~~y~~~~A----------~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i 70 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQA----------ELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETL 70 (342) T ss_pred CChHHHHHHHHHHHHHH----------HHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEE Confidence 22233334444433211 11111 1222488889999999999999999999999999998888766 Q ss_pred EeeccCCccccchhc-c-cccccccccccceeeeechheeeeehHHHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDA-E-DGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 164 ~~~~~~~~~~~~~~~-E-~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds--~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) ..... .+.++-+. . .......+...++.-.+..++.---..|+.+.|... ..+|..-+.+.+.++++.=.-.-. T Consensus 71 ~lg~~--g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IG 148 (342) T protein:vir:10 71 GLDSA--HTVASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIG 148 (342) T ss_pred ecccC--cccccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceec Confidence 44221 12211110 0 011111122234444455555544456666666653 357888888888887765444444 Q ss_pred hhcccccc------------------------------------------chhhhhhHHHHHH-HHHHhhhhhccC--CC Q lcl|Aclame:pro 240 IEVMKAAP------------------------------------------KKPTIAKFDDVIT-MINTAVDPAIIA--TS 274 (408) Q Consensus 240 ~~g~g~~~------------------------------------------~~~~~~~~d~i~~-~~~~~l~~~~~~--~a 274 (408) ++|+..+. ..+.-.+.|.++. +++..+++.++. .- T Consensus 149 fNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dL 228 (342) T protein:vir:10 149 FNGTSRAATSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDL 228 (342) T ss_pred ccceeeccCCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCE Confidence 55544221 1122334566654 445567887764 46 Q ss_pred EEEEcHHHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccce Q lcl|Aclame:pro 275 SLLTNQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENM 350 (408) Q Consensus 275 ~~~~n~~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 350 (408) ++||-++.++. +..+...+ .|- ...++. ....++-|+|.+.++. +|... +++=-|++.-..+..+.. T Consensus 229 VvivG~dLladk~~~l~n~~~-~ptE~~Aa~~i-~s~k~iGGl~a~~~Pf--FP~~~-----ilVT~L~NLsIY~Q~gs~ 299 (342) T protein:vir:10 229 VVITGRKLLADKYFPIVNQQN-APTEELAADIV-ISQKRIGGLKAVRVPF--FPANA-----ILITKLENLAIYVQEGTT 299 (342) T ss_pred EEEEchhhhHHHHHHHHhcCC-ChHHHHHHHHH-HhhhhhcCceeEEccc--cCCCc-----eEEeeccccEEEEecCcE Confidence 78888877652 22222111 110 000000 1124788999998653 56543 455555554333444443 Q ss_pred EEEEecc-chhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCC Q lcl|Aclame:pro 351 SLLPTNI-GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV 397 (408) Q Consensus 351 ~i~~~~~-~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 397 (408) +=.+.+. ..+.++.+ -..--|..|-++..++.+.-...+++- T Consensus 300 RR~~~d~p~r~rie~y-----~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 300 RKHIENVPKKDRIETY-----ESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred EEEEEeccccccccch-----hhhccceeeeccccEEEeecceecCCC Confidence 3332221 12222221 112235677777777777755555444 No 189 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=95.69 E-value=0.0016 Score=35.88 Aligned_cols=282 Identities=12% Similarity=0.122 Sum_probs=144.0 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+...+..|.+++..-.... ...+.+-.+.|-+.....+...+.+.+.+++.++++++....|........ T Consensus 1 M~~~tr~~~~~y~~~~A~~n---------gv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~ 71 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLN---------DTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVS 71 (337) T ss_pred CChHHHHHHHHHHHHHHHhc---------ChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 23334444554443321111 011233468888999999999999999999999999998888876543221 Q ss_pred CCccccch--hcccccccccccccceeeeechheeeeehHHHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 169 DVTPLTVM--DAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) Q Consensus 169 ~~~~~~~~--~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds--~~~~~~~v~~~l~~~~~~~~~~~~~~g~g 244 (408) + +.++- .+-+.-.| .+...++.-.+..++.---..|+.+.|... ..+|..-+.+.+.++++.=.-.-.++|+. T Consensus 72 g--~iagrtdt~~~~R~~-~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts 148 (337) T protein:vir:78 72 G--PIASRTDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVK 148 (337) T ss_pred c--ceeeeecCCCccccc-ccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceeccccee Confidence 1 11111 11111111 111233444444444444455666666643 35788888888888776544444455543 Q ss_pred ccc-------------------------------------------chhhhhhHHHHHHH-HHHhhhhhccC--CCEEEE Q lcl|Aclame:pro 245 AAP-------------------------------------------KKPTIAKFDDVITM-INTAVDPAIIA--TSSLLT 278 (408) Q Consensus 245 ~~~-------------------------------------------~~~~~~~~d~i~~~-~~~~l~~~~~~--~a~~~~ 278 (408) .+. ..+.-.+.|.++.. ++..+++.++. .-++|| T Consensus 149 ~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVviv 228 (337) T protein:vir:78 149 AAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) T ss_pred eccCCChhhCcCccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 221 11123345666544 44557887764 467888 Q ss_pred cHHHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEE Q lcl|Aclame:pro 279 NQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLP 354 (408) Q Consensus 279 n~~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~ 354 (408) .++.++. ...+. ..+.|- ...++. -...++-|+|.+.++. +|... +++=-|++.-..+..+..+=.+ T Consensus 229 G~dLladk~~~l~n-~~~~ptE~~Aa~~i-~s~k~iGGl~a~~~Pf--FP~~~-----ilVT~L~NLsIY~Q~gs~RR~~ 299 (337) T protein:vir:78 229 GRELLHDKYFPIVN-ATQAPTERLAADLI-VSQKRIGNLPAVRVPF--FPKRA-----LMVTKLSNLSIYYQEGARRRTL 299 (337) T ss_pred chhhhHHHHHHHHh-cCCCcHHHHHHHHH-HHhhhhcCcceEEccc--cCCCc-----eEEeechhcEEEEecCcEEEEE Confidence 8877653 22222 211110 000000 1124788999998653 56543 4555555543434444443332 Q ss_pred ecc-chhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 355 TNI-GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 355 ~~~-~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) .+. ..+.++.++ ..--|..|-++..++.+.--...++ T Consensus 300 ~d~p~r~rie~y~-----s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 300 KEVPERDRIENYE-----SSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred Eeccccccccchh-----hccceeeeeccccEEEEeceeecCC Confidence 221 122222211 1223556777777777663333333 No 190 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=93.85 E-value=0.0062 Score=32.70 Aligned_cols=377 Identities=8% Similarity=0.012 Sum_probs=93.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc---cccc Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE---EKGP 81 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 81 (408) |+|+|+.+++.+..++ ....+.... ..++ -..+++.+++.++++++.++..+........... .... T Consensus 1 M~l~el~~~~~~~~~~-------~~a~l~~~~-~~~~--~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~ 70 (434) T protein:vir:62 1 MNLKEILNASLTRTKS-------RLAELQGKV-EKNE--VRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAK 70 (434) T ss_pred CCHHHHHHHHHHHHHH-------HHHHHHHHH-hccC--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9999988776664322 112222221 1111 1234566677788888887777655432211111 1110 Q ss_pred cccchhhh-------H---------HHHHHHHHHHhhcchh------hHHHHHHHHhh----cccccc--Cceecchhhh Q lcl|Aclame:pro 82 LNKSENEL-------K---------DKFVKDFVNMVRNPMA------FMNTVSSKTET----SGSDSA--AGLTIPQDIR 133 (408) Q Consensus 82 ~~~~~~~~-------~---------~~~~~a~~~~~~~~~~------~~~~~~~~a~~----~~t~~~--gg~~vP~~~~ 133 (408) ........ . .+.+.++...+..+.. .......++.. ...... -...+.+.-. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~G 150 (434) T protein:vir:62 71 KKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNG 150 (434) T ss_pred hhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhccccccc Confidence 10000000 0 0011111111111100 00000111100 000000 0000111100 Q ss_pred hhhhhhhhhhhhhhhhhceee-cccCccceEEeec-------cCCccccchhcccccccccccccceeeeechheeeeeh Q lcl|Aclame:pro 134 TMINTLVRQYDSLQQYVRVES-VSTSNGSRVYEKW-------TDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGII 205 (408) Q Consensus 134 ~~ii~~~~~~~~l~~~~~~~~-~~~~~g~~~~~~~-------~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~ 205 (408) ..++..-.... +..+.+... +... +.. ++.. ....+.+.|..++++.. . .+....+|...++...- T Consensus 151 G~lvP~~~~~~-Ii~~l~~~~~i~~~-~~~-~~~~~~~~~p~~~~~~~a~~~~~~~e~~--~-~~~~~~~f~~v~~~~~k 224 (434) T protein:vir:62 151 SVTIPDFLSKE-IITYAQEENFLRRL-GTG-VKTKENIKYPVLVKKAEAQGHKNERTNN--E-MPETDIEFDEIELSPTE 224 (434) T ss_pred ceecchhhHHH-HHHhhhhhhhhhhh-cce-eccCCceEEEEEecCCcccceecccccc--c-ccccccceeeEEeehee Confidence 01111110011 111111111 1100 111 1111 11122223333322211 1 12233333333333221 Q ss_pred HHHHHHHhcchHHHHH-HHHHHHHHHHHHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHH Q lcl|Aclame:pro 206 TATNTSLKDTAENILA-WLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLN 284 (408) Q Consensus 206 ~iS~ell~ds~~~~~~-~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~ 284 (408) --..--+.+....-.. -+...|...++.++...+-...=.|....... ..++.. ... ......+ ..-..... T Consensus 225 ~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~--~g~~~~--~~~-~~~~~~~--~~~d~l~~ 297 (434) T protein:vir:62 225 FDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNIN--DGALAK--KAV-EFKTDEK--NLYDALVK 297 (434) T ss_pred eEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccc--cceeec--ccc-ccccccc--chhhHHHH Confidence 1111111111111111 24455555555555555443333333222211 111100 000 0000000 11111122 Q ss_pred HHHhhhcc---cCceeeccccccC--CcccccccceEeecccc---ccccccCcceEEEEehhc--------ceEeeecc Q lcl|Aclame:pro 285 KLALVKTA---EGKYLLEPDPTKP--NSYLIKGKQVIVVADRW---LPNTGSTVYPLYYGDMSQ--------AITLFDRE 348 (408) Q Consensus 285 ~l~~lkd~---~G~~~~~~~~~~~--~~~~l~G~pv~~~~~~~---~~~~~~~~~~~~~gd~~~--------~~~~~~~~ 348 (408) .+..+... ++.+++.+..... .-..=+|.|++..+... .|...-| .++++-+.-. .+.+.+++ T Consensus 298 l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G-~pV~~~~~~~~~~~~~~~~i~~Gdfs 376 (434) T protein:vir:62 298 MKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLG-FPVEEEDAIDIPDSPDTPVFYFGDFS 376 (434) T ss_pred HHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecc-eeeEEecCccCccCCCceEEEEeecc Confidence 22233322 3333433211000 00011466665321100 0111111 1122222100 01111222 Q ss_pred ceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCcc-CCCccc Q lcl|Aclame:pro 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFK-TTTSTA 407 (408) Q Consensus 349 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~-~~~~~~ 407 (408) .+.|- +....-.+......+....++++.+..--. .++-+.+.+++.-..+ ..++.+ T Consensus 377 ~~~i~-~~~g~~~i~~~~~~~~~~~~v~~~~~~r~D-gk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 377 KFYIQ-DVIGSLEVQKLVELFSRTNRVGFRIWNLLD-AQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred ceEEE-EeeceeEEEeehhhhcccCceEEEEEeeec-ceeecCcccceEEEEEeccCCCC Confidence 11110 000000000000000001111111111000 0111223444333222 222222 No 191 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=93.74 E-value=0.0065 Score=32.56 Aligned_cols=276 Identities=8% Similarity=-0.033 Sum_probs=109.4 Q ss_pred hhccccccCceecch--hhhhhhhhhhhhhhhhhhhhceee----------cccCccceEEeeccCCcccc-chhccc-c Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQ--DIRTMINTLVRQYDSLQQYVRVES----------VSTSNGSRVYEKWTDVTPLT-VMDAED-G 181 (408) Q Consensus 116 ~~~~t~~~gg~~vP~--~~~~~ii~~~~~~~~l~~~~~~~~----------~~~~~g~~~~~~~~~~~~~~-~~~~E~-~ 181 (408) |..+.-++ ..||. .|...+.+...+.+.|.+- ..+. ..+....+|+....++.... .|..-. . T Consensus 1 Ma~T~l~D--~iipe~~vf~~Yv~~~~~e~~~l~qS-Gii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~ 77 (349) T protein:vir:94 1 MAITTIGN--IVTGNIPVLASYMTEDPVEKTAFFNS-GILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQD 77 (349) T ss_pred CCceEEee--eeccChHHHHHHHHHhHHHhhhhhhc-cceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCccc Confidence 22222121 33443 2444444444344433321 1111 11222233332221222111 111100 1 Q ss_pred cccccccccceeeeechheee--eehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccc-------ccc----- Q lcl|Aclame:pro 182 KIPDLDNPQLTIIKYLIKRYA--GIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK-------AAP----- 247 (408) Q Consensus 182 ~~~~~~~~~f~~v~~~~~~~~--~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g-------~~~----- 247 (408) ..+.....+..++-...+.-. ..-.++.++-- .|..+.|.+++++...+...+.++.... .+. T Consensus 78 ~~t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG---~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~ 154 (349) T protein:vir:94 78 IATPRAIQTGEMMARVAYLNEGFGQADLTVELTS---QNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHE 154 (349) T ss_pred ccccccccccceeeeeeeeccccchhHHHHHhhC---chHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccc Confidence 111111122333333333222 23456776643 3567778888877777666655543221 100 Q ss_pred --------chhhhhhHHHHHHHHHHhhhhh----ccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccce Q lcl|Aclame:pro 248 --------KKPTIAKFDDVITMINTAVDPA----IIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQV 315 (408) Q Consensus 248 --------~~~~~~~~d~i~~~~~~~l~~~----~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv 315 (408) ..........+++++....+.. ...-..++||+.++..|++++-= .|+ ++.-....-.+++|++| T Consensus 155 ~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li--~~i-~~s~~~~~i~ty~G~~V 231 (349) T protein:vir:94 155 QNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI--DFI-RDAENNTMFATYQGYRV 231 (349) T ss_pred cCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchh--hhc-cCcccCcccceecCcEE Confidence 0112234555666654432222 22345799999999998876310 011 11111112368999999 Q ss_pred Eeecccccccccc-Cc-ceEEEEehhcceEeeecc---ceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEe Q lcl|Aclame:pro 316 IVVADRWLPNTGS-TV-YPLYYGDMSQAITLFDRE---NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) Q Consensus 316 ~~~~~~~~~~~~~-~~-~~~~~gd~~~~~~~~~~~---~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 390 (408) ++.|..++-..+. +. .+.+||. .++...+.. .+++.+++..++. .++..+....| .+++|.+|...+ T Consensus 232 ivDD~~Pv~~~g~~~~yttylfg~--GAi~~~~~~~~~~~E~~rd~~~g~~--~G~d~L~~R~~---~~~hp~G~s~~~- 303 (349) T protein:vir:94 232 IVDDSMTVVGQDTSRKFISIIFGQ--GAIGYGEGNPEMPLEYEREASRANG--GGVETLWTRKT---WLLHPFGYSFTS- 303 (349) T ss_pred EEeCCCccccCCCCceEEEEEeec--ceEEeecCCCCcceeeecccccCCc--ceeEEEEEeeE---EEeeeeeeeecc- Confidence 9977643322111 11 1235663 222222221 2444444432221 23333333333 367787776643 Q ss_pred eccccCCC----CccC----CCcccC Q lcl|Aclame:pro 391 SAIADQVG----NFKT----TTSTAV 408 (408) Q Consensus 391 ~~~~~~~~----~~~~----~~~~~~ 408 (408) +.++..+. .+++ .++..- T Consensus 304 a~v~~~~~~~~~~sPt~aeLa~~~NW 329 (349) T protein:vir:94 304 AVITGNGTETIARSASWQDLANAANW 329 (349) T ss_pred cccCCCccccccCCCChHHhcCCcCc Confidence 11111000 0111 000000 No 192 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=93.73 E-value=0.0066 Score=32.55 Aligned_cols=286 Identities=13% Similarity=0.105 Sum_probs=133.4 Q ss_pred HHHHhhcchhhHHHHHHHHhhccccccCceecchhh-hhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCcccc- Q lcl|Aclame:pro 97 FVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDI-RTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLT- 174 (408) Q Consensus 97 ~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~-~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~- 174 (408) +.++..-. .+...++.++.|.-+-+.+ ..+.+..+.+...+.+++...+++...|.-...+.-..-+.+ T Consensus 1 ~~~~~a~~---------~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~ 71 (401) T protein:vir:95 1 MLNYNAPT---------DGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDR 71 (401) T ss_pred CCccCCCc---------ccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccc Confidence 11110000 0011122233344445533 233333344458889999999999888865433221111110 Q ss_pred chhcccccccc--------------ccc-------------------ccceeeeechheeeeehHHHHHHHh-cchHHHH Q lcl|Aclame:pro 175 VMDAEDGKIPD--------------LDN-------------------PQLTIIKYLIKRYAGIITATNTSLK-DTAENIL 220 (408) Q Consensus 175 ~~~~E~~~~~~--------------~~~-------------------~~f~~v~~~~~~~~~~~~iS~ell~-ds~~~~~ 220 (408) .-..||-+..- ..+ .+-..+..+.++++.+..+|++++. +++..+. T Consensus 72 ~pl~eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~ 151 (401) T protein:vir:95 72 NINDQGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLM 151 (401) T ss_pred cchhcCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHH Confidence 01122221110 000 1112355678899999999998865 3445566 Q ss_pred HHHHHH-HHHHHHHH---HHHHHhhccc------c---------ccchhhhhhHHHHHHHHHHhhh-------------- Q lcl|Aclame:pro 221 AWLSSW-IAKKVVVT---RNQAIIEVMK------A---------APKKPTIAKFDDVITMINTAVD-------------- 267 (408) Q Consensus 221 ~~v~~~-l~~~~~~~---~~~~~~~g~g------~---------~~~~~~~~~~d~i~~~~~~~l~-------------- 267 (408) ..|..+ |.-+.... +-..++++-+ + .....+..+.+++..+- ..|. T Consensus 152 ~h~s~ell~g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~-~~L~~nRapk~t~~i~~s 230 (401) T protein:vir:95 152 EHLSRELMNGATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLD-QILTENRTPTQTTIITGS 230 (401) T ss_pred HHHHHHHhhhhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHH-HHHHhcccccchhhhhhh Confidence 554333 33333333 3344553321 1 11122333455543321 1111 Q ss_pred ----hhccCCC-EEEEcHHHHHHHHhhhcccCceeecc--------ccccCCcccccccceEeecccc------ccccc- Q lcl|Aclame:pro 268 ----PAIIATS-SLLTNQSGLNKLALVKTAEGKYLLEP--------DPTKPNSYLIKGKQVIVVADRW------LPNTG- 327 (408) Q Consensus 268 ----~~~~~~a-~~~~n~~~~~~l~~lkd~~G~~~~~~--------~~~~~~~~~l~G~pv~~~~~~~------~~~~~- 327 (408) ......+ +-+||+..-..|+.++|-.|.+-|.+ .+..+.-+.|-++.+++++-.- .+..+ T Consensus 231 ~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~ 310 (401) T protein:vir:95 231 RMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGA 310 (401) T ss_pred hccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccc Confidence 1112233 34679999999999999888887764 2344455678888877654210 00100 Q ss_pred -----------cCc----ceEEEEehhcceEeeeccce----EEEEecc-------chhhhhhceeeEEEEeeeCcEEec Q lcl|Aclame:pro 328 -----------STV----YPLYYGDMSQAITLFDRENM----SLLPTNI-------GAGAFETDTTKIRVIDRFDVKATD 381 (408) Q Consensus 328 -----------~~~----~~~~~gd~~~~~~~~~~~~~----~i~~~~~-------~~~~f~~~~~~~r~~~r~d~~v~~ 381 (408) ++. ..+++|+-.-+...+...+. .+.+..- ....-++..+.|++ ..++.+++ T Consensus 311 ~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~--~~a~~vL~ 388 (401) T protein:vir:95 311 NPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKW--YYGILVKR 388 (401) T ss_pred ccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhh--hhhhheec Confidence 111 12567764333333332332 2222111 12222334444433 36778888 Q ss_pred ccceEEEEeeccccC Q lcl|Aclame:pro 382 SEALVAGSFSAIADQ 396 (408) Q Consensus 382 ~~a~~~l~~~~~~~~ 396 (408) ++=.+.++ +++|. T Consensus 389 ~e~m~~ie--s~a~~ 401 (401) T protein:vir:95 389 PERLALIK--TVAPL 401 (401) T ss_pred cceeEEEE--eecCC Confidence 88777764 33333 No 193 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=93.52 E-value=0.0073 Score=32.30 Aligned_cols=277 Identities=12% Similarity=-0.008 Sum_probs=106.5 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeeccc----Ccc-ceEEeeccCCccccchhccccccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVST----SNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPQ 190 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~----~~g-~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~ 190 (408) |. ++- ...||+.+..+.++.+++..++.+++..-.-.. ..| ++.++.........+-.+.+..+. .+..+ T Consensus 1 MA-N~l---lT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~-~~~~~ 75 (423) T protein:vir:35 1 MA-NNL---ESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKD-KNGLF 75 (423) T ss_pred Cc-cch---hhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcc-ccccc Confidence 22 110 123799999999999999999888766533111 112 223333222111111111111111 11111 Q ss_pred ce--eeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc------ccchhhhhhHHHHHHHH Q lcl|Aclame:pro 191 LT--IIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA------APKKPTIAKFDDVITMI 262 (408) Q Consensus 191 f~--~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~------~~~~~~~~~~d~i~~~~ 262 (408) =. .++++-+|...+.-=..|... +..+|++++...+ .+++..++..++...-. +++......+++++++- T Consensus 76 e~~v~l~id~~k~~a~~v~d~e~~l-~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~~vgt~~t~~~~~~~i~~a~ 153 (423) T protein:vir:35 76 SAKATGKVGKYITVAVEWTQIEEAL-KLNQLDQILSPIH-ERMVTDLETELAHFMMNNGALSLGSPNTAIKKWADVAQTA 153 (423) T ss_pred cceeeEEeccceeccceeCHHHHHh-hHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccccCCcchHHHHHHHH Confidence 12 366666666665544555443 3557777776554 66777777776542211 12222223466665553 Q ss_pred HHhhhhhccC--CCEEEEcHHHHHHHHhhhcc---cCceeeccccccCC-cccccccceEeeccccccccccCc--ceEE Q lcl|Aclame:pro 263 NTAVDPAIIA--TSSLLTNQSGLNKLALVKTA---EGKYLLEPDPTKPN-SYLIKGKQVIVVADRWLPNTGSTV--YPLY 334 (408) Q Consensus 263 ~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd~---~G~~~~~~~~~~~~-~~~l~G~pv~~~~~~~~~~~~~~~--~~~~ 334 (408) ..|+....+ +...+++|..+..|.+- +. ...-.....+..+. .+++.|+.|+.+.+ +|....+. ..+. T Consensus 154 -~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~-~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snn--vp~~T~gt~~~~~~ 229 (423) T protein:vir:35 154 -SFIKDIGIKTGENYAIMDPWSAQRLADA-QSGLHAADQLVRTAWENAQISGNFGGIRALMSNG--LASRKQGDFDGAIT 229 (423) T ss_pred -HHHHHhcCCcCCCEEEeCHHHHHHHhcc-ccceeccccchhHHHhhccceeeecceEEEEcCC--Ccccccccccccee Confidence 345554443 34568999998876421 11 00001111233333 36899999887543 44321111 1111 Q ss_pred E-----------EehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeee---CcEE------ecccceEE------- Q lcl|Aclame:pro 335 Y-----------GDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRF---DVKA------TDSEALVA------- 387 (408) Q Consensus 335 ~-----------gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~---d~~v------~~~~a~~~------- 387 (408) . .+.+..+.. +...+....+....-|...|-+..-+ ...+ .++.=|++ T Consensus 230 v~~a~~v~~~a~~~~~~~~~~-----~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~ 304 (423) T protein:vir:35 230 VKTAPNVDYLSVKDSYQFTVA-----LTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNST 304 (423) T ss_pred eccccccccccccccccceee-----eeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEecccccc Confidence 1 011111000 00000000000000111111110000 0000 00000111 Q ss_pred ------EEeeccc-cCCCC----cc---CCCcccC Q lcl|Aclame:pro 388 ------GSFSAIA-DQVGN----FK---TTTSTAV 408 (408) Q Consensus 388 ------l~~~~~~-~~~~~----~~---~~~~~~~ 408 (408) |++.+.- +..+. ++ ..+..+| T Consensus 305 a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:35 305 ASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAV 339 (423) T ss_pred ccCceeEEccccccccCCCcccccccccccCCcee Confidence 1111110 00000 00 0000000 No 194 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=92.98 E-value=0.0092 Score=31.74 Aligned_cols=296 Identities=11% Similarity=0.051 Sum_probs=152.1 Q ss_pred cccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccc Q lcl|Aclame:pro 82 LNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGS 161 (408) Q Consensus 82 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~ 161 (408) ++.. .....+..|..++..-.... .+. ....+-.+.|.+.....+.+.+.+.+-+++.++++++....|. T Consensus 1 m~~~---M~~~tr~~~~~y~~~~A~~n-gv~------~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge 70 (358) T protein:vir:78 1 MSQT---LTVQAEQRLNKYCDALAKAY-GID------ISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQ 70 (358) T ss_pred Cccc---ccHHHHHHHHHHHHHHHHHh-CCC------hhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceee Confidence 2221 22334444544443221110 000 0122346889999999999999999999999999999988887 Q ss_pred eEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcch-----HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 RVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA-----ENILAWLSSWIAKKVVVTRN 236 (408) Q Consensus 162 ~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~-----~~~~~~v~~~l~~~~~~~~~ 236 (408) ...... ..+.++-...+. + .+...++.-.+..++.---..|+.+.|...+ .+|..-+.+.+.++++.=.- T Consensus 71 ~v~lg~--~g~iagrt~tr~--~-~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i 145 (358) T protein:vir:78 71 VVQVGV--GQLYTGRKKGGR--F-KGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDML 145 (358) T ss_pred EEeecC--CcccceecCCCc--c-ccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccc Confidence 754322 112222111111 1 1223344445555555544667777777644 26888888888888765444 Q ss_pred HHHhhcccccc--------------------------------------------chhhhhhHHHHHHH-HHHhhhhhcc Q lcl|Aclame:pro 237 QAIIEVMKAAP--------------------------------------------KKPTIAKFDDVITM-INTAVDPAII 271 (408) Q Consensus 237 ~~~~~g~g~~~--------------------------------------------~~~~~~~~d~i~~~-~~~~l~~~~~ 271 (408) .-.++|+..+. ..+.-.+.|.++.. ++..+++.++ T Consensus 146 ~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~ 225 (358) T protein:vir:78 146 RVGWNGVSAADDTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQ 225 (358) T ss_pred eecccceeeccCCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHh Confidence 44455543211 01223456666654 5677888777 Q ss_pred C--CCEEEEcHHHHHH--HHhhhcccCcee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEee Q lcl|Aclame:pro 272 A--TSSLLTNQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLF 345 (408) Q Consensus 272 ~--~a~~~~n~~~~~~--l~~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 345 (408) . .-++||.++.++. ...+ +..+.|- +..... ..+|-|+|.+.++. +|... +++=-|++.-..+ T Consensus 226 ~d~dLVvivG~dLla~k~~~l~-n~~~~pTE~~Aa~~i---~k~iGGlpa~~~Pf--FP~~~-----ilVT~L~NLsIY~ 294 (358) T protein:vir:78 226 QDPRLVVLVGTDLVAAAQAKLY-SEATKPSEQIAAQQL---AKSIAGRKAYIPPF--FPGKR-----MVVTTLDNLHCYT 294 (358) T ss_pred cCCCEEEEEchhhhhHHhhhHh-hcCCCcHHHHHHHHH---HHHhCCCeEEEccc--cCCCc-----eEEeeccccEEEE Confidence 5 4678888887652 3333 2222210 000111 15789999998653 56533 4555555543334 Q ss_pred eccceEEEEec-cchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccc---cCCCCccCCCcccC Q lcl|Aclame:pro 346 DRENMSLLPTN-IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA---DQVGNFKTTTSTAV 408 (408) Q Consensus 346 ~~~~~~i~~~~-~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~---~~~~~~~~~~~~~~ 408 (408) .++..+=.+.+ ...+.++.++ ..--|..|-++..++.+....+. +..+.-....+++= T Consensus 295 Q~gs~RR~~~d~p~r~riE~y~-----s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~~~~~~~~~~ 356 (358) T protein:vir:78 295 QRGTRKRKADDNQDSKSFDNQY-----WRMEGYALGEHKAYGGFEEADIEIGADPAVLAVEAAAQAG 356 (358) T ss_pred ecCcEEEEEEeccccccccchh-----hhcceeeeeccccEEEEeeeeeeeCCCCCccccCCccccC Confidence 44443333222 1122222211 12235567777777776644432 22222222222222 No 195 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=305 Identities=11% Similarity=0.035 Sum_probs=137.9 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhc------cccccCceecchhhhhhhhhhhhhhh--hhhhhhc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETS------GSDSAAGLTIPQDIRTMINTLVRQYD--SLQQYVR 151 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~------~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~ 151 (408) .+........++.+...|.+.+. ++.++ .+-.+||.+=-..+..+|..+..... .+.+-+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~-----------KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~ 69 (463) T protein:vir:99 1 MTIEKNLSDVQQKYADQFQEDVV-----------KSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDIS 69 (463) T ss_pred CCcccccchHHHHHHhhhhHHHH-----------HHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcC Confidence 22222222333333333322211 11222 11233444433444444433333222 2233334 Q ss_pred eeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHH-HhcchHHHHHHHHHHHHHH Q lcl|Aclame:pro 152 VESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSWIAKK 230 (408) Q Consensus 152 ~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el-l~ds~~~~~~~v~~~l~~~ 230 (408) +.+..+.-..|......+..+.+.+++|++-.+ .+++++.+.....+-++....+|.-+ +.++..+.+..+.+.-... T Consensus 70 k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~-~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ 148 (463) T protein:vir:99 70 RRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) T ss_pred CchhhhhhhhheeeeccCccccccccccccccc-cCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHH Confidence 444444444444444445667788899999875 67899999999999999988887755 4566778888999999999 Q ss_pred HHHHHHHHHhhccccccchh--hhhhHHHHHHHH---------------------HHhhhhhccCCCEEEEcHHHHHHHH Q lcl|Aclame:pro 231 VVVTRNQAIIEVMKAAPKKP--TIAKFDDVITMI---------------------NTAVDPAIIATSSLLTNQSGLNKLA 287 (408) Q Consensus 231 ~~~~~~~~~~~g~g~~~~~~--~~~~~d~i~~~~---------------------~~~l~~~~~~~a~~~~n~~~~~~l~ 287 (408) ++..++.+.+.|+..-.+.. -...+|.+.+++ ...+..+|....-++|+.-+.+.+. T Consensus 149 ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~ 228 (463) T protein:vir:99 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFV 228 (463) T ss_pred HHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHH Confidence 99999999999987655521 223455543332 2223334554445677887777775 Q ss_pred h-hhcccCceeeccccccCCcccccccceEee-----ccccccccccCcceEEEE--------ehhcceEeeeccceEEE Q lcl|Aclame:pro 288 L-VKTAEGKYLLEPDPTKPNSYLIKGKQVIVV-----ADRWLPNTGSTVYPLYYG--------DMSQAITLFDRENMSLL 353 (408) Q Consensus 288 ~-lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~-----~~~~~~~~~~~~~~~~~g--------d~~~~~~~~~~~~~~i~ 353 (408) . +-..+ |.+..++.. ....|+||--. +-..-++...+ .+.+++ +|.-... -.++. T Consensus 229 ~~~l~~q-rv~~~~N~~----~~~~G~~v~~f~s~~G~I~L~~s~~m~-~~~il~~~~~~~p~ap~~~~~-----tatv~ 297 (463) T protein:vir:99 229 NSILGRQ-MQLMQDNSG----NVNTGYSVNGFYSSRGFIKLHGSTVME-NELILDESLQPLPNAPQPAKV-----TATVE 297 (463) T ss_pred HHhcCce-EEEEcCCCC----ceeeeeeccceeeeeeeeeeCCceecC-CcccccchhhcCCCCccCcee-----EEEEe Confidence 2 21111 222222211 12334433210 00000111111 111111 1111000 01221 Q ss_pred Eeccchhhh---hhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccC-CCcccC Q lcl|Aclame:pro 354 PTNIGAGAF---ETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKT-TTSTAV 408 (408) Q Consensus 354 ~~~~~~~~f---~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~-~~~~~~ 408 (408) ..+. +..| ......+++...-+..--.|+.++-.++....+...-+-+ .+..++ T Consensus 298 ~~~~-~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~ 355 (463) T protein:vir:99 298 TKQK-GAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQ 355 (463) T ss_pred eccC-CCCCCcccccceEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCccc Confidence 1111 1011 1222334444444443334554444333322221111111 011111 No 196 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=92.51 E-value=0.011 Score=31.30 Aligned_cols=305 Identities=11% Similarity=0.035 Sum_probs=137.9 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhc------cccccCceecchhhhhhhhhhhhhhh--hhhhhhc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETS------GSDSAAGLTIPQDIRTMINTLVRQYD--SLQQYVR 151 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~------~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~ 151 (408) .+........++.+...|.+.+. ++.++ .+-.+||.+=-..+..+|..+..... .+.+-+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~-----------KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~ 69 (463) T protein:vir:95 1 MTIEKNLSDVQQKYADQFQEDVV-----------KSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDIS 69 (463) T ss_pred CCcccccchHHHHHHhhhhHHHH-----------HHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcC Confidence 22222222333333333322211 11222 11233444433444444433333222 2233334 Q ss_pred eeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHH-HhcchHHHHHHHHHHHHHH Q lcl|Aclame:pro 152 VESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSWIAKK 230 (408) Q Consensus 152 ~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el-l~ds~~~~~~~v~~~l~~~ 230 (408) +.+..+.-..|......+..+.+.+++|++-.+ .+++++.+.....+-++....+|.-+ +.++..+.+..+.+.-... T Consensus 70 k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~-~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ 148 (463) T protein:vir:95 70 RRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) T ss_pred CchhhhhhhhheeeeccCccccccccccccccc-cCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHH Confidence 444444444444444445667788899999875 67899999999999999988887755 4566778888999999999 Q ss_pred HHHHHHHHHhhccccccchh--hhhhHHHHHHHH---------------------HHhhhhhccCCCEEEEcHHHHHHHH Q lcl|Aclame:pro 231 VVVTRNQAIIEVMKAAPKKP--TIAKFDDVITMI---------------------NTAVDPAIIATSSLLTNQSGLNKLA 287 (408) Q Consensus 231 ~~~~~~~~~~~g~g~~~~~~--~~~~~d~i~~~~---------------------~~~l~~~~~~~a~~~~n~~~~~~l~ 287 (408) ++..++.+.+.|+..-.+.. -...+|.+.+++ ...+..+|....-++|+.-+.+.+. T Consensus 149 ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~ 228 (463) T protein:vir:95 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFV 228 (463) T ss_pred HHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHH Confidence 99999999999987655521 223455543332 2223334554445677887777775 Q ss_pred h-hhcccCceeeccccccCCcccccccceEee-----ccccccccccCcceEEEE--------ehhcceEeeeccceEEE Q lcl|Aclame:pro 288 L-VKTAEGKYLLEPDPTKPNSYLIKGKQVIVV-----ADRWLPNTGSTVYPLYYG--------DMSQAITLFDRENMSLL 353 (408) Q Consensus 288 ~-lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~-----~~~~~~~~~~~~~~~~~g--------d~~~~~~~~~~~~~~i~ 353 (408) . +-..+ |.+..++.. ....|+||--. +-..-++...+ .+.+++ +|.-... -.++. T Consensus 229 ~~~l~~q-rv~~~~N~~----~~~~G~~v~~f~s~~G~I~L~~s~~m~-~~~il~~~~~~~p~ap~~~~~-----tatv~ 297 (463) T protein:vir:95 229 NSILGRQ-MQLMQDNSG----NVNTGYSVNGFYSSRGFIKLHGSTVME-NELILDESLQPLPNAPQPAKV-----TATVE 297 (463) T ss_pred HHhcCce-EEEEcCCCC----ceeeeeeccceeeeeeeeeeCCceecC-CcccccchhhcCCCCccCcee-----EEEEe Confidence 2 21111 222222211 12334433210 00000111111 111111 1111000 01221 Q ss_pred Eeccchhhh---hhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccC-CCcccC Q lcl|Aclame:pro 354 PTNIGAGAF---ETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKT-TTSTAV 408 (408) Q Consensus 354 ~~~~~~~~f---~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~-~~~~~~ 408 (408) ..+. +..| ......+++...-+..--.|+.++-.++....+...-+-+ .+..++ T Consensus 298 ~~~~-~~~~~~~~~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~ 355 (463) T protein:vir:95 298 TKQK-GAFENEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQ 355 (463) T ss_pred eccC-CCCCCcccccceEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCccc Confidence 1111 1011 1222334444444443334554444333322221111111 011111 No 197 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=92.32 E-value=0.012 Score=31.13 Aligned_cols=375 Identities=10% Similarity=0.012 Sum_probs=96.1 Q ss_pred HHHHHH--HHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHH--HHHHHHHHHHHHHHHHHHHHH-------HHHhhh Q lcl|Aclame:pro 5 LTVNQL--NEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELK--NKRDNEKVRRDALREQLVEAQ-------AEQVVN 73 (408) Q Consensus 5 ~~i~el--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~-------~~~~~~ 73 (408) |-+++| +++++.+..++.++.++..++.+........+.+.+ +++..+..+++.++....+++ ...... T Consensus 1 ~~~~~~~l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~l 80 (466) T protein:vir:80 1 MALRQLMLAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKEL 80 (466) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 988884 444666666777666665544332221111121111 111122223333333322222 111110 Q ss_pred cccccc-cccccchhhhHHHHHHHHHHHhhcchhhHHH------HHHHHhhccccccC------------ceecc-h-hh Q lcl|Aclame:pro 74 MREEEK-GPLNKSENELKDKFVKDFVNMVRNPMAFMNT------VSSKTETSGSDSAA------------GLTIP-Q-DI 132 (408) Q Consensus 74 ~~~~~~-~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~------~~~~a~~~~t~~~g------------g~~vP-~-~~ 132 (408) ...... .................+...++.+...... .+.+.......... +.... . .+ T Consensus 81 e~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~ 160 (466) T protein:vir:80 81 ENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELT 160 (466) T ss_pred HHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccc Confidence 000000 0000000001111111222222221111100 01111100000000 00000 0 00 Q ss_pred h-hhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhccc-----ccccccccccceeeeechheeeeehH Q lcl|Aclame:pro 133 R-TMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED-----GKIPDLDNPQLTIIKYLIKRYAGIIT 206 (408) Q Consensus 133 ~-~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~-----~~~~~~~~~~f~~v~~~~~~~~~~~~ 206 (408) . ..++..+.+. +...-++.. ...+....+. .-|.-.+ .-..+.....-...+|..-.+...-. T Consensus 161 vP~~~~~~i~~~-----l~~~~~l~~---~~~v~~~~g~---~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~ 229 (466) T protein:vir:80 161 IPDVMLELLRDN-----MHRYSKLIS---KVRLRPLKGT---ARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKV 229 (466) T ss_pred ccHHHHHHHHHh-----hhhhhhhhh---heeeeecCce---eEeeeecCCcceeecccccccccccccccceeecceee Confidence 1 1122222111 111111110 1111111111 1111111 00111110000111122111111100 Q ss_pred HHHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhhHHHHHHHHHHh---------------hh--- Q lcl|Aclame:pro 207 ATNTSLKDTAEN-ILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTA---------------VD--- 267 (408) Q Consensus 207 iS~ell~ds~~~-~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~---------------l~--- 267 (408) -..--+.+...+ -...+...|...++.++-...-...=.|.... ...+|+..+... +. T Consensus 230 ~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~---~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~ 306 (466) T protein:vir:80 230 GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTK---MPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTN 306 (466) T ss_pred eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCC---Ccceeeecccccccccccccccccccccchhh Confidence 000001111111 11235555555566555555544433343322 233554322100 00 Q ss_pred ----hhccCCCEEEEcHHHH-HHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcce Q lcl|Aclame:pro 268 ----PAIIATSSLLTNQSGL-NKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAI 342 (408) Q Consensus 268 ----~~~~~~a~~~~n~~~~-~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~ 342 (408) ..+..++.+.+++.++ ..+.+.++.+|.++|.++... ...|.|..+....+..+.....+..+ ++|-. + T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~--~~~l~~~~~~~~~~g~~~~~~~~~~~-i~G~p---v 380 (466) T protein:vir:80 307 LLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNT--HAVLMSKAITFNSAGALVASLNNTMP-IVGGD---I 380 (466) T ss_pred hhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchh--HHHhhcccccccCCccccccCCCccc-ccccc---e Confidence 1112233333333332 234456677888888754332 22444544332211111110111111 22210 0 Q ss_pred Eeeecc-ceEEEEeccchhhhhhceeeEEEEeeeCcEEeccc------------ceEEEEeecccc---------CCCCc Q lcl|Aclame:pro 343 TLFDRE-NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSE------------ALVAGSFSAIAD---------QVGNF 400 (408) Q Consensus 343 ~~~~~~-~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~------------a~~~l~~~~~~~---------~~~~~ 400 (408) +..+.. .-.+ .+..|..+.+. .|.+..+.... ++..+..+..-+ ....+ T Consensus 381 v~s~~~~~~~~-----~~g~~~~y~i~----~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~ 451 (466) T protein:vir:80 381 VILDFIPDNDI-----IGGYGSLYLLA----ERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPT 451 (466) T ss_pred eecCccCccce-----eeeccccEEEE----eecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcc Confidence 000000 0000 11112221111 12222111111 122222222111 11111 Q ss_pred cCCCc-ccC Q lcl|Aclame:pro 401 KTTTS-TAV 408 (408) Q Consensus 401 ~~~~~-~~~ 408 (408) ++.++ .+. T Consensus 452 ~~~~~~~~~ 460 (466) T protein:vir:80 452 TSITFAPDE 460 (466) T ss_pred cceeeecCc Confidence 11111 111 No 198 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=92.15 E-value=0.013 Score=30.99 Aligned_cols=294 Identities=8% Similarity=0.048 Sum_probs=142.1 Q ss_pred hHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeecc Q lcl|Aclame:pro 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) Q Consensus 89 ~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~ 168 (408) .+..-+..|..++..-.... .+. ....+.+.-+.|.+.....+.+.+.+.+-+++.++++++....|..... .. T Consensus 1 M~~~tr~~~~~y~~~~A~~n-gv~----~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~-~~ 74 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYY-GAN----PALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLR-SN 74 (343) T ss_pred CChHHHHHHHHHHHHHHHHh-CCc----cchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEe-ec Confidence 22333344444443211110 000 0001223358899999999999999999999999999987665554332 21 Q ss_pred CCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcch--HH-HHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 169 DVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--EN-ILAWLSSWIAKKVVVTRNQAIIEVMKA 245 (408) Q Consensus 169 ~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~-~~~~v~~~l~~~~~~~~~~~~~~g~g~ 245 (408) ++ +...-....+...+. ...+.-.+..++.---..|+.+.|...+ .| |..-+.+.+.++++.=.-.-.++|+.. T Consensus 75 sg-~~t~r~~t~~~~~~~--~~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~ 151 (343) T protein:vir:98 75 RK-RHYGAHDRRTPIQQR--WTRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSV 151 (343) T ss_pred Cc-cccCccccCCCcccc--ccCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceee Confidence 11 111100010000000 0111112333333333456666666532 45 777777777777664433334455432 Q ss_pred cc----------------------------------------chhhhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHH Q lcl|Aclame:pro 246 AP----------------------------------------KKPTIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGL 283 (408) Q Consensus 246 ~~----------------------------------------~~~~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~ 283 (408) +. ..+.-.+.|.++..+...+++.++. .-++||.++.+ T Consensus 152 A~~T~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLl 231 (343) T protein:vir:98 152 GTDTSDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLV 231 (343) T ss_pred ccCCCCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhh Confidence 21 0112345666665555668887764 45788888775 Q ss_pred HHH-HhhhcccCceeeccccc--cCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEec-cch Q lcl|Aclame:pro 284 NKL-ALVKTAEGKYLLEPDPT--KPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTN-IGA 359 (408) Q Consensus 284 ~~l-~~lkd~~G~~~~~~~~~--~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~-~~~ 359 (408) +.= ..|-++++++.-..-.. --...++-|+|.+.++. +|... +++=-|++.-..+.++..+=.+.+ ... T Consensus 232 a~~~~~l~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~Pf--FP~~~-----llVT~L~NLsIY~Q~gs~RR~~~d~p~r 304 (343) T protein:vir:98 232 AKEASLVYKGNGLIATEKAALNTHDLMKSFGGMPAMIVPN--MPPRA-----AIVTSLSNLSIYTQEGSMRRGMKDDDDK 304 (343) T ss_pred hhhhhhhhhhcCCChHHHHHHHHHHHHHhhCCCeeEEccc--cCCCc-----eEEeeccccEEEEecCcEEEEEEecccc Confidence 531 22323333322111000 00125788999998653 56543 455555554333444444333222 112 Q ss_pred hhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCC Q lcl|Aclame:pro 360 GAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTT 403 (408) Q Consensus 360 ~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~ 403 (408) +.++.+ -..--|..|-++..++.+....++...+.+.=- T Consensus 305 ~rie~y-----~s~Ne~YvVEd~~~~a~iE~i~v~~~~~~g~w~ 343 (343) T protein:vir:98 305 KAVRDS-----YYRNEAYAVEDCGKFMAVDFTKVKLSSGKGTWK 343 (343) T ss_pred ccccch-----hhhcceeeeeccccEEEeeeeeeeecCCCCCCC Confidence 222221 112235677777888777766665544322111 No 199 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=92.13 E-value=0.013 Score=30.98 Aligned_cols=359 Identities=13% Similarity=0.079 Sum_probs=148.8 Q ss_pred CChHH-----HHHHHHHH-HHHHHHHHHHHHHHHHH-----------HHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MGVKL-----TVNQLNEA-WIASGDKVTDFNDQINM-----------ALNDDNFSAEAMSELKNKRDNEKVRRDALREQL 63 (408) Q Consensus 1 M~~~~-----~i~el~~~-~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 63 (408) ...+. +-+.++++ +.+.++++..+++-... .+.+...+.|+ .+..+ T Consensus 216 ~pvqAaAP~~De~airAq~~aeeraRi~~I~~l~a~Fggr~~~l~~~~l~d~~~s~e~-----------------ar~~i 278 (652) T protein:vir:79 216 TPVQAAAPVVDENSIRAQVLAEQKARVNGINDLFAMFGGRYQTLQAQCLADPECSLEQ-----------------AREKL 278 (652) T ss_pred ccccccCCcCchhHHHHHHHHHHHHHHHHHHHHHHhhccccchHHHHHhhccCCCHHH-----------------HHHHH Confidence 11110 11122221 12222222222221111 11111222111 11111 Q ss_pred H-HHHHHHhhhcccccccccccchhhhHHHHHHHH--------------------HHHh-----hcc---hh-hHHHHHH Q lcl|Aclame:pro 64 V-EAQAEQVVNMREEEKGPLNKSENELKDKFVKDF--------------------VNMV-----RNP---MA-FMNTVSS 113 (408) Q Consensus 64 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~--------------------~~~~-----~~~---~~-~~~~~~~ 113 (408) - .+....................+......+.++ .++. +.| .+ ....... T Consensus 279 l~~l~~~~~p~~~~~~~~~~~~~g~~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~~~~~~~v~ 358 (652) T protein:vir:79 279 LNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVG 358 (652) T ss_pred HHHHHhhcCCCCCCcceeEeeccchhhHHHHHHHHHhhcCCcccccCccccCccHHHHHHHHHHhhccCCCCCCHHHHHH Confidence 1 110000000000000000000000001111111 0000 001 00 1111222 Q ss_pred HHhhccccccCceecchhhhhhhhhhhhh-hhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccce Q lcl|Aclame:pro 114 KTETSGSDSAAGLTIPQDIRTMINTLVRQ-YDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLT 192 (408) Q Consensus 114 ~a~~~~t~~~gg~~vP~~~~~~ii~~~~~-~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~ 192 (408) ++.+ .++++=+.++-......+...-.. ....+.+|++.+++-..-.-.+. . +..+.---|.|+++++...... . T Consensus 359 ~A~~-hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~-l-g~~~~L~~V~E~gEyk~~t~~e-~ 434 (652) T protein:vir:79 359 AAFT-HSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVG-M-GGFSALRQVREGAEYKYVTTGD-K 434 (652) T ss_pred HHhh-cCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceee-c-CCCCCccccCCCCccceeeecC-c Confidence 3332 233332222222222222222222 22456777777665444322222 2 3345556689999997644322 4 Q ss_pred eeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------cc-----------hhhhhh Q lcl|Aclame:pro 193 IIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA-------PK-----------KPTIAK 254 (408) Q Consensus 193 ~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~-------~~-----------~~~~~~ 254 (408) .-++.+.+++..+.||++++-.-+.+..+-|...++++.++.++..++.-.-++ .+ ..+..+ T Consensus 435 ~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa~~ 514 (652) T protein:vir:79 435 QATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMD 514 (652) T ss_pred cceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccccccccCC Confidence 678999999999999999987667788888899999999999987654322111 11 011112 Q ss_pred HHHH---HHHHHHhhhh---hccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeecccccccccc Q lcl|Aclame:pro 255 FDDV---ITMINTAVDP---AIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGS 328 (408) Q Consensus 255 ~d~i---~~~~~~~l~~---~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~ 328 (408) .+.+ ..+|...-+. -...+..||+++.......++-.+.. +-..+...+...-+.|+.-++++ ..+..... T Consensus 515 ~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~--v~~a~~~~~~~Np~~~~~~~i~e-prL~~~s~ 591 (652) T protein:vir:79 515 VASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSS--VKGADINAGIINPVKDFATVIAE-PRLDDNSQ 591 (652) T ss_pred HHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCC--Ccccccccccccccccccccccc-cccCCCCc Confidence 2222 2222211111 22356678888887655554432211 11011112222223443222221 12222111 Q ss_pred CcceEEEEehhc------ceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEe Q lcl|Aclame:pro 329 TVYPLYYGDMSQ------AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSF 390 (408) Q Consensus 329 ~~~~~~~gd~~~------~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 390 (408) ...|+++-.. +|+... .+-.|+. ...|..+.+.|++...+|++++|-.++++.+- T Consensus 592 --~~wylaa~~~~dtiev~yL~G~-~~P~ie~----~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 592 --TTFYLAASKGSDTIEVAYLNGV-DTPYIDQ----MEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred --ccEEEecCCCCCeEEEEEecCC-CCCeeee----cCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 1122232111 133322 2334432 23499999999999999999999999888543 No 200 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=91.20 E-value=0.017 Score=30.28 Aligned_cols=287 Identities=14% Similarity=0.120 Sum_probs=122.8 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHHH---HH-HHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNTV---SS-KTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~~---~~-~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) ...--......-.|.+.+....+..... +. .+...-+-.+--+.+|..+...|...+....++.+...+... T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~---- 76 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV---- 76 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccc---- Confidence 1111111222233444433322221110 11 111112223334678998888898888888888776555443 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHh---cchHHHHHHHHHHHHHHHH-HHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLK---DTAENILAWLSSWIAKKVV-VTR 235 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~---ds~~~~~~~v~~~l~~~~~-~~~ 235 (408) +.+.+.+.-.....+.-.-.|.+.++.+ .+|.--++.+--++....+ -++.. .+-..+..||..+|+.++. +.. T Consensus 77 ~~~~V~~s~~s~AeAq~HkdGqTK~eqa-~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~V 154 (318) T protein:vir:86 77 GALLVSRSFDSSAEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIV 154 (318) T ss_pred hhhhhhhhhhhhhhhhhhccCCccccce-eeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 2333322222233434445566665533 3555555555544444444 22322 3444568899999999999 889 Q ss_pred HHHHhhccccccchhhhhhHH--H---------------HHHHHHHhhhhhccC--CCE-EEEcHHH-HHHHHhhhcccC Q lcl|Aclame:pro 236 NQAIIEVMKAAPKKPTIAKFD--D---------------VITMINTAVDPAIIA--TSS-LLTNQSG-LNKLALVKTAEG 294 (408) Q Consensus 236 ~~~~~~g~g~~~~~~~~~~~d--~---------------i~~~~~~~l~~~~~~--~a~-~~~n~~~-~~~l~~lkd~~G 294 (408) +.++.-|+|.++.......++ . ..+++..+++- .++ +.. +++.... .+-|..|+.+.. T Consensus 155 d~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdf-vrptagrrylivkaedrkalldelrqata 233 (318) T protein:vir:86 155 DLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDF-VRPTAGRRYLIVKAEDRKALLDELRQATA 233 (318) T ss_pred HhhheeecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhh-hccCCCceEEEEeecchHHHHHHHHhhcc Confidence 999999999876321111000 0 01111111110 111 111 2333222 222334432222 Q ss_pred c--eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEE Q lcl|Aclame:pro 295 K--YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVI 372 (408) Q Consensus 295 ~--~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~ 372 (408) + .....+-+.- ..--|..-+++. .++ .+- .+-++.|-+ |.+ +-++++ ......|.+|.-.+.++ T Consensus 234 nahvriknddtei--asevgvdeiivy---tgs-kal-kptvlvdqk--yhi-dmqdlt----kvdafewktnsnmilve 299 (318) T protein:vir:86 234 NAHVRIKNDDTEI--ASEVGVDEIIVY---TGS-KAL-KPTVLVDQK--YHI-DMQDLT----KVDAFEWKTNSNMILVE 299 (318) T ss_pred cceeEEeccchhh--hhhcCcceeeee---ecc-ccc-cceeeeccc--eec-chhhhh----hhhcceeccCCceEEEe Confidence 1 1111110000 001111111100 000 000 111233322 111 112221 01112355666666676 Q ss_pred eeeCcEEecccceEEEEee Q lcl|Aclame:pro 373 DRFDVKATDSEALVAGSFS 391 (408) Q Consensus 373 ~r~d~~v~~~~a~~~l~~~ 391 (408) .--.|.+.--+|-+++++. T Consensus 300 tltsghvetynagavitvs 318 (318) T protein:vir:86 300 TLTSGHVETYNAGAVITVS 318 (318) T ss_pred ecccCcceeecCceeEEeC Confidence 6667777666777777666 No 201 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=91.19 E-value=0.017 Score=30.28 Aligned_cols=291 Identities=12% Similarity=0.096 Sum_probs=144.5 Q ss_pred cccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccc Q lcl|Aclame:pro 82 LNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGS 161 (408) Q Consensus 82 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~ 161 (408) ++. .....-+..|..++..-... ....+.+..+.|-+.....+.+.+.+.+.+++.++++++....|. T Consensus 1 m~~---~m~~~tr~~~~~y~~~~A~~---------ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge 68 (341) T protein:vir:27 1 MSQ---ILTQSAREYMDNFAQQLAKS---------YGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQ 68 (341) T ss_pred Ccc---cccHHHHHHHHHHHHHHHHH---------cCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeee Confidence 221 12233334444444332111 111123446788888999999999999999999999999888887 Q ss_pred eEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcch-----HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 RVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA-----ENILAWLSSWIAKKVVVTRN 236 (408) Q Consensus 162 ~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~-----~~~~~~v~~~l~~~~~~~~~ 236 (408) ....... .+.++ ....+..+. .+.++...+...+.---..|+.+.|+..+ .+|+.-+.+.+.++++.=.- T Consensus 69 ~v~lg~~--g~iag-rtdt~R~~r--~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i 143 (341) T protein:vir:27 69 VVDVGVS--GLYTG-RKAGGRFTK--QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIM 143 (341) T ss_pred Eeecccc--cceee-ccCCCceec--ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhh Confidence 6543221 12222 111222221 12344444444444444555566665433 67888888888888876666 Q ss_pred HHHhhccccccc--------------------------------------hhhhhhHHHHHH-HHHHhhhhhccC--CCE Q lcl|Aclame:pro 237 QAIIEVMKAAPK--------------------------------------KPTIAKFDDVIT-MINTAVDPAIIA--TSS 275 (408) Q Consensus 237 ~~~~~g~g~~~~--------------------------------------~~~~~~~d~i~~-~~~~~l~~~~~~--~a~ 275 (408) .-.++|+..+.. .+.-.+.|.++. +++..+++.++. .-+ T Consensus 144 ~IGfnGts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLV 223 (341) T protein:vir:27 144 RIGWNGVSAEADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLT 223 (341) T ss_pred hhcccceeeccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEE Confidence 666777653211 112234565544 345567787765 457 Q ss_pred EEEcHHHHHH-HHhhhcccCce--eeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEE Q lcl|Aclame:pro 276 LLTNQSGLNK-LALVKTAEGKY--LLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSL 352 (408) Q Consensus 276 ~~~n~~~~~~-l~~lkd~~G~~--~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 352 (408) +||.++.++. -..|-+....| ....... ..+|-|+|.+.++. +|... +++=-|++.-..+..+..+= T Consensus 224 vivG~dLla~k~~~l~n~~~~ptE~~Aa~~i---~k~iGGlpa~~~Pf--fP~~~-----~lVT~L~NLsIY~Q~gs~RR 293 (341) T protein:vir:27 224 VFVGSGLIGAAQAKLYDKADKPSEQIAAQKL---DKTIAGRPAYVPPF--LPDNA-----MVVTIPENLQVLTQHGTAQR 293 (341) T ss_pred EEEchhhhhhhhhhhhccCCCCHHHHHHHHH---HHhhCCCeEEEccc--cCCCc-----eEEeeccceEEEEecCcEEE Confidence 8888877652 22222211111 0000111 25899999998653 55533 45555555444344443333 Q ss_pred EEec-cchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCccc Q lcl|Aclame:pro 353 LPTN-IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTA 407 (408) Q Consensus 353 ~~~~-~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~ 407 (408) .+.+ ...+.++.++= +..|-+-.+|..+.++.+.-+.+.---++-+- T Consensus 294 ~~~d~p~r~rie~yes--------~YvVEdyg~~~~~~~~~vkl~~~~~~~~~~~~ 341 (341) T protein:vir:27 294 KAKHESDRKRSKTHTG--------AWKVTQWVCWKRSPLTTQKKSTSALNHRSERN 341 (341) T ss_pred EEEeccccccccchhh--------hheeehhhhhhhccccccccCccccccccccC Confidence 3222 12222332211 23333334444433333332222211111111 No 202 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=90.91 E-value=0.017 Score=30.33 Aligned_cols=296 Identities=6% Similarity=-0.085 Sum_probs=120.3 Q ss_pred HHHHHHHHHHHHHHHhhhcccccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhc-cccccCceecchhhhh Q lcl|Aclame:pro 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETS-GSDSAAGLTIPQDIRT 134 (408) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~-~t~~~gg~~vP~~~~~ 134 (408) +++. +.+.+++. -... ..........+ ....++ .. +... +..+.+..-||..+.+ T Consensus 1 ~~~~-~~~~~l~~------~gi~---~~~~~~~~~~~-------~~~~~~------da-~d~~~~~~~~~~~~~~~~l~~ 56 (336) T protein:vir:36 1 MRDA-QRIQNLAR------AGVI---LPRSVQNVSTP-------LTEYAM------DA-ADLSPHLSSTGSSGIPNYLTT 56 (336) T ss_pred CchH-HHHHHHhh------cCee---ecchhhhhhhH-------HHHhhh------hh-hhccCccccCCCcchHHHHHH Confidence 0000 00000000 0000 00000000000 000000 00 0001 1111112235665543 Q ss_pred ----hhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHH-H Q lcl|Aclame:pro 135 ----MINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITAT-N 209 (408) Q Consensus 135 ----~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS-~ 209 (408) .+++.+........++.+.+...-.-...........+.+.+.+.+...|-.+ ......+-+.+.++....++ . T Consensus 57 ~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d-~~~~~~~~~v~~~~~g~~yg~~ 135 (336) T protein:vir:36 57 YVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYFFQTWTRWGER 135 (336) T ss_pred hhccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceee-cccceeeeeEEEEEeeeeeCHH Confidence 56666666666666666544321110122223334456667778888887544 34556666777777777777 5 Q ss_pred HHHhc--chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------------chhhhhhHHHHHHHHH---Hhh Q lcl|Aclame:pro 210 TSLKD--TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------------KKPTIAKFDDVITMIN---TAV 266 (408) Q Consensus 210 ell~d--s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~------------------~~~~~~~~d~i~~~~~---~~l 266 (408) |+..- ...++.+--....++++.+.+|+-.+.|+.... +....++++.+++.+. ..+ T Consensus 136 E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l 215 (336) T protein:vir:36 136 ELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVL 215 (336) T ss_pred HHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHH Confidence 55432 224566666777777777888876676665321 1111122333333222 112 Q ss_pred hhhc------cCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhc Q lcl|Aclame:pro 267 DPAI------IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ 340 (408) Q Consensus 267 ~~~~------~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~ 340 (408) ...- .....++++++.+..|..- +..|.-++. -+... +-++.+... ..+.+. ++....++.+-.. T Consensus 216 ~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~-~lk~n----~Pnl~i~t~--pEl~~a-~g~~~~l~~~~~~ 286 (336) T protein:vir:36 216 QTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAAA-KLKDI----FPKLEFVTI--PEYDTA-SGRLVQLWAPRVE 286 (336) T ss_pred HHhcCCeeeeccccEEEechHHHHhccCC-CccCccHHH-HHHHh----cCccEEEEc--cccccC-CCceEEEEEEecC Confidence 1111 1244689999988877532 333322221 01111 111112111 112221 2222222222100 Q ss_pred ceEeeeccceEEEEeccch----hhhhhceeeEEEEeeeC-cEEecccceEEEEee Q lcl|Aclame:pro 341 AITLFDRENMSLLPTNIGA----GAFETDTTKIRVIDRFD-VKATDSEALVAGSFS 391 (408) Q Consensus 341 ~~~~~~~~~~~i~~~~~~~----~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~ 391 (408) ...-..+.+ +... .....-.+..-+..|.+ +.+.+|.||+.++.- T Consensus 287 -----~~~t~~~~~-p~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 287 -----GKDTATCGF-TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred -----CCcceeeec-chhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 000011111 0000 00011123334555654 578899999999866 No 203 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=90.61 E-value=0.02 Score=29.90 Aligned_cols=182 Identities=9% Similarity=-0.020 Sum_probs=84.1 Q ss_pred eeeehHHHHHHHh-----cchHHHHHHHHHHHHHHHHHHHHHHHhhccc----------------cccc-hhhhhh---- Q lcl|Aclame:pro 201 YAGIITATNTSLK-----DTAENILAWLSSWIAKKVVVTRNQAIIEVMK----------------AAPK-KPTIAK---- 254 (408) Q Consensus 201 ~~~~~~iS~ell~-----ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g----------------~~~~-~~~~~~---- 254 (408) +-+ .-+|.-++. ++..++.+...+++.++++...|+.++.-.- +... .....+ T Consensus 1 iD~-lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l 79 (221) T protein:vir:17 1 MDD-LLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAI 79 (221) T ss_pred CCc-chhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHH Confidence 111 223444443 3567899999999999999999988754211 1000 111122 Q ss_pred HHHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHhhhcc-cCceeecc---ccccC-CcccccccceEeeccccccccc Q lcl|Aclame:pro 255 FDDVITMINTAVDPAIIA--TSSLLTNQSGLNKLALVKTA-EGKYLLEP---DPTKP-NSYLIKGKQVIVVADRWLPNTG 327 (408) Q Consensus 255 ~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd~-~G~~~~~~---~~~~~-~~~~l~G~pv~~~~~~~~~~~~ 327 (408) ++.++++. ..|+..+.+ +-.++++|..|..|.+-.|. -..+-+.. .+.++ .-.++.|++|+.+. .+|... T Consensus 80 ~dai~~a~-~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~Sn--nlP~~~ 156 (221) T protein:vir:17 80 VDGFFEAA-AVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSN--VLASLY 156 (221) T ss_pred HHHHHHHH-HHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEec--cCCccc Confidence 23344433 446555544 33466799887776542221 11111211 12222 23579999998864 467654 Q ss_pred cCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCccc Q lcl|Aclame:pro 328 STVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTA 407 (408) Q Consensus 328 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~ 407 (408) +.+....-|+|... ....+.+... |.+ .=+.+.+|+|+..+++-..-.-++.. -+-=+ T Consensus 157 gt~~~~~ag~~~~~--~~~~~~yr~~--------fs~----------~~glv~~~~Avgtvkl~~~~~~~~~~--~~~~~ 214 (221) T protein:vir:17 157 GTNLVTDPGDATTS--GENNGSYRPA--------ITD----------RAGLVFHKEAADTVEVLLPPSRPPLV--ISMFS 214 (221) T ss_pred ccccccCCcccccc--cccccccccc--------ccc----------eEEEEEcchheeeeeeecCCCCCcee--eeeee Confidence 44322222333210 0111111111 111 12567788887766644322211110 00001 Q ss_pred C Q lcl|Aclame:pro 408 V 408 (408) Q Consensus 408 ~ 408 (408) + T Consensus 215 ~ 215 (221) T protein:vir:17 215 I 215 (221) T ss_pred c Confidence 1 No 204 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=89.85 E-value=0.024 Score=29.46 Aligned_cols=278 Identities=8% Similarity=-0.029 Sum_probs=109.1 Q ss_pred hhccccccCceecch--hhhhhhhhhhhhhhhhhhhhce---------eecccCccceEEeeccCCcccc-chh-ccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQ--DIRTMINTLVRQYDSLQQYVRV---------ESVSTSNGSRVYEKWTDVTPLT-VMD-AEDGK 182 (408) Q Consensus 116 ~~~~t~~~gg~~vP~--~~~~~ii~~~~~~~~l~~~~~~---------~~~~~~~g~~~~~~~~~~~~~~-~~~-~E~~~ 182 (408) |..+.-++ ..||. .|...+.+...+.+.|.+-.=+ ....+....+|+....++.... .|. +.... T Consensus 1 Ma~T~l~D--~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~ 78 (349) T protein:vir:78 1 MAITTIGD--IVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDI 78 (349) T ss_pred CCceEEee--eeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccc Confidence 22222121 33444 2444444444344433221000 0111222233333222221111 111 11111 Q ss_pred ccccccccceeeeechheeee--ehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccc--------ccc----- Q lcl|Aclame:pro 183 IPDLDNPQLTIIKYLIKRYAG--IITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK--------AAP----- 247 (408) Q Consensus 183 ~~~~~~~~f~~v~~~~~~~~~--~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g--------~~~----- 247 (408) .+.....+..++-...+.-.+ .-.++.++-- .|..+.|.+++++...+...+.++.... ++. T Consensus 79 ~t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG---~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~ 155 (349) T protein:vir:78 79 ATPRAIQTGEMMARVAYLNEGFGQADLTVELTS---QNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQ 155 (349) T ss_pred cccccccccceeeeeeeeccccchhHHHHHhhC---chHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhc Confidence 111122233343333333333 3456766643 3567778888877666665554433211 000 Q ss_pred -------chhhhhhHHHHHHHHHHhhhh----hccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceE Q lcl|Aclame:pro 248 -------KKPTIAKFDDVITMINTAVDP----AIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVI 316 (408) Q Consensus 248 -------~~~~~~~~d~i~~~~~~~l~~----~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~ 316 (408) ......+...++++....-+. ....-..++||+.++..|++++-= .|+ ++.-....-.+++|++|+ T Consensus 156 ~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li--~~i-~~s~~~~~i~ty~G~~Vi 232 (349) T protein:vir:78 156 NDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI--DFI-RDAENNTMFATYQGYRVI 232 (349) T ss_pred ccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhh--hhc-cCcccCcccceecCeEEE Confidence 011123445555554332111 123345799999999998765310 111 111111123689999999 Q ss_pred eeccccccccccC-c-ceEEEEehhcceEeeecc---ceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEee Q lcl|Aclame:pro 317 VVADRWLPNTGST-V-YPLYYGDMSQAITLFDRE---NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) Q Consensus 317 ~~~~~~~~~~~~~-~-~~~~~gd~~~~~~~~~~~---~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 391 (408) +.|..++...++. . .+.+||. .++...+.. .++..+++...+. .++..+....| .+++|.+|....-. T Consensus 233 vDD~~Pv~~~g~~~~yttylfg~--GAi~~~~~~~~~~~et~rd~~~g~~--~G~d~l~~R~~---~~~hp~G~s~~~a~ 305 (349) T protein:vir:78 233 VDDSMTVVGQGAQRKFISIIFGQ--GAIGYGEGNPVMPLEYEREASRANG--GGVETLWTRKT---WLLHPFGYRFTSAV 305 (349) T ss_pred EeCCCccccCCCCceEEEEEeec--ceEEEccCCCccceeeecccccCCc--ceeEEEEEeeE---EEeeeeeeeecccc Confidence 9776443322111 1 2245663 223222212 2454454433221 23333433333 35677777664321 Q ss_pred ccccC---CCCccC----CCcccC Q lcl|Aclame:pro 392 AIADQ---VGNFKT----TTSTAV 408 (408) Q Consensus 392 ~~~~~---~~~~~~----~~~~~~ 408 (408) ...+. -..+++ .++..- T Consensus 306 v~~~~~~~~~~sPt~aeLa~~~NW 329 (349) T protein:vir:78 306 ITGNGTETIARSASWQDLANATNW 329 (349) T ss_pred ccCCccccccCCCChHHhcCCcCc Confidence 11000 000111 000000 No 205 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=89.06 E-value=0.029 Score=29.05 Aligned_cols=287 Identities=13% Similarity=0.099 Sum_probs=132.4 Q ss_pred cchhhhHHHHHHHHHHHhhcchhhHHH----HHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRNPMAFMNT----VSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~~~~~~~~----~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) ....-........|.+.++...+...- ....+.+..+-.+.-+.+|..+...|-..+...+|+.....+.++ T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawnaklaengvtitdttfqlprklvesintallntnpvfkvfhvtnv---- 76 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV---- 76 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhh---- Confidence 111111122233455555443322111 111122223334445677888888888878778887766555544 Q ss_pred cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHH--HhcchHHHHHHHHHHHHHHHHHHH-H Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS--LKDTAENILAWLSSWIAKKVVVTR-N 236 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el--l~ds~~~~~~~v~~~l~~~~~~~~-~ 236 (408) |.+.+.+.-+.+..+....+|.+.++ ...++.--++.|--++.+..+.... |++|...+...|..+|..++..++ | T Consensus 77 gallvsrsfdssneaqvhkdgqtkte-qaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnkivd 155 (318) T protein:vir:94 77 GALLVSRSFDSSNEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVD 155 (318) T ss_pred hheeeeccccccchhhhhcccccccc-cceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhhhh Confidence 34444443344444444566666654 3456776677777777666555544 556666788888888888888774 6 Q ss_pred HHHhhccccccchhhhhh---------------------HHHHHHHHHHhhhhhccCCCEEEEcHHH-HHHHHhhhcccC Q lcl|Aclame:pro 237 QAIIEVMKAAPKKPTIAK---------------------FDDVITMINTAVDPAIIATSSLLTNQSG-LNKLALVKTAEG 294 (408) Q Consensus 237 ~~~~~g~g~~~~~~~~~~---------------------~d~i~~~~~~~l~~~~~~~a~~~~n~~~-~~~l~~lkd~~G 294 (408) -++..|+|+++....... +|.|-.+.-...+.+.+. .+++.... .+-|..|+.+.. T Consensus 156 lalvegdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptagrr--ylivktedrkalldelrqata 233 (318) T protein:vir:94 156 LALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRR--YLIVKTEDRKALLDELRQATA 233 (318) T ss_pred eeeeecCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCce--EEEEeccchHHHHHHHHhhhc Confidence 778889988764332211 112222221111111111 12333333 233344432222 Q ss_pred c--eeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEE Q lcl|Aclame:pro 295 K--YLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVI 372 (408) Q Consensus 295 ~--~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~ 372 (408) + .....+-+.- ..--|..-+++. .++ .+-+ +-++.|-+ |.+ +-++++- .....|.+|.-.+.++ T Consensus 234 nanvriknddtei--asevgvdeiivy---tgs-kavk-ptvlvdqk--yhi-dmqdltk----vdafewktnsnmilve 299 (318) T protein:vir:94 234 NANVRIKNDDTEI--ASEVGVDEIIVY---TGS-KAVK-PTVLVDQK--YHI-DMQDLTK----VDAFEWKTNSNMILVE 299 (318) T ss_pred ccceEEeccchhh--hhhcCcceeEEe---ecc-cccc-ceeEeccc--eec-chhhhhh----hhceeeccCCceEEEE Confidence 1 1111110000 011122111110 011 1111 12333322 211 2222220 1112355666666676 Q ss_pred eeeCcEEecccceEEEEee Q lcl|Aclame:pro 373 DRFDVKATDSEALVAGSFS 391 (408) Q Consensus 373 ~r~d~~v~~~~a~~~l~~~ 391 (408) .--.|.+.--+|-+++++. T Consensus 300 tltsghvetynagavitvs 318 (318) T protein:vir:94 300 TLTSGHVETYNAGAVITVS 318 (318) T ss_pred ecccCcceeecCceeEEeC Confidence 6667777666777777666 No 206 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=88.40 E-value=0.033 Score=28.74 Aligned_cols=283 Identities=11% Similarity=0.030 Sum_probs=109.2 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccC----cc-ceEEeeccCCccc--cchhccccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTS----NG-SRVYEKWTDVTPL--TVMDAEDGKIPDLDN 188 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~----~g-~~~~~~~~~~~~~--~~~~~E~~~~~~~~~ 188 (408) |.. +- -..+|+.+..+.++.+++..++.+++..-.-... .| ++.|+........ ..+.+.+....+... T Consensus 1 MaN-~l---lT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e 76 (423) T protein:vir:17 1 MPN-NL---DSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Ccc-ch---hhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCcccc Confidence 221 10 1137999999999999999988777655331111 11 3333322211111 111111112111111 Q ss_pred ccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc-cc-----ccchhhhhhHHHHHHHH Q lcl|Aclame:pro 189 PQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM-KA-----APKKPTIAKFDDVITMI 262 (408) Q Consensus 189 ~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~-g~-----~~~~~~~~~~d~i~~~~ 262 (408) .--.++++-+|...+--=..|+. ....++++++... .++++..+|..++.-. +. +.+......+++++++- T Consensus 77 -~~v~l~id~~k~va~~v~d~E~~-~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~~a~~~i~~a~ 153 (423) T protein:vir:17 77 -GKATGRVGNYITVAVEYQQLEEA-IKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTA 153 (423) T ss_pred -ceeEEEeeceeeeeeeecHHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccccccccCCcccccHHHHHHHH Confidence 11246777777777665556654 3455687777555 5788888887775431 11 11222223466665543 Q ss_pred HHhhhhhccC--CCEEEEcHHHHHHHHhhhc--ccCceeeccccccCC-cccccccceEeeccccccccccCcc--eEE- Q lcl|Aclame:pro 263 NTAVDPAIIA--TSSLLTNQSGLNKLALVKT--AEGKYLLEPDPTKPN-SYLIKGKQVIVVADRWLPNTGSTVY--PLY- 334 (408) Q Consensus 263 ~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd--~~G~~~~~~~~~~~~-~~~l~G~pv~~~~~~~~~~~~~~~~--~~~- 334 (408) ..|+....+ +...+++|..++.|.+-.. ..........+.++. .+++.|+.|+.+.+ +|....+.. .+. T Consensus 154 -~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snn--ip~~T~gt~~~t~~~ 230 (423) T protein:vir:17 154 -SFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNG--LASRTQGAFGGTLTV 230 (423) T ss_pred -HHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCC--Cccccccceeceeee Confidence 335444433 4567899999887653210 000111111233333 25899999887543 442211110 000 Q ss_pred -EEeh-hcceE-ee--eccceEEEEeccchhhhhhceeeEEE---EeeeCcEEe------cccceEE------------- Q lcl|Aclame:pro 335 -YGDM-SQAIT-LF--DRENMSLLPTNIGAGAFETDTTKIRV---IDRFDVKAT------DSEALVA------------- 387 (408) Q Consensus 335 -~gd~-~~~~~-~~--~~~~~~i~~~~~~~~~f~~~~~~~r~---~~r~d~~v~------~~~a~~~------------- 387 (408) .+.. ..... .. ...++...+....+..-.-|.+.|-+ ..+....++ +++-|.+ T Consensus 231 ~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~t 310 (423) T protein:vir:17 231 KTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDVT 310 (423) T ss_pred cccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCceE Confidence 0000 00000 00 00000101100000000011111111 000000000 1111111 Q ss_pred EEeecc-----ccCCCCccCC---CcccC Q lcl|Aclame:pro 388 GSFSAI-----ADQVGNFKTT---TSTAV 408 (408) Q Consensus 388 l~~~~~-----~~~~~~~~~~---~~~~~ 408 (408) +++.+. +..+..+++. ++.+| T Consensus 311 v~i~p~~i~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:17 311 VTLSGVPIYDTTNPQYNSVSRQVAAGDAV 339 (423) T ss_pred EEecCccccccCCcccccceecccCCcee Confidence 111110 0000000100 01111 No 207 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=88.31 E-value=0.033 Score=28.70 Aligned_cols=377 Identities=13% Similarity=0.048 Sum_probs=148.7 Q ss_pred CChHHHH-----HHHHHHHHHHHHHHHHH---HH-HHHHHHhhhcccHHHHHH-HHHHHHHHHH---------HHHHHHH Q lcl|Aclame:pro 1 MGVKLTV-----NQLNEAWIASGDKVTDF---ND-QINMALNDDNFSAEAMSE-LKNKRDNEKV---------RRDALRE 61 (408) Q Consensus 1 M~~~~~i-----~el~~~~~~~~~~~~~~---~~-~~~~~~~~~~~~~~~~~~-~~~~~~~~~~---------~~~~~~~ 61 (408) ...+..+ .+.++.++.+....... .. -..+.+.+...+.++.++ +-+.+..-.. .+..-+. T Consensus 258 ap~~adirA~~~aae~~r~aaI~a~fa~f~~~~a~l~a~~l~d~~~s~d~ar~~lL~~l~~~~~p~~~~~~~~~~~~~~g 337 (693) T protein:vir:95 258 APTEADIRARILAEESGRRSAITAAFGAFSTGHAELLATCLNDMNITVDQAREKLLAAIGADTQPAAALSAGAHIHAGNG 337 (693) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHhccCChHHHHHHHHhhcCCCHHHHHHHHHHHHhhccCCCCCcCcCccccCCch Confidence 2222111 11111222222111111 11 112222333344333221 1111100000 0000000 Q ss_pred HH-H-HHHHHHhhhc-ccccccccccchhhhHHHHHHHHHHHhhcchh-hHHHHHHHHhhccccccCceecchhhhhhhh Q lcl|Aclame:pro 62 QL-V-EAQAEQVVNM-REEEKGPLNKSENELKDKFVKDFVNMVRNPMA-FMNTVSSKTETSGSDSAAGLTIPQDIRTMIN 137 (408) Q Consensus 62 ~~-~-~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~-~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii 137 (408) .+ . ..+..-.... .........-......+.-+..+...-..... .......++.+ .++++=+.++-......++ T Consensus 338 ~~~~d~~~~al~~R~g~~~~~~~n~~~g~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~-htTSDFp~IL~~~~nk~l~ 416 (693) T protein:vir:95 338 NLVGDSVRASVLARIGRGERQADNAYNGMTLRELARASLVDRGIGVASLNAPQMVGLAFT-HTSSDFGLILLDVANKSVL 416 (693) T ss_pred hHHHHHHHHHHHHhcCcccccCCccccCCcHHHHHHHHHHhcCCccCCCCHHHHHHHHHh-cCcchhHHHHHHHHHHHHH Confidence 00 0 0000000000 00000000000001111111111000000000 11122233333 3334332222222222222 Q ss_pred hhhh-hhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHhcch Q lcl|Aclame:pro 138 TLVR-QYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA 216 (408) Q Consensus 138 ~~~~-~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~ 216 (408) ..-+ ........++..+++-..-...+. . +.-+.---|.|+++++...... ..-++.+.+++..+.||++++-+-+ T Consensus 417 ~~y~~a~~t~~~~~~~~~~~DFk~~~~~~-l-g~~~~L~~V~E~gEyk~~t~~e-~~e~~~l~tyG~~~~iTRqaiINDD 493 (693) T protein:vir:95 417 AGWEEAEETFPLWTKSGILTDFKPARRVG-L-GEFSSLRQVREGAEYKYVTLGE-RGEQIILATYGELFSITRQAIINDD 493 (693) T ss_pred HHHHhhhhHHHHHhccCCCCcccccceee-c-CCCCChhhcCCCCceeeeecCC-ccceeehhhcCCeeeecHHhhhccc Confidence 2111 223355666665554443222221 1 2334445688998886432211 2357889999999999999988767 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccc------c------------chhhhhhHHHHHH---HHHHhh--------h Q lcl|Aclame:pro 217 ENILAWLSSWIAKKVVVTRNQAIIEVMKAA------P------------KKPTIAKFDDVIT---MINTAV--------D 267 (408) Q Consensus 217 ~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~------~------------~~~~~~~~d~i~~---~~~~~l--------~ 267 (408) .++.+-|...++++..+.++..++.-...+ . ......+.+.+-. +|...- . T Consensus 494 Lga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~ 573 (693) T protein:vir:95 494 LQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGR 573 (693) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcchhccCCc Confidence 888888899999999999987665432211 0 0111223333322 221110 0 Q ss_pred hhccCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCccccccc-ceEeeccccccccccCcceEEEEehh-----cc Q lcl|Aclame:pro 268 PAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGK-QVIVVADRWLPNTGSTVYPLYYGDMS-----QA 341 (408) Q Consensus 268 ~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~-pv~~~~~~~~~~~~~~~~~~~~gd~~-----~~ 341 (408) .-...+..||+++........+-.+.-.+- .+...+...-+.|+ .|++ +..+.....+. =.++.|.. -+ T Consensus 574 ~L~i~P~~llvP~~le~~a~~l~~s~~~~~--a~~~~~~~NP~~~~~~vi~--~prL~~~s~~~-Wyl~a~~~~dtie~~ 648 (693) T protein:vir:95 574 TLNIRPGFVLTPVALEDKANQIINSESVPG--ADVNSGIVNPIRAFAQVIG--EPRLDDASATA-WYMAAKKGSDTIEVA 648 (693) T ss_pred eeecccceEEecchHHHHHHHHhccccccc--cccccccccchhccccccc--cceecCCCCCc-eEEecCCCCCeEEEE Confidence 122356678888887666665543322111 11122212224454 2322 11222211111 11223311 12 Q ss_pred eEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEee Q lcl|Aclame:pro 342 ITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) Q Consensus 342 ~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 391 (408) |+... ++-.|+. ...|..|.+.+++...+|++++|-.++++=... T Consensus 649 yL~G~-~~P~ie~----~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 649 YLDGV-DTPYLEQ----QEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred EecCC-CCCeEee----cCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 33322 3344433 235999999999999999999998877663222 No 208 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=87.75 E-value=0.037 Score=28.46 Aligned_cols=277 Identities=10% Similarity=-0.022 Sum_probs=110.0 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecc-------cCccceEEeeccCCccccchhccccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVS-------TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDN 188 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~-------~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~ 188 (408) |. .+=..++|+-+..++++.+++..++.+++..-.-. +.+-++++|......-...+...+....+... T Consensus 1 MA----Nsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e 76 (423) T protein:vir:10 1 MA----NNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLIS 76 (423) T ss_pred Cc----cccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCccccccccc Confidence 22 11112799999999999999999988877653311 22223333321111101111111111111110 Q ss_pred ccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccc------ccchhhhhhHHHHHHHH Q lcl|Aclame:pro 189 PQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA------APKKPTIAKFDDVITMI 262 (408) Q Consensus 189 ~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~------~~~~~~~~~~d~i~~~~ 262 (408) .--.++++-+|...+--=..|+. ....++++++... .++++..+|..+...... +.+......+++++++- T Consensus 77 -~~v~l~id~~k~~a~~v~d~E~~-l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~~a~~~~a~a~ 153 (423) T protein:vir:10 77 -AKATGEVGNYITVAVEYRQIEEA-LKLNQLDQILVPI-NERMVTDLETELALFMMKHGALSLGSPNTPIKKWSDVAQTA 153 (423) T ss_pred -ceEEEEecceeeeeeeeChHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhcccccccccccccccHHHHHHHH Confidence 11256666666666655566655 4566788776555 678888888877532211 12222223466665542 Q ss_pred HHhhhhhccC--CCEEEEcHHHHHHHHh----hhcccCceeeccccccC-CcccccccceEeeccccccccccCcc---- Q lcl|Aclame:pro 263 NTAVDPAIIA--TSSLLTNQSGLNKLAL----VKTAEGKYLLEPDPTKP-NSYLIKGKQVIVVADRWLPNTGSTVY---- 331 (408) Q Consensus 263 ~~~l~~~~~~--~a~~~~n~~~~~~l~~----lkd~~G~~~~~~~~~~~-~~~~l~G~pv~~~~~~~~~~~~~~~~---- 331 (408) ..|+....+ +...+++|..+..|.+ +...++. -...+..+ ..+++.|+.++...+ +|....+.. T Consensus 154 -~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~--~~~alr~~~i~G~~~GFdi~~Sn~--vp~~T~g~~~ga~ 228 (423) T protein:vir:10 154 -SFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQL--VRTAWENAQISGNFGGIRALMSNG--LASRTQGAFGGKL 228 (423) T ss_pred -HHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhcccccc--chHHHHhcccceeecceEEEEecC--Cccccccccccee Confidence 334443332 4567999999887653 2221110 01112333 336899999887543 442211111 Q ss_pred -----eEEEEehhcceEeeeccceEEEEeccchhhhh--hceeeEEE---EeeeCcEE--------------------ec Q lcl|Aclame:pro 332 -----PLYYGDMSQAITLFDRENMSLLPTNIGAGAFE--TDTTKIRV---IDRFDVKA--------------------TD 381 (408) Q Consensus 332 -----~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~--~~~~~~r~---~~r~d~~v--------------------~~ 381 (408) ..+-|+-.. ...-...+..........|. -|...|-+ ..++...+ .- T Consensus 229 ~~~~~~~vt~a~~~---~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a 305 (423) T protein:vir:10 229 TVKGTPEVNYDSVK---DSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHS 305 (423) T ss_pred eeeeeeEEEecccc---cccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccc Confidence 111111100 00000000000000000000 01111100 11111111 01 Q ss_pred ccceEEEEeecc-----ccCCCCccCCC---cccC Q lcl|Aclame:pro 382 SEALVAGSFSAI-----ADQVGNFKTTT---STAV 408 (408) Q Consensus 382 ~~a~~~l~~~~~-----~~~~~~~~~~~---~~~~ 408 (408) +.++. +++.+. +..+..+++.+ ..+| T Consensus 306 ~~~~t-v~i~p~~~~~~~~~~~~~V~a~~a~~~~v 339 (423) T protein:vir:10 306 SGDVT-VKISGVPIFDAGYPQYNAVDRLLAEGDTV 339 (423) T ss_pred cCceE-EEeccccccccCcccccceeccccCCcee Confidence 11111 111110 00000111110 0111 No 209 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=87.29 E-value=0.04 Score=28.26 Aligned_cols=264 Identities=7% Similarity=-0.059 Sum_probs=110.9 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhce---eecc-cCccceEEeeccCCccccchhcccccccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV---ESVS-TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQL 191 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~---~~~~-~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f 191 (408) |.+ .-.++.++..+.+.....+....++.. ..+. .+..++.+|+........+-.+..+-.+..-..++ T Consensus 1 MA~-------~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~ 73 (299) T protein:vir:79 1 MAA-------LNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAW 73 (299) T ss_pred Ccc-------chhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcce Confidence 221 113467777777777776655444322 1111 11235667777655444443332122221112355 Q ss_pred eeeeechheeeeeh----HHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh-------ccccc---cchhhhhhHHH Q lcl|Aclame:pro 192 TIIKYLIKRYAGII----TATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE-------VMKAA---PKKPTIAKFDD 257 (408) Q Consensus 192 ~~v~~~~~~~~~~~----~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~-------g~g~~---~~~~~~~~~d~ 257 (408) ...+++-.+.-.+. .+-+.-.. ..+...+.+.....+.-.+|.-.+. +.|.. ...+....++. T Consensus 74 ~t~~ldqdr~~~f~vD~~Dvdet~~~---~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~ 150 (299) T protein:vir:79 74 EPKVLTNQRKWSTLVHPADINQTNYV---ASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEV 150 (299) T ss_pred eEEEeeccccceeccchhhHHHHhhh---hHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHH Confidence 66777777666553 11111111 1112222222222333333332221 11111 11122233555 Q ss_pred HHHHHHHhhhhhcc--CCCEEEEcHHHHHHHHhhhc--ccCceeeccccccCCcccccccceEeecccccccc------- Q lcl|Aclame:pro 258 VITMINTAVDPAII--ATSSLLTNQSGLNKLALVKT--AEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNT------- 326 (408) Q Consensus 258 i~~~~~~~l~~~~~--~~a~~~~n~~~~~~l~~lkd--~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~------- 326 (408) +.+++. .|+.... .+..++++|..+..|.+-+. .+..........++..++|.|+||+.+++..+.+. T Consensus 151 i~~~~~-~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~ 229 (299) T protein:vir:79 151 FDKLME-KMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGW 229 (299) T ss_pred HHHHHH-HHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCc Confidence 555554 3554433 35567889999887764321 11111111123344457899999998876555531 Q ss_pred ----ccCcceEEEEehhcceEeeec-cceEEEEeccchhhhhhceeeEEEEeeeCcEEeccc-ceEEEEeecccc Q lcl|Aclame:pro 327 ----GSTVYPLYYGDMSQAITLFDR-ENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSE-ALVAGSFSAIAD 395 (408) Q Consensus 327 ----~~~~~~~~~gd~~~~~~~~~~-~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~-a~~~l~~~~~~~ 395 (408) ++.+-.++++..+. ..-... ..+.+ ..|.. +..+-..+.-..++|.=|.+.+ .-+.+.++++.. T Consensus 230 ~~~~~ak~in~ii~~~~a-~~~~~K~~~~~~-~~P~~---~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 230 KVGAGAKQIFMSLVHPSA-IITPVSYQFSKL-DEPTA---VTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred cccCcccccceEEEcCCe-eeeeEeeeeEEe-ecCCC---CCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 11111245554332 222211 12222 22322 1221122333334555555542 222333443333 No 210 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=87.06 E-value=0.041 Score=28.17 Aligned_cols=315 Identities=13% Similarity=0.068 Sum_probs=133.0 Q ss_pred cccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhh--hhhhhhcee Q lcl|Aclame:pro 76 EEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYD--SLQQYVRVE 153 (408) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~~~ 153 (408) ..............-+....++.+.+..|.+. +-.+-.++|.+=-+.+..+|..+..... .+.+-+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS~~tg~g~---------~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~ 71 (462) T protein:vir:96 1 MHKDTNLTAEQNKYADKFQEEVMKSYQTGYGI---------TPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRR 71 (462) T ss_pred CccccccchhhhhhhchhhHHHHHHHhcCCCc---------CCccccccchhhhhhhhhhhheeeecccchhhhhhcCCc Confidence 00000000111111111122333333332110 0011223344333444444433333222 222223344 Q ss_pred ecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHH-HhcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 154 ~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el-l~ds~~~~~~~v~~~l~~~~~ 232 (408) +..+.-..+......+..+...+++|++-.+ .+++.+.+.+...+-++.-..+|... +..+..+..+...+.-...++ T Consensus 72 ~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~-~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a 150 (462) T protein:vir:96 72 PAQSTVQKYDVYLRHGNVGHSRFVREVGVAP-VSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVA 150 (462) T ss_pred hhhhhhhhheeeeccCccccccccccccccc-cCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHH Confidence 4444334444444445667788899999875 67899999999999999877666544 244566777888888889999 Q ss_pred HHHHHHHhhccccccc--hhhhhhHHHHHHHH---------------------HHhhhhhccCCCEEEEcHHHHHHHHh- Q lcl|Aclame:pro 233 VTRNQAIIEVMKAAPK--KPTIAKFDDVITMI---------------------NTAVDPAIIATSSLLTNQSGLNKLAL- 288 (408) Q Consensus 233 ~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~---------------------~~~l~~~~~~~a~~~~n~~~~~~l~~- 288 (408) ..++.+.+.|+..-.+ .+-...+|.|.+++ ...+..+|....-++|+.-+.+.+.. T Consensus 151 ~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~ 230 (462) T protein:vir:96 151 KTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNS 230 (462) T ss_pred HHHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHh Confidence 9999999999876544 12234455543332 22233445444456777777777652 Q ss_pred hhcccCceeeccccccCCcccccccceEee--c---cccccccccCcceEEEEehhc-ceEeeeccceEEEEeccchhhh Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVV--A---DRWLPNTGSTVYPLYYGDMSQ-AITLFDRENMSLLPTNIGAGAF 362 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~--~---~~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~~i~~~~~~~~~f 362 (408) +-..+ |.+.+++.. ....|+||--. . -..-++...+ .+.+++--.+ .-....-..++...-......| T Consensus 231 ~l~~q-rv~~~~n~g----~~~~G~~v~~f~s~~G~I~L~~s~~m~-~~~i~~~~~~~~p~ap~~~~vsaTv~t~~~g~f 304 (462) T protein:vir:96 231 VLGRQ-MQLMQDNSG----NVNAGYNVQGFYSSRGFIKLHGSTVME-NELILDESLQPLPNAPQPATVKATVETGKKGLF 304 (462) T ss_pred hcCce-EEEEcCCCC----ceeeeeeccceeeeeeeeeeCCceecC-cccccccccccCCCCCCCCceeEEEEeCCCCCC Confidence 21111 222222111 12333333210 0 0000111111 1111110000 0000000011111000000011 Q ss_pred ----hhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 363 ----ETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 363 ----~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ....+.+++...-+..--.|...+-+++.++++.+.-+-+ -+++ T Consensus 305 ~~~~d~~~y~Y~V~avs~dgeS~PS~~VtaTva~~~~gv~ltIt--~~a~ 352 (462) T protein:vir:96 305 TDEHDRAELTYKVVVNSDDAQSAPSEAVTATVNNATDGVKLEIS--VNAM 352 (462) T ss_pred CCccCceeEEEEEEEECCCCccccceeeEeeeecccccceEEEE--EcCC Confidence 1223333333333333334555555555444433333211 1122 No 211 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=85.80 E-value=0.05 Score=27.71 Aligned_cols=270 Identities=8% Similarity=-0.021 Sum_probs=85.4 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec----ccCccceEEee---ccCCccccchhccccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV----STSNGSRVYEK---WTDVTPLTVMDAEDGKIPDLDN 188 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~----~~~~g~~~~~~---~~~~~~~~~~~~E~~~~~~~~~ 188 (408) +.++..++- .+--+.+..-.++.+.+....++-..--.+ ....|.+..+. ..+.....- +...++.+.... T Consensus 1 ~~~t~~sdl-~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rn-v~~~~~~t~~ki 78 (315) T protein:vir:96 1 MATTVNSDL-VIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRD-VNSTATVAGTKI 78 (315) T ss_pred Cceeeecce-eeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcc-cCCCccccceec Confidence 222222221 111222223333333332222221110000 00011111111 000000000 000111111111 Q ss_pred ccceeeeechheeeeehH--HHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cchhhhhh Q lcl|Aclame:pro 189 PQLTIIKYLIKRYAGIIT--ATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA---------PKKPTIAK 254 (408) Q Consensus 189 ~~f~~v~~~~~~~~~~~~--iS~ell~---ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~---------~~~~~~~~ 254 (408) .+...+..+. ..+.-+ ++.+.+. +.+..+-.-|...+..+..+..-...+.+..+. .......+ T Consensus 79 t~~~dvaVk~--~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~ 156 (315) T protein:vir:96 79 AADEMVSVKV--PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNVSGELATEG 156 (315) T ss_pred ccccceeEEE--eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccccccccccC Confidence 1122222222 112223 3333333 222222222333333333333222222222110 11223344 Q ss_pred HHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceeeccc--cccCCcccccccceEeeccccccccccCcce Q lcl|Aclame:pro 255 FDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPD--PTKPNSYLIKGKQVIVVADRWLPNTGSTVYP 332 (408) Q Consensus 255 ~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~--~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~ 332 (408) ...+.++... +-.....-..|+||..++..|.+ +. --..++... ..-+.+...+|+||++.|. +|.. .. T Consensus 157 ~~~l~dA~~k-lGD~~~~l~~~vMHS~v~~~L~~-q~-L~~~~~~~~~~~~~~~~~~~lGkrViVdD~--~P~~----~~ 227 (315) T protein:vir:96 157 KKVLTKGLRT-MGDKASSIAIWVMDSTSYFDIVD-EA-IDNKLYEEAGVVVYGGTPGTLGKPVLVTDQ--CPAT----KI 227 (315) T ss_pred HHHHHHHHHH-hcccccCeeEEEEchHHHHHHHH-hh-hhhhcccccceeEecCcCcccccEEEEECC--CCcc----ee Confidence 5566666644 43444455679999999998876 21 111222110 1111123455999999765 3431 11 Q ss_pred EEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCc-EEecccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 333 LYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV-KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 333 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ..||. .++......++. ..+... .++-.+....|..+ -.++|.+|..-+. ....|..+.-.++.-- T Consensus 228 ~gl~~--GAi~~~~~~~~~--~~~~~~----~g~e~l~~~~r~e~tf~l~p~G~sw~~~--~~~sPt~aeLat~~NW 294 (315) T protein:vir:96 228 FGLVA--GAVMITESQAPG--MRSYQI----DDQENLAIGFRAEGTANVEVLGYKWKTK--TNVNPASATLATTTNW 294 (315) T ss_pred eeeec--ceeeecCCCccc--cccccC----CCcceeEEEEeeeeEeeeeeeeEEeecC--CCcCCChHHhcCCcCc Confidence 11222 111111111111 111111 11122223333333 4677777766321 1111111111111111 No 212 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=83.60 E-value=0.067 Score=27.01 Aligned_cols=283 Identities=11% Similarity=0.048 Sum_probs=109.5 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeeccc----Ccc-ceEEeeccCCccccchhcccc--ccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVST----SNG-SRVYEKWTDVTPLTVMDAEDG--KIPDLDN 188 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~----~~g-~~~~~~~~~~~~~~~~~~E~~--~~~~~~~ 188 (408) |.. + --..+|+.+..++++.+++..++.+++..-.-.. ..| ++.++.........+-...+. ..++... T Consensus 1 MaN-~---llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e 76 (423) T protein:vir:10 1 MPN-N---LDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLIS 76 (423) T ss_pred Ccc-c---hhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccc Confidence 211 1 0113799999999999999999877766532111 112 222332222211111111211 1122111 Q ss_pred ccceeeeechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc-ccc-----ccchhhhhhHHHHHHHH Q lcl|Aclame:pro 189 PQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEV-MKA-----APKKPTIAKFDDVITMI 262 (408) Q Consensus 189 ~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g-~g~-----~~~~~~~~~~d~i~~~~ 262 (408) .-..++++-+|...+--=..|+.. ...++++++... .++++..+|..++.- .+. +.+......+++++++- T Consensus 77 -~~v~l~id~~k~va~~v~d~E~~~-~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~~a~~~i~~a~ 153 (423) T protein:vir:10 77 -GKATGRVGNYITVAVEYQQLEEAI-KLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTA 153 (423) T ss_pred -ceeEEEeeceeeeeeeechHHHhc-ChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhccccccccCCcccchHHHHHHHH Confidence 112467777777776655666643 455687777655 578888888877642 111 11112223466655443 Q ss_pred HHhhhhhccC--CCEEEEcHHHHHHHHhhhc--ccCceeeccccccCC-cccccccceEeeccccccccccCc--ce--E Q lcl|Aclame:pro 263 NTAVDPAIIA--TSSLLTNQSGLNKLALVKT--AEGKYLLEPDPTKPN-SYLIKGKQVIVVADRWLPNTGSTV--YP--L 333 (408) Q Consensus 263 ~~~l~~~~~~--~a~~~~n~~~~~~l~~lkd--~~G~~~~~~~~~~~~-~~~l~G~pv~~~~~~~~~~~~~~~--~~--~ 333 (408) ..|+....+ +...+++|..+..|.+-.. ..+.......+..+. .+++.|+.|+.+.+ +|....+. +. + T Consensus 154 -~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snn--ip~~T~gt~~~t~~~ 230 (423) T protein:vir:10 154 -SFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNG--LASRTQGAFGGTLTV 230 (423) T ss_pred -HHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCC--Cccccccccccceee Confidence 334444333 4567899999887653210 011111112233333 36899999887543 44321111 00 0 Q ss_pred EEEehhcceEeeeccceEEEEe--ccchhhhh--hceeeEEE---EeeeCcEEe------cccceEE------------- Q lcl|Aclame:pro 334 YYGDMSQAITLFDRENMSLLPT--NIGAGAFE--TDTTKIRV---IDRFDVKAT------DSEALVA------------- 387 (408) Q Consensus 334 ~~gd~~~~~~~~~~~~~~i~~~--~~~~~~f~--~~~~~~r~---~~r~d~~v~------~~~a~~~------------- 387 (408) ..|-.-.+-.-......++... ......+. -|...|-+ ..+....++ .+.-|++ T Consensus 231 ~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~t 310 (423) T protein:vir:10 231 KTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVT 310 (423) T ss_pred eecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCcee Confidence 0000000000000000111000 00000000 01111111 001111100 1111111 Q ss_pred EEeecc-----ccCCCCccCC---CcccC Q lcl|Aclame:pro 388 GSFSAI-----ADQVGNFKTT---TSTAV 408 (408) Q Consensus 388 l~~~~~-----~~~~~~~~~~---~~~~~ 408 (408) +++.+. +.....+++. ++.+| T Consensus 311 v~i~p~~i~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:10 311 VTLSGVPIYDTTNPQYNSVSRQVEAGDAV 339 (423) T ss_pred eeccCccccccCCcccccccccccCCcee Confidence 111110 0000001111 01111 No 213 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=81.28 E-value=0.087 Score=26.40 Aligned_cols=289 Identities=9% Similarity=-0.039 Sum_probs=124.3 Q ss_pred cccccchhhhHHHHHHHHHHH---hhcchhh--HHHHHHHHhh-------ccccccCceecch----hhhhhhhhhhhhh Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNM---VRNPMAF--MNTVSSKTET-------SGSDSAAGLTIPQ----DIRTMINTLVRQY 143 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~---~~~~~~~--~~~~~~~a~~-------~~t~~~gg~~vP~----~~~~~ii~~~~~~ 143 (408) .......... +.++++ +.+.... .......++- ..+..+ .-||. .+.+.|++...+. T Consensus 1 ~~~~~~~~~~-----~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~--~~i~a~~~~~i~~~vy~~~~~~ 73 (339) T protein:vir:94 1 MSINNDRTDI-----KQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQTTAN--AGIPAWMTTFVDRRVIDIQLAP 73 (339) T ss_pred CceechHHHH-----HHHHhhceeeccchhhhcchhhHhhhccccccccccccccc--cchhhhhhhhhchhheeecccc Confidence 1111111111 111110 0000000 0001111111 011111 22443 3446677777777 Q ss_pred hhhhhhhceeecccC-ccceEEeeccCCccccchhcccccccccc-cccceeeeechheeeeehHHHHHHHhc--chHHH Q lcl|Aclame:pro 144 DSLQQYVRVESVSTS-NGSRVYEKWTDVTPLTVMDAEDGKIPDLD-NPQLTIIKYLIKRYAGIITATNTSLKD--TAENI 219 (408) Q Consensus 144 ~~l~~~~~~~~~~~~-~g~~~~~~~~~~~~~~~~~~E~~~~~~~~-~~~f~~v~~~~~~~~~~~~iS~ell~d--s~~~~ 219 (408) ...+.++.+.+...- ...+.+.. .+..+.+.+.+.++..|-.+ +..|.+.++....++-... ..|+-.- ...++ T Consensus 74 ~~~~~l~pv~t~g~w~~~t~~y~~-~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~-~~E~~~A~~~g~~l 151 (339) T protein:vir:94 74 MAAAKIFPEVKKGDWTTTYGVFII-AEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYG-DLEMATYGEAGIDY 151 (339) T ss_pred cchhhhcccccCCCCcccEEEEee-eecccceEEcccccCCCcccccceeeEEeEEEEEEEEeec-HHHHHHHHhhCCCh Confidence 777777777664321 12344443 34556667888888886544 2345555554444444333 2333221 23566 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccc------------chh-----hhhhHHHHHHHHHHh---hhhhc----c--CC Q lcl|Aclame:pro 220 LAWLSSWIAKKVVVTRNQAIIEVMKAAP------------KKP-----TIAKFDDVITMINTA---VDPAI----I--AT 273 (408) Q Consensus 220 ~~~v~~~l~~~~~~~~~~~~~~g~g~~~------------~~~-----~~~~~d~i~~~~~~~---l~~~~----~--~~ 273 (408) .+--.....+++.+.+|+-.+.|+.... ..+ ...+.+.|++.++.. +...- . .. T Consensus 152 ~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~ 231 (339) T protein:vir:94 152 VARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQER 231 (339) T ss_pred HHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccC Confidence 7777777888888888888787765321 000 112344444433322 21111 1 12 Q ss_pred CEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEeh---hcceEeeeccce Q lcl|Aclame:pro 274 SSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDM---SQAITLFDRENM 350 (408) Q Consensus 274 a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~---~~~~~~~~~~~~ 350 (408) ..++++|+.+..|... +..|.-++. -+... +-++.+..+ ..+.+..++...+++-.. ...... -.+ T Consensus 232 ~~L~LP~~~~~~L~~~-n~~~~Tvl~-~lk~n----~pnl~i~~~--~el~~a~g~~~~~~~~~~~~~~~~~~~---~p~ 300 (339) T protein:vir:94 232 MVMALAPSALNNVNRT-NNFGLSAGA-KIAQT----YPNIQFVAV--PEFDTASGRLVQLWVPEVNGQPTGEVA---FAE 300 (339) T ss_pred cEEEecHHHHHhcccC-CcCCccHHH-HHHHh----cCCcEEEEc--cccccCCCceEEEEEEeccCCcceEEE---cch Confidence 3688999999888643 333332221 01111 112222222 123322222222221111 111111 111 Q ss_pred EEEEeccchhhhhhceeeEEEEee-eCcEEecccceEEEEee Q lcl|Aclame:pro 351 SLLPTNIGAGAFETDTTKIRVIDR-FDVKATDSEALVAGSFS 391 (408) Q Consensus 351 ~i~~~~~~~~~f~~~~~~~r~~~r-~d~~v~~~~a~~~l~~~ 391 (408) .+...+.. ...-.+..-+..| .|+.+.+|.||+.++.- T Consensus 301 ~~~~lpvq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 301 KLRSHSIE---RYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred hhhccccE---EcCceEEecceeeeeeEEEEccceeeeeecC Confidence 22221111 1111233456666 45688999999999866 No 214 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=79.97 E-value=0.099 Score=26.08 Aligned_cols=285 Identities=11% Similarity=0.067 Sum_probs=134.9 Q ss_pred HHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCc Q lcl|Aclame:pro 92 KFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVT 171 (408) Q Consensus 92 ~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~ 171 (408) .-+..|..++..-.... .+. ...++.+-.+.|.+.....+.+.+.+.+.+++.++++++....|........ . T Consensus 1 mtr~~~~~y~~~~A~~n-gv~----~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~--g 73 (336) T protein:vir:37 1 MNKQAYYALAAALAKHF-NQP----LDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATE--K 73 (336) T ss_pred CcHHHHHHHHHHHHHHh-CCC----hhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccC--c Confidence 11223333332211000 000 0001122358899999999999999999999999999999888876644221 1 Q ss_pred cccchhcccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHH-HHHHHHHHHHHHHHHHHHHhhcccccc- Q lcl|Aclame:pro 172 PLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENIL-AWLSSWIAKKVVVTRNQAIIEVMKAAP- 247 (408) Q Consensus 172 ~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~-~~v~~~l~~~~~~~~~~~~~~g~g~~~- 247 (408) +.++ ....+-.+ ....++.-.+..++.---..|+.+.|+..+ +|+. ..+...+.++++.=.-.-.++|+..+. T Consensus 74 ~iag-rtdt~R~~--~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~ 150 (336) T protein:vir:37 74 GVTG-RKQTGRNL--ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADN 150 (336) T ss_pred cccc-ccCCCccc--cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccC Confidence 2211 11111111 112344444555555445667777777653 2322 222233333333222223344443211 Q ss_pred ----------------------------------------chhhhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHH Q lcl|Aclame:pro 248 ----------------------------------------KKPTIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNK 285 (408) Q Consensus 248 ----------------------------------------~~~~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~ 285 (408) ..+.-.+.|.++..+...+++.++. .-++||.++.++. T Consensus 151 TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~ 230 (336) T protein:vir:37 151 TTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSK 230 (336) T ss_pred CCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhhhhh Confidence 0112344566554443458887764 5578888877543 Q ss_pred -HHhhhcccC-cee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEec-cchh Q lcl|Aclame:pro 286 -LALVKTAEG-KYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTN-IGAG 360 (408) Q Consensus 286 -l~~lkd~~G-~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~-~~~~ 360 (408) -..|-+.+| +|- ...+... ...++-|+|.+.++. +|... +++=-|++.-..+..+..+=.+.+ ...+ T Consensus 231 ~~~~l~~~~~~~PtE~~Aa~~~~-~~k~iGGlpa~~~Pf--fP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ 302 (336) T protein:vir:37 231 ETKLIQQKHGLTPTEKAALGSHN-LMGSFGGMNAITPPN--FPARA-----AAVTTLKNLSVYTEAESVRRSLRNDEDKK 302 (336) T ss_pred hhhhhhhhcCCCHHHHHHHHHHH-HHHhhCCceeEEccc--cCCCc-----eEEeechhcEEEEecCcEEEEEEEccccc Confidence 122323332 221 0000010 124789999998653 56533 455555554444444443333222 1122 Q ss_pred hhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCC Q lcl|Aclame:pro 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGN 399 (408) Q Consensus 361 ~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 399 (408) .++.+ -..--|..|-++..++.+.-..+....-- T Consensus 303 rie~y-----~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 303 GLVTS-----YYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred cccch-----hhhcceeeeeccccEEEeeeeeeeecCcC Confidence 22221 11223556667777776665444432222 No 215 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=79.15 E-value=0.11 Score=25.90 Aligned_cols=292 Identities=8% Similarity=-0.049 Sum_probs=121.4 Q ss_pred HHHHHHHHhhhcccccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHh----hccc-cccCceecchhhh---- Q lcl|Aclame:pro 63 LVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTE----TSGS-DSAAGLTIPQDIR---- 133 (408) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~----~~~t-~~~gg~~vP~~~~---- 133 (408) +.+.+.......-. ....... .. ........++ ..+. .+.+...||..+. T Consensus 1 ~~~~~~~~~l~~~g-----i~~~~~~--~~--------------~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~ 59 (336) T protein:vir:78 1 MRDAQRIQNLARAG-----VILPRSV--KN--------------VSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVD 59 (336) T ss_pred CchHHHHHHHhccC-----eecchhh--hh--------------hhHHHHHHHHhhhhhccccccCCCcchHHHHHHhcc Confidence 11100000000000 0000000 00 0000000111 0111 1111122555433 Q ss_pred hhhhhhhhhhhhhhhhhceeecccCc-cceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHH Q lcl|Aclame:pro 134 TMINTLVRQYDSLQQYVRVESVSTSN-GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSL 212 (408) Q Consensus 134 ~~ii~~~~~~~~l~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell 212 (408) +.+++.+........++.+.+...-. ..+.+ ......+.+.+.+.+..+|-.+ ...+..+-+.+.++..+.++..=+ T Consensus 60 p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~-~~~e~~G~a~~ygd~~D~P~vd-~~~~~~~~~v~~~~~g~~yg~~El 137 (336) T protein:vir:78 60 PSVIDILVAPMKAAELVGESKKGDWTTLVAAF-ITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSYFFQTWTRWGEREL 137 (336) T ss_pred cceeeehhhhhhhhhhcccccCCCccccEEEE-eeeecceeeEEeecccCCCeee-cceeeEEEEEEEEEeeeeecHHHH Confidence 45566666666555665554431110 12223 2334556667788888887544 456677777788887777774333 Q ss_pred hc---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------------chhhhhhHHHHHHHHHHhhh---- Q lcl|Aclame:pro 213 KD---TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------------KKPTIAKFDDVITMINTAVD---- 267 (408) Q Consensus 213 ~d---s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~------------------~~~~~~~~d~i~~~~~~~l~---- 267 (408) .. ...++.+--....++++.+.+|.-.+.|+.... +.-..++.+.+++.+...+. T Consensus 138 ~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~ 217 (336) T protein:vir:78 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQT 217 (336) T ss_pred HHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHH Confidence 32 234666666777777777888877777765321 00011234444443332211 Q ss_pred -hh--cc--CCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehh--c Q lcl|Aclame:pro 268 -PA--II--ATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMS--Q 340 (408) Q Consensus 268 -~~--~~--~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~--~ 340 (408) .. .. ....++++++.+..|..- +..|--++. -+....| ++.+...+ -+.+.+++...++.-++. . T Consensus 218 qt~g~~~~~~~~tL~Lp~~~~~~L~~~-n~~g~tv~~-~lk~n~P----nl~i~t~p--el~~Agg~~~~~~~~~~~~~~ 289 (336) T protein:vir:78 218 QSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGLSAAA-KLKEIFP----KLEFVTIP--EYDTASGRLVQLWAPRVEGKD 289 (336) T ss_pred hcCCeeeeccceEEEechHHHHhccCC-CccCccHHH-HHHHhcC----ccEEEEcc--cccccCcceEEEEEeeccCCc Confidence 11 11 233689999999888542 333321211 0111111 11222211 122222222111111110 0 Q ss_pred -ceEeeeccceEEEEeccchhhhhhceeeEEEEeeeC-cEEecccceEEEEee Q lcl|Aclame:pro 341 -AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKATDSEALVAGSFS 391 (408) Q Consensus 341 -~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~ 391 (408) .... -.+.+...+.. ...-.+..-+..|.+ +.+.+|.||+.++.- T Consensus 290 t~~~~---~p~~f~~lpvq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 290 TATCG---FTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ceeee---cchhhhcccee---ecCceeEeccccceeeeeeeccchheeeccC Confidence 0000 01111111110 111223334555654 478899999999866 No 216 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=78.77 E-value=0.11 Score=25.82 Aligned_cols=285 Identities=10% Similarity=0.055 Sum_probs=134.1 Q ss_pred HHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCc Q lcl|Aclame:pro 92 KFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVT 171 (408) Q Consensus 92 ~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~ 171 (408) .-+..|..++..-.... .+. ......+..+.|.+.....+.+.+.+.+.+++.++++++....|........ . T Consensus 1 mtr~~~~~y~~~~A~~n-gv~----~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~--g 73 (336) T protein:vir:37 1 MNKQAYYALAAALAKHF-NQP----LDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATE--K 73 (336) T ss_pred CcHHHHHHHHHHHHHHh-CCC----hhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccC--c Confidence 11122333332211000 000 0001122358899999999999999999999999999999888876644221 1 Q ss_pred cccchhcccccccccccccceeeeechheeeeehHHHHHHHhcch--HHHH-HHHHHHHHHHHHHHHHHHHhhcccccc- Q lcl|Aclame:pro 172 PLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTA--ENIL-AWLSSWIAKKVVVTRNQAIIEVMKAAP- 247 (408) Q Consensus 172 ~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~--~~~~-~~v~~~l~~~~~~~~~~~~~~g~g~~~- 247 (408) +.++-...+.. . ....++.-.+..++.---..|+.+.|+..+ .|+. ..+...+.++++.=.-.-.++|+..+. T Consensus 74 ~iagrtdt~r~-r--~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~ 150 (336) T protein:vir:37 74 GVTGRKQTGRN-L--ATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATN 150 (336) T ss_pred ccccccCCCCC-c--cccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccC Confidence 22111111111 1 111233344444444444567777777653 2322 222223333333222222344433210 Q ss_pred ----------------------------------------chhhhhhHHHHHHHHHHhhhhhccC--CCEEEEcHHHHHH Q lcl|Aclame:pro 248 ----------------------------------------KKPTIAKFDDVITMINTAVDPAIIA--TSSLLTNQSGLNK 285 (408) Q Consensus 248 ----------------------------------------~~~~~~~~d~i~~~~~~~l~~~~~~--~a~~~~n~~~~~~ 285 (408) ..+.-.+.|.++..+...+++.++. .-++||.++.++. T Consensus 151 TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~ 230 (336) T protein:vir:37 151 TTKTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSK 230 (336) T ss_pred CCCccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhh Confidence 1112345666554443458887764 5578888877543 Q ss_pred -HHhhhcccC-cee--eccccccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeeccceEEEEec-cchh Q lcl|Aclame:pro 286 -LALVKTAEG-KYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTN-IGAG 360 (408) Q Consensus 286 -l~~lkd~~G-~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~-~~~~ 360 (408) -..|-..+| .|- ...+.. -...++-|+|.+.++. +|... +++=-|++.-..+..+..+=.+.+ ...+ T Consensus 231 ~~~~l~~~~~~~PtE~~Aa~~~-~~~k~iGGlpa~~~Pf--fP~~~-----~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~ 302 (336) T protein:vir:37 231 ETKLIQQKHGLTPTEKAALGSH-NLMGSFGGMNAITPPN--FPARA-----AAVTTLKNLSVYTEAESVRRSLRNDEDKK 302 (336) T ss_pred hhhhhhhhcCCCHHHHHHHHHH-HHHHhhCCceEEEccc--cCCCc-----eEEeeccccEEEEecCcEEEEEEEccccc Confidence 122323222 221 000000 0125789999998653 56533 455555554344444443333222 1122 Q ss_pred hhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCC Q lcl|Aclame:pro 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGN 399 (408) Q Consensus 361 ~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 399 (408) .++.+ -..--|..|-++..++.+....+....-- T Consensus 303 rie~y-----~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 303 GLVTS-----YYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred cccch-----hhhcceeeeeccccEEEeeeeeeeccccC Confidence 22221 11223556777777777765555432222 No 217 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=75.20 E-value=0.15 Score=25.11 Aligned_cols=271 Identities=15% Similarity=0.118 Sum_probs=124.5 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhc-eeecccCccceEEeeccCCccccchhcccccccccccccceee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVR-VESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTII 194 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~-~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v 194 (408) +.. ++..-.+.+...++..|...+.+...-..+.+ +...+ ..-++.++...+. ..-...|.++.. .+...-++| T Consensus 1 ~~~-TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~-~G~~L~I~tiGs~--~~~~~~E~~~~~-~~~i~TGEI 75 (313) T protein:vir:95 1 MQL-TSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFG-SGETLHIKTIGSV--TLQEAEEDTPLI-YNPIETGEI 75 (313) T ss_pred Ccc-cccchheehhhhHHHHHHHHhhccccchhhhhhhccCC-CCCEEEecccCce--eeeccccCCCee-ecccccceE Confidence 221 22222344555666667666665543344455 33333 3346666655433 322233333332 233344678 Q ss_pred eechheeeeeh-HHHHHHHhcchH--HHHHHHHHHHHHHHHHHHHHHHhhc-------ccc------ccc------hhhh Q lcl|Aclame:pro 195 KYLIKRYAGII-TATNTSLKDTAE--NILAWLSSWIAKKVVVTRNQAIIEV-------MKA------APK------KPTI 252 (408) Q Consensus 195 ~~~~~~~~~~~-~iS~ell~ds~~--~~~~~v~~~l~~~~~~~~~~~~~~g-------~g~------~~~------~~~~ 252 (408) ++-...+.+-. .||+.|-+|+-. .+.+.+..+-++++....+.-++.- ..+ ... ..+. T Consensus 76 t~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~ 155 (313) T protein:vir:95 76 TFQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGV 155 (313) T ss_pred EEEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCce Confidence 88888877755 799999999652 2333334444455555555544421 110 000 1111 Q ss_pred hhHHHHHHHHHHhhhhhcc--CCCEEEEcHHHHHHHHhhh------cccCceeeccccccCCc--ccccccceEeecccc Q lcl|Aclame:pro 253 AKFDDVITMINTAVDPAII--ATSSLLTNQSGLNKLALVK------TAEGKYLLEPDPTKPNS--YLIKGKQVIVVADRW 322 (408) Q Consensus 253 ~~~d~i~~~~~~~l~~~~~--~~a~~~~n~~~~~~l~~lk------d~~G~~~~~~~~~~~~~--~~l~G~pv~~~~~~~ 322 (408) ....+++. ++........ .+-++++.|+....|..+. ..+|+.++..++.-+.. ..+.|..+.+. +.. T Consensus 156 ~~~~~~~~-~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~S-N~L 233 (313) T protein:vir:95 156 FALKHLIA-MRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTS-NRL 233 (313) T ss_pred ehhhHHHH-hhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhh-hhh Confidence 22223322 2232322221 3457899999988877663 23566666544333322 45677776542 211 Q ss_pred cccc---ccCcceEEEEehhcceEeeeccceEEEEe--c----cchhhhhhceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 323 LPNT---GSTVYPLYYGDMSQAITLFDRENMSLLPT--N----IGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 323 ~~~~---~~~~~~~~~gd~~~~~~~~~~~~~~i~~~--~----~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) -..+ +.....-++|++--.+....-.++-..+- + ...+.-..+....++ |+|..+.+-+..+++--.++ T Consensus 234 ~~AN~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~--R~G~Gi~R~~~L~~~~~~A~ 311 (313) T protein:vir:95 234 HVANYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRC--RYGFGIQRLDTLGLLATSAT 311 (313) T ss_pred hhccccccccccCceeeeeeeeeecccccceeeeeccccccccccccccccccceeee--eecccceeecceeEEEeccc Confidence 1111 01111124444322221111111111110 0 011112345555555 67777777777777643333 Q ss_pred cc Q lcl|Aclame:pro 394 AD 395 (408) Q Consensus 394 ~~ 395 (408) += T Consensus 312 ~~ 313 (313) T protein:vir:95 312 AY 313 (313) T ss_pred cC Confidence 32 No 218 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=74.95 E-value=0.15 Score=25.07 Aligned_cols=312 Identities=13% Similarity=0.067 Sum_probs=119.8 Q ss_pred ccccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhh--hhhhhhce Q lcl|Aclame:pro 75 REEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYD--SLQQYVRV 152 (408) Q Consensus 75 ~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~~ 152 (408) -.+.+. .....+...++..++|. .+.+. ...+-.+|+.+=-+.+..+|..+..... .+..-+.+ T Consensus 1 ~~~~~n-~~~~~~~~~e~~~Ks~t----tgy~~---------~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k 66 (464) T protein:vir:80 1 MTEKKN-TERQLTSVQEEVIKGFT----TGYGI---------TPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITK 66 (464) T ss_pred CCcchh-hHhhcCcccHHHHHHHH----hCCcc---------CcccccCcchhhhhhhhhhhheeeecccchhhhhhcCC Confidence 000000 00000001111122221 11000 0011123344333334444433332222 22222344 Q ss_pred eecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHH-HhcchHHHHHHHHHHHHHHH Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSWIAKKV 231 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el-l~ds~~~~~~~v~~~l~~~~ 231 (408) .+..+.-..|......+..+...+++|++-.+ .+++.+.+.+...+-++..--+|.-+ |.++..+-.....+.-...+ T Consensus 67 ~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~-~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~dai~~v 145 (464) T protein:vir:80 67 RPATSTVAKYDVYLAHGRVGHTRFTREIGVAP-ISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTDDAISVV 145 (464) T ss_pred chhhhhhhhhheeeccCccccccccccccccc-cCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHHHHHHHH Confidence 44444334444333445667778899999876 67799999999888777544333333 33445566677777888889 Q ss_pred HHHHHHHHhhccccccch---hhhhhHHHHHHHH---------------------HHhhhhhccCCCEEEEcHHHHHHH- Q lcl|Aclame:pro 232 VVTRNQAIIEVMKAAPKK---PTIAKFDDVITMI---------------------NTAVDPAIIATSSLLTNQSGLNKL- 286 (408) Q Consensus 232 ~~~~~~~~~~g~g~~~~~---~~~~~~d~i~~~~---------------------~~~l~~~~~~~a~~~~n~~~~~~l- 286 (408) +..++.+.+.|+..-.+. ....-+|.|.+++ ...+..+|....-++|+.-+.+.+ T Consensus 146 a~tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~ 225 (464) T protein:vir:80 146 AKTIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFV 225 (464) T ss_pred HHHHHHHHhhhccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHH Confidence 999999999998865542 2223466654332 122233444444456677666553 Q ss_pred HhhhcccCceeeccccccCCcccccccceEeecc-----ccccccccCcceEEEEehhcce-E-eeeccceEEEEeccch Q lcl|Aclame:pro 287 ALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVAD-----RWLPNTGSTVYPLYYGDMSQAI-T-LFDRENMSLLPTNIGA 359 (408) Q Consensus 287 ~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~-----~~~~~~~~~~~~~~~gd~~~~~-~-~~~~~~~~i~~~~~~~ 359 (408) ...-+ .++.+.. ..++....|+||--.-. ..-+++..+...++ |+++-. . ...---++....+... T Consensus 226 n~~l~--~q~~~~~---~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~l--d~~~~~~~~apaapsvt~tv~~~~~ 298 (464) T protein:vir:80 226 NQQLD--RQVQVIS---DNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQIL--DENRMQLPNAPQKATVKATLEAGTK 298 (464) T ss_pred hhhcC--ceeEEEc---CCCCcceeeeecccccccccceeccCccccCccccc--ccccccCCCCcCCceeEEEecCCcc Confidence 33222 2333322 11112234444421100 00001100000000 000000 0 0000112222222211 Q ss_pred hhhhh----ceeeEEEEeeeCcEEecccceEEEEeeccccCCC------CccCCCcccC Q lcl|Aclame:pro 360 GAFET----DTTKIRVIDRFDVKATDSEALVAGSFSAIADQVG------NFKTTTSTAV 408 (408) Q Consensus 360 ~~f~~----~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~------~~~~~~~~~~ 408 (408) ..|.. ..+.+++...-+..---|..++-.++...+..+. ...+..++=| T Consensus 299 g~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~it~~~~~~~~p~yv 357 (464) T protein:vir:80 299 GKFRDEDLTIDTEYKVVVVSDDAESAPSDVASVVIDDKKKQVKLEITINNMYQARPQYV 357 (464) T ss_pred cCCccccccceeEEEEEEECCCCccccceeeeeeecCcccEEEEEEEeCCccccccceE Confidence 12221 1112222222222222222222222222222111 1111100111 No 219 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=72.74 E-value=0.18 Score=24.68 Aligned_cols=361 Identities=10% Similarity=0.025 Sum_probs=72.8 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-------ccHHHHHHHHHHHHHHHHHHHHH-------------- Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDN-------FSAEAMSELKNKRDNEKVRRDAL-------------- 59 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~-------------- 59 (408) =-++..++++++++.+..++++++.++.....++.+ ...+++.+++++++.+....+.. T Consensus 4 ~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~~~e 83 (437) T protein:vir:10 4 EKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLVAPE 83 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112223333333333333333222222111111111 11112222222221111110000 Q ss_pred -------------HHHHHHHHHHH---h-----hhc--ccccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHh Q lcl|Aclame:pro 60 -------------REQLVEAQAEQ---V-----VNM--REEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTE 116 (408) Q Consensus 60 -------------~~~~~~~~~~~---~-----~~~--~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~ 116 (408) +.......... . ... ..................+..+|.+++..+............ T Consensus 84 ~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~ 163 (437) T protein:vir:10 84 LEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKD 163 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhccccc Confidence 00000000000 0 000 000000001111112223344555555443221111100000 Q ss_pred hc-cccccCceecchhh-hhhhhhhhhhhhhhhhhhceeecccCc-cceEEeeccCCccccchhccccccccccccccee Q lcl|Aclame:pro 117 TS-GSDSAAGLTIPQDI-RTMINTLVRQYDSLQQYVRVESVSTSN-GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) Q Consensus 117 ~~-~t~~~gg~~vP~~~-~~~ii~~~~~~~~l~~~~~~~~~~~~~-g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~ 193 (408) .. ..+..-...|-... ...+...+.- .++......+++.... +...+....+. ..|... +..+..+|.. T Consensus 164 ~g~lvp~~~~~~i~~~~~~~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~e~~~------~~e~~~-~~~~~v~~~~ 235 (437) T protein:vir:10 164 GKVIIPETILTPEKEVHQFPRLGSLVRT-ESVTTTTGKLPIFNNSTDLLTAHTEYGQ------TTKNAT-PVITPILWDL 235 (437) T ss_pred ccccchHHHHHHHHHhhhhhhhhhccee-EeeccCceeeEEeecccccccccccccc------cccccc-ccceeeeeeh Confidence 00 00000000000000 0011111110 0111111112221111 11111111111 111111 1122222222 Q ss_pred eeechh-eeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhhHHHHHH-HH--------- Q lcl|Aclame:pro 194 IKYLIK-RYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVIT-MI--------- 262 (408) Q Consensus 194 v~~~~~-~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~-~~--------- 262 (408) ..+... ++.- --+.+....-..+ +...|...++......+=...-+|...+....+.....+++. .+ T Consensus 236 ~k~~~~~~is~-ell~ds~~~~~~~-i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 313 (437) T protein:vir:10 236 KTYTGGYVFSQ-ELISDSSYDWQAE-LQSRLIELRDNTDDSLIITALTDGIKKTTSTYLLGDLKKVLNVTLKPQDSAAAS 313 (437) T ss_pred hheeeehhhhH-HHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHhhhhhhhhcCCE Confidence 111110 0100 0011111111111 444555555555555543333333333322222222222211 10 Q ss_pred -------HHhhhhhccCCCEEEEcHHHHHHH-HhhhcccCceeecc-ccccCCcccccccceEeeccccccccccCcceE Q lcl|Aclame:pro 263 -------NTAVDPAIIATSSLLTNQSGLNKL-ALVKTAEGKYLLEP-DPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPL 333 (408) Q Consensus 263 -------~~~l~~~~~~~a~~~~n~~~~~~l-~~lkd~~G~~~~~~-~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~ 333 (408) ...|..--..+..+++.|+.-... ..| .|+|+... +...+. ..-...|+++-+ .. ..+ T Consensus 314 ~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l---~G~pv~~~~~~~~~~-~~~~~~~~~~gd------~~---~~~ 380 (437) T protein:vir:10 314 IVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTL---LGKTVVIVDDKLFPS-ASAGDVNIVVAP------LK---KAV 380 (437) T ss_pred EEEcHHHHHHHHHhhccCCCeeeccCccCCCCccc---ccceeEEecccccCC-cCCCceEEEEee------cc---ccE Confidence 000111112233445544321100 011 46665432 110000 001112232211 00 001 Q ss_pred EEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEe---e-eCcEEec--ccceEEEEeeccccC Q lcl|Aclame:pro 334 YYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVID---R-FDVKATD--SEALVAGSFSAIADQ 396 (408) Q Consensus 334 ~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~---r-~d~~v~~--~~a~~~l~~~~~~~~ 396 (408) .++|-+..-+-. ... .+.. +....+.+|+.. + -.++.+- ..++.. ...|+. T Consensus 381 ~~~~r~~~~~~~--~~~---~~~~----~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~---~~~~~~ 437 (437) T protein:vir:10 381 INFKLTEITGQF--QDT---YDIW----YKQLGIFLRQNVVQASKDLIVNLTGKLKAVTV---VQSTAV 437 (437) T ss_pred EEEeeeceEEEE--ecc---cccc----cceeeEEEEEccEEecccceEEEEeecccccc---CCCCCC Confidence 122211110000 000 0000 000011111100 0 1111111 122111 111111 No 220 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=72.03 E-value=0.19 Score=24.56 Aligned_cols=260 Identities=10% Similarity=0.057 Sum_probs=95.4 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhce-----eecccCccceEEeeccC-Cccccchhcccccccccccc Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV-----ESVSTSNGSRVYEKWTD-VTPLTVMDAEDGKIPDLDNP 189 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~-----~~~~~~~g~~~~~~~~~-~~~~~~~~~E~~~~~~~~~~ 189 (408) |+. -.-+.+...|.+.....+....+.+. +.. .+..++.+|+..+ .....+-.+-|-...+.+ . T Consensus 1 Mai--------n~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~-~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~-~ 70 (285) T protein:vir:79 1 MTV--------VLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRF-RGHNEVRINKLSGFVDATAYKRGQDNARKTIS-V 70 (285) T ss_pred Ccc--------hhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEe-cCCCEEEEeeecccccccccccccCccccccc-e Confidence 110 01123444455544444333333211 111 1223566776543 223333333333322222 2 Q ss_pred cceeeeechheeeeeh----HHHHHHHhcchHHHHHHHHHHH-HHHHHHHHHHHHh----hcccc--ccchhhhhhHHHH Q lcl|Aclame:pro 190 QLTIIKYLIKRYAGII----TATNTSLKDTAENILAWLSSWI-AKKVVVTRNQAII----EVMKA--APKKPTIAKFDDV 258 (408) Q Consensus 190 ~f~~v~~~~~~~~~~~----~iS~ell~ds~~~~~~~v~~~l-~~~~~~~~~~~~~----~g~g~--~~~~~~~~~~d~i 258 (408) ++...+++-.+--.+. .+- | . -... .+.+..++ .+.+.-.+|.-.+ .+.++ ....+....++.+ T Consensus 71 ~~et~tl~~DR~~~f~iD~mDvd-E--n-~~~~-~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~~~~~T~~nv~~~i 145 (285) T protein:vir:79 71 GKETVKLTHEDWFGYDLDQFDMD-E--N-GAYT-VENVVREHNKMITIPHRDKVAVQKLFDSAAKKATDSITKDNALDAY 145 (285) T ss_pred eeeEEEeeccccceecccccchh-h--h-hhhh-HHHHHHHHHhhhhcchhhHHHHHHHHhhcccccccccCHHHHHHHH Confidence 3444444444433321 111 1 0 0111 11222221 2222122222111 11111 1111222234455 Q ss_pred HHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceee-cccc---ccCCcccccc-cceEeeccccccccccCcc-e Q lcl|Aclame:pro 259 ITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLL-EPDP---TKPNSYLIKG-KQVIVVADRWLPNTGSTVY-P 332 (408) Q Consensus 259 ~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~-~~~~---~~~~~~~l~G-~pv~~~~~~~~~~~~~~~~-~ 332 (408) ..++...-+.....+.+++|+|.++..|.+-+.=+...-. +... .......|.| .|++.+++..+.+....+. . T Consensus 146 ~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~In 225 (285) T protein:vir:79 146 DTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVN 225 (285) T ss_pred HHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhcc Confidence 5555433333344566788999998876644321111111 1101 1122467998 8999988776765433222 2 Q ss_pred EEEEehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEeccc--ceEEEEeecc Q lcl|Aclame:pro 333 LYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSE--ALVAGSFSAI 393 (408) Q Consensus 333 ~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~--a~~~l~~~~~ 393 (408) +++...+ +..-...-+..-..+|..... -|...+.-..++|.=|.+.+ ++.+-.-+++ T Consensus 226 fiiv~~~-a~i~~~K~~~~~~f~P~~~~~--~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 226 FILTPLS-AIAPIVKYDSVSVIDPSTDRS--GNRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred EEEecCc-eeccceeeeeeEeECCCCCCC--cceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 3444433 222222222222233332111 11122223334555555542 2222111111 No 221 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=70.64 E-value=0.21 Score=24.34 Aligned_cols=295 Identities=7% Similarity=-0.075 Sum_probs=115.8 Q ss_pred HHHHHHHHhhhcccccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHh----hccc-cccCceecchhhh---- Q lcl|Aclame:pro 63 LVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTE----TSGS-DSAAGLTIPQDIR---- 133 (408) Q Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~----~~~t-~~~gg~~vP~~~~---- 133 (408) +.+.+.......-. ....... .. ........++ ..+. .+.+...||..+. T Consensus 1 ~~~~~~~~~l~~~g-----i~~~~~~--~~--------------~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~ 59 (336) T protein:vir:10 1 MRDAQRIQNLARAG-----VILPRSV--KN--------------VSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVD 59 (336) T ss_pred CchHHHHHHHhccC-----eecchhh--hh--------------hhHHHHHHHHhhhhhccccccCCCcchHHHHHhhcC Confidence 11100000000000 0000000 00 0000000111 0111 1111122565443 Q ss_pred hhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHHHh Q lcl|Aclame:pro 134 TMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLK 213 (408) Q Consensus 134 ~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ 213 (408) +.+++.+.....+..++.+.+...-.-...+.......+.+.+.+.....|-.+ ...+...-+.+.++....++..=+. T Consensus 60 p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d-~~~~~~~~~v~~~~~g~~yg~~El~ 138 (336) T protein:vir:10 60 PSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-TNINYPQRQSYFFQTWTRWGERELE 138 (336) T ss_pred cceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEccccCCCccee-eeeeeeeeeEEEEEEEEeeCHHHHH Confidence 344555555544444444433211111222222334445556677777777544 3455556667777777777743333 Q ss_pred c---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------------chhhhhhHHHHHHHHHHh---hhhh Q lcl|Aclame:pro 214 D---TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------------KKPTIAKFDDVITMINTA---VDPA 269 (408) Q Consensus 214 d---s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~------------------~~~~~~~~d~i~~~~~~~---l~~~ 269 (408) . ...++.+--....++++.+.+|.-.+.|+.... +.-..++++.|++.+... +... T Consensus 139 ~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~q 218 (336) T protein:vir:10 139 MAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQ 218 (336) T ss_pred HHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHh Confidence 2 234666666777777777778877777765321 000112334444333221 2111 Q ss_pred ----cc--CCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcceE Q lcl|Aclame:pro 270 ----II--ATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAIT 343 (408) Q Consensus 270 ----~~--~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~ 343 (408) .. ....++++++.+..|..- +..|.-++. -+....| ++.+...+ -+.+.+++ ...+|.+-...-- T Consensus 219 t~g~i~~~~~~tL~Lp~~~~~~L~~~-n~~g~tv~~-~lk~n~P----nl~i~t~p--el~~Agg~-~~~~~~~~~~~~~ 289 (336) T protein:vir:10 219 SQGIITQEAVLHMGLPPTAMSDLSKT-NQYGLSAAA-KLKEIFP----KLEFVTIP--EYDTASGR-LVQLWAPRVEGKD 289 (336) T ss_pred cCCeeeeccceEEEechHHHHhccCC-CccCccHHH-HHHHhCC----ccEEEEcc--cccccCCc-eEEEEEecccCCc Confidence 11 233689999999888542 333321211 0111111 11222211 12222222 2222222110000 Q ss_pred ee-eccceEEEEeccchhhhhhceeeEEEEeeeC-cEEecccceEEEEee Q lcl|Aclame:pro 344 LF-DRENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKATDSEALVAGSFS 391 (408) Q Consensus 344 ~~-~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~ 391 (408) .+ ..-...+...+.. ...-.+...+..|.+ +.+.+|.||+.++.- T Consensus 290 t~~~~~P~~f~~lpvq---~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 290 TATCGFTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred ceeeecChhhhcccee---ecCceeEeccccceeeeeeeccchheeeccC Confidence 00 0000001111110 011223334555654 578899999999866 No 222 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=68.12 E-value=0.24 Score=23.96 Aligned_cols=367 Identities=8% Similarity=-0.061 Sum_probs=110.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccch Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSE 86 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (408) |+|++ +++.+++.++.++++...+......+...+.+.+++++.++++.+++++++++................... T Consensus 1 m~e~~---~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:10 1 MTDIT---SKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred ChHHH---HHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 55444 445556666666776666655544444567777888888888888888887666544333322222222222 Q ss_pred hhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCcee--cchhhhhhhhhhhhhhhhhhhhhceeecccCccceEE Q lcl|Aclame:pro 87 NELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLT--IPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVY 164 (408) Q Consensus 87 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~--vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~ 164 (408) ..........+..+........................+.. -...+..+++..+ +..+....++...-.. + T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~i-----i~~~~~~~~l~~~~~~--~ 150 (390) T protein:vir:10 78 VGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGF-----ITQPDARLTVRDLIGS--G 150 (390) T ss_pred hhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHH-----HHHHHhhchhhhhcce--e Confidence 21122222334444333332222221111111111111110 1111222333222 1111111122111111 1 Q ss_pred eeccCCccccchhcc---cccccccccccceeeeechheeeeeh--HHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 165 EKWTDVTPLTVMDAE---DGKIPDLDNPQLTIIKYLIKRYAGII--TATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 165 ~~~~~~~~~~~~~~E---~~~~~~~~~~~f~~v~~~~~~~~~~~--~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) +.....-....+.+. ..-..+.....-...++....+...- .+.. +.+....-...+...+...++.++...+ T Consensus 151 ~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~--is~ell~d~~~l~~~i~~~l~~~~~~~~ 228 (390) T protein:vir:10 151 RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMK--ATRQILSDAPQLASYMNNRLIRGLKVKE 228 (390) T ss_pred eccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeeh--hhHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 111111111112211 11111111111112233333222221 1111 1111112123445555555555555544 Q ss_pred hhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc---eeeccc-------cccCCccc Q lcl|Aclame:pro 240 IEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK---YLLEPD-------PTKPNSYL 309 (408) Q Consensus 240 ~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~---~~~~~~-------~~~~~~~~ 309 (408) -...=.|... ......|+...... ......+....-......+..+...... .++.+. +.+. T Consensus 229 ~~~il~G~G~--~~~p~Gi~~~~~~~--~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~---- 300 (390) T protein:vir:10 229 DAEILRGTGA--NDGLLGLIPQATTY--AAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDA---- 300 (390) T ss_pred HHHHhhcCCC--Cccccccccccccc--cccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcC---- Confidence 3332222221 11233333221000 0000011111112223334444443222 222221 1111 Q ss_pred ccccceEeeccccccccccCcceEEEEeh--hcceEeeeccc-eEEEEeccchhhhhhceeeEEEEeeeCc---EEeccc Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDM--SQAITLFDREN-MSLLPTNIGAGAFETDTTKIRVIDRFDV---KATDSE 383 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~~~~-~~i~~~~~~~~~f~~~~~~~r~~~r~d~---~v~~~~ 383 (408) .|.|++..+....+....| .++++-++ ..-+.+.+... +.+ +.+..+.+......+. ..+.-. T Consensus 301 -~g~~l~~~~~~~~~~~l~G-~pv~~~~~~p~~~~~~gdf~~~~~~---------~~~~~~~i~~~~~~~~~~~~~~~~r 369 (390) T protein:vir:10 301 -NNQYLIGNARGTLTPTLWG-LPVVATQAMAPGEFLVGAFDLAAQI---------FDQWDARVEIGYVNDDFQRNMVTVL 369 (390) T ss_pred -CCceeecCCcCcCCceecc-eeeEEcCCCCCCcEEEEeccceEEE---------EEecceEEEEeecccccccCcEEEE Confidence 2444432110000000011 12222222 11111122111 111 1111111111100000 000000 Q ss_pred ceEEEEeeccccCCCCccCCC Q lcl|Aclame:pro 384 ALVAGSFSAIADQVGNFKTTT 404 (408) Q Consensus 384 a~~~l~~~~~~~~~~~~~~~~ 404 (408) ++..+.+...-|..--..+-+ T Consensus 370 ~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 370 AEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEeeccEEeccccEEEEEeC Confidence 112222332223222222222 No 223 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=64.68 E-value=0.3 Score=23.48 Aligned_cols=364 Identities=11% Similarity=0.034 Sum_probs=104.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc---c Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAM--SELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEE---K 79 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~ 79 (408) |+|+|.. +++.+++++++++++.+.++.+...++. ++...+.+++.++++.++.++++++........... . T Consensus 1 ~~l~e~i---~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~ 77 (400) T protein:vir:38 1 MTLDEKL---AAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQ 77 (400) T ss_pred CChHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 8877654 4445566667666666555443333332 222334456667778887777776654433222211 1 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccc-------------cccCceecchhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGS-------------DSAAGLTIPQDIRTMINTLVRQYDSL 146 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t-------------~~~gg~~vP~~~~~~ii~~~~~~~~l 146 (408) .................+....+.................. ....+. -+.. ...++..-.... + T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-gg~~vP~~~~~~-i 154 (400) T protein:vir:38 78 SSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGV-KAAD-AASTIPETISNT-P 154 (400) T ss_pred cccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcc-cccC-CcccccHHHHHH-H Confidence 11222223333444555555555443222211111000000 000000 0110 011111111111 1 Q ss_pred hhhhcee-ecccCccceEEeeccCCccc-cchh-c--ccccccccc-cccceeeeechheeeeehHHHHH-HHhcchHHH Q lcl|Aclame:pro 147 QQYVRVE-SVSTSNGSRVYEKWTDVTPL-TVMD-A--EDGKIPDLD-NPQLTIIKYLIKRYAGIITATNT-SLKDTAENI 219 (408) Q Consensus 147 ~~~~~~~-~~~~~~g~~~~~~~~~~~~~-~~~~-~--E~~~~~~~~-~~~f~~v~~~~~~~~~~~~iS~e-ll~ds~~~~ 219 (408) ..+.+.. ++...... ++. .+.... ..+. + ......+.. .+......+..-.+.. -++.-- -+.+...+- T Consensus 155 i~~~~~~~~l~~~~~~--~~~-~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~-~k~~~~~~is~ell~d 230 (400) T protein:vir:38 155 QRELQTVVDLKPFTNV--FQA-STQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSV-ETYRQALPVSQESIDD 230 (400) T ss_pred HHHHHhhhhhhhccee--Eec-cCcceEEEEEecCCCccccccccccccccccccceeeEeeh-hheeeehhhHHHHHhh Confidence 1111111 11111111 111 111111 1111 1 111111111 0001111111111110 011100 000101111 Q ss_pred HHH-HHHHHHHHHHHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHH-HHhhhcc--cCc Q lcl|Aclame:pro 220 LAW-LSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNK-LALVKTA--EGK 295 (408) Q Consensus 220 ~~~-v~~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~-l~~lkd~--~G~ 295 (408) ..+ +...+.+.++.++....-.....+..... ..-+..-+.+.. +....+. ++. T Consensus 231 s~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~~a~ 288 (400) T protein:vir:38 231 SAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFT----------------------AKTISSVDDLKHINNVDLDPAYSRV 288 (400) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc----------------------ccccccHHHHHHHHHhhhhhhhCcE Confidence 111 33334444444333322111111111111 011112222222 1211122 233 Q ss_pred eeeccccccC--CcccccccceEeeccccccccccCcceEEEEehhc---ceEeeeccceEEEEecc--chhhhhhceee Q lcl|Aclame:pro 296 YLLEPDPTKP--NSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ---AITLFDRENMSLLPTNI--GAGAFETDTTK 368 (408) Q Consensus 296 ~~~~~~~~~~--~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~---~~~~~~~~~~~i~~~~~--~~~~f~~~~~~ 368 (408) +++.+..... .-..-+|.|++.. + ...+....++|-.-. ......-+...+.+-+. .+..|.+..+. T Consensus 289 ~v~~~~~~~~l~~lkd~~G~~i~~~-~-----~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~ 362 (400) T protein:vir:38 289 IIASQSFYNFLDTVKDGNGRYLLQD-S-----ILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFM 362 (400) T ss_pred EEEcHHHHHHHHHhhccCCCeeeec-C-----cCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceE Confidence 3333211000 0001247776532 1 111111123332200 00000001111111010 01112222222 Q ss_pred EEEEeeeCcEEecccceEEEEeeccccCCCCccCCCccc Q lcl|Aclame:pro 369 IRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTA 407 (408) Q Consensus 369 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~ 407 (408) ++......+.. .=.++..+.++..-+..-..-+.++.| T Consensus 363 ~~~~~~~~~~~-~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 363 VRWVDDQIYGQ-FLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEecccccce-eEEEEEEeccEEecccceEEEEeecCC Confidence 22221111111 112444555555555553333334444 No 224 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=63.55 E-value=0.32 Score=23.33 Aligned_cols=341 Identities=14% Similarity=0.101 Sum_probs=117.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) |..+ ++|++++..+.+.-+. ..++.. -+..+ .+++ ++.|.++..... T Consensus 1 ~~~~---~~l~~kw~p~l~~~~~---------------~~~i~~~~~~~~---~a~l--lenq~~~~~~~~--------- 48 (524) T protein:vir:98 1 MSKK---NELMEKWNDLLESQEG---------------LPDIATKSKKQL---VAAI--LEAQEKDAETDP--------- 48 (524) T ss_pred Ccch---HHHHHHhHHHhcCCcC---------------cchhcchhhHHH---HHHH--HhhHHHHHhcCc--------- Confidence 6665 3466665554321110 011110 01111 1100 111111111100 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchh----hHHHHHHHHhhccccccCceecchhhhhhhhhhhh---hhhhhhhhhce Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMA----FMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVR---QYDSLQQYVRV 152 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~----~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~---~~~~l~~~~~~ 152 (408) ....+....+|..++....- ....... ..+.++|++ +.+.+.++..+| +.....+++.+ T Consensus 49 -------~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i----~~s~~t~~v---~~~~P~Li~lvRra~p~LIa~DIwGV 114 (524) T protein:vir:98 49 -------VYRDEKIVESFGGFLAEAEIAGDHNYDQTNI----ASGKSSGAI---TNIGPAVIGMVRRAIPNLIAFDICGV 114 (524) T ss_pred -------cccchHHHHhhhccccccccccccccccccc----ccccccccc---ccccchhhhHHHHHHHhhhhhhhhee Confidence 01112233445444332110 0000000 001111111 112233333343 33344566777 Q ss_pred eecccCccceE-----EeeccCCccc---------------cchh----------------------------------- Q lcl|Aclame:pro 153 ESVSTSNGSRV-----YEKWTDVTPL---------------TVMD----------------------------------- 177 (408) Q Consensus 153 ~~~~~~~g~~~-----~~~~~~~~~~---------------~~~~----------------------------------- 177 (408) .||++++|.+- +........+ ..|. T Consensus 115 QPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~ 194 (524) T protein:vir:98 115 QPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYF 194 (524) T ss_pred ccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCCccccccccccccccccccccccccccccccceec Confidence 77777665221 1111000000 0000 Q ss_pred ----------------------------------ccccc---------ccccccccceeeeechheeeee-------hHH Q lcl|Aclame:pro 178 ----------------------------------AEDGK---------IPDLDNPQLTIIKYLIKRYAGI-------ITA 207 (408) Q Consensus 178 ----------------------------------~E~~~---------~~~~~~~~f~~v~~~~~~~~~~-------~~i 207 (408) +.+.. ....+...|.++.|+..|..+- ..+ T Consensus 195 ~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEY 274 (524) T protein:vir:98 195 QNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQY 274 (524) T ss_pred cccccCcccccccccccccccccccccccceeecccccchhhhhhhccCCCCccccccceeeEEEEEEEeeecccccccc Confidence 00000 0001122355566666555543 568 Q ss_pred HHHHHhc----chHHHHHHHHHHHHHHHHHHHHHHHhhccc--------cc----cchhhhhhH-------------H-- Q lcl|Aclame:pro 208 TNTSLKD----TAENILAWLSSWIAKKVVVTRNQAIIEVMK--------AA----PKKPTIAKF-------------D-- 256 (408) Q Consensus 208 S~ell~d----s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g--------~~----~~~~~~~~~-------------d-- 256 (408) |-||.+| ...|.++.|.+-|+..|...+|+.|+.-.. +. ....+.++. + T Consensus 275 TiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~ 354 (524) T protein:vir:98 275 SVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESY 354 (524) T ss_pred cHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheeceeecccccccccceeeccccccccccchhHHHH Confidence 9999888 346889999999999999999998874221 00 001122211 1 Q ss_pred -HHHHHH----HHh-hhhhccCCCEEEEcHHHHHHHHhh----hcccCce--eeccccccC-Cccccc-ccceEeecccc Q lcl|Aclame:pro 257 -DVITMI----NTA-VDPAIIATSSLLTNQSGLNKLALV----KTAEGKY--LLEPDPTKP-NSYLIK-GKQVIVVADRW 322 (408) Q Consensus 257 -~i~~~~----~~~-l~~~~~~~a~~~~n~~~~~~l~~l----kd~~G~~--~~~~~~~~~-~~~~l~-G~pv~~~~~~~ 322 (408) .+...+ +.. ..+.+...-.+|+++.....|..+ -+..|.- ....+.+.. ..+.|. ||+|++.++ T Consensus 355 ~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y-- 432 (524) T protein:vir:98 355 KALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLGGTYKVYIDQY-- 432 (524) T ss_pred HHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccccchhhcccccCCccceEEEEecCceEEEecCC-- Confidence 111111 111 123333344689999998888753 1111100 000000000 012333 567766322 Q ss_pred ccccccCcceEEEEehhcceEeeeccceE----EEEeccchhhh------hhceeeEEEEeeeCcEEecccceEEEEeec Q lcl|Aclame:pro 323 LPNTGSTVYPLYYGDMSQAITLFDRENMS----LLPTNIGAGAF------ETDTTKIRVIDRFDVKATDSEALVAGSFSA 392 (408) Q Consensus 323 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~~~f------~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 392 (408) .+ .+-+++|- ++..+ +-..|+..-.+ .+-+-.+-+..|++.. .+|= +.- .. T Consensus 433 ~~-----~dy~~vG~---------KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP~--~~~--~~ 493 (524) T protein:vir:98 433 AR-----QDYFTVGF---------KGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INPF--ANS--RS 493 (524) T ss_pred CC-----cceEEEEe---------eCCcccccceeeccccccccccccCCccccceeeeeeeecee-ecCc--ccc--cC Confidence 11 22233331 11111 11112111000 0112222233333332 1221 100 00 Q ss_pred cccC-CCCccC----CCcccC Q lcl|Aclame:pro 393 IADQ-VGNFKT----TTSTAV 408 (408) Q Consensus 393 ~~~~-~~~~~~----~~~~~~ 408 (408) -+++ ...... .+.++. T Consensus 494 ~~~~~ri~~g~~~~~~ag~n~ 514 (524) T protein:vir:98 494 QAPADRITSGMISKEMCGKNA 514 (524) T ss_pred CccccccccCcchHhhcCccc Confidence 0010 000111 112222 No 225 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=63.15 E-value=0.32 Score=23.28 Aligned_cols=319 Identities=11% Similarity=0.026 Sum_probs=123.5 Q ss_pred ccchhhhHHHHHHHHHHHhhc------chhhH-----HHHHHHHhhc------cccccCceecchhhhhhhhhhhhhhhh Q lcl|Aclame:pro 83 NKSENELKDKFVKDFVNMVRN------PMAFM-----NTVSSKTETS------GSDSAAGLTIPQDIRTMINTLVRQYDS 145 (408) Q Consensus 83 ~~~~~~~~~~~~~a~~~~~~~------~~~~~-----~~~~~~a~~~------~t~~~gg~~vP~~~~~~ii~~~~~~~~ 145 (408) --.++..+....+.|..--|. ..... ..+...+.++ .+-.+|+++=-..+..++..+...... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ 80 (514) T protein:vir:10 1 MYTQDKTKDIMKKSFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERD 80 (514) T ss_pred CCccchhhHHHhhhhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcc Confidence 111122222222222110000 00000 0000111111 111233333333333333333222222 Q ss_pred --hhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHH-HhcchHHHHHH Q lcl|Aclame:pro 146 --LQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAW 222 (408) Q Consensus 146 --l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el-l~ds~~~~~~~ 222 (408) +..-....+..+.-..|......+..+...+++|++-. +.+++.+.+..+..+-++.-..+|.-+ +.++..+.... T Consensus 81 ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~-~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~ 159 (514) T protein:vir:10 81 FTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIG-DVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKV 159 (514) T ss_pred hhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccC-cCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHH Confidence 22223333443333333333344566777889999854 578899999999999999876655544 34577788888 Q ss_pred HHHHHHHHHHHHHHHHHhhccccccc--hhhhhhHHHHHHHH---------------------HHhhhhhccCCCEEEEc Q lcl|Aclame:pro 223 LSSWIAKKVVVTRNQAIIEVMKAAPK--KPTIAKFDDVITMI---------------------NTAVDPAIIATSSLLTN 279 (408) Q Consensus 223 v~~~l~~~~~~~~~~~~~~g~g~~~~--~~~~~~~d~i~~~~---------------------~~~l~~~~~~~a~~~~n 279 (408) ..+.-...++..++.+.++|+..-.+ +....-+|.|++.+ ...+..+|....-++|+ T Consensus 160 ~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp 239 (514) T protein:vir:10 160 QEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMP 239 (514) T ss_pred HHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCc Confidence 88999999999999999999876543 22235566665443 12222334433345666 Q ss_pred HHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhcc----eE---eeeccceEE Q lcl|Aclame:pro 280 QSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQA----IT---LFDRENMSL 352 (408) Q Consensus 280 ~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~----~~---~~~~~~~~i 352 (408) .-+.+.+..--.. ++.++.+.. .+....|+||--.-. ..+..... +..+.+..... -. --.-..+.+ T Consensus 240 ~~vka~f~~~~~~-~qRV~~~~n---~~~~~~G~~v~~f~s-~~G~I~L~-gs~im~~~n~L~~~~~~~~~Ap~~~~va~ 313 (514) T protein:vir:10 240 IGIKADFVNQHLN-GQRVMLPGQ---TGGMTTGLDIDKFLS-AHGSIRIQ-GSTIMDSDNKLDFDRPVSPTAPTAPQLSA 313 (514) T ss_pred hHHHHHHhhcccC-cceEEeecC---ccceeeeeeccceeE-eccceeec-CCeeecccccCccCCccCCcCCCCCcceE Confidence 6666554322111 112222211 011122333211000 00000000 00011110000 00 000000112 Q ss_pred EEeccchhhhh-------hc----------eeeEEEEeeeCcEEecccceEEEEeeccccCCCCccC-CCcccC Q lcl|Aclame:pro 353 LPTNIGAGAFE-------TD----------TTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKT-TTSTAV 408 (408) Q Consensus 353 ~~~~~~~~~f~-------~~----------~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~-~~~~~~ 408 (408) .+.+.....|. ++ +..+++...-+..--.|..++-.+........--+-+ .+...+ T Consensus 314 svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~~ 387 (514) T protein:vir:10 314 TVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQNV 387 (514) T ss_pred EEecCcccccCcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCccc Confidence 22111111111 00 1122222222222223444433322222211111111 011111 No 226 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=59.00 E-value=0.4 Score=22.76 Aligned_cols=299 Identities=6% Similarity=-0.091 Sum_probs=120.9 Q ss_pred HHHHHHHHHHHHHHHhhhcccccccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhh-- Q lcl|Aclame:pro 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIR-- 133 (408) Q Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~-- 133 (408) +++. +.+.+++. -... ..........+ ... .+........+....+...||..+. T Consensus 1 ~~~~-~~~~~l~~------~gi~---~~~~~~~~~~~-------~~~------~~~da~d~~~~~~~~~~~~i~~~l~~~ 57 (336) T protein:vir:10 1 MRDA-QRIQNLAR------AGVI---LPRSVQNVSTP-------LTE------YAMDAADLSPHLSSTGSSGIPNYLTTY 57 (336) T ss_pred CchH-HHHHHHhh------cCee---ecchhhhhhhh-------HHH------hhhhhhhccCccccCCCchhHHHHHhh Confidence 0000 00000000 0000 00000000000 000 0000000001111122234565433 Q ss_pred --hhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHH-HH Q lcl|Aclame:pro 134 --TMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITAT-NT 210 (408) Q Consensus 134 --~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS-~e 210 (408) +.+++.+........++.+.+...-.-...........+.+.+.+.+...|-.+ ......+-+.+.++....++ .| T Consensus 58 i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d-~~~~~~~~~v~~~~~g~~yg~~E 136 (336) T protein:vir:10 58 VDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYFFQTWTRWGERE 136 (336) T ss_pred cccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccCCCceee-cccceeeeeEEEEEeeeeeCHHH Confidence 555666666665566666544321110122223334456667778888887544 34556666777777777777 44 Q ss_pred HHhc--chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------------chhhhhhHH----HHHHHHHHhh Q lcl|Aclame:pro 211 SLKD--TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------------KKPTIAKFD----DVITMINTAV 266 (408) Q Consensus 211 ll~d--s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~------------------~~~~~~~~d----~i~~~~~~~l 266 (408) +-.- ...++.+--....++++.+.+|+-.+.|+.... +....++++ ||..++.. + T Consensus 137 l~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~-l 215 (336) T protein:vir:10 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQV-L 215 (336) T ss_pred HHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHH-H Confidence 4332 234666777777778888888877777765321 111112233 33333322 2 Q ss_pred hhhc------cCCCEEEEcHHHHHHHHhhhcccCceeeccccccCCcccccccceEeeccccccccccCcceEEEEehhc Q lcl|Aclame:pro 267 DPAI------IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ 340 (408) Q Consensus 267 ~~~~------~~~a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~ 340 (408) ...- .....++++++.+..|..- +..|.-++. -+... +-++.+.. ...+.+. ++....++.+-.. T Consensus 216 ~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~-n~~g~Tvl~-~lk~n----~Pnl~i~t--~pEl~~a-~G~~~~l~~~~~~ 286 (336) T protein:vir:10 216 QTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAAA-KLKDI----FPKLEFVT--IPEYDTA-SGRLVQLWAPRVE 286 (336) T ss_pred HHhcCCeecccCcceEEecHHHHHhccCC-CccCccHHH-HHHHh----cCccEEEE--ccccccC-CCceEEEEEEecC Confidence 1111 1245689999988877532 333322221 01111 11111211 1112221 2222222221100 Q ss_pred c-eEeeeccceEEEEeccchhhhhhceeeEEEEeeeC-cEEecccceEEEEee Q lcl|Aclame:pro 341 A-ITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKATDSEALVAGSFS 391 (408) Q Consensus 341 ~-~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~l~~~ 391 (408) . -..-..-...+...+.. ...-.+..-+..|.+ +.+.+|.||+.++.- T Consensus 287 ~~~t~~~~~p~~~~~l~vq---~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 287 GKDTATCGFTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred CCcceeeecchhhhcccee---ecCceeEeccccceeeeeeeccchheeeecC Confidence 0 00000000001111100 011123334555654 578899999999866 No 227 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=52.03 E-value=0.57 Score=21.94 Aligned_cols=359 Identities=9% Similarity=-0.009 Sum_probs=107.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccch Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSE 86 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (408) |.+|++++ .++++++.+++++..+......+...+.+.+++++.++++.++.++++++................... T Consensus 1 m~~l~~~l---~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:81 1 MTDITSKL---EATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred ChHHHHHH---HHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 55555444 445666666666666655444445567788888888888888888887766544333222222221111 Q ss_pred hhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCcee--cchhhhhhhhhhhhhhhhhhhhhceeecccCccceEE Q lcl|Aclame:pro 87 NELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLT--IPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVY 164 (408) Q Consensus 87 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~--vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~ 164 (408) ..........+..+........................+.. -...+..+++..+ +..+-...++...... + T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~i-----i~~~~~~~~l~~~~~~--~ 150 (390) T protein:vir:81 78 VGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGF-----ITPPDARLTVRDLIGS--G 150 (390) T ss_pred chhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHH-----HHHHhhhhhhhhhcce--e Confidence 11111122233333333222222111111111110000000 0001222222222 1111111112111111 1 Q ss_pred eeccCCccccchhcc---cccccccccccceeeeechheeeeeh--HHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 165 EKWTDVTPLTVMDAE---DGKIPDLDNPQLTIIKYLIKRYAGII--TATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 165 ~~~~~~~~~~~~~~E---~~~~~~~~~~~f~~v~~~~~~~~~~~--~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) +.....-....+.+. ..-..+.........++....+...- .+.. +.+....-...+...+...++.++..++ T Consensus 151 ~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~--is~ell~d~~~~~~~i~~~l~~~~~~~~ 228 (390) T protein:vir:81 151 RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMK--ATRQILSDAPQLASYMNNRLIRGLKVKE 228 (390) T ss_pred eccCCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeeh--hhHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 111111111111111 11112211111222333333333221 1221 1122222233444555555555555544 Q ss_pred hhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc---eeeccc-------cccCCccc Q lcl|Aclame:pro 240 IEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK---YLLEPD-------PTKPNSYL 309 (408) Q Consensus 240 ~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~---~~~~~~-------~~~~~~~~ 309 (408) -...=.|... ...+..|+....... ....... ...-......+..+...... +++.+. +.+. T Consensus 229 d~a~l~G~g~--~~~~~Gi~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~---- 300 (390) T protein:vir:81 229 DAEILRGTGA--NDGLLGLIPQATTYA-APTTIAG-ATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDA---- 300 (390) T ss_pred HHHHHhcCCC--CCcccceeecccccc-ccccccc-chhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcC---- Confidence 4322222211 112223321110000 0000000 01111222233444433221 222221 1111 Q ss_pred ccccceEeeccccccccccCcceEEEEehhcceEeeeccc-eEEEEeccchhhhhhceeeEEEEeeeCcEEec------- Q lcl|Aclame:pro 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDREN-MSLLPTNIGAGAFETDTTKIRVIDRFDVKATD------- 381 (408) Q Consensus 310 l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~-~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~------- 381 (408) .|.|++... . ..+ ..-++|=. .+. .+.-+ -++.+ ..|... +....|-+..+.. T Consensus 301 -~G~~l~~~~-----~-~~~-~~~l~G~p--v~~-~~~~p~~~~~~-----gd~~~~---~~~~~~~~~~v~~~~~~~~~ 361 (390) T protein:vir:81 301 -NNQYLIGNA-----R-GTL-TPTLWGLP--VVA-TQAMAPGEFLV-----GAFDLA---AQIFDQWDARVEIGYVGEDF 361 (390) T ss_pred -CCceeecCc-----c-ccc-Cceeccee--eEE-cCCCCCCcEEE-----Eehhce---EEEEEecceEEEEecccchh Confidence 244443210 0 010 11223321 000 00000 00100 011111 1111111111110 Q ss_pred ------ccceEEEEeeccccCCCCccCCC Q lcl|Aclame:pro 382 ------SEALVAGSFSAIADQVGNFKTTT 404 (408) Q Consensus 382 ------~~a~~~l~~~~~~~~~~~~~~~~ 404 (408) -.++..+.+...-|..--..+.+ T Consensus 362 ~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 362 QRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred hcCcEEEEEEEeeccEEecccceEEEEeC Confidence 01222223333333222222222 No 228 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=47.81 E-value=0.69 Score=21.47 Aligned_cols=378 Identities=9% Similarity=-0.006 Sum_probs=88.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc---c Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDN--FSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG---P 81 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~ 81 (408) |+-+++. .++++++.+++....++.. .++++ .++.+.+..+++.++++++..+............. . T Consensus 1 mk~~~el----~~~l~el~~~~~~~~~~~~~~~~~~~----~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~ 72 (415) T protein:vir:94 1 MKTKEEL----QSEISDIKRQIDLKVKYATRALNNDE----LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQ 72 (415) T ss_pred CChHHHH----HHHHHHHHHHHHHHHHHHHHHhchhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 5544433 3334444443333332211 22222 23344555666666666665544322211111111 1 Q ss_pred cccchhhhHHHHHH-HHHHHhhc-chhhHHHHHHHHhh----ccccccCceecchhhhhhhhhhhhhhhhhhhhhceeec Q lcl|Aclame:pro 82 LNKSENELKDKFVK-DFVNMVRN-PMAFMNTVSSKTET----SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV 155 (408) Q Consensus 82 ~~~~~~~~~~~~~~-a~~~~~~~-~~~~~~~~~~~a~~----~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 155 (408) .............. ....+... ........+.++.. .......+. +++.-...++..-.....+..+-...++ T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~-~~~~~g~~~iP~~~~~~ii~~~~~~~~l 151 (415) T protein:vir:94 73 QSVEVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGS-LKTDSGFVVIPEEIVTDILKLKEVEFNL 151 (415) T ss_pred ccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhc-cccccccccCcHHHHHHHHHHHHhhhhh Confidence 11111111111111 11111100 00011111122111 001000111 1111111111111111111111122222 Q ss_pred ccCccceEEeeccCCccccchhc--cccccccccc-ccceeeeechheeeeehHHHHH-HHhcchHHHHHH-HHHHHHHH Q lcl|Aclame:pro 156 STSNGSRVYEKWTDVTPLTVMDA--EDGKIPDLDN-PQLTIIKYLIKRYAGIITATNT-SLKDTAENILAW-LSSWIAKK 230 (408) Q Consensus 156 ~~~~g~~~~~~~~~~~~~~~~~~--E~~~~~~~~~-~~f~~v~~~~~~~~~~~~iS~e-ll~ds~~~~~~~-v~~~l~~~ 230 (408) ......++++.....-+...+.+ +..-..+... +..+..++..-.+...- +..- -+.+.......+ +...+... T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k-~~~~~~is~ell~ds~~~~~~~i~~~ 230 (415) T protein:vir:94 152 DKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT-HRGYFRISREAIEDAKVNVLQELKLW 230 (415) T ss_pred hhhcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeehee-eeeechhhHHHHhhchHHHHHHHHHH Confidence 22111222211111111111111 1111111111 11111112222211111 0000 011111111111 33444444 Q ss_pred HHHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhccc---CceeeccccccC-- Q lcl|Aclame:pro 231 VVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAE---GKYLLEPDPTKP-- 305 (408) Q Consensus 231 ~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~---G~~~~~~~~~~~-- 305 (408) ++..+...+-...-.+........ ........ ........-..-.+....+.++.+.. ..+++.+..... T Consensus 231 l~~~~~~~~~~~il~g~g~g~~~~--~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~ 305 (415) T protein:vir:94 231 MARTIAATRNKAIIDVITKGSTGS--TSSGFEKE---GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLD 305 (415) T ss_pred HHHHHHHHHHHHHhhccccCcccc--cccccccc---ccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHH Confidence 444444433222112211111110 00000000 00000000111112223333333322 122322211000 Q ss_pred CcccccccceEeecc-ccccccccCcceEEEEe------hhc-ceEeeeccceEEEEeccchhhhhhceeeEEEEeeeC- Q lcl|Aclame:pro 306 NSYLIKGKQVIVVAD-RWLPNTGSTVYPLYYGD------MSQ-AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD- 376 (408) Q Consensus 306 ~~~~l~G~pv~~~~~-~~~~~~~~~~~~~~~gd------~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d- 376 (408) .-..-+|.|++..+- ...+...-+. ++++-+ -.. .+.+.++..+.+..+ +..+.+....... T Consensus 306 ~lkd~~G~~l~~~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~--------~~~~~v~~~~~~~~ 376 (415) T protein:vir:94 306 KMKDKLGNYLIQPDVKEKTQQRLLGA-KIEILPDEVLGQKGNNTLIIGNLKDAIVLFD--------RSQYQASWTDYMHF 376 (415) T ss_pred HhhccCCCeeeccCcCCCCCceecce-eeEEecccccCCCCccEEEEEehhccEEEEe--------ecceEEEEeccccC Confidence 001122444432110 0011111111 122221 111 123334443322222 1112222211111 Q ss_pred cEEec---ccceEEEEeeccccCCCCccCCCcccC Q lcl|Aclame:pro 377 VKATD---SEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) Q Consensus 377 ~~v~~---~~a~~~l~~~~~~~~~~~~~~~~~~~~ 408 (408) ....+ --.+.++.-.++..-..+++++...++ T Consensus 377 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:94 377 GECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc Confidence 11111 112223333333333344444445555 No 229 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=47.02 E-value=0.72 Score=21.38 Aligned_cols=340 Identities=13% Similarity=0.093 Sum_probs=116.3 Q ss_pred HHHHHHHHHHHHHHHhhhcccHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccchhhhHHHHHHH Q lcl|Aclame:pro 18 GDKVTDFNDQINMALNDDNFSAEAMSEL-KNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKD 96 (408) Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 96 (408) ....+++.++=.-+++.+.. .++... +..+ .+++ ++.|.+++... .....+..+.+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~--~~i~~~~~~~~---~a~l--lenq~~~~~~~----------------~~~~~~~~~~~ 57 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKL--PEIATASKQKL---VAKI--LESQEADFAVD----------------PIYKDEKVVEA 57 (528) T ss_pred CcchHHHHHhhhHhhcCCcc--chhcchhhhhh---hhhh--hhhhhHHhhcc----------------ccccchHHHHh Confidence 11122222222223322211 111110 0000 0000 11111110000 01112233445 Q ss_pred HHHHhhcch----hhHHHHHHHHhhccccccCceecchhhhhhhhhhhh---hhhhhhhhhceeecccCccceEEeecc- Q lcl|Aclame:pro 97 FVNMVRNPM----AFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVR---QYDSLQQYVRVESVSTSNGSRVYEKWT- 168 (408) Q Consensus 97 ~~~~~~~~~----~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~g~~~~~~~~- 168 (408) |..++.... .-...-. ...++ +++++. .+.+.++..+| +.....+++.+.||++++|-+--.+-. T Consensus 58 ~~~~l~ea~~~~~~~~~~~~---i~es~-~t~~v~---~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY 130 (528) T protein:vir:80 58 FGGFIAEAEVAGDHGYDASQ---IAAGQ-TTGAIT---NVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVY 130 (528) T ss_pred hhhhccccccccccCCcccc---ccccc-cccccc---cCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeee Confidence 544443211 0000000 00011 111111 23333444444 344456778888888776533211100 Q ss_pred -CC-----------------------------------c----------------------------------------- Q lcl|Aclame:pro 169 -DV-----------------------------------T----------------------------------------- 171 (408) Q Consensus 169 -~~-----------------------------------~----------------------------------------- 171 (408) .. + T Consensus 131 ~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~ 210 (528) T protein:vir:80 131 GPNPLASQAKEAFHPMYAPDAFHSSLAAKGAAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTK 210 (528) T ss_pred cCCccccccccccccccccccccccccccccccccccccccccccccccccccceeccccccccccccccccccccCccc Confidence 00 0 Q ss_pred ------------------------cccchhcccc-cccccccccceeeeechheeeee-------hHHHHHHHhc----c Q lcl|Aclame:pro 172 ------------------------PLTVMDAEDG-KIPDLDNPQLTIIKYLIKRYAGI-------ITATNTSLKD----T 215 (408) Q Consensus 172 ------------------------~~~~~~~E~~-~~~~~~~~~f~~v~~~~~~~~~~-------~~iS~ell~d----s 215 (408) +...-.+|.- .-...+...|.++.|+..|..+- ..+|-||.+| . T Consensus 211 ~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIH 290 (528) T protein:vir:80 211 AGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVH 290 (528) T ss_pred cCCcccccccccccccccccccccccchhhhhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhc Confidence 0000001100 00001122355666666555543 5689999888 3 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcccc---c---------cchhhhhhH-------------HH---HHHHH----H Q lcl|Aclame:pro 216 AENILAWLSSWIAKKVVVTRNQAIIEVMKA---A---------PKKPTIAKF-------------DD---VITMI----N 263 (408) Q Consensus 216 ~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~---~---------~~~~~~~~~-------------d~---i~~~~----~ 263 (408) ..|.++.|.+-|+..|...+|+.|+.-... . ....+..+. +. +...+ + T Consensus 291 GLDAEtELaNILStEImlEINReii~~i~~~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an 370 (528) T protein:vir:80 291 GMDADAELNAILANEVLLEINREIVDVINFTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAA 370 (528) T ss_pred CCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHH Confidence 468899999999999999999999542210 0 001111111 11 11111 1 Q ss_pred Hh-hhhhccCCCEEEEcHHHHHHHHhh-----hcccC-ceeeccccccCC-ccccc-ccceEeeccccccccccCcceEE Q lcl|Aclame:pro 264 TA-VDPAIIATSSLLTNQSGLNKLALV-----KTAEG-KYLLEPDPTKPN-SYLIK-GKQVIVVADRWLPNTGSTVYPLY 334 (408) Q Consensus 264 ~~-l~~~~~~~a~~~~n~~~~~~l~~l-----kd~~G-~~~~~~~~~~~~-~~~l~-G~pv~~~~~~~~~~~~~~~~~~~ 334 (408) .. ..+.+...-.+++++.....|... ....| +..+..+.+... .+.|. ||+|++.++ .+ .+-++ T Consensus 371 ~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~-----~dy~~ 443 (528) T protein:vir:80 371 EIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKVFIDQY--AR-----QDYFT 443 (528) T ss_pred HHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEecCceEEEecCC--CC-----cceEE Confidence 11 123344445789999988877643 11111 122222222211 23444 567766322 11 12233 Q ss_pred EEehhcceEeeeccceEE----EEeccchhhh------hhceeeEEEEeeeCcEEecccceEEEEeecccc-CCCCccC- Q lcl|Aclame:pro 335 YGDMSQAITLFDRENMSL----LPTNIGAGAF------ETDTTKIRVIDRFDVKATDSEALVAGSFSAIAD-QVGNFKT- 402 (408) Q Consensus 335 ~gd~~~~~~~~~~~~~~i----~~~~~~~~~f------~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~-~~~~~~~- 402 (408) +|- ++..++ =..|+..-.| .+-+-.+-+..|++.. .+|= +.- ..-++ .-..... T Consensus 444 vG~---------KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP~--~~~--~~~~~~~r~~~g~~ 509 (528) T protein:vir:80 444 VGY---------KGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGFKTRYGIG-INPF--ADS--KSQAPSARITSGML 509 (528) T ss_pred EEE---------eCCcccccceeecccccceeeEeeCCccccceeeeeeeecee-ecCc--ccc--cCCcccccccccch Confidence 331 111111 1111110000 0111122222333322 1221 000 00000 0000111 Q ss_pred ---CCcccC Q lcl|Aclame:pro 403 ---TTSTAV 408 (408) Q Consensus 403 ---~~~~~~ 408 (408) ++.++. T Consensus 510 ~~~~ag~n~ 518 (528) T protein:vir:80 510 SKDSVGKNA 518 (528) T ss_pred hhhhcCccc Confidence 111222 No 230 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=45.44 E-value=0.77 Score=21.21 Aligned_cols=314 Identities=7% Similarity=0.019 Sum_probs=117.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccchhhhHHHHHHHHHHHh---hcchhhHHHHHHHH Q lcl|Aclame:pro 39 AEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMV---RNPMAFMNTVSSKT 115 (408) Q Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~---~~~~~~~~~~~~~a 115 (408) ....+.....+.. ....+........+-...++++++- .+........-..| T Consensus 1 ~~~~~~~~~~~~~-------------------------~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~a 55 (379) T protein:vir:10 1 MPQISKIHSSLNA-------------------------RQMTQMVMDSADVTLDNLKHLESYGIHLNGRKNKLFELMQFA 55 (379) T ss_pred CCCcceeeeecCc-------------------------cccchhhhccccccHHHHHHHHhcCccccchhhhhhhhhhhh Confidence 0000000000000 0000000000000111112222210 00000000000011 Q ss_pred hhcc------cc-c----cCceecchh---hhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccc Q lcl|Aclame:pro 116 ETSG------SD-S----AAGLTIPQD---IRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDG 181 (408) Q Consensus 116 ~~~~------t~-~----~gg~~vP~~---~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~ 181 (408) +... +. + .+-..+|.. +...+++.+-....+..++.+.+...-.-...........+.+.+.+.+. T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~ 135 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGG 135 (379) T ss_pred hccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEecccc Confidence 1110 00 0 001123443 33556666655555555555544211110112222233445666778777 Q ss_pred cccccccccceeeeechheeeeehHHHH-HHHh--cchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------- Q lcl|Aclame:pro 182 KIPDLDNPQLTIIKYLIKRYAGIITATN-TSLK--DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP----------- 247 (408) Q Consensus 182 ~~~~~~~~~f~~v~~~~~~~~~~~~iS~-ell~--ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~~----------- 247 (408) ++|-.+ ...+...-..+.+...+.++. |+.. ....++.+--.....+++.+.+|+-.+.|.+... T Consensus 136 d~pl~d-~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~ 214 (379) T protein:vir:10 136 NMALMS-WTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPN 214 (379) T ss_pred CCCeee-eeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCC Confidence 776433 233333344455555555543 2222 1235677777888888888889988898843110 Q ss_pred ------chh--------hhhhHHHHHHHHHHhhhh-----h--ccCC---CEEEEcHHHHHHHHhhhcccCceeeccccc Q lcl|Aclame:pro 248 ------KKP--------TIAKFDDVITMINTAVDP-----A--IIAT---SSLLTNQSGLNKLALVKTAEGKYLLEPDPT 303 (408) Q Consensus 248 ------~~~--------~~~~~d~i~~~~~~~l~~-----~--~~~~---a~~~~n~~~~~~l~~lkd~~G~~~~~~~~~ 303 (408) +.+ ...+.+.|++.+...+.. . ..++ ..+++++..+..|..- +..|.-++. -+. T Consensus 215 l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~-~lk 292 (379) T protein:vir:10 215 LPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP-TELGYSVAQ-YMR 292 (379) T ss_pred CcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc-cccCccHHH-HHH Confidence 000 011334444333322211 1 1122 2588899998888643 222321211 011 Q ss_pred cCCcccccccceEeeccccccccccCcceEEEEehhcc---------eEeeeccceEEEEeccchhhhhhceeeEEEEee Q lcl|Aclame:pro 304 KPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQA---------ITLFDRENMSLLPTNIGAGAFETDTTKIRVIDR 374 (408) Q Consensus 304 ~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~---------~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r 374 (408) .. +-++.++..+ ..-...++++...+|.|-... ...+ .+.++ ..+.. ...-.+..-+..| T Consensus 293 ~n----~Pnl~i~t~p-EL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~-p~k~~--~l~ve---~~~~~~~~~~~~r 361 (379) T protein:vir:10 293 ES----YPNVTFVSAP-ELNDANGGSSAIYYYADAVENNGTDDGRTWLQVV-PTKMF--TLGVE---KKIKGYAEGYTNA 361 (379) T ss_pred Hh----cCCcEEEEcc-cccccCCCccEEEEEeeccCCCccCCcceEEEec-chhhh--hccce---ecCceeEeccccc Confidence 11 1122222211 111222333344555542110 0000 11111 11100 0011122234445 Q ss_pred -eCcEEecccceEEEEee Q lcl|Aclame:pro 375 -FDVKATDSEALVAGSFS 391 (408) Q Consensus 375 -~d~~v~~~~a~~~l~~~ 391 (408) .|+.+.+|.||+.+... T Consensus 362 t~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 362 TAGAMLKRPFATYRQTGA 379 (379) T ss_pred eeeeeeecchhhheecCC Confidence 45588899999998877 No 231 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=44.41 E-value=0.81 Score=21.09 Aligned_cols=260 Identities=8% Similarity=-0.030 Sum_probs=103.4 Q ss_pred hhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceeee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIK 195 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~ 195 (408) |+... -+.++..+.+.+...+....+...-.-..+..++.+++........+-.+.|-..++.+ .++...+ T Consensus 1 Main~--------a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~v~-~~~et~t 71 (290) T protein:vir:78 1 MAINY--------VDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGSAS-NTNKSYT 71 (290) T ss_pred CchhH--------HHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCcccCccc-cceeeEE Confidence 11100 12333444444433333222221111111223566776665544444444443333332 2455566 Q ss_pred echheeeeeh----HHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhh----ccccc----cc-hhhhhhHHHHHHHH Q lcl|Aclame:pro 196 YLIKRYAGII----TATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIE----VMKAA----PK-KPTIAKFDDVITMI 262 (408) Q Consensus 196 ~~~~~~~~~~----~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~----g~g~~----~~-~~~~~~~d~i~~~~ 262 (408) ++-.+.-.+. .+-+.-. ...+...+.+...+.+.-.+|.-.+. +.++. .. .+....++.+.+++ T Consensus 72 l~qdR~~~F~vD~~DvDEt~~---~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~ 148 (290) T protein:vir:78 72 IDFDRDVEFFVDVMDVDETGQ---ALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAI 148 (290) T ss_pred eeccccceeeccccchhHHhh---hhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHH Confidence 6665555442 1211111 12233344444444444444433221 11111 11 11122344444444 Q ss_pred HHhhhhhccCCCEEEEcHHHHHHHHhhhcccCceee---ccccccCCcccccccceEeeccc-cc-----------cccc Q lcl|Aclame:pro 263 NTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLL---EPDPTKPNSYLIKGKQVIVVADR-WL-----------PNTG 327 (408) Q Consensus 263 ~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~~~~---~~~~~~~~~~~l~G~pv~~~~~~-~~-----------~~~~ 327 (408) ..++.....+..++++|..+..|.+-+.=+...-. .....++..++|.|++|+.+++. -+ +... T Consensus 149 -~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ 227 (290) T protein:vir:78 149 -RKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAG 227 (290) T ss_pred -HHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcccccccCC Confidence 33554444566778999999877533211111111 01122344578999999987642 11 2222 Q ss_pred cCcceEEEEehhcceEeeeccceEEEEe-ccchhhhhh-ceeeEEEEeeeCcEEecccceEEEEeecc Q lcl|Aclame:pro 328 STVYPLYYGDMSQAITLFDRENMSLLPT-NIGAGAFET-DTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) Q Consensus 328 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~-~~~~~~f~~-~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 393 (408) +.+-.+++...+. ..-...-. .+... |.. +++ +...+.-..++|.=|.+.+.=.+..=.++ T Consensus 228 ak~in~ii~~~~a-~i~~~K~~-~~~~~~P~~---~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 228 AKKLNFLLVNKGS-VVGGAKHA-SIYLHAPGS---VGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred ccceeEEEEcCCc-eeeeeeee-EEEeeCCCC---CcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 2223345555443 22222111 23222 222 111 22333333446666665542222111111 No 232 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=43.50 E-value=0.84 Score=20.99 Aligned_cols=338 Identities=13% Similarity=0.117 Sum_probs=120.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |.++- .++|.+++..+.+. +.. .++..-+..+- +++ ++.|.++++.. + T Consensus 1 ~~~~~-~~~l~~kw~p~l~~--------------~~~--~~i~~~~~~~~---a~~--~enq~~~~~~~------~---- 48 (521) T protein:vir:10 1 MTIKT-KAELLNKWKPLLEG--------------EGL--PEIANSKQAII---AKI--FENQEKDFQTA------P---- 48 (521) T ss_pred CCcch-hHHHHHhhhhhhcc--------------CCC--Cccccchhhhh---hhh--hhhhhhhhhhc------c---- Confidence 66663 24466666554222 110 11111111111 000 11111111000 0 Q ss_pred ccccchhhhHHHHHHHHHHHhhcc-----hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhh---hhhhhhhhhce Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNP-----MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVR---QYDSLQQYVRV 152 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~-----~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~---~~~~l~~~~~~ 152 (408) ....+....+|.+++-.. .+.... ....++ ++|++. .+.+.++..+| +.....+++.+ T Consensus 49 ------~~~~~~~~~~~~~~l~e~~~~~~~~~~~~----~i~es~-~t~~v~---~~~P~Li~lvRra~p~LIa~DIwGV 114 (521) T protein:vir:10 49 ------EYKDEKIAQAFGSFLTEAEIGGDHGYNAT----NIAAGQ-TSGAVT---QIGPAVMGMVRRAIPNLIAFDICGV 114 (521) T ss_pred ------ccchhHHHHHHhhhhhhhcccCccccccc----cccccc-cccccc---cCCchhhhHHHHHHhhhhhhhceee Confidence 011122333444443221 111000 001111 111111 23344444444 34445677888 Q ss_pred eecccCccceEEeecc--CCc--------------cccchhcc------------------------------------- Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWT--DVT--------------PLTVMDAE------------------------------------- 179 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~--~~~--------------~~~~~~~E------------------------------------- 179 (408) .||++++|-+--.+-. ... +.+.|-+. T Consensus 115 QPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~ 194 (521) T protein:vir:10 115 QPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASA 194 (521) T ss_pred ccCCchhhhheeeeeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccccceecccc Confidence 8888887633211110 000 00000000 Q ss_pred --------------------------------cc---------cccccccccceeeeechheeeee-------hHHHHHH Q lcl|Aclame:pro 180 --------------------------------DG---------KIPDLDNPQLTIIKYLIKRYAGI-------ITATNTS 211 (408) Q Consensus 180 --------------------------------~~---------~~~~~~~~~f~~v~~~~~~~~~~-------~~iS~el 211 (408) +. .....+...|.++.|+..|..+- ..+|-|| T Consensus 195 ~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiEL 274 (521) T protein:vir:10 195 QVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIEL 274 (521) T ss_pred cccCCCcccccccccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccceeccccHHH Confidence 00 00001122355666666665543 5689999 Q ss_pred Hhc----chHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------cc----chhhhhhHH----------------HHH Q lcl|Aclame:pro 212 LKD----TAENILAWLSSWIAKKVVVTRNQAIIEVMKA--------AP----KKPTIAKFD----------------DVI 259 (408) Q Consensus 212 l~d----s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~--------~~----~~~~~~~~d----------------~i~ 259 (408) .+| ...|.++.|.+-|+..|...+|+.|+.-... .+ ...+..+.+ .+. T Consensus 275 AQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~ 354 (521) T protein:vir:10 275 AQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALL 354 (521) T ss_pred HHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHH Confidence 888 3468899999999999999999998843210 00 111222111 111 Q ss_pred HHH----HHhh-hhhccCCCEEEEcHHHHHHHHhhh-----cccC-ceeeccccccCC-ccccc-ccceEeecccccccc Q lcl|Aclame:pro 260 TMI----NTAV-DPAIIATSSLLTNQSGLNKLALVK-----TAEG-KYLLEPDPTKPN-SYLIK-GKQVIVVADRWLPNT 326 (408) Q Consensus 260 ~~~----~~~l-~~~~~~~a~~~~n~~~~~~l~~lk-----d~~G-~~~~~~~~~~~~-~~~l~-G~pv~~~~~~~~~~~ 326 (408) .-+ +... .+.-...-.+|+++.....|...- .++| ..-|..+.+... -+.|. ||+|++.++ . T Consensus 355 ~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y--~--- 429 (521) T protein:vir:10 355 FQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQY--A--- 429 (521) T ss_pred HHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCC--C--- Confidence 111 1111 122122335889999988887531 1111 112222221111 13443 567766322 1 Q ss_pred ccCcceEEEEehhcceEeeeccceE----EEEeccchhhh------hhceeeEEEEeeeCcEEecccceEEEEeeccccC Q lcl|Aclame:pro 327 GSTVYPLYYGDMSQAITLFDRENMS----LLPTNIGAGAF------ETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) Q Consensus 327 ~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~~~f------~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 396 (408) ..+-+++|- ++..+ +-..|+..-.+ .+-+-.+-+..|++.. .+|= +. .+.+ T Consensus 430 --~~dy~~vG~---------KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP~--~~-----~~~~ 490 (521) T protein:vir:10 430 --KQDYFTVGY---------KGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INPF--AE-----SAAQ 490 (521) T ss_pred --CcceEEEEE---------eCCcccccceeeccccccccccccCCccccceeeeeeeecee-ecCc--cc-----ccCC Confidence 122233331 11111 11112111000 0111122222333322 1221 00 0000 Q ss_pred CCC----cc---CCCcccC Q lcl|Aclame:pro 397 VGN----FK---TTTSTAV 408 (408) Q Consensus 397 ~~~----~~---~~~~~~~ 408 (408) .+. .. ..+.++= T Consensus 491 ~~~~~i~~~~~~~~a~~~~ 509 (521) T protein:vir:10 491 APASRIQSGMPSILNSLGK 509 (521) T ss_pred ccceeecccchhhhccccc Confidence 000 00 0000110 No 233 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=40.84 E-value=0.95 Score=20.70 Aligned_cols=373 Identities=9% Similarity=-0.048 Sum_probs=103.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc-ccccc Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG-PLNKS 85 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 85 (408) |.++.+++.++.++++++.+++++..+.. .+++.++.+..++++++.++++.++.+.+............. ..... T Consensus 1 m~~~~k~l~el~~~~~~~~~~~~~~~e~~---~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQIKSQAEQV---NTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGG 77 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 66888888888888888888877665442 345555666666667777777776666554432221111111 11111 Q ss_pred hhhhH-HHHHHHHHHHhhcchhhHHHHHHHHhh-ccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceE Q lcl|Aclame:pro 86 ENELK-DKFVKDFVNMVRNPMAFMNTVSSKTET-SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) Q Consensus 86 ~~~~~-~~~~~a~~~~~~~~~~~~~~~~~~a~~-~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~ 163 (408) ....+ ...........+.-............. ....+.. ..-...+..++...+ +..+-...++...-...+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~vp~~~~~~i-----i~~~~~~~~l~~l~~~~~ 151 (395) T protein:vir:43 78 EEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSID-GSGGALVAPDRRPGV-----VAAPQRRLTIRDLVAPGT 151 (395) T ss_pred cchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccC-CCCccccchhhHHHH-----HHHHHhhhhHHhhcccee Confidence 11100 000001111111111111111111111 1111110 000011222222222 211111111211111111 Q ss_pred EeeccCCccccchhcc-c-----ccccccccccceeeeechheeeeeh--HHHHHHHhcchHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 164 YEKWTDVTPLTVMDAE-D-----GKIPDLDNPQLTIIKYLIKRYAGII--TATNTSLKDTAENILAWLSSWIAKKVVVTR 235 (408) Q Consensus 164 ~~~~~~~~~~~~~~~E-~-----~~~~~~~~~~f~~v~~~~~~~~~~~--~iS~ell~ds~~~~~~~v~~~l~~~~~~~~ 235 (408) + . +... .|.-+ + .-..+.....-..+++....+...- .+.. +.+...+-...+...+.+.++.++ T Consensus 152 ~--~-~~~~--~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~--is~ell~d~~~l~~~v~~~la~a~ 224 (395) T protein:vir:43 152 T--E-SNSV--EYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFK--ASRQILDDASALQSYIDARARYGL 224 (395) T ss_pred c--C-CCce--EEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeeh--hhHHHHHhHHHHHHHHHHHHHHHH Confidence 1 1 1111 11111 1 1122221111122333333322221 1111 112122222234444555555555 Q ss_pred HHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc---eeeccccccC--Ccccc Q lcl|Aclame:pro 236 NQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK---YLLEPDPTKP--NSYLI 310 (408) Q Consensus 236 ~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~---~~~~~~~~~~--~~~~l 310 (408) ...+-...=.|... ......|+...................-......+..+...... ++..+..... .-..= T Consensus 225 ~~~~d~~~l~G~g~--~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~ 302 (395) T protein:vir:43 225 MLVEECQLLYGNGT--GANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDA 302 (395) T ss_pred HHHHHHHHHhccCC--CCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhcc Confidence 54444332222221 11222332221100000000000011111222223334333221 2222210000 00011 Q ss_pred cccceEeeccccccccccCcceEEEEeh--hcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEE-ecccceE- Q lcl|Aclame:pro 311 KGKQVIVVADRWLPNTGSTVYPLYYGDM--SQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKA-TDSEALV- 386 (408) Q Consensus 311 ~G~pv~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v-~~~~a~~- 386 (408) .|.|++.......+...-+ .++++-|+ ..-+.+.++....+ .+.+....+......+... .+--+|. T Consensus 303 ~G~~i~~~~~~~~~~~l~G-~pVv~~~~~~~~~~~~gd~~~~~~--------~~~~~~~~i~~~~~~~~~f~~~~~~~r~ 373 (395) T protein:vir:43 303 ENRYIIGSPQNGTTPTLWR-LPVVETQAITQDEFLTGAFSLGAQ--------IFDRMDIEVLVSTENDKDFENNMVTIRA 373 (395) T ss_pred CCceeccccccCCCceecc-eeeEEcCCCCCCcEEEEeccceEE--------EEEecceEEEEeccccchhhcCcEEEEE Confidence 2455432100000000001 11222221 00011111111100 0111111111111100000 0000111 Q ss_pred --EEEeeccccCCCCccCCCcc Q lcl|Aclame:pro 387 --AGSFSAIADQVGNFKTTTST 406 (408) Q Consensus 387 --~l~~~~~~~~~~~~~~~~~~ 406 (408) .+.+...-|..--..+.++. T Consensus 374 ~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 374 EERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred EEeeccEEecccceEEEEeccC Confidence 12222222222211111111 No 234 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=39.64 E-value=1 Score=20.56 Aligned_cols=308 Identities=12% Similarity=0.058 Sum_probs=124.5 Q ss_pred cccccchhhhHHH----HHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhh--hhhhcee Q lcl|Aclame:pro 80 GPLNKSENELKDK----FVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSL--QQYVRVE 153 (408) Q Consensus 80 ~~~~~~~~~~~~~----~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l--~~~~~~~ 153 (408) .+....+....+. .-.++.+.+..+.+. +..+-.+|+.+=-..+..+|..+......+ ..-+.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~Ks~~agy~~---------~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~ 71 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGI---------TPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKK 71 (468) T ss_pred CCCCcchhhccccChhHHHHHHHHHHHcCccc---------CCccccCcchhhhhhhhhhhheeeecccchhhhhhcccc Confidence 1111111111111 112233333322111 001122334443334444444333322222 2222333 Q ss_pred ecccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHH-HhcchHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSWIAKKVV 232 (408) Q Consensus 154 ~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el-l~ds~~~~~~~v~~~l~~~~~ 232 (408) +..+.-..|......+..+...+++|++-.+ .+++.+.+.....+-++.-..+|.-+ +..+..+..+...+.-...++ T Consensus 72 ~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~-~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a 150 (468) T protein:vir:63 72 PATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIA 150 (468) T ss_pred hhhhhhhhheeeeccCccccccccccccccc-cCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHH Confidence 3333333444333445667778899999875 67899999999999999866655543 233455677888888888999 Q ss_pred HHHHHHHhhccccccch---hhhhhHHHH---------------------HHHHHHhhhhhccCCCEEEEcHHHHHHHH- Q lcl|Aclame:pro 233 VTRNQAIIEVMKAAPKK---PTIAKFDDV---------------------ITMINTAVDPAIIATSSLLTNQSGLNKLA- 287 (408) Q Consensus 233 ~~~~~~~~~g~g~~~~~---~~~~~~d~i---------------------~~~~~~~l~~~~~~~a~~~~n~~~~~~l~- 287 (408) ..++.+.+.|+..-.+. .-..-+|.| ++.+......+|....-++|+.-+.+.+. T Consensus 151 ~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~ 230 (468) T protein:vir:63 151 KTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVN 230 (468) T ss_pred HHHHHHhhhcccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhh Confidence 99999999998765321 111223332 22222222334444444566666655542 Q ss_pred hhhcccCceeeccccccCCcccccccceEeeccccccccc--cCcceEEEEehhcceEeeeccceEEEEec------c-- Q lcl|Aclame:pro 288 LVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTG--STVYPLYYGDMSQAITLFDRENMSLLPTN------I-- 357 (408) Q Consensus 288 ~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~--~~~~~~~~gd~~~~~~~~~~~~~~i~~~~------~-- 357 (408) ..-. .++.+.. +.......|+||-- .++..+ .-.+..++||... ...++.+.....++ + T Consensus 231 ~~L~--~q~~v~~---~n~~~~~~G~~v~g----~~sa~G~I~l~gs~il~~~~~--l~~~~~~~~~Apsp~~vsaT~~~ 299 (468) T protein:vir:63 231 QQLS--KQTQLVR---DNGNNVSVGFNIQG----FHSARGFIKLHGSTVMENEQI--LDERILALPTAPQPAKVTATQEA 299 (468) T ss_pred hhcC--ceEEEEc---CCCCceeeeecccc----eecceeeeeecCceeeccccC--CCcccccccccccCCccceeeec Confidence 1111 1122221 11112233444421 001000 0001123443321 11111111111000 0 Q ss_pred -chhhhhh---ceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccC------CCcccC Q lcl|Aclame:pro 358 -GAGAFET---DTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKT------TTSTAV 408 (408) Q Consensus 358 -~~~~f~~---~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~------~~~~~~ 408 (408) ....|.. ..+.+++...-+..--.|...+-+++.+.......+.+ +.++=| T Consensus 300 ~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv 360 (468) T protein:vir:63 300 GKKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFV 360 (468) T ss_pred ccCCcccCCCcceEEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEE Confidence 0000100 11233333332222333444444444432222211111 001001 No 235 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=39.15 E-value=0.81 Score=21.09 Aligned_cols=286 Identities=14% Similarity=0.011 Sum_probs=103.5 Q ss_pred cccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhh--hhhhhhceeeccc Q lcl|Aclame:pro 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYD--SLQQYVRVESVST 157 (408) Q Consensus 80 ~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~~~~~~~ 157 (408) .+ -..-+...+...+++.+....+.. +==+.+..++..+..... .+..-....+..+ T Consensus 1 ~~-~~~~~~~~~a~~~al~~a~~~g~A--------------------lR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~S 59 (470) T protein:vir:10 1 MP-YEHLKHLDEATLKALNAAGQVAES--------------------LEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKA 59 (470) T ss_pred CC-hhHhhhhhHHHHHHHHHhhhcchh--------------------hhhhhhccceeEeeecCccchhhhhcCCchhhh Confidence 00 000011122223334333333321 111111111111111111 1111112222222 Q ss_pred CccceEEeec-cCCccccchhcccccccccccccceeeeechheeeeehHHHHHH---HhcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 158 SNGSRVYEKW-TDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS---LKDTAENILAWLSSWIAKKVVV 233 (408) Q Consensus 158 ~~g~~~~~~~-~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el---l~ds~~~~~~~v~~~l~~~~~~ 233 (408) .-..|..... .+..+.. ...|++-. +.+++++.+.+...+-++.-..+|.-. ++....+++..+.+.-.-.+++ T Consensus 60 TV~ey~~~~~rhG~~g~s-~~~E~~l~-~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~ 137 (470) T protein:vir:10 60 YEHEYNVVTARHDKIGYA-AFREGGLP-RTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVAN 137 (470) T ss_pred Hhhhhhhhccccccccce-eecccccC-ccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHH Confidence 2222221111 1222222 34677765 467899999999999999998998764 3445558888888888889999 Q ss_pred HHHHHHhhcccccc---c-hhhhhhHHHHHHHHHHhhh-hhccCCCEEEEcHHHHHHHH-hhhc--ccCce--eeccc-c Q lcl|Aclame:pro 234 TRNQAIIEVMKAAP---K-KPTIAKFDDVITMINTAVD-PAIIATSSLLTNQSGLNKLA-LVKT--AEGKY--LLEPD-P 302 (408) Q Consensus 234 ~~~~~~~~g~g~~~---~-~~~~~~~d~i~~~~~~~l~-~~~~~~a~~~~n~~~~~~l~-~lkd--~~G~~--~~~~~-~ 302 (408) +++.+.+.|+..-+ + .....-+|.+.+++...-+ .-+......+ +.+.+.+.. .++- +-|.+ +|.|. . T Consensus 138 tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~L-s~~~L~~aa~~I~~~~~fGt~TD~~lp~~v 216 (470) T protein:vir:10 138 EFEYLAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPL-SIDLLWEAESRVVSTQAFANPTAVFISYVD 216 (470) T ss_pred HHHhhhhhhccccccccCcccCceeccchhhhccCCCCccccccCCCCc-cHHHHHHHHhhhcccccccChhhhccchhH Confidence 99999999976432 1 1233447777665531100 1111111122 444444433 2321 11111 12111 1 Q ss_pred ccCCc-ccccccceEeeccccccccccCcceEEEE-ehhcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCcEE- Q lcl|Aclame:pro 303 TKPNS-YLIKGKQVIVVADRWLPNTGSTVYPLYYG-DMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKA- 379 (408) Q Consensus 303 ~~~~~-~~l~G~pv~~~~~~~~~~~~~~~~~~~~g-d~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v- 379 (408) .+.-. ..+..+.|...+| .. ....| |...+ .. .++.+.+-.+. |..+....+ ..+++..+ T Consensus 217 ka~f~~~~~~~qRv~~~~N------~~---~~~~G~~v~~f-~s-a~G~I~L~~s~-----~m~~~~k~~-p~~l~~~v~ 279 (470) T protein:vir:10 217 KLNLQASFYQISRVMTTAD------RR---AGLLGADAQSY-IG-VRGEHSLYPSQ-----FLGDFHKFN-PARFGAEVG 279 (470) T ss_pred HHHHHHhhcCceEEEEecC------CC---ceeeeeeccce-ee-eeeeeeecccc-----cccchhhcC-cccCCcccC Confidence 11111 1122233322111 00 01111 22221 11 13333332211 111000000 01233321 Q ss_pred --ecccceEEEEeeccccCCCCccCCC-----cccC Q lcl|Aclame:pro 380 --TDSEALVAGSFSAIADQVGNFKTTT-----STAV 408 (408) Q Consensus 380 --~~~~a~~~l~~~~~~~~~~~~~~~~-----~~~~ 408 (408) .-|..++- +..+.+....-+.+- +.-| T Consensus 280 ~~aAP~~~~t--v~~t~~~~a~~~~sk~g~~~~~~v 313 (470) T protein:vir:10 280 DFAAPSNSWT--VSTTDNFVTLPYNSGLGDPANTTV 313 (470) T ss_pred CcccCceeEE--eecCCCceeecccCCCCcccCcce Confidence 12221111 111111000000000 0111 No 236 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=38.13 E-value=1.1 Score=20.40 Aligned_cols=356 Identities=13% Similarity=0.009 Sum_probs=98.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccch Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSE 86 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (408) |+.+++ +++.+.++.+++.++.+..+....+.....+++++++.+++.+.++++.+.................... T Consensus 1 Mk~~~e----l~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (397) T protein:vir:48 1 MKTSNE----LHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEK 76 (397) T ss_pred CchHHH----HHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcc Confidence 555544 3333444444433332222111122233445667777777777777666554433322222111111111 Q ss_pred hhhHHHHHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEEee Q lcl|Aclame:pro 87 NELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEK 166 (408) Q Consensus 87 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~ 166 (408) .............+.+.................+.+.+ ..-...+-.++...+.+. +++. .++......+++.. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~-~~gg~~iP~~~~~~ii~~--~~~~---~~l~~~~~~~~~~~ 150 (397) T protein:vir:48 77 KPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASG-SDAGLTIPQDIQTAIHTL--VRQY---DSLQEYVNVENVTT 150 (397) T ss_pred ccccchhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCC-ccccccccHHHHHHHHHH--HHHH---HHHHhhhceeeccC Confidence 11111111111112221111222222222222221111 001111222232222111 1111 11221112222221 Q ss_pred ccCCccccchhccc---ccccccc-cccceeeeechheeeeeh--HHHHHHHhcchHHHHH-HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 167 WTDVTPLTVMDAED---GKIPDLD-NPQLTIIKYLIKRYAGII--TATNTSLKDTAENILA-WLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 167 ~~~~~~~~~~~~E~---~~~~~~~-~~~f~~v~~~~~~~~~~~--~iS~ell~ds~~~~~~-~v~~~l~~~~~~~~~~~~ 239 (408) ....-+...+..-. .-..+.. .++.+..++...++...- .+.. +.+....-.. -+...+...++.++...+ T Consensus 151 ~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~--iS~ell~ds~~~l~~~v~~~l~~~~~~~~ 228 (397) T protein:vir:48 151 LTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGIST--VTNSLLADSAENILAWLSGWIAKKVVVTR 228 (397) T ss_pred CcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehh--hHHHHHhhchHHHHHHHHHHHHHHHHHHH Confidence 11111111111111 1111111 122223344433333211 1111 1121121112 245555555655555544 Q ss_pred hhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcc---cCceeeccccccC--Ccccccccc Q lcl|Aclame:pro 240 IEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTA---EGKYLLEPDPTKP--NSYLIKGKQ 314 (408) Q Consensus 240 ~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~---~G~~~~~~~~~~~--~~~~l~G~p 314 (408) -...=.+...... ......-......+.+++.. +..++..+..... .-..-+|.| T Consensus 229 d~~il~G~g~~~~--------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~ 288 (397) T protein:vir:48 229 NKAILEAIATLPT--------------------KPTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDY 288 (397) T ss_pred HHHHhhccccccc--------------------ccccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCce Confidence 3322222111110 00011111122223344322 1222322211000 000124667 Q ss_pred eEeeccccccccccCcceEEEE------ehh--------c-ceEeeeccceEEEEeccchhhhhhceeeEEEEee----- Q lcl|Aclame:pro 315 VIVVADRWLPNTGSTVYPLYYG------DMS--------Q-AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDR----- 374 (408) Q Consensus 315 v~~~~~~~~~~~~~~~~~~~~g------d~~--------~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r----- 374 (408) ++..+ ...+....++| +.. . .+++.+++.+.. .|....+.+..... T Consensus 289 i~~~~------~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~--------~~~~~~~~i~~~~~~~~~~ 354 (397) T protein:vir:48 289 LMERD------VKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVT--------LFDRQQMSLLSTNIGGGAF 354 (397) T ss_pred eeccC------cCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEE--------EEeecceEEEEeccchhhh Confidence 65321 11111122333 100 0 011111111100 01111111111100 Q ss_pred -eCcEEec---ccceEEEEe-----eccccCCCCccCCCcccC Q lcl|Aclame:pro 375 -FDVKATD---SEALVAGSF-----SAIADQVGNFKTTTSTAV 408 (408) Q Consensus 375 -~d~~v~~---~~a~~~l~~-----~~~~~~~~~~~~~~~~~~ 408 (408) .|....+ --.+..+.- -..+..+.......+.+| T Consensus 355 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 355 ETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred hcCceeEEEEeeeccEEecccceEEEEecccccCCCCccccCC Confidence 0000000 011222221 122344444445555555 No 237 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=33.74 E-value=1.3 Score=19.89 Aligned_cols=360 Identities=9% Similarity=0.033 Sum_probs=72.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNK 84 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (408) |+...|++.+++ ..++.++++++.++.. ..+...+.+++++.++.+++.+++++++............. .... T Consensus 1 m~~~~l~~l~e~----r~~~~~e~~~l~~~~~-~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~--~~~~ 73 (392) T protein:vir:13 1 MDATTLSANFEA----RERATAELRSLTDEFA-GKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTS--LLSG 73 (392) T ss_pred CCHHHHHHHHHH----HHHHHHHHHHHHHHhh-cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--Hhcc Confidence 666655444333 3333444444444322 22233444455555566666665555432221111000000 0000 Q ss_pred chhhhHHHHHHHHHHHhhcchhh---HHHHHHHHhhccccccCcee--cchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 85 SENELKDKFVKDFVNMVRNPMAF---MNTVSSKTETSGSDSAAGLT--IPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 85 ~~~~~~~~~~~a~~~~~~~~~~~---~~~~~~~a~~~~t~~~gg~~--vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) ....... .... ........ ....+.++........++.. -|..+...++.. .+..+....++.. . T Consensus 74 ~~~~~~~--~~~~--~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~-----~i~~~~~~~~~l~-~ 143 (392) T protein:vir:13 74 LQGSGSG--AQRS--ADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQ-----LIAQAVERSAIMR-G 143 (392) T ss_pred cCCcccc--hhhh--hhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHH-----HHHHHHhhhhhhh-h Confidence 0000000 0000 00000000 00000111110000000000 000111111111 1111111111100 0 Q ss_pred cceEEeeccCCc-cccchhccccc--ccccccccceeeeechheeeeehHHHHHHHhcchHHHHHH-HHHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVT-PLTVMDAEDGK--IPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAW-LSSWIAKKVVVTR 235 (408) Q Consensus 160 g~~~~~~~~~~~-~~~~~~~E~~~--~~~~~~~~f~~v~~~~~~~~~~~~iS~ell~ds~~~~~~~-v~~~l~~~~~~~~ 235 (408) .-..++...... ......+.... ..+.....-...++....+...---..--+.+....-..+ +...+...++.++ T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i 223 (392) T protein:vir:13 144 GASTFTTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAI 223 (392) T ss_pred cceeeecCCCceeEEEEEcCCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHH Confidence 001111111100 00000110000 0110000001111111111111000000011111110111 2333334444444 Q ss_pred HHHHhhccccccchhhhhhHHHHHHHHHHhhhhhc-cCCCEEEEcHHHHHHHHhhhc---ccCceeeccc-------ccc Q lcl|Aclame:pro 236 NQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAI-IATSSLLTNQSGLNKLALVKT---AEGKYLLEPD-------PTK 304 (408) Q Consensus 236 ~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~-~~~a~~~~n~~~~~~l~~lkd---~~G~~~~~~~-------~~~ 304 (408) ...+-...=.|... .....|+..... ..... ...+..+.-......+..|+. .++.++..+. +.+ T Consensus 224 ~~~~d~~~l~G~Gt---~~p~Gil~~~~~-~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd 299 (392) T protein:vir:13 224 GDAMGRHFLTGTGT---GQPRGILTDATG-ANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKD 299 (392) T ss_pred HHHHHHHHhcccCC---cccccccccccc-ccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhc Confidence 33332222222211 111222211100 00000 000111111111111112221 1222332221 111 Q ss_pred CCcccccccceEeecccccc--ccccCcceEEEEehh--cceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCc--- Q lcl|Aclame:pro 305 PNSYLIKGKQVIVVADRWLP--NTGSTVYPLYYGDMS--QAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV--- 377 (408) Q Consensus 305 ~~~~~l~G~pv~~~~~~~~~--~~~~~~~~~~~gd~~--~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~--- 377 (408) . .|.|++.. +...+ ....| .++++-|+- .-+++.+ |....+..+...+++. T Consensus 300 ~-----~G~~l~~~-~~~~g~~~~l~G-~Pv~~~~~~~~~~i~~Gd---------------f~~~~i~~~~~~~i~~~~~ 357 (392) T protein:vir:13 300 A-----NGQYLWQS-ALTVGAPDTFNG-KVVETDDGMPADKVLFAD---------------LSKYRVRFAGSLRVDRSVD 357 (392) T ss_pred c-----CCceeecC-CcCCCCCceecc-eeeEEcCCCCCCcEEEee---------------ccceeEEeecceEEEeecc Confidence 1 24444321 10000 00001 112221110 0011111 1111111111111110 Q ss_pred -----EEecccceEEEEeeccccCCCCccCCCccc Q lcl|Aclame:pro 378 -----KATDSEALVAGSFSAIADQVGNFKTTTSTA 407 (408) Q Consensus 378 -----~v~~~~a~~~l~~~~~~~~~~~~~~~~~~~ 407 (408) ....=.++..+.+...-|..--....++.| T Consensus 358 ~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 358 AKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred ccccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 000001222222333223222211222222 No 238 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=30.77 E-value=1.5 Score=19.54 Aligned_cols=346 Identities=12% Similarity=0.090 Sum_probs=123.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |+...+.++|++++..+.+. +.. .++..-+..+- +++ ++.|.+++.... T Consensus 1 ~~~~~~~e~l~~kw~p~l~~--------------~~~--~~~~~~~~~~~---a~l--~enq~~~~~~~~---------- 49 (522) T protein:vir:69 1 MTTIKTKAQLVDKWKELLEG--------------EGL--PEIANSKQAII---AKI--FENQEKDFEVSP---------- 49 (522) T ss_pred CCccchHHHHHHhhHHHhcC--------------CCC--Cccccchhhhh---hhh--hhhhhHHhhccc---------- Confidence 88887777788777664322 110 11111111110 000 111111111000 Q ss_pred ccccchhhhHHHHHHHHHHHhhcch----hhHHHHHHHHhhccccccCceecchhhhhhhhhhh---hhhhhhhhhhcee Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNPM----AFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLV---RQYDSLQQYVRVE 153 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~~----~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~---~~~~~l~~~~~~~ 153 (408) ....+....+|..++.... ..... .....++.+ +++ ..+.+.++..+ -+.....+++.+. T Consensus 50 ------~~~~~~~~~~~~~~l~ea~~~~~~~~~~---~~i~es~~t-~~v---~~~~P~li~lvrRa~p~LIa~DIwGVQ 116 (522) T protein:vir:69 50 ------EYKDEKIAQAFGSFLTEAEIGGDHGYNA---QNIAAGQTS-GAV---TQIGPAVMGMVRRAIPNLIAFDICGVQ 116 (522) T ss_pred ------ccchhHHHHhhhhhhhhhccccccCCCc---ccccccccc-ccc---ccccchHHHHHHHHHhhhhhhhceeec Confidence 0111223344444433210 00000 000111111 111 12333333333 3344456777888 Q ss_pred ecccCccceEEee-----ccC-----------Cccccch----------------------------------------- Q lcl|Aclame:pro 154 SVSTSNGSRVYEK-----WTD-----------VTPLTVM----------------------------------------- 176 (408) Q Consensus 154 ~~~~~~g~~~~~~-----~~~-----------~~~~~~~----------------------------------------- 176 (408) ||++++|-+--.+ ... ..+...| T Consensus 117 PMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~ 196 (522) T protein:vir:69 117 PMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAKKFPALAASTQTKVGDIYTHFFQETGTVYLQASAQ 196 (522) T ss_pred cCCchhhhheeeeeeccCCcccCccccccccccccccccccccccccccccccccccccccccccccccccceeeecccC Confidence 8877776221111 000 0000000 Q ss_pred ----------------------------------hccccc---ccccccccceeeeechheeee-------ehHHHHHHH Q lcl|Aclame:pro 177 ----------------------------------DAEDGK---IPDLDNPQLTIIKYLIKRYAG-------IITATNTSL 212 (408) Q Consensus 177 ----------------------------------~~E~~~---~~~~~~~~f~~v~~~~~~~~~-------~~~iS~ell 212 (408) .+.++. ....+...|.++.|+..|..+ ...+|-||. T Consensus 197 ~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELA 276 (522) T protein:vir:69 197 VTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELA 276 (522) T ss_pred CcCCCCCcccccccchhccccccccceeeccccchhhhhhcccCCCCcccchhhhcceEeeEEEeeecccccccccHHHH Confidence 000000 000111224445555555443 356899998 Q ss_pred hc----chHHHHHHHHHHHHHHHHHHHHHHHhhccc--------ccc----chhhhhhHH----------------HHHH Q lcl|Aclame:pro 213 KD----TAENILAWLSSWIAKKVVVTRNQAIIEVMK--------AAP----KKPTIAKFD----------------DVIT 260 (408) Q Consensus 213 ~d----s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g--------~~~----~~~~~~~~d----------------~i~~ 260 (408) +| ...|.++.|.+-|+..|...+|+.|+.-.. +.. ...+..+.. .+.. T Consensus 277 QDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~ 356 (522) T protein:vir:69 277 QDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTNIVGSKAGVFDFQDPIDIRGARWAGESFKALLF 356 (522) T ss_pred HHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccccccccccceeecccccccccchhHHHHHHHHHH Confidence 88 346889999999999999999998874321 000 011111111 1111 Q ss_pred HH----HHh-hhhhccCCCEEEEcHHHHHHHHhhh-----cccC-ceeeccccccCC-ccccc-ccceEeeccccccccc Q lcl|Aclame:pro 261 MI----NTA-VDPAIIATSSLLTNQSGLNKLALVK-----TAEG-KYLLEPDPTKPN-SYLIK-GKQVIVVADRWLPNTG 327 (408) Q Consensus 261 ~~----~~~-l~~~~~~~a~~~~n~~~~~~l~~lk-----d~~G-~~~~~~~~~~~~-~~~l~-G~pv~~~~~~~~~~~~ 327 (408) .+ +.. ..+.+...-.+|+++.....|...- .++| ..-|..+.+... -+.|. ||+|++.++ . T Consensus 357 ~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y--~---- 430 (522) T protein:vir:69 357 QIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLASGFNTDTTKSVFAGVLGGKYRVYIDQY--A---- 430 (522) T ss_pred HHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCC--C---- Confidence 11 111 1223333446899999988887531 1111 122222221111 13443 567776322 1 Q ss_pred cCcceEEEEehhcceEeeeccceEEEEeccchhhh------hhceeeEEEEeeeCcEEecccceEE-E------Eeeccc Q lcl|Aclame:pro 328 STVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAF------ETDTTKIRVIDRFDVKATDSEALVA-G------SFSAIA 394 (408) Q Consensus 328 ~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f------~~~~~~~r~~~r~d~~v~~~~a~~~-l------~~~~~~ 394 (408) ..+-+++|- + .-.--.-.+-..|+..-.+ .+-+-.+-+..|++.. .+|= +. . .+...+ T Consensus 431 -~~dy~~vG~-K----G~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-vNP~--~~~~~~~~~~ri~~g~ 501 (522) T protein:vir:69 431 -KQDYFTVGY-K----GANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-VNPF--AESSLQAPGARIQSGM 501 (522) T ss_pred -CcceEEEEE-e----CCcccccceeeccccccccccccCCccccceeeeeeeecee-ecCc--ccccCCcccceeeccc Confidence 122233331 0 0000000111112111000 0112222233344332 1221 10 0 000000 Q ss_pred cCCCCccCCCcccC Q lcl|Aclame:pro 395 DQVGNFKTTTSTAV 408 (408) Q Consensus 395 ~~~~~~~~~~~~~~ 408 (408) |+..+ +..+.. T Consensus 502 p~~~~---~~~~n~ 512 (522) T protein:vir:69 502 PSILN---SLGKNA 512 (522) T ss_pred chhhc---ccCCcc Confidence 10000 111111 No 239 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=30.77 E-value=1.5 Score=19.54 Aligned_cols=372 Identities=9% Similarity=-0.010 Sum_probs=73.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc--c Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN--K 84 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 84 (408) ||.+++.. ++++++++++....++.... .-++-.++.+.++.++++++.++++++................ . T Consensus 1 mk~~~em~----~~l~el~~~~~~~~~e~~~~--~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 74 (415) T protein:vir:46 1 MKTKEELQ----SEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQS 74 (415) T ss_pred CchHHHHH----HHHHHHHHHHHHHHHHHHHH--hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 44333322 33333333333322221110 0011112234444555555555554433322111111110000 0 Q ss_pred chhhhHHHHHHHHHH--Hhhc-chhhHHHHHHHHhh----ccccccCceecchhhhhhhhhhhhhhhhhhhhhceeeccc Q lcl|Aclame:pro 85 SENELKDKFVKDFVN--MVRN-PMAFMNTVSSKTET----SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVST 157 (408) Q Consensus 85 ~~~~~~~~~~~a~~~--~~~~-~~~~~~~~~~~a~~----~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 157 (408) ............... .... ........+.++.. ..... ....+++.-...++........+..+-...++.. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~ 153 (415) T protein:vir:46 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) T ss_pred cccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh-hhccccccCCcccccHHHHHHHHHHHHhhhhhhh Confidence 000000001000000 0000 00000111111110 00000 0111121111111111111111111111111211 Q ss_pred CccceEEeeccCCccccchhc--ccccccccc-cccceeeeechheeeeeh--HHHHHHHhcchHHHHHH-HHHHHHHHH Q lcl|Aclame:pro 158 SNGSRVYEKWTDVTPLTVMDA--EDGKIPDLD-NPQLTIIKYLIKRYAGII--TATNTSLKDTAENILAW-LSSWIAKKV 231 (408) Q Consensus 158 ~~g~~~~~~~~~~~~~~~~~~--E~~~~~~~~-~~~f~~v~~~~~~~~~~~--~iS~ell~ds~~~~~~~-v~~~l~~~~ 231 (408) ....+++......-+...+.+ +-.-..+.. .+..+..++..-.+...- .+.. +.+...+-..+ +...+...+ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~--iS~ell~ds~~~l~~~i~~~l 231 (415) T protein:vir:46 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR--ISREAIEDAKVNVLQELKLWM 231 (415) T ss_pred hcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeeh--hhHHHHhhchHHHHHHHHHHH Confidence 111111111110001100000 001111110 011111122222221111 1110 11111111111 333333444 Q ss_pred HHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhccc---Cceeeccc------- Q lcl|Aclame:pro 232 VVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAE---GKYLLEPD------- 301 (408) Q Consensus 232 ~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~---G~~~~~~~------- 301 (408) +.++...+-...=.+........ .+...... ...........-.+....+..+.+.. ..+++.+. T Consensus 232 ~~~i~~~~d~~il~g~g~g~~~~--~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:46 232 ARTIAATRNKAIIDVITKGSTGS--TSSGFEKE---GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHhhccccCCccc--cccccccc---cceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 44333333222222211111100 00000000 00000000011111122222222221 11222221 Q ss_pred cccCCcccccccceEeec-cccccccccCcceEEEEe------hhc-ceEeeeccceEEEEeccchhhhhhceeeEEEEe Q lcl|Aclame:pro 302 PTKPNSYLIKGKQVIVVA-DRWLPNTGSTVYPLYYGD------MSQ-AITLFDRENMSLLPTNIGAGAFETDTTKIRVID 373 (408) Q Consensus 302 ~~~~~~~~l~G~pv~~~~-~~~~~~~~~~~~~~~~gd------~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~ 373 (408) +.+.. |.|++..+ ....+....+.. +++.+ -.. .+++.++..+.+..+ +..+.+.... T Consensus 307 lkd~~-----G~~i~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~--------~~~~~v~~~~ 372 (415) T protein:vir:46 307 MKDKL-----GNYLIQPDVKEKTQQRLLGAK-IEILPDEVLGQKGNNTLIIGNLKDAIVLFD--------RSQYQASWTD 372 (415) T ss_pred hhccC-----CCeeeccCcCCCCCcccccee-eEEeccccccCCCccEEEEEehhccEEEEe--------ecceEEEeec Confidence 11111 23332110 000111111111 22211 000 122233332211111 1111111111 Q ss_pred ee-CcEEecccceEEEEee-----ccccCCCCccCCCcccC Q lcl|Aclame:pro 374 RF-DVKATDSEALVAGSFS-----AIADQVGNFKTTTSTAV 408 (408) Q Consensus 374 r~-d~~v~~~~a~~~l~~~-----~~~~~~~~~~~~~~~~~ 408 (408) .. +.... .++..+.+. ++.--..+++++.+.++ T Consensus 373 ~~~~~~~~--~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:46 373 YMHFGECL--MIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred cccCceEE--EEEEEeccEEeccccEEEEEeeccCCCCCCc Confidence 00 00111 122222222 22222233333334444 No 240 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=30.77 E-value=1.5 Score=19.54 Aligned_cols=372 Identities=9% Similarity=-0.010 Sum_probs=73.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc--c Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN--K 84 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 84 (408) ||.+++.. ++++++++++....++.... .-++-.++.+.++.++++++.++++++................ . T Consensus 1 mk~~~em~----~~l~el~~~~~~~~~e~~~~--~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~ 74 (415) T protein:vir:47 1 MKTKEELQ----SEISDIKRQIDLKVKYATRA--LNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQS 74 (415) T ss_pred CchHHHHH----HHHHHHHHHHHHHHHHHHHH--hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 44333322 33333333333322221110 0011112234444555555555554433322111111110000 0 Q ss_pred chhhhHHHHHHHHHH--Hhhc-chhhHHHHHHHHhh----ccccccCceecchhhhhhhhhhhhhhhhhhhhhceeeccc Q lcl|Aclame:pro 85 SENELKDKFVKDFVN--MVRN-PMAFMNTVSSKTET----SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVST 157 (408) Q Consensus 85 ~~~~~~~~~~~a~~~--~~~~-~~~~~~~~~~~a~~----~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 157 (408) ............... .... ........+.++.. ..... ....+++.-...++........+..+-...++.. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~ 153 (415) T protein:vir:47 75 VEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDI-QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDK 153 (415) T ss_pred cccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhh-hhccccccCCcccccHHHHHHHHHHHHhhhhhhh Confidence 000000001000000 0000 00000111111110 00000 0111121111111111111111111111111211 Q ss_pred CccceEEeeccCCccccchhc--ccccccccc-cccceeeeechheeeeeh--HHHHHHHhcchHHHHHH-HHHHHHHHH Q lcl|Aclame:pro 158 SNGSRVYEKWTDVTPLTVMDA--EDGKIPDLD-NPQLTIIKYLIKRYAGII--TATNTSLKDTAENILAW-LSSWIAKKV 231 (408) Q Consensus 158 ~~g~~~~~~~~~~~~~~~~~~--E~~~~~~~~-~~~f~~v~~~~~~~~~~~--~iS~ell~ds~~~~~~~-v~~~l~~~~ 231 (408) ....+++......-+...+.+ +-.-..+.. .+..+..++..-.+...- .+.. +.+...+-..+ +...+...+ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~--iS~ell~ds~~~l~~~i~~~l 231 (415) T protein:vir:47 154 YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFR--ISREAIEDAKVNVLQELKLWM 231 (415) T ss_pred hcceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeeh--hhHHHHhhchHHHHHHHHHHH Confidence 111111111110001100000 001111110 011111122222221111 1110 11111111111 333333444 Q ss_pred HHHHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhccc---Cceeeccc------- Q lcl|Aclame:pro 232 VVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAE---GKYLLEPD------- 301 (408) Q Consensus 232 ~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~---G~~~~~~~------- 301 (408) +.++...+-...=.+........ .+...... ...........-.+....+..+.+.. ..+++.+. T Consensus 232 ~~~i~~~~d~~il~g~g~g~~~~--~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:47 232 ARTIAATRNKAIIDVITKGSTGS--TSSGFEKE---GKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHhhccccCCccc--cccccccc---cceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 44333333222222211111100 00000000 00000000011111122222222221 11222221 Q ss_pred cccCCcccccccceEeec-cccccccccCcceEEEEe------hhc-ceEeeeccceEEEEeccchhhhhhceeeEEEEe Q lcl|Aclame:pro 302 PTKPNSYLIKGKQVIVVA-DRWLPNTGSTVYPLYYGD------MSQ-AITLFDRENMSLLPTNIGAGAFETDTTKIRVID 373 (408) Q Consensus 302 ~~~~~~~~l~G~pv~~~~-~~~~~~~~~~~~~~~~gd------~~~-~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~ 373 (408) +.+.. |.|++..+ ....+....+.. +++.+ -.. .+++.++..+.+..+ +..+.+.... T Consensus 307 lkd~~-----G~~i~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~--------~~~~~v~~~~ 372 (415) T protein:vir:47 307 MKDKL-----GNYLIQPDVKEKTQQRLLGAK-IEILPDEVLGQKGNNTLIIGNLKDAIVLFD--------RSQYQASWTD 372 (415) T ss_pred hhccC-----CCeeeccCcCCCCCcccccee-eEEeccccccCCCccEEEEEehhccEEEEe--------ecceEEEeec Confidence 11111 23332110 000111111111 22211 000 122233332211111 1111111111 Q ss_pred ee-CcEEecccceEEEEee-----ccccCCCCccCCCcccC Q lcl|Aclame:pro 374 RF-DVKATDSEALVAGSFS-----AIADQVGNFKTTTSTAV 408 (408) Q Consensus 374 r~-d~~v~~~~a~~~l~~~-----~~~~~~~~~~~~~~~~~ 408 (408) .. +.... .++..+.+. ++.--..+++++.+.++ T Consensus 373 ~~~~~~~~--~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:47 373 YMHFGECL--MIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred cccCceEE--EEEEEeccEEeccccEEEEEeeccCCCCCCc Confidence 00 00111 122222222 22222233333334444 No 241 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=29.61 E-value=1.6 Score=19.40 Aligned_cols=308 Identities=11% Similarity=0.042 Sum_probs=124.3 Q ss_pred cccccchhhhHHH---HHHHHHHHhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhh--hhhhceee Q lcl|Aclame:pro 80 GPLNKSENELKDK---FVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSL--QQYVRVES 154 (408) Q Consensus 80 ~~~~~~~~~~~~~---~~~a~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l--~~~~~~~~ 154 (408) .+....++...+. .-..|.+.+..+.+. +-.+-.+|+.+=-+.+..+|..+......+ ..-+.+.+ T Consensus 1 ~~~~~~~~~~~~n~~~~~e~~~Ks~~agy~~---------~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~ 71 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQEDALKSFTTGYGI---------TPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKP 71 (467) T ss_pred CCCcchhhhhhcccccCHHHHHHHHHccccc---------CCccccCcchhhhhhhhhhhheeeccccchhhhhhcccch Confidence 1111111100000 111223333222111 001123344444444444444433322222 22223333 Q ss_pred cccCccceEEeeccCCccccchhcccccccccccccceeeeechheeeeehHHHHHH-HhcchHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 155 VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSWIAKKVVV 233 (408) Q Consensus 155 ~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~~~~~~~iS~el-l~ds~~~~~~~v~~~l~~~~~~ 233 (408) ..+.-..|......+..+...+++|++-.+ .+++.+.+.....+-++.-..+|.-+ +..+..+..+...+.-...++. T Consensus 72 a~stv~~y~~~~~~G~~g~~~f~~E~g~~~-~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~ 150 (467) T protein:vir:80 72 ATSTVAKYDVYMQHGKVGHTRFTREIGVAP-VSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAK 150 (467) T ss_pred hhhhhhhheeeeccCccccccccccccccc-cCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHH Confidence 333333444333445667778899999875 67899999999999999866655543 2334556778888888889999 Q ss_pred HHHHHHhhccccccch---hhhhhHHHH---------------------HHHHHHhhhhhccCCCEEEEcHHHHHHHH-h Q lcl|Aclame:pro 234 TRNQAIIEVMKAAPKK---PTIAKFDDV---------------------ITMINTAVDPAIIATSSLLTNQSGLNKLA-L 288 (408) Q Consensus 234 ~~~~~~~~g~g~~~~~---~~~~~~d~i---------------------~~~~~~~l~~~~~~~a~~~~n~~~~~~l~-~ 288 (408) .++.+.+.|+..-.+. .-..-+|.| ++.+......+|....-++|+.-+.+.+. . T Consensus 151 tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~ 230 (467) T protein:vir:80 151 TIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQ 230 (467) T ss_pred HHHHHhhhcccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhh Confidence 9999999998765321 111223332 22222222334444444566666655542 1 Q ss_pred hhcccCceeeccccccCCcccccccceEeeccccccccc--cCcceEEEEehhcceEeeeccceEEEEec------c--- Q lcl|Aclame:pro 289 VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTG--STVYPLYYGDMSQAITLFDRENMSLLPTN------I--- 357 (408) Q Consensus 289 lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~--~~~~~~~~gd~~~~~~~~~~~~~~i~~~~------~--- 357 (408) .-. .++.+.. +.......|+||-- .+...+ .-.+..++||... ...++.+.....++ + T Consensus 231 ~L~--~q~~v~~---~n~~~~~~G~~v~g----~~sa~G~I~l~gs~il~~~~~--l~~~~~~~~~Apsp~~vsaT~~~~ 299 (467) T protein:vir:80 231 QLS--KQTQLVR---DNGNNVSVGFNIQG----FHSARGFIKLHGSTVMENEQI--LDERILALPTAPQPAKVTATQEAG 299 (467) T ss_pred hcC--ceEEEEc---CCCCceeeeecccc----eecceeeeeecCceeeccccC--CCcccccccccccCCccceeeecc Confidence 111 1122221 11112233444421 001000 0001123443321 11111111111000 0 Q ss_pred chhhhhh---ceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccC------CCcccC Q lcl|Aclame:pro 358 GAGAFET---DTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKT------TTSTAV 408 (408) Q Consensus 358 ~~~~f~~---~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~------~~~~~~ 408 (408) ....|.. ..+.+++...-+..--.|...+-+++.+.......+.+ +.++=| T Consensus 300 ~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv 359 (467) T protein:vir:80 300 KKGQFRAEDLAAHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFV 359 (467) T ss_pred cCCcccCCCcceEEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEE Confidence 0000100 11233333332222333444444444432222211111 001001 No 242 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=28.77 E-value=1.7 Score=19.30 Aligned_cols=361 Identities=12% Similarity=0.055 Sum_probs=108.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc---cccc Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE---EKGP 81 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 81 (408) |-++ |++++.++.++++++.++++.++.+.+ .++.+.+.++++.+.++++.+.+................ .... T Consensus 1 ~~~~-m~k~l~el~~~~~~~~~~~~~~~~~~~--~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:12 1 MPMQ-MSKKEIALRQQFTEKKQQADKALQEGN--TDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNP 77 (397) T ss_pred CCCc-HHHHHHHHHHHHHHHHHHHHHHhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhh Confidence 7766 777788899999999988888766543 456777777888887777776554443332222111111 1111 Q ss_pred cccchhhhH-HHHHHHHHH-HhhcchhhHHHHHHHHhhccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCc Q lcl|Aclame:pro 82 LNKSENELK-DKFVKDFVN-MVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSN 159 (408) Q Consensus 82 ~~~~~~~~~-~~~~~a~~~-~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) ......... ........+ +.+...+.....+.+..............+..-..-++........+..+....++...- T Consensus 78 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~ 157 (397) T protein:vir:12 78 EGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYV 157 (397) T ss_pred cccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhc Confidence 111111111 111111111 111111111122222111110000000111111111111111111111111111121111 Q ss_pred cceEEeeccCCccccchh--ccccccccccc-ccceeeeechheeeeeh--HHHHHHHhcchHHHHH-HHHHHHHHHHHH Q lcl|Aclame:pro 160 GSRVYEKWTDVTPLTVMD--AEDGKIPDLDN-PQLTIIKYLIKRYAGII--TATNTSLKDTAENILA-WLSSWIAKKVVV 233 (408) Q Consensus 160 g~~~~~~~~~~~~~~~~~--~E~~~~~~~~~-~~f~~v~~~~~~~~~~~--~iS~ell~ds~~~~~~-~v~~~l~~~~~~ 233 (408) ..+++......-...... ....-..+... +..+..++..-.+...- .+.. +.+....-.. -+.+.+...++. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~--is~e~l~ds~~~l~~~i~~~l~~ 235 (397) T protein:vir:12 158 TVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMT--LSNSMLNDSDQAIMTYVAKWFAK 235 (397) T ss_pred ceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeeh--hhHHHHhhchHHHHHHHHHHHHH Confidence 122221111110000000 11111111111 11111222222222211 1111 2222222222 255566666666 Q ss_pred HHHHHHhhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHH-HHH-Hhhhcc---cCceeeccccccCC-- Q lcl|Aclame:pro 234 TRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGL-NKL-ALVKTA---EGKYLLEPDPTKPN-- 306 (408) Q Consensus 234 ~~~~~~~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~-~~l-~~lkd~---~G~~~~~~~~~~~~-- 306 (408) ++..++-...-.+......... .+-+.+ ..+ ..++.. +..++..+.....- T Consensus 236 ~~~~~~d~~il~G~g~~~~~g~----------------------~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~ 293 (397) T protein:vir:12 236 KSVVTRNNLILAAIASLKKVDI----------------------DGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDT 293 (397) T ss_pred HHHHHHHHHHHhcccccccccc----------------------ccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHH Confidence 6665554333333222211111 111111 112 122211 11222222110000 Q ss_pred cccccccceEeeccccccccccCcceEEEEehh----cc----------eEeeeccceEEEEeccchhhhhhceeeEEEE Q lcl|Aclame:pro 307 SYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMS----QA----------ITLFDRENMSLLPTNIGAGAFETDTTKIRVI 372 (408) Q Consensus 307 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~----~~----------~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~ 372 (408) -..=.|.|++.. + ...+...-++|-.- +. +++.+++.+.+..+ .....+... T Consensus 294 lkd~~G~~l~~~-~-----~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~--------~~~~~i~~~ 359 (397) T protein:vir:12 294 LKDGTGRYLLQP-D-----PTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFD--------REQQSIAST 359 (397) T ss_pred hhccCCceeecc-c-----ccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEe--------ecceEEEEe Confidence 000135665421 1 01111111222210 00 11111111111000 001111111 Q ss_pred eeeCc-EEecc---cceEEEEeeccccCCCCccCCCcc Q lcl|Aclame:pro 373 DRFDV-KATDS---EALVAGSFSAIADQVGNFKTTTST 406 (408) Q Consensus 373 ~r~d~-~v~~~---~a~~~l~~~~~~~~~~~~~~~~~~ 406 (408) ...+. ...+- .++..+.+...-|..--..+.|++ T Consensus 360 ~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 360 DTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred ccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 00000 00011 133333444444444444555555 No 243 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=28.39 E-value=1.8 Score=19.25 Aligned_cols=257 Identities=9% Similarity=0.050 Sum_probs=107.7 Q ss_pred hhccccccCceecchhhhhhhhhhhhhh-hhhhhhhceeecccCccceEEeeccCCccccchhcccccccccccccceee Q lcl|Aclame:pro 116 ETSGSDSAAGLTIPQDIRTMINTLVRQY-DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTII 194 (408) Q Consensus 116 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~-~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v 194 (408) |..+... - -++-..+...+....... .....+|++++-.....++.+. ...-...-|.+| ++- ....=..- T Consensus 1 m~it~~~-l-~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~l--g~~p~l~e~~Ge---~~~-~~l~~~~~ 72 (302) T protein:vir:10 1 MLINKQS-L-NAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWL--STFPKMRRWIGA---KVV-KNLKAYKY 72 (302) T ss_pred CcccHHH-H-HHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceec--CCCCCccccccc---eee-ccccccce Confidence 1111000 0 000111112222222211 1234555555432222333322 221112244444 321 11222457 Q ss_pred eechheeeeehHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------cc------h----------- Q lcl|Aclame:pro 195 KYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAA--------PK------K----------- 249 (408) Q Consensus 195 ~~~~~~~~~~~~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~~--------~~------~----------- 249 (408) ++..++++..+.||++.+.|-..++.+-+.+.+.++.++.+++.++.-...+ .+ + T Consensus 73 ~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~ 152 (302) T protein:vir:10 73 VVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGT 152 (302) T ss_pred eEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccc Confidence 7999999999999999999878888888999999999999988765432211 00 0 Q ss_pred ---------hhhhhHHHHHHHHHHh----hhhhccCCCEEEEcHHHHHHHHhh-hcccCceeeccccccCCccccccc-c Q lcl|Aclame:pro 250 ---------PTIAKFDDVITMINTA----VDPAIIATSSLLTNQSGLNKLALV-KTAEGKYLLEPDPTKPNSYLIKGK-Q 314 (408) Q Consensus 250 ---------~~~~~~d~i~~~~~~~----l~~~~~~~a~~~~n~~~~~~l~~l-kd~~G~~~~~~~~~~~~~~~l~G~-p 314 (408) ....++.+...+|... -.+-...+..+|++|+....-+.+ .+. ++. ++...-+.|. . T Consensus 153 ~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~--~~~------~g~~Np~~g~~~ 224 (302) T protein:vir:10 153 APLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNP--KLA------DNTPNPYVGTAE 224 (302) T ss_pred hhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhcc--ccC------CCCcceeccceE Confidence 0011122222222211 111222344677777665554443 221 111 1111122232 2 Q ss_pred eEeeccccccccccCcceEEEEehhcceE--eeeccceEEEEeccchhhhhhceeeEEEEeeeCcEEecccceE--EEEe Q lcl|Aclame:pro 315 VIVVADRWLPNTGSTVYPLYYGDMSQAIT--LFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALV--AGSF 390 (408) Q Consensus 315 v~~~~~~~~~~~~~~~~~~~~gd~~~~~~--~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~--~l~~ 390 (408) +++ +..+.+ +..=.++.|.+..-. +--+++.++... ..|..+.+-++....+|+.-+-..+|. .+-+ T Consensus 225 ~vv--~p~L~s---~~aWyL~a~~~~i~~~~l~g~~~P~~~~~----~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~ 295 (302) T protein:vir:10 225 LVV--DGRIES---DTAWFLLDTTKPVKPFIFQPRKQPEFVSQ----VNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAY 295 (302) T ss_pred EEE--eeccCC---CCceEEEecCCccceEEEcCccccEEEec----cCCCCCceEEEEEEEEeeeeeeecchhhhhhhh Confidence 222 222322 222344445432111 112334444432 236677777777666664222211111 0111 Q ss_pred eccccCC Q lcl|Aclame:pro 391 SAIADQV 397 (408) Q Consensus 391 ~~~~~~~ 397 (408) ....+++ T Consensus 296 ~s~g~~~ 302 (302) T protein:vir:10 296 GSTGTGA 302 (302) T ss_pred ccCccCC Confidence 1111111 No 244 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=27.45 E-value=1.8 Score=19.13 Aligned_cols=373 Identities=8% Similarity=-0.070 Sum_probs=109.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccch Q lcl|Aclame:pro 7 VNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSE 86 (408) Q Consensus 7 i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (408) |.++++++.+ +++++.++++.+.+......+...+.+++++.++.+++.++.++++++................... T Consensus 1 m~~~~~~l~~---~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:97 1 MTDITAKLEA---TLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred ChHHHHHHHH---HHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 6666554444 4555556666655554444444566777777888888888888777665544333322222221111 Q ss_pred hhhHHHHHHHHHHHhhcchhhHHHHHHHHh--hccccccCceecchhhhhhhhhhhhhhhhhhhhhceeecccCccceEE Q lcl|Aclame:pro 87 NELKDKFVKDFVNMVRNPMAFMNTVSSKTE--TSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVY 164 (408) Q Consensus 87 ~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~--~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~g~~~~ 164 (408) ..........+..++............... .....+++...-...+..+++..+.+. ++. ..++...-..+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~--~~~---~~~i~~~~~~~~~ 152 (390) T protein:vir:97 78 VGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITP--PDA---RLTVRDLIGSGRT 152 (390) T ss_pred chhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHH--Hhh---hhhhHhhcceeec Confidence 111112222333333333222222111111 111111111101111222222222111 111 1112111111111 Q ss_pred eeccCCccccchhc---ccccccccccccceeeeechheeeeeh--HHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 165 EKWTDVTPLTVMDA---EDGKIPDLDNPQLTIIKYLIKRYAGII--TATNTSLKDTAENILAWLSSWIAKKVVVTRNQAI 239 (408) Q Consensus 165 ~~~~~~~~~~~~~~---E~~~~~~~~~~~f~~v~~~~~~~~~~~--~iS~ell~ds~~~~~~~v~~~l~~~~~~~~~~~~ 239 (408) ....-....+.+ ...-..+.....-...++....+...- .+.. +.+....-...+...+...++.++..++ T Consensus 153 --~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~--is~ell~ds~~l~~~i~~~la~a~~~~~ 228 (390) T protein:vir:97 153 --DSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMK--ATRQILSDAPQLASYMNNRLIRGLKVKE 228 (390) T ss_pred --cCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeeh--hhHHHHHhHHHHHHHHHHHHHHHHHHHH Confidence 111101111111 111112211111123344333333321 1211 1111222223455556666666666665 Q ss_pred hhccccccchhhhhhHHHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHhhhcccCc---eeeccccccC--Ccccccccc Q lcl|Aclame:pro 240 IEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGK---YLLEPDPTKP--NSYLIKGKQ 314 (408) Q Consensus 240 ~~g~g~~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~~~n~~~~~~l~~lkd~~G~---~~~~~~~~~~--~~~~l~G~p 314 (408) -...=.+... ......|+..... .......+....-......+..++..... ++..+..... .-..=.|.| T Consensus 229 d~a~l~G~g~--~~~p~Gi~~~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~ 304 (390) T protein:vir:97 229 DAEILRGTGA--NDGLLGLIPQATT--YAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQY 304 (390) T ss_pred HHHHhhcCCC--Cccccceeecccc--ccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCce Confidence 5433333221 1122333221100 00000000011111222333444433221 2222210000 000112455 Q ss_pred eEeeccccccccccCcceEEEEeh--hcceEeeeccceEEEEeccchhhhhhceeeEEEEeeeCc---EEecccceEEEE Q lcl|Aclame:pro 315 VIVVADRWLPNTGSTVYPLYYGDM--SQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV---KATDSEALVAGS 389 (408) Q Consensus 315 v~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~---~v~~~~a~~~l~ 389 (408) ++..+....+....| .++++-|. ..-+++.+.....+- +....+.+......+. ..+.-.+...+. T Consensus 305 l~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~gd~~~~~~~--------~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d 375 (390) T protein:vir:97 305 LIGNARGTLTPTLWG-LPVVATQAMAPGEFLVGAFDLAAQI--------FDQWDARVEIGYVNDDFQRNMVTVLAEERLA 375 (390) T ss_pred eecCccCCCCceecc-eeeEEcCCCCCCcEEEEeccceEEE--------EEecceEEEEeecccccccCcEEEEEEEeec Confidence 432110000000001 12222222 001111111110000 0011111111000000 000001111222 Q ss_pred eeccccCCCCccCCC Q lcl|Aclame:pro 390 FSAIADQVGNFKTTT 404 (408) Q Consensus 390 ~~~~~~~~~~~~~~~ 404 (408) +...-|..--..+.+ T Consensus 376 ~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 376 LVVYRPEALITGSFA 390 (390) T ss_pred cEEeccccEEEEEeC Confidence 222222222222222 No 245 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=26.94 E-value=1.9 Score=19.06 Aligned_cols=338 Identities=15% Similarity=0.134 Sum_probs=117.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Q lcl|Aclame:pro 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMS-ELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) Q Consensus 5 ~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) |..++|.+++..+.+. +. .-++. .-|+.+ .+++ ++++.+++... +. T Consensus 1 ~~~~~l~~kw~p~l~~--------------~~--~~~i~~~~~~~i---~~~~--~en~~~~~~~~------~~------ 47 (519) T protein:vir:10 1 MKKNALVQKWSALLEN--------------EA--LPEIVGASKQAI---IAKI--FENQEQDILTA------PE------ 47 (519) T ss_pred CchhHHHHHhHHhhcc--------------cc--cchhhhhhhHHH---HHHH--HHHHHHHhhhc------cc------ Confidence 5566677766664331 00 00111 111111 1111 01111111000 00 Q ss_pred cchhhhHHHHHHHHHHHhhc-----chhhHHHHHHHHhhccccccCceecchhhhhhhhhhh---hhhhhhhhhhceeec Q lcl|Aclame:pro 84 KSENELKDKFVKDFVNMVRN-----PMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLV---RQYDSLQQYVRVESV 155 (408) Q Consensus 84 ~~~~~~~~~~~~a~~~~~~~-----~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~---~~~~~l~~~~~~~~~ 155 (408) ...+....+|.+++-. +........ ..++.+ |++ .++.+.++..+ -+.....+++.+.|| T Consensus 48 ----~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i----~~~~~t-~~v---~~~~P~l~~l~rRa~p~LIa~DIwGVQPM 115 (519) T protein:vir:10 48 ----YRDEKISEAFGSFLTEAEIGGDHGYDATNI----AAGQTS-GAV---TQIGPAVMGMVRRAIPHLIAFDICGVQPL 115 (519) T ss_pred ----ccchHHHHHHhhhcchhccCCccccCcccc----cccccc-ccc---cccchhHHHHHHHHHHhhhhhhhheeecC Confidence 0001122233333211 111100000 001111 111 12333334444 233344667777777 Q ss_pred ccCccceE-----EeeccCC-----------ccccchh------------------------------------------ Q lcl|Aclame:pro 156 STSNGSRV-----YEKWTDV-----------TPLTVMD------------------------------------------ 177 (408) Q Consensus 156 ~~~~g~~~-----~~~~~~~-----------~~~~~~~------------------------------------------ 177 (408) ++++|-+- +...... .+...|- T Consensus 116 TgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~~~~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t 195 (519) T protein:vir:10 116 NNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVT 195 (519) T ss_pred CchhhhhheeeeeecCCccccccccccccccccccccCccccccccccccccccccccccccccccccccceeccccccc Confidence 77766322 1110000 0000000 Q ss_pred ---------------------------ccccc---------ccccccccceeeeechheeee-------ehHHHHHHHhc Q lcl|Aclame:pro 178 ---------------------------AEDGK---------IPDLDNPQLTIIKYLIKRYAG-------IITATNTSLKD 214 (408) Q Consensus 178 ---------------------------~E~~~---------~~~~~~~~f~~v~~~~~~~~~-------~~~iS~ell~d 214 (408) +++.. -...+...|.++.|+..|..+ ...+|-||.+| T Consensus 196 ~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQD 275 (519) T protein:vir:10 196 VDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQD 275 (519) T ss_pred cCCCCcCccccccccccccccccccccccccccchhhccccCCCccccchhhhceeEEEEEEeeecccccccccHHHHHH Confidence 01100 000011124445555555443 35689999888 Q ss_pred ----chHHHHHHHHHHHHHHHHHHHHHHHhhccc--------ccc----chhhhhhHHH----------------HHHHH Q lcl|Aclame:pro 215 ----TAENILAWLSSWIAKKVVVTRNQAIIEVMK--------AAP----KKPTIAKFDD----------------VITMI 262 (408) Q Consensus 215 ----s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g--------~~~----~~~~~~~~d~----------------i~~~~ 262 (408) ...|.++.|.+-|+..|...+|+.|+.-.. +.+ ...+..++++ +...+ T Consensus 276 LKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i 355 (519) T protein:vir:10 276 LRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQI 355 (519) T ss_pred HHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcceeecccCcccccceeecccccccccchHHHHHHHHHHHHH Confidence 346889999999999999999999985221 011 1112222111 11111 Q ss_pred ----HHh-hhhhccCCCEEEEcHHHHHHHHhhh-----cccC-ceeeccccccC-Cccccc-ccceEeeccccccccccC Q lcl|Aclame:pro 263 ----NTA-VDPAIIATSSLLTNQSGLNKLALVK-----TAEG-KYLLEPDPTKP-NSYLIK-GKQVIVVADRWLPNTGST 329 (408) Q Consensus 263 ----~~~-l~~~~~~~a~~~~n~~~~~~l~~lk-----d~~G-~~~~~~~~~~~-~~~~l~-G~pv~~~~~~~~~~~~~~ 329 (408) +.. ..+.+...-.+|+++.....|...- .+.| +..+..+.+.. .-+.|. ||+|++.++ .+ T Consensus 356 ~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y--~~----- 428 (519) T protein:vir:10 356 DKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVYIDQY--AR----- 428 (519) T ss_pred HHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEEecCC--CC----- Confidence 111 1233444457899999888876542 0111 11111111111 012343 567776322 22 Q ss_pred cceEEEEehhcceEeeeccceE----EEEeccchhhh------hhceeeEEEEeeeCcEEecccceEEEEeeccccC--C Q lcl|Aclame:pro 330 VYPLYYGDMSQAITLFDRENMS----LLPTNIGAGAF------ETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ--V 397 (408) Q Consensus 330 ~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~~~f------~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~--~ 397 (408) .+-+++|- ++..+ +=..|+..-.+ .+-+-.+-+..|++.. .+| |.- ...-++. . T Consensus 429 ~dy~~vG~---------KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~-~NP--~~~--~~~~~~~~~i 494 (519) T protein:vir:10 429 SDYFTIGY---------KGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIG-INP--FAD--PAAQAPTKRI 494 (519) T ss_pred cceEEEEE---------ecCcccccceeeccccccccccccCCccccceeeeeeeecee-ecC--ccc--ccccCcccee Confidence 12233331 11111 11111111000 0112222233333332 122 110 0000000 0 Q ss_pred CC----ccCCCcccC Q lcl|Aclame:pro 398 GN----FKTTTSTAV 408 (408) Q Consensus 398 ~~----~~~~~~~~~ 408 (408) .+ -..+.-.++ T Consensus 495 ~~g~~~~a~~~~~n~ 509 (519) T protein:vir:10 495 QNGMPDIVNSLGLNG 509 (519) T ss_pred ccCchhhhccccCce Confidence 00 011111222 No 246 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=25.31 E-value=2.1 Score=18.85 Aligned_cols=356 Identities=12% Similarity=0.039 Sum_probs=87.5 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMA---LNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) |++++..+++.+..+.+++..+++.++.+.. .++......++.+++++++.+..++++.+................. T Consensus 5 m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (408) T protein:vir:74 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNK 84 (408) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 9998765665555555444444444433221 2222222334555566666666555544332222111111110000 Q ss_pred cccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhh-cc--ccccCceecchhhhhh------hhhhhhhhhhhhh Q lcl|Aclame:pro 78 EKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTET-SG--SDSAAGLTIPQDIRTM------INTLVRQYDSLQQ 148 (408) Q Consensus 78 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~-~~--t~~~gg~~vP~~~~~~------ii~~~~~~~~l~~ 148 (408) ...............+.+.+......... ......... .+ .+. .+...+... |...+. .-++.. T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~a~~~~~~~~gg~~vP~----~~~~~Ii~~~~~~~~l~~~~~-~~~~~~ 157 (408) T protein:vir:74 85 SENELKDKFVKDFVNMVRNPMAFLNTVSS--KTETSGSDSAAGLTIPQ----DIRTMINTLVRQYDSLQQYVR-VESVST 157 (408) T ss_pred hhhhhHHHHHHHHHHHHhcchhhhhhhhh--hhhcccccCCCceeech----hHhhHHHHHHhhhcchhhhcc-eeeccC Confidence 00011111111111111111111111000 000000000 00 010 111121111 111111 111111 Q ss_pred hhceeecc--cCccceEEeeccCCccccchhcccccccccccccceeeeechhe-eee-ehHHHHHHHhcchHHHHHHHH Q lcl|Aclame:pro 149 YVRVESVS--TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKR-YAG-IITATNTSLKDTAENILAWLS 224 (408) Q Consensus 149 ~~~~~~~~--~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~-~~~-~~~iS~ell~ds~~~~~~~v~ 224 (408) ....+++. ...+..... ...+ .-..|... +..+..++...++...- +.- ++.-|..-+.. -+.+.|. T Consensus 158 ~~~~~~~~~~~~~~~~~~~-v~E~----~~~~~~~~-~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~---~i~~~l~ 228 (408) T protein:vir:74 158 SSGSRVYEKWTDVTPLKAM-DEED----GKIPDLDN-PRLTIIKYLIKRYAGIITATNTLLKDTAENILA---WLSSWIA 228 (408) T ss_pred CcceEEEEeecCCcccccc-cccc----cccccccc-cceeeEEeeeeeEEeeehhHHHHHhhchHHHHH---HHHHHHH Confidence 11111111 111111100 0101 11122221 12232333322221111 110 11122211111 1444555 Q ss_pred HHHHHHHHHHHHHHHhhccccccchhhhhhHH---------------------HHHHHHHHhhhhhccCCCEEEEcHHHH Q lcl|Aclame:pro 225 SWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFD---------------------DVITMINTAVDPAIIATSSLLTNQSGL 283 (408) Q Consensus 225 ~~l~~~~~~~~~~~~~~g~g~~~~~~~~~~~d---------------------~i~~~~~~~l~~~~~~~a~~~~n~~~~ 283 (408) +.++..+..++=..--++...++.. +..... .....+.. +. ..+..++..++.. T Consensus 229 ~~~~~~~d~~il~G~G~~~~~~~~~-~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~-lk---d~~G~~l~~~~~~ 303 (408) T protein:vir:74 229 KKVVVTRNQAIIAAMGTVPKKPTIA-NFDDVITMINTSVDPAIIATSSLLTNQSGLNKLAL-VK---TAEGKYLLEPDPT 303 (408) T ss_pred HHHHHHHHHHHhhcccccccccccc-cHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH-hh---cCCCceEeccCcC Confidence 5555555555422211121111110 000000 01111111 21 1223344433221 Q ss_pred HHH-HhhhcccCceeeccc-cccCCcccccc-cceEeeccccccccccCcceEEEEehhcceEeeec--------cceEE Q lcl|Aclame:pro 284 NKL-ALVKTAEGKYLLEPD-PTKPNSYLIKG-KQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDR--------ENMSL 352 (408) Q Consensus 284 ~~l-~~lkd~~G~~~~~~~-~~~~~~~~l~G-~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~--------~~~~i 352 (408) ..- ..| .|.|++..+ ..-+... .+ .++++-+ . +..+.++|.+..-+.+++ ....+ T Consensus 304 ~~~~~~l---~G~pV~~~~~~~~~~~~--~~~~~i~~gd------~---~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~ 369 (408) T protein:vir:74 304 KPNSYLI---KGKQVIVVADRWLPNSG--STVYPLYYGD------M---SQAITLFDRENMSLLPTNIGAGAFETDTTKI 369 (408) T ss_pred CCCCcee---cceeeEEecCccccccc--CCcceEEEEe------h---hccEEEEEecceEEEEeccccchhhcceeeE Confidence 000 011 355543211 0000000 00 1122111 0 011223332221111111 11111 Q ss_pred EEeccchhhhhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCc Q lcl|Aclame:pro 353 LPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTS 405 (408) Q Consensus 353 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~ 405 (408) ...-. +.+....--.+.++. +-.+++..+..+++++++- T Consensus 370 r~~~r---------~d~~~~~~~a~~~~~-----~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 370 RVIDR---------FDVKATDSEALVAGS-----FTAIADQVGNFKTTTSTAV 408 (408) T ss_pred EEEEe---------eCcEEecccceEEEE-----eecccCCCCCCCCCccccC Confidence 11000 000000001111221 1122333333333444444 No 247 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=24.31 E-value=2.2 Score=18.71 Aligned_cols=365 Identities=11% Similarity=-0.003 Sum_probs=96.7 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINM---ALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) |++++..+++.+...++.+..+++.+++.. ..++.+...+++.++..+++++..++++.+................. T Consensus 5 m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (408) T protein:vir:10 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNK 84 (408) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Confidence 999876666655555554444444433321 11222223344555566666666666555443222111111111111 Q ss_pred cccccccchhhhHHHHHHHHHHHhhcchhhHHHHHHHHhhc---ccccc-Cceecchhhhh-hhhhhhhhhhhhhhhhce Q lcl|Aclame:pro 78 EKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETS---GSDSA-AGLTIPQDIRT-MINTLVRQYDSLQQYVRV 152 (408) Q Consensus 78 ~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~---~t~~~-gg~~vP~~~~~-~ii~~~~~~~~l~~~~~~ 152 (408) ...............+.+.......... ..+........ ..+.. ...+|..-... .|...+.. -++...... T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~-~~~~~~~~~ 161 (408) T protein:vir:10 85 SENELKDKFVKDFVNMVRNPMAFMNTVS--SKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRV-ESVSTSNGS 161 (408) T ss_pred chhhhHHHHHHHHHHHhhcchhhhhhhh--hhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcce-eeccCCcce Confidence 1111111111111111111111110000 00000000000 00100 00011111111 11111111 111111111 Q ss_pred eecc---cCccceEEeeccCCccccchhcccccccccccccceeeeechhe-eee-ehHHHHHHHhcchHHHHHHHHHHH Q lcl|Aclame:pro 153 ESVS---TSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKR-YAG-IITATNTSLKDTAENILAWLSSWI 227 (408) Q Consensus 153 ~~~~---~~~g~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~v~~~~~~-~~~-~~~iS~ell~ds~~~~~~~v~~~l 227 (408) +++. +.++...+... + ....|.. .+..+..+|....+...- +.. ++.-|..-+. .+ +...|.+.+ T Consensus 162 ~~~~~~~~~~~~a~~v~E--~----~~~~~~~-~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~--~~-i~~~l~~~~ 231 (408) T protein:vir:10 162 RVYEKWTDVTPLTVMDAE--D----GKIPDLD-NPQLTIIKYLIKRYAGIITATNTSLKDTAENIL--AW-LSSWIAKKV 231 (408) T ss_pred EEEeeccccccceeeecC--c----ccccccc-CcceeeEEeeeeeEEeeehhHHHHHhhchHHHH--HH-HHHHHHHHH Confidence 1211 11122111111 1 1122211 122333344433322111 110 1111111111 11 455566666 Q ss_pred HHHHHHHHHHHHhhccccccchhhhhhHHHHHHHH----------------HHhhhhhccCCCEEEEcHHHHH-HHHhhh Q lcl|Aclame:pro 228 AKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMI----------------NTAVDPAIIATSSLLTNQSGLN-KLALVK 290 (408) Q Consensus 228 ~~~~~~~~~~~~~~g~g~~~~~~~~~~~d~i~~~~----------------~~~l~~~~~~~a~~~~n~~~~~-~l~~lk 290 (408) +..+..++-...-.|....+........+.+...+ ...|...-..+...+..+..-. .-..| T Consensus 232 ~~~~~~~il~g~g~~~~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l- 310 (408) T protein:vir:10 232 VVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLI- 310 (408) T ss_pred HHHHHHHHhhcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCcee- Confidence 66666665443333333221111111111110000 0001111112223333322100 00011 Q ss_pred cccCceeeccc-cccCCcccccccceEeeccccccccccCcceEEEEehhcceEeeecc--------ceEEEEeccchhh Q lcl|Aclame:pro 291 TAEGKYLLEPD-PTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRE--------NMSLLPTNIGAGA 361 (408) Q Consensus 291 d~~G~~~~~~~-~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~--------~~~i~~~~~~~~~ 361 (408) .|.|+...+ ...+.. .-.-.++++-+ . +..+.++|....-+..+.. ...+... T Consensus 311 --~G~PV~~~~~~~~~~~-~~~~~~i~~gd------~---~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~------ 372 (408) T protein:vir:10 311 --KGKQVIVVADRWLPNT-GSTVYPLYYGD------M---SQAITLFDRENMSLLPTNIGAGAFETDTTKIRVI------ 372 (408) T ss_pred --cceeeEEecccccCcc-CCCceEEEEEe------h---hccEEEEEecceEEEEcccccchhhcCceEEEEE------ Confidence 466554311 000100 00112222211 0 0112344433221111111 0111110 Q ss_pred hhhceeeEEEEeeeCcEEecccceEEEEeeccccCCCCccCCCc Q lcl|Aclame:pro 362 FETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTS 405 (408) Q Consensus 362 f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~~~~ 405 (408) ..+.+....--.+..+. +-..++..+..+++.+++- T Consensus 373 ---~r~d~~v~~~~a~~~~~-----~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 373 ---DRFDVKATDSEALVAGS-----FSAIADQVGNFKTTTSTAV 408 (408) T ss_pred ---EeeccEEeccccEEEEE-----eeccccCCCCCCCCCcccC Confidence 00001111111122222 2222234444444444444 No 248 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=21.98 E-value=2.5 Score=18.39 Aligned_cols=92 Identities=8% Similarity=0.045 Sum_probs=7.7 Q ss_pred CChHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 1 MGVKL--TVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRD--NEKVRRDALREQLVEAQAEQVVNMRE 76 (408) Q Consensus 1 M~~~~--~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (408) |.-++ ...+.+.+..+...++.++......+ +......+...++...+ +...+...++..... .......... T Consensus 616 ~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa--~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~-~~~~~qq~~~ 692 (711) T protein:vir:10 616 EREAIEEDMPEQTEPTPEQQVEMAKSQADMAQA--EADTAQAQADMLKAQLETEEAQKQLAMIEDMAQG-GDVVYQQVRE 692 (711) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHH Confidence 21111 00111111111111111111110000 00000000000111100 000000000000000 0000000000 Q ss_pred ccccccccchhhhHHHHHHHHHHHhhc Q lcl|Aclame:pro 77 EEKGPLNKSENELKDKFVKDFVNMVRN 103 (408) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~a~~~~~~~ 103 (408) .. .....+....-.+..+. T Consensus 693 ~l--------~~~qaelq~~q~~~~q~ 711 (711) T protein:vir:10 693 LV--------AQALAEITASQANVTEQ 711 (711) T ss_pred HH--------HHHHHHHHHHHHHhhcC Confidence 00 00000000000000000 No 249 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=21.10 E-value=2.7 Score=18.26 Aligned_cols=332 Identities=13% Similarity=0.140 Sum_probs=116.9 Q ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (408) Q Consensus 1 M~~~~~i~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (408) |.++- .++|++++..+.+. +.. .++..-+..+- +++ ++.|.++++.. +. T Consensus 1 ~~~~~-~~~l~~kw~p~l~~--------------~~~--~~i~~~~~~~~---a~~--~enq~~~~~~~------~~--- 49 (521) T protein:vir:72 1 MTIKT-KAELLNKWKPLLEG--------------EGL--PEIANSKQAII---AKI--FENQEKDFQTA------PE--- 49 (521) T ss_pred CCcch-hHHHHHhhhhhhcc--------------CCC--Cccccchhhhh---hhh--hhhhhhhhhhc------cc--- Confidence 66652 23376666554222 110 11111111111 100 11111111000 00 Q ss_pred ccccchhhhHHHHHHHHHHHhhcc-----hhhHHHHHHHHhhccccccCceecchhhhhhhhhhhh---hhhhhhhhhce Q lcl|Aclame:pro 81 PLNKSENELKDKFVKDFVNMVRNP-----MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVR---QYDSLQQYVRV 152 (408) Q Consensus 81 ~~~~~~~~~~~~~~~a~~~~~~~~-----~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~---~~~~l~~~~~~ 152 (408) ...+....+|.+++-.. .+..... ...++ ++|++. .+.+.++..+| +.....+++.+ T Consensus 50 -------~~~~~~~~~~~~~l~e~~~~~~~~~~~~~----iaes~-~t~~v~---~~~P~Li~lvRra~p~LIa~DIwGV 114 (521) T protein:vir:72 50 -------YKDEKIAQAFGSFLTEAEIGGDHGYNATN----IAAGQ-TSGAVT---QIGPAVMGMVRRAIPNLIAFDICGV 114 (521) T ss_pred -------ccchHHHHHHhhhhhhhcccCccccCccc----ccccc-cccccc---cCCchhhhHHHHHHhhhhhhhceee Confidence 00111222333332211 1110000 00111 111111 23334444444 33344677778 Q ss_pred eecccCccceEEeecc--CCc--------------cccchhc-------------------------------------- Q lcl|Aclame:pro 153 ESVSTSNGSRVYEKWT--DVT--------------PLTVMDA-------------------------------------- 178 (408) Q Consensus 153 ~~~~~~~g~~~~~~~~--~~~--------------~~~~~~~-------------------------------------- 178 (408) .||++++|-+--.+-. ... +...|.+ T Consensus 115 QPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~ 194 (521) T protein:vir:72 115 QPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETGTVYLQASV 194 (521) T ss_pred ccCCchhhhheeeeeeecCCCCCcccccccchhccccccccccccccccccccccccccccccccccccccccccccccc Confidence 8877777632211100 000 0000000 Q ss_pred -------------------------------c------ccc---ccccccccceeeeechheeeee-------hHHHHHH Q lcl|Aclame:pro 179 -------------------------------E------DGK---IPDLDNPQLTIIKYLIKRYAGI-------ITATNTS 211 (408) Q Consensus 179 -------------------------------E------~~~---~~~~~~~~f~~v~~~~~~~~~~-------~~iS~el 211 (408) + .+. ....+...|.++.|+..|..+- ..+|-|| T Consensus 195 ~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiEL 274 (521) T protein:vir:72 195 QVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQLKAAYSIEL 274 (521) T ss_pred ccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccceeccccHHH Confidence 0 000 0001112355555555555543 5689999 Q ss_pred Hhc----chHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------cc----chhhhhhHH----------------HHH Q lcl|Aclame:pro 212 LKD----TAENILAWLSSWIAKKVVVTRNQAIIEVMKA--------AP----KKPTIAKFD----------------DVI 259 (408) Q Consensus 212 l~d----s~~~~~~~v~~~l~~~~~~~~~~~~~~g~g~--------~~----~~~~~~~~d----------------~i~ 259 (408) .+| ...|.++.|.+-|+..|...+|+.|+.-... .+ ...+..+.+ .+. T Consensus 275 AQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~ 354 (521) T protein:vir:72 275 AQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWAGESFKALL 354 (521) T ss_pred HHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHHHHHHHHHH Confidence 888 3468899999999999999999998842210 00 111222111 111 Q ss_pred HHH----HHhh-hhhccCCCEEEEcHHHHHHHHhhh-----cccC-ceeeccccccC-Ccccc-cccceEeecccccccc Q lcl|Aclame:pro 260 TMI----NTAV-DPAIIATSSLLTNQSGLNKLALVK-----TAEG-KYLLEPDPTKP-NSYLI-KGKQVIVVADRWLPNT 326 (408) Q Consensus 260 ~~~----~~~l-~~~~~~~a~~~~n~~~~~~l~~lk-----d~~G-~~~~~~~~~~~-~~~~l-~G~pv~~~~~~~~~~~ 326 (408) .-+ +... .+.-...-.+|+++.....|...- .++| .--|..+.+.. ..+.| .||+|++.++ . T Consensus 355 ~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y--~--- 429 (521) T protein:vir:72 355 FQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQY--A--- 429 (521) T ss_pred HHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEccCceEEEecCC--C--- Confidence 111 1111 122122335889999988887531 0111 11111111111 01233 3567766322 1 Q ss_pred ccCcceEEEEehhcceEeeeccceE----EEEeccchhhhhhceeeEEEEeeeCcEEecccceEEEE--eeccccCCCCc Q lcl|Aclame:pro 327 GSTVYPLYYGDMSQAITLFDRENMS----LLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGS--FSAIADQVGNF 400 (408) Q Consensus 327 ~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~--~~~~~~~~~~~ 400 (408) ..+-+++|- ++..+ +-..|+..-. .....||+.|--.. .+.++=.+..+ T Consensus 430 --~~dy~~vG~---------KG~~~~~~glfyaPYv~l~--------------~~~~~dp~sfqP~~g~~tRY~l~~NP~ 484 (521) T protein:vir:72 430 --KQDYFTVGY---------KGPNEMDAGIYYAPYVALT--------------PLRGSDPKNFQPVMGFKTRYGIGINPF 484 (521) T ss_pred --CcceEEEEE---------eCCcccccceeeccccccc--------------cccccCCccccceeeeeeeeceeecCc Confidence 122233331 11111 1111211100 00123444432211 11111111111 Q ss_pred cCCCc----ccC Q lcl|Aclame:pro 401 KTTTS----TAV 408 (408) Q Consensus 401 ~~~~~----~~~ 408 (408) +..+. +-+ T Consensus 485 ~~~~~~~~a~~i 496 (521) T protein:vir:72 485 AESAAQAPASRI 496 (521) T ss_pred ccccCcccceee Confidence 11111 111 Done!