Query lcl|NC_011614.1_cdsid_YP_002332517.1 [gene=SauSIPLA88_gp42] [protein=major head protein] [protein_id=YP_002332517.1] [location=20439..21413] Match_columns 324 No_of_seqs 120 out of 1113 Neff 9.5 Searched_HMMs 1612 Date Thu Nov 7 13:13:59 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_41 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_41_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:9309 Length: 324 # 100.0 1.3E-75 8.4E-79 431.0 34.1 324 1-324 1-324 (324) 2 protein:vir:97148 Length: 324 100.0 4.1E-75 2.5E-78 428.4 34.0 324 1-324 1-324 (324) 3 protein:vir:96392 Length: 324 100.0 1.4E-74 8.6E-78 425.5 33.5 324 1-324 1-324 (324) 4 protein:vir:78830 Length: 324 100.0 1.4E-74 8.6E-78 425.5 33.5 324 1-324 1-324 (324) 5 protein:vir:96223 Length: 324 100.0 1.9E-74 1.2E-77 424.8 33.7 324 1-324 1-324 (324) 6 protein:vir:103955 Length: 324 100.0 4.1E-74 2.5E-77 422.9 33.7 324 1-324 1-324 (324) 7 protein:vir:99749 Length: 324 100.0 4.8E-74 3E-77 422.6 33.8 324 1-324 1-324 (324) 8 protein:vir:95763 Length: 297 100.0 1.4E-63 8.5E-67 365.2 31.1 296 19-316 1-297 (297) 9 protein:vir:41 Length: 299 # N 100.0 2.4E-62 1.5E-65 358.5 30.7 293 22-316 1-299 (299) 10 protein:vir:5739 Length: 366 # 100.0 1.4E-61 8.6E-65 354.2 29.8 312 1-315 26-366 (366) 11 protein:vir:94142 Length: 304 100.0 4E-61 2.5E-64 351.7 30.4 295 19-314 1-304 (304) 12 protein:vir:105905 Length: 304 100.0 4E-61 2.5E-64 351.7 30.4 295 19-314 1-304 (304) 13 protein:vir:2430 Length: 318 # 100.0 8.6E-61 5.3E-64 349.9 30.9 306 14-320 1-318 (318) 14 protein:vir:7771 Length: 330 # 100.0 1E-60 6.3E-64 349.5 31.2 304 19-324 1-330 (330) 15 protein:vir:2344 Length: 397 # 100.0 2.5E-60 1.6E-63 347.3 29.6 306 15-324 1-315 (397) 16 protein:vir:105038 Length: 428 100.0 1.5E-60 9.1E-64 348.6 28.1 311 1-315 83-428 (428) 17 protein:vir:104085 Length: 320 100.0 4.7E-60 2.9E-63 345.8 30.1 304 14-318 1-320 (320) 18 protein:vir:4226 Length: 326 # 100.0 4.2E-60 2.6E-63 346.1 28.7 309 1-318 1-326 (326) 19 protein:vir:80376 Length: 435 100.0 1.4E-59 8.7E-63 343.2 30.8 314 1-317 92-435 (435) 20 protein:vir:485 Length: 407 # 100.0 1.6E-59 1E-62 342.9 30.4 305 1-322 80-407 (407) 21 protein:vir:80684 Length: 315 100.0 1.4E-59 8.6E-63 343.3 29.8 292 27-324 1-315 (315) 22 protein:vir:1433 Length: 435 # 100.0 2.5E-59 1.6E-62 341.8 30.0 314 1-317 91-435 (435) 23 protein:vir:9574 Length: 300 # 100.0 4E-59 2.5E-62 340.8 29.7 282 28-315 1-300 (300) 24 protein:vir:4456 Length: 401 # 100.0 1.1E-58 6.7E-62 338.4 29.1 298 1-315 81-401 (401) 25 protein:vir:100247 Length: 425 100.0 4.3E-58 2.7E-61 335.1 29.5 299 1-316 102-425 (425) 26 protein:vir:9759 Length: 303 # 100.0 6.9E-58 4.3E-61 334.0 30.1 281 29-315 1-303 (303) 27 protein:vir:1638 Length: 298 # 100.0 8.2E-58 5.1E-61 333.6 30.0 278 31-314 1-298 (298) 28 protein:vir:8187 Length: 311 # 100.0 1.1E-57 7.1E-61 332.8 30.2 281 29-316 1-311 (311) 29 protein:vir:94771 Length: 298 100.0 1.5E-57 9.1E-61 332.2 29.6 278 31-314 1-298 (298) 30 protein:vir:93616 Length: 645 100.0 5.2E-57 3.2E-60 329.1 31.1 315 1-321 289-645 (645) 31 protein:vir:100135 Length: 418 100.0 5.5E-57 3.4E-60 329.0 30.4 303 1-318 106-418 (418) 32 protein:vir:2504 Length: 305 # 100.0 5.1E-57 3.1E-60 329.2 28.3 289 27-323 1-305 (305) 33 protein:vir:78223 Length: 333 100.0 2.7E-56 1.6E-59 325.3 29.9 296 18-316 1-333 (333) 34 protein:vir:78523 Length: 338 100.0 3.7E-56 2.3E-59 324.5 30.2 301 1-318 1-338 (338) 35 protein:vir:1886 Length: 385 # 100.0 5.8E-56 3.6E-59 323.4 30.6 301 1-316 75-385 (385) 36 protein:vir:191 Length: 385 # 100.0 5.8E-56 3.6E-59 323.4 30.6 301 1-316 75-385 (385) 37 protein:vir:8102 Length: 543 # 100.0 3.6E-56 2.3E-59 324.5 29.5 302 1-316 220-543 (543) 38 protein:vir:4339 Length: 395 # 100.0 7E-56 4.3E-59 323.0 30.3 299 1-315 80-395 (395) 39 protein:vir:101650 Length: 497 100.0 9E-56 5.6E-59 322.4 29.2 303 1-319 116-497 (497) 40 protein:vir:7855 Length: 497 # 100.0 9E-56 5.6E-59 322.4 29.2 303 1-319 116-497 (497) 41 protein:vir:1328 Length: 392 # 100.0 1.6E-55 1E-58 321.0 30.0 299 1-316 83-392 (392) 42 protein:vir:6242 Length: 390 # 100.0 1.3E-55 8.2E-59 321.5 29.5 297 1-316 81-390 (390) 43 protein:vir:102119 Length: 404 100.0 3.4E-55 2.1E-58 319.2 31.2 307 1-319 78-404 (404) 44 protein:vir:97053 Length: 390 100.0 2.1E-55 1.3E-58 320.3 29.7 297 1-313 81-390 (390) 45 protein:vir:6212 Length: 434 # 100.0 2.8E-55 1.7E-58 319.7 29.8 301 1-318 114-434 (434) 46 protein:vir:10364 Length: 390 100.0 3.8E-55 2.4E-58 318.9 30.4 297 1-313 81-390 (390) 47 protein:vir:81070 Length: 390 100.0 3.5E-55 2.2E-58 319.1 29.6 297 1-313 81-390 (390) 48 protein:vir:95376 Length: 425 100.0 4.4E-55 2.7E-58 318.6 28.2 302 1-319 110-425 (425) 49 protein:vir:4953 Length: 397 # 100.0 1.2E-54 7.6E-58 316.1 30.5 297 1-324 84-395 (397) 50 protein:vir:107593 Length: 392 100.0 1.4E-54 8.9E-58 315.8 29.3 296 1-323 70-392 (392) 51 protein:vir:102873 Length: 392 100.0 1.4E-54 8.9E-58 315.8 29.3 296 1-323 70-392 (392) 52 protein:vir:102082 Length: 392 100.0 1.4E-54 8.9E-58 315.8 29.3 296 1-323 70-392 (392) 53 protein:vir:105004 Length: 392 100.0 1.4E-54 8.9E-58 315.8 29.3 296 1-323 70-392 (392) 54 protein:vir:4997 Length: 397 # 100.0 3.5E-54 2.2E-57 313.6 31.0 298 1-324 79-394 (397) 55 protein:vir:4830 Length: 397 # 100.0 3.4E-54 2.1E-57 313.7 30.5 297 1-324 79-395 (397) 56 protein:vir:1025 Length: 408 # 100.0 5.3E-54 3.3E-57 312.6 31.3 298 1-324 84-408 (408) 57 protein:vir:4511 Length: 409 # 100.0 3.7E-54 2.3E-57 313.5 28.5 304 1-318 85-409 (409) 58 protein:vir:4856 Length: 293 # 100.0 7.8E-54 4.8E-57 311.7 30.0 276 23-324 1-291 (293) 59 protein:vir:81160 Length: 371 100.0 1E-53 6.4E-57 311.1 30.5 287 1-315 64-371 (371) 60 protein:vir:3991 Length: 404 # 100.0 1.5E-53 9.2E-57 310.2 31.1 297 1-323 87-404 (404) 61 protein:vir:4600 Length: 415 # 100.0 2.9E-53 1.8E-56 308.6 31.1 304 1-324 95-411 (415) 62 protein:vir:4700 Length: 415 # 100.0 2.9E-53 1.8E-56 308.6 31.1 304 1-324 95-411 (415) 63 protein:vir:1268 Length: 397 # 100.0 1.5E-53 9E-57 310.3 29.3 288 1-315 92-397 (397) 64 protein:vir:81100 Length: 415 100.0 3.8E-53 2.3E-56 308.0 31.5 304 1-324 96-411 (415) 65 protein:vir:79987 Length: 415 100.0 3.8E-53 2.3E-56 308.0 31.5 304 1-324 96-411 (415) 66 protein:vir:98339 Length: 415 100.0 3.8E-53 2.3E-56 308.0 31.5 304 1-324 96-411 (415) 67 protein:vir:3845 Length: 395 # 100.0 2.7E-53 1.6E-56 308.8 30.6 295 1-324 81-392 (395) 68 protein:vir:7409 Length: 408 # 100.0 4E-53 2.5E-56 307.8 31.2 298 1-324 84-408 (408) 69 protein:vir:104256 Length: 458 100.0 2.8E-53 1.8E-56 308.7 29.9 298 1-315 126-458 (458) 70 protein:vir:99920 Length: 311 100.0 3.4E-53 2.1E-56 308.3 29.0 280 28-315 1-311 (311) 71 protein:vir:81227 Length: 413 100.0 4.2E-53 2.6E-56 307.7 29.2 303 1-321 85-413 (413) 72 protein:vir:96762 Length: 632 100.0 2.4E-53 1.5E-56 309.0 27.3 295 1-314 299-632 (632) 73 protein:vir:9410 Length: 415 # 100.0 1.1E-52 6.6E-56 305.5 30.6 304 1-324 96-411 (415) 74 protein:vir:1383 Length: 421 # 100.0 1E-52 6.5E-56 305.6 28.4 294 1-324 90-392 (421) 75 protein:vir:101607 Length: 379 100.0 7E-52 4.4E-55 301.0 29.2 290 1-315 76-379 (379) 76 protein:vir:98635 Length: 377 100.0 2.4E-52 1.5E-55 303.6 22.3 299 1-315 39-377 (377) 77 protein:vir:4092 Length: 390 # 100.0 1.8E-51 1.1E-54 298.8 26.9 303 1-324 38-379 (390) 78 protein:vir:9704 Length: 394 # 100.0 1.3E-50 8.1E-54 294.1 28.7 285 1-322 85-394 (394) 79 protein:vir:100172 Length: 394 100.0 3.5E-50 2.2E-53 291.7 30.7 293 1-324 84-393 (394) 80 protein:vir:78640 Length: 352 100.0 4.6E-51 2.9E-54 296.5 24.6 294 1-322 46-352 (352) 81 protein:vir:100884 Length: 389 100.0 4.1E-50 2.5E-53 291.4 29.8 289 1-323 83-389 (389) 82 protein:vir:94673 Length: 419 100.0 6.2E-50 3.8E-53 290.4 29.2 301 1-317 87-419 (419) 83 protein:vir:96978 Length: 387 100.0 1.7E-50 1.1E-53 293.4 24.3 294 1-322 85-387 (387) 84 protein:vir:94424 Length: 387 100.0 1.7E-50 1.1E-53 293.4 24.3 294 1-322 85-387 (387) 85 protein:vir:2685 Length: 387 # 100.0 1.7E-50 1.1E-53 293.4 24.3 294 1-322 85-387 (387) 86 protein:vir:3870 Length: 400 # 100.0 1.1E-49 7E-53 288.9 27.5 283 1-316 89-400 (400) 87 protein:vir:1084 Length: 437 # 100.0 1.1E-49 6.8E-53 289.0 26.7 294 1-324 126-437 (437) 88 protein:vir:9509 Length: 381 # 100.0 2E-49 1.2E-52 287.6 27.1 301 1-324 37-378 (381) 89 protein:vir:101291 Length: 381 100.0 2E-49 1.2E-52 287.6 27.1 301 1-324 37-378 (381) 90 protein:vir:93881 Length: 387 100.0 9.1E-50 5.6E-53 289.4 24.7 294 1-322 81-387 (387) 91 protein:vir:9361 Length: 402 # 100.0 4.7E-50 2.9E-53 291.0 22.9 294 1-322 98-402 (402) 92 protein:vir:100632 Length: 381 100.0 1.6E-49 9.7E-53 288.2 25.1 301 1-324 20-380 (381) 93 protein:vir:9643 Length: 377 # 100.0 2.3E-49 1.4E-52 287.2 25.6 292 1-315 39-377 (377) 94 protein:vir:95963 Length: 395 100.0 6.1E-49 3.8E-52 284.9 26.8 303 1-324 45-389 (395) 95 protein:vir:80128 Length: 466 100.0 7.1E-49 4.4E-52 284.6 25.5 306 1-324 103-457 (466) 96 protein:vir:78350 Length: 383 100.0 2.9E-49 1.8E-52 286.7 22.0 300 1-323 45-383 (383) 97 protein:vir:8420 Length: 477 # 100.0 2E-48 1.2E-51 282.1 25.6 306 1-322 103-477 (477) 98 protein:vir:962 Length: 397 # 100.0 3E-47 1.9E-50 275.6 25.3 277 1-315 110-397 (397) 99 protein:vir:4197 Length: 314 # 100.0 8.3E-40 5.2E-43 234.8 25.3 287 1-318 1-314 (314) 100 protein:vir:4159 Length: 315 # 100.0 1.3E-38 8E-42 228.3 23.9 288 1-314 1-315 (315) 101 protein:vir:3158 Length: 321 # 100.0 1.2E-35 7.6E-39 212.0 25.0 299 1-322 1-321 (321) 102 protein:vir:9820 Length: 272 # 100.0 7.8E-35 4.8E-38 207.6 24.7 261 27-318 1-272 (272) 103 protein:vir:3033 Length: 272 # 100.0 7.8E-35 4.8E-38 207.6 24.7 261 27-318 1-272 (272) 104 protein:vir:97397 Length: 517 100.0 1.1E-34 6.7E-38 206.8 22.2 294 1-318 190-517 (517) 105 protein:vir:4074 Length: 480 # 99.9 2E-30 1.3E-33 183.3 16.0 284 1-318 151-480 (480) 106 protein:vir:93742 Length: 274 99.9 1.9E-27 1.2E-30 167.0 23.0 263 27-319 1-274 (274) 107 protein:vir:3613 Length: 272 # 99.9 1.9E-26 1.2E-29 161.6 20.7 259 27-315 1-272 (272) 108 protein:vir:96833 Length: 275 99.9 4.9E-26 3E-29 159.3 21.4 264 24-319 1-275 (275) 109 protein:vir:105334 Length: 276 99.9 6.1E-26 3.8E-29 158.8 21.9 265 26-322 1-276 (276) 110 protein:vir:96123 Length: 274 99.9 3.1E-25 2E-28 154.9 22.4 263 27-319 1-274 (274) 111 protein:vir:94494 Length: 274 99.9 5.6E-25 3.5E-28 153.5 23.2 263 26-319 1-274 (274) 112 protein:vir:97433 Length: 274 99.9 5.6E-25 3.5E-28 153.5 23.2 263 26-319 1-274 (274) 113 protein:vir:1239 Length: 274 # 99.9 6.8E-24 4.2E-27 147.6 22.1 263 26-319 1-274 (274) 114 protein:vir:96262 Length: 274 99.9 8.5E-24 5.3E-27 147.1 22.6 263 26-319 1-274 (274) 115 protein:vir:95898 Length: 274 99.9 8.5E-24 5.3E-27 147.1 22.6 263 26-319 1-274 (274) 116 protein:vir:80930 Length: 278 99.9 5.8E-24 3.6E-27 148.0 21.5 266 27-316 1-278 (278) 117 protein:vir:95107 Length: 270 99.9 3.7E-23 2.3E-26 143.5 20.9 262 29-324 1-270 (270) 118 protein:vir:79928 Length: 393 99.9 1.9E-23 1.2E-26 145.2 18.7 306 1-324 28-386 (393) 119 protein:vir:94933 Length: 330 99.9 7.6E-23 4.7E-26 141.8 20.7 293 1-316 1-330 (330) 120 protein:vir:739 Length: 231 # 99.8 9.6E-21 6E-24 130.3 17.6 223 61-315 1-231 (231) 121 protein:vir:97255 Length: 310 99.7 2.7E-18 1.7E-21 116.9 21.7 274 24-315 1-310 (310) 122 protein:vir:99424 Length: 360 99.7 1.6E-17 1E-20 112.6 21.8 298 1-318 1-360 (360) 123 protein:vir:7990 Length: 273 # 99.6 3.7E-16 2.3E-19 105.2 19.4 258 31-315 1-273 (273) 124 protein:vir:5974 Length: 324 # 99.6 6.5E-16 4E-19 103.9 18.7 277 29-324 1-298 (324) 125 protein:vir:105822 Length: 273 99.6 1.1E-15 7E-19 102.5 19.8 258 31-315 1-273 (273) 126 protein:vir:102605 Length: 273 99.6 1.1E-15 7E-19 102.5 19.8 258 31-315 1-273 (273) 127 protein:vir:108211 Length: 318 99.5 5.4E-16 3.4E-19 104.3 17.8 282 24-316 1-318 (318) 128 protein:vir:94622 Length: 341 99.5 2.1E-15 1.3E-18 101.0 17.7 290 24-319 1-341 (341) 129 protein:vir:8324 Length: 410 # 99.5 1.6E-15 1E-18 101.7 14.9 280 1-313 62-410 (410) 130 protein:vir:80213 Length: 334 99.4 4.4E-14 2.7E-17 93.8 17.8 283 15-317 1-334 (334) 131 protein:vir:1583 Length: 351 # 99.4 1.9E-13 1.2E-16 90.3 18.5 272 29-324 1-302 (351) 132 protein:vir:102944 Length: 330 99.3 4.2E-13 2.6E-16 88.4 20.1 278 27-324 1-304 (330) 133 protein:vir:80180 Length: 381 99.3 2.8E-13 1.8E-16 89.4 19.0 292 11-324 1-314 (381) 134 protein:vir:2201 Length: 345 # 99.3 2.6E-13 1.6E-16 89.6 17.1 282 15-315 1-345 (345) 135 protein:vir:8885 Length: 347 # 99.3 3.4E-13 2.1E-16 88.9 16.1 285 15-316 1-347 (347) 136 protein:vir:10450 Length: 344 99.2 2.7E-13 1.7E-16 89.5 13.5 284 15-315 1-344 (344) 137 protein:vir:94576 Length: 347 99.2 8.4E-13 5.2E-16 86.8 15.8 284 15-315 1-347 (347) 138 protein:vir:100057 Length: 375 99.2 8.1E-12 5E-15 81.4 20.6 292 15-320 1-375 (375) 139 protein:vir:6324 Length: 335 # 99.1 2E-11 1.3E-14 79.2 19.3 287 15-322 1-335 (335) 140 protein:vir:3364 Length: 347 # 99.1 1.7E-11 1.1E-14 79.6 17.9 286 15-317 1-347 (347) 141 protein:vir:1541 Length: 347 # 99.1 2.1E-11 1.3E-14 79.2 18.3 286 15-317 1-347 (347) 142 protein:vir:78739 Length: 332 99.1 9.5E-12 5.9E-15 81.0 16.3 287 12-313 1-332 (332) 143 protein:vir:103323 Length: 364 99.1 9.5E-11 5.9E-14 75.5 21.6 290 15-324 1-349 (364) 144 protein:vir:94711 Length: 347 99.1 3E-12 1.9E-15 83.7 13.2 284 15-316 1-347 (347) 145 protein:vir:78935 Length: 335 99.1 4.5E-11 2.8E-14 77.3 19.3 288 15-322 1-335 (335) 146 protein:vir:9927 Length: 295 # 99.1 8.8E-12 5.5E-15 81.2 14.6 265 24-322 1-295 (295) 147 protein:vir:99675 Length: 324 99.0 2.8E-11 1.7E-14 78.5 15.9 246 60-324 1-306 (324) 148 protein:vir:93858 Length: 400 99.0 3.3E-11 2.1E-14 78.0 15.5 289 1-313 87-400 (400) 149 protein:vir:106647 Length: 303 99.0 8.5E-11 5.3E-14 75.8 16.0 271 24-322 1-303 (303) 150 protein:vir:102655 Length: 322 99.0 1.4E-10 9E-14 74.5 17.2 282 18-316 1-322 (322) 151 protein:vir:9875 Length: 296 # 99.0 9.6E-11 6E-14 75.5 16.2 260 18-316 1-296 (296) 152 protein:vir:3136 Length: 322 # 99.0 8.5E-11 5.3E-14 75.8 15.8 288 24-319 1-322 (322) 153 protein:vir:95318 Length: 328 98.9 1.5E-10 9.3E-14 74.5 15.8 225 21-245 1-328 (328) 154 protein:vir:97031 Length: 402 98.9 6.1E-10 3.8E-13 71.1 17.6 298 15-324 1-342 (402) 155 protein:vir:97331 Length: 319 98.9 2.4E-09 1.5E-12 67.9 20.8 283 1-324 1-303 (319) 156 protein:vir:94800 Length: 319 98.9 2.4E-09 1.5E-12 67.9 20.8 283 1-324 1-303 (319) 157 protein:vir:105645 Length: 400 98.8 1.4E-09 8.5E-13 69.2 17.0 290 15-324 1-345 (400) 158 protein:vir:103285 Length: 296 98.8 3E-09 1.8E-12 67.3 17.5 271 27-316 1-296 (296) 159 protein:vir:107120 Length: 329 98.7 8.9E-09 5.6E-12 64.7 19.8 285 1-324 12-314 (329) 160 protein:vir:8843 Length: 317 # 98.7 8.6E-09 5.3E-12 64.8 19.0 276 24-317 1-317 (317) 161 protein:vir:7019 Length: 401 # 98.7 2.4E-09 1.5E-12 67.9 15.5 290 15-324 1-346 (401) 162 protein:vir:107687 Length: 319 98.7 1.3E-08 7.9E-12 63.9 19.1 290 1-313 1-319 (319) 163 protein:vir:80068 Length: 301 98.6 2.5E-08 1.6E-11 62.3 19.1 265 30-313 1-301 (301) 164 protein:vir:107826 Length: 331 98.6 7.3E-09 4.6E-12 65.2 15.5 225 21-245 1-331 (331) 165 protein:vir:98525 Length: 331 98.6 7.3E-09 4.6E-12 65.2 15.5 225 21-245 1-331 (331) 166 protein:vir:107388 Length: 331 98.6 7.3E-09 4.6E-12 65.2 15.5 225 21-245 1-331 (331) 167 protein:vir:103759 Length: 330 98.5 8.2E-09 5.1E-12 64.9 13.9 225 21-245 1-330 (330) 168 protein:vir:79548 Length: 652 98.5 1.1E-07 7E-11 58.7 19.1 301 1-312 319-652 (652) 169 protein:vir:104342 Length: 314 98.5 4.1E-08 2.5E-11 61.1 16.5 286 1-316 1-314 (314) 170 protein:vir:80446 Length: 367 98.5 1.9E-07 1.2E-10 57.4 19.7 278 21-324 1-343 (367) 171 protein:vir:7324 Length: 335 # 98.4 2.5E-08 1.5E-11 62.3 14.3 226 21-246 1-335 (335) 172 protein:vir:79642 Length: 329 98.4 1.4E-07 8.9E-11 58.1 18.5 295 2-316 1-329 (329) 173 protein:vir:95131 Length: 325 98.4 1.2E-07 7.5E-11 58.5 16.7 275 1-324 1-301 (325) 174 protein:vir:99075 Length: 392 98.3 3.1E-07 1.9E-10 56.3 17.9 277 31-324 1-313 (392) 175 protein:vir:108303 Length: 418 98.3 1.3E-06 7.8E-10 52.9 20.6 278 30-324 1-319 (418) 176 protein:vir:95512 Length: 693 98.2 8E-07 4.9E-10 54.0 17.2 300 1-313 357-693 (693) 177 protein:vir:78387 Length: 349 98.2 3.5E-06 2.2E-09 50.5 20.9 274 30-324 1-323 (349) 178 protein:vir:94989 Length: 349 98.0 8.7E-06 5.4E-09 48.3 21.4 274 30-324 1-323 (349) 179 protein:vir:99311 Length: 463 97.8 5.3E-06 3.3E-09 49.5 15.3 300 1-324 1-331 (463) 180 protein:vir:95603 Length: 463 97.8 5.3E-06 3.3E-09 49.5 15.3 300 1-324 1-331 (463) 181 protein:vir:3525 Length: 423 # 97.5 5E-05 3.1E-08 44.2 18.0 268 31-324 1-323 (423) 182 protein:vir:96792 Length: 315 97.5 5.6E-05 3.5E-08 43.9 17.3 266 29-324 1-288 (315) 183 protein:vir:105522 Length: 423 97.5 6.2E-05 3.8E-08 43.7 19.3 279 31-324 1-333 (423) 184 protein:vir:95875 Length: 401 97.4 5E-05 3.1E-08 44.2 16.1 291 21-316 1-401 (401) 185 protein:vir:5255 Length: 304 # 97.3 6.3E-05 3.9E-08 43.6 15.2 264 33-312 1-304 (304) 186 protein:vir:105374 Length: 423 97.0 0.00021 1.3E-07 40.7 18.4 275 31-324 1-333 (423) 187 protein:vir:101557 Length: 336 97.0 3.3E-05 2E-08 45.2 10.9 293 1-313 1-336 (336) 188 protein:vir:3643 Length: 336 # 96.9 4E-05 2.5E-08 44.7 10.9 292 1-313 1-336 (336) 189 protein:vir:94070 Length: 339 96.9 0.00022 1.3E-07 40.7 14.6 292 1-313 1-339 (339) 190 protein:vir:107732 Length: 379 96.8 0.00017 1.1E-07 41.2 13.6 294 1-313 21-379 (379) 191 protein:vir:174 Length: 423 # 96.7 0.0004 2.5E-07 39.2 19.3 272 31-324 1-318 (423) 192 protein:vir:1781 Length: 221 # 96.6 0.00022 1.4E-07 40.6 13.0 183 108-324 1-209 (221) 193 protein:vir:80835 Length: 464 96.5 0.00021 1.3E-07 40.7 12.2 303 1-324 1-328 (464) 194 protein:vir:63741 Length: 468 96.4 0.00056 3.5E-07 38.4 14.1 298 1-324 1-331 (468) 195 protein:vir:96666 Length: 462 96.4 0.00046 2.8E-07 38.9 13.3 298 1-324 3-331 (462) 196 protein:vir:78558 Length: 336 96.2 0.00031 1.9E-07 39.8 11.4 293 1-313 1-336 (336) 197 protein:vir:80491 Length: 467 96.1 0.0009 5.6E-07 37.3 13.6 298 1-324 1-330 (467) 198 protein:vir:103886 Length: 302 96.0 0.0012 7.4E-07 36.6 16.2 265 1-315 1-302 (302) 199 protein:vir:96079 Length: 382 95.6 0.0018 1.1E-06 35.7 14.7 301 1-313 1-382 (382) 200 protein:vir:95451 Length: 313 95.6 0.0018 1.1E-06 35.6 16.0 272 29-316 1-313 (313) 201 protein:vir:861 Length: 318 # 95.3 0.00064 4E-07 38.1 9.8 286 1-313 5-318 (318) 202 protein:vir:99576 Length: 388 95.2 0.0016 1E-06 35.9 11.5 298 1-313 30-388 (388) 203 protein:vir:1829 Length: 355 # 95.1 0.0028 1.7E-06 34.6 18.4 297 1-324 1-351 (355) 204 protein:vir:106734 Length: 336 94.9 0.0018 1.1E-06 35.6 11.0 294 1-313 1-336 (336) 205 protein:vir:1663 Length: 393 # 94.7 0.00088 5.4E-07 37.3 9.0 286 1-313 80-393 (393) 206 protein:vir:98566 Length: 355 94.3 0.0047 2.9E-06 33.4 17.5 299 1-324 1-351 (355) 207 protein:vir:93966 Length: 400 94.0 0.0015 9.4E-07 36.0 8.7 286 1-313 87-400 (400) 208 protein:vir:78777 Length: 358 91.8 0.014 8.7E-06 30.7 17.6 300 1-324 1-353 (358) 209 protein:vir:94870 Length: 318 91.4 0.016 9.9E-06 30.4 11.9 287 1-313 5-318 (318) 210 protein:vir:104011 Length: 337 90.2 0.022 1.4E-05 29.7 19.1 290 1-318 1-337 (337) 211 protein:vir:79171 Length: 337 90.2 0.022 1.4E-05 29.6 19.1 290 1-318 1-337 (337) 212 protein:vir:79008 Length: 299 89.6 0.025 1.6E-05 29.3 20.9 271 30-317 1-299 (299) 213 protein:vir:102823 Length: 470 89.2 0.015 9.6E-06 30.5 8.7 276 1-324 1-313 (470) 214 protein:vir:1153 Length: 338 # 87.8 0.036 2.3E-05 28.5 17.4 291 1-317 1-338 (338) 215 protein:vir:78186 Length: 337 87.0 0.041 2.6E-05 28.2 16.7 290 1-318 1-337 (337) 216 protein:vir:6061 Length: 357 # 86.3 0.047 2.9E-05 27.9 16.3 299 1-324 1-351 (357) 217 protein:vir:348 Length: 321 # 86.1 0.048 3E-05 27.8 15.1 276 1-313 1-321 (321) 218 protein:vir:270 Length: 341 # 85.5 0.052 3.2E-05 27.6 15.0 299 1-324 1-340 (341) 219 protein:vir:100851 Length: 514 85.3 0.054 3.3E-05 27.6 13.5 301 1-324 1-363 (514) 220 protein:vir:78920 Length: 290 85.2 0.054 3.4E-05 27.5 20.4 270 1-315 1-290 (290) 221 protein:vir:3746 Length: 336 # 82.4 0.077 4.8E-05 26.7 19.7 293 1-324 1-336 (336) 222 protein:vir:5942 Length: 523 # 82.3 0.078 4.8E-05 26.7 13.5 301 1-317 162-523 (523) 223 protein:vir:2016 Length: 357 # 80.2 0.097 6E-05 26.1 17.7 299 1-324 1-351 (357) 224 protein:vir:5694 Length: 357 # 77.7 0.12 7.6E-05 25.6 17.4 299 1-324 1-351 (357) 225 protein:vir:3783 Length: 336 # 76.7 0.13 8.3E-05 25.4 20.6 294 1-324 1-336 (336) 226 protein:vir:98856 Length: 343 75.3 0.15 9.2E-05 25.1 17.3 298 1-324 1-342 (343) 227 protein:vir:100331 Length: 342 75.3 0.15 9.2E-05 25.1 16.4 294 1-319 1-342 (342) 228 protein:vir:2736 Length: 348 # 65.4 0.28 0.00018 23.6 19.4 285 31-316 1-348 (348) 229 protein:vir:96490 Length: 348 64.8 0.29 0.00018 23.5 19.0 284 31-316 1-348 (348) 230 protein:vir:79157 Length: 339 64.5 0.3 0.00018 23.5 18.5 291 1-319 1-339 (339) 231 protein:vir:107882 Length: 307 63.8 0.31 0.00019 23.4 13.5 271 30-315 1-307 (307) 232 protein:vir:99888 Length: 309 62.6 0.33 0.00021 23.2 13.5 278 33-316 1-309 (309) 233 protein:vir:103463 Length: 521 61.4 0.35 0.00022 23.1 17.2 306 1-324 28-510 (521) 234 protein:vir:105464 Length: 346 61.0 0.36 0.00022 23.0 19.8 278 30-324 1-310 (346) 235 protein:vir:102335 Length: 312 59.5 0.39 0.00024 22.8 19.9 277 31-319 1-312 (312) 236 protein:vir:79712 Length: 285 55.8 0.47 0.00029 22.4 17.6 259 31-316 1-285 (285) 237 protein:vir:7214 Length: 521 # 52.0 0.56 0.00035 21.9 16.8 306 1-324 28-510 (521) 238 protein:vir:79078 Length: 307 50.6 0.61 0.00038 21.8 12.8 270 30-315 1-307 (307) 239 protein:vir:103370 Length: 418 49.5 0.64 0.0004 21.7 16.2 305 1-323 9-418 (418) 240 protein:vir:106590 Length: 349 43.4 0.85 0.00053 21.0 20.7 285 2-313 1-349 (349) 241 protein:vir:4902 Length: 348 # 40.9 0.95 0.00059 20.7 18.4 284 31-316 1-348 (348) 242 protein:vir:3424 Length: 341 # 36.2 1.2 0.00073 20.2 20.2 272 35-313 1-341 (341) 243 protein:vir:107947 Length: 519 35.8 1.2 0.00075 20.1 16.6 303 1-324 25-507 (519) 244 protein:vir:96442 Length: 418 34.3 1.3 0.00081 20.0 14.7 309 1-323 9-418 (418) 245 protein:vir:98480 Length: 348 33.8 1.3 0.00083 19.9 17.5 281 29-314 1-348 (348) 246 protein:vir:99523 Length: 311 28.8 1.7 0.0011 19.3 19.5 275 30-315 1-311 (311) 247 protein:vir:93696 Length: 364 20.8 2.7 0.0017 18.2 16.9 280 27-320 1-364 (364) No 1 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=1.3e-75 Score=431.05 Aligned_cols=324 Identities=99% Similarity=1.399 Sum_probs=314.0 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) ||++++.|.+.|+|+.+....+++++++++.+++++++||++++++|++.+++.+++++++++++++++..+||+.++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999989999999999999999999999999999999999999999999999999 Q ss_pred ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccc Q lcl|NC_011614. 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~ 160 (324) .+.|++||+++|+++++|++++++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~ 160 (324) T protein:vir:93 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988888888 Q ss_pred cccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCccCC Q lcl|NC_011614. 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....+..+++++.+++.++..+++.++.|+|||+++..|++++|++|++++..+.+++|+|+||+++.+...+ T Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PVv~~~~~~~~ 240 (324) T protein:vir:93 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeecCCCCC Confidence 88877777777889999999999999999999999999999999999999999999999999999999999998888889 Q ss_pred CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 241 ~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++.++++++++++++......+.++.++++|++|+++||++.|+||.+.+|+||++|+.+++.+++| T Consensus 241 ~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~ 320 (324) T protein:vir:93 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccC Q lcl|NC_011614. 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:93 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 2 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=4.1e-75 Score=428.42 Aligned_cols=324 Identities=98% Similarity=1.395 Sum_probs=314.4 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) ||++++.+.++++|+......+++++++++.+++++.+||++++++|++.+++.+++++++++++++++..++|+.++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~ 80 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccc Q lcl|NC_011614. 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~ 160 (324) .+.|++||+++|+++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~ 160 (324) T protein:vir:97 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCccCC Q lcl|NC_011614. 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....+..+++++.+++.++..+++.+++|+|||.++..|++++|++|++++..+..++|+|+||+++++.+.+ T Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~~~~tl~G~PV~~~~~~~~~ 240 (324) T protein:vir:97 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCccccceeeEeecCCCCC Confidence 88777777777889999999999999999999999999999999999999999999999998999999999999888899 Q ss_pred CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 241 ~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) +..+++|||++++++.++++++++++++......+.++.++++|++|+++||++.|+|+++.+|+||++|+++++.++.| T Consensus 241 ~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) T protein:vir:97 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSV 320 (324) T ss_pred cceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccC Q lcl|NC_011614. 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:97 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 3 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=1.4e-74 Score=425.51 Aligned_cols=324 Identities=98% Similarity=1.395 Sum_probs=313.6 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |++.++.+.++++|+.+....+.+++.+++.+++++++||+++.++|++.+++.+++++++++++++++..+||+.++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999889999999999 Q ss_pred ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccc Q lcl|NC_011614. 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~ 160 (324) .+.|++|++++|+++++|++++++++|++++++||+|+++++.++++++|.++|++++++++|.++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~ 160 (324) T protein:vir:96 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCccCC Q lcl|NC_011614. 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....++.+++++.+++.++..+++.+++|+|||+++.+|++++|++|++++..+.+++|+|+||+++++...+ T Consensus 161 ~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV~~~~~~~~~ 240 (324) T protein:vir:96 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCcccceeeEeeCCCCCC Confidence 87777777777889999999999999999999999999999999999999999999999999999999999999888899 Q ss_pred CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 241 ~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++.++++++++++++......+.++.++++|++|+++||++.|+||.+.+|+||++|+++.+.+++| T Consensus 241 ~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:96 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccC Q lcl|NC_011614. 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:96 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 4 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=1.4e-74 Score=425.51 Aligned_cols=324 Identities=98% Similarity=1.395 Sum_probs=313.6 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |++.++.+.++++|+.+....+.+++.+++.+++++++||+++.++|++.+++.+++++++++++++++..+||+.++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999889999999999 Q ss_pred ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccc Q lcl|NC_011614. 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~ 160 (324) .+.|++|++++|+++++|++++++++|++++++||+|+++++.++++++|.++|++++++++|.++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~ 160 (324) T protein:vir:78 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCccCC Q lcl|NC_011614. 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....++.+++++.+++.++..+++.+++|+|||+++.+|++++|++|++++..+.+++|+|+||+++++...+ T Consensus 161 ~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV~~~~~~~~~ 240 (324) T protein:vir:78 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCcccceeeEeeCCCCCC Confidence 87777777777889999999999999999999999999999999999999999999999999999999999999888899 Q ss_pred CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 241 ~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++.++++++++++++......+.++.++++|++|+++||++.|+||.+.+|+||++|+++.+.+++| T Consensus 241 ~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:78 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccC Q lcl|NC_011614. 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:78 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 5 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=1.9e-74 Score=424.77 Aligned_cols=324 Identities=98% Similarity=1.396 Sum_probs=313.3 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |||.++.+.+.|+|+......+++++++++.+++++++||++++++|++.+++.+++++++++++++++..+||+.++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999888889999999999999999999999999999999999999999999999 Q ss_pred ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccc Q lcl|NC_011614. 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~ 160 (324) .+.|++|++.+|+++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~ 160 (324) T protein:vir:96 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCccCC Q lcl|NC_011614. 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ...........+.++++++.+++.++..+++.+++|+|||+++.+|++++|++|++++..+.+++|+|+||+++.+...+ T Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~~~~~~~~ 240 (324) T protein:vir:96 161 QSIKKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeecCCCCC Confidence 87777777777889999999999999999999999999999999999999999999999889999999999998888899 Q ss_pred CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 241 ~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) +..+++|||++++++.++++++++++++....+.+.++.++++|++|++++|++.|+||.+.+|+||++|+.+.+.+++| T Consensus 241 ~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~ 320 (324) T protein:vir:96 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccC Q lcl|NC_011614. 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:96 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 6 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=4.1e-74 Score=422.93 Aligned_cols=324 Identities=99% Similarity=1.398 Sum_probs=313.8 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |+|.++.+.++|+|+.+....+++++++++.+++++++||++++++|++.+++.+++++++++++++++...||++++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccc Q lcl|NC_011614. 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~ 160 (324) .+.|++||+++|+++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~ 160 (324) T protein:vir:10 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCccCC Q lcl|NC_011614. 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....++++++++.+++.++..+++.+++|+|||+++..|++++|++|++++..+.+++|+|+||+++++...+ T Consensus 161 ~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PV~~~~~~~~~ 240 (324) T protein:vir:10 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecCCCCccccceeEEeecCCCCC Confidence 88777777777889999999999999999999999999999999999999999999999999999999999999888899 Q ss_pred CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 241 ~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) +..+++|||++++++.++++++++++++..+...+.++.++++|++|++++|++.|+||.+.+|+||++|+++++.++.| T Consensus 241 ~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:10 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSV 320 (324) T ss_pred cceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccC Q lcl|NC_011614. 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:10 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 7 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=4.8e-74 Score=422.56 Aligned_cols=324 Identities=98% Similarity=1.397 Sum_probs=313.9 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |||.++.+.++++|+++....+++++++++.+++++++||++++++|++.+++.+++++++++++++++..+||+.++.+ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccc Q lcl|NC_011614. 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~ 160 (324) .+.|++||+++|+++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) T protein:vir:99 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCccCC Q lcl|NC_011614. 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ...........++++++++.+++.++..+++.+++|+|||+++..|++++|++|++++....+++|+|+||+++++.+.+ T Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PVv~~~~~~~~ 240 (324) T protein:vir:99 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCccccceeEEeecCCCCC Confidence 88777777777889999999999999999999999999999999999999999999999988999999999999888889 Q ss_pred CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 241 ~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) +..+++|||++++++.+++++|++++++..+...+.++..+++|++|++++|++.|+||.+.+|+||++|+++++.++.| T Consensus 241 ~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~ 320 (324) T protein:vir:99 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSV 320 (324) T ss_pred cceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccC Q lcl|NC_011614. 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:99 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 8 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.4e-63 Score=365.23 Aligned_cols=296 Identities=49% Similarity=0.811 Sum_probs=274.8 Q ss_pred chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc-eEEEEEeCCcceeeecccccccccccc Q lcl|NC_011614. 19 VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKAT 97 (324) Q Consensus 19 ~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~ 97 (324) ...+.++++++++++++|++||++++++|++.+++.++++++++++++++.. ..+|+..+.+.+.|++||+++++++++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 80 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKPE 80 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccccccccc Confidence 5566778888999999999999999999999999999999999999997654 578888888999999999999999999 Q ss_pred eeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchh Q lcl|NC_011614. 98 WVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQD 177 (324) Q Consensus 98 ~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (324) |++++++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++ .+.++...........++.++++ T Consensus 81 f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~-~~~gi~~~~~~~~~~~~~~~t~~ 159 (297) T protein:vir:95 81 VVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTP-FANSVAKAAKDANKVIGGPINYD 159 (297) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCc-ccccccccccccceecccccCHH Confidence 99999999999999999999999999999999999999999999999999999865 45666666666666677789999 Q ss_pred HHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEe Q lcl|NC_011614. 178 NIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIP 257 (324) Q Consensus 178 ~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~ 257 (324) ++.+++.++..++..+++|+|||+++.+|++++|++|+|++.. .+++|+|+||+.+.+...+++.+++|||++++++.+ T Consensus 160 ~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~-~~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~ 238 (297) T protein:vir:95 160 NILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDK-AANTIDGITTVDLKSARFEKGDLLAGDFDNLIYGVP 238 (297) T ss_pred HHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecC-CCCcccceeeEeecCCCCCCceEEEEecccEEEEEe Confidence 9999999999999999999999999999999999999999864 467899999999888888999999999999999999 Q ss_pred cceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 258 QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 258 ~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) ++++++++++...+...+.++..+++|++|++++|++.|+|+++.+|+||++|+.++.. T Consensus 239 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 239 YNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred cCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 99999999999999999999999999999999999999999999999999999998877 No 9 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=2.4e-62 Score=358.45 Aligned_cols=293 Identities=34% Similarity=0.534 Sum_probs=265.3 Q ss_pred hhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeE Q lcl|NC_011614. 22 QVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 22 ~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) --+++++.+++++++++||++++++|++.+++.++++++++++|++++...+|+.+ .+.+.|++|++++|+++++|+++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~v 79 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS-GVGAFWVDEAERIQTSKPTFTKA 79 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc-CCceeeeecCccccccccceeEE Confidence 45667888889999999999999999999999999999999999999999999876 47799999999999999999999 Q ss_pred EeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccc-cccccceeecccchhHHH Q lcl|NC_011614. 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQS-IEKTNKVIKGDFTQDNII 180 (324) Q Consensus 102 ~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~-~~~~~~~~~~~~~~~~i~ 180 (324) ++.++|++++++||+|++++|..+++++|.+.|++++++++|+++|+|+|++.+ .+++.. ....+....+..+++++. T Consensus 80 ~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~-~gil~~~~~~~~~~~~~~~~~~~l~ 158 (299) T protein:vir:41 80 KMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYN-WNILKSATDASNLVEETANKYDDLN 158 (299) T ss_pred EEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCccc-ccccccccccceeeccccccHHHHH Confidence 999999999999999999999999999999999999999999999999987644 444443 333344455678899999 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccC---CCceecccceEeecCccCC--CceEEEeecccEEEE Q lcl|NC_011614. 181 DLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---NSDSLDGLPVVNLKSSNLK--RGELITGDFDKLIYG 255 (324) Q Consensus 181 ~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~---~~~~l~G~pv~~~~~~~~~--~~~i~~gd~~~~~~~ 255 (324) +++.++..+++.+++|+|||+++.+|++++|++|+|++.+. ..++|+|+||++++..+.+ +..+++|||++++++ T Consensus 159 ~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~ 238 (299) T protein:vir:41 159 EAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYYG 238 (299) T ss_pred HHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeEEecccCCCCCceEEEEEecccEEEE Confidence 99999999999999999999999999999999999998643 3468999999998766543 345999999999999 Q ss_pred EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) .+++++++++++.......+.++.++++|++|++.+|++.|+|+++.+|+||++++.+++. T Consensus 239 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 239 ILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 9999999999999999999999999999999999999999999999999999999999888 No 10 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=1.4e-61 Score=354.24 Aligned_cols=312 Identities=12% Similarity=0.165 Sum_probs=250.8 Q ss_pred CchhhHHHHHHH-------------HHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhh-ceeeec Q lcl|NC_011614. 1 MEQTQKLKLNLQ-------------HFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL-GKYEPM 66 (324) Q Consensus 1 m~~~~~~~~~~~-------------~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l-~~~~~~ 66 (324) ..|...+....+ +++..............+++++||.+||+++.++|++.+++.++++++ ++++++ T Consensus 26 ~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~ 105 (366) T protein:vir:57 26 QYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPL 105 (366) T ss_pred cccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeec Confidence 111111111111 111111111111112234556788899999999999999999999998 888999 Q ss_pred CCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 67 EGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) Q Consensus 67 ~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~ 146 (324) .++.+++|+.++.+.+.|++|++.+|+++++|++++++++|++++++||+|+++||.++++++|.++|++++++++|+++ T Consensus 106 ~~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~ 185 (366) T protein:vir:57 106 PNGNLSMPRLSGGATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAF 185 (366) T ss_pred CCCceEEEEEeCCcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHh Confidence 99899999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCcCcCCcccccccccccceee---cccchhH------HHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcee Q lcl|NC_011614. 147 ILNQGNNPFGKSIAQSIEKTNKVIK---GDFTQDN------IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKER 217 (324) Q Consensus 147 l~g~g~~~~~~~~~~~~~~~~~~~~---~~~~~~~------i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~ 217 (324) |+|+|++..|.++............ ...++.. +..+.......+...+.|+|||.++.+|++++|++|+|+ T Consensus 186 l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l 265 (366) T protein:vir:57 186 LRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKV 265 (366) T ss_pred hccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhccCCcee Confidence 9999988788887765443322211 1222222 223333344556778999999999999999999999999 Q ss_pred eccCCCceecccceEeecCccC------CCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEE Q lcl|NC_011614. 218 IYDRNSDSLDGLPVVNLKSSNL------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL 291 (324) Q Consensus 218 ~~~~~~~~l~G~pv~~~~~~~~------~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ 291 (324) |.....++|+|+||+.++..+. +...+++|||++++++.+++++++++++++ +.+.++..+++|++|++++ T Consensus 266 ~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~---~~~~~g~~~~~f~~~~~~i 342 (366) T protein:vir:57 266 YPEMSQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEAT---YKDADGQLVSAFARNQSLI 342 (366) T ss_pred ccCCCCCeecceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEeeccc---cccccccchhhhhcCceeE Confidence 9888889999999999775433 346799999999999999999999999974 5566778889999999999 Q ss_pred EEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 292 RATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 292 r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) |+++|+||++.||+||++++...| T Consensus 343 R~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 343 RVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred EeeeeeCcEeeccccEEEEecccC Confidence 999999999999999999999999 No 11 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=4e-61 Score=351.73 Aligned_cols=295 Identities=39% Similarity=0.627 Sum_probs=261.0 Q ss_pred chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccce Q lcl|NC_011614. 19 VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 19 ~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) ...++.++.++++++++|.+||++++++|++.+++.+++++++++++++++..+||++++.+.+.|++|++++|+++++| T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~ 80 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEY 80 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCccccccccee Confidence 45556677888899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCc----ccccccccc-cceeecc Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK----SIAQSIEKT-NKVIKGD 173 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~----~~~~~~~~~-~~~~~~~ 173 (324) ++++++++|++++++||+|++++|..+++++|.++|++++++++|+++|+|+|++.... ++....... .....+. T Consensus 81 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (304) T protein:vir:94 81 AQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTN 160 (304) T ss_pred eEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999998754322 222222222 2223456 Q ss_pred cchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCcc--CCCceEEEeeccc Q lcl|NC_011614. 174 FTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSN--LKRGELITGDFDK 251 (324) Q Consensus 174 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~--~~~~~i~~gd~~~ 251 (324) .+++++.+++.++..++..+++|+|||+++.+|++++|++|+|+|... +++|+|+||+++++.+ .++..+++|||++ T Consensus 161 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~ 239 (304) T protein:vir:94 161 NLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-GNEIMGLPLSYTGADVYDKKKSLALMGDWDY 239 (304) T ss_pred chHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-CccccceeeEEecccccCCCCcEEEEEehhh Confidence 789999999999999999999999999999999999999999998654 5789999999887654 3567899999999 Q ss_pred EEEEEecceEEEEeeccccc--ccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeec Q lcl|NC_011614. 252 LIYGIPQLIEYKIDETAQLS--TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 252 ~~~~~~~~~~i~~~~~~~~~--~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) ++++.++++++++++++... .+.+.++..+++|++|++++|+++|+|+.+.+|+||++||.+. T Consensus 240 ~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 240 ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99999999999999997644 4556777889999999999999999999999999999999987 No 12 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=4e-61 Score=351.73 Aligned_cols=295 Identities=39% Similarity=0.627 Sum_probs=261.0 Q ss_pred chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccce Q lcl|NC_011614. 19 VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 19 ~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) ...++.++.++++++++|.+||++++++|++.+++.+++++++++++++++..+||++++.+.+.|++|++++|+++++| T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~ 80 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEY 80 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCccccccccee Confidence 45556677888899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCc----ccccccccc-cceeecc Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK----SIAQSIEKT-NKVIKGD 173 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~----~~~~~~~~~-~~~~~~~ 173 (324) ++++++++|++++++||+|++++|..+++++|.++|++++++++|+++|+|+|++.... ++....... .....+. T Consensus 81 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (304) T protein:vir:10 81 AQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTN 160 (304) T ss_pred eEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999998754322 222222222 2223456 Q ss_pred cchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCcc--CCCceEEEeeccc Q lcl|NC_011614. 174 FTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSN--LKRGELITGDFDK 251 (324) Q Consensus 174 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~--~~~~~i~~gd~~~ 251 (324) .+++++.+++.++..++..+++|+|||+++.+|++++|++|+|+|... +++|+|+||+++++.+ .++..+++|||++ T Consensus 161 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~ 239 (304) T protein:vir:10 161 NLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-GNEIMGLPLSYTGADVYDKKKSLALMGDWDY 239 (304) T ss_pred chHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-CccccceeeEEecccccCCCCcEEEEEehhh Confidence 789999999999999999999999999999999999999999998654 5789999999887654 3567899999999 Q ss_pred EEEEEecceEEEEeeccccc--ccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeec Q lcl|NC_011614. 252 LIYGIPQLIEYKIDETAQLS--TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 252 ~~~~~~~~~~i~~~~~~~~~--~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) ++++.++++++++++++... .+.+.++..+++|++|++++|+++|+|+.+.+|+||++||.+. T Consensus 240 ~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 240 ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99999999999999997644 4556777889999999999999999999999999999999987 No 13 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=8.6e-61 Score=349.89 Aligned_cols=306 Identities=19% Similarity=0.235 Sum_probs=265.7 Q ss_pred HhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccc Q lcl|NC_011614. 14 FASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 14 ~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) .++......+.+....+++++++++||+++.++|++.+++.+++++++++++++++..+||+.++.+.++|++|++++++ T Consensus 1 ~~~~~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 80 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDMKPI 80 (318) T ss_pred CCCCCCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCccccc Confidence 22222222233444456677788999999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccce---e Q lcl|NC_011614. 94 SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV---I 170 (324) Q Consensus 94 ~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~---~ 170 (324) ++++|++++++++|+++++++|+|+++||.++++++|.+.|++++++++|+++|+|+|++. +.++.......... . T Consensus 81 ~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~-~~~~~~~~~~~~~~~~~~ 159 (318) T protein:vir:24 81 TKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPF-PTYIGQTTKAISIADTTG 159 (318) T ss_pred cccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCC-Cccccccccccccccccc Confidence 9999999999999999999999999999999999999999999999999999999998653 44444433322221 1 Q ss_pred ecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCC---------CceecccceEeecCccCCC Q lcl|NC_011614. 171 KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRN---------SDSLDGLPVVNLKSSNLKR 241 (324) Q Consensus 171 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~---------~~~l~G~pv~~~~~~~~~~ 241 (324) ......+++.+++..+...++.+++|+|||+++..|+++||++|+|++.+.. .+.++|+|++++++.+.++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~ 239 (318) T protein:vir:24 160 ATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEGT 239 (318) T ss_pred ccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCCc Confidence 2233446678899999999999999999999999999999999999986432 2468999999988888888 Q ss_pred ceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 242 GELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 242 ~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) ..+++|||+.++++.++++++++++++.++...+.++.++++|++|++++|+++|+|+++.+|+||++|+++++.++.- T Consensus 240 ~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 240 TVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred cEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCCC Confidence 8899999999999999999999999999999999999999999999999999999999999999999999998888777 No 14 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=1e-60 Score=349.50 Aligned_cols=304 Identities=19% Similarity=0.264 Sum_probs=263.6 Q ss_pred chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccce Q lcl|NC_011614. 19 VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 19 ~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) ....+.++..++.+.++|+++|++++++|++.+++.++++++++++++.++.+++|++++.+.+.|++||+++++++++| T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f 80 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKPITKGSF 80 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCcccccccee Confidence 56667788888888899999999999999999999999999999999999989999999999999999999999999999 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccc---------cce Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKT---------NKV 169 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~---------~~~ 169 (324) ++++++++|++++++||+|+++++.++++++|.++|++++++++|+++|+|+|++..+.++....... ... T Consensus 81 ~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~~~ 160 (330) T protein:vir:77 81 GKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTTAS 160 (330) T ss_pred eEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccccccc Confidence 99999999999999999999999999999999999999999999999999999887766655433221 111 Q ss_pred eecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccC---------CCceecccceEeecCccC- Q lcl|NC_011614. 170 IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDSLDGLPVVNLKSSNL- 239 (324) Q Consensus 170 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~---------~~~~l~G~pv~~~~~~~~- 239 (324) ......++++.+++.++..++..+++|+|||+++..|+++||++|+|+|.+. .+++|+|+||+++++.+. T Consensus 161 ~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~ 240 (330) T protein:vir:77 161 GPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNG 240 (330) T ss_pred cccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEeccccCC Confidence 2234568899999999999999999999999999999999999999998642 235899999999876543 Q ss_pred ---CCceEEEeecccEEEEEecceEEEEeecccccccccc----cccchhhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_011614. 240 ---KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNE----DGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP 312 (324) Q Consensus 240 ---~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~----~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~ 312 (324) ++..+++|||+.++++.++++++++++++......+. ....+++|++|++++|++.|+|+++.+|+||++|+. T Consensus 241 ~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~ 320 (330) T protein:vir:77 241 TVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTD 320 (330) T ss_pred CCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEe Confidence 3567999999999999999999999999876544332 345678999999999999999999999999999999 Q ss_pred eccCCCCccccC Q lcl|NC_011614. 313 ADAKPSSVPGEV 324 (324) Q Consensus 313 ~~~~~~~~~~~~ 324 (324) +++ ..+|.|- T Consensus 321 ~~~--~~~~~~~ 330 (330) T protein:vir:77 321 QVA--GTDPEEE 330 (330) T ss_pred ccC--CcCCCCC Confidence 874 3445555 No 15 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=2.5e-60 Score=347.34 Aligned_cols=306 Identities=17% Similarity=0.196 Sum_probs=265.2 Q ss_pred hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeeccccccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETS 94 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~ 94 (324) ++.+...+ ....+++++++++||++++++|++.+++.+++++++++++++++..+||+.+..+.+.|++|+++++++ T Consensus 1 ~g~~~e~~---~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s 77 (397) T protein:vir:23 1 MGFSADHS---QIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPIT 77 (397) T ss_pred CCcCHHHH---HHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCcccccc Confidence 33322222 222345556677899999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeeccc Q lcl|NC_011614. 95 KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDF 174 (324) Q Consensus 95 ~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~ 174 (324) +++|++++++++|++++++||+|+++++.++++++|.++|++++++++|+++|+|+|+.....++...... ....++.. T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~-~~~~~~~~ 156 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNK-TQSISPNA 156 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccc-eeeecccc Confidence 99999999999999999999999999999999999999999999999999999999987655555444333 33344567 Q ss_pred chhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCC---------CceecccceEeecCccCCCceEE Q lcl|NC_011614. 175 TQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRN---------SDSLDGLPVVNLKSSNLKRGELI 245 (324) Q Consensus 175 ~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~---------~~~l~G~pv~~~~~~~~~~~~i~ 245 (324) .++++.++..++..+++.++.|+|||+++.+|+++||++|+|+|.+.. +++++|+||++++..+.++..++ T Consensus 157 ~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~ 236 (397) T protein:vir:23 157 YQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGY 236 (397) T ss_pred hhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEE Confidence 788899999999999999999999999999999999999999986532 24799999999988887777889 Q ss_pred EeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 246 TGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 246 ~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) +|||++++++.+++++++++++..+....+..+.++++|++|+++||++.|+||++.+|+||++++..+.....+..+- T Consensus 237 ~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~ 315 (397) T protein:vir:23 237 AGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYALDLD 315 (397) T ss_pred EeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeeeccc Confidence 9999999999999999999999999999999999999999999999999999999999999999998665444432211 No 16 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=1.5e-60 Score=348.62 Aligned_cols=311 Identities=13% Similarity=0.164 Sum_probs=249.6 Q ss_pred Cc----hhhHHHHHHH-------------HHhhccchh-hhhccccccccCCCcceechhhhHHHHHHHHhhcchhhh-c Q lcl|NC_011614. 1 ME----QTQKLKLNLQ-------------HFASNNVKP-QVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL-G 61 (324) Q Consensus 1 m~----~~~~~~~~~~-------------~~~~~~~~~-~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l-~ 61 (324) ++ +...+....+ .++...... .+.++ ..+.+++||.+||+++.++|++.+++.++++++ + T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~ 161 (428) T protein:vir:10 83 AEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMA-ISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGA 161 (428) T ss_pred cccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhh-hcccccCCccccchhHHHHHHHHHhhhchhhhhcc Confidence 11 1111111111 111111111 12222 234455678899999999999999999999999 7 Q ss_pred eeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) Q Consensus 62 ~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~ 141 (324) +++++.++.+.+|+.++.+.+.|++||+.+|+++++|+++++.++|++++++||+|+++||.++++++|.+.|+++++++ T Consensus 162 ~~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~ 241 (428) T protein:vir:10 162 RSIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVR 241 (428) T ss_pred eeeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHH Confidence 78999888899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhccCcCcCCcccccccccccce----eecccchhHHHH------HHHHhhhhccCCCEEEEcHHHHHHHHHhhc Q lcl|NC_011614. 142 FDEAGILNQGNNPFGKSIAQSIEKTNKV----IKGDFTQDNIID------LEALLEDDELEANAFISKTQNRSLLRKIVD 211 (324) Q Consensus 142 ~d~a~l~g~g~~~~~~~~~~~~~~~~~~----~~~~~~~~~i~~------~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d 211 (324) +|+++|+|+|++..|.|+.......... .....+++.+.. +.......+..++.|+|||.++.+|++++| T Consensus 242 ~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd 321 (428) T protein:vir:10 242 EDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRD 321 (428) T ss_pred HHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhc Confidence 9999999999988888887655433221 112233333322 333444556677899999999999999999 Q ss_pred cCCceeeccCCCceecccceEeecCccC------CCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhh Q lcl|NC_011614. 212 PETKERIYDRNSDSLDGLPVVNLKSSNL------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) Q Consensus 212 ~~g~~~~~~~~~~~l~G~pv~~~~~~~~------~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) ++|+|+|.+..+++|+|+||++++..+. +...+++|||++++++.++++++++++++. +.+..+..+++|+ T Consensus 322 ~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~---~~~~~~~~~~~f~ 398 (428) T protein:vir:10 322 GNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDFSKEAS---YIDTDGKLVSAFS 398 (428) T ss_pred cCCceeccCCCCCeeeceeeEEeccccccccCCCccceEEEEecceEEEEEecceEEEeecccc---cccccccccchhh Confidence 9999999888888999999998765432 356799999999999999999999999875 4455667778999 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 286 QDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 286 ~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) +|+++||+++||||++.+|+||++++...| T Consensus 399 ~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 399 RNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cchhheeeeeeeCceeeccceEEEEeccCC Confidence 999999999999999999999999999999 No 17 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=4.7e-60 Score=345.83 Aligned_cols=304 Identities=18% Similarity=0.239 Sum_probs=257.7 Q ss_pred HhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccc Q lcl|NC_011614. 14 FASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 14 ~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) -++......+.++...++++++|++||++++++|++.+++.+++++++++++++++..+||+.++.+++.|++|++++|+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~ 80 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKPI 80 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCccccc Confidence 11222122333444456667778899999999999999999999999999999999999999999999999999999999 Q ss_pred cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccce---- Q lcl|NC_011614. 94 SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV---- 169 (324) Q Consensus 94 ~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~---- 169 (324) ++++|++++++++|++++++||+|++++|.++++++|.++|++++++++|+++|+|+|++... ++.......... T Consensus 81 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~-~~~~~~~~~~~~~~~~ 159 (320) T protein:vir:10 81 TKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPT-YLAQTTKSVSLADPGG 159 (320) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCc-ccccccccccceeccc Confidence 999999999999999999999999999999999999999999999999999999999875432 222222222111 Q ss_pred -eecccc-h-hHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccC---------CCceecccceEeecCc Q lcl|NC_011614. 170 -IKGDFT-Q-DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDSLDGLPVVNLKSS 237 (324) Q Consensus 170 -~~~~~~-~-~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~---------~~~~l~G~pv~~~~~~ 237 (324) ..+..+ . +++.++...+...+..+++|+|||+++.+|+++||++|++++... ..++++|+||++++.. T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~ 239 (320) T protein:vir:10 160 ATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHV 239 (320) T ss_pred ccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCC Confidence 112222 2 346788888889999999999999999999999999999998642 1357899999998877 Q ss_pred cCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 238 NLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 238 ~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) +.++..+++|||++++++.+++++++++++.......+.++.++++|++|++++|+++|+|+++.+|+||++|+++++.+ T Consensus 240 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~ 319 (320) T protein:vir:10 240 ADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPD 319 (320) T ss_pred CCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCC Confidence 77777789999999999999999999999999999999999999999999999999999999999999999999888655 Q ss_pred C Q lcl|NC_011614. 318 S 318 (324) Q Consensus 318 ~ 318 (324) + T Consensus 320 ~ 320 (320) T protein:vir:10 320 A 320 (320) T ss_pred C Confidence 5 No 18 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=4.2e-60 Score=346.09 Aligned_cols=309 Identities=19% Similarity=0.269 Sum_probs=259.0 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |--+. .+ +.+-+...+.|+.++ +++++|++||++++++|++.+++.+++++++++++++++..++|+.++.+ T Consensus 1 ~~~~~-~r------~~~~~~~~e~~a~~~-~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~ 72 (326) T protein:vir:42 1 MAVNP-DR------TTPFLGVNDPKVAQT-GDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDV 72 (326) T ss_pred CCCCc-cc------hhhhcCcchhhheec-cccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCc Confidence 21111 10 011122335555544 44556779999999999999999999999999999999999999999999 Q ss_pred ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccc Q lcl|NC_011614. 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~ 160 (324) .+.|++||+++|+++++|++++++++|++++++||+|++++|.++++++|.++|++++++++|+++|+|+|++ .+.++. T Consensus 73 ~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~-~p~gi~ 151 (326) T protein:vir:42 73 SASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSP-FPTFLA 151 (326) T ss_pred ceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-cccccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999975 344444 Q ss_pred cccccccc------eeecccchhH--HHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCC---------C Q lcl|NC_011614. 161 QSIEKTNK------VIKGDFTQDN--IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRN---------S 223 (324) Q Consensus 161 ~~~~~~~~------~~~~~~~~~~--i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~---------~ 223 (324) ........ ......++.+ +.++...+...++.++.|+|||+++.+|+++||++|+|+|.+.. . T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~ 231 (326) T protein:vir:42 152 QTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRL 231 (326) T ss_pred ccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccC Confidence 33322111 1122233333 34566667777888899999999999999999999999986532 3 Q ss_pred ceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_011614. 224 DSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 224 ~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) ++++|+||++++..+.++..+++|||++++++.+++++++++++..+....+.++.++++|++|++++|+++|+|+++.+ T Consensus 232 ~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~ 311 (326) T protein:vir:42 232 GRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCND 311 (326) T ss_pred ceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEec Confidence 46999999999888888888899999999999999999999999999988899999999999999999999999999999 Q ss_pred ccceEEEEeeccCCC Q lcl|NC_011614. 304 DKAFAKLVPADAKPS 318 (324) Q Consensus 304 ~~a~~~l~~~~~~~~ 318 (324) |+||++|+++++.++ T Consensus 312 ~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 312 KDAFVKLTNVDATEA 326 (326) T ss_pred ccceEEEeeccccCC Confidence 999999999999999 No 19 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=1.4e-59 Score=343.25 Aligned_cols=314 Identities=15% Similarity=0.190 Sum_probs=254.8 Q ss_pred CchhhHHHHHHHHHh--------------hccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhh-ceeee Q lcl|NC_011614. 1 MEQTQKLKLNLQHFA--------------SNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL-GKYEP 65 (324) Q Consensus 1 m~~~~~~~~~~~~~~--------------~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l-~~~~~ 65 (324) ..+...+....+... +........++.+..++..||.+||+++.++|++.+++.++++++ +++++ T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~ 171 (435) T protein:vir:80 92 EVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLP 171 (435) T ss_pred hhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeee Confidence 111112222221111 111112222334455666788899999999999999999999998 78899 Q ss_pred cCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 66 MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFD 143 (324) Q Consensus 66 ~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d 143 (324) +.++..++|+.++.+.+.|++|++.+|+++++|+++++.++|++++++||+|+++||. ++++++|.++|++++++++| T Consensus 172 ~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d 251 (435) T protein:vir:80 172 LSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGARED 251 (435) T ss_pred cCCCceEEEEEeCCcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHH Confidence 9999999999999999999999999999999999999999999999999999999995 47999999999999999999 Q ss_pred HHHHhccCcCcCCcccccccccccce-eec----ccchhHHHHHHHHhhhh--ccCCCEEEEcHHHHHHHHHhhccCCce Q lcl|NC_011614. 144 EAGILNQGNNPFGKSIAQSIEKTNKV-IKG----DFTQDNIIDLEALLEDD--ELEANAFISKTQNRSLLRKIVDPETKE 216 (324) Q Consensus 144 ~a~l~g~g~~~~~~~~~~~~~~~~~~-~~~----~~~~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~l~d~~g~~ 216 (324) .++|+|+|++..|.++.......... .+. ...+.++.+++..+... ++.+++|+|||.++.+|++++|++|+| T Consensus 252 ~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~ 331 (435) T protein:vir:80 252 KAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNK 331 (435) T ss_pred HHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCce Confidence 99999999887777776654332221 111 22345667777776554 566789999999999999999999999 Q ss_pred eeccCCCceecccceEeecCccC------CCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEE Q lcl|NC_011614. 217 RIYDRNSDSLDGLPVVNLKSSNL------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) Q Consensus 217 ~~~~~~~~~l~G~pv~~~~~~~~------~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~ 290 (324) +|....+++|+|+||++++..+. +...+++|||++++++.++++++++++++.. .+..+..+++|++|+++ T Consensus 332 l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~---~~~~~~~~~~f~~n~~~ 408 (435) T protein:vir:80 332 VYPELANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEIDYSKEATY---KDADGHMVSAFQRDQTL 408 (435) T ss_pred eccCCCCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEEEeccccc---cccccchhhhhhcCcce Confidence 99877888999999998765432 3457999999999999999999999999853 45666788999999999 Q ss_pred EEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 291 LRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 291 ~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) ||++.|+||++.+|+||++|++..|.. T Consensus 409 ~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 409 IRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred eeeeeeeCcEeecccceEEEeccCCCC Confidence 999999999999999999999999988 No 20 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=1.6e-59 Score=342.90 Aligned_cols=305 Identities=13% Similarity=0.072 Sum_probs=257.9 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) .+....+...+|+.........+.++....++.+||.+||++++++|++.+++.++++++++++++.++.+.+|+..+++ T Consensus 80 ~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (407) T protein:vir:48 80 SEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGT 159 (407) T ss_pred hHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCc Confidence 22222233334444445555667788877888889999999999999999999999999999999999999999999999 Q ss_pred ceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCccc Q lcl|NC_011614. 81 GAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSI 159 (324) Q Consensus 81 ~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~ 159 (324) .+.|++|++.+|++ .++|+++++.++|++++++||+|+++||.++++++|.++|++++++++|.++++|+|++ .|.|+ T Consensus 160 ~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~-~p~Gi 238 (407) T protein:vir:48 160 TSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSK-KPKGF 238 (407) T ss_pred ceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCC-cccee Confidence 99999999999976 57999999999999999999999999999999999999999999999999999999985 45665 Q ss_pred ccccccc--------------cceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----C Q lcl|NC_011614. 160 AQSIEKT--------------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----R 221 (324) Q Consensus 160 ~~~~~~~--------------~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~ 221 (324) +...... ....++.++++++++++.+|..+|+.+++|+||+.++..|++++|++|+|+|.+ + T Consensus 239 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g 318 (407) T protein:vir:48 239 LAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELG 318 (407) T ss_pred eecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC Confidence 5432211 122345678999999999999999999999999999999999999999999854 3 Q ss_pred CCceecccceEeecCcc---CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEe Q lcl|NC_011614. 222 NSDSLDGLPVVNLKSSN---LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 222 ~~~~l~G~pv~~~~~~~---~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) .+++|+|+||++++..+ .+...+++|||+. +.++.+.++++..++ +|++|++.||++.|+ T Consensus 319 ~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~----------------~~~~~~~~~~~~~r~ 382 (407) T protein:vir:48 319 QPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP----------------YTNKPFVGFYTTKRT 382 (407) T ss_pred CCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeec----------------cccCCcEEEEEEEEe Confidence 45689999999886543 3456689999985 778889988876542 467899999999999 Q ss_pred ccEEecccceEEEEeeccCCCCccc Q lcl|NC_011614. 298 ALHIADDKAFAKLVPADAKPSSVPG 322 (324) Q Consensus 298 d~~v~~~~a~~~l~~~~~~~~~~~~ 322 (324) |+++.+|+||++++.++++++.+-+ T Consensus 383 d~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 383 GGMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred ccEEecccceEEEEeeccCCCCCCC Confidence 9999999999999999999988888 No 21 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.4e-59 Score=343.26 Aligned_cols=292 Identities=17% Similarity=0.148 Sum_probs=247.1 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeee Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) +...++++||++||++++.+|++.+++.+++++++++++++++..+||++++.+.|+|++||+++++++++|++++++++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 80 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEeeee Confidence 44456678999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEEeehhHHHHHhcChhH----HHHHHHHHHHHHHHHHHHHHHHhccCc--CcCCccccccccccc-ceeecccchhHH Q lcl|NC_011614. 107 KLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILNQGN--NPFGKSIAQSIEKTN-KVIKGDFTQDNI 179 (324) Q Consensus 107 k~~~~v~iS~ell~~s~~~----~~~~v~~~l~~ai~~~~d~a~l~g~g~--~~~~~~~~~~~~~~~-~~~~~~~~~~~i 179 (324) |++++++||+|+++++..+ ++++|.+.|++++++++|.++|+|++. +..+.++........ ........++++ T Consensus 81 kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 160 (315) T protein:vir:80 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATADL 160 (315) T ss_pred eEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccchHHH Confidence 9999999999999988765 789999999999999999999999763 333333333332222 223345568899 Q ss_pred HHHHHHhhhh-ccCCCEEEEcHHHHHHHHHhhccCCc-----eeec---cCCCceecccceEeecCccC-------CCce Q lcl|NC_011614. 180 IDLEALLEDD-ELEANAFISKTQNRSLLRKIVDPETK-----ERIY---DRNSDSLDGLPVVNLKSSNL-------KRGE 243 (324) Q Consensus 180 ~~~~~~l~~~-~~~~~~~v~~~~~~~~L~~l~d~~g~-----~~~~---~~~~~~l~G~pv~~~~~~~~-------~~~~ 243 (324) .+++.++..+ +..+++|+|||.++..|+++++.+|+ +++. .+.+++|+|+||++++..+. ++.. T Consensus 161 ~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~ 240 (315) T protein:vir:80 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) T ss_pred HHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCcccccccccccE Confidence 9999888654 45567899999999999999877664 4443 23456899999998775542 3456 Q ss_pred EEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCcccc Q lcl|NC_011614. 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGE 323 (324) Q Consensus 244 i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) +++|||++++++.+++++++++++.. .++..+++|++|+++||++.|+|+++.+|+||++|+.+++.....|+| T Consensus 241 ~~~GDfs~~~~g~~~~~~i~i~~~~~------~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~~~ 314 (315) T protein:vir:80 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAE 314 (315) T ss_pred EEEeecccEEEEEecCeeEEEecccc------ccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCCCC Confidence 89999999999999999999998753 455678899999999999999999999999999999999999999999 Q ss_pred C Q lcl|NC_011614. 324 V 324 (324) Q Consensus 324 ~ 324 (324) - T Consensus 315 ~ 315 (315) T protein:vir:80 315 N 315 (315) T ss_pred C Confidence 9 No 22 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=2.5e-59 Score=341.83 Aligned_cols=314 Identities=15% Similarity=0.198 Sum_probs=254.6 Q ss_pred Cchhh-HHHHHHHHHh--------------hccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhh-ceee Q lcl|NC_011614. 1 MEQTQ-KLKLNLQHFA--------------SNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL-GKYE 64 (324) Q Consensus 1 m~~~~-~~~~~~~~~~--------------~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l-~~~~ 64 (324) ++..+ .+..+.+... .........++.+..++..||.+||+++.++|++.+++.++++++ ++.+ T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~ 170 (435) T protein:vir:14 91 LEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTL 170 (435) T ss_pred hhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceee Confidence 11111 1111111111 011112223344556667788899999999999999999999998 7789 Q ss_pred ecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 65 PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKF 142 (324) Q Consensus 65 ~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~ 142 (324) ++.++...+|+.++.+.+.|++|++.+|+++++|+++++.++|++++++||+|+++|+. ++++++|.+.|++++++++ T Consensus 171 ~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~ 250 (435) T protein:vir:14 171 PLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGARE 250 (435) T ss_pred ecCCCceEEEEEeCCcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHH Confidence 99888999999999999999999999999999999999999999999999999999995 4699999999999999999 Q ss_pred HHHHHhccCcCcCCcccccccccccce-----eecccchhHHHHHHHHhhhh--ccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_011614. 143 DEAGILNQGNNPFGKSIAQSIEKTNKV-----IKGDFTQDNIIDLEALLEDD--ELEANAFISKTQNRSLLRKIVDPETK 215 (324) Q Consensus 143 d~a~l~g~g~~~~~~~~~~~~~~~~~~-----~~~~~~~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~l~d~~g~ 215 (324) |.+|++|+|++..|.++.......... .+......++.+++..+... ++.+++|+|||.++.+|++++|++|+ T Consensus 251 d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~ 330 (435) T protein:vir:14 251 DKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGN 330 (435) T ss_pred HHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCc Confidence 999999999987788876543322211 12223345677777777654 56678999999999999999999999 Q ss_pred eeeccCCCceecccceEeecCccC------CCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcE Q lcl|NC_011614. 216 ERIYDRNSDSLDGLPVVNLKSSNL------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) Q Consensus 216 ~~~~~~~~~~l~G~pv~~~~~~~~------~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v 289 (324) |+|.....++|+|+||++++..+. +...+++|||+.++++.+++++++++++.. +.+.++..+.+|++|++ T Consensus 331 ~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~f~~~~~ 407 (435) T protein:vir:14 331 KVYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLEIDYSKEAT---YKDADGHMVSAFQRDQT 407 (435) T ss_pred eeccCCCCCeeecceeEeeccccccccCCCccceEEEeecccEEEEEecccEEEEecccc---ccccccchhhhhhcChh Confidence 999877888999999998765432 445799999999999999999999999875 44556677889999999 Q ss_pred EEEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 290 ALRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 290 ~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) +||+++|+||++.+|+||++|++++|.. T Consensus 408 ~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 408 LIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred heeeeeeeCceeecccceEEEecCCCCC Confidence 9999999999999999999999998887 No 23 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=4e-59 Score=340.76 Aligned_cols=282 Identities=15% Similarity=0.187 Sum_probs=242.3 Q ss_pred cccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeee Q lcl|NC_011614. 28 NVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFK 107 (324) Q Consensus 28 ~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k 107 (324) +..+++++|.+||++++.+|++.+++.+++++++++++++++..++|+.++.+.|.|++||+++|+++++|++++++++| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k 80 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVPLK 80 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeeeEE Confidence 45677778999999999999999999999999999999999889999999999999999999999999999999999999 Q ss_pred EEEeehhHHHHHh---cChhHHHHHHHHHHHHHHHHHHHHHHHhccCc----CcCCcccccc--cccccceeecccchhH Q lcl|NC_011614. 108 LGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN----NPFGKSIAQS--IEKTNKVIKGDFTQDN 178 (324) Q Consensus 108 ~~~~v~iS~ell~---~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~----~~~~~~~~~~--~~~~~~~~~~~~~~~~ 178 (324) ++++++||+|+++ ++.++++++|.++|++++++++|.++|+|++. +..+.+.... .........+..++++ T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (300) T protein:vir:95 81 VEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPDES 160 (300) T ss_pred EEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchHHH Confidence 9999999999994 56789999999999999999999999999532 2222222221 1222222345677899 Q ss_pred HHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceEeecCccC----CCceEEEeecc Q lcl|NC_011614. 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL----KRGELITGDFD 250 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~~----~~~~i~~gd~~ 250 (324) +.++..++...++.+++|+|||.++.+|++++|++|+|+|.. +.+++|+|+||++++..+. ++..+++|||+ T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~ 240 (300) T protein:vir:95 161 MEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGDFE 240 (300) T ss_pred HHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEEEEeecc Confidence 999999999999999999999999999999999999999853 4568999999999876543 34568889999 Q ss_pred cEE-EEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 251 KLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 251 ~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) .++ ++.+++++++++++. +.++..+++|++|++++|+++|+||.+.+|+||++|++++- T Consensus 241 ~~~~~~~~~~~~~~v~~~~------~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 241 TMFKWGYAKEVPMEIIKYG------DPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ceEEEEEecccEEEEeecc------CCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 765 999999999998754 34566789999999999999999999999999999999765 No 24 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=1.1e-58 Score=338.37 Aligned_cols=298 Identities=14% Similarity=0.073 Sum_probs=251.3 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) ++.++.+..++|+.........+.++....++++||.+||++++++|++.+++.+++++++++++++++...+|+...++ T Consensus 81 ~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 160 (401) T protein:vir:44 81 AEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGT 160 (401) T ss_pred HHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCc Confidence 33333344444444445555567777777777788999999999999999999999999999999999999999999999 Q ss_pred ceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCccc Q lcl|NC_011614. 81 GAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSI 159 (324) Q Consensus 81 ~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~ 159 (324) .+.|++|++.+|++ .++|+++++.++|++++++||+|+++||.++++++|.+.|++++++++|.++|+|+|++ .|.|+ T Consensus 161 ~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~-~p~Gi 239 (401) T protein:vir:44 161 ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTK-KPKGF 239 (401) T ss_pred cceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCC-cccee Confidence 99999999999865 58999999999999999999999999999999999999999999999999999999984 55665 Q ss_pred ccccccc--------------cceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----C Q lcl|NC_011614. 160 AQSIEKT--------------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----R 221 (324) Q Consensus 160 ~~~~~~~--------------~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~ 221 (324) +...... ....++.++++++++++..|..+|..+++|+||++++..|++++|++|+|+|.+ + T Consensus 240 l~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g 319 (401) T protein:vir:44 240 LAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELG 319 (401) T ss_pred eccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCC Confidence 5432211 112345578999999999999999999999999999999999999999999854 4 Q ss_pred CCceecccceEeecCcc---CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEe Q lcl|NC_011614. 222 NSDSLDGLPVVNLKSSN---LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 222 ~~~~l~G~pv~~~~~~~---~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) .+++|+|+||++++..+ .+...+++|||+. |.++.+.++++..++ +|++|++.||++.|+ T Consensus 320 ~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~----------------~~~~~~v~~~a~~r~ 383 (401) T protein:vir:44 320 QPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP----------------YTNKPFVGFYTTKRT 383 (401) T ss_pred CCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeec----------------cccCCcEEEEEEEEe Confidence 55689999999886533 3456689999986 678889888876442 478999999999999 Q ss_pred ccEEecccceEEEEeecc Q lcl|NC_011614. 298 ALHIADDKAFAKLVPADA 315 (324) Q Consensus 298 d~~v~~~~a~~~l~~~~~ 315 (324) |+++.+|+||++++.+++ T Consensus 384 d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 384 GGMLVDSQAIKLLKIAAA 401 (401) T ss_pred ccEEecccceEEEEeecC Confidence 999999999999999988 No 25 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=4.3e-58 Score=335.06 Aligned_cols=299 Identities=16% Similarity=0.132 Sum_probs=245.9 Q ss_pred Cchhh-HH-HHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeC Q lcl|NC_011614. 1 MEQTQ-KL-KLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 m~~~~-~~-~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~ 78 (324) .+... .. ....+.|.+.....++.++.+..++++||.+||++++++|++.+++.++++++|+++++.++..++|+..+ T Consensus 102 ~~~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~ 181 (425) T protein:vir:10 102 ANGVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMG 181 (425) T ss_pred cccccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcC Confidence 11111 00 11122222222222344556667778889999999999999999999999999999999999999999999 Q ss_pred Ccceeeecccccccccc-cceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCc Q lcl|NC_011614. 79 KPGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~ 157 (324) .+.+.|++|++.+|+++ ++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.++|+|+|++ .|. T Consensus 182 ~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~-~p~ 260 (425) T protein:vir:10 182 GTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTN-KPN 260 (425) T ss_pred CcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCC-Ccc Confidence 99999999999999875 7999999999999999999999999999999999999999999999999999999975 566 Q ss_pred cccccccccc--------------ceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc--- Q lcl|NC_011614. 158 SIAQSIEKTN--------------KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD--- 220 (324) Q Consensus 158 ~~~~~~~~~~--------------~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~--- 220 (324) |++....... ...++.++++++++++..+...|+.+++|+|||+++.+|++++|++|+|+|.+ T Consensus 261 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~ 340 (425) T protein:vir:10 261 GLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYV 340 (425) T ss_pred eeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCcc Confidence 6655433211 22345678999999999999999999999999999999999999999999854 Q ss_pred -CCCceecccceEeecCcc---CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEE Q lcl|NC_011614. 221 -RNSDSLDGLPVVNLKSSN---LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 221 -~~~~~l~G~pv~~~~~~~---~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) +.+++|+|+||++++..+ .+...+++|||+. ++++.+.++++..+. +|.+|++.||++. T Consensus 341 ~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~----------------~~~~~~~~~~~~~ 404 (425) T protein:vir:10 341 AGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDP----------------YTAKPYVLFYTTK 404 (425) T ss_pred CCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecc----------------cccCCcEEEEEEE Confidence 455789999999986543 3456699999997 578888887764332 4779999999999 Q ss_pred EeccEEecccceEEEEeeccC Q lcl|NC_011614. 296 HVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 296 r~d~~v~~~~a~~~l~~~~~~ 316 (324) |+|+++.+|+||++++.+++. T Consensus 405 r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 405 RVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred EeccEeecccceEEEEeeccC Confidence 999999999999999999888 No 26 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=6.9e-58 Score=333.95 Aligned_cols=281 Identities=18% Similarity=0.214 Sum_probs=240.4 Q ss_pred ccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeE Q lcl|NC_011614. 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) Q Consensus 29 ~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~ 108 (324) ..+.+++|.+||++++++|++.+++.+++++++++++++++..++|+.++++.|+|++|++++|+++++|++++++++|+ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~kl 80 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIKV 80 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEEE Confidence 34556788999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeehhHHHHHh---cChhHHHHHHHHHHHHHHHHHHHHHHHhccCcC----cC--Cccccccc-ccccceeecccchhH Q lcl|NC_011614. 109 GVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN----PF--GKSIAQSI-EKTNKVIKGDFTQDN 178 (324) Q Consensus 109 ~~~v~iS~ell~---~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~----~~--~~~~~~~~-~~~~~~~~~~~~~~~ 178 (324) ++++++|+|+++ ++.++++++|.++|++++++++|.++|+|+++. .. +....... .......++..++++ T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (303) T protein:vir:97 81 EYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADAN 160 (303) T ss_pred EEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchHHH Confidence 999999999994 567899999999999999999999999996432 11 11122221 222233345678999 Q ss_pred HHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccC-----CCceecccceEeecCcc------CCCceEEEe Q lcl|NC_011614. 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR-----NSDSLDGLPVVNLKSSN------LKRGELITG 247 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~-----~~~~l~G~pv~~~~~~~------~~~~~i~~g 247 (324) +.+++.++..+++.++.|+|||+++.+|++++|++|+|++... .+++|+|+||++++..+ .+...+++| T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~G 240 (303) T protein:vir:97 161 IEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLVIIG 240 (303) T ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCCCccEEEEe Confidence 9999999999999999999999999999999999999998643 34689999999976543 245579999 Q ss_pred ecc-cEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 248 DFD-KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 248 d~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) ||+ .+.++.+++++++++++ .+.++..+++|++|++++|++.|+|+++.+|+||++||++.- T Consensus 241 df~~~~~~~~~~~~~~~~~~~------~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 241 DFESMFKWGYAKQIPMEIIKY------GDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred eccccEEEEEecCcEEEEeec------cCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 996 46799999999999864 346677889999999999999999999999999999998655 No 27 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=8.2e-58 Score=333.55 Aligned_cols=278 Identities=19% Similarity=0.194 Sum_probs=237.0 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEE Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~ 110 (324) +..++|.+||++++++|++.+++++++++++++++++++..++|+.++.++|+|++|++++|+++++|++++++++|+++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a~ 80 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeEEE Confidence 55677899999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred eehhHHHHHh---cChhHHHHHHHHHHHHHHHHHHHHHHHhccCc----CcCCccccccccc----ccceeecccchhHH Q lcl|NC_011614. 111 ILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN----NPFGKSIAQSIEK----TNKVIKGDFTQDNI 179 (324) Q Consensus 111 ~v~iS~ell~---~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~----~~~~~~~~~~~~~----~~~~~~~~~~~~~i 179 (324) +++||+|+++ ++..+++++|.++|++++++++|.++|+|++. ...+.+....... +.........++++ T Consensus 81 ~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) T protein:vir:16 81 GARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) T ss_pred eehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHHHH Confidence 9999999996 45578999999999999999999999999532 2222221111111 11112223447789 Q ss_pred HHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceEeecCcc----CCCceEEEeeccc Q lcl|NC_011614. 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN----LKRGELITGDFDK 251 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~----~~~~~i~~gd~~~ 251 (324) .+++.++..+++.+++|+|||+++.+|++++|++|+|+|.+ +.+++|+|+||++++..+ .++..+++|||+. T Consensus 161 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~ 240 (298) T protein:vir:16 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) T ss_pred HHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeeccc Confidence 99999999999999999999999999999999999999864 345799999999886543 3456799999988 Q ss_pred E-EEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeec Q lcl|NC_011614. 252 L-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 252 ~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) + .++.+++++++++++. +.++..+++|++|++++|++.|+||++.+|+||++|++++ T Consensus 241 ~~~~~~~~~~~~~~~~~~------~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 241 GFKWGYAKEVPLEVIQYG------DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eEEEEEecCceEEEeecc------CCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 5 5899999999998765 3456778999999999999999999999999999999998 No 28 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=1.1e-57 Score=332.77 Aligned_cols=281 Identities=19% Similarity=0.182 Sum_probs=237.5 Q ss_pred ccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeE Q lcl|NC_011614. 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) Q Consensus 29 ~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~ 108 (324) ..+.++||.+||++++++|++.+++++++++++++++++++..++|++++.++|+|++||+++|+++++|+++++.++|+ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~kl 80 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEEE Confidence 45666789999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeehhHHHHHh---cChhHHHHHHHHHHHHHHHHHHHHHHHhccC--cCcCCccccccccccc----ceeec-ccchhH Q lcl|NC_011614. 109 GVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQG--NNPFGKSIAQSIEKTN----KVIKG-DFTQDN 178 (324) Q Consensus 109 ~~~v~iS~ell~---~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g--~~~~~~~~~~~~~~~~----~~~~~-~~~~~~ 178 (324) +++++||+|+++ ++..+++++|.+++++++++++|.++|+|++ ++..+.++........ ....+ ...+.+ T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) T protein:vir:81 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) T ss_pred EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHHHH Confidence 999999999996 4557899999999999999999999999964 3333333333322221 11112 233456 Q ss_pred HHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceEeecCcc---------------- Q lcl|NC_011614. 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN---------------- 238 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~---------------- 238 (324) +.+++.++...+..+.+|+|||.++.+|+++||++|+|+|.. +.+++|+|+||++++..+ T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~ 240 (311) T protein:vir:81 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) T ss_pred HHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccccchhccc Confidence 777888888878888889999999999999999999999864 356799999999865332 Q ss_pred CCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 239 ~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) .++..+++|||++++++.+++++++++++.. . +..+++|++|++++|++.|+|+++.+|+||++|++++.+ T Consensus 241 ~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~------~-~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred CCccEEEEEecccEEEEEeccceEEEeccCC------C-CcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 2455789999999999999999999987752 2 234688999999999999999999999999999998887 No 29 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.5e-57 Score=332.16 Aligned_cols=278 Identities=18% Similarity=0.194 Sum_probs=238.1 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEE Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~ 110 (324) ++.++|.+||++++++|++.+++++++++++++++++++..++|+.++.++|.|++||+++|+++++|++++++++|+++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 80 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEEE Confidence 45577899999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred eehhHHHHHhc---ChhHHHHHHHHHHHHHHHHHHHHHHHhccC----cCcCCcccccccccc----cceeecccchhHH Q lcl|NC_011614. 111 ILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQG----NNPFGKSIAQSIEKT----NKVIKGDFTQDNI 179 (324) Q Consensus 111 ~v~iS~ell~~---s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g----~~~~~~~~~~~~~~~----~~~~~~~~~~~~i 179 (324) +++||+|++++ +..+++++|.++|++++++++|.++|+|.+ +...+.+.......+ .........++++ T Consensus 81 ~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) T protein:vir:94 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) T ss_pred eeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHHHH Confidence 99999999964 457899999999999999999999999843 233333222211111 1122234457899 Q ss_pred HHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceEeecCcc----CCCceEEEeeccc Q lcl|NC_011614. 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN----LKRGELITGDFDK 251 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~----~~~~~i~~gd~~~ 251 (324) .+++.++..++..+++|+|||+++.+|++++|++|+|+|.+ +.+++|+|+||++++..+ .++..+++|||+. T Consensus 161 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~ 240 (298) T protein:vir:94 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) T ss_pred HHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeeccc Confidence 99999999999999999999999999999999999999864 456789999999887543 3456799999998 Q ss_pred E-EEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeec Q lcl|NC_011614. 252 L-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 252 ~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) + .++.+++++++++++. +.++..+++|++|++++|++.|+||.+.+|+||++|++++ T Consensus 241 ~~~~~~~~~~~~~~~~~~------~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 241 GFKWGYAKEVPLEVIQYG------DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eEEEEEecCceEEEeecC------CCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 6 4899999999998765 3456778899999999999999999999999999999998 No 30 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=5.2e-57 Score=329.14 Aligned_cols=315 Identities=14% Similarity=0.125 Sum_probs=244.2 Q ss_pred CchhhHHHHHHHHHhh------------------ccchhhhh-----ccccccccCCCcceechhhhHHHHHHHHhhcch Q lcl|NC_011614. 1 MEQTQKLKLNLQHFAS------------------NNVKPQVF-----NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKI 57 (324) Q Consensus 1 m~~~~~~~~~~~~~~~------------------~~~~~~~~-----~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l 57 (324) +.|...+...++.... .......+ ++.+...+..||.++|+++..+|++.+++.+++ T Consensus 289 ~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv 368 (645) T protein:vir:93 289 LDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTII 368 (645) T ss_pred hhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhhhhH Confidence 2222222222222111 10101111 111112233466779999999999999999999 Q ss_pred hhhceeeecC----CCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHH Q lcl|NC_011614. 58 MQLGKYEPME----GTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPM 133 (324) Q Consensus 58 ~~l~~~~~~~----~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~ 133 (324) ++++....++ .+..++|+.++++.++|++||+.+|+++++|++++++++|+++++++|+||++||.++++++|.+. T Consensus 369 ~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~ 448 (645) T protein:vir:93 369 GRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNA 448 (645) T ss_pred HhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHH Confidence 9997654332 235789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhccCcCc---CCcccccccccccceeecccchhHHHHHHHHhhhhcc--CCCEEEEcHHHHHHHHH Q lcl|NC_011614. 134 IAEAFYKKFDEAGILNQGNNP---FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRK 208 (324) Q Consensus 134 l~~ai~~~~d~a~l~g~g~~~---~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~L~~ 208 (324) |++++++++|.++|+|++++. .|.++.... ..+.+...+..++.+++.++..++. ..++|+|||.++.+|++ T Consensus 449 l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~---~~~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~ 525 (645) T protein:vir:93 449 LAEAVVARLDTDFVDPKKAAVADVSPASITHDV---KGTASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSM 525 (645) T ss_pred HHHHHHHHHHHHhhcCCCcccCCccccceeccc---cccccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHh Confidence 999999999999999887653 344444332 2222334456788888888876643 35789999999999999 Q ss_pred hhccCCceeecc--CCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccc--------cccc Q lcl|NC_011614. 209 IVDPETKERIYD--RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVK--------NEDG 278 (324) Q Consensus 209 l~d~~g~~~~~~--~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~--------~~~~ 278 (324) ++|++|++++.. ..+++|+|+||++++..+ . .+++|||+.+++|.++++.+.+++++.+.... ...+ T Consensus 526 lkd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp--~-~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~ 602 (645) T protein:vir:93 526 RKNALGQKEYPDMTLLGGSFQGLPVIVSQYVG--D-QLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPV 602 (645) T ss_pred ccccCCceeecCCCCCCceeeceeeEEeccCC--c-ceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccc Confidence 999999998743 346799999999987653 2 47899999999999999999999998765433 3444 Q ss_pred cchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCcc Q lcl|NC_011614. 279 TPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVP 321 (324) Q Consensus 279 ~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 321 (324) .++++|++|+++||+++|+||++.+|+||++|+.+.|.++.-- T Consensus 603 ~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 603 ELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSASGG 645 (645) T ss_pred cchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcccCC Confidence 6789999999999999999999999999999999999887765 No 31 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=5.5e-57 Score=329.04 Aligned_cols=303 Identities=13% Similarity=0.075 Sum_probs=250.2 Q ss_pred CchhhHHHHHHHHHhh---ccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEe Q lcl|NC_011614. 1 MEQTQKLKLNLQHFAS---NNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 m~~~~~~~~~~~~~~~---~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) +.....+..+.+.-.. ........+....++++++|++||++++++|++.+++.+++++++++++++++.+.+|+.. T Consensus 106 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 185 (418) T protein:vir:10 106 SEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVET 185 (418) T ss_pred HHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEe Confidence 1111111111111111 1111112223334566678889999999999999999999999999999999889999987 Q ss_pred C-CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 78 D-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 78 ~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) . .+.+.|++||+++++++++|+++++.++|++++++||+|+++++. +++++|.+.|++++++++|.++|+|+|++..| T Consensus 186 ~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p 264 (418) T protein:vir:10 186 GFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDAP-ALQSYIDGRARYGLQLTEEGQILKGDGTGANI 264 (418) T ss_pred cCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhccCCCCccc Confidence 6 578999999999999999999999999999999999999999885 89999999999999999999999999998878 Q ss_pred cccccccccccc--eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc---CCCceecccce Q lcl|NC_011614. 157 KSIAQSIEKTNK--VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGLPV 231 (324) Q Consensus 157 ~~~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~---~~~~~l~G~pv 231 (324) .+++........ ..++..+++++.+++.++...++.+++|+|||.++..|++++|++|+|+|.. +.+++|+|+|| T Consensus 265 ~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV 344 (418) T protein:vir:10 265 LGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPV 344 (418) T ss_pred cccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceee Confidence 887766544333 3334567899999999999999999999999999999999999999999854 45678999999 Q ss_pred EeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE Q lcl|NC_011614. 232 VNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 232 ~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l 310 (324) ++++ .++.+.+++|||++ ++++++++++++++++.. .+|++|++.||++.|+||.+.+|+||+++ T Consensus 345 ~~~~--~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~~~d~~~~~~~a~~~~ 410 (418) T protein:vir:10 345 VETQ--AMTANEFLVGAFSMAAQIFDRMEIEVLLSTENV------------DDFEKNMVSIRAEERLALAVYRPESFVTG 410 (418) T ss_pred EEcC--CCCCCcEEEeeccceEEEEEecceEEEEecccc------------hhhhcCceEEEEEEeeccEEecccceEEE Confidence 9866 45677899999997 668889999999887643 36999999999999999999999999999 Q ss_pred EeeccCCC Q lcl|NC_011614. 311 VPADAKPS 318 (324) Q Consensus 311 ~~~~~~~~ 318 (324) +.+++++. T Consensus 411 ~~~~~~~g 418 (418) T protein:vir:10 411 ALVEQAGG 418 (418) T ss_pred EeccCCCC Confidence 99877777 No 32 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=5.1e-57 Score=329.21 Aligned_cols=289 Identities=20% Similarity=0.256 Sum_probs=240.4 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeeccccc-----ccccccceeeE Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~-----~~~~~~~~~~v 101 (324) +..++++++|.+||++++++|++.+++.++++++++++++.++..++|+.+..+.|.|++|++. +|.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 5567777889999999999999999999999999999999999999999999999999999986 45578999999 Q ss_pred EeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcC--Cccccccccc---ccceeecccch Q lcl|NC_011614. 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF--GKSIAQSIEK---TNKVIKGDFTQ 176 (324) Q Consensus 102 ~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~--~~~~~~~~~~---~~~~~~~~~~~ 176 (324) +++++|++++++||+|+++||.++++++|.+.|++++++++|.++|+|+|++.. +......... ........... T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchhh Confidence 999999999999999999999999999999999999999999999999986422 1122211111 11111122233 Q ss_pred h----HHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccceEeecCcc--CCCceEEEeecc Q lcl|NC_011614. 177 D----NIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSN--LKRGELITGDFD 250 (324) Q Consensus 177 ~----~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~--~~~~~i~~gd~~ 250 (324) + .+.++...+...++..+.|+|||.++..|++++|++|+|+|.+ ++|+|+|+++++..+ .++..+++|||+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~---~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s 237 (305) T protein:vir:25 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---DSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) T ss_pred hHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC---CcccccceEEcCccCCCCCccEEEEEecc Confidence 3 3444555555566667789999999999999999999999964 589999999886543 356689999999 Q ss_pred cEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCcccc Q lcl|NC_011614. 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGE 323 (324) Q Consensus 251 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) +++++.+++++++++++..... ++..+++|++|++++|++.|+||.+.+|+||++++...++ .++|+- T Consensus 238 ~~~i~~~~~~~i~~~~~~~~~~----~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~-~~~pa~ 305 (305) T protein:vir:25 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA-VVAPAA 305 (305) T ss_pred eEEEEEecCeEEEEeeeeeeec----CCceeeeeecCcEEEEEEEeecceeeCcccEEEEcccccc-ccCCCC Confidence 9999999999999999986553 4567889999999999999999999999999999997654 667777 No 33 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=2.7e-56 Score=325.27 Aligned_cols=296 Identities=19% Similarity=0.279 Sum_probs=242.1 Q ss_pred cchhhhhccccccc------cCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeeccc--- Q lcl|NC_011614. 18 NVKPQVFNPDNVMM------HEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEG--- 88 (324) Q Consensus 18 ~~~~~~~~a~~~~~------~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg--- 88 (324) -...+|+|+..... ++.++++||+++.++|++.+++.+++++++++++++++..++|+.++.+.|.|++|| T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~ 80 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcccc Confidence 22234444332222 223445899999999999999999999999999999999999999999999888765 Q ss_pred -----ccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcC--Cccccc Q lcl|NC_011614. 89 -----QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF--GKSIAQ 161 (324) Q Consensus 89 -----~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~--~~~~~~ 161 (324) +.+++++++|++++++++|++++++||+|+++++.++++++|.++|++++++++|.++|+|+|++.. +.++.. T Consensus 81 ~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~ 160 (333) T protein:vir:78 81 EQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDT 160 (333) T ss_pred cccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccc Confidence 4578899999999999999999999999999999999999999999999999999999999986432 222222 Q ss_pred c------cccccceeecccchhHHHHHHHHhhhh-ccCCCEEEEcHHHHHHHHH---hhccCCceeecc----CCCceec Q lcl|NC_011614. 162 S------IEKTNKVIKGDFTQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRK---IVDPETKERIYD----RNSDSLD 227 (324) Q Consensus 162 ~------~~~~~~~~~~~~~~~~i~~~~~~l~~~-~~~~~~~v~~~~~~~~L~~---l~d~~g~~~~~~----~~~~~l~ 227 (324) . .........+..+++++.+++..+..+ ++.++.|+|||.++..|++ ++|.+|+|++.. +.+++|+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~ 240 (333) T protein:vir:78 161 DNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVL 240 (333) T ss_pred cccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceee Confidence 1 111222334556789999999988765 4556789999999987764 678999999864 4567999 Q ss_pred ccceEeecCccC-------CCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccE Q lcl|NC_011614. 228 GLPVVNLKSSNL-------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH 300 (324) Q Consensus 228 G~pv~~~~~~~~-------~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~ 300 (324) |+||++++..+. ++..+++|||+.++++.++++++++++++. ..+.++.++++|++|++.+|++.|+|++ T Consensus 241 G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~---~~~~~~~~~~~~~~~~v~~r~~~r~d~~ 317 (333) T protein:vir:78 241 GLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTAT---LTDSGSATVSMWQTNQIAILIEVTFGWL 317 (333) T ss_pred ceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEecccc---ccccccceeehhhcCcEEEEEEEEEccE Confidence 999998764432 356799999999999999999999999875 4566777889999999999999999999 Q ss_pred EecccceEEEEeeccC Q lcl|NC_011614. 301 IADDKAFAKLVPADAK 316 (324) Q Consensus 301 v~~~~a~~~l~~~~~~ 316 (324) +.+|+||++|+++++. T Consensus 318 v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 318 LGDKQAFVKFVDDEQP 333 (333) T ss_pred EecccceEEEeccCCC Confidence 9999999999987665 No 34 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=3.7e-56 Score=324.45 Aligned_cols=301 Identities=20% Similarity=0.278 Sum_probs=246.6 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccc------cccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNV------MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~------~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) |- ...|+|+... ..++.++.+||++++++|++.+++.++++++|++++++++..++| T Consensus 1 ~~-----------------~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip 63 (338) T protein:vir:78 1 MA-----------------TLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIP 63 (338) T ss_pred Cc-----------------chHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEE Confidence 22 2223332211 112234569999999999999999999999999999999999999 Q ss_pred EEeCCcc--------eeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 75 FWADKPG--------AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) Q Consensus 75 ~~~~~~~--------a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~ 146 (324) +.+..+. +.|++||+++++++++|++++++++|++++++||+|+++++.++++++|.++|++++++++|.++ T Consensus 64 ~~~~~~~a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~ 143 (338) T protein:vir:78 64 TTVKRPEVGQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAV 143 (338) T ss_pred EEecCccceeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9876544 55677999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCcCc--CCccccccccccc------ceeecccchhHHHHHHHHhhh-hccCCCEEEEcHHHHHHHH---HhhccCC Q lcl|NC_011614. 147 ILNQGNNP--FGKSIAQSIEKTN------KVIKGDFTQDNIIDLEALLED-DELEANAFISKTQNRSLLR---KIVDPET 214 (324) Q Consensus 147 l~g~g~~~--~~~~~~~~~~~~~------~~~~~~~~~~~i~~~~~~l~~-~~~~~~~~v~~~~~~~~L~---~l~d~~g 214 (324) |+|+|++. .+.++........ ........++++.++..++.. .....++|+|||+++..|+ +++|++| T Consensus 144 l~G~g~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g 223 (338) T protein:vir:78 144 FHGKSPLTGSALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANG 223 (338) T ss_pred hcccCCCccccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCC Confidence 99998642 3333333221111 112234567888888888765 3456678999999988775 5779999 Q ss_pred ceeecc----CCCceecccceEeecCcc-------CCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhh Q lcl|NC_011614. 215 KERIYD----RNSDSLDGLPVVNLKSSN-------LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) Q Consensus 215 ~~~~~~----~~~~~l~G~pv~~~~~~~-------~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) +|++.. +.+++|+|+||++++..+ .++..+++|||+.++++++++++++++++.......++...++++ T Consensus 224 ~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 303 (338) T protein:vir:78 224 NVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSM 303 (338) T ss_pred ceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhh Confidence 999854 456789999999876433 345679999999999999999999999999999988999999999 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 284 FEQDMVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 284 f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) |++|++++|++.|+||++.+|+||++|+++++..+ T Consensus 304 ~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 304 WQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred hhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 99999999999999999999999999999877777 No 35 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=5.8e-56 Score=323.41 Aligned_cols=301 Identities=13% Similarity=0.084 Sum_probs=251.2 Q ss_pred CchhhHHHHHHHH-Hh--hccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEe Q lcl|NC_011614. 1 MEQTQKLKLNLQH-FA--SNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 m~~~~~~~~~~~~-~~--~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) +...+++....+. +. .......+.+....++++.+|+++|++++..|++.+++.++|++++++++++++.+.+|+.+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 154 (385) T protein:vir:18 75 KSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREE 154 (385) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEe Confidence 2222222121111 11 12222334444445666778889999999999999999999999999999999889999987 Q ss_pred C-CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 78 D-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 78 ~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) . .+.+.|++||+++|+++++|+++++.++|++++++||+|+++++. +++++|.+.|++++++++|.++|+|+|++..+ T Consensus 155 ~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~ 233 (385) T protein:vir:18 155 VFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDNL 233 (385) T ss_pred cCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc Confidence 5 578999999999999999999999999999999999999999875 79999999999999999999999999999888 Q ss_pred cccccccccccc--eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc---CCCceecccce Q lcl|NC_011614. 157 KSIAQSIEKTNK--VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGLPV 231 (324) Q Consensus 157 ~~~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~---~~~~~l~G~pv 231 (324) .++......... ..++..+++++.+++.++...++.+++|+|||+++.+|++++|++|+|++.. +.+++|+|+|| T Consensus 234 ~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV 313 (385) T protein:vir:18 234 EGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPV 313 (385) T ss_pred cccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceee Confidence 777665544332 2345678999999999999999999999999999999999999999999854 55678999999 Q ss_pred EeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE Q lcl|NC_011614. 232 VNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 232 ~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l 310 (324) ++++. ++++.+++|||+. +.++.+++++++++++.. .+|++|++.||++.|+|+.+.+|+||+++ T Consensus 314 ~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~ 379 (385) T protein:vir:18 314 VPTKA--QAAGTFTVGGFDMASQVWDRMDATVEVSREDR------------DNFVKNMLTILCEERLALAHYRPTAIIKG 379 (385) T ss_pred EEcCc--CCCCcEEEeecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEE Confidence 98654 5677899999986 678999999998877642 36999999999999999999999999999 Q ss_pred EeeccC Q lcl|NC_011614. 311 VPADAK 316 (324) Q Consensus 311 ~~~~~~ 316 (324) +.++++ T Consensus 380 ~~~aa~ 385 (385) T protein:vir:18 380 TFSSGS 385 (385) T ss_pred EeccCC Confidence 998888 No 36 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=5.8e-56 Score=323.41 Aligned_cols=301 Identities=13% Similarity=0.084 Sum_probs=251.2 Q ss_pred CchhhHHHHHHHH-Hh--hccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEe Q lcl|NC_011614. 1 MEQTQKLKLNLQH-FA--SNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 m~~~~~~~~~~~~-~~--~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) +...+++....+. +. .......+.+....++++.+|+++|++++..|++.+++.++|++++++++++++.+.+|+.+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 154 (385) T protein:vir:19 75 KSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREE 154 (385) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEe Confidence 2222222121111 11 12222334444445666778889999999999999999999999999999999889999987 Q ss_pred C-CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 78 D-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 78 ~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) . .+.+.|++||+++|+++++|+++++.++|++++++||+|+++++. +++++|.+.|++++++++|.++|+|+|++..+ T Consensus 155 ~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~ 233 (385) T protein:vir:19 155 VFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLALKEEGQLLNGDGTGDNL 233 (385) T ss_pred cCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc Confidence 5 578999999999999999999999999999999999999999875 79999999999999999999999999999888 Q ss_pred cccccccccccc--eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc---CCCceecccce Q lcl|NC_011614. 157 KSIAQSIEKTNK--VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGLPV 231 (324) Q Consensus 157 ~~~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~---~~~~~l~G~pv 231 (324) .++......... ..++..+++++.+++.++...++.+++|+|||+++.+|++++|++|+|++.. +.+++|+|+|| T Consensus 234 ~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV 313 (385) T protein:vir:19 234 EGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPV 313 (385) T ss_pred cccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceee Confidence 777665544332 2345678999999999999999999999999999999999999999999854 55678999999 Q ss_pred EeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE Q lcl|NC_011614. 232 VNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 232 ~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l 310 (324) ++++. ++++.+++|||+. +.++.+++++++++++.. .+|++|++.||++.|+|+.+.+|+||+++ T Consensus 314 ~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~ 379 (385) T protein:vir:19 314 VPTKA--QAAGTFTVGGFDMASQVWDRMDATVEVSREDR------------DNFVKNMLTILCEERLALAHYRPTAIIKG 379 (385) T ss_pred EEcCc--CCCCcEEEeecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEE Confidence 98654 5677899999986 678999999998877642 36999999999999999999999999999 Q ss_pred EeeccC Q lcl|NC_011614. 311 VPADAK 316 (324) Q Consensus 311 ~~~~~~ 316 (324) +.++++ T Consensus 380 ~~~aa~ 385 (385) T protein:vir:19 380 TFSSGS 385 (385) T ss_pred EeccCC Confidence 998888 No 37 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=3.6e-56 Score=324.52 Aligned_cols=302 Identities=10% Similarity=0.035 Sum_probs=244.3 Q ss_pred CchhhHHHHHHHHHhhccchhh------hhccccccccCCCcceechhhhHHHH-HHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQ------VFNPDNVMMHEKKDGTLLNDFTTPIL-QEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~------~~~a~~~~~~~~~g~lip~~~~~~i~-~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) ++++. +...++.......... +.++. ..+++++|.+||+++..++| +.+.+.++++++++++++ ++.+.+ T Consensus 220 ~~~~a-~~~~~~~~~~~~l~~~e~~~~~~~~~~-~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~~~ 296 (543) T protein:vir:81 220 AYLRA-WSKMARNPHAAILTEEEKRAINEVRAM-GLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDVWH 296 (543) T ss_pred hhhhH-HHHHHHhhHHHHhhhhhhhhhhhhhhc-ccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-CcceEE Confidence 11100 0011111111111111 12222 24566788999999998877 557788999999988766 456889 Q ss_pred EEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) |+.++.+.+.|++||+.+|+++++|++++++++|++++++||+|+++|+ +++.++|.+.|++++++++|.+||+|+|++ T Consensus 297 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~ 375 (543) T protein:vir:81 297 GVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTGQG 375 (543) T ss_pred EEecCCcceeecccCccccccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCCCC Confidence 9999999999999999999999999999999999999999999999998 599999999999999999999999999998 Q ss_pred cCCccccccccc----ccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc---CCCcee Q lcl|NC_011614. 154 PFGKSIAQSIEK----TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSL 226 (324) Q Consensus 154 ~~~~~~~~~~~~----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~---~~~~~l 226 (324) ..|.|+...... ......+.++++++.+++..++.+|..+++|+|||.++..|++++|++|+|+|.+ +.+++| T Consensus 376 ~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~~l 455 (543) T protein:vir:81 376 NQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPSQL 455 (543) T ss_pred cccccchhhcccccccccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCccc Confidence 888887654332 2233445688999999999999999999999999999999999999999999864 345789 Q ss_pred cccceEeecCccC--------CCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEec Q lcl|NC_011614. 227 DGLPVVNLKSSNL--------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA 298 (324) Q Consensus 227 ~G~pv~~~~~~~~--------~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d 298 (324) +|+||+++++.+. +...+++|||++++++.+++++++++.+... ...|.+|+++||++.|+| T Consensus 456 ~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~----------~~~~~~~~~~~~~~~r~d 525 (543) T protein:vir:81 456 LGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFG----------TNRRPNGSRGWFAYYRMG 525 (543) T ss_pred cceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEeccccc----------cchhhcCceEEEEEEeec Confidence 9999999875432 4556999999999999999999998876431 124789999999999999 Q ss_pred cEEecccceEEEEeeccC Q lcl|NC_011614. 299 LHIADDKAFAKLVPADAK 316 (324) Q Consensus 299 ~~v~~~~a~~~l~~~~~~ 316 (324) +.+.+|+||++++.++++ T Consensus 526 ~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 526 ADVVNPNAFRLLNVETAS 543 (543) T ss_pred cEeecccceEEEEecccC Confidence 999999999999998888 No 38 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=7e-56 Score=322.97 Aligned_cols=299 Identities=14% Similarity=0.098 Sum_probs=247.5 Q ss_pred Cch-------hh-HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceE Q lcl|NC_011614. 1 MEQ-------TQ-KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) Q Consensus 1 m~~-------~~-~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) ..+ .. ..+.+.+..........+.++ ...++.++|+++|++++++|++.+++.++|++++++++++++.+. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~ 158 (395) T protein:vir:43 80 APKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSA-ITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVE 158 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhh-hcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceE Confidence 111 11 111222222222222223333 345667788999999999999999999999999999999998899 Q ss_pred EEEEeC-CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011614. 73 FTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 73 ip~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g 151 (324) +|+.+. .+.+.|++||+.+|+++++|++++++++|++++++||+|++++++ +++++|.+.|++++++++|.++|+|+| T Consensus 159 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a~~~~~d~~~l~G~g 237 (395) T protein:vir:43 159 YVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYGLMLVEECQLLYGNG 237 (395) T ss_pred EEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 999876 468999999999999999999999999999999999999999875 799999999999999999999999999 Q ss_pred cCcCCcccccccccccc----eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc---CCCc Q lcl|NC_011614. 152 NNPFGKSIAQSIEKTNK----VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSD 224 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~----~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~---~~~~ 224 (324) ++.++.+++........ ..++...++++.+++.++...+..+++|+|||.++.+|++++|++|+|++.. +..+ T Consensus 238 ~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~ 317 (395) T protein:vir:43 238 TGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTP 317 (395) T ss_pred CCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccccCCCc Confidence 98887777765443322 2334467899999999999999999999999999999999999999999854 4466 Q ss_pred eecccceEeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_011614. 225 SLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 225 ~l~G~pv~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) +|+|+||++++. ++++.+++|||+. +.++++++++++++++.. .+|++|++.||++.|+||++.+ T Consensus 318 ~l~G~pVv~~~~--~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~ 383 (395) T protein:vir:43 318 TLWRLPVVETQA--ITQDEFLTGAFSLGAQIFDRMDIEVLVSTEND------------KDFENNMVTIRAEERLAFAVYR 383 (395) T ss_pred eecceeeEEcCC--CCCCcEEEEeccceEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEec Confidence 899999998764 5677899999998 567889999999887542 3699999999999999999999 Q ss_pred ccceEEEEeecc Q lcl|NC_011614. 304 DKAFAKLVPADA 315 (324) Q Consensus 304 ~~a~~~l~~~~~ 315 (324) |+||++++.+++ T Consensus 384 ~~a~~~~~~taa 395 (395) T protein:vir:43 384 PEAFVTGSLTAS 395 (395) T ss_pred ccceEEEEeccC Confidence 999999999887 No 39 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=9e-56 Score=322.36 Aligned_cols=303 Identities=15% Similarity=0.122 Sum_probs=236.3 Q ss_pred Cchhh---------HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce Q lcl|NC_011614. 1 MEQTQ---------KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 m~~~~---------~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) +..++ +.......+........+.+.....+++++|++||+++.++|++.+++.+++++++++++++++.+ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~ 195 (497) T protein:vir:10 116 VSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNL 195 (497) T ss_pred hhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCce Confidence 11111 111112223333333344555556677788999999999999999999999999999999999999 Q ss_pred EEEEEeC-CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011614. 72 KFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 72 ~ip~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~ 150 (324) .||+.++ .+.+.|++||+.+|+++++|+++++.++|++++++||+|+++|++ +++++|.+.|++++++++|.+||+|+ T Consensus 196 ~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~ 274 (497) T protein:vir:10 196 SYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGG 274 (497) T ss_pred EEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 9999876 468999999999999999999999999999999999999999985 69999999999999999999999999 Q ss_pred CcCcCCcccccccccccceee--------------------------------------------------------ccc Q lcl|NC_011614. 151 GNNPFGKSIAQSIEKTNKVIK--------------------------------------------------------GDF 174 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~~--------------------------------------------------------~~~ 174 (324) |++ .|.+++........... ... T Consensus 275 G~~-~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (497) T protein:vir:10 275 GYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAE 353 (497) T ss_pred Ccc-cccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhh Confidence 976 35555543322111000 001 Q ss_pred chhHHHHHHHHhhh-hccCCCEEEEcHHHHHHHHHhhccCCceeeccCC----------CceecccceEeecCccCCCce Q lcl|NC_011614. 175 TQDNIIDLEALLED-DELEANAFISKTQNRSLLRKIVDPETKERIYDRN----------SDSLDGLPVVNLKSSNLKRGE 243 (324) Q Consensus 175 ~~~~i~~~~~~l~~-~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~----------~~~l~G~pv~~~~~~~~~~~~ 243 (324) ..+++..++..+.. .++.+++|+|||.++..|+++||++|+|+|.+.. .++|+|+||+++++.+ .+. T Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~--~~~ 431 (497) T protein:vir:10 354 IAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP--LGT 431 (497) T ss_pred hhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCC--CCc Confidence 11223333333433 3455678999999999999999999999986432 3489999999987654 567 Q ss_pred EEEeeccc--EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 244 LITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 244 i~~gd~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) +++|||+. +.++++.+++++++++.. .+|++|+++||++.|+|+.|.+|+||++++.++...++ T Consensus 432 ~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 432 ILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred eEEeecccceEEEEEecccEEEeecccc------------hhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 89999986 457889999999987642 35999999999999999999999999999999888888 No 40 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=9e-56 Score=322.36 Aligned_cols=303 Identities=15% Similarity=0.122 Sum_probs=236.3 Q ss_pred Cchhh---------HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce Q lcl|NC_011614. 1 MEQTQ---------KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 m~~~~---------~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) +..++ +.......+........+.+.....+++++|++||+++.++|++.+++.+++++++++++++++.+ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~ 195 (497) T protein:vir:78 116 VSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNL 195 (497) T ss_pred hhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCce Confidence 11111 111112223333333344555556677788999999999999999999999999999999999999 Q ss_pred EEEEEeC-CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011614. 72 KFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 72 ~ip~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~ 150 (324) .||+.++ .+.+.|++||+.+|+++++|+++++.++|++++++||+|+++|++ +++++|.+.|++++++++|.+||+|+ T Consensus 196 ~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~ 274 (497) T protein:vir:78 196 SYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDAP-ELFNFVQGRLLEGIQRKEEVQLLAGG 274 (497) T ss_pred EEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 9999876 468999999999999999999999999999999999999999985 69999999999999999999999999 Q ss_pred CcCcCCcccccccccccceee--------------------------------------------------------ccc Q lcl|NC_011614. 151 GNNPFGKSIAQSIEKTNKVIK--------------------------------------------------------GDF 174 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~~--------------------------------------------------------~~~ 174 (324) |++ .|.+++........... ... T Consensus 275 G~~-~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (497) T protein:vir:78 275 GYP-GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAE 353 (497) T ss_pred Ccc-cccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhh Confidence 976 35555543322111000 001 Q ss_pred chhHHHHHHHHhhh-hccCCCEEEEcHHHHHHHHHhhccCCceeeccCC----------CceecccceEeecCccCCCce Q lcl|NC_011614. 175 TQDNIIDLEALLED-DELEANAFISKTQNRSLLRKIVDPETKERIYDRN----------SDSLDGLPVVNLKSSNLKRGE 243 (324) Q Consensus 175 ~~~~i~~~~~~l~~-~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~----------~~~l~G~pv~~~~~~~~~~~~ 243 (324) ..+++..++..+.. .++.+++|+|||.++..|+++||++|+|+|.+.. .++|+|+||+++++.+ .+. T Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~--~~~ 431 (497) T protein:vir:78 354 IAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP--LGT 431 (497) T ss_pred hhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCC--CCc Confidence 11223333333433 3455678999999999999999999999986432 3489999999987654 567 Q ss_pred EEEeeccc--EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 244 LITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 244 i~~gd~~~--~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) +++|||+. +.++++.+++++++++.. .+|++|+++||++.|+|+.|.+|+||++++.++...++ T Consensus 432 ~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 432 ILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred eEEeecccceEEEEEecccEEEeecccc------------hhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 89999986 457889999999987642 35999999999999999999999999999999888888 No 41 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=1.6e-55 Score=320.98 Aligned_cols=299 Identities=14% Similarity=0.123 Sum_probs=238.7 Q ss_pred CchhhHHHHHHHHHhhccchhhhhcc-ccccccCCCcceechhhhHHHHHHHHh-hcchhhhceeeecCCC-ceEEEEEe Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNP-DNVMMHEKKDGTLLNDFTTPILQEVME-NSKIMQLGKYEPMEGT-EKKFTFWA 77 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a-~~~~~~~~~g~lip~~~~~~i~~~~~~-~s~l~~l~~~~~~~~~-~~~ip~~~ 77 (324) -...+..+..+|+.........+... ....+++.+|+++|+++..+++..+.+ .++++.+++++++.++ .+.+|+.+ T Consensus 83 ~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 162 (392) T protein:vir:13 83 RSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVIT 162 (392) T ss_pred hhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEc Confidence 11111122222222211111111111 112344556778888888888876555 5567778888888654 47899999 Q ss_pred CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCc Q lcl|NC_011614. 78 DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~ 157 (324) +.+.+.|++|++++|+++++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|.++|+|+|++ .|. T Consensus 163 ~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~-~p~ 241 (392) T protein:vir:13 163 GRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTG-QPR 241 (392) T ss_pred CCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCc-ccc Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999975 456 Q ss_pred ccccccccc----cceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceeccc Q lcl|NC_011614. 158 SIAQSIEKT----NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGL 229 (324) Q Consensus 158 ~~~~~~~~~----~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~ 229 (324) |++...... ....++..+++++++++..+...++.+++|+|||+++..|++++|++|+|+|.+ +.+++|+|+ T Consensus 242 Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~ 321 (392) T protein:vir:13 242 GILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGK 321 (392) T ss_pred ccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecce Confidence 665543322 122345678999999999999999999999999999999999999999999864 345689999 Q ss_pred ceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEE Q lcl|NC_011614. 230 PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK 309 (324) Q Consensus 230 pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~ 309 (324) ||++++. ++.+.+++|||++++++.+++++++.+.+. +|.+|++.||++.|+|+++.+|+||++ T Consensus 322 Pv~~~~~--~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~--------------~~~~~~~~~r~~~r~d~~~~~~~A~~~ 385 (392) T protein:vir:13 322 VVETDDG--MPADKVLFADLSKYRVRFAGSLRVDRSVDA--------------KFSTDQIVYRFLQRADGLLVDARGAKV 385 (392) T ss_pred eeEEcCC--CCCCcEEEeeccceeEEeecceEEEeeccc--------------cccCCcEEEEEEEEeccEEecccceEE Confidence 9998765 456789999999999999999999887653 589999999999999999999999999 Q ss_pred EEeeccC Q lcl|NC_011614. 310 LVPADAK 316 (324) Q Consensus 310 l~~~~~~ 316 (324) ++.++++ T Consensus 386 ~~~~~aa 392 (392) T protein:vir:13 386 LTVTPAA 392 (392) T ss_pred EEeeccC Confidence 9998887 No 42 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=1.3e-55 Score=321.46 Aligned_cols=297 Identities=14% Similarity=0.111 Sum_probs=235.0 Q ss_pred Cchhh--HHHHHHHHHhhccchhhhhccc-cccccCCCcceechhhhHHHHH-HHHhhcchhhhceeeecCCC-ceEEEE Q lcl|NC_011614. 1 MEQTQ--KLKLNLQHFASNNVKPQVFNPD-NVMMHEKKDGTLLNDFTTPILQ-EVMENSKIMQLGKYEPMEGT-EKKFTF 75 (324) Q Consensus 1 m~~~~--~~~~~~~~~~~~~~~~~~~~a~-~~~~~~~~g~lip~~~~~~i~~-~~~~~s~l~~l~~~~~~~~~-~~~ip~ 75 (324) .++.. ..+..+|+..-......+.... ...+++.+|+++|+++..+++. .+++.++++.+++++++.++ .+.+|+ T Consensus 81 ~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~ 160 (390) T protein:vir:62 81 AQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTV 160 (390) T ss_pred chhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEE Confidence 11111 1122222221111111111111 1234444566777777766665 45667678889999998764 478999 Q ss_pred EeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcC Q lcl|NC_011614. 76 WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF 155 (324) Q Consensus 76 ~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~ 155 (324) .++.+.+.|++|++.+|+++++|++++++++|++++++||+|+++||.++++++|.+.|+++++.++|.++|+|+|. T Consensus 161 ~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~--- 237 (390) T protein:vir:62 161 ITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQ--- 237 (390) T ss_pred EcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCc--- Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999883 Q ss_pred Cccccccccc----ccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceec Q lcl|NC_011614. 156 GKSIAQSIEK----TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLD 227 (324) Q Consensus 156 ~~~~~~~~~~----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~ 227 (324) |.|+...... .....++..+++++++|+.+|..+|..+++|+||++++..|++++|++|+|+|.+ +.+++|+ T Consensus 238 p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~l~ 317 (390) T protein:vir:62 238 PRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSLFN 317 (390) T ss_pred cccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccceec Confidence 4455443322 1222345678999999999999999999999999999999999999999999854 3456899 Q ss_pred ccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 228 GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 228 G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) |+||++++.. +...+++|||++++++.+++++++.+.+. +|.+|++.||++.|+|+++.+|+|| T Consensus 318 G~Pv~~~~~~--p~~~i~~gd~s~~~i~~~~~~~v~~~~~~--------------~~~~~~~~~~~~~r~d~~~~~~~A~ 381 (390) T protein:vir:62 318 GKVVETDDGM--PADKILFADLSKYRVRFAGSLRVDRSVDA--------------KFSTDQIVYRFLQRADGLLVDARGA 381 (390) T ss_pred ccceEEecCC--CCccEEEeeccceeEEeecceEEEeeccc--------------cccCCcEEEEEEEEeCcEeechhhe Confidence 9999987654 55679999999999999999999988653 5899999999999999999999999 Q ss_pred EEEEeeccC Q lcl|NC_011614. 308 AKLVPADAK 316 (324) Q Consensus 308 ~~l~~~~~~ 316 (324) ++|+.++++ T Consensus 382 ~~l~~~~~a 390 (390) T protein:vir:62 382 KVLTVTPGA 390 (390) T ss_pred EEEEeecCC Confidence 999988777 No 43 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=3.4e-55 Score=319.21 Aligned_cols=307 Identities=12% Similarity=0.141 Sum_probs=253.3 Q ss_pred CchhhHHHH----HHHHH--hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCC--CceE Q lcl|NC_011614. 1 MEQTQKLKL----NLQHF--ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG--TEKK 72 (324) Q Consensus 1 m~~~~~~~~----~~~~~--~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~--~~~~ 72 (324) ..+....+. .++.. ........+.++.+..++++||.+||++++++|++.+++.++|+++++++++++ +.+. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~ 157 (404) T protein:vir:10 78 YNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRT 157 (404) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceE Confidence 000000011 11111 111234456677777777889999999999999999999999999999998874 4567 Q ss_pred EEEEeCCcceeeeccccccccc--ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011614. 73 FTFWADKPGAYWVGEGQKIETS--KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~ 150 (324) +|+.++.+.+.|++|++.++.+ +++|++++++++|++++++||+|+++|+.++++++|.+.|++++++++|.++|+|+ T Consensus 158 ~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~ 237 (404) T protein:vir:10 158 YEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGA 237 (404) T ss_pred EEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 8888888999999999999875 58899999999999999999999999999999999999999999999999999999 Q ss_pred CcCcCCcccccccccccceeecccchhHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCce Q lcl|NC_011614. 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~ 225 (324) |++..+.++...........++..+++++.+++. .+...+..+++|+|||+++.+|++++|++|+|+|.+ +.+++ T Consensus 238 g~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~ 317 (404) T protein:vir:10 238 GGDEHATGIMTANKFKKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYR 317 (404) T ss_pred CCCCcccceeeccccceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCcc Confidence 9988888887776666666677788999988776 677888888999999999999999999999999864 34568 Q ss_pred ecccceEeecC----ccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccE Q lcl|NC_011614. 226 LDGLPVVNLKS----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH 300 (324) Q Consensus 226 l~G~pv~~~~~----~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~ 300 (324) |+|+||+++++ ...++..+++|||++ +.++.+++++++++++. +..|++|++.||++.|+|+. T Consensus 318 l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~------------~~~~~~~~~~~~~~~r~d~~ 385 (404) T protein:vir:10 318 FLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIG------------AGAFETNTTKARIIMRIDGN 385 (404) T ss_pred ccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccc------------cchhhcCceEEEEEEeeccE Confidence 99999986543 234566799999996 67888999999988653 24699999999999999999 Q ss_pred EecccceEEEEeeccCCCC Q lcl|NC_011614. 301 IADDKAFAKLVPADAKPSS 319 (324) Q Consensus 301 v~~~~a~~~l~~~~~~~~~ 319 (324) +.+|+||++++.++++... T Consensus 386 v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 386 VKDSEALLIAEIPVESVQA 404 (404) T ss_pred EecccceEEEEeecccCCC Confidence 9999999999988766655 No 44 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=2.1e-55 Score=320.29 Aligned_cols=297 Identities=15% Similarity=0.084 Sum_probs=248.1 Q ss_pred CchhhH-HHHHHHHHhhc-----cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEE Q lcl|NC_011614. 1 MEQTQK-LKLNLQHFASN-----NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 m~~~~~-~~~~~~~~~~~-----~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) +.+.+. ++.+....... .......+.....++.++|+++|++++++|++.+++.+++++++++++++++.+.+| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:97 81 MFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEE Confidence 211111 12222111111 111223334445667788999999999999999999999999999999999999999 Q ss_pred EEeCC-cceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 75 FWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 75 ~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) +.++. +.+.|++||+++|+++++|+++++.++|++++++||+|+++++. +++++|.+.|++++++++|.++|+|+|++ T Consensus 161 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~la~a~~~~~d~a~l~G~g~~ 239 (390) T protein:vir:97 161 QETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGAN 239 (390) T ss_pred EEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCC Confidence 98764 68999999999999999999999999999999999999999985 89999999999999999999999999998 Q ss_pred cCCccccccccccc--ceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc---CCCceecc Q lcl|NC_011614. 154 PFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~---~~~~~l~G 228 (324) ..|.+++....... ...++...++++.+++.++...+..+++|+|||+++.+|++++|++|+|+|.+ +.+++|+| T Consensus 240 ~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G 319 (390) T protein:vir:97 240 DGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWG 319 (390) T ss_pred ccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecc Confidence 88888876544332 33445678899999999999999999999999999999999999999999864 34568999 Q ss_pred cceEeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 229 LPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 229 ~pv~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +||++++. ++++++++|||+. +.++.+++++++++++. .+|++|++++|++.|+||.+.+|+|| T Consensus 320 ~pV~~~~~--~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-------------~~f~~~~~~~r~~~r~d~~v~~~~a~ 384 (390) T protein:vir:97 320 LPVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVN-------------DDFQRNMVTVLAEERLALVVYRPEAL 384 (390) T ss_pred eeeEEcCC--CCCCcEEEEeccceEEEEEecceEEEEeecc-------------cccccCcEEEEEEEeeccEEeccccE Confidence 99999764 5677899999996 66889999999988654 25899999999999999999999999 Q ss_pred EEEEee Q lcl|NC_011614. 308 AKLVPA 313 (324) Q Consensus 308 ~~l~~~ 313 (324) ++++.+ T Consensus 385 v~~~~a 390 (390) T protein:vir:97 385 ITGSFA 390 (390) T ss_pred EEEEeC Confidence 999998 No 45 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=2.8e-55 Score=319.68 Aligned_cols=301 Identities=10% Similarity=0.080 Sum_probs=242.4 Q ss_pred Cchhh---HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEe Q lcl|NC_011614. 1 MEQTQ---KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 m~~~~---~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) .++.+ ..+...+.+........+.++.+ .++++||.+||++++++|++.++++++++++++++++.+ ..++|+.. T Consensus 114 ~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~-~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~ 191 (434) T protein:vir:62 114 GHRTNKETEIRSVFANYIVGNIDEKEARALG-LVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE-NIKYPVLV 191 (434) T ss_pred cccchHHHHHHHHHHHHhccccchhhhhhhc-ccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCC-ceEEEEEe Confidence 11111 12222333444445555666654 344578899999999999999999999999999988865 57899998 Q ss_pred CCcceeee---cccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCc Q lcl|NC_011614. 78 DKPGAYWV---GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 78 ~~~~a~~v---~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~ 154 (324) ..+.+.|. +|++.++.++++|+++++.++|++++++||+|+++|+.++++++|.+.|++++++++|.++|+|+|++. T Consensus 192 ~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~ 271 (434) T protein:vir:62 192 KKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANN 271 (434) T ss_pred cCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc Confidence 88877775 567888999999999999999999999999999999999999999999999999999999999999887 Q ss_pred CCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc------CCCceecc Q lcl|NC_011614. 155 FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD------RNSDSLDG 228 (324) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~------~~~~~l~G 228 (324) .+.++...... ....++..+++++++|+.++..+++.+++|+|||.++.+|++++|++|+|+|.+ +.+.+|+| T Consensus 272 ~~~g~~~~~~~-~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G 350 (434) T protein:vir:62 272 INDGALAKKAV-EFKTDEKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLG 350 (434) T ss_pred cccceeecccc-cccccccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecc Confidence 77666554333 333455678999999999999999999999999999999999999999999854 33458999 Q ss_pred cceEeecCccCC----CceEEEeecccEEEEEec-ceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_011614. 229 LPVVNLKSSNLK----RGELITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 229 ~pv~~~~~~~~~----~~~i~~gd~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) +||++++..+.+ ...+++|||++++++.+. .++++.+.+ .+|.+|+|+||++.|+|+++++ T Consensus 351 ~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~--------------~~~~~~~v~~~~~~r~Dgk~i~ 416 (434) T protein:vir:62 351 FPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLVE--------------LFSRTNRVGFRIWNLLDAQLIH 416 (434) T ss_pred eeeEEecCccCccCCCceEEEEeeccceEEEEeeceeEEEeehh--------------hhcccCceEEEEEeeecceeec Confidence 999998765433 344889999999988875 466776654 3578999999999999999876 Q ss_pred -ccceEEEE--eeccCCC Q lcl|NC_011614. 304 -DKAFAKLV--PADAKPS 318 (324) Q Consensus 304 -~~a~~~l~--~~~~~~~ 318 (324) |.++++++ ++.++++ T Consensus 417 ~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 417 SPFEVPVYKYVLKAPTGA 434 (434) T ss_pred CcccceEEEEEeccCCCC Confidence 99888764 4444444 No 46 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=3.8e-55 Score=318.93 Aligned_cols=297 Identities=14% Similarity=0.097 Sum_probs=245.5 Q ss_pred Cchhh-HHHHHHHHHhhccch-----hhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEE Q lcl|NC_011614. 1 MEQTQ-KLKLNLQHFASNNVK-----PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 m~~~~-~~~~~~~~~~~~~~~-----~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) +...+ .++.+.......... ....+.....++..+|+++|+++.++|++.+++.++|++++++++++++.+.+| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:10 81 LFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE Confidence 11111 122222221111100 111122333455667889999999999999999999999999999999999999 Q ss_pred EEeCC-cceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 75 FWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 75 ~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) ++++. +.+.|++||+++|+++++|+++++.++|++++++||+|+++++. +++++|.+.|++++++++|+++|+|+|++ T Consensus 161 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G~~ 239 (390) T protein:vir:10 161 QETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGAN 239 (390) T ss_pred EEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCC Confidence 98865 68999999999999999999999999999999999999999985 89999999999999999999999999998 Q ss_pred cCCccccccccccc--ceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccC---CCceecc Q lcl|NC_011614. 154 PFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---NSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~---~~~~l~G 228 (324) ..|.|++....... ...++...++++.+++.++...++.+++|+|||+++..|++++|++|+|+|... .+++|+| T Consensus 240 ~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G 319 (390) T protein:vir:10 240 DGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWG 319 (390) T ss_pred ccccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecc Confidence 88888877654433 334456678999999999999999999999999999999999999999998643 3568999 Q ss_pred cceEeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 229 LPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 229 ~pv~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +||++++. ++.+.+++|||+. +.++.++++++++++++ .+|++|++.||++.|+|+++.+|+|| T Consensus 320 ~pv~~~~~--~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~-------------~~~~~~~~~~r~~~r~d~~v~~~~a~ 384 (390) T protein:vir:10 320 LPVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVN-------------DDFQRNMVTVLAEERLALVVYRPEAL 384 (390) T ss_pred eeeEEcCC--CCCCcEEEEeccceEEEEEecceEEEEeecc-------------cccccCcEEEEEEEeeccEEeccccE Confidence 99998764 5567899999997 56789999999988754 24899999999999999999999999 Q ss_pred EEEEee Q lcl|NC_011614. 308 AKLVPA 313 (324) Q Consensus 308 ~~l~~~ 313 (324) ++++.+ T Consensus 385 ~~~~~a 390 (390) T protein:vir:10 385 ISGSFA 390 (390) T ss_pred EEEEeC Confidence 999998 No 47 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=3.5e-55 Score=319.15 Aligned_cols=297 Identities=14% Similarity=0.089 Sum_probs=246.1 Q ss_pred Cchhh-HHHHHHHHHhhc-cchh----hhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEE Q lcl|NC_011614. 1 MEQTQ-KLKLNLQHFASN-NVKP----QVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 m~~~~-~~~~~~~~~~~~-~~~~----~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) +...+ .++.+....... .... ...+.....+++++|+++|+++..+|++.+++.++|++++++++++++.+.+| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:81 81 MFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEE Confidence 11111 122222211111 1111 11122233456778899999999999999999999999999999999999999 Q ss_pred EEeCC-cceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 75 FWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 75 ~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) +.+.. +.+.|++||+.+|+++++|+++++.++|++++++||+|+++++. +++++|.+.|++++++++|.++|+|+|++ T Consensus 161 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~~~~~i~~~l~~~~~~~~d~a~l~G~g~~ 239 (390) T protein:vir:81 161 QETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGAN 239 (390) T ss_pred EEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHHhcCCCC Confidence 98764 58999999999999999999999999999999999999999985 89999999999999999999999999998 Q ss_pred cCCccccccccccc--ceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc---CCCceecc Q lcl|NC_011614. 154 PFGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~---~~~~~l~G 228 (324) ..+.+++....... ...++..+++++.+++.++...++.+++|+|||+++..|++++|++|+|+|.. +.+++|+| T Consensus 240 ~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G 319 (390) T protein:vir:81 240 DGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWG 319 (390) T ss_pred CcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecc Confidence 88888876554432 33445678999999999999999999999999999999999999999999864 34568999 Q ss_pred cceEeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 229 LPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 229 ~pv~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +||++++. ++++.+++|||+. +.++.+++++++.+++.. +|++|++.||++.|+|+++.+|+|| T Consensus 320 ~pv~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~-------------~~~~~~v~~r~~~r~d~~v~~~~a~ 384 (390) T protein:vir:81 320 LPVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVGE-------------DFQRNMITVLAEERLALVVYRPEAL 384 (390) T ss_pred eeeEEcCC--CCCCcEEEEehhceEEEEEecceEEEEecccc-------------hhhcCcEEEEEEEeeccEEecccce Confidence 99998764 5677899999997 567889999999886542 5999999999999999999999999 Q ss_pred EEEEee Q lcl|NC_011614. 308 AKLVPA 313 (324) Q Consensus 308 ~~l~~~ 313 (324) ++++.+ T Consensus 385 v~~t~a 390 (390) T protein:vir:81 385 ISGSFA 390 (390) T ss_pred EEEEeC Confidence 999998 No 48 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=4.4e-55 Score=318.60 Aligned_cols=302 Identities=16% Similarity=0.142 Sum_probs=240.8 Q ss_pred CchhhHHHHH--HHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeC Q lcl|NC_011614. 1 MEQTQKLKLN--LQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 m~~~~~~~~~--~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~ 78 (324) +.+....+.. ...+.+........+.....+++++|.+||+++.++|++.+++.++++++++++++.+ ..++|+..+ T Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g-~~~ip~~~~ 188 (425) T protein:vir:95 110 MNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG-TTRILVDTD 188 (425) T ss_pred HHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc-eeEEEEecC Confidence 0000000000 0111111111111112233455678889999999999999999999999999999865 578999999 Q ss_pred Ccceeeecccccccccc-cceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcC-cCC Q lcl|NC_011614. 79 KPGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PFG 156 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~-~~~ 156 (324) .+.+.|++|++++|+++ ++|++|++++++++++++||+|+++|+.++++++|.++|++++++++|.++|+|+|++ ..| T Consensus 189 ~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p 268 (425) T protein:vir:95 189 TSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQP 268 (425) T ss_pred CccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcccc Confidence 99999999999999876 6899999999999999999999999999999999999999999999999999999975 456 Q ss_pred ccccccccccc--ceeecccchhHHHHHHHHhhhhcc--CCCEEEEcHHHHH----HHHHhhccCCceeec--cCCCcee Q lcl|NC_011614. 157 KSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRS----LLRKIVDPETKERIY--DRNSDSL 226 (324) Q Consensus 157 ~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~----~L~~l~d~~g~~~~~--~~~~~~l 226 (324) .|++....... ...++..+++++.+++..+..++. .+++|+||+.++. .|+.++|++|+|++. ....++| T Consensus 269 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l 348 (425) T protein:vir:95 269 LGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDL 348 (425) T ss_pred ceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccc Confidence 77766543322 334567789999999998877664 4567999999853 456788999999975 3445789 Q ss_pred cccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc Q lcl|NC_011614. 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) Q Consensus 227 ~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a 306 (324) +|+||+++++. ++..++||||++++++.+++++++++++. +|.+|+++||++.|+|+++.+|+| T Consensus 349 ~G~pvv~~~~~--~~~~i~~Gd~~~~~~~~~~~~~i~~~~~~--------------~f~~~~~~~~~~~r~d~~~~~~~a 412 (425) T protein:vir:95 349 LGLRVVFNNFL--DDDTVLFGEFEQYTLVERENITIDSSTHV--------------KFTEDQTAFRGKGRFDGKPVKPEA 412 (425) T ss_pred cceeeEEcCcC--CCccEEEEecccEEEEeecceEEEeeccc--------------ccccCceEEEEEEeeCcEeecccc Confidence 99999987654 56689999999999999999999998764 489999999999999999999999 Q ss_pred eEEEEeeccCCCC Q lcl|NC_011614. 307 FAKLVPADAKPSS 319 (324) Q Consensus 307 ~~~l~~~~~~~~~ 319 (324) |++++.+++.... T Consensus 413 ~~~~~i~~~~~g~ 425 (425) T protein:vir:95 413 FVLVTITDPVQGA 425 (425) T ss_pred eEEEEecCcCCCC Confidence 9999998866655 No 49 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=1.2e-54 Score=316.15 Aligned_cols=297 Identities=16% Similarity=0.148 Sum_probs=239.2 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc--eEEEEEeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE--KKFTFWAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~--~~ip~~~~ 78 (324) ++.+.......+.+.+.... ....+....++++||.+||++++++|++.+++.++|+++|+++++++.. +.+|+... T Consensus 84 ~~~~~~~~~~~~~~l~~~~~-~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 162 (397) T protein:vir:49 84 EEVKAGFVKDFKNLVRGRYQ-NLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTD 162 (397) T ss_pred hHHHHHHHHHHHHHHhcchh-HHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeecc Confidence 22222222222223222221 1222334456677899999999999999999999999999999987554 45566544 Q ss_pred -Ccceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 79 -KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 79 -~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) .+.+.|++||+.+++ ++++|++++++++|++++++||+|+++||.++++++|.+.|++++++++|.++|+|+|++... T Consensus 163 ~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~ 242 (397) T protein:vir:49 163 ITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTK 242 (397) T ss_pred CCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 467999999999996 679999999999999999999999999999999999999999999999999999998876532 Q ss_pred cccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceE Q lcl|NC_011614. 157 KSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) .+..+++++.+++.++..++..+++|+|||+++..|++++|++|+|+|.. +.+++|+|+||+ T Consensus 243 --------------~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~ 308 (397) T protein:vir:49 243 --------------PTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVK 308 (397) T ss_pred --------------cccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeE Confidence 23457999999999999999999999999999999999999999999853 345689999998 Q ss_pred eecC-----ccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc Q lcl|NC_011614. 233 NLKS-----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) Q Consensus 233 ~~~~-----~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a 306 (324) ++++ ...++..+++|||+. +.++.+++++++++++.. ++|++|++.+|++.|+|+++.+|+| T Consensus 309 ~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a 376 (397) T protein:vir:49 309 EVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGG------------GAFETDTTKVRVIDRFDVVATDTEA 376 (397) T ss_pred EecccccccccCCceeEEEeeccceEEEEeecceEEEEecccc------------chhhcCceeEEEEeeeCcEEecccc Confidence 7543 334566799999997 678999999999987642 4699999999999999999999999 Q ss_pred eEEEEeec-cCCCCccccC Q lcl|NC_011614. 307 FAKLVPAD-AKPSSVPGEV 324 (324) Q Consensus 307 ~~~l~~~~-~~~~~~~~~~ 324 (324) |++++.++ +.++++.+-. T Consensus 377 ~~~~~~~~~~~~~~~~~~~ 395 (397) T protein:vir:49 377 FVPASFKAIADQKGNLGST 395 (397) T ss_pred eEEEEeecccCCCCCcccc Confidence 99999665 3344443333 No 50 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=1.4e-54 Score=315.78 Aligned_cols=296 Identities=14% Similarity=0.103 Sum_probs=240.2 Q ss_pred CchhhHHHHHHHHHhhccc----------hhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNV----------KPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~----------~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~ 70 (324) .+...+.+....++.++.. ...+.++....++++||.+||+++.++|++.+++.++|++++++++++++. T Consensus 70 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 149 (392) T protein:vir:10 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) T ss_pred ccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCc Confidence 1111111111111111111 112333444556677888999999999999999999999999999998665 Q ss_pred e--EEEEEeCCcceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 71 K--KFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) Q Consensus 71 ~--~ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l 147 (324) . .+|+.++.+.+.|++|+++++++ .++|+++++.++|++++++||+|+++||.++++++|.+.|++++++++|.+++ T Consensus 150 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~ 229 (392) T protein:vir:10 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) T ss_pred eeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4 46666777889999999999976 58999999999999999999999999999999999999999999999999999 Q ss_pred hccCcCcCCcccccccccccceeecccchhHHHHHH-HHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CC Q lcl|NC_011614. 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) Q Consensus 148 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~ 222 (324) +|+|++.. .+..+++++++++ ..+...++.++.|+|||+++.+|+++||++|+|+|.. +. T Consensus 230 ~g~g~~~~---------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~ 294 (392) T protein:vir:10 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) T ss_pred hccccccc---------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCc Confidence 99876432 2346788999876 5888899999999999999999999999999999853 44 Q ss_pred CceecccceEeec--------CccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 223 SDSLDGLPVVNLK--------SSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 223 ~~~l~G~pv~~~~--------~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) +++|+|+|+++.. ....+...+++|||+. +.++.+.+++++++++.. .+|++|++.||+ T Consensus 295 ~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~ 362 (392) T protein:vir:10 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) T ss_pred cccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEE Confidence 5789998766532 2334666799999997 568999999999887542 369999999999 Q ss_pred EEEeccEEecccceEEEEeeccCCCCcccc Q lcl|NC_011614. 294 TMHVALHIADDKAFAKLVPADAKPSSVPGE 323 (324) Q Consensus 294 ~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) +.|+|+++.+|+||++++.++++|+.+|+- T Consensus 363 ~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999999999999988 No 51 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=1.4e-54 Score=315.78 Aligned_cols=296 Identities=14% Similarity=0.103 Sum_probs=240.2 Q ss_pred CchhhHHHHHHHHHhhccc----------hhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNV----------KPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~----------~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~ 70 (324) .+...+.+....++.++.. ...+.++....++++||.+||+++.++|++.+++.++|++++++++++++. T Consensus 70 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 149 (392) T protein:vir:10 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) T ss_pred ccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCc Confidence 1111111111111111111 112333444556677888999999999999999999999999999998665 Q ss_pred e--EEEEEeCCcceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 71 K--KFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) Q Consensus 71 ~--~ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l 147 (324) . .+|+.++.+.+.|++|+++++++ .++|+++++.++|++++++||+|+++||.++++++|.+.|++++++++|.+++ T Consensus 150 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~ 229 (392) T protein:vir:10 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) T ss_pred eeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4 46666777889999999999976 58999999999999999999999999999999999999999999999999999 Q ss_pred hccCcCcCCcccccccccccceeecccchhHHHHHH-HHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CC Q lcl|NC_011614. 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) Q Consensus 148 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~ 222 (324) +|+|++.. .+..+++++++++ ..+...++.++.|+|||+++.+|+++||++|+|+|.. +. T Consensus 230 ~g~g~~~~---------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~ 294 (392) T protein:vir:10 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) T ss_pred hccccccc---------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCc Confidence 99876432 2346788999876 5888899999999999999999999999999999853 44 Q ss_pred CceecccceEeec--------CccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 223 SDSLDGLPVVNLK--------SSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 223 ~~~l~G~pv~~~~--------~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) +++|+|+|+++.. ....+...+++|||+. +.++.+.+++++++++.. .+|++|++.||+ T Consensus 295 ~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~ 362 (392) T protein:vir:10 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) T ss_pred cccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEE Confidence 5789998766532 2334666799999997 568999999999887542 369999999999 Q ss_pred EEEeccEEecccceEEEEeeccCCCCcccc Q lcl|NC_011614. 294 TMHVALHIADDKAFAKLVPADAKPSSVPGE 323 (324) Q Consensus 294 ~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) +.|+|+++.+|+||++++.++++|+.+|+- T Consensus 363 ~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999999999999988 No 52 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=1.4e-54 Score=315.78 Aligned_cols=296 Identities=14% Similarity=0.103 Sum_probs=240.2 Q ss_pred CchhhHHHHHHHHHhhccc----------hhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNV----------KPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~----------~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~ 70 (324) .+...+.+....++.++.. ...+.++....++++||.+||+++.++|++.+++.++|++++++++++++. T Consensus 70 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 149 (392) T protein:vir:10 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) T ss_pred ccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCc Confidence 1111111111111111111 112333444556677888999999999999999999999999999998665 Q ss_pred e--EEEEEeCCcceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 71 K--KFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) Q Consensus 71 ~--~ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l 147 (324) . .+|+.++.+.+.|++|+++++++ .++|+++++.++|++++++||+|+++||.++++++|.+.|++++++++|.+++ T Consensus 150 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~ 229 (392) T protein:vir:10 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) T ss_pred eeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4 46666777889999999999976 58999999999999999999999999999999999999999999999999999 Q ss_pred hccCcCcCCcccccccccccceeecccchhHHHHHH-HHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CC Q lcl|NC_011614. 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) Q Consensus 148 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~ 222 (324) +|+|++.. .+..+++++++++ ..+...++.++.|+|||+++.+|+++||++|+|+|.. +. T Consensus 230 ~g~g~~~~---------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~ 294 (392) T protein:vir:10 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) T ss_pred hccccccc---------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCc Confidence 99876432 2346788999876 5888899999999999999999999999999999853 44 Q ss_pred CceecccceEeec--------CccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 223 SDSLDGLPVVNLK--------SSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 223 ~~~l~G~pv~~~~--------~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) +++|+|+|+++.. ....+...+++|||+. +.++.+.+++++++++.. .+|++|++.||+ T Consensus 295 ~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~ 362 (392) T protein:vir:10 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) T ss_pred cccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEE Confidence 5789998766532 2334666799999997 568999999999887542 369999999999 Q ss_pred EEEeccEEecccceEEEEeeccCCCCcccc Q lcl|NC_011614. 294 TMHVALHIADDKAFAKLVPADAKPSSVPGE 323 (324) Q Consensus 294 ~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) +.|+|+++.+|+||++++.++++|+.+|+- T Consensus 363 ~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999999999999988 No 53 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=1.4e-54 Score=315.78 Aligned_cols=296 Identities=14% Similarity=0.103 Sum_probs=240.2 Q ss_pred CchhhHHHHHHHHHhhccc----------hhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNV----------KPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~----------~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~ 70 (324) .+...+.+....++.++.. ...+.++....++++||.+||+++.++|++.+++.++|++++++++++++. T Consensus 70 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 149 (392) T protein:vir:10 70 VDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRS 149 (392) T ss_pred ccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCc Confidence 1111111111111111111 112333444556677888999999999999999999999999999998665 Q ss_pred e--EEEEEeCCcceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 71 K--KFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) Q Consensus 71 ~--~ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l 147 (324) . .+|+.++.+.+.|++|+++++++ .++|+++++.++|++++++||+|+++||.++++++|.+.|++++++++|.+++ T Consensus 150 ~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~ 229 (392) T protein:vir:10 150 GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLIL 229 (392) T ss_pred eeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 4 46666777889999999999976 58999999999999999999999999999999999999999999999999999 Q ss_pred hccCcCcCCcccccccccccceeecccchhHHHHHH-HHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CC Q lcl|NC_011614. 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) Q Consensus 148 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~ 222 (324) +|+|++.. .+..+++++++++ ..+...++.++.|+|||+++.+|+++||++|+|+|.. +. T Consensus 230 ~g~g~~~~---------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~ 294 (392) T protein:vir:10 230 GVIEKLTK---------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKN 294 (392) T ss_pred hccccccc---------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCc Confidence 99876432 2346788999876 5888899999999999999999999999999999853 44 Q ss_pred CceecccceEeec--------CccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 223 SDSLDGLPVVNLK--------SSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 223 ~~~l~G~pv~~~~--------~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) +++|+|+|+++.. ....+...+++|||+. +.++.+.+++++++++.. .+|++|++.||+ T Consensus 295 ~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~ 362 (392) T protein:vir:10 295 KKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRA 362 (392) T ss_pred cccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEE Confidence 5789998766532 2334666799999997 568999999999887542 369999999999 Q ss_pred EEEeccEEecccceEEEEeeccCCCCcccc Q lcl|NC_011614. 294 TMHVALHIADDKAFAKLVPADAKPSSVPGE 323 (324) Q Consensus 294 ~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) +.|+|+++.+|+||++++.++++|+.+|+- T Consensus 363 ~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 363 IQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEeeccEEecccceEEEEecccccccCCCC Confidence 999999999999999999999999999988 No 54 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=3.5e-54 Score=313.64 Aligned_cols=298 Identities=16% Similarity=0.126 Sum_probs=238.0 Q ss_pred Cchhh--HHHHHHHHHhhccc--hhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceE--EE Q lcl|NC_011614. 1 MEQTQ--KLKLNLQHFASNNV--KPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK--FT 74 (324) Q Consensus 1 m~~~~--~~~~~~~~~~~~~~--~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~--ip 74 (324) ..+.. ....+++.|..... .....+.....++++||.+||++++++|++.+++.++|++++++++++++... +| T Consensus 79 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 158 (397) T protein:vir:49 79 LTKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYE 158 (397) T ss_pred ccchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEE Confidence 11111 11222222222111 11233344456677888999999999999999999999999999998876554 55 Q ss_pred EEeC-Ccceeeecccccccccc-cceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_011614. 75 FWAD-KPGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 75 ~~~~-~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~ 152 (324) +... .+.+.|++|++.+|+++ ++|++|+++++|++++++||+|+++|+..+++++|.+.|++++++++|.+||+|+|+ T Consensus 159 ~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~ 238 (397) T protein:vir:49 159 KWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGT 238 (397) T ss_pred eeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 5544 46789999999999865 899999999999999999999999999999999999999999999999999999987 Q ss_pred CcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecc Q lcl|NC_011614. 153 NPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDG 228 (324) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G 228 (324) +.. ..+..+++++.+++.++..+++.++.|+|||.++..|++++|++|+|+|.. +.+++|+| T Consensus 239 ~~~--------------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G 304 (397) T protein:vir:49 239 LPN--------------KPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDG 304 (397) T ss_pred ccc--------------cccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecc Confidence 543 124568999999999999999999999999999999999999999999753 44578999 Q ss_pred cceEeecC-----ccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_011614. 229 LPVVNLKS-----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 229 ~pv~~~~~-----~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) +||+++.+ ...++..+++|||+. +.++++++++++++++.. ++|++|++.||++.|+|+.+. T Consensus 305 ~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~r~d~~~~ 372 (397) T protein:vir:49 305 FVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGG------------GAFETDTTKVRVIDRFDVVST 372 (397) T ss_pred eeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEecccc------------chhhcCeeeEEEEEeeccEEe Confidence 99987653 334566799999996 678999999999987643 469999999999999999999 Q ss_pred cccceEEEEeeccCCCCccccC Q lcl|NC_011614. 303 DDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 303 ~~~a~~~l~~~~~~~~~~~~~~ 324 (324) +|+||++++.++...+.-.... T Consensus 373 ~~~a~~~~~~~~~~~~~~~~~~ 394 (397) T protein:vir:49 373 DTEAFVPASFKAIADQKAKLST 394 (397) T ss_pred cccceEEEEecccccccCcccc Confidence 9999999987654432221111 No 55 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=3.4e-54 Score=313.72 Aligned_cols=297 Identities=14% Similarity=0.115 Sum_probs=239.4 Q ss_pred Cch-----hhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEE Q lcl|NC_011614. 1 MEQ-----TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF 75 (324) Q Consensus 1 m~~-----~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~ 75 (324) ++. .+......+++.+..... ...+....+++++|.+||++++++|++.+++.++|+++++++++++....+|+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 157 (397) T protein:vir:48 79 LTKSEEEVKAGFVKDFKNLVRGRYQN-LLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVY 157 (397) T ss_pred ccchhhHHHHHHHHHHHHHHhhhhhH-HHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEE Confidence 222 222222233333332211 11222335566788999999999999999999999999999999887776665 Q ss_pred E---eCCcceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011614. 76 W---ADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 76 ~---~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g 151 (324) . +..+.+.|++|++.++++ +++|+++++++++++++++||+|++++|..+++++|.+.|++++++++|.++|+|+| T Consensus 158 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g 237 (397) T protein:vir:48 158 EKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIA 237 (397) T ss_pred EeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 4 334678999999999976 689999999999999999999999999999999999999999999999999999988 Q ss_pred cCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceec Q lcl|NC_011614. 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLD 227 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~ 227 (324) ++... .+..+++++.+++.+++.++..+++|+|||.++..|++++|++|+|+|.. +.+++|+ T Consensus 238 ~~~~~--------------~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~ 303 (397) T protein:vir:48 238 TLPTK--------------PTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSID 303 (397) T ss_pred ccccc--------------cccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceec Confidence 75432 24568999999999999999999999999999999999999999999853 3457899 Q ss_pred ccceEeecC-----ccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEE Q lcl|NC_011614. 228 GLPVVNLKS-----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHI 301 (324) Q Consensus 228 G~pv~~~~~-----~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v 301 (324) |+||+++.+ ...++..+++|||+. +.++.+++++++++++.. ++|.+|++.||++.|+|+.+ T Consensus 304 G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~ 371 (397) T protein:vir:48 304 GFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGG------------GAFETDTTKIRVIDRFDVVA 371 (397) T ss_pred cceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccch------------hhhhcCceeEEEEeeeccEE Confidence 999987643 334577899999996 568999999999987642 46999999999999999999 Q ss_pred ecccceEEEEeeccC-CCCccccC Q lcl|NC_011614. 302 ADDKAFAKLVPADAK-PSSVPGEV 324 (324) Q Consensus 302 ~~~~a~~~l~~~~~~-~~~~~~~~ 324 (324) .+|+||++++.+++. +..+-+-+ T Consensus 372 ~~~~a~~~~~~~~~~~~~~~~~~~ 395 (397) T protein:vir:48 372 TDTESFVPASFKAIADQKGNLGST 395 (397) T ss_pred ecccceEEEEecccccCCCCcccc Confidence 999999999976553 33333333 No 56 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=5.3e-54 Score=312.64 Aligned_cols=298 Identities=14% Similarity=0.080 Sum_probs=239.9 Q ss_pred CchhhHHHHHHHHHhh---c---cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFAS---N---NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 m~~~~~~~~~~~~~~~---~---~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) ....+......+.|.. . .....+.++...+++.+||.+||++++++|++.+++.++++++++++++++....+| T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 163 (408) T protein:vir:10 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) T ss_pred cchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEE Confidence 2222222223333322 2 222346667777788889999999999999999999999999999999987766665 Q ss_pred EE--e-CCcceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011614. 75 FW--A-DKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 75 ~~--~-~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~ 150 (324) +. . ..+.+.|++|++++|++ .++|++|++.++|++++++||+|+++|+.+++.++|.+.|++++++++|.+|++|+ T Consensus 164 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~ 243 (408) T protein:vir:10 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) T ss_pred EeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 54 3 34678999999999975 58999999999999999999999999999999999999999999999999999998 Q ss_pred CcCcCCcccccccccccceeecccchhHHHHHH-HHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCce Q lcl|NC_011614. 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~ 225 (324) |++... .+..+++++.+++ ..+..+++.++.|+|||.++.+|++++|++|+|+|.. +.+++ T Consensus 244 g~~~~~--------------~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~ 309 (408) T protein:vir:10 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) T ss_pred cccccc--------------cccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCce Confidence 875431 2345788898876 5788899999999999999999999999999999864 34568 Q ss_pred ecccceEeecCcc-----CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Q lcl|NC_011614. 226 LDGLPVVNLKSSN-----LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 226 l~G~pv~~~~~~~-----~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) |+|+||+++++.. .+...+++|||+. +.++.+++++++++++.. ..|++|++.||++.|+|+ T Consensus 310 l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~f~~~~~~~r~~~r~d~ 377 (408) T protein:vir:10 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDV 377 (408) T ss_pred ecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEccccc------------chhhcCceEEEEEEeecc Confidence 9999999865433 3445699999997 578999999999987642 459999999999999999 Q ss_pred EEecccceEEEEeeccCCCC------ccccC Q lcl|NC_011614. 300 HIADDKAFAKLVPADAKPSS------VPGEV 324 (324) Q Consensus 300 ~v~~~~a~~~l~~~~~~~~~------~~~~~ 324 (324) .+.+|+||++++.++.++.+ +-.-| T Consensus 378 ~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 378 KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred EEeccccEEEEEeeccccCCCCCCCCCcccC Confidence 99999999999987644322 12222 No 57 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=3.7e-54 Score=313.54 Aligned_cols=304 Identities=16% Similarity=0.151 Sum_probs=245.9 Q ss_pred Cchhh-HHHHHHHHHhh----ccc-hhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EE Q lcl|NC_011614. 1 MEQTQ-KLKLNLQHFAS----NNV-KPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KF 73 (324) Q Consensus 1 m~~~~-~~~~~~~~~~~----~~~-~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~i 73 (324) +++.. .+...+++-.. ... ...+.++..++++.+||.+||+++.++|++.+++.++|+++++++++.++.. .+ T Consensus 85 ~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 164 (409) T protein:vir:45 85 DEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEW 164 (409) T ss_pred hHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEE Confidence 12111 12222221111 011 1135566666777788999999999999999999999999999999977654 44 Q ss_pred EEEeCC-cceeeecccccccccccceeeEEeeeeeEE-EeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011614. 74 TFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 74 p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~-~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g 151 (324) |+..+. ..+.|++|++.+|+++++|.++++.++|++ ++++||+|+++|+.++++++|.++|++++++++|.+||+|+| T Consensus 165 ~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G 244 (409) T protein:vir:45 165 ATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTG 244 (409) T ss_pred EeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 555443 457899999999999999999999999985 679999999999999999999999999999999999999998 Q ss_pred cC--cCCccccccccc-ccceeecccchhHHHHHHHHhhhhccCCCEE--EEcHHHHHHHHHhhccCCceeecc----CC Q lcl|NC_011614. 152 NN--PFGKSIAQSIEK-TNKVIKGDFTQDNIIDLEALLEDDELEANAF--ISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) Q Consensus 152 ~~--~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~--v~~~~~~~~L~~l~d~~g~~~~~~----~~ 222 (324) ++ ..|.+++..... .....++.++++++.+++..|..++..++.| +||+.++.+|++++|++|+|+|.. +. T Consensus 245 ~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~ 324 (409) T protein:vir:45 245 AGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVA 324 (409) T ss_pred CCCccccceeeeccccccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCC Confidence 75 345666655433 3334456789999999999999999888765 679999999999999999999853 34 Q ss_pred CceecccceEeecCcc---CCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Q lcl|NC_011614. 223 SDSLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~---~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) +.+|+|+||++++..+ .++..+++|||++++++.++++.++.+++. +|++|+++||++.|+|+ T Consensus 325 ~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~--------------~~~~~~~~~~~~~r~d~ 390 (409) T protein:vir:45 325 PASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVER--------------YAEYDQTGFLAFHRFDC 390 (409) T ss_pred CceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEeecc--------------cccCCcEEEEEEEEecc Confidence 5689999999987543 345679999999999999999999887653 47899999999999999 Q ss_pred EEecccceEEEEeeccCCC Q lcl|NC_011614. 300 HIADDKAFAKLVPADAKPS 318 (324) Q Consensus 300 ~v~~~~a~~~l~~~~~~~~ 318 (324) ++.+|+||++++.++++++ T Consensus 391 ~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 391 ILEDTSAIKALVGKGSVGG 409 (409) T ss_pred EeechhheEEEEeccCCCC Confidence 9999999999999988888 No 58 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=7.8e-54 Score=311.75 Aligned_cols=276 Identities=16% Similarity=0.124 Sum_probs=235.9 Q ss_pred hhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc--eEEEEEe-CCcceeeecccccccc-cccce Q lcl|NC_011614. 23 VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE--KKFTFWA-DKPGAYWVGEGQKIET-SKATW 98 (324) Q Consensus 23 ~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~--~~ip~~~-~~~~a~~v~Eg~~~~~-~~~~~ 98 (324) .+++....++++||.+||++++++|++.++++++++++++++++.+.. ..+|.+. ..+.+.|++||+++++ ++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 778888888888999999999999999999999999999999987654 4566665 4577999999999997 57999 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhH Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) ++++++++|++++++||+|+++|+.++++++|.++|++++++++|++|+.|.++... ..+..++++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~--------------~~~~~~~d~ 146 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT--------------KPTLTKWDD 146 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc--------------cccccCHHH Confidence 999999999999999999999999999999999999999999999999998775432 235678999 Q ss_pred HHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceEeecCc-----cCCCceEEEeec Q lcl|NC_011614. 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSS-----NLKRGELITGDF 249 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~~~~~~-----~~~~~~i~~gd~ 249 (324) +.++++++..+++.+++|+|||+++..|++++|++|+|+|.+ +.+++|+|+||+++.+. ..++..+++||| T Consensus 147 i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~ 226 (293) T protein:vir:48 147 IIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDL 226 (293) T ss_pred HHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEec Confidence 999999999999999999999999999999999999999864 34568999999876533 334557999999 Q ss_pred cc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCC-CccccC Q lcl|NC_011614. 250 DK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPS-SVPGEV 324 (324) Q Consensus 250 ~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~-~~~~~~ 324 (324) +. +.++++++++++++++.. ++|++|++.||++.|+|+++.+|+||++++.+++.++ .|-+-. T Consensus 227 ~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 291 (293) T protein:vir:48 227 KQAVTLFDRQQMSLLSTNIGG------------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGST 291 (293) T ss_pred cceEEEEEecceEEEEecccc------------hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCcccccc Confidence 97 568899999999987642 4699999999999999999999999999996544332 221111 No 59 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=1e-53 Score=311.06 Aligned_cols=287 Identities=18% Similarity=0.142 Sum_probs=237.6 Q ss_pred Cc--hhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceE--EEEE Q lcl|NC_011614. 1 ME--QTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK--FTFW 76 (324) Q Consensus 1 m~--~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~--ip~~ 76 (324) +. ..+..+..+++|.... ...+.++....++.+||.+||++++++|++.+++.+++++++++++++++... +|+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~l-~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~ 142 (371) T protein:vir:81 64 PLKPTVQVKENEVEAFVNHI-RTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKR 142 (371) T ss_pred ccccchhhHHHHHHHHHHHH-HHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEee Confidence 11 1111223344444322 22344566667778899999999999999999999999999999999876655 4555 Q ss_pred eCCcceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcC Q lcl|NC_011614. 77 ADKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF 155 (324) Q Consensus 77 ~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~ 155 (324) ...+.+.|++||+++|+ ++++|++++++++|++++++||+|+++|+.++++++|.+.|++++++++|.++++|+|++.. T Consensus 143 ~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~ 222 (371) T protein:vir:81 143 SQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAK 222 (371) T ss_pred cCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 66778999999999986 67999999999999999999999999999999999999999999999999999999886532 Q ss_pred CcccccccccccceeecccchhHHHHHH-HHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccc Q lcl|NC_011614. 156 GKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLP 230 (324) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~p 230 (324) .+..+++++..++ ..+...+..+++|+|||+++.+|++++|++|+|+|.. +.+++|+|+| T Consensus 223 ---------------~~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~p 287 (371) T protein:vir:81 223 ---------------TAIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLP 287 (371) T ss_pred ---------------cccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceeccee Confidence 2345677888765 5788889999999999999999999999999999853 4457899999 Q ss_pred eEeecCcc----------CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Q lcl|NC_011614. 231 VVNLKSSN----------LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 231 v~~~~~~~----------~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) |+++++.+ .+...+++|||+. +.++.+++++++++++.. ++|++|++.||++.|+|+ T Consensus 288 V~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~v~~~~~~r~d~ 355 (371) T protein:vir:81 288 VVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAM------------DAFETDATLWRAIERMDV 355 (371) T ss_pred EEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeecc Confidence 99976543 3566799999997 578899999999987643 469999999999999999 Q ss_pred EEecccceEEEEeecc Q lcl|NC_011614. 300 HIADDKAFAKLVPADA 315 (324) Q Consensus 300 ~v~~~~a~~~l~~~~~ 315 (324) .+.+|+||++++.+++ T Consensus 356 ~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 356 KMRDDEAFVFGEVQLA 371 (371) T ss_pred EEecccceEEEEEecC Confidence 9999999999999888 No 60 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=1.5e-53 Score=310.21 Aligned_cols=297 Identities=14% Similarity=0.104 Sum_probs=239.6 Q ss_pred CchhhHHHHHHHHHhhc---cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEE- Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASN---NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW- 76 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~---~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~- 76 (324) ++.+.+.......+.+. .....+.++...+++++||.+||++++++|++.+++.++|++++++++++++...+|++ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 166 (404) T protein:vir:39 87 YELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEK 166 (404) T ss_pred hhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEe Confidence 22222222222223222 22345667777778888999999999999999999999999999999998776666654 Q ss_pred -e-CCcceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 77 -A-DKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 77 -~-~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) . ..+.+.|++||+.+|+ ++++|++++++++|++++++||+|+++|+.++++++|.+.|++++++++|.++|+|+|++ T Consensus 167 ~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~ 246 (404) T protein:vir:39 167 WTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTV 246 (404) T ss_pred ecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 3 3467899999999997 579999999999999999999999999999999999999999999999999999998875 Q ss_pred cCCcccccccccccceeecccchhHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecc Q lcl|NC_011614. 154 PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G 228 (324) ... .+..+++++.+++. .+...+..+++|+|||+++..|++++|++|+|++.. +.+++|+| T Consensus 247 ~~~--------------~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G 312 (404) T protein:vir:39 247 PKK--------------PTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKG 312 (404) T ss_pred ccc--------------cccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecc Confidence 421 23456888888765 677888888999999999999999999999999854 34568999 Q ss_pred cceEeecCcc-----CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_011614. 229 LPVVNLKSSN-----LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 229 ~pv~~~~~~~-----~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) +||+++++.. .+...+++|||+. +.++.+++++++++++.. ++|++|++.+|++.|+|+.+. T Consensus 313 ~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~ 380 (404) T protein:vir:39 313 KKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKTT 380 (404) T ss_pred eeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccch------------hhhhhceeeEEEEeeeccEEe Confidence 9999875433 3455799999996 668999999999987642 469999999999999999999 Q ss_pred cccceEEEEeeccCCC---Ccccc Q lcl|NC_011614. 303 DDKAFAKLVPADAKPS---SVPGE 323 (324) Q Consensus 303 ~~~a~~~l~~~~~~~~---~~~~~ 323 (324) +|+||++++.++.+++ ++.|- T Consensus 381 ~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 381 DSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred cccceEEEEeeccccCCCCCCCCC Confidence 9999999986655442 33444 No 61 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=2.9e-53 Score=308.60 Aligned_cols=304 Identities=13% Similarity=0.069 Sum_probs=243.5 Q ss_pred Cchh-hHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEE--e Q lcl|NC_011614. 1 MEQT-QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW--A 77 (324) Q Consensus 1 m~~~-~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~--~ 77 (324) ..+. .....+++.|........+.++. ..++++++.+||+++.++|++.+++.++|++++++++++++...+|+. . T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 173 (415) T protein:vir:46 95 SIQNTKVTSQEVRDFTEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS 173 (415) T ss_pred hhhhhhhhHHHHHHHHHHHhhhhhhhhc-cccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec Confidence 1111 11222344444433333333332 345667888999999999999999999999999999999888777765 4 Q ss_pred CCcceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 78 DKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) ..+.+.|++||+++|+ +.++|+++++.+++++++++||+|+++|+.++++++|.+.|++++++++|.++|+|+|++... T Consensus 174 ~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~ 253 (415) T protein:vir:46 174 EVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG 253 (415) T ss_pred CCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc Confidence 5567899999999997 568999999999999999999999999999999999999999999999999999999876554 Q ss_pred cccccccc-cccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccce Q lcl|NC_011614. 157 KSIAQSIE-KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPV 231 (324) Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv 231 (324) ........ ......++..+++++.+++.++...++.+++|+|||+++.+|++++|++|+|+|.. +.+++|+|+|| T Consensus 254 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV 333 (415) T protein:vir:46 254 STSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKI 333 (415) T ss_pred ccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceee Confidence 44433332 23334456789999999999999999999999999999999999999999999853 44578999999 Q ss_pred EeecCccC---CCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 232 VNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 232 ~~~~~~~~---~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +++++.+. ++..+++|||+. +.++.+++++++.++ |.++++.+|++.|+|+++.+|+|| T Consensus 334 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~ 396 (415) T protein:vir:46 334 EILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSA 396 (415) T ss_pred EEeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----------------cccCceEEEEEEEeccEEeccccE Confidence 98876543 345699999997 567888999887764 456778899999999999999999 Q ss_pred EEEEeeccCCCCccccC Q lcl|NC_011614. 308 AKLVPADAKPSSVPGEV 324 (324) Q Consensus 308 ~~l~~~~~~~~~~~~~~ 324 (324) ++++..++.+ -||+. T Consensus 397 ~~~~~~~~~~--~~~~~ 411 (415) T protein:vir:46 397 IVIEYDDSER--GEGDL 411 (415) T ss_pred EEEEeeccCC--CCCCc Confidence 9998754433 34555 No 62 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=2.9e-53 Score=308.60 Aligned_cols=304 Identities=13% Similarity=0.069 Sum_probs=243.5 Q ss_pred Cchh-hHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEE--e Q lcl|NC_011614. 1 MEQT-QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW--A 77 (324) Q Consensus 1 m~~~-~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~--~ 77 (324) ..+. .....+++.|........+.++. ..++++++.+||+++.++|++.+++.++|++++++++++++...+|+. . T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 173 (415) T protein:vir:47 95 SIQNTKVTSQEVRDFTEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS 173 (415) T ss_pred hhhhhhhhHHHHHHHHHHHhhhhhhhhc-cccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec Confidence 1111 11222344444433333333332 345667888999999999999999999999999999999888777765 4 Q ss_pred CCcceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 78 DKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) ..+.+.|++||+++|+ +.++|+++++.+++++++++||+|+++|+.++++++|.+.|++++++++|.++|+|+|++... T Consensus 174 ~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~ 253 (415) T protein:vir:47 174 EVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG 253 (415) T ss_pred CCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc Confidence 5567899999999997 568999999999999999999999999999999999999999999999999999999876554 Q ss_pred cccccccc-cccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccce Q lcl|NC_011614. 157 KSIAQSIE-KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPV 231 (324) Q Consensus 157 ~~~~~~~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv 231 (324) ........ ......++..+++++.+++.++...++.+++|+|||+++.+|++++|++|+|+|.. +.+++|+|+|| T Consensus 254 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV 333 (415) T protein:vir:47 254 STSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKI 333 (415) T ss_pred ccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceee Confidence 44433332 23334456789999999999999999999999999999999999999999999853 44578999999 Q ss_pred EeecCccC---CCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 232 VNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 232 ~~~~~~~~---~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +++++.+. ++..+++|||+. +.++.+++++++.++ |.++++.+|++.|+|+++.+|+|| T Consensus 334 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~ 396 (415) T protein:vir:47 334 EILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSA 396 (415) T ss_pred EEeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----------------cccCceEEEEEEEeccEEeccccE Confidence 98876543 345699999997 567888999887764 456778899999999999999999 Q ss_pred EEEEeeccCCCCccccC Q lcl|NC_011614. 308 AKLVPADAKPSSVPGEV 324 (324) Q Consensus 308 ~~l~~~~~~~~~~~~~~ 324 (324) ++++..++.+ -||+. T Consensus 397 ~~~~~~~~~~--~~~~~ 411 (415) T protein:vir:47 397 IVIEYDDSER--GEGDL 411 (415) T ss_pred EEEEeeccCC--CCCCc Confidence 9998754433 34555 No 63 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=1.5e-53 Score=310.25 Aligned_cols=288 Identities=13% Similarity=0.080 Sum_probs=235.3 Q ss_pred CchhhHHHHHHHH-----HhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCC--ceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQH-----FASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT--EKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~-----~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~--~~~i 73 (324) ++..+.+...+++ ..+......+.++...+++++||.+||++++++|++.+++.++++++++++++++. .+.+ T Consensus 92 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 171 (397) T protein:vir:12 92 QQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLL 171 (397) T ss_pred HHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEE Confidence 1111111111111 01111122355566667778899999999999999999999999999999998754 4556 Q ss_pred EEEeCCcceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_011614. 74 TFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 74 p~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~ 152 (324) |+.++.+.+.|++||+++|++ .++|+++++.++|++++++||+|+++|+.++++++|.+.|++++++++|.++++|+|+ T Consensus 172 ~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~ 251 (397) T protein:vir:12 172 EKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIAS 251 (397) T ss_pred EEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 677778899999999999974 6999999999999999999999999999999999999999999999999999999886 Q ss_pred CcCCcccccccccccceeecccchhHHHHHH-HHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceec Q lcl|NC_011614. 153 NPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLD 227 (324) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~ 227 (324) +.+ .+..+++++.+++ ..+..++..+++|+|||.++.+|++++|++|+|+|.. +.+++|+ T Consensus 252 ~~~---------------~g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~ 316 (397) T protein:vir:12 252 LKK---------------VDIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLD 316 (397) T ss_pred ccc---------------cccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCcccc Confidence 532 2345688899866 5888899999999999999999999999999999853 4457899 Q ss_pred ccceEeecC----ccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_011614. 228 GLPVVNLKS----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 228 G~pv~~~~~----~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) |+||+++++ ...++..+++|||+. +.++.+++++++++++.. .+|++|++.||++.|+|+.+. T Consensus 317 G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~~~ 384 (397) T protein:vir:12 317 GRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGA------------GAFETNSTKVRGIEREDVRKW 384 (397) T ss_pred ceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeeccEEe Confidence 999987653 234566799999997 468889999999887642 469999999999999999999 Q ss_pred cccceEEEEeecc Q lcl|NC_011614. 303 DDKAFAKLVPADA 315 (324) Q Consensus 303 ~~~a~~~l~~~~~ 315 (324) +|+||++++.++. T Consensus 385 ~~~a~~~~~~t~~ 397 (397) T protein:vir:12 385 DEDAVVFGQITVE 397 (397) T ss_pred cccceEEEEEeeC Confidence 9999999999877 No 64 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=3.8e-53 Score=308.00 Aligned_cols=304 Identities=13% Similarity=0.080 Sum_probs=242.1 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEE--EeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF--WAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~--~~~ 78 (324) ....+....+++.|........+.++. ..++++||.+||+++.+.|++.+++.++|++++++++|+++...+|+ .++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:81 96 IQNTKVTSQEVRDFTEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhhhc-cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC Confidence 111112233344454443333343333 34556788899999999999999999999999999999877666554 556 Q ss_pred Ccceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCc Q lcl|NC_011614. 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~ 157 (324) ...+.|++|++++++. .++|+++++.+++++++++||+|+++||.++++++|.+.|++++++++|.++++|+|++.... T Consensus 175 ~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:81 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 6788999999999975 689999999999999999999999999999999999999999999999999999998765443 Q ss_pred cccc-ccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceE Q lcl|NC_011614. 158 SIAQ-SIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~-~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) .... ..........+..+++++.+++.++...++.+++|+|||+++.+|++++|++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:81 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred ccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeE Confidence 3333 22333444556789999999999999999999999999999999999999999999854 345689999999 Q ss_pred eecCccC---CCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_011614. 233 NLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~---~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~ 308 (324) ++++.+. ++..+++|||+. ++++.+.+++++.++ |.++++.+|++.|+|+.+.+|+||+ T Consensus 335 ~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:81 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 8876543 455699999997 557889999998764 4456778999999999999999999 Q ss_pred EEEeeccCCCCccccC Q lcl|NC_011614. 309 KLVPADAKPSSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..++ ++-+|+. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:81 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEEecc--CCCCCcc Confidence 9997543 3445565 No 65 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=3.8e-53 Score=308.00 Aligned_cols=304 Identities=13% Similarity=0.080 Sum_probs=242.1 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEE--EeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF--WAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~--~~~ 78 (324) ....+....+++.|........+.++. ..++++||.+||+++.+.|++.+++.++|++++++++|+++...+|+ .++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:79 96 IQNTKVTSQEVRDFTEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhhhc-cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC Confidence 111112233344454443333343333 34556788899999999999999999999999999999877666554 556 Q ss_pred Ccceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCc Q lcl|NC_011614. 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~ 157 (324) ...+.|++|++++++. .++|+++++.+++++++++||+|+++||.++++++|.+.|++++++++|.++++|+|++.... T Consensus 175 ~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:79 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 6788999999999975 689999999999999999999999999999999999999999999999999999998765443 Q ss_pred cccc-ccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceE Q lcl|NC_011614. 158 SIAQ-SIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~-~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) .... ..........+..+++++.+++.++...++.+++|+|||+++.+|++++|++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:79 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred ccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeE Confidence 3333 22333444556789999999999999999999999999999999999999999999854 345689999999 Q ss_pred eecCccC---CCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_011614. 233 NLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~---~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~ 308 (324) ++++.+. ++..+++|||+. ++++.+.+++++.++ |.++++.+|++.|+|+.+.+|+||+ T Consensus 335 ~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:79 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 8876543 455699999997 557889999998764 4456778999999999999999999 Q ss_pred EEEeeccCCCCccccC Q lcl|NC_011614. 309 KLVPADAKPSSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..++ ++-+|+. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:79 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEEecc--CCCCCcc Confidence 9997543 3445565 No 66 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=3.8e-53 Score=308.00 Aligned_cols=304 Identities=13% Similarity=0.080 Sum_probs=242.1 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEE--EeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF--WAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~--~~~ 78 (324) ....+....+++.|........+.++. ..++++||.+||+++.+.|++.+++.++|++++++++|+++...+|+ .++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:98 96 IQNTKVTSQEVRDFTEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhhhc-cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC Confidence 111112233344454443333343333 34556788899999999999999999999999999999877666554 556 Q ss_pred Ccceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCc Q lcl|NC_011614. 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~ 157 (324) ...+.|++|++++++. .++|+++++.+++++++++||+|+++||.++++++|.+.|++++++++|.++++|+|++.... T Consensus 175 ~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:98 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 6788999999999975 689999999999999999999999999999999999999999999999999999998765443 Q ss_pred cccc-ccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceE Q lcl|NC_011614. 158 SIAQ-SIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~-~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) .... ..........+..+++++.+++.++...++.+++|+|||+++.+|++++|++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:98 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred ccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeE Confidence 3333 22333444556789999999999999999999999999999999999999999999854 345689999999 Q ss_pred eecCccC---CCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_011614. 233 NLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~---~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~ 308 (324) ++++.+. ++..+++|||+. ++++.+.+++++.++ |.++++.+|++.|+|+.+.+|+||+ T Consensus 335 ~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:98 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 8876543 455699999997 557889999998764 4456778999999999999999999 Q ss_pred EEEeeccCCCCccccC Q lcl|NC_011614. 309 KLVPADAKPSSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..++ ++-+|+. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:98 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEEecc--CCCCCcc Confidence 9997543 3445565 No 67 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=2.7e-53 Score=308.82 Aligned_cols=295 Identities=17% Similarity=0.142 Sum_probs=234.6 Q ss_pred CchhhH---HHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEE- Q lcl|NC_011614. 1 MEQTQK---LKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW- 76 (324) Q Consensus 1 m~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~- 76 (324) |...+. .+...+++.+... ...+ ...+++++||.+||++++++|++.+++.++++++++++++++....++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 157 (395) T protein:vir:38 81 LPVKDGKPDAQAMKNQFVKDFK--NLVT-SGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEK 157 (395) T ss_pred cchhhhhHHHHHHHHHHHHHHH--HHHh-hccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEe Confidence 221111 1122222222111 1112 23455667899999999999999999999999999999998766665544 Q ss_pred -eC-Ccceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 77 -AD-KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 77 -~~-~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) .. .+.+.|++|++.+|++ .++|++++++++|++++++||+|+++|+.++++++|.+.|++++++++|.+||+|+|++ T Consensus 158 ~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~ 237 (395) T protein:vir:38 158 LADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKA 237 (395) T ss_pred eccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 33 4568999999999976 58999999999999999999999999999999999999999999999999999998876 Q ss_pred cCCcccccccccccceeecccchhHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecc Q lcl|NC_011614. 154 PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G 228 (324) .... +..+++++.+++. .+...++.+++|+|||.++.+|++++|++|+|+|.. +.+++|+| T Consensus 238 ~~~~--------------~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G 303 (395) T protein:vir:38 238 PKKP--------------TISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDG 303 (395) T ss_pred cccc--------------ccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecc Confidence 4321 2346788888765 788889999999999999999999999999999854 34568999 Q ss_pred cceEeecCcc----CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_011614. 229 LPVVNLKSSN----LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 229 ~pv~~~~~~~----~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) +||+++++.. .++..+++|||+. +.++.+++++++++++.. .+|++|++.+|++.|+|+++.+ T Consensus 304 ~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~ 371 (395) T protein:vir:38 304 KPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGA------------GSFEHDTTKLRFIDRFDVQLID 371 (395) T ss_pred ceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEecccc------------chhhcCceEEEEEEeeccEEec Confidence 9999876432 3456699999996 678999999999987643 3599999999999999999999 Q ss_pred ccceEEEEeeccCCCCccccC Q lcl|NC_011614. 304 DKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 304 ~~a~~~l~~~~~~~~~~~~~~ 324 (324) |+||++++.++.++.+.+.-. T Consensus 372 ~~a~~~~~~~~~~~~~~~~~~ 392 (395) T protein:vir:38 372 DGAFAAASFKTVANQAQGTAG 392 (395) T ss_pred ccceEEEEeecccCCCCCccC Confidence 999999998766544443322 No 68 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=4e-53 Score=307.84 Aligned_cols=298 Identities=15% Similarity=0.113 Sum_probs=238.9 Q ss_pred CchhhHHHHHHHHHhhc------cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE- Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASN------NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF- 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~------~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i- 73 (324) +...+....+++.|... .....+.++.+..++.+||.+||++++++|++.+++.++|++++++++++++...+ T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 163 (408) T protein:vir:74 84 KSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRV 163 (408) T ss_pred chhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEE Confidence 22222222333333321 22334666666777888999999999999999999999999999999998766554 Q ss_pred -EEEeC-Ccceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011614. 74 -TFWAD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 74 -p~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~ 150 (324) |+... .+.+.|++|++.+++ ++++|++++++++|++++++||+|+++|+..+++++|.+.|++++++++|.++|+|+ T Consensus 164 ~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~ 243 (408) T protein:vir:74 164 YEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM 243 (408) T ss_pred EEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 44444 456789999999997 569999999999999999999999999999999999999999999999999999999 Q ss_pred CcCcCCcccccccccccceeecccchhHHHHHH-HHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCce Q lcl|NC_011614. 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~ 225 (324) |++... .+..+++++++++ ..+..+++.+++|+|||.++.+|++++|++|+|+|.. +.+++ T Consensus 244 G~~~~~--------------~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 309 (408) T protein:vir:74 244 GTVPKK--------------PTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) T ss_pred cccccc--------------cccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCce Confidence 876432 2345688898875 6888999999999999999999999999999999863 44578 Q ss_pred ecccceEeecC-----ccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Q lcl|NC_011614. 226 LDGLPVVNLKS-----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 226 l~G~pv~~~~~-----~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) |+|+||+++++ ...++..+++|||+. +.++.+++++++++++. +..|.+|++.+|++.|+|+ T Consensus 310 l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~------------~~~f~~~~~~~r~~~r~d~ 377 (408) T protein:vir:74 310 IKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIG------------AGAFETDTTKIRVIDRFDV 377 (408) T ss_pred ecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccc------------cchhhcceeeEEEEEeeCc Confidence 99999988653 334567799999996 67899999999998763 2459999999999999999 Q ss_pred EEecccceEEEEeeccCCCC--cc----ccC Q lcl|NC_011614. 300 HIADDKAFAKLVPADAKPSS--VP----GEV 324 (324) Q Consensus 300 ~v~~~~a~~~l~~~~~~~~~--~~----~~~ 324 (324) ++.+|+||++++.++..++. || .-| T Consensus 378 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 378 KATDSEALVAGSFTAIADQVGNFKTTTSTAV 408 (408) T ss_pred EEecccceEEEEeecccCCCCCCCCCccccC Confidence 99999999999875433221 11 122 No 69 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=2.8e-53 Score=308.67 Aligned_cols=298 Identities=14% Similarity=0.080 Sum_probs=237.7 Q ss_pred CchhhHHHHHHHHHh----hccchh-----hhhcccc-ccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFA----SNNVKP-----QVFNPDN-VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) Q Consensus 1 m~~~~~~~~~~~~~~----~~~~~~-----~~~~a~~-~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~ 70 (324) ++++...+...+++. +..... +...+.+ .++...++.+||++++++|++.+++.++++++++++|++++. T Consensus 126 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 205 (458) T protein:vir:10 126 TQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKI 205 (458) T ss_pred hhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcc Confidence 222222222222222 111111 1111222 234456788999999999999999999999999999999999 Q ss_pred eEEEEEeCCcceeeeccccccccc------ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 71 KKFTFWADKPGAYWVGEGQKIETS------KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) Q Consensus 71 ~~ip~~~~~~~a~~v~Eg~~~~~~------~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~ 144 (324) ..+|+.+..+.|.|++|++.++++ +++|+++++.++|++++++||+|+++|+.++++++|.+.|++++++++|. T Consensus 206 ~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~ 285 (458) T protein:vir:10 206 LTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE 285 (458) T ss_pred eEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999988754 57899999999999999999999999999999999999999999999999 Q ss_pred HHHhccCcCcCCcccccccccc--------cceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCce Q lcl|NC_011614. 145 AGILNQGNNPFGKSIAQSIEKT--------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) Q Consensus 145 a~l~g~g~~~~~~~~~~~~~~~--------~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~ 216 (324) ++|+|+|++ .|.|++...... .....+..+++++++++..+..+|+.++.|+|||.+|..|++++|++|+| T Consensus 286 ~~l~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~ 364 (458) T protein:vir:10 286 AFMTGDGSG-KPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQD 364 (458) T ss_pred HhhcCCCCC-ccceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCce Confidence 999999975 455555543221 12223456899999999999999999999999999999999999999999 Q ss_pred eecc--------CCCceecccceEeecCccC--CCceEEEeecc-cEEEEEecceEEEEeecccccccccccccchhhhh Q lcl|NC_011614. 217 RIYD--------RNSDSLDGLPVVNLKSSNL--KRGELITGDFD-KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) Q Consensus 217 ~~~~--------~~~~~l~G~pv~~~~~~~~--~~~~i~~gd~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) ++.. +.+++|+|+||++++..+. +...+++|||. .+.++++.++++..++ ++. T Consensus 365 i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~----------------~~~ 428 (458) T protein:vir:10 365 VAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERER----------------QAG 428 (458) T ss_pred eeccccccccccCcCceecceeeEEccccccccCCcceEEEEecccEEEEEeeceEEEeec----------------ccC Confidence 8642 2345799999999876543 34578999996 4779999999887643 357 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 286 QDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 286 ~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) +++++||++.|+|+.+.+|+||++.+.+++ T Consensus 429 ~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 429 KQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred CCceEEEEEEEecceEecccceEEEeeccC Confidence 899999999999999999999999988877 No 70 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=3.4e-53 Score=308.26 Aligned_cols=280 Identities=15% Similarity=0.134 Sum_probs=227.0 Q ss_pred cccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeee Q lcl|NC_011614. 28 NVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFK 107 (324) Q Consensus 28 ~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k 107 (324) +.+.++++|++||++++++|++.+++++++++++++++++++..+||++++.+.|+|++|++++|+++++|++++++++| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 80 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTPKK 80 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEE Confidence 45667788999999999999999999999999999999999889999999999999999999999999999999999999 Q ss_pred EEEeehhHHHHHh---cChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCc--CCcccccccc----cccceeec-ccchh Q lcl|NC_011614. 108 LGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGKSIAQSIE----KTNKVIKG-DFTQD 177 (324) Q Consensus 108 ~~~~v~iS~ell~---~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~--~~~~~~~~~~----~~~~~~~~-~~~~~ 177 (324) ++++++||+|+++ ++..+++++|.++|++++++++|+++|+|+|++. .+.+...... .......+ ...++ T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) T protein:vir:99 81 AQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANPDL 160 (311) T ss_pred EEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchhHH Confidence 9999999999995 6678999999999999999999999999987532 2222222111 11111111 22345 Q ss_pred HHHHHHHHhhhhc--cCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceEeecCcc------------- Q lcl|NC_011614. 178 NIIDLEALLEDDE--LEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN------------- 238 (324) Q Consensus 178 ~i~~~~~~l~~~~--~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~------------- 238 (324) ++.+++..+...+ ...+.|+|||.++..|++++|++|+|+|.+ ..+++|+|+||++++..+ T Consensus 161 ~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~ 240 (311) T protein:vir:99 161 AIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDEDLD 240 (311) T ss_pred HHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccccchhh Confidence 6677777766543 444679999999999999999999999864 345689999999875332 Q ss_pred -CCCceEEEeecccE-EEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 239 -LKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 239 -~~~~~i~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) .+...+++|||+.. .++.++++++++++++. .+..+++|++|++++|+++|+||.+.+| +|++++.+++ T Consensus 241 ~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 241 AARAVRGIVGDFANGIHWGVQRDIPVELIKYGD-------PDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred ccCcceEEEeeccccEEEEEecCceEEEeecCC-------CCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 24556788999874 58899999998887653 2345788999999999999999999996 6777777777 No 71 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=4.2e-53 Score=307.73 Aligned_cols=303 Identities=15% Similarity=0.120 Sum_probs=236.9 Q ss_pred CchhhHHHHH--HHHHhhccchhhhhc-----cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLN--LQHFASNNVKPQVFN-----PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~--~~~~~~~~~~~~~~~-----a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) +....+.+.. ............+.+ .....++++++++||++++++|++.+++.++|++++++++++++...+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 164 (413) T protein:vir:81 85 AGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKY 164 (413) T ss_pred hhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeE Confidence 1100000000 001111111111111 223355668899999999999999999999999999999999998999 Q ss_pred EEEeCC----cceeeecccccccccc-cceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011614. 74 TFWADK----PGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 74 p~~~~~----~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~ 148 (324) |+.... ..+.|++||+.+|+++ ++|+++++.++|++++++||+|+++|++ .++++|.+.|++++++++|+++|+ T Consensus 165 ~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~~~l~ 243 (413) T protein:vir:81 165 LMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARLLEELAIEEERQLLL 243 (413) T ss_pred EEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhc Confidence 987643 4579999999999987 6899999999999999999999999996 599999999999999999999999 Q ss_pred ccCcCcCCcccccccccccceee-cccchhHHHHHHHHhhhh-ccCCCEEEEcHHHHHHHHHhhccCCceeeccC----- Q lcl|NC_011614. 149 NQGNNPFGKSIAQSIEKTNKVIK-GDFTQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRKIVDPETKERIYDR----- 221 (324) Q Consensus 149 g~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~~~~l~~~-~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~----- 221 (324) |+|++.++.++............ +...++++.+++..+... ++.+.+|+|||.++.+|+++||++|+|+|... T Consensus 244 G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~ 323 (413) T protein:vir:81 244 GDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQY 323 (413) T ss_pred cCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccc Confidence 99998887777776555444333 334577777777776544 45566799999999999999999999998532 Q ss_pred ------CCceecccceEeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEE Q lcl|NC_011614. 222 ------NSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRAT 294 (324) Q Consensus 222 ------~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~ 294 (324) ..++|+|+||+++++ ++.+.+++|||+. +.++.+++++++++++.. ++|++|++.||++ T Consensus 324 ~~~~~~~~~~l~G~pv~~s~~--~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~r~~ 389 (413) T protein:vir:81 324 GSGGIMLDPAPWGLRTVQSQV--VPVGKPVVGAFRSAASVLRKGGVRIDSTNTNV------------DDFENNLITVRAE 389 (413) T ss_pred cccccccCceecceeeEEcCC--CCcccEEEEecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEE Confidence 234799999998764 4567899999996 678889999999987653 3699999999999 Q ss_pred EEeccEEecccceEEEEeeccCCCCcc Q lcl|NC_011614. 295 MHVALHIADDKAFAKLVPADAKPSSVP 321 (324) Q Consensus 295 ~r~d~~v~~~~a~~~l~~~~~~~~~~~ 321 (324) .|+|+.+.+|+||++++.++ +++| T Consensus 390 ~r~d~~~~~~~a~~~l~~~~---~~~p 413 (413) T protein:vir:81 390 ERVGLMVTFPEAIVQLDVAE---VVTP 413 (413) T ss_pred EeeccEEecccceEEEEecC---CCCC Confidence 99999999999999998754 4445 No 72 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=2.4e-53 Score=309.03 Aligned_cols=295 Identities=12% Similarity=0.171 Sum_probs=243.0 Q ss_pred CchhhH---------------------HHHH-----HHHHhhc------cchhhhhccccccccCCCcceechhh-hHHH Q lcl|NC_011614. 1 MEQTQK---------------------LKLN-----LQHFASN------NVKPQVFNPDNVMMHEKKDGTLLNDF-TTPI 47 (324) Q Consensus 1 m~~~~~---------------------~~~~-----~~~~~~~------~~~~~~~~a~~~~~~~~~g~lip~~~-~~~i 47 (324) +..++. ++.. .+++.+. .....+.|+....++++||.+||+++ ..+| T Consensus 299 ~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~i 378 (632) T protein:vir:96 299 IQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEF 378 (632) T ss_pred hhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHH Confidence 100000 0000 0000000 01112345666677778999999887 5899 Q ss_pred HHHHHhhcchhhh-ceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHH Q lcl|NC_011614. 48 LQEVMENSKIMQL-GKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQF 126 (324) Q Consensus 48 ~~~~~~~s~l~~l-~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~ 126 (324) ++.+++.++++++ ++++++.++.+.||+.++++.++|++|++.+++++++|+++++.++|++++++||+|++++|.+++ T Consensus 379 ie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~ 458 (632) T protein:vir:96 379 IDILRNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHV 458 (632) T ss_pred HHHHhhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHH Confidence 9999999999998 688999989999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccc-eeecccchhHHHHHHHHhhhhc--cCCCEEEEcHHHH Q lcl|NC_011614. 127 FEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDE--LEANAFISKTQNR 203 (324) Q Consensus 127 ~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~--~~~~~~v~~~~~~ 203 (324) +++|.+.|.+++++++|.++|+|+|++..|.|++...+.... ..++.++++++.++..++...+ ..++.|+|||.++ T Consensus 459 ~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~ 538 (632) T protein:vir:96 459 ENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQR 538 (632) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHH Confidence 999999999999999999999999987778887765544332 2345678999999999998876 3457899999988 Q ss_pred HHHHH--hhccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccch Q lcl|NC_011614. 204 SLLRK--IVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPV 281 (324) Q Consensus 204 ~~L~~--l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 281 (324) ..|.. ++|++|+|+|.+ ++|+|+|+++++. ++.+.+++|||+.++++.++++++.++++. T Consensus 539 ~~l~~~~l~d~~G~~i~~~---~~l~G~pv~~s~~--ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~------------- 600 (632) T protein:vir:96 539 GAAKKAQVFDNTGERIWQN---NEVNGYRAEASNQ--IPADTWIFGDWSQIVIAMWGVLDLKVDPYT------------- 600 (632) T ss_pred HHHHHHhccCCCCceeecC---CeecccceEeccc--cccCcEEEeecceEEEEEecceEEEEcccc------------- Confidence 77765 779999999864 5899999998764 556679999999999999999999998754 Q ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEEeec Q lcl|NC_011614. 282 NLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 282 ~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) +|.+|++.||++.|+|+++.+|++|+++++++ T Consensus 601 -~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 601 -KAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred -ccccCceEEEEEeecCceeechhhhhheeecC Confidence 47899999999999999999999999999988 No 73 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=1.1e-52 Score=305.52 Aligned_cols=304 Identities=13% Similarity=0.073 Sum_probs=242.7 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEE--EEeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT--FWAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip--~~~~ 78 (324) ....+....+++.|........+.++. ..++++||.+||+++.++|++.+++.++|++++++++++++...+| +.++ T Consensus 96 ~~~~~~~~~e~~~~~~~~~~~~~~~~~-~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:94 96 IQNTKVTSQEVRDFTEYLETRNDIQGG-SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhhh-ccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecC Confidence 111111223344555444444444443 3556678899999999999999999999999999999987766555 4556 Q ss_pred Ccceeeeccccccccc-ccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCc Q lcl|NC_011614. 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~-~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~ 157 (324) .+.+.|++||+++|+. .++|+++++.+++++++++||+|+++||.++++++|.++|++++++++|+++++|+|++.... T Consensus 175 ~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:94 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 7789999999999964 689999999999999999999999999999999999999999999999999999998765544 Q ss_pred ccccc-cccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceE Q lcl|NC_011614. 158 SIAQS-IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) ..... ........++..+++++.+++.++...++.+++|+|||++|.+|++++|++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:94 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred ccccccccccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeE Confidence 33332 2233344456688999999999999999999999999999999999999999999853 345689999999 Q ss_pred eecCccC---CCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_011614. 233 NLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~---~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~ 308 (324) ++++.+. ++..+++|||+. ++++.+.+++++.++ |.++++.+|++.|+|+.+.+|+||+ T Consensus 335 ~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~r~~~r~d~~~~~~~a~~ 397 (415) T protein:vir:94 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 8876543 345699999997 567888999988764 4567788999999999999999999 Q ss_pred EEEeeccCCCCccccC Q lcl|NC_011614. 309 KLVPADAKPSSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..+. ++-+|+. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:94 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEEecc--CCCCCcc Confidence 9996533 3335555 No 74 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1e-52 Score=305.57 Aligned_cols=294 Identities=11% Similarity=0.081 Sum_probs=240.7 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) +++.+.......++.+......+.|+. .++++||.+||++++++|++.+++.++++++++++++.++...+|+....+ T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~ra~--~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~ 167 (421) T protein:vir:13 90 EEKRSLQLSAMSKTIRGIQLSEEERDI--MSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRAGAS 167 (421) T ss_pred HHHHHHHHHHHHHhhhccchhHHHhhc--cccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEeecCC Confidence 333333332333333344444455553 455678899999999999999999999999999999999999999987654 Q ss_pred c--eeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcc Q lcl|NC_011614. 81 G--AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKS 158 (324) Q Consensus 81 ~--a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~ 158 (324) . +.|++|++.+++++++|+++++++++++++++||+|+++|+.++++++|.+.|++++.+++|.++++. +.+ T Consensus 168 ~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~------~~g 241 (421) T protein:vir:13 168 VDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQ------AKA 241 (421) T ss_pred ccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhh------hhh Confidence 4 56799999999999999999999999999999999999999999999999999999999999988752 111 Q ss_pred cccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc---CCCceecccceEeec Q lcl|NC_011614. 159 IAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGLPVVNLK 235 (324) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~---~~~~~l~G~pv~~~~ 235 (324) ++ ..++..+++++++++.++..+++.+++|+|||.+|.+|++++|++|+|+|.+ +.+++|+|+||++++ T Consensus 242 ~~--------~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~ 313 (421) T protein:vir:13 242 VL--------AEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVFKGRPVIELE 313 (421) T ss_pred cc--------ccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCceecceeeEEec Confidence 11 1223457999999999999999999999999999999999999999999864 445689999999887 Q ss_pred CccC---CCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|NC_011614. 236 SSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) Q Consensus 236 ~~~~---~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~ 311 (324) +.+. +...+++|||+. +.++.+++++++.+++. +|.+|++.||++.|+|+++.+++||+.++ T Consensus 314 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~--------------~f~~~~~~~r~~~r~d~~~~~~~a~~~~~ 379 (421) T protein:vir:13 314 ESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEA--------------GYTKNETIARIIERFDVNSPLDKSSDAEK 379 (421) T ss_pred cccccCCCceEEEEEeccccEEEEEecceEEEeeccc--------------ccccCeeEEEEEeeecceeecchhhheee Confidence 5443 455799999997 67899999999998764 49999999999999999999999988877 Q ss_pred eeccCCCCccccC Q lcl|NC_011614. 312 PADAKPSSVPGEV 324 (324) Q Consensus 312 ~~~~~~~~~~~~~ 324 (324) ....+.-++..+. T Consensus 380 ~~~~~a~v~~~~~ 392 (421) T protein:vir:13 380 IRKFGVIVKLQEV 392 (421) T ss_pred ecccceeeccccc Confidence 6655544444333 No 75 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=7e-52 Score=301.02 Aligned_cols=290 Identities=10% Similarity=0.040 Sum_probs=233.6 Q ss_pred CchhhHHHHHHHH---H--hhccc-hhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEE Q lcl|NC_011614. 1 MEQTQKLKLNLQH---F--ASNNV-KPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 m~~~~~~~~~~~~---~--~~~~~-~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) ...+...+...+. . .+... .....+ ...++++++++++|+++..+|++.+.+.++++++++++++.++.+.|| T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~ 154 (379) T protein:vir:10 76 DKSDSLVKSITENFNDIKEVRNGKSIQVKAV-GDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFV 154 (379) T ss_pred ccchhHHHHHHHHHHhHHHHHhhhhhhhhhh-cccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEE Confidence 1111111111110 0 00110 011111 233556677788999999999999999999999999999999999999 Q ss_pred EEeCC--cceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_011614. 75 FWADK--PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 75 ~~~~~--~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~ 152 (324) +.++. ..+.|++||+.+|+++++|+++++.++|++++++||+|+++|++ +++++|.+.|++++++++|.+++.|+++ T Consensus 155 ~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~~la~~~~~~~~~~~~~g~~~ 233 (379) T protein:vir:10 155 RENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPNALRRDYAKAENAAFNAVLAA 233 (379) T ss_pred EeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 98754 45678999999999999999999999999999999999999986 6999999999999999999999988775 Q ss_pred CcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccC------CCcee Q lcl|NC_011614. 153 NPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR------NSDSL 226 (324) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~------~~~~l 226 (324) +.... ....++..+++++.+++.++..+++.+++|+|||.+|..|+++||++|+|++... .+.+| T Consensus 234 ~~~~~---------~~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l 304 (379) T protein:vir:10 234 NATAS---------TEIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRI 304 (379) T ss_pred ccccc---------cccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCccee Confidence 42111 1122345568899999999999999999999999999999999999999998642 33589 Q ss_pred cccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc Q lcl|NC_011614. 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) Q Consensus 227 ~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a 306 (324) +|+||++++. ++.+.+++|||+.+.+..+++++++++++.. ++|++|++.||++.|+|+.|.+|+| T Consensus 305 ~G~pvv~s~~--~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~R~~~~v~~p~a 370 (379) T protein:vir:10 305 NGIPLFRATW--LAANKYYVGDWTRVTKVTTEGLSLEFSEVEG------------TNFVKNNITARIEAQVALAVEQPAA 370 (379) T ss_pred cceeeEecCC--CCCCceEEeecccEEEEEEeceEEEEeeccc------------ccccCCcEEEEEEEEeccEEecCcc Confidence 9999998764 4567899999999999999999999987642 3599999999999999999999999 Q ss_pred eEEEEeecc Q lcl|NC_011614. 307 FAKLVPADA 315 (324) Q Consensus 307 ~~~l~~~~~ 315 (324) |++++.++- T Consensus 371 ~v~~~~~~~ 379 (379) T protein:vir:10 371 LIFGDFTAV 379 (379) T ss_pred EEEEEecCC Confidence 999998766 No 76 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=2.4e-52 Score=303.59 Aligned_cols=299 Identities=11% Similarity=0.069 Sum_probs=239.1 Q ss_pred CchhhH-----HHHHHHHHhh-----ccchhhhhc----cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeec Q lcl|NC_011614. 1 MEQTQK-----LKLNLQHFAS-----NNVKPQVFN----PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM 66 (324) Q Consensus 1 m~~~~~-----~~~~~~~~~~-----~~~~~~~~~----a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~ 66 (324) |+.-.+ .+.+.+.... .....+|.+ .....+.+++|.+||+++.++|++.+.+.++++++|+++++ T Consensus 39 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~ 118 (377) T protein:vir:98 39 FTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT 118 (377) T ss_pred HHhHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEec Confidence 110000 0000110000 111222222 12335566788999999999999999999999999999998 Q ss_pred CCCceEEEEEeCCcceeeeccccccc-ccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) Q Consensus 67 ~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a 145 (324) ++. .++|+.++.+.|.|++|+++.+ +++++|+++++.++|++++++||+||++||.+++++||.+.|++++++++|.+ T Consensus 119 ~~~-~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a 197 (377) T protein:vir:98 119 SLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELA 197 (377) T ss_pred Ccc-eEEEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhc Confidence 764 6899999999999999988765 57899999999999999999999999999999999999999999999999999 Q ss_pred HHhccCcCcCCcccccccccccc-------eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceee Q lcl|NC_011614. 146 GILNQGNNPFGKSIAQSIEKTNK-------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) Q Consensus 146 ~l~g~g~~~~~~~~~~~~~~~~~-------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~ 218 (324) |++|+|++ +|.|++........ ..+.....+.+.++...+...++.+++|+||..++..+++++|.+|+++| T Consensus 198 ~i~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~ 276 (377) T protein:vir:98 198 IVKGDGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKL 276 (377) T ss_pred eEeccCCC-cceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEE Confidence 99999976 67777654322111 11112234678888889999999999999999999999999999999998 Q ss_pred c------------------cCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccc Q lcl|NC_011614. 219 Y------------------DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) Q Consensus 219 ~------------------~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) . .+.+.+++|+|+.+..+..+++..+++|||++|+++.+++++++.+++. T Consensus 277 ~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~------------ 344 (377) T protein:vir:98 277 ILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQT------------ 344 (377) T ss_pred EecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechh------------ Confidence 3 2334578999987666777888899999999999999999999998765 Q ss_pred hhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 281 ~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) +|.+|++.||+..|+|+++++++||++++.+.- T Consensus 345 --~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 345 --FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred --hhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 489999999999999999999999999998654 No 77 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=1.8e-51 Score=298.82 Aligned_cols=303 Identities=11% Similarity=0.062 Sum_probs=231.1 Q ss_pred Cchhh------HHHHH----HHHHhh------ccchhhhhc----cccccccCCCcceechhhhHHHHHHHHhhcchhhh Q lcl|NC_011614. 1 MEQTQ------KLKLN----LQHFAS------NNVKPQVFN----PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL 60 (324) Q Consensus 1 m~~~~------~~~~~----~~~~~~------~~~~~~~~~----a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l 60 (324) |.+.. ..+.. .+.+.. ......+.+ .....+++++|.+||++++++|++.+++.++++++ T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~ 117 (390) T protein:vir:40 38 MAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSK 117 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhh Confidence 10000 00000 000000 011111111 11224566789999999999999999999999999 Q ss_pred ceeeecCCCceEEEEEeCCcceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHH Q lcl|NC_011614. 61 GKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFY 139 (324) Q Consensus 61 ~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~ 139 (324) +++++++++...+|+.++.+.+.|++|++++++ ++++|++++++++|++++++||+|+++|+.++++++|.+.|+++++ T Consensus 118 ~~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~ 197 (390) T protein:vir:40 118 INFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMA 197 (390) T ss_pred ceeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Confidence 999999999999999999999999999998874 6899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCcCcCCccccccccccc-----ceeecccchhHHHHHHHHhhhh-------ccCCCEEEEcHHHHH--- Q lcl|NC_011614. 140 KKFDEAGILNQGNNPFGKSIAQSIEKTN-----KVIKGDFTQDNIIDLEALLEDD-------ELEANAFISKTQNRS--- 204 (324) Q Consensus 140 ~~~d~a~l~g~g~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~-------~~~~~~~v~~~~~~~--- 204 (324) +++|++||+|+|++ .|.|++....... ......+++++..++...+... ...+++|+|||.++. T Consensus 198 ~~~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l 276 (390) T protein:vir:40 198 LGLEAGIVNGSGKD-QPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKI 276 (390) T ss_pred HHHHhhhhcccCCC-ccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHH Confidence 99999999999975 4667665433222 1223345666666666655443 345688999998842 Q ss_pred -HHHHhhccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhh Q lcl|NC_011614. 205 -LLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) Q Consensus 205 -~L~~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) .++.++|.+|+|++.. .++|+||+.++. ++++.+++|||++++++.+++++++++++. + T Consensus 277 ~~~~~~~d~~G~~v~~~----~~~g~pvv~~~~--~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~--------------~ 336 (390) T protein:vir:40 277 YAATSYMTPQGVWVTGI----LPVPLEIVQSVA--VPVGKAVAGRAKDYFMGIGSEQVIRTSTEY--------------R 336 (390) T ss_pred HHHhhccCCCCcccccc----CCCceeEEEcCC--CCCCcEEEEeeceEEEEeecceEEEecchh--------------h Confidence 4457899999998633 458999998654 456789999999999999999999988754 5 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEeeccCCC--CccccC Q lcl|NC_011614. 284 FEQDMVALRATMHVALHIADDKAFAKLVPADAKPS--SVPGEV 324 (324) Q Consensus 284 f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~--~~~~~~ 324 (324) |.+|++.||+..|+|+++.+|+||++++.++..++ .+|..+ T Consensus 337 f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~ 379 (390) T protein:vir:40 337 LLDDETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAIDVNVV 379 (390) T ss_pred hhcCcEEEEEEEEeCCEEecccceEEEEeeccCCCCCCCccee Confidence 89999999999999999999999999987777554 222222 No 78 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=1.3e-50 Score=294.07 Aligned_cols=285 Identities=14% Similarity=0.085 Sum_probs=224.6 Q ss_pred CchhhHHHHHHHH-------------H----hh-ccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhce Q lcl|NC_011614. 1 MEQTQKLKLNLQH-------------F----AS-NNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGK 62 (324) Q Consensus 1 m~~~~~~~~~~~~-------------~----~~-~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~ 62 (324) ++....+....+. + .. ......+... ...++.+||.+||+++++.|++.+++.++++++++ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~ 163 (394) T protein:vir:97 85 KTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQK-DGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTT 163 (394) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhc-cccccccccccChHHHHHHHHHHhhhhhhhhhhce Confidence 1111111111000 0 00 0111111111 23456678899999999999999999999999999 Q ss_pred eeecCCCceEEEEEeC-Ccceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHH Q lcl|NC_011614. 63 YEPMEGTEKKFTFWAD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYK 140 (324) Q Consensus 63 ~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~ 140 (324) ++++.++...+|+... ...+.|++||+++|+ ++++|++|++.++|++++++||+|+++|+.++++++|.+.|++++++ T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~ 243 (394) T protein:vir:97 164 VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVN 243 (394) T ss_pred eeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 9999999899999764 467899999999997 57999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc Q lcl|NC_011614. 141 KFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD 220 (324) Q Consensus 141 ~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~ 220 (324) ++|.++++|.++.. ..+..+++++++++......+ .++.|+|||.++.+|++++|++|+|+|.. T Consensus 244 ~~~~~i~~g~~~~~---------------~~~~~~~~~~~~~~~~~~~~~-~~a~~v~n~~~~~~l~~lkd~~G~~i~~~ 307 (394) T protein:vir:97 244 TTNDAIAKVLKSFT---------------TKTVKNLDEIKALLNGGFDPA-YNVSLIVSQSFYQTLDTLKDGNGRYLLQD 307 (394) T ss_pred HHHHHHhhcccccc---------------ccccccHHHHHHHHHhhhhhh-hCCEEEEcHHHHHHHHHhhccCCCeeeec Confidence 99999999865432 123457888988887654433 36889999999999999999999999854 Q ss_pred ----CCCceecccceEeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEE Q lcl|NC_011614. 221 ----RNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 221 ----~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) +.+++|+|+||+++++...+...+++|||+. +.++.+++++++.+++ ..+...+|++. T Consensus 308 ~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~ 370 (394) T protein:vir:97 308 DITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN-----------------EIYGQYLQAVL 370 (394) T ss_pred CcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecc-----------------cccceeEEEEE Confidence 3456899999999888889999999999987 5788899999987654 23456799999 Q ss_pred EeccEEecccceEEEEeeccCCCCccc Q lcl|NC_011614. 296 HVALHIADDKAFAKLVPADAKPSSVPG 322 (324) Q Consensus 296 r~d~~v~~~~a~~~l~~~~~~~~~~~~ 322 (324) |+|+.+.+|+||++++..+. ++|= T Consensus 371 r~d~~v~~~~a~~~~~~~~~---~~p~ 394 (394) T protein:vir:97 371 RFGVSKVDDKAGYYVTFTPE---PLPL 394 (394) T ss_pred EEccEEecccceEEEEeccc---ccCC Confidence 99999999999999998533 2333 No 79 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=3.5e-50 Score=291.73 Aligned_cols=293 Identities=12% Similarity=0.107 Sum_probs=235.2 Q ss_pred Cchhh--HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeC Q lcl|NC_011614. 1 MEQTQ--KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 m~~~~--~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~ 78 (324) +...+ ..+...++|.+......+.++ ...++++||.+||++++++|++.+++.+++++++++++++++...+|+... T Consensus 84 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 162 (394) T protein:vir:10 84 LKKKPIDAKKKAINDFIHSHGKVIDNAA-GHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR 162 (394) T ss_pred hhhhHHHHHHHHHHHHHhccchhhhhhh-cccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEec Confidence 22211 122334555555444444443 346677888999999999999999999999999999999999899998765 Q ss_pred -Ccceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 79 -KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 79 -~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) ...+.|++|++++++ ++++|++|++.++|++++++||+|+++||.++++++|.+.|++++++++|.++++|+|++.. T Consensus 163 ~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~- 241 (394) T protein:vir:10 163 ATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTA- 241 (394) T ss_pred CCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccc- Confidence 467899999999996 67999999999999999999999999999999999999999999999999999999876432 Q ss_pred cccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccC--------CCceecc Q lcl|NC_011614. 157 KSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR--------NSDSLDG 228 (324) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~--------~~~~l~G 228 (324) ....+..+++++.+++.......+ +++|+|||+++.+|++++|++|+|+|... .+++|+| T Consensus 242 -----------~~~~~~~~~d~l~~~~~~~~~~~~-~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G 309 (394) T protein:vir:10 242 -----------KATTTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLG 309 (394) T ss_pred -----------ccccccccHHHHHHHHHhhhhhhc-cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCccccccc Confidence 122345678899988764444333 58999999999999999999999998542 2358999 Q ss_pred cceEeecCcc----CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_011614. 229 LPVVNLKSSN----LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 229 ~pv~~~~~~~----~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) +||+++++.. .++..+++|||+. ++++.++++++.++++.. |. ..+|++.|+|+++.+ T Consensus 310 ~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~--------------~~---~~~~~~~r~d~~~~~ 372 (394) T protein:vir:10 310 VPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKI--------------YG---RYLGAAFRFGVKQAD 372 (394) T ss_pred ceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccc--------------cc---eeEEEEEEeccEEec Confidence 9999875432 3455699999997 677888999998876532 33 458999999999999 Q ss_pred ccceEEEEeeccCCCCccccC Q lcl|NC_011614. 304 DKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 304 ~~a~~~l~~~~~~~~~~~~~~ 324 (324) |+||+.++.+++.++++-|.= T Consensus 373 ~~ai~~~~~~~~~~~~~~~~~ 393 (394) T protein:vir:10 373 SNAGYFVTNTDAASGSTSGTG 393 (394) T ss_pred cccEEEEEeecccCCCCCCCC Confidence 999999999888888775544 No 80 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=4.6e-51 Score=296.54 Aligned_cols=294 Identities=13% Similarity=0.109 Sum_probs=225.7 Q ss_pred Cchhh----HHHHHHHHH------hhc-cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCC Q lcl|NC_011614. 1 MEQTQ----KLKLNLQHF------ASN-NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT 69 (324) Q Consensus 1 m~~~~----~~~~~~~~~------~~~-~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~ 69 (324) +.... .+..+.|+. .+. .....+.++.+..++++||.+||+++.++|++.++++++|+++++++++++ T Consensus 46 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~- 124 (352) T protein:vir:78 46 LNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 124 (352) T ss_pred cchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC- Confidence 11101 111111111 111 111234455566677888999999999999999999999999999998865 Q ss_pred ceEEEEEeC-CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|NC_011614. 70 EKKFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE-AGI 147 (324) Q Consensus 70 ~~~ip~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~-a~l 147 (324) ..+|+.+. .+++.|++|++.+++++++|+++++.++|++++++||+|+++||.++++++|.+.|+++++++++. .+. T Consensus 125 -~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~ 203 (352) T protein:vir:78 125 -LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 203 (352) T ss_pred -ceEEEEecCCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 35777654 468999999999999999999999999999999999999999999999999999999999998655 444 Q ss_pred hccCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceec Q lcl|NC_011614. 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLD 227 (324) Q Consensus 148 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~ 227 (324) .|+|++.+...+... . ....++..++|++++++..+..+|+.+++|+||+.++..|.++++.+|++++. +.+.+|+ T Consensus 204 ~g~g~~~~~g~l~~~-~--~~~~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~-~~~~~ll 279 (352) T protein:vir:78 204 VSPKSGLEHMSFYNG-S--VKEVEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVF 279 (352) T ss_pred cCCCCcccccceecc-c--cccccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCcccc-cCCcccc Confidence 666655433322221 1 12234555799999999999999999999999999999999999989999875 4467899 Q ss_pred ccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 228 GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 228 G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) |+||+++++ ...+++|||+++++. +.++.++.+++ ..++++.||+..|+|+++.+|+|| T Consensus 280 G~PV~~~~~----~~~~~~Gdf~~~~~~-~~~~~~~~~~~----------------~~~g~~~f~~~~r~Dg~~~~~eA~ 338 (352) T protein:vir:78 280 GKPVVFTDA----AVKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAF 338 (352) T ss_pred ccceEEecC----CCceeEeehhhhhhh-hhhheeeeecc----------------ccCCeeEEEEEeeeCceeechhhe Confidence 999998764 345889999988765 45566655543 236899999999999999999999 Q ss_pred EEEEeeccCCCCccc Q lcl|NC_011614. 308 AKLVPADAKPSSVPG 322 (324) Q Consensus 308 ~~l~~~~~~~~~~~~ 322 (324) ++++.++ +++++|. T Consensus 339 ~~l~~~a-~~~~~~~ 352 (352) T protein:vir:78 339 RIAKAKE-STGSLPS 352 (352) T ss_pred EEEEeec-ccCCCCC Confidence 9999864 4455666 No 81 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=4.1e-50 Score=291.36 Aligned_cols=289 Identities=12% Similarity=0.122 Sum_probs=232.1 Q ss_pred CchhhHH--HHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeC Q lcl|NC_011614. 1 MEQTQKL--KLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 m~~~~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~ 78 (324) |...+.. +...++|.+... .+.++....++++||.+||+++.++|++.++++++++++|++++++++...+|+... T Consensus 83 ~~~~~~~~~~~~~~~~lr~~~--~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (389) T protein:vir:10 83 LSKKPIDAKKKAINDFIHSHG--KVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR 160 (389) T ss_pred cchhHHHHHHHHHHHHhhcch--hhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec Confidence 3222211 122333433322 233444556777889999999999999999999999999999999999899998875 Q ss_pred -Ccceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 79 -KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 79 -~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) ...+.|++|++++++ ++++|+++++.++|+++++++|+|+++||.++++++|.+.|++++++++|.+|++|.+++.. T Consensus 161 ~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~- 239 (389) T protein:vir:10 161 ATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTA- 239 (389) T ss_pred CCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc- Confidence 456689999999985 78999999999999999999999999999999999999999999999999999998765421 Q ss_pred cccccccccccceeecccchhHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccC--------CCceec Q lcl|NC_011614. 157 KSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR--------NSDSLD 227 (324) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~--------~~~~l~ 227 (324) ...++..+++++.++++ .+..++ +++|+|||.++.+|++++|++|+|+|..+ .+++|+ T Consensus 240 -----------~~~~~~~~~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~ 306 (389) T protein:vir:10 240 -----------KKTTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTIL 306 (389) T ss_pred -----------ccccccccHHHHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccc Confidence 22345678899998876 444444 58999999999999999999999998533 235799 Q ss_pred ccceEeecCcc----CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_011614. 228 GLPVVNLKSSN----LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 228 G~pv~~~~~~~----~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) |+||+++++.. .++..+++|||+. +.++++++++++++++.. |. ..+|+..|+|+++. T Consensus 307 G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~~---~~~~~~~r~d~~~~ 369 (389) T protein:vir:10 307 GVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKI--------------YG---KYLGAAFRFGVQKA 369 (389) T ss_pred cceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeecccc--------------cc---ceEEEEEEeccEEe Confidence 99998865432 2445699999997 679999999999887542 32 45899999999999 Q ss_pred cccceEEEEeeccCCCCcccc Q lcl|NC_011614. 303 DDKAFAKLVPADAKPSSVPGE 323 (324) Q Consensus 303 ~~~a~~~l~~~~~~~~~~~~~ 323 (324) +|+||++++. +..++++||+ T Consensus 370 ~~~a~~~~~~-~~~~~~~~~~ 389 (389) T protein:vir:10 370 DSKAGYFVTN-TDVPGSALGK 389 (389) T ss_pred cccceEEEEe-eccCCCCCCC Confidence 9999999987 4778888999 No 82 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=6.2e-50 Score=290.36 Aligned_cols=301 Identities=12% Similarity=0.058 Sum_probs=229.9 Q ss_pred Cc-hhhHHHHHHHHH--------hhcc-chhhhhcccccc-ccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCC Q lcl|NC_011614. 1 ME-QTQKLKLNLQHF--------ASNN-VKPQVFNPDNVM-MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT 69 (324) Q Consensus 1 m~-~~~~~~~~~~~~--------~~~~-~~~~~~~a~~~~-~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~ 69 (324) .. +...++...... .... ......+..... .+..++.++|..+...|...+.....++++++++++.++ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~ 166 (419) T protein:vir:94 87 RFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYN 166 (419) T ss_pred hhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCC Confidence 00 001111111110 0000 001111112222 233344456666667777777888899999999999998 Q ss_pred ceEEEEEeC--------CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 70 EKKFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) Q Consensus 70 ~~~ip~~~~--------~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~ 141 (324) .+.+|+.++ ...++|++||+.+|+++++|+++++.++|++++++||+|+++|+. +++++|.++|+++++++ T Consensus 167 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~~~ 245 (419) T protein:vir:94 167 VLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYIQGRLTYGLRFL 245 (419) T ss_pred ceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHH Confidence 888887653 345789999999999999999999999999999999999999874 79999999999999999 Q ss_pred HHHHHHhccCcCcCCcccccccccc-------cceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC Q lcl|NC_011614. 142 FDEAGILNQGNNPFGKSIAQSIEKT-------NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET 214 (324) Q Consensus 142 ~d~a~l~g~g~~~~~~~~~~~~~~~-------~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g 214 (324) +|.+||+|+|++ .|.|++...... ....++...++++.+++..+...+..+++|+|||+++..|++++|.+| T Consensus 246 ~d~aii~G~G~~-~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~ 324 (419) T protein:vir:94 246 RDRQLLNGNGST-EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGS 324 (419) T ss_pred HHHHHHhccCcc-cccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCC Confidence 999999999975 455655432221 112334557899999999999999999999999999999999999877 Q ss_pred ceee-c----cCCCceecccceEeecCccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCc Q lcl|NC_011614. 215 KERI-Y----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM 288 (324) Q Consensus 215 ~~~~-~----~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~ 288 (324) ++++ + .+.+++|+|+||++++. ++++.+++|||+. +.++.+++++++++++.. ++|++|+ T Consensus 325 ~~~~~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~ 390 (419) T protein:vir:94 325 GVFRVIANVQGEATPRIWGLNVVSTVA--IAQGTALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANT 390 (419) T ss_pred CceeecCCcccCCCccccceeeEEcCC--CCCccEEEeeccceEEEEEecceEEEEecccc------------chhhcCc Confidence 7543 3 34467999999998764 5677899999997 467889999999887643 3699999 Q ss_pred EEEEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 289 VALRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 289 v~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) ++||++.|+|+++.+|+||++++.+++.+ T Consensus 391 ~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 391 LVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred EEEEEEEeeccEEeccccEEEEEeccCCC Confidence 99999999999999999999999987777 No 83 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=1.7e-50 Score=293.38 Aligned_cols=294 Identities=12% Similarity=0.106 Sum_probs=224.7 Q ss_pred CchhhHHHHHHHHHhhc------c-chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASN------N-VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~------~-~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) +...+.+..++|++... . ....+.++....++++||.+||++++++|++.++++++++++++++++++. .+ T Consensus 85 ~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~ 162 (387) T protein:vir:96 85 EKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL--EI 162 (387) T ss_pred HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCc--ee Confidence 11111122223332211 1 112334455566677789999999999999999999999999999998764 57 Q ss_pred EEEe-CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH-hccC Q lcl|NC_011614. 74 TFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI-LNQG 151 (324) Q Consensus 74 p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l-~g~g 151 (324) |+.. ..+.+.|++||+.+++++++|+++++.++|++++++||+|+++||.+++++||.+.|+++++++++..+| .|+| T Consensus 163 p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g 242 (387) T protein:vir:96 163 PRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPK 242 (387) T ss_pred eeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCC Confidence 7765 4578999999999999999999999999999999999999999999999999999999999999776544 5555 Q ss_pred cCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccce Q lcl|NC_011614. 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPV 231 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv 231 (324) ++. +.+++.... ....++..++|++++++..+..+|+.++.|+||+.++..+.++++..|++++. +.+++|+|+|| T Consensus 243 ~g~-~~g~~~~~~--~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-~~~~~llG~PV 318 (387) T protein:vir:96 243 SGL-EHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVFGKPV 318 (387) T ss_pred ccc-cceeeeccc--cccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCccccccce Confidence 543 333332221 22234566799999999999999999999999999988887777777777765 45679999999 Q ss_pred EeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|NC_011614. 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) Q Consensus 232 ~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~ 311 (324) +++++. .++++|||+++++. +.++.++.+++ ..++++.||+..|+|+++.+|+||++++ T Consensus 319 ~~~~~~----~~~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ 377 (387) T protein:vir:96 319 VFTDAA----VKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred EEecCC----Cceeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEE Confidence 997643 36899999987665 44555554443 2368999999999999999999999999 Q ss_pred eeccCCCCccc Q lcl|NC_011614. 312 PADAKPSSVPG 322 (324) Q Consensus 312 ~~~~~~~~~~~ 322 (324) .+++. ++||- T Consensus 378 ~ka~~-~~~~~ 387 (387) T protein:vir:96 378 AKENT-GPLPS 387 (387) T ss_pred eecCC-CCCCC Confidence 86444 34444 No 84 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=1.7e-50 Score=293.38 Aligned_cols=294 Identities=12% Similarity=0.106 Sum_probs=224.7 Q ss_pred CchhhHHHHHHHHHhhc------c-chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASN------N-VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~------~-~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) +...+.+..++|++... . ....+.++....++++||.+||++++++|++.++++++++++++++++++. .+ T Consensus 85 ~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~ 162 (387) T protein:vir:94 85 EKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL--EI 162 (387) T ss_pred HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCc--ee Confidence 11111122223332211 1 112334455566677789999999999999999999999999999998764 57 Q ss_pred EEEe-CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH-hccC Q lcl|NC_011614. 74 TFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI-LNQG 151 (324) Q Consensus 74 p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l-~g~g 151 (324) |+.. ..+.+.|++||+.+++++++|+++++.++|++++++||+|+++||.+++++||.+.|+++++++++..+| .|+| T Consensus 163 p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g 242 (387) T protein:vir:94 163 PRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPK 242 (387) T ss_pred eeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCC Confidence 7765 4578999999999999999999999999999999999999999999999999999999999999776544 5555 Q ss_pred cCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccce Q lcl|NC_011614. 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPV 231 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv 231 (324) ++. +.+++.... ....++..++|++++++..+..+|+.++.|+||+.++..+.++++..|++++. +.+++|+|+|| T Consensus 243 ~g~-~~g~~~~~~--~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-~~~~~llG~PV 318 (387) T protein:vir:94 243 SGL-EHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVFGKPV 318 (387) T ss_pred ccc-cceeeeccc--cccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCccccccce Confidence 543 333332221 22234566799999999999999999999999999988887777777777765 45679999999 Q ss_pred EeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|NC_011614. 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) Q Consensus 232 ~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~ 311 (324) +++++. .++++|||+++++. +.++.++.+++ ..++++.||+..|+|+++.+|+||++++ T Consensus 319 ~~~~~~----~~~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ 377 (387) T protein:vir:94 319 VFTDAA----VKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred EEecCC----Cceeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEE Confidence 997643 36899999987665 44555554443 2368999999999999999999999999 Q ss_pred eeccCCCCccc Q lcl|NC_011614. 312 PADAKPSSVPG 322 (324) Q Consensus 312 ~~~~~~~~~~~ 322 (324) .+++. ++||- T Consensus 378 ~ka~~-~~~~~ 387 (387) T protein:vir:94 378 AKENT-GPLPS 387 (387) T ss_pred eecCC-CCCCC Confidence 86444 34444 No 85 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=1.7e-50 Score=293.38 Aligned_cols=294 Identities=12% Similarity=0.106 Sum_probs=224.7 Q ss_pred CchhhHHHHHHHHHhhc------c-chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASN------N-VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~------~-~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) +...+.+..++|++... . ....+.++....++++||.+||++++++|++.++++++++++++++++++. .+ T Consensus 85 ~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~--~~ 162 (387) T protein:vir:26 85 EKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL--EI 162 (387) T ss_pred HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCc--ee Confidence 11111122223332211 1 112334455566677789999999999999999999999999999998764 57 Q ss_pred EEEe-CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH-hccC Q lcl|NC_011614. 74 TFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI-LNQG 151 (324) Q Consensus 74 p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l-~g~g 151 (324) |+.. ..+.+.|++||+.+++++++|+++++.++|++++++||+|+++||.+++++||.+.|+++++++++..+| .|+| T Consensus 163 p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g 242 (387) T protein:vir:26 163 PRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPK 242 (387) T ss_pred eeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCC Confidence 7765 4578999999999999999999999999999999999999999999999999999999999999776544 5555 Q ss_pred cCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceecccce Q lcl|NC_011614. 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPV 231 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv 231 (324) ++. +.+++.... ....++..++|++++++..+..+|+.++.|+||+.++..+.++++..|++++. +.+++|+|+|| T Consensus 243 ~g~-~~g~~~~~~--~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-~~~~~llG~PV 318 (387) T protein:vir:26 243 SGL-EHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVFGKPV 318 (387) T ss_pred ccc-cceeeeccc--cccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCccccccce Confidence 543 333332221 22234566799999999999999999999999999988887777777777765 45679999999 Q ss_pred EeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEE Q lcl|NC_011614. 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) Q Consensus 232 ~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~ 311 (324) +++++. .++++|||+++++. +.++.++.+++ ..++++.||+..|+|+++.+|+||++++ T Consensus 319 ~~~~~~----~~~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ 377 (387) T protein:vir:26 319 VFTDAA----VKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred EEecCC----Cceeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEE Confidence 997643 36899999987665 44555554443 2368999999999999999999999999 Q ss_pred eeccCCCCccc Q lcl|NC_011614. 312 PADAKPSSVPG 322 (324) Q Consensus 312 ~~~~~~~~~~~ 322 (324) .+++. ++||- T Consensus 378 ~ka~~-~~~~~ 387 (387) T protein:vir:26 378 AKENT-GPLPS 387 (387) T ss_pred eecCC-CCCCC Confidence 86444 34444 No 86 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1.1e-49 Score=288.94 Aligned_cols=283 Identities=13% Similarity=0.089 Sum_probs=224.1 Q ss_pred Cc--------------hh---hHHHHHHHHHhhccchhhhhcc--ccccccCCCcceechhhhHHHHHHHHhhcchhhhc Q lcl|NC_011614. 1 ME--------------QT---QKLKLNLQHFASNNVKPQVFNP--DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLG 61 (324) Q Consensus 1 m~--------------~~---~~~~~~~~~~~~~~~~~~~~~a--~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~ 61 (324) +. .. +..+...+.+........+.+. ....++++||.+||+++.++|++.+++.+++++++ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~ 168 (400) T protein:vir:38 89 HSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFT 168 (400) T ss_pred hhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcc Confidence 00 00 0011111112222222222221 12245667889999999999999999999999999 Q ss_pred eeeecCCCceEEEEEeC-Ccceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHH Q lcl|NC_011614. 62 KYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFY 139 (324) Q Consensus 62 ~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~ 139 (324) ++++++++...+|+... .+.+.|++|++.+++ ++++|+++++.++|++++++||+|+++||.++++++|.+.|+++++ T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~ 248 (400) T protein:vir:38 169 NVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKV 248 (400) T ss_pred eeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 99999999999999874 467899999999986 6899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeec Q lcl|NC_011614. 140 KKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIY 219 (324) Q Consensus 140 ~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~ 219 (324) .++|.++++|+|+... .+..+++++.+++....+.. .+++|+|||.++.+|++++|++|+|+|. T Consensus 249 ~~~~~~i~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~-~~a~~v~~~~~~~~l~~lkd~~G~~i~~ 312 (400) T protein:vir:38 249 NTTNGAVATLLKGFTA---------------KTISSVDDLKHINNVDLDPA-YSRVIIASQSFYNFLDTVKDGNGRYLLQ 312 (400) T ss_pred HHHHHhhhhccccccc---------------cccccHHHHHHHHHhhhhhh-hCcEEEEcHHHHHHHHHhhccCCCeeee Confidence 9999999998775421 23456888888877554433 3689999999999999999999999985 Q ss_pred c----CCCceecccceEeecCcc---CCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEE Q lcl|NC_011614. 220 D----RNSDSLDGLPVVNLKSSN---LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL 291 (324) Q Consensus 220 ~----~~~~~l~G~pv~~~~~~~---~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ 291 (324) + +.+++|+|+||+++++.+ .++..+++|||+. +.++.+.+++++++++. .+...+ T Consensus 313 ~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~-----------------~~~~~~ 375 (400) T protein:vir:38 313 DSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQ-----------------IYGQFL 375 (400) T ss_pred cCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEeccc-----------------ccceeE Confidence 4 345689999999987644 3466799999997 56788999999887653 234579 Q ss_pred EEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 292 RATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 292 r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) |+..|+|+.+.+|+||++|+.++.+ T Consensus 376 ~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 376 QAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEEEeccEEecccceEEEEeecCC Confidence 9999999999999999999998777 No 87 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=1.1e-49 Score=289.01 Aligned_cols=294 Identities=11% Similarity=0.049 Sum_probs=228.2 Q ss_pred CchhhHHHHHHHHH----hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHF----ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW 76 (324) Q Consensus 1 m~~~~~~~~~~~~~----~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~ 76 (324) .+..........+. ........+.++....++.++|.+||+++.+.| ..+.+.+.++.++++++++++...+|+. T Consensus 126 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i-~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 204 (437) T protein:vir:10 126 QDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPE-KEVHQFPRLGSLVRTESVTTTTGKLPIF 204 (437) T ss_pred hHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHH-HHhhhhhhhhhcceeEeeccCceeeEEe Confidence 00000000000000 011122234455555677889999999998866 4568888999999999999988999998 Q ss_pred eC-Ccceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCc Q lcl|NC_011614. 77 AD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 77 ~~-~~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~ 154 (324) .. .+.+.|++|++.+++ ++++|+++++.++|++++++||+|+++|+.++++++|.+.|++++++++|.+|++|+|++. T Consensus 205 ~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~ 284 (437) T protein:vir:10 205 NNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGI 284 (437) T ss_pred eccccccccccccccccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Confidence 64 567899999999996 5689999999999999999999999999999999999999999999999999999988653 Q ss_pred CCcccccccccccceeecccchhHHHHHHH-HhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceeccc Q lcl|NC_011614. 155 FGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGL 229 (324) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~ 229 (324) .. .++..+++++.+++. .+..+|..+++|+|||+++..|++++|++|+|+|.+ +.+++|+|+ T Consensus 285 ~~-------------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~ 351 (437) T protein:vir:10 285 KK-------------TTSTYLLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLGK 351 (437) T ss_pred cc-------------cccccchhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCcccccc Confidence 21 224456777888664 788889899999999999999999999999999853 345689999 Q ss_pred ceEeecCc-----cCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_011614. 230 PVVNLKSS-----NLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 230 pv~~~~~~-----~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) ||+++++. ..++..+++|||+. +.++++.+++++.++. |..+...+|+..|+|+.+++ T Consensus 352 pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~----------------~~~~~~~~~~~~r~d~~~~~ 415 (437) T protein:vir:10 352 TVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDT----------------YDIWYKQLGIFLRQNVVQAS 415 (437) T ss_pred eeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEecc----------------cccccceeeEEEEEccEEec Confidence 99987543 34566699999997 5688899999876542 45566789999999999999 Q ss_pred ccceEEEEeeccCCCCc-cccC Q lcl|NC_011614. 304 DKAFAKLVPADAKPSSV-PGEV 324 (324) Q Consensus 304 ~~a~~~l~~~~~~~~~~-~~~~ 324 (324) |+||++|+.+.++.+++ |+-| T Consensus 416 ~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 416 KDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred ccceEEEEeeccccccCCCCCC Confidence 99999999664433333 4455 No 88 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=2e-49 Score=287.63 Aligned_cols=301 Identities=14% Similarity=0.067 Sum_probs=224.5 Q ss_pred Cch-----hhHHHHHHHHHhh-----ccchhh---hhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC Q lcl|NC_011614. 1 MEQ-----TQKLKLNLQHFAS-----NNVKPQ---VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME 67 (324) Q Consensus 1 m~~-----~~~~~~~~~~~~~-----~~~~~~---~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~ 67 (324) ++. ....+.+.+.+.. ...... .+++....++++||++||+++.++|++.+.+.++++++|++++++ T Consensus 37 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~ 116 (381) T protein:vir:95 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) T ss_pred HHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC Confidence 111 0111111111111 111111 223444566678899999999999999999999999999999987 Q ss_pred CCceEEEEEeCCcceeeeccccccc-ccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) Q Consensus 68 ~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~ 146 (324) + ..++|+.++.+.|.|++|+++++ +++++|+++++.++|++++++||+||++|+.+++++||.+.|++++++++|.+| T Consensus 117 ~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~ 195 (381) T protein:vir:95 117 L-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) T ss_pred c-ceEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhhee Confidence 6 46899999999999999988876 568999999999999999999999999999999999999999999999999999 Q ss_pred HhccCcCcCCcccccccccccc---------eeecccc-------hhHHHHHHHHhhh-------hccCCCEEEEcHHHH Q lcl|NC_011614. 147 ILNQGNNPFGKSIAQSIEKTNK---------VIKGDFT-------QDNIIDLEALLED-------DELEANAFISKTQNR 203 (324) Q Consensus 147 l~g~g~~~~~~~~~~~~~~~~~---------~~~~~~~-------~~~i~~~~~~l~~-------~~~~~~~~v~~~~~~ 203 (324) ++|+|++ +|.|++........ ...+..+ ++.+.+++..+.. .+..++.|+|||.++ T Consensus 196 i~G~G~~-qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) T protein:vir:95 196 LKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) T ss_pred EeccCCC-CceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccH Confidence 9999976 56666543321100 1112222 3445555555532 356667899999999 Q ss_pred HHHHHhh---ccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccc Q lcl|NC_011614. 204 SLLRKIV---DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) Q Consensus 204 ~~L~~l~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) ..|+.++ +.+|+|.+.-+ +|.+|+.+ ..++++.++||||++|.++++++++++++++. T Consensus 275 ~~l~~~~~~~~~~G~~v~~l~-----~g~~vv~s--~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~------------ 335 (381) T protein:vir:95 275 FEVQAQYTHLNANGVYVTALP-----FNLNVIES--TVQEAGKVLTYVKGLYDGYLAGGINVQKFKET------------ 335 (381) T ss_pred HhhccccccCCCCCceeecCC-----CCceEEec--CCCCcCcEEEEecccEEEEEecccEEEeechh------------ Confidence 9887655 56677764311 34445554 45677889999999999999999999999874 Q ss_pred hhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCC-CCccccC Q lcl|NC_011614. 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKP-SSVPGEV 324 (324) Q Consensus 281 ~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~-~~~~~~~ 324 (324) +|.+|++.||+..|+|+++++++||++++.+.... .++++.= T Consensus 336 --~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~ 378 (381) T protein:vir:95 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) T ss_pred --HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCccccc Confidence 59999999999999999999999999988655433 3334443 No 89 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=2e-49 Score=287.63 Aligned_cols=301 Identities=14% Similarity=0.067 Sum_probs=224.5 Q ss_pred Cch-----hhHHHHHHHHHhh-----ccchhh---hhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC Q lcl|NC_011614. 1 MEQ-----TQKLKLNLQHFAS-----NNVKPQ---VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME 67 (324) Q Consensus 1 m~~-----~~~~~~~~~~~~~-----~~~~~~---~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~ 67 (324) ++. ....+.+.+.+.. ...... .+++....++++||++||+++.++|++.+.+.++++++|++++++ T Consensus 37 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~ 116 (381) T protein:vir:10 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) T ss_pred HHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC Confidence 111 0111111111111 111111 223444566678899999999999999999999999999999987 Q ss_pred CCceEEEEEeCCcceeeeccccccc-ccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) Q Consensus 68 ~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~ 146 (324) + ..++|+.++.+.|.|++|+++++ +++++|+++++.++|++++++||+||++|+.+++++||.+.|++++++++|.+| T Consensus 117 ~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~ 195 (381) T protein:vir:10 117 L-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) T ss_pred c-ceEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhhee Confidence 6 46899999999999999988876 568999999999999999999999999999999999999999999999999999 Q ss_pred HhccCcCcCCcccccccccccc---------eeecccc-------hhHHHHHHHHhhh-------hccCCCEEEEcHHHH Q lcl|NC_011614. 147 ILNQGNNPFGKSIAQSIEKTNK---------VIKGDFT-------QDNIIDLEALLED-------DELEANAFISKTQNR 203 (324) Q Consensus 147 l~g~g~~~~~~~~~~~~~~~~~---------~~~~~~~-------~~~i~~~~~~l~~-------~~~~~~~~v~~~~~~ 203 (324) ++|+|++ +|.|++........ ...+..+ ++.+.+++..+.. .+..++.|+|||.++ T Consensus 196 i~G~G~~-qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) T protein:vir:10 196 LKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) T ss_pred EeccCCC-CceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccH Confidence 9999976 56666543321100 1112222 3445555555532 356667899999999 Q ss_pred HHHHHhh---ccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccc Q lcl|NC_011614. 204 SLLRKIV---DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) Q Consensus 204 ~~L~~l~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) ..|+.++ +.+|+|.+.-+ +|.+|+.+ ..++++.++||||++|.++++++++++++++. T Consensus 275 ~~l~~~~~~~~~~G~~v~~l~-----~g~~vv~s--~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~------------ 335 (381) T protein:vir:10 275 FEVQAQYTHLNANGVYVTALP-----FNLNVIES--TVQEAGKVLTYVKGLYDGYLAGGINVQKFKET------------ 335 (381) T ss_pred HhhccccccCCCCCceeecCC-----CCceEEec--CCCCcCcEEEEecccEEEEEecccEEEeechh------------ Confidence 9887655 56677764311 34445554 45677889999999999999999999999874 Q ss_pred hhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCC-CCccccC Q lcl|NC_011614. 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKP-SSVPGEV 324 (324) Q Consensus 281 ~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~-~~~~~~~ 324 (324) +|.+|++.||+..|+|+++++++||++++.+.... .++++.= T Consensus 336 --~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~ 378 (381) T protein:vir:10 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) T ss_pred --HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCccccc Confidence 59999999999999999999999999988655433 3334443 No 90 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=9.1e-50 Score=289.45 Aligned_cols=294 Identities=12% Similarity=0.113 Sum_probs=222.8 Q ss_pred CchhhH----HHHHHHHHhhc-------cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCC Q lcl|NC_011614. 1 MEQTQK----LKLNLQHFASN-------NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT 69 (324) Q Consensus 1 m~~~~~----~~~~~~~~~~~-------~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~ 69 (324) +....+ +..++|++... .....+.++.+..++++||.+||+++.++|++.++++++++++++++++++. T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~ 160 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL 160 (387) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCCc Confidence 111110 11123322211 1122355666667777889999999999999999999999999999998754 Q ss_pred ceEEEEEe-CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|NC_011614. 70 EKKFTFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI- 147 (324) Q Consensus 70 ~~~ip~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l- 147 (324) .+|+.. ....+.|++|++..++++++|+++++.++|++++++||+|+++||.++++++|.+.|+++++++++..+| T Consensus 161 --~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:93 161 --EIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred --eEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 578755 4577999999999999999999999999999999999999999999999999999999999999876554 Q ss_pred hccCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceec Q lcl|NC_011614. 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLD 227 (324) Q Consensus 148 ~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~ 227 (324) .|+|++. |.+++.... ....++..++|+++++++++..+|+.++.|+||+.++..+.++++.+|++++. +.+.+|+ T Consensus 239 ~g~g~g~-p~g~l~~~~--~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~-~~~~~ll 314 (387) T protein:vir:93 239 VSPKSGL-DHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVF 314 (387) T ss_pred cCCCccc-cceeeeccc--cccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCcccc Confidence 5665553 333333221 12234556799999999999999999999999999987765544444555543 4567999 Q ss_pred ccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 228 GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 228 G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) |+||+++.+. ..+++|||+++++. +.++.++.+++ +.++++.|++..|+|+++.+|+|| T Consensus 315 G~PV~~~~~~----~~~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~d~~v~~~eA~ 373 (387) T protein:vir:93 315 GKPVVFTDAA----VKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAF 373 (387) T ss_pred ccceEEecCC----Cceeeeehhhhhee-hhhheeeeccc----------------ccCCceeEEEEeeeCceeechhhe Confidence 9999997643 35899999998765 45555554433 457899999999999999999999 Q ss_pred EEEEeeccCCCCccc Q lcl|NC_011614. 308 AKLVPADAKPSSVPG 322 (324) Q Consensus 308 ~~l~~~~~~~~~~~~ 322 (324) ++++.++++ +++|. T Consensus 374 ~~l~~k~~~-~~~~~ 387 (387) T protein:vir:93 374 RIAKAKENT-GSLPS 387 (387) T ss_pred EEEEeecCC-CCCCC Confidence 999987544 44555 No 91 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=4.7e-50 Score=291.00 Aligned_cols=294 Identities=12% Similarity=0.112 Sum_probs=222.9 Q ss_pred Cchh--hHHHHHHHHHh------hcc-chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce Q lcl|NC_011614. 1 MEQT--QKLKLNLQHFA------SNN-VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 m~~~--~~~~~~~~~~~------~~~-~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) ++.+ +.+..++|++. ... ....+.++....++++||.+||++++++|++.++++++++++++++++++ . T Consensus 98 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~ 175 (402) T protein:vir:93 98 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--L 175 (402) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC--c Confidence 1110 01111122211 111 11234445556677778999999999999999999999999999999875 4 Q ss_pred EEEEEe-CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH-hc Q lcl|NC_011614. 72 KFTFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI-LN 149 (324) Q Consensus 72 ~ip~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l-~g 149 (324) .+|+.. ..+.+.|++||+.+++++++|+++++.+++++++++||+|+++||.++++++|.+.|+++++++++..+| .| T Consensus 176 ~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g 255 (402) T protein:vir:93 176 EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS 255 (402) T ss_pred eeeeeeccCCccccccccccccccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC Confidence 578765 4577999999999999999999999999999999999999999999999999999999999999776544 55 Q ss_pred cCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCceeccc Q lcl|NC_011614. 150 QGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGL 229 (324) Q Consensus 150 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~ 229 (324) +|++. +.+++.... ....++..++|+|++++.++..+|+.++.|+||+.++..+.++++.+|++++. +.+.+|+|+ T Consensus 256 ~g~g~-p~g~~~~~~--~~~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~-~~~~~llG~ 331 (402) T protein:vir:93 256 PKSGL-EHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVFGK 331 (402) T ss_pred CCccc-cceeeeccc--cccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCcccccc Confidence 55543 334333221 22234556799999999999999999999999999988877777667777764 456789999 Q ss_pred ceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEE Q lcl|NC_011614. 230 PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK 309 (324) Q Consensus 230 pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~ 309 (324) ||+++++. .++++|||+++++.. .++.++.++++ .++++.||+..|+|+++.+|+||++ T Consensus 332 PV~~t~~~----~~i~~GDf~~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~r~Dg~v~~~~A~~~ 390 (402) T protein:vir:93 332 PVVFTDAA----VKPIVGDFNYFGINY-DGTTYDTDKDV----------------KKGEYLFVLTAWYDQQRTLDSAFRI 390 (402) T ss_pred ceEEecCC----Cceeeechhhhhhhh-hhhhhhhhhcc----------------cCCceEEEEEEEeCcEEechhheEE Confidence 99997643 368999999876543 34444443332 3589999999999999999999999 Q ss_pred EEeeccCCCCccc Q lcl|NC_011614. 310 LVPADAKPSSVPG 322 (324) Q Consensus 310 l~~~~~~~~~~~~ 322 (324) |+.+++ .++||. T Consensus 391 l~ik~~-~~~~~~ 402 (402) T protein:vir:93 391 AKAKEN-TGPLPS 402 (402) T ss_pred EEeecC-CCCCCC Confidence 998754 556666 No 92 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=1.6e-49 Score=288.15 Aligned_cols=301 Identities=14% Similarity=0.046 Sum_probs=223.2 Q ss_pred Cc----------------------hhhHHHHHHHHHh-----hccchh---hhhccccccccCCCcceechhhhHHHHHH Q lcl|NC_011614. 1 ME----------------------QTQKLKLNLQHFA-----SNNVKP---QVFNPDNVMMHEKKDGTLLNDFTTPILQE 50 (324) Q Consensus 1 m~----------------------~~~~~~~~~~~~~-----~~~~~~---~~~~a~~~~~~~~~g~lip~~~~~~i~~~ 50 (324) |+ ...+.+.+.+... ...+.. +.+++.+..++.+||++||+++.++|++. T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~ 99 (381) T protein:vir:10 20 VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEETIDRIFED 99 (381) T ss_pred HHhhhHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHHHhhcCCCCCceecCHHHHHHHHHH Confidence 00 0000111111000 011111 12234556677788999999999999999 Q ss_pred HHhhcchhhhceeeecCCCceEEEEEeCCcceeeeccccccc-ccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHH Q lcl|NC_011614. 51 VMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEE 129 (324) Q Consensus 51 ~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~ 129 (324) +.+.|+++++|+++++++ ..++|+.+..+.|.|++|+++.+ +++++|+++++.++|++++++||+||++|+.+++++| T Consensus 100 l~~~spir~~a~v~~~~~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~ 178 (381) T protein:vir:10 100 LTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERF 178 (381) T ss_pred HHhhcceeeeeeeEecCc-ceEEEeecCCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHH Confidence 999999999999999865 56899999889999999987765 6789999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCcCcCCccccccccccc---------ceeecccchhHHHHHHHHhh------------- Q lcl|NC_011614. 130 MKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTN---------KVIKGDFTQDNIIDLEALLE------------- 187 (324) Q Consensus 130 v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~---------~~~~~~~~~~~i~~~~~~l~------------- 187 (324) |.+.|++++++++|.+|++|+|++ +|.|++....... ....+.+++.++..++..+. T Consensus 179 i~~~la~~~a~~~~~afi~GdG~~-qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~ 257 (381) T protein:vir:10 179 VRVQIEEAFAVALETAFLKGTGKD-QPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGK 257 (381) T ss_pred HHHHHHHHHHHHhhceeEecccCC-CceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccc Confidence 999999999999999999999976 5666654322111 11122334444433332221 Q ss_pred -hhccCCCEEEEcHHHHHHHHHhh---ccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEE Q lcl|NC_011614. 188 -DDELEANAFISKTQNRSLLRKIV---DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYK 263 (324) Q Consensus 188 -~~~~~~~~~v~~~~~~~~L~~l~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~ 263 (324) ..+..++.|+|||.++..|+.++ +++|+|++..+ +|+|++.++ .++++.++||||++|.++++++++++ T Consensus 258 ~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp-----~g~~vv~~~--~~p~~~i~fGDfs~Y~i~~r~~~~i~ 330 (381) T protein:vir:10 258 SVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP-----FNLNVIEST--VQEAGKVLTYVKGLYDGYLAGGINVQ 330 (381) T ss_pred cccccCceEEEEchhhHHhhccccccCCCCCceeecCC-----CCceeEEcC--CCCcCcEEEEEcccEEEEEecccEEE Confidence 13456678999999999887544 77888875422 466777654 45677899999999999999999999 Q ss_pred EeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc---cccC Q lcl|NC_011614. 264 IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV---PGEV 324 (324) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~---~~~~ 324 (324) .+++. +|.+|++.||+..|+|+++++++||++++.+.....++ |-|- T Consensus 331 ~~~~~--------------~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~ 380 (381) T protein:vir:10 331 KFKET--------------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEET 380 (381) T ss_pred eechh--------------hhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCcccccccccc Confidence 99874 59999999999999999999999999988765553222 2222 No 93 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=2.3e-49 Score=287.24 Aligned_cols=292 Identities=11% Similarity=0.069 Sum_probs=222.2 Q ss_pred Cchhh-----HHHHHHHHHhh-----ccchhhhhc----cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeec Q lcl|NC_011614. 1 MEQTQ-----KLKLNLQHFAS-----NNVKPQVFN----PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM 66 (324) Q Consensus 1 m~~~~-----~~~~~~~~~~~-----~~~~~~~~~----a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~ 66 (324) |+.-+ +.+.+.+.... ..+..+|.+ .....+.++||.+||+++.++|++.+.+.++++++|+++++ T Consensus 39 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~ 118 (377) T protein:vir:96 39 FTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT 118 (377) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEec Confidence 11000 00111111111 112222222 12235566788999999999999999999999999999998 Q ss_pred CCCceEEEEEeCCcceeeeccccccc-ccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) Q Consensus 67 ~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a 145 (324) ++ ..++|+.++.+.|.|++|+++++ .++++|+++++.++|++++++||+||++||.+++++||.+.|++++++++|.+ T Consensus 119 ~~-~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a 197 (377) T protein:vir:96 119 SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELA 197 (377) T ss_pred CC-ceEEEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhc Confidence 65 57899999899999999988875 57899999999999999999999999999999999999999999999999999 Q ss_pred HHhccCcCcCCcccccccccccce------------------eecccchhHHHHHHHHhhhhcc-----------CCCEE Q lcl|NC_011614. 146 GILNQGNNPFGKSIAQSIEKTNKV------------------IKGDFTQDNIIDLEALLEDDEL-----------EANAF 196 (324) Q Consensus 146 ~l~g~g~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~i~~~~~~l~~~~~-----------~~~~~ 196 (324) |++|+|++ +|.|++......... .....+.+.+.++...+...+. .++.| T Consensus 198 ~i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~ 276 (377) T protein:vir:96 198 IVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) T ss_pred eEeccCCC-cceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEE Confidence 99999976 566776533221100 0112345666666666644432 34579 Q ss_pred EEcHHHHHHHH---HhhccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeeccccccc Q lcl|NC_011614. 197 ISKTQNRSLLR---KIVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTV 273 (324) Q Consensus 197 v~~~~~~~~L~---~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~ 273 (324) +|||.++..+. ...+.+| .+.+++|+|+.+..+..++++.+++|||++|.++++++++++.+++. T Consensus 277 ~mn~~t~~~~~~~~~~~~~~G-------~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~----- 344 (377) T protein:vir:96 277 LLNPEDRWTLEAKFTSRNQFG-------EYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQT----- 344 (377) T ss_pred EEchhhHHhccccccccCCCC-------CceeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhh----- Confidence 99999987652 2333333 34478899987777777888889999999999999999999999874 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) +|.+|++.||+..|+|+++++++||++++.+-- T Consensus 345 ---------~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 345 ---------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ---------hhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 599999999999999999999999999998654 No 94 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6.1e-49 Score=284.93 Aligned_cols=303 Identities=13% Similarity=0.097 Sum_probs=225.1 Q ss_pred Cc--hhhHHHHHHHHHhhc----------cchh---hhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee Q lcl|NC_011614. 1 ME--QTQKLKLNLQHFASN----------NVKP---QVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP 65 (324) Q Consensus 1 m~--~~~~~~~~~~~~~~~----------~~~~---~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~ 65 (324) ++ ..+..+.+.+....+ .... +.+++....++.+||.+||++++++|++.+++.+++++++++++ T Consensus 45 ~~~~~~~~~~~e~~~~~~~~~~~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~ 124 (395) T protein:vir:95 45 LSNDLQEEITAEINNRVVDNGILAKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQN 124 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Confidence 10 011111111111110 1111 11223344567788999999999999999999999999999999 Q ss_pred cCCCceEEEEEeCCcceeeecccccc-cccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 66 MEGTEKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) Q Consensus 66 ~~~~~~~ip~~~~~~~a~~v~Eg~~~-~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~ 144 (324) +++ ...+|+.++.+.+.|++|+++. ++++++|+++++.++|++++++||+||++|+.+++++||.+.|++++++++|. T Consensus 125 ~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~ 203 (395) T protein:vir:95 125 AGI-KTRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALES 203 (395) T ss_pred cCC-ceEEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhh Confidence 976 4689999999999999887665 56899999999999999999999999999999999999999999999999999 Q ss_pred HHHhccCcCc-CCcccccccccccc-----eeecccchhHHHHHHHHhhh--------------hccCCCEEEEcHHHHH Q lcl|NC_011614. 145 AGILNQGNNP-FGKSIAQSIEKTNK-----VIKGDFTQDNIIDLEALLED--------------DELEANAFISKTQNRS 204 (324) Q Consensus 145 a~l~g~g~~~-~~~~~~~~~~~~~~-----~~~~~~~~~~i~~~~~~l~~--------------~~~~~~~~v~~~~~~~ 204 (324) +||+|+|++. +|.|++........ ..++..+++++..+...+.. .+..+..|+|||.++. T Consensus 204 a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~ 283 (395) T protein:vir:95 204 AIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW 283 (395) T ss_pred heeeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh Confidence 9999999874 67887765433222 22333455554444333322 3445678999999875 Q ss_pred HHHHhhccCCceeecc--CCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchh Q lcl|NC_011614. 205 LLRKIVDPETKERIYD--RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) Q Consensus 205 ~L~~l~d~~g~~~~~~--~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) +.+|+|+|.+ +.+.+++|+|+.+..+..++++.++||||++|+++.+++++++++++. T Consensus 284 ------~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~-------------- 343 (395) T protein:vir:95 284 ------DVQARYTYLTANGGFVTVLPYNVTIITSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQT-------------- 343 (395) T ss_pred ------hcCCcceeccCCCcceeccCCcceEEEcCCCCCCcEEEEecccEEEEEecceEEEeccch-------------- Confidence 4467777754 344567766643333456677789999999999999999999998764 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEeecc----CCCCccccC Q lcl|NC_011614. 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADA----KPSSVPGEV 324 (324) Q Consensus 283 ~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~----~~~~~~~~~ 324 (324) +|.+|++.||+..|+|+++.+++||++|+.... .+..+||.. T Consensus 344 ~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~~~~ 389 (395) T protein:vir:95 344 LALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAGGTT 389 (395) T ss_pred hhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCCCCC Confidence 489999999999999999999999999887633 333446666 No 95 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=7.1e-49 Score=284.57 Aligned_cols=306 Identities=13% Similarity=0.154 Sum_probs=225.4 Q ss_pred CchhhHHH---HHHHHHh-------------hccch--hhhhc--cccccccCCCcceechhhhHHHHHHHHhhcchhhh Q lcl|NC_011614. 1 MEQTQKLK---LNLQHFA-------------SNNVK--PQVFN--PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL 60 (324) Q Consensus 1 m~~~~~~~---~~~~~~~-------------~~~~~--~~~~~--a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l 60 (324) .++.+..+ ..++.+. +.... ..+.+ .....+.++++.+||+++++.|++.+.+.++++++ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~ 182 (466) T protein:vir:80 103 GARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISK 182 (466) T ss_pred hhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhh Confidence 10000000 0000000 00000 00111 11112233445689999999999999999999999 Q ss_pred ceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHH Q lcl|NC_011614. 61 GKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYK 140 (324) Q Consensus 61 ~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~ 140 (324) +++.++++ ..++|+....+.+.|++|++.+++++++|+++++.+++++++++||+|+++||.++++++|...|+++++. T Consensus 183 ~~v~~~~g-~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~ 261 (466) T protein:vir:80 183 VRLRPLKG-TARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGF 261 (466) T ss_pred eeeeecCc-eeEeeeecCCcceeecccccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHH Confidence 99999876 46889988888999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhccCcCcCCccccccccccccee--------ecccchh-----------------HHHHHHHHhhhhccCC-C Q lcl|NC_011614. 141 KFDEAGILNQGNNPFGKSIAQSIEKTNKVI--------KGDFTQD-----------------NIIDLEALLEDDELEA-N 194 (324) Q Consensus 141 ~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~--------~~~~~~~-----------------~i~~~~~~l~~~~~~~-~ 194 (324) ++|.+||+|+|++. |.|+++......... ...++.. ++...+..+...+..+ . T Consensus 262 ~~~~ail~G~G~~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (466) T protein:vir:80 262 ALDKAILYGTGTKM-PVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMK 340 (466) T ss_pred HHhhheeeccCCCC-cceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCce Confidence 99999999999864 566665432211110 0111211 2222233334444444 4 Q ss_pred EEEEcHHHHHHHHHhh---ccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeeccccc Q lcl|NC_011614. 195 AFISKTQNRSLLRKIV---DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLS 271 (324) Q Consensus 195 ~~v~~~~~~~~L~~l~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~ 271 (324) .|+||+.++..|..++ +.+|.+.+.......++|+||+.+++ ++++.+++|+|+.|++++++++++.++++. T Consensus 341 ~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~pvv~s~~--~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~--- 415 (466) T protein:vir:80 341 FWAMSSNTHAVLMSKAITFNSAGALVASLNNTMPIVGGDIVILDF--IPDNDIIGGYGSLYLLAERADIKLAQSEHV--- 415 (466) T ss_pred eEEecchhHHHhhcccccccCCccccccCCCcccccccceeecCc--cCccceeeeccccEEEEeecceEEEechhh--- Confidence 6999999999998887 55666666555555699999998764 456679999999999999999999988663 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) .|.+|++.||+..|+|+++.+|+||++++.+..++.+++..+ T Consensus 416 -----------~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~~ 457 (466) T protein:vir:80 416 -----------RFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITFA 457 (466) T ss_pred -----------hhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceeee Confidence 489999999999999999999999999999888777774433 No 96 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=2.9e-49 Score=286.66 Aligned_cols=300 Identities=15% Similarity=0.092 Sum_probs=226.3 Q ss_pred CchhhHHHHHH----HHHhh-----ccchh---hhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCC Q lcl|NC_011614. 1 MEQTQKLKLNL----QHFAS-----NNVKP---QVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG 68 (324) Q Consensus 1 m~~~~~~~~~~----~~~~~-----~~~~~---~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~ 68 (324) .++....+.+. +.+.. ..+.. +.+++....++++||++||++++++|++.+.+.++++++|+++++++ T Consensus 45 ~~~~~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~ 124 (383) T protein:vir:78 45 ADIMEQAKKEARQEADAYISASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL 124 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCChhhhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC Confidence 01111111111 11110 01111 12234455678889999999999999999999999999999999876 Q ss_pred CceEEEEEeCCcceeeeccccccc-ccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 69 TEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) Q Consensus 69 ~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l 147 (324) . .+||+.+..+.|.|++|+++++ .++++|+++++.++|++++++||+||++||.+++++||.+.|++++++++|.+|+ T Consensus 125 ~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i 203 (383) T protein:vir:78 125 R-TKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYI 203 (383) T ss_pred c-eEEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheE Confidence 5 6899999999999999987765 5789999999999999999999999999999999999999999999999999999 Q ss_pred hccCcCcCCcccccccccccc---------eeecccchhHHHHHHHHhhhhc--------------cCCCEEEEcHHHHH Q lcl|NC_011614. 148 LNQGNNPFGKSIAQSIEKTNK---------VIKGDFTQDNIIDLEALLEDDE--------------LEANAFISKTQNRS 204 (324) Q Consensus 148 ~g~g~~~~~~~~~~~~~~~~~---------~~~~~~~~~~i~~~~~~l~~~~--------------~~~~~~v~~~~~~~ 204 (324) +|+|++ +|.|++........ ...+..+.+++..+...+...+ ..+..|+|||.++. T Consensus 204 ~G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 282 (383) T protein:vir:78 204 VGDGND-KPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAW 282 (383) T ss_pred eccCCC-CceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchh Confidence 999965 56677654322211 1233445666655555554211 12335777776644 Q ss_pred HHH---HhhccCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccch Q lcl|NC_011614. 205 LLR---KIVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPV 281 (324) Q Consensus 205 ~L~---~l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 281 (324) .+. ...+.+|+| .+++|+|+.+..+..++++.+++|||++|.++++++++++.+++. T Consensus 283 ~~~~~~~~~~~~G~~-------~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~------------- 342 (383) T protein:vir:78 283 DVKKQYTSLNANGVY-------VTALPFNLNIIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQT------------- 342 (383) T ss_pred hhccchhccCCCCce-------eeecCCCceEEecCCCCcccEEEeeccceEEEecccceEEecchh------------- Confidence 332 122334433 367888876555667778889999999999999999999998764 Q ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCcccc Q lcl|NC_011614. 282 NLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGE 323 (324) Q Consensus 282 ~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~ 323 (324) +|.+|++.||+..|+|+++++++||++++.+...+.+||+- T Consensus 343 -~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 343 -LAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred -hhhcCceEEEEEEEEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 59999999999999999999999999999999999999988 No 97 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=2e-48 Score=282.08 Aligned_cols=306 Identities=11% Similarity=0.025 Sum_probs=225.6 Q ss_pred Cchh-------------------hHHHHHHHH---Hhh-----ccchhhhhccccccccCCCcceechhh-hHHHHHHHH Q lcl|NC_011614. 1 MEQT-------------------QKLKLNLQH---FAS-----NNVKPQVFNPDNVMMHEKKDGTLLNDF-TTPILQEVM 52 (324) Q Consensus 1 m~~~-------------------~~~~~~~~~---~~~-----~~~~~~~~~a~~~~~~~~~g~lip~~~-~~~i~~~~~ 52 (324) +++. ...+...+. ... ......+.+....++++.||++||+++ .++|++.++ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~ 182 (477) T protein:vir:84 103 YEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELAR 182 (477) T ss_pred hhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhh Confidence 0000 000000000 000 000111122233355666788998885 678999999 Q ss_pred hhcchhhhceeeecCC--CceEEEEEeCCc-ceeeeccccc-----ccccccceeeEEeeeeeEEEeehhHHHHHhcChh Q lcl|NC_011614. 53 ENSKIMQLGKYEPMEG--TEKKFTFWADKP-GAYWVGEGQK-----IETSKATWVNATMRAFKLGVILPVTKEFLNYTYS 124 (324) Q Consensus 53 ~~s~l~~l~~~~~~~~--~~~~ip~~~~~~-~a~~v~Eg~~-----~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~ 124 (324) +.++++++++++++++ +.+.||+.++++ .+.|++||+. +|+++++|++++++++|++++++||+|+++||.+ T Consensus 183 ~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~ 262 (477) T protein:vir:84 183 AGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAV 262 (477) T ss_pred hcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccch Confidence 9999999999988754 457899876654 5779999864 5788899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeec--------ccchhHHHHHHHHhhhhccCC-CE Q lcl|NC_011614. 125 QFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKG--------DFTQDNIIDLEALLEDDELEA-NA 195 (324) Q Consensus 125 ~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~--------~~~~~~i~~~~~~l~~~~~~~-~~ 195 (324) +++++|.+.|++++++++|.++|+|+|++..|.|+++.........+. ...++++++++..+...+..+ +. T Consensus 263 ~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 342 (477) T protein:vir:84 263 SVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEV 342 (477) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHHHHHHHHHHHHhhccccccCCccE Confidence 999999999999999999999999999887888887654432222111 124556777777777776654 57 Q ss_pred EEEcHHHHHHHHHhhccCCceeeccC-----------------CCceecccceEeecCccC------CCceEEEeecccE Q lcl|NC_011614. 196 FISKTQNRSLLRKIVDPETKERIYDR-----------------NSDSLDGLPVVNLKSSNL------KRGELITGDFDKL 252 (324) Q Consensus 196 ~v~~~~~~~~L~~l~d~~g~~~~~~~-----------------~~~~l~G~pv~~~~~~~~------~~~~i~~gd~~~~ 252 (324) |+|||.++..|++++|++|+|+|.+. ..++|+|+||++++..+. +...+++|||+.+ T Consensus 343 ~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~ 422 (477) T protein:vir:84 343 IVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDL 422 (477) T ss_pred EEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccccccccCCcceEEEEEeceE Confidence 99999999999999999999998542 245899999999876543 2346899999999 Q ss_pred EEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec-ccceEEEEeeccCCCCccc Q lcl|NC_011614. 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD-DKAFAKLVPADAKPSSVPG 322 (324) Q Consensus 253 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~-~~a~~~l~~~~~~~~~~~~ 322 (324) +++. .+++++++++. ++.++++.||+..++++..++ |+||++++++ +..++|-+ T Consensus 423 ~i~~-~~~~~~~~~~~--------------~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~-~~~~~~~~ 477 (477) T protein:vir:84 423 ALFE-SSVRMRALQET--------------RAENLSVLLQVYGYLAFTAARFPQSVVEIGGT-ALTAPTFA 477 (477) T ss_pred EEEe-eceeEEecccc--------------ccccceeeeeehhhhhhhhhccccceEEeecc-cccccccC Confidence 8887 46777776543 356788889988888876655 9999999985 55566656 No 98 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=3e-47 Score=275.65 Aligned_cols=277 Identities=13% Similarity=0.063 Sum_probs=220.9 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeC-C Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD-K 79 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~-~ 79 (324) ++..+..+.....+ ......+.....++.+++.++|+++.+.|++ +.+...+++++++++++++...+|+... . T Consensus 110 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 184 (397) T protein:vir:96 110 EEELAEKRSAINAF----VKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSG 184 (397) T ss_pred hHHHHHHHHHHHHH----HHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccC Confidence 00000111111111 1112222333456677889999999999987 5788899999999999998888998764 4 Q ss_pred cceeeecccccccc-cccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcc Q lcl|NC_011614. 80 PGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKS 158 (324) Q Consensus 80 ~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~ 158 (324) ..+.|++|++..++ ++++|++++++++++++++++|+|+++||.++++++|.+.|+++++.++|.++++|+|.+. T Consensus 185 ~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~---- 260 (397) T protein:vir:96 185 SKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTAT---- 260 (397) T ss_pred CccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---- Confidence 67889999999986 6899999999999999999999999999999999999999999999999999999987543 Q ss_pred cccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc----CCCceecccceEee Q lcl|NC_011614. 159 IAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNL 234 (324) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~~~ 234 (324) ..+..+++++.+++......++ +++|+|||++|..|++++|++|+|+|.+ +.+++|+|+||+++ T Consensus 261 -----------~~~~~~~d~~~~~~~~~~~~~~-~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~ 328 (397) T protein:vir:96 261 -----------AKSVVGVDGLKDLINKEIKKVY-DVKLFISASMYSELDKLKDKNGRYLLQDSITAASGKQLLGKEVVVL 328 (397) T ss_pred -----------cccccchHHHHHHHHHhhhhhc-CcEEEEcHHHHHHHHHhhccCCCeEeccCccCCCcccccccceEEe Confidence 1245689999999887655544 6899999999999999999999999853 34568999999886 Q ss_pred cC----ccCCCceEEEeeccc-EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEE Q lcl|NC_011614. 235 KS----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK 309 (324) Q Consensus 235 ~~----~~~~~~~i~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~ 309 (324) ++ ...++..+++|||+. +.++.++++++..+++. .+.+.+|++.|+|+.+.+|+||++ T Consensus 329 ~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~r~d~~~~~~~a~~~ 391 (397) T protein:vir:96 329 DDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNN-----------------IYGQLLAGIIRYDVKATDKKAGFY 391 (397) T ss_pred cccccCCCCCceEEEEeehhcceEeEeecceEEEEeccc-----------------ccceeEEEEEEEccEEecccceEE Confidence 53 334566799999997 67899999999887643 234579999999999999999999 Q ss_pred EEeecc Q lcl|NC_011614. 310 LVPADA 315 (324) Q Consensus 310 l~~~~~ 315 (324) ++.+++ T Consensus 392 ~~~~~a 397 (397) T protein:vir:96 392 VTFTIG 397 (397) T ss_pred EEeecC Confidence 998877 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=8.3e-40 Score=234.83 Aligned_cols=287 Identities=14% Similarity=0.093 Sum_probs=220.1 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee-cCCCceEEEEEeCC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP-MEGTEKKFTFWADK 79 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~-~~~~~~~ip~~~~~ 79 (324) |+-=| +. .+..++.++ ...+||.|+|+++ .++++.+++.+++++++++++ +.+....||....+ T Consensus 1 ~~~~~------~~-------~~~~k~it~-~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g 65 (314) T protein:vir:41 1 MDFLN------KP-------FQITPKIDV-PDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLG 65 (314) T ss_pred Cchhh------hH-------HHhhccccc-ccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccC Confidence 22111 11 112223333 3345777888776 579999999999999999986 57777889887543 Q ss_pred ----cceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChh--HHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 80 ----PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYS--QFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 80 ----~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~--~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) +.+.|.+|..+.++++++|+++++.++|+...++||+|+++|+.. +|+++|..++++++++.++.++++|+|+. T Consensus 66 ~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~ 145 (314) T protein:vir:41 66 VELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSL 145 (314) T ss_pred cccccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCC Confidence 234567788888999999999999999999999999999999964 99999999999999999999999999863 Q ss_pred c-------CCcccccccccc-cc--eeecccchhHHHHHHHHhhhhccC---CCEEEEcHHHHHHHHHhhccCCceee-- Q lcl|NC_011614. 154 P-------FGKSIAQSIEKT-NK--VIKGDFTQDNIIDLEALLEDDELE---ANAFISKTQNRSLLRKIVDPETKERI-- 218 (324) Q Consensus 154 ~-------~~~~~~~~~~~~-~~--~~~~~~~~~~i~~~~~~l~~~~~~---~~~~v~~~~~~~~L~~l~d~~g~~~~-- 218 (324) . .+.|++..+... .. ..+...+.+.+.+++..|++.|++ +.+|+||+.++.+++++++.++++++ T Consensus 146 ~s~~~~~~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~ 225 (314) T protein:vir:41 146 TTGRELYRINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDS 225 (314) T ss_pred cCcccchhcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccch Confidence 2 345555432211 11 122345677788999999999875 45799999999999999998887765 Q ss_pred --ccCCCceecccceEeecCc---cCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 219 --YDRNSDSLDGLPVVNLKSS---NLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 219 --~~~~~~~l~G~pv~~~~~~---~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) ..+.+.+++|+||+.++.. ..++..+++|||++++++.+..++++..+.+ .++++.|.+ T Consensus 226 ~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a----------------~~~~~~~~~ 289 (314) T protein:vir:41 226 ALIGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDA----------------AMRRTEYIA 289 (314) T ss_pred hhhCCCCceecceeeEecccccccCCCCceEEEechhheEEEeeceeEEeecccC----------------cCCeEEEEE Confidence 3456678999999987654 4578999999999999999999888876543 578999999 Q ss_pred EEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 294 TMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 294 ~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) ..|+|+.+.+++|.++.....+... T Consensus 290 ~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 290 SLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred EEEeceEEEEcCcEEEEEeeccCCC Confidence 9999999998877776654333333 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=1.3e-38 Score=228.30 Aligned_cols=288 Identities=12% Similarity=0.063 Sum_probs=213.4 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee-cCCCceEEEEEeC- Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP-MEGTEKKFTFWAD- 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~-~~~~~~~ip~~~~- 78 (324) |..=+ +.+.....+...+.++ ...+||.++| +...++++.+.+.|++++++++++ +.+....++.... T Consensus 1 ~~~~~--------~~~~~~~~~~~k~~t~-~d~~Gg~l~P-~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~ 70 (315) T protein:vir:41 1 MLTIE--------DIRGGKPFEIVPKIDV-PDLGRGVLSV-DRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLV 70 (315) T ss_pred Ccccc--------hhhcCChhhhhhhcCC-cCCCCceech-HHHHHHHHHHHhhhhhhhhceeeeccccccccccccccC Confidence 22111 1122223333344333 2234445555 455679999999999999999865 5554455554321 Q ss_pred ---CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 79 ---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 79 ---~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) .+...|.+|+++.++++++|+++++.++++...+.||+|+++|+. ++++++|..++++++++.++.++++|+|+. T Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s 150 (315) T protein:vir:41 71 LDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSS 150 (315) T ss_pred cccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcC Confidence 123568889989999999999999999999999999999999996 599999999999999999999999999865 Q ss_pred cC-----Ccccccccccc-----cceeecccchhHHHHHHHHhhhhccC---CCEEEEcHHHHHHHHHhhccCCceeec- Q lcl|NC_011614. 154 PF-----GKSIAQSIEKT-----NKVIKGDFTQDNIIDLEALLEDDELE---ANAFISKTQNRSLLRKIVDPETKERIY- 219 (324) Q Consensus 154 ~~-----~~~~~~~~~~~-----~~~~~~~~~~~~i~~~~~~l~~~~~~---~~~~v~~~~~~~~L~~l~d~~g~~~~~- 219 (324) .. +.|++..+... ....+...+.+.+.+|+..|+..|++ +++|+||+.++.+++++++.+|++++. T Consensus 151 ~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~ 230 (315) T protein:vir:41 151 SDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQ 230 (315) T ss_pred cCccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccc Confidence 33 34554432211 11122345678899999999999874 468999999999999999999999874 Q ss_pred ---cCCCceecccceEeecCcc---CCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 220 ---DRNSDSLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 220 ---~~~~~~l~G~pv~~~~~~~---~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) .+.+.+|+|+||+.+++.+ .++..+++|||++++++.+.+++++.++++ .++.+.|.+ T Consensus 231 ~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a----------------~~~~~~~~~ 294 (315) T protein:vir:41 231 ALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDA----------------EMRLTKYVA 294 (315) T ss_pred hhhcCCCceecccceEecccccccCCCCccEEEecccceEEEeccccEEEeeecC----------------CCCceEEEE Confidence 4567799999998876543 478889999999999999999999887653 356677888 Q ss_pred EEEeccEEecccceEEEEeec Q lcl|NC_011614. 294 TMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 294 ~~r~d~~v~~~~a~~~l~~~~ 314 (324) ..|+|+.+.++++.++...+. T Consensus 295 ~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 295 SLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EEEeceeEEeccceeEeeeeC Confidence 899999888777633332222 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1.2e-35 Score=211.99 Aligned_cols=299 Identities=11% Similarity=0.055 Sum_probs=219.9 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |.++--.. .++.. .+.+.. .....++|++||+++.+++++.+.+.++++++++++++.+....+|.+..++ T Consensus 1 ~~~k~~~~-~l~~~-------~~~~~~-~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~ 71 (321) T protein:vir:31 1 MASRTINN-DLSRI-------TEKNAL-TVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGE 71 (321) T ss_pred CchHHHHH-HHHHH-------HHhccc-cccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCC Confidence 55432221 12111 112222 2344567889999999999999999999999999999999889999988777 Q ss_pred ceeeec-cc-ccccccccceeeEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 81 GAYWVG-EG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 81 ~a~~v~-Eg-~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) .+.|++ |+ .....++++|+++++.++++...++||+|+++|+. ++|+++|.+.+++++++.++.++|+|++.+..+ T Consensus 72 ~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~ 151 (321) T protein:vir:31 72 RHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDS 151 (321) T ss_pred cccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCc Confidence 777876 43 34556789999999999999999999999999985 599999999999999999999999999876543 Q ss_pred -----cccccccccc---cceeecccchhHHHHHHHHhhhhccC--CCEEEEcHHHHHHHHH-hhccCCceee----ccC Q lcl|NC_011614. 157 -----KSIAQSIEKT---NKVIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSLLRK-IVDPETKERI----YDR 221 (324) Q Consensus 157 -----~~~~~~~~~~---~~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~L~~-l~d~~g~~~~----~~~ 221 (324) .|++..+... ....++..+++.+.+++..|+..|+. ..+|+||+.++.+++. +++.++ +++ ..+ T Consensus 152 ~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~-~~~~~~l~~~ 230 (321) T protein:vir:31 152 FENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDT-PLGDNVIMGE 230 (321) T ss_pred ccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCC-ccccchhhcc Confidence 3444322111 12234457889999999999998875 3589999999887764 555554 343 344 Q ss_pred CCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEE Q lcl|NC_011614. 222 NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHI 301 (324) Q Consensus 222 ~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v 301 (324) .+.+|+|+||+.++ .+|+..+++++|+++.++.++++++++..+.... ...++.+......++|+.| T Consensus 231 ~~~tl~G~pvv~~~--~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~v 297 (321) T protein:vir:31 231 ADVNPFSFPIIGSG--LWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKV-----------SERDLHARYFMRGDDDFAI 297 (321) T ss_pred ccccccceeEEEcC--CCCCCcEEEeccccEEEEEeeccEEEEeecCccc-----------cccceeeEeeeeeecceeE Confidence 56689999999866 4567789999999999999999999887664211 0123334444556799999 Q ss_pred ecccceEEEEeeccCC---CCccc Q lcl|NC_011614. 302 ADDKAFAKLVPADAKP---SSVPG 322 (324) Q Consensus 302 ~~~~a~~~l~~~~~~~---~~~~~ 322 (324) .+++|++.+++....- ..+|. T Consensus 298 e~~~a~a~~~~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 298 ENTEAVVLAEGLGDPLEHLEEETS 321 (321) T ss_pred eccccEEEEecCCcchhcccCCCC Confidence 9999999999653210 01111 No 102 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=7.8e-35 Score=207.58 Aligned_cols=261 Identities=16% Similarity=0.143 Sum_probs=215.9 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeEE Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) +..++|..+..++|+.++..+.+.+.+.+.+.+++.+-. .+|..++||++...+++.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 222345566788999999999999999999888876522 3466789999988889999999999999999999999 Q ss_pred eeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHH Q lcl|NC_011614. 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) Q Consensus 103 ~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) +.+++++..+++|+++..++..++.+.+.+++++++++++|+.++....+ .....++..+++.+.++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~-------------a~~~~~~~~t~d~i~da 147 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK-------------STQTVEATATVDGVSKA 147 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHHH Confidence 99999999999999999999999999999999999999999999863211 11223345689999999 Q ss_pred HHHhhhhccCCCEEEEcHHHHHHHHHhhccC-------CceeeccCCCceecccceEeecCccCCCceEEEeecccEEEE Q lcl|NC_011614. 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) Q Consensus 183 ~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~-------g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~ 255 (324) ..++.+++.....|+|||.++..|++.+..+ |...+..+..++++|+||++++. ++++.+++.+.+.+.++ T Consensus 148 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~p~~t~~~~~~~a~~~~ 225 (272) T protein:vir:98 148 LDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--CPKGTAYMVRKGALRIM 225 (272) T ss_pred HHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--CCcceEEEEcCCeEEEE Confidence 9999999888899999999999998653221 22334456667999999999765 56778888888999999 Q ss_pred EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) .+++++++.+++. .++...+++..||++++.+|+++++++.++++.- T Consensus 226 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 226 LKRNTMVETDRDI----------------TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ecCCceeeecccc----------------ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 9999999887764 2456789999999999999999999999877776 No 103 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=7.8e-35 Score=207.58 Aligned_cols=261 Identities=16% Similarity=0.143 Sum_probs=215.9 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeEE Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) +..++|..+..++|+.++..+.+.+.+.+.+.+++.+-. .+|..++||++...+++.|++||+.++.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 222345566788999999999999999999888876522 3466789999988889999999999999999999999 Q ss_pred eeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHH Q lcl|NC_011614. 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) Q Consensus 103 ~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) +.+++++..+++|+++..++..++.+.+.+++++++++++|+.++....+ .....++..+++.+.++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~-------------a~~~~~~~~t~d~i~da 147 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK-------------STQTVEATATVDGVSKA 147 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHHH Confidence 99999999999999999999999999999999999999999999863211 11223345689999999 Q ss_pred HHHhhhhccCCCEEEEcHHHHHHHHHhhccC-------CceeeccCCCceecccceEeecCccCCCceEEEeecccEEEE Q lcl|NC_011614. 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) Q Consensus 183 ~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~-------g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~ 255 (324) ..++.+++.....|+|||.++..|++.+..+ |...+..+..++++|+||++++. ++++.+++.+.+.+.++ T Consensus 148 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~p~~t~~~~~~~a~~~~ 225 (272) T protein:vir:30 148 LDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--CPKGTAYMVRKGALRIM 225 (272) T ss_pred HHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--CCcceEEEEcCCeEEEE Confidence 9999999888899999999999998653221 22334456667999999999765 56778888888999999 Q ss_pred EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) .+++++++.+++. .++...+++..||++++.+|+++++++.++++.- T Consensus 226 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 226 LKRNTMVETDRDI----------------TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ecCCceeeecccc----------------ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 9999999887764 2456789999999999999999999999877776 No 104 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=1.1e-34 Score=206.80 Aligned_cols=294 Identities=8% Similarity=0.009 Sum_probs=197.0 Q ss_pred CchhhHHH---------------HHHHHHh-------hc-cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcch Q lcl|NC_011614. 1 MEQTQKLK---------------LNLQHFA-------SN-NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKI 57 (324) Q Consensus 1 m~~~~~~~---------------~~~~~~~-------~~-~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l 57 (324) .+..++.+ ..+..+. .. .................++.+.|+.+...+...+...+++ T Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i 269 (517) T protein:vir:97 190 LMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSL 269 (517) T ss_pred HHHHHHhhhhcccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccc Confidence 00000000 0000000 00 0000000111112223357788999999999999999888 Q ss_pred hhhceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhH----HHHHHHHH Q lcl|NC_011614. 58 MQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPM 133 (324) Q Consensus 58 ~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~----~~~~v~~~ 133 (324) +..++..+.+. ..+|..+....+.|+.||+.+|+++++|+++++.++++++++++|+++++|+..+ +++||... T Consensus 270 ~~~~~~~~i~~--~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~ 347 (517) T protein:vir:97 270 LPFIRHENLPT--LVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNR 347 (517) T ss_pred eeeeeeccccc--eeeecccccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHH Confidence 88877655543 4567777777788999999999999999999999999999999999999998776 99999999 Q ss_pred HHHHHHHHHHHHHHhccCcCcCCcccccccccc-cceeecccchhHHHHHHHHhhhhcc--CCCEEEEcHHHHHHHHHhh Q lcl|NC_011614. 134 IAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKT-NKVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIV 210 (324) Q Consensus 134 l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~L~~l~ 210 (324) |++++++++|.+||+|+|++..+.++....... .....++.+.++++ ..+..++. .++.|+|||.+|.+|+++| T Consensus 348 l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d~i---~~l~~a~~~a~~a~~vmn~~t~~~I~klK 424 (517) T protein:vir:97 348 LPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELL---EKLSVATPKAADSTLVIHRNDLAAIRFLK 424 (517) T ss_pred HHHHHHHHHHHHHhcccCCCcccccccccccccccccccccchHHHHH---HHHHHHhhhccCCEEEECHHHHHHHHHhh Confidence 999999999999999999887665555443222 11222233333433 33333332 3678999999999999999 Q ss_pred ccCCceeeccC----CCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhc Q lcl|NC_011614. 211 DPETKERIYDR----NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) Q Consensus 211 d~~g~~~~~~~----~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~ 286 (324) |++|+|+|+.. ...+++|..-+. +..... ....+..+.+.++.+.++++..+.+ +.+ T Consensus 425 D~~G~Yl~~~~~~~~~~~~l~G~~~~~-~~~~~~--~~~~~~~~~y~i~~~~g~~~~~~fd----------------~~~ 485 (517) T protein:vir:97 425 DKNGNYVFPVGVSNQTIATHFGFNRLV-QSVAVD--EKTAVSLSGYVTNGSRGMEFEQGTI----------------LVE 485 (517) T ss_pred cCCCCeeccCcCCcccccccCCccccc-cccccC--ceeEeeccccEEEeecceeeeeeee----------------ccc Confidence 99999999653 335667742211 222222 2334456677777777655322111 357 Q ss_pred CcEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 287 DMVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 287 ~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) |+..|+.++|+++.|..|++|+.++...+... T Consensus 486 n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 486 NNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred CceeEeeeeeeccccccccceEEEEEcCCCCC Confidence 88899999999999999999999876544443 No 105 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.95 E-value=2e-30 Score=183.35 Aligned_cols=284 Identities=11% Similarity=0.002 Sum_probs=168.4 Q ss_pred Cchhh---HHH------------------------HHHHH---Hhhc----cchhhhhccccccccCCCcceechhhhHH Q lcl|NC_011614. 1 MEQTQ---KLK------------------------LNLQH---FASN----NVKPQVFNPDNVMMHEKKDGTLLNDFTTP 46 (324) Q Consensus 1 m~~~~---~~~------------------------~~~~~---~~~~----~~~~~~~~a~~~~~~~~~g~lip~~~~~~ 46 (324) ++... +++ .+.+. +... ........+.. ......++.+|+.+... T Consensus 151 ~el~akl~el~k~~ee~k~~~~~~~~~~~~~~~~~~e~r~~~~~~~~~~e~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 229 (480) T protein:vir:40 151 RELEAKVEELNKEREELKKEREASIPSEKPEDAERKFMRELGSKMAEMPEQGFLREFANGAD-LNVVNSLGSITSKYARK 229 (480) T ss_pred hhHHHHHHHHHhHHHHHhhhhhhhccccchhhhhhHHHHHHHHHhccchhhhhhhhhhhhcc-ccccccccccccchhhh Confidence 00000 000 00000 0000 00000000111 11222334455555544 Q ss_pred HHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeeccccccccc--ccceeeEEee---eeeEEEeehhHHHHHhc Q lcl|NC_011614. 47 ILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETS--KATWVNATMR---AFKLGVILPVTKEFLNY 121 (324) Q Consensus 47 i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~~~---~~k~~~~v~iS~ell~~ 121 (324) +.......+++...++.. ..+.....|++|+...+.. ..++.+..+. .++++.....|.++++| T Consensus 230 ~~~~~~~~~~~~~~~~~~-----------~~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDD 298 (480) T protein:vir:40 230 SGIYDGAMKARFQGLTLA-----------EDGVDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVND 298 (480) T ss_pred eeechhhhhhhhhcceee-----------eccccceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhh Confidence 443333333333332221 1233445677765544332 2234455554 46788888999999999 Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccc-hhHHHHHHHHhhhhccCCC-EEEEc Q lcl|NC_011614. 122 TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFT-QDNIIDLEALLEDDELEAN-AFISK 199 (324) Q Consensus 122 s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~-~~~i~~~~~~l~~~~~~~~-~~v~~ 199 (324) +. ++++||.++|++.++++++.+|++|+|++............ ..+...+ .+.+.+++.++...|+.++ .|+|| T Consensus 299 a~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~---~~~~~~~~~d~id~L~~al~~~y~~~a~~~vmn 374 (480) T protein:vir:40 299 SG-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATD---GWTKQIEYTDLFEGITDAVAECSISDAITIVMS 374 (480) T ss_pred hH-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeecc---cccccchhHHHHHHHHHhhhHHhhCCCCEEEEC Confidence 87 79999999999999999999999997766433222221111 1122233 3445568889988888777 69999 Q ss_pred HHHHHHHHHhhccCCceeecc----CCCceecccceEeecCccCCCce-EEEeecccEEEEEecceEEEEeecccccccc Q lcl|NC_011614. 200 TQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNLKRGE-LITGDFDKLIYGIPQLIEYKIDETAQLSTVK 274 (324) Q Consensus 200 ~~~~~~L~~l~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~~~~~~-i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~ 274 (324) |.+|++|++|||++|+|+|++ +.+.+|+|+||+.+.. ..+.+. .+..+..++++++++ .+ ..+.. T Consensus 375 ~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~-~~~~~~~~~~~~~~~~~~~d~~-~~--~~~~~------ 444 (480) T protein:vir:40 375 PQTFAELRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRV-WMPKDEVAVYNHDEYVLIGDLN-VE--NYNDF------ 444 (480) T ss_pred HHHHHHHHHhhcCCCCeeccCcccccCcceecccceeeeec-cccCCcceeeeCCccEEEEecc-cc--eeccc------ Confidence 999999999999999999965 3467899999876532 222222 333333455677653 22 21111 Q ss_pred cccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 275 ~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) .+..++..|+++.|+++.+..|+|++.++++..=+- T Consensus 445 --------~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 445 --------DLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSLGV 480 (480) T ss_pred --------ccccchhhhhhhhhhceeeEccccEEEEEeccCcCC Confidence 145788899999999999999999999999866655 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.93 E-value=1.9e-27 Score=167.03 Aligned_cols=263 Identities=14% Similarity=0.081 Sum_probs=212.5 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeec----CCCceEEEEEeCCcceeeecccccccccccceeeEE Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM----EGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) +....|.-...++|+.++..+.+.+.+...+.+++....- +|..++||+|...+++.++.||+.++.++.++++.+ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~~ 80 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKRE 80 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccceeE Confidence 2123444556789999999999999999888888765431 355789999987778999999999999999999999 Q ss_pred eeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHH Q lcl|NC_011614. 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) Q Consensus 103 ~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) +..++.+..+.++++....+..++.+.+.+++++++++++|+.++..-.+.. ....+..++++.+.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~------------~~~~~~~~~~d~i~dA 148 (274) T protein:vir:93 81 AKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------LTVNADITKLNGLQSA 148 (274) T ss_pred EEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ccccccccCHHHHHHH Confidence 9999999999999999999989999999999999999999999986432211 1122345689999999 Q ss_pred HHHhhhhccCCCEEEEcHHHHHHHHHhh------cc-CCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEE Q lcl|NC_011614. 183 EALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) Q Consensus 183 ~~~l~~~~~~~~~~v~~~~~~~~L~~l~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~ 255 (324) ..++.++......++|||..+..|++-. ++ .|..++..+..++++|++|++++. ++++..++.....+.++ T Consensus 149 ~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l~~~gai~~~ 226 (274) T protein:vir:93 149 IDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKLI 226 (274) T ss_pred HHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCC--CCcceEEEEeCCeEEEE Confidence 9999988878889999999999997531 11 234455667778999999998764 56778888888888888 Q ss_pred EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) ...++.+|.+|+.. +....+++..++++++++|+++++++++.++-+- T Consensus 227 ~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 227 LKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ecCCcccccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 88888888777642 3456789999999999999999999988777655 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.91 E-value=1.9e-26 Score=161.59 Aligned_cols=259 Identities=14% Similarity=0.107 Sum_probs=201.3 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeec----CCCceEEEEEeCCcceeeecccccccccccceeeEE Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM----EGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) +..+.|.-...++|+.+...+.+.+.+...+.+++..-+. +|..+.||.|....++.+++||+.++..+.+.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 1112344456678999999999999999888888765442 366789999987778899999999999999999999 Q ss_pred eeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHH Q lcl|NC_011614. 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) Q Consensus 103 ~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) +..++++..+.++++....+..++.+.+.++++.++++.+|+.++..-.+ .....+...+++.+.++ T Consensus 81 ~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~-------------~~~~~~~~~~~d~i~~A 147 (272) T protein:vir:36 81 VTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKT-------------TSQTVSTKANVDGVQAA 147 (272) T ss_pred EeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------ccccccccccHHHHHHH Confidence 99999999999999999999899999999999999999999998853211 11122356789999999 Q ss_pred HHHhhhhccCCCEEEEcHHHHHHHHHhh------ccCCceeeccCCCceecccceEeecCccCCCce---EEEeecccEE Q lcl|NC_011614. 183 EALLEDDELEANAFISKTQNRSLLRKIV------DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGE---LITGDFDKLI 253 (324) Q Consensus 183 ~~~l~~~~~~~~~~v~~~~~~~~L~~l~------d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~---i~~gd~~~~~ 253 (324) ..++.+.+....+++|||.++..|++.. +..|..++..+..++++|++|++++..+.+... ++++ ...+. T Consensus 148 ~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~-~gA~~ 226 (272) T protein:vir:36 148 LDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSN-SPALK 226 (272) T ss_pred HHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEec-cccee Confidence 9999998888889999999999997643 233445555666789999999998766544332 2222 23444 Q ss_pred EEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 254 YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 254 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) ++..+++.+|..|+.. +....+++..+|+.++++|+++++++.+-. T Consensus 227 ~~~~~~~~vE~~R~~~----------------~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 227 LVLKRGVQVETDRDIV----------------TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eeecCCcccccccchh----------------hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 5666777777766542 344568899999999999999999988755 No 108 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.91 E-value=4.9e-26 Score=159.34 Aligned_cols=264 Identities=16% Similarity=0.078 Sum_probs=209.2 Q ss_pred hccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeec----CCCceEEEEEeCCcceeeeccccccccccccee Q lcl|NC_011614. 24 FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM----EGTEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 24 ~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~----~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) +. +...|.-...++|+.++..+.+.+.+...+.+++..-+. +|..+.||.|....++.++.||+.++..+.+.+ T Consensus 1 ~~--~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~ 78 (275) T protein:vir:96 1 MA--LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETK 78 (275) T ss_pred CC--CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccc Confidence 21 222344455678999999999999999999888765442 466789999987778899999999999999999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHH Q lcl|NC_011614. 100 NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 100 ~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) +.++..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--++. .....+..++++.+ T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a------------~~~~~~~~~~~d~i 146 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGA------------TLKVEADITKLAGL 146 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccccccccCHHHH Confidence 999999999999999999998888899999999999999999999988532221 11123356789999 Q ss_pred HHHHHHhhhhccCCCEEEEcHHHHHHHHHhh-------ccCCceeeccCCCceecccceEeecCccCCCceEEEeecccE Q lcl|NC_011614. 180 IDLEALLEDDELEANAFISKTQNRSLLRKIV-------DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL 252 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~-------d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~ 252 (324) .++..++.++......++|||..+..|++.. +..|..++..+..++++|++|+.++. ++.+..++.....+ T Consensus 147 ~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~i~~~gA~ 224 (275) T protein:vir:96 147 QTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNK--IKEGEAILAKRGAV 224 (275) T ss_pred HHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCC--CCcceEEEEeccce Confidence 9999999887777788999999999997652 12244556677788999999999765 45556666666777 Q ss_pred EEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 253 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) .++...++.+|.+|+.. +....+++..+|+.++++|+++++++.+.+.=.+ T Consensus 225 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 225 KLITKRDFFLETERHAS----------------HKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred eeeecCCcccccccchh----------------hcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 78888888888877642 3456788899999999999999999886555555 No 109 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.91 E-value=6.1e-26 Score=158.81 Aligned_cols=265 Identities=17% Similarity=0.108 Sum_probs=211.1 Q ss_pred cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeE Q lcl|NC_011614. 26 PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) +.+ ..|.-...++|+.+++.+.+.+.+...+.+++...+ .++..+.||.|....++.+++||+.++..+.+.++. T Consensus 1 Ma~-~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~ 79 (276) T protein:vir:10 1 MAQ-GTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRR 79 (276) T ss_pred CCc-ceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCcccccccee Confidence 111 134445667899999999999999999999887543 357778999998778889999999999999999999 Q ss_pred EeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHH Q lcl|NC_011614. 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ....++.+..+.++++....+..|+.+.+.++++.++++++|+.++.--.+ ......++.++++.+.+ T Consensus 80 ~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~------------~~~~~~~~~~t~d~i~~ 147 (276) T protein:vir:10 80 EAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRG------------TKLTVSADIGTLAGLEA 147 (276) T ss_pred eEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhc------------ccccccccccCHHHHHH Confidence 999999999999999999999899999999999999999999988753111 11122345678999999 Q ss_pred HHHHhhhhccCCCEEEEcHHHHHHHHHhhc------c-CCceeeccCCCceecccceEeecCccCCCceEEEeecccEEE Q lcl|NC_011614. 182 LEALLEDDELEANAFISKTQNRSLLRKIVD------P-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d------~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~ 254 (324) +..++.+......+++|||..+..|++... . .|...+..+..++++|++|++++. ++.+..++.....+.+ T Consensus 148 A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l~~~gAi~~ 225 (276) T protein:vir:10 148 AIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKK--LDEGEAILAKRGAVKL 225 (276) T ss_pred HHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCC--CCcceEEEEeccceee Confidence 999998877778899999999999987532 1 233445566778999999999764 5567777777777778 Q ss_pred EEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccc Q lcl|NC_011614. 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPG 322 (324) Q Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 322 (324) +..+++.+|.+|+.. +....+++..+|+.++.+|++++++++++ .+.++-+ T Consensus 226 ~~~~~~~vE~dRd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~-~~~~~~~ 276 (276) T protein:vir:10 226 ITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKVTKGA-GTTDSGA 276 (276) T ss_pred eecCCceeecccchh----------------hcccEEEEeeEEEEEEEcCcceEEEecCC-cCCcCCC Confidence 888888888887653 34566888899999999999999999765 4444444 No 110 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.90 E-value=3.1e-25 Score=154.91 Aligned_cols=263 Identities=14% Similarity=0.092 Sum_probs=207.6 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeEE Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) +....|.-...++|+.++..+.+.+.+...+.+++..-+ -+|..+.||+|....++..+.||+.++..+.++++.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~~ 80 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKRE 80 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccceeE Confidence 111234445678899999999999988888888776533 2366789999987678888999999999999999999 Q ss_pred eeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHH Q lcl|NC_011614. 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) Q Consensus 103 ~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) +..++.+..+.++++....+..++.+.+.++++.++++.+|+.++..-.+. .....+..++++.+.++ T Consensus 81 ~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a------------~~~~~~~~~~~d~i~dA 148 (274) T protein:vir:96 81 AKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGLQTA 148 (274) T ss_pred EEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCcCcccccHHHHHHH Confidence 999999999999999999998899999999999999999999888642211 11223355789999999 Q ss_pred HHHhhhhccCCCEEEEcHHHHHHHHHhh------cc-CCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEE Q lcl|NC_011614. 183 EALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) Q Consensus 183 ~~~l~~~~~~~~~~v~~~~~~~~L~~l~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~ 255 (324) ..+|.++......++|||..+..|++.. +. .|..++..+..++++|++|++++. +|.+..++.....+.++ T Consensus 149 ~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~--~p~~t~~l~~~gA~~~~ 226 (274) T protein:vir:96 149 IDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGEALLAKKGAVKLI 226 (274) T ss_pred HHHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCC--CCcceEEEEeCcceeee Confidence 9999988877888999999999997653 11 234455667788999999998765 55667777777778888 Q ss_pred EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) ...++.+|.+|+.. +....+++..+|+.++++|+++++++++++..-- T Consensus 227 ~~~~~~vE~~Rd~~----------------~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 227 TKRDFFLEKDRDAS----------------RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred ecCCcccccccchh----------------hcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 88887887766542 3456788889999999999999999998776544 No 111 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.90 E-value=5.6e-25 Score=153.54 Aligned_cols=263 Identities=14% Similarity=0.084 Sum_probs=210.4 Q ss_pred cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeE Q lcl|NC_011614. 26 PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) +- -..|.-...++|+.+...+.+.+.+...+.+++..-+ .++..+.||+|....++..+.||+.++..+.+.++. T Consensus 1 ma-~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:94 1 MP-QGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CC-ccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccccccccee Confidence 11 1234445678999999999999988888888876533 246678999998767888999999999999999999 Q ss_pred EeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHH Q lcl|NC_011614. 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++.+..+.++++....+..++.+.+.+++++++++++|+.++.--.+.. ....+..++++.+.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:94 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------LTVNADITKLNGLQS 147 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC------------ccccccccCHHHHHH Confidence 99999999999999999999888999999999999999999999885422211 112234578999999 Q ss_pred HHHHhhhhccCCCEEEEcHHHHHHHHHh------hcc-CCceeeccCCCceecccceEeecCccCCCceEEEeecccEEE Q lcl|NC_011614. 182 LEALLEDDELEANAFISKTQNRSLLRKI------VDP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~L~~l------~d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~ 254 (324) +..++.++......++|||..+..|++- +.+ .|..++..+..++++|++|++++. ++.+..++.....+.+ T Consensus 148 A~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:94 148 AIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCC--CCcceEEEEeCcceEe Confidence 9999998887788899999999999753 111 244556677788999999999765 5577788888888888 Q ss_pred EEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) +..+++.+|.+|+.. +....+++..+|+.++++|+++++++++.++-+- T Consensus 226 ~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 226 ILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eecCCceeccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 888888888887643 2345678889999999999999999988777655 No 112 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.90 E-value=5.6e-25 Score=153.54 Aligned_cols=263 Identities=14% Similarity=0.084 Sum_probs=210.4 Q ss_pred cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeE Q lcl|NC_011614. 26 PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) +- -..|.-...++|+.+...+.+.+.+...+.+++..-+ .++..+.||+|....++..+.||+.++..+.+.++. T Consensus 1 ma-~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:97 1 MP-QGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CC-ccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccccccccee Confidence 11 1234445678999999999999988888888876533 246678999998767888999999999999999999 Q ss_pred EeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHH Q lcl|NC_011614. 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++.+..+.++++....+..++.+.+.+++++++++++|+.++.--.+.. ....+..++++.+.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:97 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------LTVNADITKLNGLQS 147 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC------------ccccccccCHHHHHH Confidence 99999999999999999999888999999999999999999999885422211 112234578999999 Q ss_pred HHHHhhhhccCCCEEEEcHHHHHHHHHh------hcc-CCceeeccCCCceecccceEeecCccCCCceEEEeecccEEE Q lcl|NC_011614. 182 LEALLEDDELEANAFISKTQNRSLLRKI------VDP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~L~~l------~d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~ 254 (324) +..++.++......++|||..+..|++- +.+ .|..++..+..++++|++|++++. ++.+..++.....+.+ T Consensus 148 A~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:97 148 AIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCC--CCcceEEEEeCcceEe Confidence 9999998887788899999999999753 111 244556677788999999999765 5577788888888888 Q ss_pred EEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) +..+++.+|.+|+.. +....+++..+|+.++++|+++++++++.++-+- T Consensus 226 ~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 226 ILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eecCCceeccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 888888888887643 2345678889999999999999999988777655 No 113 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.88 E-value=6.8e-24 Score=147.61 Aligned_cols=263 Identities=14% Similarity=0.091 Sum_probs=207.8 Q ss_pred cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeE Q lcl|NC_011614. 26 PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ..+ ..|.-...++|+.+...+.+.+.+...+.+++..-. .+|..+.||.|...+++..+.||+.++..+.+.++. T Consensus 1 ma~-~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:12 1 MAQ-GLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCc-ceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhccccee Confidence 111 234445668899999999999988888878776532 246678999998777888999999999999999999 Q ss_pred EeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHH Q lcl|NC_011614. 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.+. ........++++.+.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a------------~~~~~~~a~~~d~i~d 147 (274) T protein:vir:12 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGLQS 147 (274) T ss_pred eEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccccccccCHHHHHH Confidence 9999999999999999888888899999999999999999999988542221 1122335678999999 Q ss_pred HHHHhhhhccCCCEEEEcHHHHHHHHHh------hccC-CceeeccCCCceecccceEeecCccCCCceEEEeecccEEE Q lcl|NC_011614. 182 LEALLEDDELEANAFISKTQNRSLLRKI------VDPE-TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~L~~l------~d~~-g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~ 254 (324) +..+|.+......+++|||..+..|++. ++++ |..++..+..++++|++|+.++. ++.+..++.....+.+ T Consensus 148 A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l~~~gA~~~ 225 (274) T protein:vir:12 148 AIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNK--LEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCC--CCcceEEEEeccceee Confidence 9999988777778899999999998763 1222 44556677788999999999765 4455666666677778 Q ss_pred EEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) +..+++.+|.+|+.. +....+++..+|+.++++|+++++++++.|+-+- T Consensus 226 ~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 226 ILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eecCCceeccccchh----------------hcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 888888888887753 2345788889999999999999999988777655 No 114 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.88 E-value=8.5e-24 Score=147.06 Aligned_cols=263 Identities=14% Similarity=0.079 Sum_probs=206.4 Q ss_pred cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeE Q lcl|NC_011614. 26 PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ..+ ..|.-...++|+.++..+.+.+.+...+.+++..-+ -+|..+.||.|....++..+.||+.++..+.+.++. T Consensus 1 m~~-~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:96 1 MAQ-GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCc-ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhccccee Confidence 111 233445567899999999999988888888865433 246788999998777888899999999999999999 Q ss_pred EeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHH Q lcl|NC_011614. 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.+. .....+++++++.+.+ T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a------------~~~~~~~~~~~d~i~~ 147 (274) T protein:vir:96 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA------------KLTVEADITKLTGLQT 147 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccccccccCHHHHHH Confidence 9999999999999999888888899999999999999999999988532221 1112335678999999 Q ss_pred HHHHhhhhccCCCEEEEcHHHHHHHHHhh------cc-CCceeeccCCCceecccceEeecCccCCCceEEEeecccEEE Q lcl|NC_011614. 182 LEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~L~~l~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~ 254 (324) +..++.++.....+++|||..+..|++.. ++ .|..++..+..++++|++|+.++. ++++..++.....+.+ T Consensus 148 A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~~~~t~~l~~~gA~~~ 225 (274) T protein:vir:96 148 AIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNK--LEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCC--CCCceEEEEeccceee Confidence 99999887777788999999999997631 12 234566677788999999999765 4455666655667777 Q ss_pred EEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) +..+++.+|.+|+.. +....+++..+|++++++|++++++++.+|.=+- T Consensus 226 ~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 226 ITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eecCCcccccccccc----------------cccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 778888888877643 4556788889999999999999999988776544 No 115 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.88 E-value=8.5e-24 Score=147.06 Aligned_cols=263 Identities=14% Similarity=0.079 Sum_probs=206.4 Q ss_pred cccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeE Q lcl|NC_011614. 26 PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ..+ ..|.-...++|+.++..+.+.+.+...+.+++..-+ -+|..+.||.|....++..+.||+.++..+.+.++. T Consensus 1 m~~-~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:95 1 MAQ-GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCc-ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhccccee Confidence 111 233445567899999999999988888888865433 246788999998777888899999999999999999 Q ss_pred EeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHH Q lcl|NC_011614. 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.+. .....+++++++.+.+ T Consensus 80 ~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a------------~~~~~~~~~~~d~i~~ 147 (274) T protein:vir:95 80 EAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA------------KLTVEADITKLTGLQT 147 (274) T ss_pred EEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccccccccCHHHHHH Confidence 9999999999999999888888899999999999999999999988532221 1112335678999999 Q ss_pred HHHHhhhhccCCCEEEEcHHHHHHHHHhh------cc-CCceeeccCCCceecccceEeecCccCCCceEEEeecccEEE Q lcl|NC_011614. 182 LEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~L~~l~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~ 254 (324) +..++.++.....+++|||..+..|++.. ++ .|..++..+..++++|++|+.++. ++++..++.....+.+ T Consensus 148 A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~~~~t~~l~~~gA~~~ 225 (274) T protein:vir:95 148 AIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNK--LEAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCC--CCCceEEEEeccceee Confidence 99999887777788999999999997631 12 234566677788999999999765 4455666655667777 Q ss_pred EEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) +..+++.+|.+|+.. +....+++..+|++++++|++++++++.+|.=+- T Consensus 226 ~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 226 ITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eecCCcccccccccc----------------cccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 778888888877643 4556788889999999999999999988776544 No 116 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.88 E-value=5.8e-24 Score=147.97 Aligned_cols=266 Identities=15% Similarity=0.124 Sum_probs=199.8 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeEE Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) +..++|..+..++|+.++..+.+.+.+...+.+++.... -++..+.||+|....++.++.||+.++..+.++++.+ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~~ 80 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESVK 80 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCcccccccceee Confidence 111245556678999999999999999888888775433 2355688999987778899999999999999999999 Q ss_pred eeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHH Q lcl|NC_011614. 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) Q Consensus 103 ~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) +..++.+..+.++++....+..++.+.+.+++++++++.+|+.++..-.+... ..............++.+.++ T Consensus 81 ~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~------~~~~~~t~~~~~~~~~~~~da 154 (278) T protein:vir:80 81 HGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL------EVKGAINIGLIDKIENTFTDA 154 (278) T ss_pred EeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc------ccccccccchhhhHHHHHHHH Confidence 99999999999999999999899999999999999999999988864221110 000001111122346677788 Q ss_pred HHHhhhhccCC-CEEEEcHHHHHHHHHhhc-------cCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEE Q lcl|NC_011614. 183 EALLEDDELEA-NAFISKTQNRSLLRKIVD-------PETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 183 ~~~l~~~~~~~-~~~v~~~~~~~~L~~l~d-------~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~ 254 (324) ..++..++... ..++|||..+..|++... ..|..++..+..++++|++|++++. ++.+..++...+.+.+ T Consensus 155 ~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l~~~gAi~~ 232 (278) T protein:vir:80 155 PDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKK--LADGNALAVKAGALKT 232 (278) T ss_pred HHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCC--CCcceEEEEeccceee Confidence 87877665443 358899999999976431 1244455667778999999999775 4556667666777777 Q ss_pred EEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) +..+++.+|.+|+.. +.+..+++..+|+.++++|++++++++.+.. T Consensus 233 ~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 233 FLKRNLLAESGRDMD----------------HKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred eecCCcccccccchh----------------hccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 777888888776642 3455788889999999999999999998766 No 117 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.86 E-value=3.7e-23 Score=143.53 Aligned_cols=262 Identities=11% Similarity=0.077 Sum_probs=204.7 Q ss_pred ccccCCCcceechhhhHHHHHHHHhhcchhhhceeee----cCCCceEEEEEeCCcceeeecccccccccccceeeEEee Q lcl|NC_011614. 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMR 104 (324) Q Consensus 29 ~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~ 104 (324) .+.|.-...++|+.+.+.+.+.+.+...+.+++..-+ .+|..+.+|.|....++..+.||+.++..+.++++...+ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 2223344566899999999999999888888877533 256778999999878899999999999999999999999 Q ss_pred eeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHHHH Q lcl|NC_011614. 105 AFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) Q Consensus 105 ~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 184 (324) .++.+..+.++++....+..+....+.++++.++++++|+.++.--.+ +....+..++++++.++.. T Consensus 81 i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~-------------a~~~~~~~~t~~~~~dA~~ 147 (270) T protein:vir:95 81 VKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNK-------------SKQTATVSADATGILDAIE 147 (270) T ss_pred eehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHHHHH Confidence 999999999999988888788999999999999999999988742111 1111234678999999999 Q ss_pred HhhhhccCCCEEEEcHHHHHHHHHhhc---c-CCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecce Q lcl|NC_011614. 185 LLEDDELEANAFISKTQNRSLLRKIVD---P-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLI 260 (324) Q Consensus 185 ~l~~~~~~~~~~v~~~~~~~~L~~l~d---~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~ 260 (324) ++.+......+++|||.++..|++-.. . .+..++..+..++++|++|++... ..+++..++.....+.++..+++ T Consensus 148 ~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~-~~~~~~~~l~~~gAi~~~~~~~~ 226 (270) T protein:vir:95 148 VFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSK-RVSENTAFLQRYGAMEIVNKKKP 226 (270) T ss_pred HhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCC-CCCceeEEEEeccceeeeecCCc Confidence 999988888999999999999985321 1 233344556778999999987543 45666777766777788888888 Q ss_pred EEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 261 EYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 261 ~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) .+|.+|+.. +....+++..+|++.+.+|+.+++++.+.+.+. || T Consensus 227 ~vEtdRd~~----------------~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~----~~ 270 (270) T protein:vir:95 227 EAYTDFDIL----------------KRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSL----EM 270 (270) T ss_pred eeeeccchh----------------hcccEEEeeeEEEEEEEccceEEEEEecCCCCc----CC Confidence 899888753 334567788999999999999999987533222 34 No 118 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.86 E-value=1.9e-23 Score=145.19 Aligned_cols=306 Identities=14% Similarity=0.115 Sum_probs=212.7 Q ss_pred CchhhHH-HHH-------------HHHHhh---ccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhcee Q lcl|NC_011614. 1 MEQTQKL-KLN-------------LQHFAS---NNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY 63 (324) Q Consensus 1 m~~~~~~-~~~-------------~~~~~~---~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~ 63 (324) ||++..+ +.. ..+|+. .++...++|-...+++.++..+||..+++-+.+..++-....+++.. T Consensus 28 me~~et~~e~~~~~~~~~~~e~el~E~f~Kmm~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk 107 (393) T protein:vir:79 28 MERGETLAEADANKLALNEEETQILESFAKMMEGETPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQK 107 (393) T ss_pred hhhhhhhhhhhhhhhhcchhHHHHHHHHHHHhcCCCchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHH Confidence 7776532 221 223443 56666667766668888899999999999999988888888888888 Q ss_pred eecCCC-ceEEEEEeCCcceeeecccccccccc---cceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHH Q lcl|NC_011614. 64 EPMEGT-EKKFTFWADKPGAYWVGEGQKIETSK---ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFY 139 (324) Q Consensus 64 ~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~~~---~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~ 139 (324) +....+ +..+|.. +.-.+.-|+||+++|+.. .++++++++..|.|..+.+|+|++.||..++.++......++++ T Consensus 108 ~~L~~Grsm~F~~~-g~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMa 186 (393) T protein:vir:79 108 IRLKSGQSMIFPSI-GIMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMG 186 (393) T ss_pred HhhhcCcceeccch-heeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHH Confidence 887444 3344433 355678899999999754 56889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCcCcC--Cccc-cccccccc-----ceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh- Q lcl|NC_011614. 140 KKFDEAGILNQGNNPF--GKSI-AQSIEKTN-----KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV- 210 (324) Q Consensus 140 ~~~d~a~l~g~g~~~~--~~~~-~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~- 210 (324) ++.|..++++..+... ..++ ......++ +.-.++++.+|+.+|..++..+.+.+++++|||-.|+.+++-. T Consensus 187 RkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~ 266 (393) T protein:vir:79 187 RHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNEL 266 (393) T ss_pred hhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhh Confidence 9999999998765433 1111 11111111 2456789999999999999999999999999999999987521 Q ss_pred --c----cCCceeeccCC------Cceec-----ccceEeecCcc----CCCceEEEeecccEEEE-EecceEEEEeecc Q lcl|NC_011614. 211 --D----PETKERIYDRN------SDSLD-----GLPVVNLKSSN----LKRGELITGDFDKLIYG-IPQLIEYKIDETA 268 (324) Q Consensus 211 --d----~~g~~~~~~~~------~~~l~-----G~pv~~~~~~~----~~~~~i~~gd~~~~~~~-~~~~~~i~~~~~~ 268 (324) . +-|++.-..-. +..|. .+.|++++-.+ ..+.+++..|.++.-+- .+.++..+..++. T Consensus 267 me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk 346 (393) T protein:vir:79 267 MGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEK 346 (393) T ss_pred hcceeeccccccCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEEecCcceeccccc Confidence 1 11211110000 11222 25566665333 24556777777775432 3445555544432 Q ss_pred cccccccccccchhhhhcCcEEEEEEEEeccEEec-ccceEEEEeeccCCCCccccC Q lcl|NC_011614. 269 QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD-DKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 269 ~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~-~~a~~~l~~~~~~~~~~~~~~ 324 (324) .+|.+.+....|+|+.|++ .+|++..+..+-..+ .|.-| T Consensus 347 ----------------~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~k~-y~~P~ 386 (393) T protein:vir:79 347 ----------------ARGLQNIKMIERYGIGILNEGKAIAVAKNISMDKS-YAEPM 386 (393) T ss_pred ----------------cccceeeeeeeeeceeeeeCCceEEEEecceeecc-cccch Confidence 4678889999999999998 677777765443332 22333 No 119 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.85 E-value=7.6e-23 Score=141.85 Aligned_cols=293 Identities=12% Similarity=0.110 Sum_probs=213.8 Q ss_pred Cchhh------HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEE Q lcl|NC_011614. 1 MEQTQ------KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 m~~~~------~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) |-|-- .|+....+|.. +. +...+-...+.+.|......|++.+.+.+.+++++...++.++.+.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~-------l~-m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~ 72 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPE-------LK-MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYN 72 (330) T ss_pred CceecCCccccceeehhccccc-------cc-hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceee Confidence 32211 01111111111 11 112233346778899999999999999999999999888888889999 Q ss_pred EEeCCcceeeecccccccccc-cceeeEEeeeeeEEEeehhHHHHHh--cChhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011614. 75 FWADKPGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 75 ~~~~~~~a~~v~Eg~~~~~~~-~~~~~v~~~~~k~~~~v~iS~ell~--~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g 151 (324) +....+.+.|+..++.++++. .+|.+++...+.+++.+.|.+.+.+ .+..+...+..+...+++..++++.+|+|+. T Consensus 73 r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs 152 (330) T protein:vir:94 73 RENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDG 152 (330) T ss_pred eeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 999899999999998888765 5799999999999999999999865 3456888888999999999999999999987 Q ss_pred cCcCCcccccccccccce----eecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeecc------- Q lcl|NC_011614. 152 NNPFGKSIAQSIEKTNKV----IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD------- 220 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~----~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~------- 220 (324) ++....|+.......+.. ..+.++.|++-+++..+......++.|+||+.+..+|+.+.+..|++...+ T Consensus 153 ~~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G 232 (330) T protein:vir:94 153 TGNSFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSG 232 (330) T ss_pred CCccccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCC Confidence 766665666655444433 336788999999999887766678999999999999999988777655322 Q ss_pred CCCceecccceEeecCccC--------CCceEEEeecc-----cEEEEEe----cceEEEEeecccccccccccccchhh Q lcl|NC_011614. 221 RNSDSLDGLPVVNLKSSNL--------KRGELITGDFD-----KLIYGIP----QLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) Q Consensus 221 ~~~~~l~G~pv~~~~~~~~--------~~~~i~~gd~~-----~~~~~~~----~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) .....+.|.|++.++-.+. +...||+..|. +-+.|.. .+++++..-. . T Consensus 233 ~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~---------------~ 297 (330) T protein:vir:94 233 RQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGA---------------K 297 (330) T ss_pred CEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCC---------------c Confidence 1123578999887653333 23457766654 2445553 2344433211 0 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 284 FEQDMVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 284 f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) =.++...+|+++||+..+.+|+|+++|++.... T Consensus 298 ~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 298 ENADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred cccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 135678899999999999999999999987655 No 120 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.78 E-value=9.6e-21 Score=130.32 Aligned_cols=223 Identities=13% Similarity=0.098 Sum_probs=175.6 Q ss_pred ceeeecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHH Q lcl|NC_011614. 61 GKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYK 140 (324) Q Consensus 61 ~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~ 140 (324) -+-+++ |..+.+|.+ ..+|..++||++++..+.++++.+.+.++++..++|+++....+..++.....++++.+|++ T Consensus 1 ~~~~~~-Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINL-ANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccC-CceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 122333 456889987 44778899999999999999999999999999999999999888889999999999999999 Q ss_pred HHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhc------cCC Q lcl|NC_011614. 141 KFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVD------PET 214 (324) Q Consensus 141 ~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d------~~g 214 (324) ++|..++.-..+ .....+..++++.+.++..++.+....+.+++|||..+..|++..+ ..| T Consensus 78 kvD~di~~~~~~-------------a~l~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g 144 (231) T protein:vir:73 78 KVDDDLLKAAKT-------------TSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) T ss_pred hhhHHHHHhhcc-------------ccccccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhhhhhhhc Confidence 999998842111 1112234689999999999999988888889999999999987443 335 Q ss_pred ceeeccCCCceecccceEeecCccCCCceE--EEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEE Q lcl|NC_011614. 215 KERIYDRNSDSLDGLPVVNLKSSNLKRGEL--ITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) Q Consensus 215 ~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i--~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r 292 (324) ..++..+..++++|++|+.++..+.+.... +..-...+.+...+++.+|.+|+.. +....++ T Consensus 145 ~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~----------------~k~~~i~ 208 (231) T protein:vir:73 145 ANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV----------------TKTTVIT 208 (231) T ss_pred cceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeecccccc----------------ccccEEE Confidence 667778888999999999987555432211 2222456678888888999887753 4456688 Q ss_pred EEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 293 ATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 293 ~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) +..+|+..+.+|+.+++++.+-. T Consensus 209 ~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 209 ADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EeEEEEEEEEcCccEEEEEeecC Confidence 89999999999999999987755 No 121 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.71 E-value=2.7e-18 Score=116.93 Aligned_cols=274 Identities=11% Similarity=0.074 Sum_probs=191.7 Q ss_pred hccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeeccc-----ccccccccce Q lcl|NC_011614. 24 FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEG-----QKIETSKATW 98 (324) Q Consensus 24 ~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg-----~~~~~~~~~~ 98 (324) +.+ .+-. ..+-+.+......||+.+.+.+.++++....++.++.+.+.+...-+.+.+.+.+ +..+++..+| T Consensus 1 mpa--ltLa-ea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~ 77 (310) T protein:vir:97 1 MAS--VTLA-ESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATF 77 (310) T ss_pred Ccc--cchH-HHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccc Confidence 111 1122 2345778889999999999999999999999888888888887766555544332 3446788999 Q ss_pred eeEEeeeeeEEEeehhHHHHHhc--C-hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccce----ee Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNY--T-YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV----IK 171 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~--s-~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~----~~ 171 (324) ++++...+-+++.+.|-+.+.+- + ..+...+-.++..+++.++++..+|+|+.++....|+...+...+.. .. T Consensus 78 ~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~g 157 (310) T protein:vir:97 78 TKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATG 157 (310) T ss_pred ceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCC Confidence 99999999999999998765442 3 33555555677789999999999999998766555666655444433 23 Q ss_pred cccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhcc-CCceeec------cCCCceecccceEeecCccC----- Q lcl|NC_011614. 172 GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDP-ETKERIY------DRNSDSLDGLPVVNLKSSNL----- 239 (324) Q Consensus 172 ~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~-~g~~~~~------~~~~~~l~G~pv~~~~~~~~----- 239 (324) +.++.|++-.++..+......++.|+|||+++.+|+.+... +++.++. .....++.|.|++.++..+. T Consensus 158 g~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~ 237 (310) T protein:vir:97 158 SAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKG 237 (310) T ss_pred CCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCcccc Confidence 56789999999998877777889999999997777655433 2222211 12224688999988764433 Q ss_pred ---CCceEEEeeccc-----EEEEEe----cceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 240 ---KRGELITGDFDK-----LIYGIP----QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 240 ---~~~~i~~gd~~~-----~~~~~~----~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +...||+..|.. -++|.. .+++++...+ .=.++...+|+++||+..+..|+|+ T Consensus 238 ~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~---------------~~~~~v~~~~V~~Y~~~av~~~~A~ 302 (310) T protein:vir:97 238 GTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGE---------------SEDSDEHIWRVKWYCGLALFSEKGL 302 (310) T ss_pred ccCCceeEEEEeeCccccccceeccccCCccceeEEeCCc---------------ccCCcceeEEEEEeeeEEEecccce Confidence 244567666653 234432 2233332211 0145677899999999999999999 Q ss_pred EEEEeecc Q lcl|NC_011614. 308 AKLVPADA 315 (324) Q Consensus 308 ~~l~~~~~ 315 (324) ++|++..- T Consensus 303 a~L~~V~~ 310 (310) T protein:vir:97 303 ACADGITN 310 (310) T ss_pred eeeccccC Confidence 99998776 No 122 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.68 E-value=1.6e-17 Score=112.61 Aligned_cols=298 Identities=13% Similarity=0.099 Sum_probs=190.9 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCc Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |.-+-.+...+ ++...+-.. ++ .+.++-+++++++++..++++.+++.++++++++++++.+....|+....+. T Consensus 1 ~~~~~~~~~~~----n~~~~~i~k-~~-it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~ 74 (360) T protein:vir:99 1 MSSNSTIDSVR----NQNMNSLSQ-KD-IGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPR 74 (360) T ss_pred CcchhHHHHHh----hhHHHHHHh-hh-ccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccce Confidence 65433332222 222221111 22 3334446899999999999999999999999999999999988888765443 Q ss_pred ceee-ecccccc-cccccceeeEEe-eeeeEEEeehhHHHHHhcCh----hHHHHHHHHHHHHHHHHHHHHHHHhccCcC Q lcl|NC_011614. 81 GAYW-VGEGQKI-ETSKATWVNATM-RAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 81 ~a~~-v~Eg~~~-~~~~~~~~~v~~-~~~k~~~~v~iS~ell~~s~----~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~ 153 (324) ...- -.|+... ...+.+...+.+ ..+++-....+..+.+++.. ..+++.|+++|++++++.++.-.++|+... T Consensus 75 r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds 154 (360) T protein:vir:99 75 LSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASS 154 (360) T ss_pred eeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchh Confidence 2211 1232222 223444555555 34566666777777777653 367899999999999999999999997543 Q ss_pred c--------CC-----cccccccc-cc---------cce--------e----------ec-----ccchhHHHHHHHHhh Q lcl|NC_011614. 154 P--------FG-----KSIAQSIE-KT---------NKV--------I----------KG-----DFTQDNIIDLEALLE 187 (324) Q Consensus 154 ~--------~~-----~~~~~~~~-~~---------~~~--------~----------~~-----~~~~~~i~~~~~~l~ 187 (324) . .+ .|++..+. .+ ..+ . .+ ..+-..+.+++..|+ T Consensus 155 ~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp 234 (360) T protein:vir:99 155 GNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLD 234 (360) T ss_pred cccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcc Confidence 2 10 11111110 00 000 0 00 012334678999999 Q ss_pred hhccCC----CEEEEcHHHHHHHHH-hhc---cCCceeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecc Q lcl|NC_011614. 188 DDELEA----NAFISKTQNRSLLRK-IVD---PETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL 259 (324) Q Consensus 188 ~~~~~~----~~~v~~~~~~~~L~~-l~d---~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~ 259 (324) ..|++. -.|+||+..+...+. |.+ +-|...+.....-.+.|+|++..+ ..+++.+++.++.++++|.+.+ T Consensus 235 ~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~--~~pd~~~mlT~p~NLi~g~~~~ 312 (360) T protein:vir:99 235 SRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSAVIFGDSDITPFSYDLVGVN--GFPDEYMMFTDPNNLAFGLYEE 312 (360) T ss_pred hhhhcCcccceEEEccCchHHHHHHHHhccCcccchhheecccccccceeeeEEcC--CCCCCceEEeccCceeEEeeee Confidence 998764 279999998665543 333 234445555555567899998766 4567789999999999999999 Q ss_pred eEEEEeecccccccccccccchhhhhcC-cEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 260 IEYKIDETAQLSTVKNEDGTPVNLFEQD-MVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 260 ~~i~~~~~~~~~~~~~~~~~~~~~f~~~-~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) ++++.+.+... +-++. .+..-....+|+.+.+++|+|++++.....+ T Consensus 313 iri~~~~e~~~------------~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 313 MELDQSTDTDK------------VHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred eEEeecccchh------------hhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 99987665321 01111 1223345779999999999999998755444 No 123 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.58 E-value=3.7e-16 Score=105.16 Aligned_cols=258 Identities=9% Similarity=-0.030 Sum_probs=167.9 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhcee----eecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeee Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY----EPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~----~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) ++. -.++|+.++.++++.+++.+.+.+++.. +...+.++.||++.....+....++..++..+.+.++++++.. T Consensus 1 MA~--~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:79 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccc--hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEEEEe Confidence 111 2368999999999999999998888643 3334667899998766666678888888888888888888886 Q ss_pred eE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHHHHH Q lcl|NC_011614. 107 KL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) Q Consensus 107 k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) +. ..-+.|++.-...+..++.+ +.+++.+++++++|+.++.--..... ........+....++.+.++..+ T Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~-------~~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:79 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-------ALTGSAPSDADDAFDLIASALKE 150 (273) T ss_pred eecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhccc-------ccccccccchhhHHHHHHHHHHH Confidence 63 55567777444555567876 66778899999999877632111000 00111111223457788999999 Q ss_pred hhhhcc--CCCEEEEcHHHHHHHHHhh----c--cCC-ceeeccCCCceecccceEeecCccCCCc-eEEEeecccEEEE Q lcl|NC_011614. 186 LEDDEL--EANAFISKTQNRSLLRKIV----D--PET-KERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIYG 255 (324) Q Consensus 186 l~~~~~--~~~~~v~~~~~~~~L~~l~----d--~~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~~-~i~~gd~~~~~~~ 255 (324) |..++. ..-.++++|..+..|.+.. + ..| ...+..+..++++|++|+.++..+.... ..+.+-.+.+... T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a 230 (273) T protein:vir:79 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeee Confidence 988775 2346899999998885432 1 122 2335566778999999998766554333 2334433333332 Q ss_pred EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) .+ ...++..+.. ..| ...+++..++|+.+++|++++.++...+ T Consensus 231 ~~-~~~~e~~r~~-------------~~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 231 SQ-IDTVEALRDQ-------------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ee-hhhhhcccCc-------------ccc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 21 1122222221 123 4568889999999999999999887655 No 124 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.55 E-value=6.5e-16 Score=103.86 Aligned_cols=277 Identities=11% Similarity=0.036 Sum_probs=178.8 Q ss_pred ccccCCCcceechhhhHHHHHHHHhhcchhh---------hceee--ecCCCceEEEEEeC-Ccceeeeccccccccccc Q lcl|NC_011614. 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMQ---------LGKYE--PMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKA 96 (324) Q Consensus 29 ~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~---------l~~~~--~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~~~~ 96 (324) ..+|.-...++|+.+...+.+...+.+.+.+ +.... ..+|..+.+|.|.. +.++.-+.|+..++..+. T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~~l 80 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQKI 80 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchhhc Confidence 1233334556788777777777777766633 12222 23566789999975 367888899999999999 Q ss_pred ceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccch Q lcl|NC_011614. 97 TWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQ 176 (324) Q Consensus 97 ~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (324) +.++-....++.+..+.++++...-+..++...+.+++++..++..++.+|.-...-.............+......+++ T Consensus 81 ~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~s~ 160 (324) T protein:vir:59 81 NAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIYSA 160 (324) T ss_pred ccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccccceecH Confidence 98888888888999999999888888889999999999999999999877643110000000000111112222345788 Q ss_pred hHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC-ceeeccCCCceecccceEeecCccCC-------CceEEEee Q lcl|NC_011614. 177 DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET-KERIYDRNSDSLDGLPVVNLKSSNLK-------RGELITGD 248 (324) Q Consensus 177 ~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g-~~~~~~~~~~~l~G~pv~~~~~~~~~-------~~~i~~gd 248 (324) +.+.++..++.+....-.+|+||+.++..|++..--+. ++--.....+.++|++|+++++.+.. +...|+-- T Consensus 161 ~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~VivdD~~p~~~~~~~~~~y~s~l~~ 240 (324) T protein:vir:59 161 ETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIRFPTYMNKRVIVDDSMPVETLEDGTKVFTSYLFG 240 (324) T ss_pred HHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhhccccccCceeeeecccEEEEeCCCCccccCCCCceEEEEEEe Confidence 99999999999988778899999999999986431111 01111223467899999998755431 22233222 Q ss_pred cccEEEEE-ecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 249 FDKLIYGI-PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 249 ~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ...+.++. ...+.+|.+|+.. .+...+..+.++ +++|.++..-+.+.+...+|.+|. T Consensus 241 ~GAi~~~~~~~~v~vE~dRd~~----------------~g~~~l~~r~~~---~~~p~G~s~~~~~~~~~sPt~~~L 298 (324) T protein:vir:59 241 AGALGYAEGQPEVPTETARNAL----------------GSQDILINRKHF---VLHPRGVKFTENAMAGTTPTDEEL 298 (324) T ss_pred cCeEEEeecCCCcceecccCcc----------------ccceEEEEeeEE---EeEeeeEEecccccCCCCCChhhh Confidence 23344444 3345666666542 233444555554 467777777665556677777777 No 125 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.55 E-value=1.1e-15 Score=102.53 Aligned_cols=258 Identities=10% Similarity=-0.016 Sum_probs=166.0 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhceee----ecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeee Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) ++- -.++|+.|+.++++.+++.+++.+++..- ...+.++.||+......+....++..++..+.+.++++++.. T Consensus 1 MA~--~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:10 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccc--hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEe Confidence 211 24689999999999999999988886431 223567889997765556667777777777777777777775 Q ss_pred eE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHHHHH Q lcl|NC_011614. 107 KL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) Q Consensus 107 k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) +. ...+.|++.-...+..++++ +.+++.++++.++|..++.--..... ........+....++.+.++..+ T Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~-------~~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-------ALTGSAPTDADDAFDLIAKALKE 150 (273) T ss_pred eeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc-------ccccccccchhHHHHHHHHHHHH Confidence 53 55556777444455567877 66778999999999887742111000 00111112223457889999999 Q ss_pred hhhhccC--CCEEEEcHHHHHHHHHh----hc--cCC-ceeeccCCCceecccceEeecCccCCC-ceEEEeecccEEEE Q lcl|NC_011614. 186 LEDDELE--ANAFISKTQNRSLLRKI----VD--PET-KERIYDRNSDSLDGLPVVNLKSSNLKR-GELITGDFDKLIYG 255 (324) Q Consensus 186 l~~~~~~--~~~~v~~~~~~~~L~~l----~d--~~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~-~~i~~gd~~~~~~~ 255 (324) |..++.. .-.++++|..+..|.+. ++ ..| ...+..+..+++.|++|+.++..+.+. ...+.+..+.+... T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a 230 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeee Confidence 9887753 34689999999988642 22 112 234556677899999999876655433 23445444444333 Q ss_pred EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) .. ...++..+.. +.| ...+++...+|+++++|++++.++...+ T Consensus 231 ~q-~~~~e~~r~~-------------~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 231 SQ-IDTVEALRDQ-------------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ee-eehhhcccCC-------------Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 21 1122222211 123 4558889999999999999999887655 No 126 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.55 E-value=1.1e-15 Score=102.53 Aligned_cols=258 Identities=10% Similarity=-0.016 Sum_probs=166.0 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhceee----ecCCCceEEEEEeCCcceeeecccccccccccceeeEEeeee Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) ++- -.++|+.|+.++++.+++.+++.+++..- ...+.++.||+......+....++..++..+.+.++++++.. T Consensus 1 MA~--~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:10 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccc--hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEe Confidence 211 24689999999999999999988886431 223567889997765556667777777777777777777775 Q ss_pred eE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHHHHH Q lcl|NC_011614. 107 KL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) Q Consensus 107 k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) +. ...+.|++.-...+..++++ +.+++.++++.++|..++.--..... ........+....++.+.++..+ T Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~-------~~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-------ALTGSAPTDADDAFDLIAKALKE 150 (273) T ss_pred eeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc-------ccccccccchhHHHHHHHHHHHH Confidence 53 55556777444455567877 66778999999999887742111000 00111112223457889999999 Q ss_pred hhhhccC--CCEEEEcHHHHHHHHHh----hc--cCC-ceeeccCCCceecccceEeecCccCCC-ceEEEeecccEEEE Q lcl|NC_011614. 186 LEDDELE--ANAFISKTQNRSLLRKI----VD--PET-KERIYDRNSDSLDGLPVVNLKSSNLKR-GELITGDFDKLIYG 255 (324) Q Consensus 186 l~~~~~~--~~~~v~~~~~~~~L~~l----~d--~~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~-~~i~~gd~~~~~~~ 255 (324) |..++.. .-.++++|..+..|.+. ++ ..| ...+..+..+++.|++|+.++..+.+. ...+.+..+.+... T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a 230 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeee Confidence 9887753 34689999999988642 22 112 234556677899999999876655433 23445444444333 Q ss_pred EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) .. ...++..+.. +.| ...+++...+|+++++|++++.++...+ T Consensus 231 ~q-~~~~e~~r~~-------------~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 231 SQ-IDTVEALRDQ-------------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ee-eehhhcccCC-------------Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 21 1122222211 123 4558889999999999999999887655 No 127 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.55 E-value=5.4e-16 Score=104.26 Aligned_cols=282 Identities=11% Similarity=0.019 Sum_probs=165.0 Q ss_pred hc-cccccccCCCcce-----e--chhhhHHHHHHHHhhcchhhhceeeec-CCCceEEEEEeC---Ccceeeecccccc Q lcl|NC_011614. 24 FN-PDNVMMHEKKDGT-----L--LNDFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWAD---KPGAYWVGEGQKI 91 (324) Q Consensus 24 ~~-a~~~~~~~~~g~l-----i--p~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~---~~~a~~v~Eg~~~ 91 (324) +. ..++..+.+++.+ + |+-+-+.|.+.+.+.-+.-.+.+.+.. .++...+-.... ..++.-|+||+++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 11 1122223333332 1 444445555555444444445565544 344343322221 2466778999999 Q ss_pred cccccceeeEEe-eeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccc-cce Q lcl|NC_011614. 92 ETSKATWVNATM-RAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKT-NKV 169 (324) Q Consensus 92 ~~~~~~~~~v~~-~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~-~~~ 169 (324) |...+.++...+ ..+|.|..+.||+|+++.+..+..+-...+++.+++++.|+.++.---+...+.......... ... T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~ 160 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKV 160 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccc Confidence 999999988777 558999999999999999999999999999999999999987765321111010000000000 000 Q ss_pred eecccchh-HHHHHHH--------Hh-hhhccCCCEEEEcHHHHHHHH------HhhccCCceeec-----cCCCceecc Q lcl|NC_011614. 170 IKGDFTQD-NIIDLEA--------LL-EDDELEANAFISKTQNRSLLR------KIVDPETKERIY-----DRNSDSLDG 228 (324) Q Consensus 170 ~~~~~~~~-~i~~~~~--------~l-~~~~~~~~~~v~~~~~~~~L~------~l~d~~g~~~~~-----~~~~~~l~G 228 (324) ..+..... .+..+.. .. ..-++.++.++|||.+|..|. ++...++.+.+. ...+++++| T Consensus 161 ~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lG 240 (318) T protein:vir:10 161 RTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMG 240 (318) T ss_pred cccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeec Confidence 01111111 1111111 01 133577899999999999994 444445555542 223567899 Q ss_pred cceEeecCccCCCceEEEeecccE-EEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 229 LPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 229 ~pv~~~~~~~~~~~~i~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +.|+.++ ..+.+.++..+...+ ++++.+.+..+-.+.-.. ++++. .+.+..+|+.......|.+|+|+ T Consensus 241 l~vi~s~--~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg----~~~g~-----~~~s~~~~~~~~~~~~V~~PkA~ 309 (318) T protein:vir:10 241 LNVIRSR--TFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGN----GPNGG-----PTESYRADASHKRALAVDQPKAA 309 (318) T ss_pred eEEeecC--ccCCCeeEEEecCCcceeeccccceeeecccCCC----CCCCC-----cchhhheehheeeeeeeeCccee Confidence 9998865 455667888887655 345555555443332111 11111 23445578888889999999999 Q ss_pred EEEEeeccC Q lcl|NC_011614. 308 AKLVPADAK 316 (324) Q Consensus 308 ~~l~~~~~~ 316 (324) ++||+.-.. T Consensus 310 ~~itgi~~~ 318 (318) T protein:vir:10 310 LWLTGIVTP 318 (318) T ss_pred EEEeeccCC Confidence 999986433 No 128 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.51 E-value=2.1e-15 Score=101.00 Aligned_cols=290 Identities=7% Similarity=-0.037 Sum_probs=169.4 Q ss_pred hcccccc-----ccCCCcceechhhhHHHHHHHHhhcchhhhceeee---cCCCceEEEEEeCCcceeeecccccccccc Q lcl|NC_011614. 24 FNPDNVM-----MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEGQKIETSK 95 (324) Q Consensus 24 ~~a~~~~-----~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~---~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~ 95 (324) +.-.|.. ++..-..+||+.++.++++.+++...+.++++... ..+.++.||+.. .+.+.-+.++.+++..+ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~~~ 79 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGVQP 79 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCcccccc Confidence 1111111 12223347899999999999999999888876543 235678899865 56677788888888777 Q ss_pred cceeeEEeeeee-EEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccC--cCcCCcccccccccccceeec Q lcl|NC_011614. 96 ATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG--NNPFGKSIAQSIEKTNKVIKG 172 (324) Q Consensus 96 ~~~~~v~~~~~k-~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g--~~~~~~~~~~~~~~~~~~~~~ 172 (324) .+-.++++...+ ....+.|+++-...+..++.+.+.++..+++++++|+.++.--. +.................... T Consensus 80 ~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~~ 159 (341) T protein:vir:94 80 VNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNGQ 159 (341) T ss_pred ccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCchh Confidence 777888888744 45667788866666778999999999999999999998775311 111101111111111111223 Q ss_pred ccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhh-----ccCCceeeccCCCceecccceEeecCccCCCceEE Q lcl|NC_011614. 173 DFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELI 245 (324) Q Consensus 173 ~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~-----d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~ 245 (324) ..+++.+.++...|..++... -.++++|..+..|.+.. +..|...+..+..++++|++|+.++..+......+ T Consensus 160 ~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~ 239 (341) T protein:vir:94 160 AFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNSATGW 239 (341) T ss_pred hhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEeccccccccccc Confidence 457888999999998876533 35788999999986421 22344445566678999999998765443221111 Q ss_pred -------------------------Eeeccc--EEEEEecce-EEEEee-----cccccccccccccchhhhhcCcEEEE Q lcl|NC_011614. 246 -------------------------TGDFDK--LIYGIPQLI-EYKIDE-----TAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) Q Consensus 246 -------------------------~gd~~~--~~~~~~~~~-~i~~~~-----~~~~~~~~~~~~~~~~~f~~~~v~~r 292 (324) -++++. .+++.+..+ .+++.+ ................ +-...++ T Consensus 240 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~i~ 316 (341) T protein:vir:94 240 RNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENR---EQVWLMV 316 (341) T ss_pred cccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhh---hhhhhhh Confidence 011111 011111111 111111 0000001111111111 1123356 Q ss_pred EEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 293 ATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 293 ~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) +..-||.+++||+|.+.|.. ...++ T Consensus 317 ~~~~~G~~~lrp~~~v~~~~--~~~~~ 341 (341) T protein:vir:94 317 GRQAYGARLYRPLHAVNIHT--TGDTV 341 (341) T ss_pred hhhhhcccccCcceeEEEec--CcCCC Confidence 66778999999999665543 33344 No 129 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.48 E-value=1.6e-15 Score=101.66 Aligned_cols=280 Identities=13% Similarity=0.119 Sum_probs=179.8 Q ss_pred CchhhH---------------------------------------HHHHH---HHHhhccchhhhhcc-ccccccCCCcc Q lcl|NC_011614. 1 MEQTQK---------------------------------------LKLNL---QHFASNNVKPQVFNP-DNVMMHEKKDG 37 (324) Q Consensus 1 m~~~~~---------------------------------------~~~~~---~~~~~~~~~~~~~~a-~~~~~~~~~g~ 37 (324) ||...+ +|... +..+......+++|. .....+.+..+ T Consensus 62 ~e~~~~~~~~~~E~Rs~~~~i~~~~~~~r~~p~~~~veyRSaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~ 141 (410) T protein:vir:83 62 MEQAQEVNRIAFETRSKGQAVDAAISAMRGSPVGTEVEYRSAGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGDLQG 141 (410) T ss_pred hHHHHHHHHHHHHHHHHHHHHHhhhccCcCCCCCCCcccccHHHHHHHHhccCCchHHHHHHHHHHHHhhccCccccccc Confidence 222211 11111 111111111233332 22233444556 Q ss_pred eechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCccee-------eecccccccccccceeeEEeeeeeEEE Q lcl|NC_011614. 38 TLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAY-------WVGEGQKIETSKATWVNATMRAFKLGV 110 (324) Q Consensus 38 lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~-------~v~Eg~~~~~~~~~~~~v~~~~~k~~~ 110 (324) +||+++..+.++.+.+..++.+++...|.++.++.+|+.+..+..+ --.||...+..+.+|+..+...+.+|+ T Consensus 142 ~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGG 221 (410) T protein:vir:83 142 VIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGG 221 (410) T ss_pred ccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehhcC Confidence 7899999999999999999999999999999999999887766542 124899999999999999999999999 Q ss_pred eehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHH---HHhccCcCcCCcccccccccccceeecccchhHH----HHHH Q lcl|NC_011614. 111 ILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA---GILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI----IDLE 183 (324) Q Consensus 111 ~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a---~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i----~~~~ 183 (324) +..+|++.++.|.+...+..++.|+-+++++-+.+ +|..+-+. ....+..|.+.| .+.. T Consensus 222 yt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~--------------~~a~~~~Tad~~~~~i~da~ 287 (410) T protein:vir:83 222 YVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG--------------AVGYGNATADNVASAIWQAA 287 (410) T ss_pred cccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------------hhhhhhccHHHHHHHHHHHH Confidence 99999999999999999999999999999988854 44332111 011223344444 4445 Q ss_pred HHhhhh--ccCCCEEEEcHHHHHHHHHh--------hccCCcee--eccCCCceecccceEeecCccCCCceEEEeeccc Q lcl|NC_011614. 184 ALLEDD--ELEANAFISKTQNRSLLRKI--------VDPETKER--IYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDK 251 (324) Q Consensus 184 ~~l~~~--~~~~~~~v~~~~~~~~L~~l--------~d~~g~~~--~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~ 251 (324) .++.++ +..-..+.++|+.+..+.++ .|..|-.. +-.+..+.+++.||++.+ ..+.++++|-|... T Consensus 288 ~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~--~a~AgTA~f~~~~A 365 (410) T protein:vir:83 288 GAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSA--ALGSGDAYLFSTAA 365 (410) T ss_pred HHHhhhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEec--CCCcCeeeEeccce Confidence 555554 34445689999998776542 22223111 224556789999999865 45556777777665 Q ss_pred EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEee Q lcl|NC_011614. 252 LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 252 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) +.......-.+.+.++ .+.+|-..-+ .||.+.+..+.++.=|.+. T Consensus 366 i~~~eS~~gp~qL~d~-----------~i~nLt~~yS------gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 366 IECFEQRVGTLQVVEP-----------SVFGLQVAYA------GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred eeeeecCCceeEeeCC-----------chhhhhhhhe------eeeeeccccccceeeeccC Confidence 5554444322333322 2222221111 5778889999988877765 No 130 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.40 E-value=4.4e-14 Score=93.82 Aligned_cols=283 Identities=10% Similarity=-0.023 Sum_probs=166.0 Q ss_pred hhccchhhhhccccccccCCCc--ceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKD--GTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g--~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) |.+......-|... ...++ .+.-+.+..+|++.....++++++.++.++. +.+..||+. +.++++...-|+++ T Consensus 1 m~~~~~~~~t~~~~---~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l 76 (334) T protein:vir:80 1 MTYPAANTHTRPGW---GGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEEL 76 (334) T ss_pred CCCCcCCCcccccc---ccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCC Confidence 33222211112111 11122 2334999999999999999999999988876 556788876 56777888888898 Q ss_pred cccccceeeEEeeeee-EEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhc----cCcC-------cCCccc Q lcl|NC_011614. 92 ETSKATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN----QGNN-------PFGKSI 159 (324) Q Consensus 92 ~~~~~~~~~v~~~~~k-~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g----~g~~-------~~~~~~ 159 (324) ..+..+.++.++.... +.....|.+----++..|+.+.+.+++++++++..|++++.- .... +.+.+. T Consensus 77 ~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~ 156 (334) T protein:vir:80 77 VVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGI 156 (334) T ss_pred CCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCc Confidence 8888888888887766 344455544333345578999999999999999999987521 1110 010111 Q ss_pred ccccccccceeecccc----hhHHHHHHHHhhhhccC-----CCEEEEcHHHHHHHHHhhc---c-----CCceeeccCC Q lcl|NC_011614. 160 AQSIEKTNKVIKGDFT----QDNIIDLEALLEDDELE-----ANAFISKTQNRSLLRKIVD---P-----ETKERIYDRN 222 (324) Q Consensus 160 ~~~~~~~~~~~~~~~~----~~~i~~~~~~l~~~~~~-----~~~~v~~~~~~~~L~~l~d---~-----~g~~~~~~~~ 222 (324) ......+........+ ++.+..+...|...+.. .-..+++|..|..|..-.. . .+.-.+..+. T Consensus 157 ~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~ 236 (334) T protein:vir:80 157 LLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGR 236 (334) T ss_pred ceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccccccccee Confidence 1111111111111122 23344566666665544 2467899999999864321 1 1111233444 Q ss_pred CceecccceEeecCccC---------CCceEEEeecccEE----------EEEecceEEEEeecccccccccccccchhh Q lcl|NC_011614. 223 SDSLDGLPVVNLKSSNL---------KRGELITGDFDKLI----------YGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~~---------~~~~i~~gd~~~~~----------~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) .++++|++|+.++..+. .....+.|||+... .+...++..++.++.. .... T Consensus 237 i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~----------~~~d 306 (334) T protein:vir:80 237 IAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK----------DFGH 306 (334) T ss_pred EEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechh----------hHHH Confidence 56889999998764432 23346677766532 1222222223322221 1111 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 284 FEQDMVALRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 284 f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) | +.+..-+|.+++||+|++.++.....| T Consensus 307 ~------i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 307 Y------LDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred H------HHHHHHcCCceeccceEEEEEEeeecC Confidence 1 233355799999999999998876666 No 131 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.35 E-value=1.9e-13 Score=90.31 Aligned_cols=272 Identities=10% Similarity=0.012 Sum_probs=169.1 Q ss_pred ccccCCCcceechhhhHHHHHHHHhhcchhh---------hceeeecCCCceEEEEEeC-Ccceeeecccccccccccce Q lcl|NC_011614. 29 VMMHEKKDGTLLNDFTTPILQEVMENSKIMQ---------LGKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 29 ~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~---------l~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~~~~~~ 98 (324) ..+|.-...++|+.+...+.+...+.+.+++ +.....-++..+.+|.|.. +.++.-+.|+..++..+.+- T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~kitt 80 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNLTS 80 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchheecc Confidence 1233334556777777777676666665543 1122334577789999975 35788889999999999998 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc----CcCcCCcccccccccccceeeccc Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNPFGKSIAQSIEKTNKVIKGDF 174 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~----g~~~~~~~~~~~~~~~~~~~~~~~ 174 (324) ++-....++.+..+.++++...-+..++...+.+++++..++..++.+|.-- +............. ........+ T Consensus 81 ~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t-~~~~~~~~i 159 (351) T protein:vir:15 81 GKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQT-KVSPSEPMF 159 (351) T ss_pred cceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceeccc-ccccccccc Confidence 8888888889999999998888888899999999999999999998777521 00000000000011 111233457 Q ss_pred chhHHHHHHHHhhhhccC-CCEEEEcHHHHHHHHHhh------ccCCceeeccCCCceecccceEeecCccCC------- Q lcl|NC_011614. 175 TQDNIIDLEALLEDDELE-ANAFISKTQNRSLLRKIV------DPETKERIYDRNSDSLDGLPVVNLKSSNLK------- 240 (324) Q Consensus 175 ~~~~i~~~~~~l~~~~~~-~~~~v~~~~~~~~L~~l~------d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~------- 240 (324) +++.+.++..++.+.... -.+|+||+.++..|++.. .++| ....++++|++|++++..+.. T Consensus 160 s~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~-----~~~i~t~~G~~VivdD~~p~~~~~~~~~ 234 (351) T protein:vir:15 160 GAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNG-----ATPFEAYNGLRIVLDDDIEIDLTDKTKP 234 (351) T ss_pred CHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhcccccc-----CcccceecceEEEEcCCCccccCCCCCc Confidence 889999999999886544 588999999999997533 2222 223578999999997655431 Q ss_pred CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEE--eeccCCC Q lcl|NC_011614. 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV--PADAKPS 318 (324) Q Consensus 241 ~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~--~~~~~~~ 318 (324) +...|+--...+.++.. ...+++.++.... .++-.+..+.+ .+++|.++..-+ ..+...+ T Consensus 235 ~ytsyl~~~GAi~~~~~-~~~ve~~rd~~~~--------------~g~d~l~~r~~---~~~hp~G~s~~~~~~~~~~~s 296 (351) T protein:vir:15 235 VSTSYIFAPGAVRYSTN-MRSTETKYDPLIN--------------GGQDVIVQKRV---GTIHVAGTSIKASFSPSKASF 296 (351) T ss_pred eeEEEEEecceeeeecC-CcCcceeecccCC--------------CCceEEEEeee---eeeeeeeeeecccccccCcCC Confidence 12222222223333333 3345555554321 11222222222 256777776643 2334455 Q ss_pred CccccC Q lcl|NC_011614. 319 SVPGEV 324 (324) Q Consensus 319 ~~~~~~ 324 (324) +|.+|. T Consensus 297 Pt~~~L 302 (351) T protein:vir:15 297 PTIDEL 302 (351) T ss_pred cChHHh Confidence 666766 No 132 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.35 E-value=4.2e-13 Score=88.41 Aligned_cols=278 Identities=12% Similarity=0.031 Sum_probs=173.0 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhhh---------ceeeecCCCceEEEEEeC-Ccceeeecccc-cccccc Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL---------GKYEPMEGTEKKFTFWAD-KPGAYWVGEGQ-KIETSK 95 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l---------~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~-~~~~~~ 95 (324) +.-..|.-...++|+.+...+.+...+.+.+++- ......++..+.+|.|.. +.++.-+.||+ .++..+ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 1111233445667888877777777666655332 222334677789999974 46777888986 588889 Q ss_pred cceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc------CcCcCCcccccccccccce Q lcl|NC_011614. 96 ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ------GNNPFGKSIAQSIEKTNKV 169 (324) Q Consensus 96 ~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~------g~~~~~~~~~~~~~~~~~~ 169 (324) .+-++-....++.+..+.++++...-+..+....+.+++++...+..++.+|.-- ................... T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~ 160 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQSK 160 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecccc Confidence 9988888888999999999999888888899999999999999998887666421 1110000000010111122 Q ss_pred eecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh--ccCCceeeccCCCceecccceEeecCccCCC--ceEE Q lcl|NC_011614. 170 IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV--DPETKERIYDRNSDSLDGLPVVNLKSSNLKR--GELI 245 (324) Q Consensus 170 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~--d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~--~~i~ 245 (324) ....++++.+.++..++.+....-.+|+||+.++..|++.. +.. ++.-.....++++|++|++++..+... ...| T Consensus 161 ~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~-~~s~~~~~i~~~~G~~VivdD~~p~~~~~yt~y 239 (330) T protein:vir:10 161 ASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQYI-QPTTATINIPTYLGYRVIIDDGIAPTGDIYTSY 239 (330) T ss_pred cccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhhh-cccccCcccccccceEEEEeCCCCCCCCceeEE Confidence 33457889999999999988877889999999999997632 111 111112345789999999987665432 2222 Q ss_pred EeecccEEEEE---ecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEee--ccCCCCc Q lcl|NC_011614. 246 TGDFDKLIYGI---PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA--DAKPSSV 320 (324) Q Consensus 246 ~gd~~~~~~~~---~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~--~~~~~~~ 320 (324) +--...+.++. .....+|.+|+.. +++..+-...+ .+++|.++..-+.. ....++| T Consensus 240 l~~~GAi~~~~~~~~~~v~~EtdRd~~----------------~g~~~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt 300 (330) T protein:vir:10 240 LFRTGSIGLNTGNPSGLTTFETSREAA----------------KGNDMIYTRRA---LVMHPYGVKWTGAEVDAGNITPS 300 (330) T ss_pred EEecCceeeecccCCccccccccCCcc----------------ccceEEEEeeE---EEeeeeeeeecccccccCcCCcC Confidence 22222333332 1224556555532 22233333333 35667877776543 2445667 Q ss_pred cccC Q lcl|NC_011614. 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) -+|. T Consensus 301 ~~~L 304 (330) T protein:vir:10 301 NADL 304 (330) T ss_pred hHHh Confidence 7776 No 133 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.35 E-value=2.8e-13 Score=89.37 Aligned_cols=292 Identities=8% Similarity=0.011 Sum_probs=169.3 Q ss_pred HHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeec---CCCceEEEEEeCCcceeeecc Q lcl|NC_011614. 11 LQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPM---EGTEKKFTFWADKPGAYWVGE 87 (324) Q Consensus 11 ~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~---~~~~~~ip~~~~~~~a~~v~E 87 (324) +.....+ .-+.+- ...++..-.++|+.++.++++.+++...+.+++..... .+.++.||+.. .+.+..+.+ T Consensus 1 ~~~~~~~----~~~~~~-~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d~~~ 74 (381) T protein:vir:80 1 MATIQGT----GGYKGS-AVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYDKQP 74 (381) T ss_pred Cceeccc----ccccCc-ccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeeeecC Confidence 1111111 001111 12222335688999999999999999998888765432 35678899865 567888899 Q ss_pred cccccccccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccC---cCcCCc------ Q lcl|NC_011614. 88 GQKIETSKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG---NNPFGK------ 157 (324) Q Consensus 88 g~~~~~~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g---~~~~~~------ 157 (324) +.+++..+.+.++++++..+. .....|++.-...+..++.+.+.+++..++++++|+.++.--. ....+. T Consensus 75 g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~ 154 (381) T protein:vir:80 75 QTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDT 154 (381) T ss_pred CCcccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Confidence 999888888888877777443 4446777765666667999999999999999999998874311 110000 Q ss_pred cccc-ccccccceeecccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhh-----ccCCceeeccCCCceeccc Q lcl|NC_011614. 158 SIAQ-SIEKTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNSDSLDGL 229 (324) Q Consensus 158 ~~~~-~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~-----d~~g~~~~~~~~~~~l~G~ 229 (324) .+.. .............+++.|+++...|..++... -.++++|..+..|.+.. +..+...+..+..++++|+ T Consensus 155 ~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i~G~ 234 (381) T protein:vir:80 155 TLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTILGM 234 (381) T ss_pred cccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEEcce Confidence 0000 01111111223457889999999998876532 36899999999886432 2223334556667899999 Q ss_pred ceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec-ccceE Q lcl|NC_011614. 230 PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD-DKAFA 308 (324) Q Consensus 230 pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~-~~a~~ 308 (324) +|+.++..+......+...+ +.... .........+. ..|..+..++|....+|..+.. -..+- T Consensus 235 ~Vv~Sn~lp~~~~t~~~~~a-----gap~~----~~~~~~~~~~~-------g~~s~~a~av~~~k~yd~~~~~~~~~~~ 298 (381) T protein:vir:80 235 EVIVTTQIGINSLTGYVNGQ-----GAPTQ----PTPGVLGSPYL-------PDQAGTANVVNTGSASDLAVSLSYFGLP 298 (381) T ss_pred EEEeecccccccccceeeec-----ccccc----ccccccccccc-------cccccceeeeeeeeeeceeeeeeeccce Confidence 99987654432111111100 00000 00000000000 1134455667777777777743 44444 Q ss_pred EEEeeccCCCCccccC Q lcl|NC_011614. 309 KLVPADAKPSSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) ...++.+..+...+.. T Consensus 299 ~~~g~~~~~~~~~~~~ 314 (381) T protein:vir:80 299 VFSGAGATAADGGQTL 314 (381) T ss_pred eeecceeeecCCCcee Confidence 4444444444443333 No 134 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.31 E-value=2.6e-13 Score=89.62 Aligned_cols=282 Identities=11% Similarity=0.023 Sum_probs=164.3 Q ss_pred hhccchhhhhccccccccCCCc-----ceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeeccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKD-----GTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEG 88 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g-----~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg 88 (324) |.+-.. ..+..+.+.+..+| .+.-+.+..+|++.....+.++++.++.++. +.+.++|+. +..++.....| T Consensus 1 ~~~~~~--~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G 77 (345) T protein:vir:22 1 MASMTG--GQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPG 77 (345) T ss_pred Cccccc--chhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecC Confidence 111111 01111112222111 3555899999999999999999999988876 556778876 66678888888 Q ss_pred cccccc--ccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc----C-c-----CcC Q lcl|NC_011614. 89 QKIETS--KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----G-N-----NPF 155 (324) Q Consensus 89 ~~~~~~--~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~----g-~-----~~~ 155 (324) ++...+ ++...+.+|...+. .....|.+----++..++.+.+.+++++++++..|+.++.-- . . .+. T Consensus 78 ~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~ 157 (345) T protein:vir:22 78 ENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIE 157 (345) T ss_pred CCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 887554 46677755554332 222334332223455789999999999999999999887311 0 0 011 Q ss_pred C--ccccccccccc-----ceeecccchhHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHhhccC-----CceeeccC Q lcl|NC_011614. 156 G--KSIAQSIEKTN-----KVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE-----TKERIYDR 221 (324) Q Consensus 156 ~--~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~d~~-----g~~~~~~~ 221 (324) + .+......... ....+...++.+.++..+|...+...+ .++++|..+..|..-+.-+ |.-....+ T Consensus 158 ~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G 237 (345) T protein:vir:22 158 GLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKG 237 (345) T ss_pred ccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccccc Confidence 1 11110111100 011122347778888888887775543 5789999999886433221 11112344 Q ss_pred CCceecccceEeecCccCC------------------------------CceEEEeecccEEEEEecceEEEEeeccccc Q lcl|NC_011614. 222 NSDSLDGLPVVNLKSSNLK------------------------------RGELITGDFDKLIYGIPQLIEYKIDETAQLS 271 (324) Q Consensus 222 ~~~~l~G~pv~~~~~~~~~------------------------------~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~ 271 (324) ..+++.|++|+.++..+.. ....++...+.+..+...++++|..++.. T Consensus 238 ~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~-- 315 (345) T protein:vir:22 238 SIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN-- 315 (345) T ss_pred eEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechh-- Confidence 4568899999887532210 00112222333334444444555554321 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) .|. ..+++..-+|.+++||+|.+.|+-+-. T Consensus 316 -----------~~~---d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 316 -----------FQA---DQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred -----------HHH---HHHHHHHhcCCcccccceeEEEEEeeC Confidence 222 236677789999999999999987766 No 135 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.28 E-value=3.4e-13 Score=88.94 Aligned_cols=285 Identities=12% Similarity=0.091 Sum_probs=163.9 Q ss_pred hhccchhhhhccccccccCCCc--ceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKD--GTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g--~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) |.+-.....+....-....++. .+.-+++..+|++.....+.++++.++.+.. +.+..||+.. ...+.....|.++ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG-~~~~~~~~~g~~l 79 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG-RTKGYYLAPGENL 79 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeec-ceeeeeeccccCC Confidence 2211111110000001111111 3445999999999999999999999876655 5567788654 4456667777776 Q ss_pred cc--cccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhc----cC----cCcCCcccc Q lcl|NC_011614. 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN----QG----NNPFGKSIA 160 (324) Q Consensus 92 ~~--~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g----~g----~~~~~~~~~ 160 (324) .. .++..+++++...+. .....|.+--.-++..++.+.+.++.++++++..|+.++.- .. .+....++. T Consensus 80 ~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~ 159 (347) T protein:vir:88 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) T ss_pred CCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCcc Confidence 54 356788888877664 33445555444455578999999999999999999988732 11 001111111 Q ss_pred c----ccccccce----eecccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhcc-----CCceeeccCCCce Q lcl|NC_011614. 161 Q----SIEKTNKV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP-----ETKERIYDRNSDS 225 (324) Q Consensus 161 ~----~~~~~~~~----~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d~-----~g~~~~~~~~~~~ 225 (324) . .....+.. ......++.|.++..+|...+... -.++++|..+..|.+-... ++.-.+..+..++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~vg~ 239 (347) T protein:vir:88 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) T ss_pred ccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcceeee Confidence 1 11111110 111223677888888888776532 3688999999888543221 2222244566678 Q ss_pred ecccceEeecCccCCCc-----------------------eEEEeecccEE----------EEEecceEEEEeecccccc Q lcl|NC_011614. 226 LDGLPVVNLKSSNLKRG-----------------------ELITGDFDKLI----------YGIPQLIEYKIDETAQLST 272 (324) Q Consensus 226 l~G~pv~~~~~~~~~~~-----------------------~i~~gd~~~~~----------~~~~~~~~i~~~~~~~~~~ 272 (324) +.|++|+.++..+.+.. .-+.+|++... .+.-.++.+|..++. T Consensus 240 i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~---- 315 (347) T protein:vir:88 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP---- 315 (347) T ss_pred eccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeech---- Confidence 99999998765543111 01222333211 111223334433322 Q ss_pred cccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 273 ~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) ..| ...+++...+|.+++||++.+.++...++ T Consensus 316 ---------~~~---~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 316 ---------EFQ---ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred ---------hhH---HHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 122 22467888999999999998888766555 No 136 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.24 E-value=2.7e-13 Score=89.50 Aligned_cols=284 Identities=13% Similarity=0.067 Sum_probs=159.4 Q ss_pred hhccchhh---hhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeeccccc Q lcl|NC_011614. 15 ASNNVKPQ---VFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQK 90 (324) Q Consensus 15 ~~~~~~~~---~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~ 90 (324) +.+-.... ..+.-....+.+--.+.-+.+..+|++.....++++++.++.++. +.+.++|+. +..++..+..|++ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~~G~~ 79 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPGEN 79 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeeecCCC Confidence 11100000 000000011111112334899999999999999999999987776 556778876 5566777888888 Q ss_pred cccc--ccceeeEEeeeee-EEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhc----cCc------CcCC- Q lcl|NC_011614. 91 IETS--KATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN----QGN------NPFG- 156 (324) Q Consensus 91 ~~~~--~~~~~~v~~~~~k-~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g----~g~------~~~~- 156 (324) .+.+ ++.-+++++...+ ......|.+----++..++.+.+.++.+.++++..|+.++.- ... .+.+ T Consensus 80 l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~ 159 (344) T protein:vir:10 80 LDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITGL 159 (344) T ss_pred CCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc Confidence 7654 4667776676655 233344444333345578999999999999999999988631 110 0110 Q ss_pred -ccccccccccc-----ceeecccchhHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHhhcc-----CCceeeccCCC Q lcl|NC_011614. 157 -KSIAQSIEKTN-----KVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDP-----ETKERIYDRNS 223 (324) Q Consensus 157 -~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~d~-----~g~~~~~~~~~ 223 (324) .+......... ....+..-++.+.++..+|...+.... .++++|..+..|..-+.- .|.-.+..+.. T Consensus 160 ~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V 239 (344) T protein:vir:10 160 GTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSI 239 (344) T ss_pred cccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccceeeeEE Confidence 11111111111 111112235667888888887775432 567899999988643221 12222334555 Q ss_pred ceecccceEeecCccCC----CceEE---------------EeecccE----------EEEEecceEEEEeecccccccc Q lcl|NC_011614. 224 DSLDGLPVVNLKSSNLK----RGELI---------------TGDFDKL----------IYGIPQLIEYKIDETAQLSTVK 274 (324) Q Consensus 224 ~~l~G~pv~~~~~~~~~----~~~i~---------------~gd~~~~----------~~~~~~~~~i~~~~~~~~~~~~ 274 (324) +++.|++|+.++..+.+ ..... ..+|+.. ..+...++++|..++. T Consensus 240 ~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~------ 313 (344) T protein:vir:10 240 RNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA------ 313 (344) T ss_pred EEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccch------ Confidence 67899999987644311 00111 1122221 1222233334443321 Q ss_pred cccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 275 ~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) ..|. ..+++..-+|.+++||+|.+.++.++- T Consensus 314 -------~~~~---d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 314 -------NFQA---DQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred -------hHHH---HHHHHHhhcccceecccceEEEEeecC Confidence 2232 245677889999999999987777644 No 137 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.23 E-value=8.4e-13 Score=86.77 Aligned_cols=284 Identities=13% Similarity=0.077 Sum_probs=163.8 Q ss_pred hhccchhhhh--ccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011614. 15 ASNNVKPQVF--NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 ~~~~~~~~~~--~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) +.+-...... |......+.+--.+.-+.+..+|++.....+.++++.++..+. +.+..||+. +..++..+..|.++ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~i-G~~~~~~~~~G~~l 79 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVL-GRTKAAYLQPGENL 79 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeec-cceeEeeeecCcCC Confidence 1111111100 0000001111112445999999999999999999999887755 556778864 45567778888887 Q ss_pred cc--cccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh----ccC----cCcCCccc- Q lcl|NC_011614. 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQG----NNPFGKSI- 159 (324) Q Consensus 92 ~~--~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~----g~g----~~~~~~~~- 159 (324) .. .++..++.++...++ .....|-+---.++..++.+.+.++.+.++++..|+.++. +.. +...+.+. T Consensus 80 ~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~ 159 (347) T protein:vir:94 80 DDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGLG 159 (347) T ss_pred CCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCC Confidence 55 357788877776554 3334454433344557899999999999999999998862 111 00001110 Q ss_pred ---cccccc-----ccceeecccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhccC-Cce----eeccCCCc Q lcl|NC_011614. 160 ---AQSIEK-----TNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDPE-TKE----RIYDRNSD 224 (324) Q Consensus 160 ---~~~~~~-----~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d~~-g~~----~~~~~~~~ 224 (324) ...+.. ......+...++.+.++..+|...+... -.++++|..+..|.+..+.+ +.+ .+..+..+ T Consensus 160 ~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V~ 239 (347) T protein:vir:94 160 KAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIR 239 (347) T ss_pred cceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccccccceeE Confidence 000100 0011112334777888989988777543 24677999998876543322 221 23345567 Q ss_pred eecccceEeecCccCCC-----------------------ceEEEeecccE----------EEEEecceEEEEeeccccc Q lcl|NC_011614. 225 SLDGLPVVNLKSSNLKR-----------------------GELITGDFDKL----------IYGIPQLIEYKIDETAQLS 271 (324) Q Consensus 225 ~l~G~pv~~~~~~~~~~-----------------------~~i~~gd~~~~----------~~~~~~~~~i~~~~~~~~~ 271 (324) ++.|++|+.++..+... ..-|-+||+.. ..+.-.++++++.++.. T Consensus 240 ~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~-- 317 (347) T protein:vir:94 240 NVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRAN-- 317 (347) T ss_pred EeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeechh-- Confidence 89999999876543211 01133344332 12223344444443321 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeecc Q lcl|NC_011614. 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADA 315 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) .| ...+.+..-+|..++||++.+.++.+.+ T Consensus 318 -----------~~---~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 318 -----------FQ---ADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred -----------hh---hhhhhhhhhhcCcccccceeEEEEecCC Confidence 12 2235677789999999999988876655 No 138 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.22 E-value=8.1e-12 Score=81.38 Aligned_cols=292 Identities=9% Similarity=0.008 Sum_probs=161.1 Q ss_pred hhccchhhhhccccccccCCCc-----ceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeeccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKD-----GTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEG 88 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g-----~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg 88 (324) +.+....+-=+.+..+.+..+| .+.-+.+..+|++.....++++++.++..+. +.+.+||+. +..++....-| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eeeEEeeecCC Confidence 3332222211111112222111 3455889999999999999999999987766 555678886 55667777667 Q ss_pred cccc---ccccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh----ccCcCc------ Q lcl|NC_011614. 89 QKIE---TSKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNNP------ 154 (324) Q Consensus 89 ~~~~---~~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~----g~g~~~------ 154 (324) +++. ..+...++.++...+. .....|.+---.++..++.+.+.++.++++++..|+.++. +..... T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~ 159 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATN 159 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 6653 2344455544554433 3334444432334557899999999999999999998863 111100 Q ss_pred --CCcccccc---cccccceeecccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhccC--------Cceeec Q lcl|NC_011614. 155 --FGKSIAQS---IEKTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDPE--------TKERIY 219 (324) Q Consensus 155 --~~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d~~--------g~~~~~ 219 (324) .+.+.... ........+....++.+.++..+|...+... -.++++|..|..|.+-+|.+ +..+.. T Consensus 160 ~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~~ 239 (375) T protein:vir:10 160 FVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQS 239 (375) T ss_pred ccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccccceec Confidence 00111100 1111111223445788888988988777643 35789999998886544322 222233 Q ss_pred cCCCceecccceEeecCccCCC-----------------------------------ceEEEeec-------------cc Q lcl|NC_011614. 220 DRNSDSLDGLPVVNLKSSNLKR-----------------------------------GELITGDF-------------DK 251 (324) Q Consensus 220 ~~~~~~l~G~pv~~~~~~~~~~-----------------------------------~~i~~gd~-------------~~ 251 (324) .+..+++.|++|+.++..+... ..-|-+|| +. T Consensus 240 ~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A 319 (375) T protein:vir:10 240 GNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEA 319 (375) T ss_pred cceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhh Confidence 3334578899998865433211 11223333 11 Q ss_pred EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 252 LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 252 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) +..+.-.++++++++. ... -.+-...+.+..-+|..++||+|.+.|+..++.+..= T Consensus 320 ~g~v~~~~~~~~~~~~--------~~~-----~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~~~~~ 375 (375) T protein:vir:10 320 AGVVEAIGPQVQVTNG--------DVS-----VIYQGDVILGRMAMGADYLNPAAAVELYIGATAPSAF 375 (375) T ss_pred eeeeeeeccccccccc--------hhh-----heeeeeeeeeeeeeccCccCceeEEEEecCcCccccC Confidence 2222223333333210 000 0122334667788999999999988886543222222 No 139 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.14 E-value=2e-11 Score=79.22 Aligned_cols=287 Identities=11% Similarity=0.017 Sum_probs=162.5 Q ss_pred hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |.+.. ..-|..-.....+- .+.-+++..+|.+.....++++++.++.++. +.+..+|+. +..+++...-|+++.. T Consensus 1 ms~~~--~~tr~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~ 76 (335) T protein:vir:63 1 MSFLN--DLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELER 76 (335) T ss_pred CCCcc--cchhhhcccccchh-heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCC Confidence 22110 00011111122222 3555999999999999999999999888866 445688886 5677888888888877 Q ss_pred cccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHH----hccCcCc----CC---ccccc Q lcl|NC_011614. 94 SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI----LNQGNNP----FG---KSIAQ 161 (324) Q Consensus 94 ~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l----~g~g~~~----~~---~~~~~ 161 (324) +.+..++..+....+ .....|-+----++..++.+.+.+++++++++..|++++ .+..... .+ .++.. T Consensus 77 ~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:63 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcce Confidence 777778877777553 333444432223455789999999999999999999875 2211110 00 11111 Q ss_pred ccccccceeeccc--chhHHHHHHHHhhhhccC-----CCEEEEcHHHHHHHHHhhccC--------CceeeccCCCcee Q lcl|NC_011614. 162 SIEKTNKVIKGDF--TQDNIIDLEALLEDDELE-----ANAFISKTQNRSLLRKIVDPE--------TKERIYDRNSDSL 226 (324) Q Consensus 162 ~~~~~~~~~~~~~--~~~~i~~~~~~l~~~~~~-----~~~~v~~~~~~~~L~~l~d~~--------g~~~~~~~~~~~l 226 (324) ....+........ -++.+.++.++|...+.. .-..+++|..|..|..-..-- |.-.+..+....+ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v 236 (335) T protein:vir:63 157 KLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAIL 236 (335) T ss_pred eeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEEe Confidence 1111111111111 123445677777766543 246899999999986532111 1122334445678 Q ss_pred cccceEeecCccC---------CCceEEEeecccE----------EEEEecceEEEEeecccccccccccccchhhhhcC Q lcl|NC_011614. 227 DGLPVVNLKSSNL---------KRGELITGDFDKL----------IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) Q Consensus 227 ~G~pv~~~~~~~~---------~~~~i~~gd~~~~----------~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~ 287 (324) .|+||+.++..+. +....+.+|+... ..+...++..++.++.. .|.. T Consensus 237 ~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~-------------~~~~- 302 (335) T protein:vir:63 237 NGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNE-------------KFSW- 302 (335) T ss_pred eceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccc-------------hhhH- Confidence 9999987754321 2333445555332 22222333333332221 1211 Q ss_pred cEEEEEEEEeccEEecccceEEEEeeccCCCCc-cc Q lcl|NC_011614. 288 MVALRATMHVALHIADDKAFAKLVPADAKPSSV-PG 322 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~-~~ 322 (324) .+.+..-+|..++||+|.+.++.+ -.|+.. .+ T Consensus 303 --~i~~~~a~G~g~lRPe~a~~i~~t-g~~~~~~~~ 335 (335) T protein:vir:63 303 --VLDTFQMYNIGARRPDTAGAIELK-GIGAFDITA 335 (335) T ss_pred --HhHHHHHcCCcccccceEEEEEEc-CCCceeecC Confidence 133444589999999999999863 333332 44 No 140 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.12 E-value=1.7e-11 Score=79.61 Aligned_cols=286 Identities=13% Similarity=0.057 Sum_probs=156.8 Q ss_pred hhccchhhhh--ccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011614. 15 ASNNVKPQVF--NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 ~~~~~~~~~~--~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) +.+-..-... |.......++.-.+.-+.+..+|++.....+.++++.+..+.. +.+..||+.. ..++.-...|+++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG-~~t~~~~~~g~~l 79 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGENL 79 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeecc-ceeeeeecCCCCC Confidence 1111110000 0000001111111334999999999999999999998876644 5567787754 4556667777776 Q ss_pred cc--cccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh-----ccCc------CcCC- Q lcl|NC_011614. 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL-----NQGN------NPFG- 156 (324) Q Consensus 92 ~~--~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~-----g~g~------~~~~- 156 (324) +. .+....+.++...+. .....|.+---.++..++.+.+.++.+.++++..|+.++. +... .+.+ T Consensus 80 ~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~ 159 (347) T protein:vir:33 80 DDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLG 159 (347) T ss_pred CCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Confidence 54 346667766655433 2223343333334557899999999999999999998872 1110 0000 Q ss_pred ccc-cccccccccee-----ecccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhh-----ccCCceeeccCCC Q lcl|NC_011614. 157 KSI-AQSIEKTNKVI-----KGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNS 223 (324) Q Consensus 157 ~~~-~~~~~~~~~~~-----~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~-----d~~g~~~~~~~~~ 223 (324) ... .......+... .+..-++.+.++..+|..++... -.++++|..+..|.+-. |..|...+..+.. T Consensus 160 ~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~G~V 239 (347) T protein:vir:33 160 KPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPERGTI 239 (347) T ss_pred ccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccccccccccee Confidence 000 00000011111 11233677788888888777632 35789999999886432 2223334455666 Q ss_pred ceecccceEeecCccCCCc------------eE--------EEeecccE--E--------EEEecceEEEEeeccccccc Q lcl|NC_011614. 224 DSLDGLPVVNLKSSNLKRG------------EL--------ITGDFDKL--I--------YGIPQLIEYKIDETAQLSTV 273 (324) Q Consensus 224 ~~l~G~pv~~~~~~~~~~~------------~i--------~~gd~~~~--~--------~~~~~~~~i~~~~~~~~~~~ 273 (324) +++.|++|+.++..+.... .. +.++|+.. + .....++.++..++. T Consensus 240 ~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~----- 314 (347) T protein:vir:33 240 RNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA----- 314 (347) T ss_pred EEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccch----- Confidence 7899999998764432110 01 12222211 1 111222233333321 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) ..| .-.+++...+|.+++||++.+.++.+--.. T Consensus 315 --------~~~---~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 315 --------NYQ---ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred --------hhh---hHhhhhhhhcCCceecccceEEEecCCCCC Confidence 122 233567778899999999988887543333 No 141 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.12 E-value=2.1e-11 Score=79.18 Aligned_cols=286 Identities=12% Similarity=0.040 Sum_probs=154.5 Q ss_pred hhccchhhhh--ccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011614. 15 ASNNVKPQVF--NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 ~~~~~~~~~~--~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) |.+-..-... |........+--.+.-+.+..+|++.....+.++++.++.+.. +.+..||+.. ..++.-...|.++ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig-~~t~~~~~~g~~l 79 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGENL 79 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeecc-ceeeeeeccCCCC Confidence 1111110000 0000000111112344778889999999999999998876654 5667788765 4566777778877 Q ss_pred cc--cccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc--C--cC---cCC---cc Q lcl|NC_011614. 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--G--NN---PFG---KS 158 (324) Q Consensus 92 ~~--~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~--g--~~---~~~---~~ 158 (324) +. .+.+.++.++...+. .....|.+---.++..++.+.+.++.+.++++..|+.++.-- . .. ..+ .+ T Consensus 80 ~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~g 159 (347) T protein:vir:15 80 DDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGLG 159 (347) T ss_pred CCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccC Confidence 54 446677766665433 222334333333455789999999999999999999887310 0 00 000 00 Q ss_pred ---cccccccccceeec-----ccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhc-----cCCceeeccCCC Q lcl|NC_011614. 159 ---IAQSIEKTNKVIKG-----DFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-----PETKERIYDRNS 223 (324) Q Consensus 159 ---~~~~~~~~~~~~~~-----~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d-----~~g~~~~~~~~~ 223 (324) +............. ..-++.+.++..+|..++... -.++++|..+..|.+-.+ ..|...+..+.. T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G~V 239 (347) T protein:vir:15 160 KPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHERGTI 239 (347) T ss_pred ccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccceEE Confidence 00000000000001 111455666777787766532 356789999988864322 223333455666 Q ss_pred ceecccceEeecCccCCCc------------eE--------EEeecc----------cEEEEEecceEEEEeeccccccc Q lcl|NC_011614. 224 DSLDGLPVVNLKSSNLKRG------------EL--------ITGDFD----------KLIYGIPQLIEYKIDETAQLSTV 273 (324) Q Consensus 224 ~~l~G~pv~~~~~~~~~~~------------~i--------~~gd~~----------~~~~~~~~~~~i~~~~~~~~~~~ 273 (324) +++.|++|+.++..+.... .. +.++|+ .+..+...++.++..++. T Consensus 240 g~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~----- 314 (347) T protein:vir:15 240 RNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA----- 314 (347) T ss_pred EEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccc----- Confidence 7899999998764432110 01 111111 111222233344443322 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) ..| ...+|+...+|.+++||++.+.++.+--.. T Consensus 315 --------~~~---~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 315 --------NYQ---ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred --------hhh---hhhhehhhhcCCceeccccEEEEecCCCCC Confidence 112 234677778899999999988886543332 No 142 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.11 E-value=9.5e-12 Score=81.02 Aligned_cols=287 Identities=14% Similarity=0.036 Sum_probs=161.7 Q ss_pred HHHhhccchhhhhccccccccCCCc-ceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccc Q lcl|NC_011614. 12 QHFASNNVKPQVFNPDNVMMHEKKD-GTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQ 89 (324) Q Consensus 12 ~~~~~~~~~~~~~~a~~~~~~~~~g-~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~ 89 (324) ..+-++--.+...|......+.+.. .+.-+.+..+|++.....+.++++.+..+.. +.+..||+. +..++.-...|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i-g~~~~~~~~~g~ 79 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEec-cceeEeeecCCC Confidence 1111121122222222112222211 3445999999999999999999998876655 566788886 455666667777 Q ss_pred cccc-cccceeeEEeeeee-EEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh----ccCcCcCCccc---- Q lcl|NC_011614. 90 KIET-SKATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNNPFGKSI---- 159 (324) Q Consensus 90 ~~~~-~~~~~~~v~~~~~k-~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~----g~g~~~~~~~~---- 159 (324) .+.. .+++-+++++...+ ......|.+---.++..++.+.+.++.+.++++.+|+.++. +...+....+. T Consensus 80 ~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~ 159 (332) T protein:vir:78 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) T ss_pred CCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccc Confidence 7643 34666666666654 23333443322234556899999999999999999988763 21111111111 Q ss_pred ccccccccceeecccchhHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHhhccC-------C-ceeeccC-CCceecc Q lcl|NC_011614. 160 AQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE-------T-KERIYDR-NSDSLDG 228 (324) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~d~~-------g-~~~~~~~-~~~~l~G 228 (324) ...+... ...+....++.|.++..+|...+.... .++++|..+..|.+.+|.. + .-.+..+ ..+++.| T Consensus 160 ~~~~~~~-~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G 238 (332) T protein:vir:78 160 HVNIGAG-NTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAG 238 (332) T ss_pred ccccCCc-cccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEee Confidence 1111111 111223356778899999988776433 4677999998886543321 1 1122222 3568999 Q ss_pred cceEeecCccCC------------CceEEEeecccE--EEEEe--------cceEEEEeecccccccccccccchhhhhc Q lcl|NC_011614. 229 LPVVNLKSSNLK------------RGELITGDFDKL--IYGIP--------QLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) Q Consensus 229 ~pv~~~~~~~~~------------~~~i~~gd~~~~--~~~~~--------~~~~i~~~~~~~~~~~~~~~~~~~~~f~~ 286 (324) ++|+.++..+.. ....+.|+|+.. ++..+ .++++++.+. ......|. T Consensus 239 ~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~----------~~~~~~~~- 307 (332) T protein:vir:78 239 IRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSG----------DFNVQYQG- 307 (332) T ss_pred eEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhc----------ccchhhhH- Confidence 999987644321 122344555431 11111 2222222110 01112232 Q ss_pred CcEEEEEEEEeccEEecccceEEEEee Q lcl|NC_011614. 287 DMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 287 ~~v~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) -.++....+|.+++||++++.|+.+ T Consensus 308 --d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 308 --DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred --hhhhhhhhhcCceecccceEEEeeC Confidence 2466777899999999999999887 No 143 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.11 E-value=9.5e-11 Score=75.54 Aligned_cols=290 Identities=11% Similarity=-0.005 Sum_probs=157.6 Q ss_pred hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |..... .+--....+.+--.+.-+.+..+|.+.....+.++++..+.++. +.+.++|+. +..+++...-|++... T Consensus 1 ms~~n~---~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld~ 76 (364) T protein:vir:10 1 MSNPNV---LTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPDA 76 (364) T ss_pred CCCccc---ccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccCC Confidence 221100 00001111112223455899999999999999999999888766 455788886 4556666666666655 Q ss_pred cccceeeEEeeeeeE-EEeehhHHHHHhcChhH-HHHHHHHHHHHHHHHHHHHHHHh----ccCcC-----cCCcccccc Q lcl|NC_011614. 94 SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGIL----NQGNN-----PFGKSIAQS 162 (324) Q Consensus 94 ~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~-~~~~v~~~l~~ai~~~~d~a~l~----g~g~~-----~~~~~~~~~ 162 (324) +.+.-++.++....+ .....|-+=---++..+ +.+.+.+++++++++..|+.++. +.-.+ ..+.+...+ T Consensus 77 ~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g 156 (364) T protein:vir:10 77 SPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHG 156 (364) T ss_pred CCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCc Confidence 666677767766543 22233322111234456 78899999999999999998752 10011 111111100 Q ss_pred c-ccccceee-----cccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhcc-------CCceeeccCCCceec Q lcl|NC_011614. 163 I-EKTNKVIK-----GDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP-------ETKERIYDRNSDSLD 227 (324) Q Consensus 163 ~-~~~~~~~~-----~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d~-------~g~~~~~~~~~~~l~ 227 (324) . .......+ ...-.+.+.++...|.+.+... -.++++|..|..|.+-.+- .+.-.+..+....+. T Consensus 157 ~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~ 236 (364) T protein:vir:10 157 FSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSW 236 (364) T ss_pred ceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEe Confidence 0 00011111 1122444556777777766543 4688999999988653211 011223345556789 Q ss_pred ccceEeecCccCC-------------------------------CceEEEeecccEEEEEecceEEEEeecccccccccc Q lcl|NC_011614. 228 GLPVVNLKSSNLK-------------------------------RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNE 276 (324) Q Consensus 228 G~pv~~~~~~~~~-------------------------------~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 276 (324) |+||+.++..+.. ....++...+.+..+...++..++.++... T Consensus 237 Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~------ 310 (364) T protein:vir:10 237 NTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKE------ 310 (364) T ss_pred ceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccce------ Confidence 9999876543210 111122222223333344555554443221 Q ss_pred cccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc-cccC Q lcl|NC_011614. 277 DGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV-PGEV 324 (324) Q Consensus 277 ~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~-~~~~ 324 (324) | ...+.+..-+|..++||+|.+.++..++.+..+ -+-+ T Consensus 311 -------~---~~~ida~~a~G~g~lRPeaa~~i~~~~~~~~~~~~~~~ 349 (364) T protein:vir:10 311 -------K---TWYIDTFLAEGAIPDRWEAVAVVTAADTAELATDHNAI 349 (364) T ss_pred -------e---eeeeeeehcccCcccCccceEEEEecCCCCCccchhhh Confidence 1 122345566999999999999997655544433 2223 No 144 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.10 E-value=3e-12 Score=83.73 Aligned_cols=284 Identities=13% Similarity=0.092 Sum_probs=154.7 Q ss_pred hhccchhhhhccccccccCCCc--ceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKD--GTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g--~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) |.+..- ..+....-....++. .+.-+++..+|+......+.++++.+..++. +.+..||+. +..++.-+..|+++ T Consensus 1 m~~~~~-~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv~~~t~G~~l 78 (347) T protein:vir:94 1 MANVPG-QKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSGVYLAPGERL 78 (347) T ss_pred CCCCCc-cccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceeeeeecCCCCc Confidence 222111 101000001111111 3445889999999999999999998887765 555678886 56677777878877 Q ss_pred ccc--ccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhc-----cCcCc---CCccc- Q lcl|NC_011614. 92 ETS--KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN-----QGNNP---FGKSI- 159 (324) Q Consensus 92 ~~~--~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g-----~g~~~---~~~~~- 159 (324) +.+ ..+-.+++|...+. .....|-+---.++..++.+.+.++.+.++++..|+.++.- ...+. ...+. T Consensus 79 ~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~ 158 (347) T protein:vir:94 79 SDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLG 158 (347) T ss_pred CCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc Confidence 543 34455544554443 22223323212234568999999999999999999987631 11010 00010 Q ss_pred cccc---ccccceee----cccchhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhccC-----CceeeccCCCce Q lcl|NC_011614. 160 AQSI---EKTNKVIK----GDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDPE-----TKERIYDRNSDS 225 (324) Q Consensus 160 ~~~~---~~~~~~~~----~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d~~-----g~~~~~~~~~~~ 225 (324) .... .......+ ....++.|.++..+|..++... -.++++|..+..|..-++.+ +.-.+..+..++ T Consensus 159 ~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 238 (347) T protein:vir:94 159 TASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIRN 238 (347) T ss_pred ccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccceEE Confidence 0000 01111101 1223566777788887766532 36789999998774332211 112234455678 Q ss_pred ecccceEeecCccCC---------CceE---------------EEeecccE--EE--------EEecceEEEEeeccccc Q lcl|NC_011614. 226 LDGLPVVNLKSSNLK---------RGEL---------------ITGDFDKL--IY--------GIPQLIEYKIDETAQLS 271 (324) Q Consensus 226 l~G~pv~~~~~~~~~---------~~~i---------------~~gd~~~~--~~--------~~~~~~~i~~~~~~~~~ 271 (324) ++|++|+.++..+.. ...+ +.++|+.. ++ +...++++|..++ T Consensus 239 i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~---- 314 (347) T protein:vir:94 239 VMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRD---- 314 (347) T ss_pred EeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhc---- Confidence 999999987654321 0011 12222221 11 1111222232222 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) .+.|. ..+++..-+|.+++||++.+.++..++. T Consensus 315 ---------~~~~~---d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 315 ---------VDAQG---DLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred ---------hhhHH---HHhhhhhhhcCcccccceeEEEEecCCC Confidence 12232 3578888999999999999999876555 No 145 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.10 E-value=4.5e-11 Score=77.30 Aligned_cols=288 Identities=11% Similarity=0.049 Sum_probs=160.6 Q ss_pred hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |.+.. ..-|..-....++- .+.-+.+..+|++.....++++++.++.++. +.+..+|+. +..+++...-|++... T Consensus 1 ms~~~--~~t~~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~ 76 (335) T protein:vir:78 1 MSFLN--DLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELER 76 (335) T ss_pred CCccc--cccccccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCC Confidence 22210 00111111222222 3555999999999999999999999888766 455788876 5667777888888877 Q ss_pred cccceeeEEeeeeeE-EEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh----ccCcC-------cCCccccc Q lcl|NC_011614. 94 SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN-------PFGKSIAQ 161 (324) Q Consensus 94 ~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~----g~g~~-------~~~~~~~~ 161 (324) +.+..++..+....+ .....|-+----++..++.+.+.+++++++++..|++++. +.... ....++.. T Consensus 77 ~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:78 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcce Confidence 777778877776553 3334443322234557899999999999999999998762 21111 00111111 Q ss_pred ccccccce--eecccchhHHHHHHHHhhhhccC-----CCEEEEcHHHHHHHHHhhcc--------CCceeeccCCCcee Q lcl|NC_011614. 162 SIEKTNKV--IKGDFTQDNIIDLEALLEDDELE-----ANAFISKTQNRSLLRKIVDP--------ETKERIYDRNSDSL 226 (324) Q Consensus 162 ~~~~~~~~--~~~~~~~~~i~~~~~~l~~~~~~-----~~~~v~~~~~~~~L~~l~d~--------~g~~~~~~~~~~~l 226 (324) ....+... .......+.+.++..++...+.. .-+.+++|..|..|..-..- +|.-.+..+....+ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v 236 (335) T protein:vir:78 157 KLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAIL 236 (335) T ss_pred eeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeEEe Confidence 11111111 11112234445555556554442 24689999999998653211 11222334455678 Q ss_pred cccceEeecCccC---------CCceEEEeeccc----------EEEEEecceEEEEeecccccccccccccchhhhhcC Q lcl|NC_011614. 227 DGLPVVNLKSSNL---------KRGELITGDFDK----------LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) Q Consensus 227 ~G~pv~~~~~~~~---------~~~~i~~gd~~~----------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~ 287 (324) .|+||+.++..+. +.+..+.+|++. +..+...++.-++.++.. .|.. T Consensus 237 ~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~-------------~~~~- 302 (335) T protein:vir:78 237 NGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHD-------------QFSW- 302 (335) T ss_pred eceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccc-------------hhhH- Confidence 9999988754332 122233344432 223333333333333221 1211 Q ss_pred cEEEEEEEEeccEEecccceEEEEeeccCCCCccc Q lcl|NC_011614. 288 MVALRATMHVALHIADDKAFAKLVPADAKPSSVPG 322 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~ 322 (324) .+.+..-+|..++||+|.+.++.+....-.-.+ T Consensus 303 --~i~~~~a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 303 --VLDTFQMYNIGARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred --hhhHHHHcCCcccCcceEEEEEecCCCcccccC Confidence 233445589999999999998754322222234 No 146 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.08 E-value=8.8e-12 Score=81.18 Aligned_cols=265 Identities=12% Similarity=0.053 Sum_probs=151.8 Q ss_pred hccccccccCCCcceechhh---hHHHHHHHHhhcchhhhceeeecCC-CceEEEEEeCCcceeeeccccccccccccee Q lcl|NC_011614. 24 FNPDNVMMHEKKDGTLLNDF---TTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 24 ~~a~~~~~~~~~g~lip~~~---~~~i~~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) +.-.+.+.. ..|.+++. .+.+-+.+.+-..++...+.+||.. ..+++|.|.....+.-|+||+++|-++.+.+ T Consensus 1 mAe~nlt~~---~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~ 77 (295) T protein:vir:99 1 MAEKNLNTM---ADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRT 77 (295) T ss_pred CCCcccccH---hhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheee Confidence 111112222 33443333 3444444444445555568888874 4579999998888899999999999999876 Q ss_pred ---eEEeeeeeEEEeehhHHHHHhcCh-hHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeeccc- Q lcl|NC_011614. 100 ---NATMRAFKLGVILPVTKEFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDF- 174 (324) Q Consensus 100 ---~v~~~~~k~~~~v~iS~ell~~s~-~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~- 174 (324) ..+++.+|++..+ |+|.++.|. .+....-.++|..+++.++|+.+|.--.++ ..+.++.. T Consensus 78 ~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lkta-------------t~t~tg~~l 142 (295) T protein:vir:99 78 KDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTK-------------PTKVKGVGL 142 (295) T ss_pred eeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccC-------------ceeeehhhH Confidence 4777778887754 999986554 578888999999999999999998642211 11111221 Q ss_pred --chhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCC-ce-eeccCCCceecccc-eEeecCccCCCceEEEeec Q lcl|NC_011614. 175 --TQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET-KE-RIYDRNSDSLDGLP-VVNLKSSNLKRGELITGDF 249 (324) Q Consensus 175 --~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g-~~-~~~~~~~~~l~G~p-v~~~~~~~~~~~~i~~gd~ 249 (324) .+..+.+....+.+.+..+.+.++||.+...|++-..-+. +. .|-...--.++|.- ++.+ ..++++.++..-. T Consensus 143 q~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~nfLG~q~II~S--~kv~~G~~~aT~~ 220 (295) T protein:vir:99 143 QKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKNFLGMQNVIVM--PSVPEGKIYSTAV 220 (295) T ss_pred HHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhhhhccceEEEc--ccCCCceEEEeec Confidence 2333333344444455556688999999998865332211 10 01000011488987 6554 4566667776665 Q ss_pred ccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEE-------------ec---cEEecccceEEEEee Q lcl|NC_011614. 250 DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH-------------VA---LHIADDKAFAKLVPA 313 (324) Q Consensus 250 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r-------------~d---~~v~~~~a~~~l~~~ 313 (324) .++.+.+..-=.=++. + .. + |..|.+.+-...+ +. .-.-+++++++.+.. T Consensus 221 ~Ni~~ay~~~~~g~l~-~-~f-----------~-~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~ 286 (295) T protein:vir:99 221 ENLVFASLNVKGGDLG-G-LF-----------A-DFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIE 286 (295) T ss_pred cceEEEEecCCchhhh-h-hh-----------h-hccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEe Confidence 5554433210000000 0 00 0 1122222221111 11 113467889999998 Q ss_pred ccCCCCccc Q lcl|NC_011614. 314 DAKPSSVPG 322 (324) Q Consensus 314 ~~~~~~~~~ 322 (324) ++.++..-| T Consensus 287 ~~~~~~~~~ 295 (295) T protein:vir:99 287 AAAVPGIGG 295 (295) T ss_pred cCcCCCCCC Confidence 888777777 No 147 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=99.04 E-value=2.8e-11 Score=78.46 Aligned_cols=246 Identities=12% Similarity=0.078 Sum_probs=135.9 Q ss_pred hceeeecCCCceEEEEEeCCcceeeecccccccc--cccceeeEEee--eeeEEEeehhHHHHHhcChhHHHHHHHHHHH Q lcl|NC_011614. 60 LGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET--SKATWVNATMR--AFKLGVILPVTKEFLNYTYSQFFEEMKPMIA 135 (324) Q Consensus 60 l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~--~~~~~~~v~~~--~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~ 135 (324) +++.+.- +.+.++|+. +..++....-|+++.. .++.-.+.++. ..++.. ..|-+---.++..++.+...++.+ T Consensus 1 ~vr~i~~-g~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~-~~VdDiD~~qa~~Dlr~e~s~~~G 77 (324) T protein:vir:99 1 MTRTITS-GKSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTD-VLIYDIEDAMNHYDVRSEYSTQMG 77 (324) T ss_pred Ceeeeec-CceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhh-hhhhhHHHHhcCccchhHHHHHHH Confidence 5555444 567889887 5667777777777643 44555554444 333332 333332223455789999999999 Q ss_pred HHHHHHHHHHHHhc--------cCc---CcCCcccccccccc----cceeecccchhHHHHHHHHhhhhccCC--CEEEE Q lcl|NC_011614. 136 EAFYKKFDEAGILN--------QGN---NPFGKSIAQSIEKT----NKVIKGDFTQDNIIDLEALLEDDELEA--NAFIS 198 (324) Q Consensus 136 ~ai~~~~d~a~l~g--------~g~---~~~~~~~~~~~~~~----~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~ 198 (324) .++++.+|+.++.- ... +..+.+........ .........++.+.++..+|...+... -.+++ T Consensus 78 ~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv 157 (324) T protein:vir:99 78 EALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYT 157 (324) T ss_pred HHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEe Confidence 99999999887621 000 11111111111111 111112234677888888888777543 35789 Q ss_pred cHHHHHHHHHhhc-----cCCceeeccCCCceecccceEeecCccCCCc-----------------------eEEEeecc Q lcl|NC_011614. 199 KTQNRSLLRKIVD-----PETKERIYDRNSDSLDGLPVVNLKSSNLKRG-----------------------ELITGDFD 250 (324) Q Consensus 199 ~~~~~~~L~~l~d-----~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~-----------------------~i~~gd~~ 250 (324) +|..+..|..-+. .++.-.+..+..+++.|++|+.++..+.... .-|.+|++ T Consensus 158 ~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~ 237 (324) T protein:vir:99 158 DPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGAD 237 (324) T ss_pred ChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccC Confidence 9999988853321 1222234455667899999998764432110 01333333 Q ss_pred c----------EEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccC-CCC Q lcl|NC_011614. 251 K----------LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAK-PSS 319 (324) Q Consensus 251 ~----------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~-~~~ 319 (324) . +..+...++..+..++. ..|. -.+++..-+|..++||+|.+.++..+.. |.+ T Consensus 238 ~~~gl~~~~~a~~tv~~~~~~~e~~~~~-------------~~~~---d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~ 301 (324) T protein:vir:99 238 NVVGLFVHRSAVATLKLKDMALERARRP-------------EYQA---DQIIAKYAMGHGGLRPEAVGAIIFEDGETPAV 301 (324) T ss_pred ceeEEEEehhheEEEeeecceecceech-------------hhHH---HhhhhhhhhcCcccccceEEEEEEccCccccc Confidence 2 12222222333333321 1222 3356667789999999999888754444 344 Q ss_pred ccccC Q lcl|NC_011614. 320 VPGEV 324 (324) Q Consensus 320 ~~~~~ 324 (324) +|--+ T Consensus 302 ~~~~~ 306 (324) T protein:vir:99 302 APDVI 306 (324) T ss_pred cchhh Confidence 44332 No 148 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.02 E-value=3.3e-11 Score=78.03 Aligned_cols=289 Identities=13% Similarity=0.163 Sum_probs=171.3 Q ss_pred CchhhHHHHHHHHHhhccchhh---hhcc----ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQ---VFNP----DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~---~~~a----~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) .+.++....+.+-.+.+..... .-+| ..++. ++.-..+|.-+...|-..+..+.++++...+.+.++-...- T Consensus 87 LkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt~-td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V~~ 165 (400) T protein:vir:93 87 IESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 165 (400) T ss_pred hhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhccccc-CCchhhcchHHHHHHHHhhhccCCcccceeeecCCceeeec Confidence 1233333333222222222111 1111 12221 22334789999999999999999999988888885533221 Q ss_pred EEEeCCcceee-ecccccccccccceeeEEeeeeeEEEeehhHHHHHh--cChhHHHHHHHHHHHHHHHH-HHHHHHHhc Q lcl|NC_011614. 74 TFWADKPGAYW-VGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFYK-KFDEAGILN 149 (324) Q Consensus 74 p~~~~~~~a~~-v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~--~s~~~~~~~v~~~l~~ai~~-~~d~a~l~g 149 (324) +- ....-+| ..-|+++.++..+|..-++.|+-+..+..+.+-..+ ++...+.+||+.+|...+.. ..+.+++-| T Consensus 166 ~~--dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~G 243 (400) T protein:vir:93 166 SF--DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 243 (400) T ss_pred ch--hhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeec Confidence 21 2222345 678999999999999999999888888777433332 22456899999999999996 579999999 Q ss_pred cCcCcC-Ccccccc----ccc-ccceeecccchhHHHH-HHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCC Q lcl|NC_011614. 150 QGNNPF-GKSIAQS----IEK-TNKVIKGDFTQDNIID-LEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRN 222 (324) Q Consensus 150 ~g~~~~-~~~~~~~----~~~-~~~~~~~~~~~~~i~~-~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~ 222 (324) +|++.. ...-... +.. .....++...+.++.. +..-+.+-..+...++++|..|+.|+.++|++|.+.|..+. T Consensus 244 dG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n 323 (400) T protein:vir:93 244 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKN 323 (400) T ss_pred ccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeecc Confidence 887642 2111111 111 1111234444444433 33333333444567899999999999999999999985543 Q ss_pred Cc----eecccc-eEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhh--hhcCcEEEEEEE Q lcl|NC_011614. 223 SD----SLDGLP-VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL--FEQDMVALRATM 295 (324) Q Consensus 223 ~~----~l~G~p-v~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--f~~~~v~~r~~~ 295 (324) .. +-.|+. ++.......++..++. |-.. .++. .++ + .++. |.+|+--+.++. T Consensus 324 ~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~V-Dek~-~i~~-~~~--~----------------t~~sf~~~tNs~~ilvet 382 (400) T protein:vir:93 324 DDTEIASEVGVDEIIVYTGSKALKPTVLV-DQKY-HIDM-QDL--T----------------KVDAFEWKTNSNMILVET 382 (400) T ss_pred ccchhhhhcccceeeeeccCCCCCceeee-ehhh-hccc-cCc--e----------------eccceeeeeccceEEeee Confidence 22 223432 2222233344433333 3221 1211 111 1 1111 356777788899 Q ss_pred EeccEEecccceEEEEee Q lcl|NC_011614. 296 HVALHIADDKAFAKLVPA 313 (324) Q Consensus 296 r~d~~v~~~~a~~~l~~~ 313 (324) +.++.+.-|++-+.++.. T Consensus 383 lv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 383 LTSGHVETYNAGAVITVS 400 (400) T ss_pred eeccceecccceeeEeeC Confidence 999999999999999887 No 149 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.97 E-value=8.5e-11 Score=75.79 Aligned_cols=271 Identities=13% Similarity=0.111 Sum_probs=156.9 Q ss_pred hccc-cccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc-e---EEEEEeCCcceeeecccccccccccce Q lcl|NC_011614. 24 FNPD-NVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-K---KFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 24 ~~a~-~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~---~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) +.+. +.+.+++-+..+--++.+++-+.+.+-..++...+.+||..++ + ++|.++....+.-|+||+.||-++.+. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 2221 2233344444455666777766666666677777888887543 3 355556667788999999999999886 Q ss_pred e---eEEeeeeeEEEeehhHHHHHhcC-hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeeccc Q lcl|NC_011614. 99 V---NATMRAFKLGVILPVTKEFLNYT-YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDF 174 (324) Q Consensus 99 ~---~v~~~~~k~~~~v~iS~ell~~s-~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~ 174 (324) . ..+++.+|++..+ |.|.++.+ ..+....-.++|.++++.++|+.+|.--.+ +..+...+..... T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lkt---------aT~t~~~t~~t~~ 149 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKS---------AIENGKRTNKTKL 149 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhh---------cccccccccceee Confidence 5 5778888888855 99999655 457788899999999999999999853211 1111122223345 Q ss_pred chhHHHHHHHHhh------hhccCCCEEEEcHHHHHHHHHhhccCCc-eeeccCCCceecccceEeecCccCCCceEEEe Q lcl|NC_011614. 175 TQDNIIDLEALLE------DDELEANAFISKTQNRSLLRKIVDPETK-ERIYDRNSDSLDGLPVVNLKSSNLKRGELITG 247 (324) Q Consensus 175 ~~~~i~~~~~~l~------~~~~~~~~~v~~~~~~~~L~~l~d~~g~-~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~g 247 (324) +.+.+.+++.... .......++++||.+...++.-..-+.+ .-|-...--.++|.-++.+ ..++++.++.. T Consensus 150 s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~nfLG~~II~S--~kv~~G~~~~T 227 (303) T protein:vir:10 150 SAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLTPYVGVKIVEF--ADVPQGEVWMT 227 (303) T ss_pred cHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhhhhhcceEEEe--ccCCCceEEEe Confidence 6666666655442 1222345889999999887642211111 1110000114889987664 45666677766 Q ss_pred ecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEE-------------ec---cEEecccceEEEE Q lcl|NC_011614. 248 DFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH-------------VA---LHIADDKAFAKLV 311 (324) Q Consensus 248 d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r-------------~d---~~v~~~~a~~~l~ 311 (324) -..++.+.+... +=++. ...+ |..|.+.+-...+ +. .-.-+++++++.+ T Consensus 228 ~~~Ni~~ay~~~-~g~l~--~~f~------------~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~t 292 (303) T protein:vir:10 228 VAENLNVAYANP-RGELS--RAFA------------FATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVT 292 (303) T ss_pred eccceEEEEecC-chhhh--hhhh------------hccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEE Confidence 655544433211 00000 0000 1112222211111 11 1134577899999 Q ss_pred eeccCCCCccc Q lcl|NC_011614. 312 PADAKPSSVPG 322 (324) Q Consensus 312 ~~~~~~~~~~~ 322 (324) ......+-+|+ T Consensus 293 i~~~e~~~~~~ 303 (303) T protein:vir:10 293 IKKDEAGELPS 303 (303) T ss_pred EeccccCCCCC Confidence 87666666677 No 150 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.97 E-value=1.4e-10 Score=74.53 Aligned_cols=282 Identities=11% Similarity=0.005 Sum_probs=155.7 Q ss_pred cchhhhhccccccccCCCcceechhhhHHHHHHHHh-hcchhhhceeeecCCCceEEEEEeCCcceeee---------cc Q lcl|NC_011614. 18 NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVME-NSKIMQLGKYEPMEGTEKKFTFWADKPGAYWV---------GE 87 (324) Q Consensus 18 ~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~-~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v---------~E 87 (324) -.....+.+.-.+.++-...+| +++..++....++ .+.|++-++..+-.++.-.+-.+.. ....-+ .+ T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLAS-MDPDAVKRKRSRQQSAD 78 (322) T ss_pred CcccceeeeeeeeechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccccccccccceeeccc-ccccccccccccccccC Confidence 0000111122223333222333 6666666655544 4556665553333333211111111 111111 11 Q ss_pred cc-cccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhcc-CcCcCCc-ccccccc Q lcl|NC_011614. 88 GQ-KIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ-GNNPFGK-SIAQSIE 164 (324) Q Consensus 88 g~-~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~-g~~~~~~-~~~~~~~ 164 (324) +. ..|.....++...+..........|.+.-......++.+...+..+.+++++.|+.++.+- +....+. +...... T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ 158 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFL 158 (322) T ss_pred cccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccC Confidence 11 2333333344434433333445677676666666789999999999999999999888642 1111111 0001111 Q ss_pred ccc--ceeecccchhHHHHHHHHhhhhccCCC---EEEEcHHHHHHHHHhh-----ccCC-ceeeccCCCceecccceEe Q lcl|NC_011614. 165 KTN--KVIKGDFTQDNIIDLEALLEDDELEAN---AFISKTQNRSLLRKIV-----DPET-KERIYDRNSDSLDGLPVVN 233 (324) Q Consensus 165 ~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~---~~v~~~~~~~~L~~l~-----d~~g-~~~~~~~~~~~l~G~pv~~ 233 (324) ... ......++++.++++...|..+..... .++++|..+..|.... |.+| ..++..+..++++|+.++. T Consensus 159 ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~i~ 238 (322) T protein:vir:10 159 ATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYTWIV 238 (322) T ss_pred CCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEEEEE Confidence 111 112236789999999999988776542 4788999998885432 2222 3344557778999999988 Q ss_pred ecCccCC----------------CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEe Q lcl|NC_011614. 234 LKSSNLK----------------RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 234 ~~~~~~~----------------~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) ++..+.. ....++...+.+.++...++..+++..... .+...+++.+-+ T Consensus 239 s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~---------------~~a~~I~~~~~~ 303 (322) T protein:vir:10 239 STRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSA---------------SFAWRIYSAFTA 303 (322) T ss_pred eccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCc---------------chhhhhhhhhhh Confidence 7543321 122445556677777777777776554321 122335667889 Q ss_pred ccEEecccceEEEEeeccC Q lcl|NC_011614. 298 ALHIADDKAFAKLVPADAK 316 (324) Q Consensus 298 d~~v~~~~a~~~l~~~~~~ 316 (324) |..+++|++++.+....+- T Consensus 304 Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 304 DCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred CceEeccCcEEEEEEeccC Confidence 9999999999999987665 No 151 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.97 E-value=9.6e-11 Score=75.50 Aligned_cols=260 Identities=12% Similarity=0.106 Sum_probs=154.3 Q ss_pred cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc-e-EEEEEeCCcceeeecccccccccc Q lcl|NC_011614. 18 NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-K-KFTFWADKPGAYWVGEGQKIETSK 95 (324) Q Consensus 18 ~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~-~ip~~~~~~~a~~v~Eg~~~~~~~ 95 (324) -+.++.+--.+.+.+++-+...--++.+++-+.+.+-..++...+.+||..++ + .+|.|+....+.-|+||+.+|-++ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~Iplsk 80 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcccchhh Confidence 22333333334455555555555667777777777766677777889988654 4 456788888889999999999999 Q ss_pred cceee---EEeeeeeEEEeehhHHHHHhcCh-hHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceee Q lcl|NC_011614. 96 ATWVN---ATMRAFKLGVILPVTKEFLNYTY-SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK 171 (324) Q Consensus 96 ~~~~~---v~~~~~k~~~~v~iS~ell~~s~-~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~ 171 (324) .+... .+++.+|++..+ |+|.++.|. .+....-.++|..++++++|+.++.-..+.. ..+ T Consensus 81 vt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT------------~t~-- 144 (296) T protein:vir:98 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT------------GTQ-- 144 (296) T ss_pred heeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccc------------cee-- Confidence 98764 777778888775 999986554 5778889999999999999999986422111 000 Q ss_pred cccchhHH--------HHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccC-CceeeccCCCc-eecccceEeecCccCCC Q lcl|NC_011614. 172 GDFTQDNI--------IDLEALLEDDELEANAFISKTQNRSLLRKIVDPE-TKERIYDRNSD-SLDGLPVVNLKSSNLKR 241 (324) Q Consensus 172 ~~~~~~~i--------~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~-g~~~~~~~~~~-~l~G~pv~~~~~~~~~~ 241 (324) ..+.+.+ .++...+++.+....+.++||.+...+++ +++ +.......+.. .++|.-++. +..+++ T Consensus 145 -~~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg--~a~it~qt~fG~tyl~nfLG~~II~--S~kV~~ 219 (296) T protein:vir:98 145 -DALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIA--KAGITTQTAFGLTYLVDFTGTVIIS--TNDVTK 219 (296) T ss_pred -eechhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhc--CCccchhheechhhhhhccccEEEE--cCcCCC Confidence 0122223 33334555554455678999999877642 221 11111122222 388875555 445667 Q ss_pred ceEEEeecccEEEEEec----ceEEEEeecccccccccccccchhhhhcCcEEEEEEEE-------------ec---cEE Q lcl|NC_011614. 242 GELITGDFDKLIYGIPQ----LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH-------------VA---LHI 301 (324) Q Consensus 242 ~~i~~gd~~~~~~~~~~----~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r-------------~d---~~v 301 (324) +.++..-..++.+.+.. ++.-.+. |..|.+.+-...+ +. .-. T Consensus 220 G~~~~T~~~Ni~~ay~~~~~~~l~~~f~------------------~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfp 281 (296) T protein:vir:98 220 GEIWATVPENIIFAYINPNNSELAKEFN------------------LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYP 281 (296) T ss_pred ceEEEeeecceEEEeecccccchhhhhc------------------cccccccceEEEeccccceeeehhHhHhHHHhcc Confidence 78887776665544321 1111110 1112222211111 11 113 Q ss_pred ecccceEEEEeeccC Q lcl|NC_011614. 302 ADDKAFAKLVPADAK 316 (324) Q Consensus 302 ~~~~a~~~l~~~~~~ 316 (324) -+++++++.+..++. T Consensus 282 E~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 282 ERIDGIVKVTLTPGV 296 (296) T ss_pred cccceEEEEEecCCC Confidence 456778888875444 No 152 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.96 E-value=8.5e-11 Score=75.78 Aligned_cols=288 Identities=8% Similarity=-0.029 Sum_probs=163.2 Q ss_pred hccccccccCCCccee-chhhhHHHHHHHHhhcchhhhceeeec-CCCceEEEEEeCCcceeeecccccccccccceeeE Q lcl|NC_011614. 24 FNPDNVMMHEKKDGTL-LNDFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 24 ~~a~~~~~~~~~g~li-p~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) +.- ...++..-.+| |+.|+.+|+.-+.+......+.++... .|..+.||... .+...-..+++.+.-...+-.++ T Consensus 1 ~~~--~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg-~~tV~dY~~~~~i~~d~ltt~~~ 77 (322) T protein:vir:31 1 MST--GNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVG-TPVVRSRPEQGDFTFDNLDTGEI 77 (322) T ss_pred CCC--CCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEecccc-ccccccccCCCCcccccCCCceE Confidence 111 11333444555 999999999888888877777665443 35567777654 44555566777766555666655 Q ss_pred Eeee--eeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh--ccCcC---cCCc-cccccccc--ccceee Q lcl|NC_011614. 102 TMRA--FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL--NQGNN---PFGK-SIAQSIEK--TNKVIK 171 (324) Q Consensus 102 ~~~~--~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~--g~g~~---~~~~-~~~~~~~~--~~~~~~ 171 (324) ++.. .|+.+ ..|++... +...++.+...++.+++++..+|+.+.. -++.. ..+. ........ +..... T Consensus 78 ~l~IDq~KYfa-f~VdDD~~-Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~ 155 (322) T protein:vir:31 78 SIILRDEVYAG-NAISKKLR-QDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTD 155 (322) T ss_pred EEEEehhhhhc-cccchhHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCC Confidence 5554 44444 44888554 4667999999999999999999987632 11110 0000 00111110 111112 Q ss_pred cccchhHHHHHHHHhhhhccCC-C-EEEEcHHHHHHHHH-------hhccCCceeeccCC------CceecccceEeecC Q lcl|NC_011614. 172 GDFTQDNIIDLEALLEDDELEA-N-AFISKTQNRSLLRK-------IVDPETKERIYDRN------SDSLDGLPVVNLKS 236 (324) Q Consensus 172 ~~~~~~~i~~~~~~l~~~~~~~-~-~~v~~~~~~~~L~~-------l~d~~g~~~~~~~~------~~~l~G~pv~~~~~ 236 (324) ....|+.+++|..+|..+.... . ..|++|..+..|.. ++|..--.+...+. .++++|+-|+.++. T Consensus 156 ~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V~~SN~ 235 (322) T protein:vir:31 156 QTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVSNL 235 (322) T ss_pred chhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceeeeeecc Confidence 3457899999999998877654 2 45788998877743 23321101112211 47899999999987 Q ss_pred ccCCCceEEEeecccEE-EEEecceEEEEeeccccc------ccccccccchhhhhcCcEEEEEEEEeccEEecccceEE Q lcl|NC_011614. 237 SNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLS------TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK 309 (324) Q Consensus 237 ~~~~~~~i~~gd~~~~~-~~~~~~~~i~~~~~~~~~------~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~ 309 (324) .+.++..++.|.-.... -|.....-...+.+...- +-....... -.+.--.+|..+|+|.++++|+.++. T Consensus 236 l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~---~~~~~d~~~~~~~~g~g~~r~e~l~~ 312 (322) T protein:vir:31 236 LADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFID---DYNDDLNTATTARWGNGLVRDENLVC 312 (322) T ss_pred ccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccC---ccccccceeeeeeecceeecccceEE Confidence 76666666666543221 122211111111111000 000000000 11334558899999999999999988 Q ss_pred EEeeccCCCC Q lcl|NC_011614. 310 LVPADAKPSS 319 (324) Q Consensus 310 l~~~~~~~~~ 319 (324) |.--+...+- T Consensus 313 ~~a~~~~~~~ 322 (322) T protein:vir:31 313 VLANADKVTF 322 (322) T ss_pred EEeccccccC Confidence 8654333222 No 153 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.93 E-value=1.5e-10 Score=74.45 Aligned_cols=225 Identities=12% Similarity=0.046 Sum_probs=150.4 Q ss_pred hhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeeccccccccccccee Q lcl|NC_011614. 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 21 ~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) ..++.....+..+...-+-|......|++.+.+.++++..++.+... +....+.+.++-|++.|..=++.++.++.++. T Consensus 1 m~~~~~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~~tt~ 80 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSKSTTV 80 (328) T ss_pred CCccccccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccCcccceeE Confidence 11111111111111222445667778999999999999999998875 33467889999999999999999999999999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCcCc-C-Cccc---------------c Q lcl|NC_011614. 100 NATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP-F-GKSI---------------A 160 (324) Q Consensus 100 ~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~-~-~~~~---------------~ 160 (324) +++....-+++.+.|.+.+.+... .++...-.....+++..++...+|+|+.+.. . ..|+ . T Consensus 81 q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~qii 160 (328) T protein:vir:95 81 QVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQNII 160 (328) T ss_pred EEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCcccccccccee Confidence 999999999999999998887664 3444556667899999999999999854310 0 0000 0 Q ss_pred ccccccc------------------------------------------------------------------------- Q lcl|NC_011614. 161 QSIEKTN------------------------------------------------------------------------- 167 (324) Q Consensus 161 ~~~~~~~------------------------------------------------------------------------- 167 (324) ++.++.+ T Consensus 161 daGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NId 240 (328) T protein:vir:95 161 DAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIANID 240 (328) T ss_pred ecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecCc Confidence 0000000 Q ss_pred -----ceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh-ccCCcee----eccCCCceecccceEeecCc Q lcl|NC_011614. 168 -----KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPETKER----IYDRNSDSLDGLPVVNLKSS 237 (324) Q Consensus 168 -----~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~-d~~g~~~----~~~~~~~~l~G~pv~~~~~~ 237 (324) ..+......+.+++++.+++.....+.+|+||.+....|++.. +.+.-.+ +.....-.+.|+||..+++. T Consensus 241 ~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~~dai 320 (328) T protein:vir:95 241 VSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRETDAL 320 (328) T ss_pred ccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEEEeee Confidence 0000111233446667777777788899999999999998643 3333222 22334457899999988877 Q ss_pred cCCCceEE Q lcl|NC_011614. 238 NLKRGELI 245 (324) Q Consensus 238 ~~~~~~i~ 245 (324) ..++..++ T Consensus 321 ~~tE~~vv 328 (328) T protein:vir:95 321 LETEARVV 328 (328) T ss_pred ecCccccC Confidence 76666555 No 154 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.88 E-value=6.1e-10 Score=71.12 Aligned_cols=298 Identities=8% Similarity=-0.041 Sum_probs=154.2 Q ss_pred hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecC-CCceEEEEEeCCcceeeecccccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |..... .+--....+.+--.+.-+.+..+|.+.....++++++..+.++. +.+.++|+. +..+++...-|++... T Consensus 1 Ms~~n~---~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ldg 76 (402) T protein:vir:97 1 MSTPNT---LTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNA 76 (402) T ss_pred CCCccc---ccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccCC Confidence 221110 00001111112223555899999999999999999999887766 455788886 5556666666666655 Q ss_pred cccceeeEEeeeeeE-EEeehhHHHHHhcChhH-HHHHHHHHHHHHHHHHHHHHHHh-----ccC-c---CcCCcccc-c Q lcl|NC_011614. 94 SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGIL-----NQG-N---NPFGKSIA-Q 161 (324) Q Consensus 94 ~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~-~~~~v~~~l~~ai~~~~d~a~l~-----g~g-~---~~~~~~~~-~ 161 (324) +.+..++..+....+ .....|-+=---++..+ +.+.+.+++++++++..|+.+|. +-. + ...+.... . T Consensus 77 ~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g 156 (402) T protein:vir:97 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) T ss_pred CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccc Confidence 566667766666543 22233322111234456 78899999999999999997763 100 0 00000000 0 Q ss_pred ccccccceee-cccc----hhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhcc-------CCceeeccCCCceec Q lcl|NC_011614. 162 SIEKTNKVIK-GDFT----QDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP-------ETKERIYDRNSDSLD 227 (324) Q Consensus 162 ~~~~~~~~~~-~~~~----~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d~-------~g~~~~~~~~~~~l~ 227 (324) .......+.. ...+ .+.+.++..+|...+... -+++++|..|..|.+-.+- .+.-.+..+....+. T Consensus 157 ~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~ 236 (402) T protein:vir:97 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) T ss_pred cccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEe Confidence 1111111111 1122 344556777776655443 3689999999988653211 111223455557899 Q ss_pred ccceEeecCccCCC-------------ceEE--EeecccE--EEEEecceEEEEeecccccccccccccchhhhhcCcEE Q lcl|NC_011614. 228 GLPVVNLKSSNLKR-------------GELI--TGDFDKL--IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) Q Consensus 228 G~pv~~~~~~~~~~-------------~~i~--~gd~~~~--~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~ 290 (324) |+||+.++..+... +.-| -+|++.- ++..+..+-.-.....+...+.+.. ...+| T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r--~~~~~------ 308 (402) T protein:vir:97 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK--EKTYY------ 308 (402) T ss_pred ceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchh--HHHHH------ Confidence 99998876443211 1111 1444321 1212211111000011111111111 11111 Q ss_pred EEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 291 LRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 291 ~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) +-+..-+|..+.||+|..+++.+.-.++...+++ T Consensus 309 id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~~ 342 (402) T protein:vir:97 309 IDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) T ss_pred HHHHHHhCCcccCccceEEEEEecccccccCCcc Confidence 1223457999999999999987765555555555 No 155 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=98.88 E-value=2.4e-09 Score=67.87 Aligned_cols=283 Identities=16% Similarity=0.053 Sum_probs=157.3 Q ss_pred Cchhh-----HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhh-h-ce--eeecCCCce Q lcl|NC_011614. 1 MEQTQ-----KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ-L-GK--YEPMEGTEK 71 (324) Q Consensus 1 m~~~~-----~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~-l-~~--~~~~~~~~~ 71 (324) |.+.- ++|.+++||+.....+.. ..+-+-++. +++.+.....+.. + +. ....++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt-------------~~l~~k~~~-~LD~~~~~~~~s~~~~~N~~~e~~gg~tV 66 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQ-------------TLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcch-------------HHHHHHHHH-HHHHHHHHhhhhhhcccCcceEeccCcEE Confidence 77654 478999999998766321 122233333 4444444443332 1 11 344578889 Q ss_pred EEEEEeCCcceeeecccccccccccc--eeeEEeeeeeEEEe-ehhHHHHHhcChh--HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 72 KFTFWADKPGAYWVGEGQKIETSKAT--WVNATMRAFKLGVI-LPVTKEFLNYTYS--QFFEEMKPMIAEAFYKKFDEAG 146 (324) Q Consensus 72 ~ip~~~~~~~a~~v~Eg~~~~~~~~~--~~~v~~~~~k~~~~-v~iS~ell~~s~~--~~~~~v~~~l~~ai~~~~d~a~ 146 (324) +||..+.. ...-..-+......+++ ....++.-.|.-.+ +.--+ .+++.. .+...+.+.....++-.+|... T Consensus 67 kIp~i~~~-gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D--~~Etn~~l~a~~i~~~~~~~~v~PEiDay~ 143 (319) T protein:vir:97 67 TVMKGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALD--RKDTEGNIDINYVVARQGAEVVAPYLDNLR 143 (319) T ss_pred EEeeeccc-ccccccCCCCcccCCcccceeEEEeecccccccccchhh--HhhhhchhhHHHHHHHHHHHHhhhhhhHHH Confidence 99998763 32223222223333333 44455555444333 22222 223322 3344556666777777888765 Q ss_pred HhccCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCC-CEEEEcHHHHHHHHHhh----cc-CCceeecc Q lcl|NC_011614. 147 ILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-NAFISKTQNRSLLRKIV----DP-ETKERIYD 220 (324) Q Consensus 147 l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~v~~~~~~~~L~~l~----d~-~g~~~~~~ 220 (324) +.--..+. ........+.+..|+.|.++..+|...+... -.++++|..+..|.+-. .. .+...... T Consensus 144 ~skla~~a--------~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~ 215 (319) T protein:vir:97 144 FATLARNK--------AKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGK 215 (319) T ss_pred HHHHHhhc--------ccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceee Confidence 54321110 0111111233446888999999998766543 35689999998885422 11 22334456 Q ss_pred CCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccE Q lcl|NC_011614. 221 RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH 300 (324) Q Consensus 221 ~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~ 300 (324) +..++|.|++|+.+++....+..+++|..+...... +--.+++.+. .. .+..-.++...++|.. T Consensus 216 g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~-k~~~~~~~~p--------~~-------~~~a~~v~gr~y~d~~ 279 (319) T protein:vir:97 216 GVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPI-QADLAKTNSN--------IP-------GMFGTLAEQLLYTGAF 279 (319) T ss_pred eeceeecCeEEEEecccccccceEEEEcCCeeeeee-eeeeeeccCC--------Cc-------cccceeeeeeeeeeeE Confidence 777899999999887776666667777654433222 1111222110 00 1123467888999999 Q ss_pred EecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 301 IADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 301 v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) |++|++..+.......++..+.-. T Consensus 280 V~~~k~~~Iy~~~~~~~~~~~~~~ 303 (319) T protein:vir:97 280 VPEHLQKYIFTIGGTEVATKRDGV 303 (319) T ss_pred EeccccceEEEeecCCcccCCCcc Confidence 999998888877666666665444 No 156 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=98.88 E-value=2.4e-09 Score=67.87 Aligned_cols=283 Identities=16% Similarity=0.053 Sum_probs=157.3 Q ss_pred Cchhh-----HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhh-h-ce--eeecCCCce Q lcl|NC_011614. 1 MEQTQ-----KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ-L-GK--YEPMEGTEK 71 (324) Q Consensus 1 m~~~~-----~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~-l-~~--~~~~~~~~~ 71 (324) |.+.- ++|.+++||+.....+.. ..+-+-++. +++.+.....+.. + +. ....++.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt-------------~~l~~k~~~-~LD~~~~~~~~s~~~~~N~~~e~~gg~tV 66 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQ-------------TLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcch-------------HHHHHHHHH-HHHHHHHHhhhhhhcccCcceEeccCcEE Confidence 77654 478999999998766321 122233333 4444444443332 1 11 344578889 Q ss_pred EEEEEeCCcceeeecccccccccccc--eeeEEeeeeeEEEe-ehhHHHHHhcChh--HHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 72 KFTFWADKPGAYWVGEGQKIETSKAT--WVNATMRAFKLGVI-LPVTKEFLNYTYS--QFFEEMKPMIAEAFYKKFDEAG 146 (324) Q Consensus 72 ~ip~~~~~~~a~~v~Eg~~~~~~~~~--~~~v~~~~~k~~~~-v~iS~ell~~s~~--~~~~~v~~~l~~ai~~~~d~a~ 146 (324) +||..+.. ...-..-+......+++ ....++.-.|.-.+ +.--+ .+++.. .+...+.+.....++-.+|... T Consensus 67 kIp~i~~~-gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D--~~Etn~~l~a~~i~~~~~~~~v~PEiDay~ 143 (319) T protein:vir:94 67 TVMKGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALD--RKDTEGNIDINYVVARQGAEVVAPYLDNLR 143 (319) T ss_pred EEeeeccc-ccccccCCCCcccCCcccceeEEEeecccccccccchhh--HhhhhchhhHHHHHHHHHHHHhhhhhhHHH Confidence 99998763 32223222223333333 44455555444333 22222 223322 3344556666777777888765 Q ss_pred HhccCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCC-CEEEEcHHHHHHHHHhh----cc-CCceeecc Q lcl|NC_011614. 147 ILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-NAFISKTQNRSLLRKIV----DP-ETKERIYD 220 (324) Q Consensus 147 l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~v~~~~~~~~L~~l~----d~-~g~~~~~~ 220 (324) +.--..+. ........+.+..|+.|.++..+|...+... -.++++|..+..|.+-. .. .+...... T Consensus 144 ~skla~~a--------~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~ 215 (319) T protein:vir:94 144 FATLARNK--------AKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGK 215 (319) T ss_pred HHHHHhhc--------ccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceee Confidence 54321110 0111111233446888999999998766543 35689999998885422 11 22334456 Q ss_pred CCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccE Q lcl|NC_011614. 221 RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH 300 (324) Q Consensus 221 ~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~ 300 (324) +..++|.|++|+.+++....+..+++|..+...... +--.+++.+. .. .+..-.++...++|.. T Consensus 216 g~Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~~-k~~~~~~~~p--------~~-------~~~a~~v~gr~y~d~~ 279 (319) T protein:vir:94 216 GVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPI-QADLAKTNSN--------IP-------GMFGTLAEQLLYTGAF 279 (319) T ss_pred eeceeecCeEEEEecccccccceEEEEcCCeeeeee-eeeeeeccCC--------Cc-------cccceeeeeeeeeeeE Confidence 777899999999887776666667777654433222 1111222110 00 1123467888999999 Q ss_pred EecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 301 IADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 301 v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) |++|++..+.......++..+.-. T Consensus 280 V~~~k~~~Iy~~~~~~~~~~~~~~ 303 (319) T protein:vir:94 280 VPEHLQKYIFTIGGTEVATKRDGV 303 (319) T ss_pred EeccccceEEEeecCCcccCCCcc Confidence 999998888877666666665444 No 157 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.80 E-value=1.4e-09 Score=69.17 Aligned_cols=290 Identities=11% Similarity=-0.027 Sum_probs=156.2 Q ss_pred hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCC-ceEEEEEeCCcceeeecccccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGT-EKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |.+..... | -....+.+--.+.-+.+..+|.+.....++++++..+.++.++ +..+|+. +..+++...-|+++.. T Consensus 1 Ms~~n~~t--~-p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg 76 (400) T protein:vir:10 1 MSTPNNLT--N-VAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAA 76 (400) T ss_pred CCCCcccc--c-cccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCC Confidence 22111100 0 0111122223466799999999999999999999999887654 5678886 6678888888888766 Q ss_pred cccceeeEEeeeeeE-EEeehhHHHHHhcChhH-HHHHHHHHHHHHHHHHHHHHHHh----c----cCcC-cCCcccccc Q lcl|NC_011614. 94 SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGIL----N----QGNN-PFGKSIAQS 162 (324) Q Consensus 94 ~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~-~~~~v~~~l~~ai~~~~d~a~l~----g----~g~~-~~~~~~~~~ 162 (324) +.+..++..+....+ .....|.+=---++..+ +.+.+.+++++++++..|+.+|. + .... ..+.+.... T Consensus 77 ~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 667777777766553 34444433222234466 78999999999999999997763 1 1000 001111111 Q ss_pred c-ccccceeec-ccc----hhHHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhh-----ccC--CceeeccCCCceec Q lcl|NC_011614. 163 I-EKTNKVIKG-DFT----QDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPE--TKERIYDRNSDSLD 227 (324) Q Consensus 163 ~-~~~~~~~~~-~~~----~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~-----d~~--g~~~~~~~~~~~l~ 227 (324) . ........+ ..+ ...+.++..+|...+... -++++.|..|..|..-. +-+ +.-.+..+....+. T Consensus 157 ~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~ 236 (400) T protein:vir:10 157 FSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSY 236 (400) T ss_pred cceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEe Confidence 0 111111111 112 233456666666554332 35788888888775321 110 11112333345789 Q ss_pred ccceEeecCccCCC-------------ceE--EEeecccE----------EEEEecceEEEEeecccccccccccccchh Q lcl|NC_011614. 228 GLPVVNLKSSNLKR-------------GEL--ITGDFDKL----------IYGIPQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) Q Consensus 228 G~pv~~~~~~~~~~-------------~~i--~~gd~~~~----------~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) |+||+.++..+... +.- +-+|++.. ..+...+++-++.++.. T Consensus 237 Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r------------- 303 (400) T protein:vir:10 237 NCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKK------------- 303 (400) T ss_pred ceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchh------------- Confidence 99998875443211 111 22454432 22222222222222211 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc---cccC Q lcl|NC_011614. 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV---PGEV 324 (324) Q Consensus 283 ~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~---~~~~ 324 (324) .|..- +-+..-++..+.||+|+++++.+......+ |++- T Consensus 304 ~~~~~---id~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~~~~ 345 (400) T protein:vir:10 304 EKTYY---IDTFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGNAAQ 345 (400) T ss_pred hHHHH---HHHHHHhCCcccchhheEEEEecCCcccccccCcchh Confidence 11111 223345899999999999998765544433 3333 No 158 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.76 E-value=3e-09 Score=67.33 Aligned_cols=271 Identities=10% Similarity=0.042 Sum_probs=159.0 Q ss_pred ccccccCCCcceech---hhhHHHHHHHHhhcchhhhceeee---cCCCceEEEEEeCCcceeeeccc-cccccccccee Q lcl|NC_011614. 27 DNVMMHEKKDGTLLN---DFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEG-QKIETSKATWV 99 (324) Q Consensus 27 ~~~~~~~~~g~lip~---~~~~~i~~~~~~~s~l~~l~~~~~---~~~~~~~ip~~~~~~~a~~v~Eg-~~~~~~~~~~~ 99 (324) +++-..+++|.++-. .+.+.|++...+.-..++++.+.. ....++.+++.+....+.|++.+ ..+|..+..++ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~~ 80 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALAT 80 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccce Confidence 222233455566653 345666766666666666665433 22234556666667788898754 45888888888 Q ss_pred eEEeeeeeEEEeehhHHHHHhcC---hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCccccc--cccccccee---e Q lcl|NC_011614. 100 NATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQ--SIEKTNKVI---K 171 (324) Q Consensus 100 ~v~~~~~k~~~~v~iS~ell~~s---~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~--~~~~~~~~~---~ 171 (324) ......+.++..+.++.+=++.+ ..++...-....++++++.+|+.+|+|+..... .|+++ ++....... + T Consensus 81 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~-~GLlN~p~v~~~~~~~~W~~ 159 (296) T protein:vir:10 81 ERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGI-PSVFDYPNINNVVSGGSWSQ 159 (296) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccc-eeEeecCCCccccccCCccC Confidence 98888899888888877555544 457888888999999999999999999754211 12222 211111110 1 Q ss_pred cccchhHHHHHHHHhhh---hccCCCEEEEcHHHHHHHHHhhccCCceeec----cCCCceecccceEeecCccCCCceE Q lcl|NC_011614. 172 GDFTQDNIIDLEALLED---DELEANAFISKTQNRSLLRKIVDPETKERIY----DRNSDSLDGLPVVNLKSSNLKRGEL 244 (324) Q Consensus 172 ~~~~~~~i~~~~~~l~~---~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~----~~~~~~l~G~pv~~~~~~~~~~~~i 244 (324) .+.-++|+.+++.++.. ....+..++++|..+..|.......|..++. ...+.++.+.|.... +...++..+ T Consensus 160 ~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~-a~~~g~~~~ 238 (296) T protein:vir:10 160 PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLND-YNGTGTSAA 238 (296) T ss_pred HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeecc-CCCCcceEE Confidence 12237788888877754 3456678999999999986554433322211 112234444444322 112222333 Q ss_pred EEee--cccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEec-cEEecccceEEEEeeccC Q lcl|NC_011614. 245 ITGD--FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA-LHIADDKAFAKLVPADAK 316 (324) Q Consensus 245 ~~gd--~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d-~~v~~~~a~~~l~~~~~~ 316 (324) +.-. ...+-+...++++..... ...-...++...|++ ..+.+|.|++++++.+-+ T Consensus 239 v~~~~~~~~~~~~v~~~~~~~~~e-----------------~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 239 IAYEKDPNNMAIEIPEATNALPAQ-----------------PKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEEEcCCceEEEEcCcceeeeccc-----------------ccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 3322 222333333333322110 112234466778885 788889999999888777 No 159 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=98.74 E-value=8.9e-09 Score=64.71 Aligned_cols=285 Identities=16% Similarity=0.060 Sum_probs=149.4 Q ss_pred Cchhh-----HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhh-hce--eeecCCCceE Q lcl|NC_011614. 1 MEQTQ-----KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ-LGK--YEPMEGTEKK 72 (324) Q Consensus 1 m~~~~-----~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~-l~~--~~~~~~~~~~ 72 (324) |.+.- ++|.+++||+++.+.+- ...+-+-+...+-+.+...+--.. ++. ....++++++ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~n-------------t~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVk 78 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHFANKSVEPG-------------DTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFT 78 (329) T ss_pred hhhhhhcccceeEEehhhhcCCccCCc-------------hhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEE Confidence 65533 57899999998766521 112223333333333222211111 111 3445788899 Q ss_pred EEEEeCCcceeeeccccccccc--ccceeeEEeeeeeEEEeehhHHHHHhcChh--HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011614. 73 FTFWADKPGAYWVGEGQKIETS--KATWVNATMRAFKLGVILPVTKEFLNYTYS--QFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~~~~~k~~~~v~iS~ell~~s~~--~~~~~v~~~l~~ai~~~~d~a~l~ 148 (324) ||..+.. ...-..-+...... ..++...++.-.|.-.+. |-+--.+.+.. .+...+.+.....++-.+|...+. T Consensus 79 Ip~i~~~-gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~s 156 (329) T protein:vir:10 79 VIKGDVT-ELKDYKRNATNEFDHPQIQETTYFLDQEKYWGRF-VDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFA 156 (329) T ss_pred Eeeeccc-ccccccCCCCccccccccceeEEEeecccceeee-cchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHH Confidence 9998753 32223322223323 334445555555544332 21111222222 334556666777778788876654 Q ss_pred ccCcCcCCcccccccccccceeecccchhHHHHHHHHhhhhccCC-CEEEEcHHHHHHHHHhh----cc-CCceeeccCC Q lcl|NC_011614. 149 NQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-NAFISKTQNRSLLRKIV----DP-ETKERIYDRN 222 (324) Q Consensus 149 g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~v~~~~~~~~L~~l~----d~-~g~~~~~~~~ 222 (324) ---.+. ........+.+.-|+.+.++..+|..++... -.++++|..+..|.+-. .. .+......+. T Consensus 157 kla~~a--------~~~~~~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~ 228 (329) T protein:vir:10 157 TLARNK--------AKHLTVGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGV 228 (329) T ss_pred HHHhhc--------ccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeee Confidence 211110 0011111223445788899999998765433 35689999998886421 11 1122345666 Q ss_pred CceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_011614. 223 SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) .++|.|++|+.+++.......++++..+....... --.+++.+.. . .++.-.++...++|..|+ T Consensus 229 Vg~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~K-~~~~~~~~p~--------~-------~~~a~~v~gr~yyd~~V~ 292 (329) T protein:vir:10 229 QGELDGFTIVKVPSKMLQGVEAMAVIGEVMASPIQ-ANEAKLNSNV--------P-------GMFGTLAEQMLYTGAFVP 292 (329) T ss_pred eeeecCeEEEEecCCcccceeEEEEcCCceeeeee-eeeeeeeCCC--------C-------ccchheeeeeeeeeeEEE Confidence 78899999998877777666667666554333221 1122222110 0 112346788899999999 Q ss_pred cccceEEEEeeccCCCCccccC Q lcl|NC_011614. 303 DDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 303 ~~~a~~~l~~~~~~~~~~~~~~ 324 (324) +|++..+.......++..+.-. T Consensus 293 ~~k~~~I~~~~~~a~~~~~~~~ 314 (329) T protein:vir:10 293 EHLQKYIFTIGGKEVETNRDGV 314 (329) T ss_pred ccccCEEEEecccCcccCCCCC Confidence 9997776654333333332222 No 160 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.72 E-value=8.6e-09 Score=64.80 Aligned_cols=276 Identities=9% Similarity=-0.042 Sum_probs=162.0 Q ss_pred hccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcce-eeecccccccccccceeeEE Q lcl|NC_011614. 24 FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGA-YWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 24 ~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~ 102 (324) +..-..+-++......-+++...|...-....|+.++.......+..+.++..+....+ .-..||+..+.....-.... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~~ 80 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTML 80 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEEe Confidence 11110011112233455778888888889999999998877766666667665544333 22358887665443322221 Q ss_pred eee-eeEEEeehhHHHHHhcChh---HHHHHHHHHHHHHHHHHHHHHHHhccCc-----CcCC---ccccccccc----- Q lcl|NC_011614. 103 MRA-FKLGVILPVTKEFLNYTYS---QFFEEMKPMIAEAFYKKFDEAGILNQGN-----NPFG---KSIAQSIEK----- 165 (324) Q Consensus 103 ~~~-~k~~~~v~iS~ell~~s~~---~~~~~v~~~l~~ai~~~~d~a~l~g~g~-----~~~~---~~~~~~~~~----- 165 (324) -.. +=+...+.||.-+..-+.. +...+-...=...+.+.+|+++|+|... .+.+ .|+...+.. T Consensus 81 ~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~~ 160 (317) T protein:vir:88 81 NNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSLG 160 (317) T ss_pred ccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCceec Confidence 111 1233344455433332222 3233333333456888999999998532 1111 111111100 Q ss_pred ----------c---cceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCCCcee------ Q lcl|NC_011614. 166 ----------T---NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSL------ 226 (324) Q Consensus 166 ----------~---~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l------ 226 (324) . .......++.+++.+++.++-.++..+..++|+|.....|.++...++.++........+ T Consensus 161 ~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~v~~ 240 (317) T protein:vir:88 161 ANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQTVDV 240 (317) T ss_pred cCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEEEEE Confidence 0 001112478899999999999999888999999999999988854455555433222111 Q ss_pred ----cccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|NC_011614. 227 ----DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 227 ----~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) +| .|-+..+..++.+.+++.|++++-+..-+.+..+..-. .-+......+..++..+. T Consensus 241 ~~tdfG-~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laK-----------------tGd~~k~~i~~E~tLe~~ 302 (317) T protein:vir:88 241 YESDFG-KYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAK-----------------TGDSEKRQLLVEYTFRVN 302 (317) T ss_pred EEeCCe-EEEEEeCCCCCCCeEEEEcccccceeecccceeeccCC-----------------CcccceeEEEEEEEEEEc Confidence 22 12223466778889999999988776655554432211 124455677889999999 Q ss_pred cccceEEEEeeccCC Q lcl|NC_011614. 303 DDKAFAKLVPADAKP 317 (324) Q Consensus 303 ~~~a~~~l~~~~~~~ 317 (324) +|+|.++++..++.= T Consensus 303 N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 303 NEKSGALIRDVVAQL 317 (317) T ss_pred CccceeEEEEecccC Confidence 999999999875554 No 161 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.70 E-value=2.4e-09 Score=67.85 Aligned_cols=290 Identities=11% Similarity=0.025 Sum_probs=153.5 Q ss_pred hhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCC-CceEEEEEeCCcceeeecccccccc Q lcl|NC_011614. 15 ASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 ~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |.+....+ | -....+.+--.+.-+.+..+|.+.....++++++..+.++.+ .+.++|+. +..+++...-|++... T Consensus 1 Ms~~n~~t--~-~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld~ 76 (401) T protein:vir:70 1 MSTPNNLT--N-VAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAA 76 (401) T ss_pred CCCCcccc--c-cccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcCC Confidence 22111100 0 011111222346778999999999999999999999988765 45678886 5667777777777766 Q ss_pred cccceeeEEeeeeeE-EEeehhHHHHHhcChhH-HHHHHHHHHHHHHHHHHHHHHHhcc---C------cCcCCcccc-- Q lcl|NC_011614. 94 SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGILNQ---G------NNPFGKSIA-- 160 (324) Q Consensus 94 ~~~~~~~v~~~~~k~-~~~v~iS~ell~~s~~~-~~~~v~~~l~~ai~~~~d~a~l~g~---g------~~~~~~~~~-- 160 (324) +.+..++..+....+ .....|-+=---++..+ +.+.+.+++++++++..|+.++.-- + ....+.+.. T Consensus 77 ~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G 156 (401) T protein:vir:70 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHG 156 (401) T ss_pred CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCc Confidence 677777777766553 33334433112234456 7889999999999999998764311 0 001111110 Q ss_pred --cccccc--cceeecccchhHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHh---hccC----CceeeccCCCceec Q lcl|NC_011614. 161 --QSIEKT--NKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKI---VDPE----TKERIYDRNSDSLD 227 (324) Q Consensus 161 --~~~~~~--~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l---~d~~----g~~~~~~~~~~~l~ 227 (324) ..+... ........-.+.+.++...|...+.... +++++|..|..|..- -+.. +.-.+..+....+. T Consensus 157 ~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~va 236 (401) T protein:vir:70 157 FSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSSY 236 (401) T ss_pred eEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEEe Confidence 011110 0111111234456677777776665433 567788888777532 1111 11123344445789 Q ss_pred ccceEeecCccCCC-------------ceEE--EeecccE----------EEEEecceEEEEeecccccccccccccchh Q lcl|NC_011614. 228 GLPVVNLKSSNLKR-------------GELI--TGDFDKL----------IYGIPQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) Q Consensus 228 G~pv~~~~~~~~~~-------------~~i~--~gd~~~~----------~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) |+||+.++..+... +.-+ -+|++.. ..+...+++-++.++.. T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r------------- 303 (401) T protein:vir:70 237 NCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKK------------- 303 (401) T ss_pred ceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhh------------- Confidence 99998876443211 1111 1444322 11222222222222211 Q ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEeecc----CCCCccccC Q lcl|NC_011614. 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADA----KPSSVPGEV 324 (324) Q Consensus 283 ~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~----~~~~~~~~~ 324 (324) -|..- +-+..-+|..+.||+|.++++.+-. .+.+|++.- T Consensus 304 ~~~~~---id~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~~~ 346 (401) T protein:vir:70 304 EKTYY---IDTFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGTDGAQ 346 (401) T ss_pred hhHHH---HHHHHHhCCcccchhheEEEeecCcccccccccCCcch Confidence 11111 1233458999999999999865433 222233222 No 162 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.69 E-value=1.3e-08 Score=63.86 Aligned_cols=290 Identities=8% Similarity=0.019 Sum_probs=159.7 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceec---hhhhHHHHHHHHhhcchhhhceeee---cCCCceEEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLL---NDFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFT 74 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip---~~~~~~i~~~~~~~s~l~~l~~~~~---~~~~~~~ip 74 (324) |+..+..+.+.+..+....... ++-+ ...+.|.+.. +.+.+.+++...+.-..+++..+.. ....++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~d---a~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~ 76 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAG-VKQD---AAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYM 76 (319) T ss_pred CCCcchhHHhhHHHHHHHhhcc-chhh---hhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEee Confidence 7776665555544333222211 1211 1122344444 3444567777777777777766543 223344566 Q ss_pred EEeCCcceeeecccc-cccccccceeeEEeeeeeEEEeehhHHHHHhcC---hhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011614. 75 FWADKPGAYWVGEGQ-KIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 75 ~~~~~~~a~~v~Eg~-~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s---~~~~~~~v~~~l~~ai~~~~d~a~l~g~ 150 (324) +.+....+.|++.++ .+|..+..++......+.++..+.++..=++.+ ..++...-....++++++.+|+.+|+|+ T Consensus 77 ~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~ 156 (319) T protein:vir:10 77 TFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGS 156 (319) T ss_pred eeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec Confidence 666667789997654 478888888888888888888888876444444 3578888889999999999999999997 Q ss_pred CcCcCCccccccccccccee---------ecccchhHHHHHHHHhhh---hccCCCEEEEcHHHHHHHHHhhccCCceee Q lcl|NC_011614. 151 GNNPFGKSIAQSIEKTNKVI---------KGDFTQDNIIDLEALLED---DELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~---------~~~~~~~~i~~~~~~l~~---~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~ 218 (324) ..... .|+++.-+...... +...-++|+..++.++.. ....+..++++|..+..|.......|...+ T Consensus 157 ~~~g~-~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l 235 (319) T protein:vir:10 157 APHKI-VSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYL 235 (319) T ss_pred ccccc-eeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHH Confidence 54211 12222211111111 111234667777777753 234567899999999999654443332222 Q ss_pred c---c-CCCceecccceEeecCccCCCceEEEeec--ccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEE Q lcl|NC_011614. 219 Y---D-RNSDSLDGLPVVNLKSSNLKRGELITGDF--DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) Q Consensus 219 ~---~-~~~~~l~G~pv~~~~~~~~~~~~i~~gd~--~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r 292 (324) . . ..+.++.+.|-... +...++..++.... ..+-+.....++.... +. ..-...+. T Consensus 236 ~~lk~~~~~l~I~~~pel~~-ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~-e~----------------~~l~~~~~ 297 (319) T protein:vir:10 236 DYFKSQNSGIEIDSIAELED-IDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA-QP----------------KDLHFKVP 297 (319) T ss_pred HHHHHhcCCceEEEeeeecc-cCCCcceEEEEEecCCceEEEecCcceeeeee-ee----------------cCceEEEe Confidence 1 1 12234555554332 11122222222222 2222222233322111 00 01112233 Q ss_pred EEEEec-cEEecccceEEEEee Q lcl|NC_011614. 293 ATMHVA-LHIADDKAFAKLVPA 313 (324) Q Consensus 293 ~~~r~d-~~v~~~~a~~~l~~~ 313 (324) ...|++ ..+.+|.|++++++. T Consensus 298 ~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 298 CTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eeeeeEEEEEEccceeEeeecC Confidence 455554 667779999999998 No 163 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.63 E-value=2.5e-08 Score=62.26 Aligned_cols=265 Identities=10% Similarity=0.014 Sum_probs=151.0 Q ss_pred cccCCCcceec---hhhhHHHHHHHHhhcchhhhceee---ecCCCceEEEEEeCCcceeeeccccc-ccccccceeeEE Q lcl|NC_011614. 30 MMHEKKDGTLL---NDFTTPILQEVMENSKIMQLGKYE---PMEGTEKKFTFWADKPGAYWVGEGQK-IETSKATWVNAT 102 (324) Q Consensus 30 ~~~~~~g~lip---~~~~~~i~~~~~~~s~l~~l~~~~---~~~~~~~~ip~~~~~~~a~~v~Eg~~-~~~~~~~~~~v~ 102 (324) +.+++.|.+.. +.+.+.+++.+.+....+++..+. +.....+.+++.+....+.|+++++. +|..+..++... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 34444444443 344567788888888888876553 33334455666666678899876554 788888888888 Q ss_pred eeeeeEEEeehhHHHHHhcC---hhHHHHHHHHHHHHHHHHHHHHHHHhccCcC-cCCccccccccccccee-------- Q lcl|NC_011614. 103 MRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PFGKSIAQSIEKTNKVI-------- 170 (324) Q Consensus 103 ~~~~k~~~~v~iS~ell~~s---~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~-~~~~~~~~~~~~~~~~~-------- 170 (324) .....++.-+.++..=++.+ ..++...-....++++++.+|+.+|+|+... ..|.............. T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~~ 160 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVSK 160 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccccc Confidence 88888888888877545444 4678888889999999999999999997642 12211111111111100 Q ss_pred ----ecccchhHHHHHHHHhhhh---ccCCCEEEEcHHHHHHHHHhh--ccCCceeec----cCCCceecccceEeecCc Q lcl|NC_011614. 171 ----KGDFTQDNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIV--DPETKERIY----DRNSDSLDGLPVVNLKSS 237 (324) Q Consensus 171 ----~~~~~~~~i~~~~~~l~~~---~~~~~~~v~~~~~~~~L~~l~--d~~g~~~~~----~~~~~~l~G~pv~~~~~~ 237 (324) +..--++|+.+++.++... ...+..++++|..+..|.... +..|..++. .....++...|-..... T Consensus 161 w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L~~~g- 239 (301) T protein:vir:80 161 WEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDLAGMG- 239 (301) T ss_pred cccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcceeccCC- Confidence 0111257788888887542 245678999999999997543 333332221 11123444444433211 Q ss_pred cCCCceEEE-ee-cccEEEEEecceEEEEeecccccccccccccchhhhhcCc-EEEEEEEEe-ccEEecccceEEEEee Q lcl|NC_011614. 238 NLKRGELIT-GD-FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM-VALRATMHV-ALHIADDKAFAKLVPA 313 (324) Q Consensus 238 ~~~~~~i~~-gd-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~-v~~r~~~r~-d~~v~~~~a~~~l~~~ 313 (324) ..++..++. .+ ...+-+...+.++...- -.++. ...-...|+ +..+.+|.|++++++. T Consensus 240 ~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~------------------e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 240 TAGSDSFAVIHDSNETAELIIPMDITRHPE------------------EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred CCcccEEEEEecCCcEEEEEecCceeeecc------------------eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 112222222 21 11122222222221110 01121 112234555 5678889999999998 No 164 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.61 E-value=7.3e-09 Score=65.18 Aligned_cols=225 Identities=13% Similarity=0.116 Sum_probs=143.1 Q ss_pred hhhhccccccccCCCcceechh-hhHHHHHHHHhhcchhhhceeeecCCCc-eEEEEEeCCcceeeecccccccccccce Q lcl|NC_011614. 21 PQVFNPDNVMMHEKKDGTLLND-FTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 21 ~~~~~a~~~~~~~~~g~lip~~-~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) ..++.....+..+...-+-|.. +...|++.+.+.++|+..++.+....+. ....+.++-|.+.|..=++.++.++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 0000000000000000011322 3457999999999999999988654333 3466778889999999999999999999 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCcC-cCC-c---------------cc Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PFG-K---------------SI 159 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~-~~~-~---------------~~ 159 (324) .+++....-+++.+.|.+.+.+... .++.....+.+.+++...+.+.+|+|+.+. +.. . .+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 9999999999999999998888764 345555677789999999999999986321 000 0 00 Q ss_pred cccccccc------------------------------------------c----------------------------- Q lcl|NC_011614. 160 AQSIEKTN------------------------------------------K----------------------------- 168 (324) Q Consensus 160 ~~~~~~~~------------------------------------------~----------------------------- 168 (324) .++.++.+ . T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:10 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecc Confidence 00000000 0 Q ss_pred --------eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCCce-e----eccCCCceecccceEee Q lcl|NC_011614. 169 --------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKE-R----IYDRNSDSLDGLPVVNL 234 (324) Q Consensus 169 --------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l-~d~~g~~-~----~~~~~~~~l~G~pv~~~ 234 (324) ..++..-.+.++.+..+++.....+.+|+||.+....|++. .+.+... + +.....-.+.|+||..+ T Consensus 241 dvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~ 320 (331) T protein:vir:10 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) T ss_pred chhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe Confidence 00000012334556666777777788999999999999864 3332222 2 12334457899999988 Q ss_pred cCccCCCceEE Q lcl|NC_011614. 235 KSSNLKRGELI 245 (324) Q Consensus 235 ~~~~~~~~~i~ 245 (324) ++...++..++ T Consensus 321 dai~~tE~~Vv 331 (331) T protein:vir:10 321 DALLLTEARVV 331 (331) T ss_pred eeeecCccccC Confidence 87776666555 No 165 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.61 E-value=7.3e-09 Score=65.18 Aligned_cols=225 Identities=13% Similarity=0.116 Sum_probs=143.1 Q ss_pred hhhhccccccccCCCcceechh-hhHHHHHHHHhhcchhhhceeeecCCCc-eEEEEEeCCcceeeecccccccccccce Q lcl|NC_011614. 21 PQVFNPDNVMMHEKKDGTLLND-FTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 21 ~~~~~a~~~~~~~~~g~lip~~-~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) ..++.....+..+...-+-|.. +...|++.+.+.++|+..++.+....+. ....+.++-|.+.|..=++.++.++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 0000000000000000011322 3457999999999999999988654333 3466778889999999999999999999 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCcC-cCC-c---------------cc Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PFG-K---------------SI 159 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~-~~~-~---------------~~ 159 (324) .+++....-+++.+.|.+.+.+... .++.....+.+.+++...+.+.+|+|+.+. +.. . .+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:98 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 9999999999999999998888764 345555677789999999999999986321 000 0 00 Q ss_pred cccccccc------------------------------------------c----------------------------- Q lcl|NC_011614. 160 AQSIEKTN------------------------------------------K----------------------------- 168 (324) Q Consensus 160 ~~~~~~~~------------------------------------------~----------------------------- 168 (324) .++.++.+ . T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:98 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecc Confidence 00000000 0 Q ss_pred --------eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCCce-e----eccCCCceecccceEee Q lcl|NC_011614. 169 --------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKE-R----IYDRNSDSLDGLPVVNL 234 (324) Q Consensus 169 --------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l-~d~~g~~-~----~~~~~~~~l~G~pv~~~ 234 (324) ..++..-.+.++.+..+++.....+.+|+||.+....|++. .+.+... + +.....-.+.|+||..+ T Consensus 241 dvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~ 320 (331) T protein:vir:98 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) T ss_pred chhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe Confidence 00000012334556666777777788999999999999864 3332222 2 12334457899999988 Q ss_pred cCccCCCceEE Q lcl|NC_011614. 235 KSSNLKRGELI 245 (324) Q Consensus 235 ~~~~~~~~~i~ 245 (324) ++...++..++ T Consensus 321 dai~~tE~~Vv 331 (331) T protein:vir:98 321 DALLLTEARVV 331 (331) T ss_pred eeeecCccccC Confidence 87776666555 No 166 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.61 E-value=7.3e-09 Score=65.18 Aligned_cols=225 Identities=13% Similarity=0.116 Sum_probs=143.1 Q ss_pred hhhhccccccccCCCcceechh-hhHHHHHHHHhhcchhhhceeeecCCCc-eEEEEEeCCcceeeecccccccccccce Q lcl|NC_011614. 21 PQVFNPDNVMMHEKKDGTLLND-FTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 21 ~~~~~a~~~~~~~~~g~lip~~-~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) ..++.....+..+...-+-|.. +...|++.+.+.++|+..++.+....+. ....+.++-|.+.|..=++.++.++.++ T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~tt 80 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKSRT 80 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCccccee Confidence 0000000000000000011322 3457999999999999999988654333 3466778889999999999999999999 Q ss_pred eeEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCcC-cCC-c---------------cc Q lcl|NC_011614. 99 VNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PFG-K---------------SI 159 (324) Q Consensus 99 ~~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~-~~~-~---------------~~ 159 (324) .+++....-+++.+.|.+.+.+... .++.....+.+.+++...+.+.+|+|+.+. +.. . .+ T Consensus 81 ~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~q~ 160 (331) T protein:vir:10 81 VQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQNI 160 (331) T ss_pred EEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhccccccccccce Confidence 9999999999999999998888764 345555677789999999999999986321 000 0 00 Q ss_pred cccccccc------------------------------------------c----------------------------- Q lcl|NC_011614. 160 AQSIEKTN------------------------------------------K----------------------------- 168 (324) Q Consensus 160 ~~~~~~~~------------------------------------------~----------------------------- 168 (324) .++.++.+ . T Consensus 161 IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:10 161 IDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIANV 240 (331) T ss_pred eecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecc Confidence 00000000 0 Q ss_pred --------eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCCce-e----eccCCCceecccceEee Q lcl|NC_011614. 169 --------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKE-R----IYDRNSDSLDGLPVVNL 234 (324) Q Consensus 169 --------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l-~d~~g~~-~----~~~~~~~~l~G~pv~~~ 234 (324) ..++..-.+.++.+..+++.....+.+|+||.+....|++. .+.+... + +.....-.+.|+||..+ T Consensus 241 dvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~ 320 (331) T protein:vir:10 241 DVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT 320 (331) T ss_pred chhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe Confidence 00000012334556666777777788999999999999864 3332222 2 12334457899999988 Q ss_pred cCccCCCceEE Q lcl|NC_011614. 235 KSSNLKRGELI 245 (324) Q Consensus 235 ~~~~~~~~~i~ 245 (324) ++...++..++ T Consensus 321 dai~~tE~~Vv 331 (331) T protein:vir:10 321 DALLLTEARVV 331 (331) T ss_pred eeeecCccccC Confidence 87776666555 No 167 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.53 E-value=8.2e-09 Score=64.91 Aligned_cols=225 Identities=12% Similarity=0.061 Sum_probs=143.1 Q ss_pred hhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc-eEEEEEeCCcceeeeccccccccccccee Q lcl|NC_011614. 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 21 ~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) ..++.....+..+...-+-|......|++.+.+.++|+..++.+...... ....+.++-|++.|..=++.++.++.++. T Consensus 1 m~~~~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSSTA 80 (330) T ss_pred CCcCCCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCccccccceEE Confidence 11111011111111122335566678999999999999998886533222 12345677789999999999999999999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCcC-cC-Cccc---------------c Q lcl|NC_011614. 100 NATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PF-GKSI---------------A 160 (324) Q Consensus 100 ~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~-~~-~~~~---------------~ 160 (324) +++....-+++.+.|-+.+.+.+. .++.....+...+++...+.+.+|+|+.+. +. ..|+ . T Consensus 81 qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~qvI 160 (330) T protein:vir:10 81 QVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDNVI 160 (330) T ss_pred EEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhhee Confidence 999999999999999888877654 345566777889999999999999995431 00 0000 0 Q ss_pred ccccccc-----------------------c-----------ee------------------------------------ Q lcl|NC_011614. 161 QSIEKTN-----------------------K-----------VI------------------------------------ 170 (324) Q Consensus 161 ~~~~~~~-----------------------~-----------~~------------------------------------ 170 (324) ++.++.. . +. T Consensus 161 daGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI~N 240 (330) T protein:vir:10 161 DAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCN 240 (330) T ss_pred eccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEEee Confidence 0000000 0 00 Q ss_pred ------ecccchhHH----HHHHHHhhhhccCCCEEEEcHHHHHHHHHh-hccCCcee-e---ccCCCceecccceEeec Q lcl|NC_011614. 171 ------KGDFTQDNI----IDLEALLEDDELEANAFISKTQNRSLLRKI-VDPETKER-I---YDRNSDSLDGLPVVNLK 235 (324) Q Consensus 171 ------~~~~~~~~i----~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l-~d~~g~~~-~---~~~~~~~l~G~pv~~~~ 235 (324) ......+++ +.+..+++.......+|+||...+..|++. .+.++..+ . .....-.+.|+||..++ T Consensus 241 Idvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gipir~~D 320 (330) T protein:vir:10 241 IDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTD 320 (330) T ss_pred cccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeEEEEEe Confidence 000012233 445577777788889999999999999864 34433211 1 12223468999999888 Q ss_pred CccCCCceEE Q lcl|NC_011614. 236 SSNLKRGELI 245 (324) Q Consensus 236 ~~~~~~~~i~ 245 (324) +...++..++ T Consensus 321 ail~tE~~vv 330 (330) T protein:vir:10 321 ALLNTESRVV 330 (330) T ss_pred eeecCccccC Confidence 7776666555 No 168 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.48 E-value=1.1e-07 Score=58.67 Aligned_cols=301 Identities=10% Similarity=0.079 Sum_probs=151.5 Q ss_pred Cc---hhhHHHHH-HHHHhhccchhh----------h-hccccccccCCCcceechhhhHHHHHHH-Hhhc-chhhhcee Q lcl|NC_011614. 1 ME---QTQKLKLN-LQHFASNNVKPQ----------V-FNPDNVMMHEKKDGTLLNDFTTPILQEV-MENS-KIMQLGKY 63 (324) Q Consensus 1 m~---~~~~~~~~-~~~~~~~~~~~~----------~-~~a~~~~~~~~~g~lip~~~~~~i~~~~-~~~s-~l~~l~~~ 63 (324) |+ +.|.+... ++..++..+..+ + ++..-..+|++-+.++ ..+.++.+... +... .+...+++ T Consensus 319 ~~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL-~~~~nk~l~~~y~~a~~t~~~~~~~ 397 (652) T protein:vir:79 319 FEKTERDNVYNGMTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSDFGNIL-LDVANKAILQGWEDAPETYEQWTRK 397 (652) T ss_pred CcccccCccccCccHHHHHHHHHHhhccCCCCCCHHHHHHHHhhcCcchHHHHH-HHHHHHHHHHHHhhhHHHHHHHhcc Confidence 22 22211111 222222211111 1 1111112333433333 33333333332 2222 24444554 Q ss_pred eecCC-CceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 64 EPMEG-TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) Q Consensus 64 ~~~~~-~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~ 142 (324) -+.+- .........+-++..-|.|++++......=+..++...++|.++.||++++-.-..+..+-|...++++.++.+ T Consensus 398 ~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~ 477 (652) T protein:vir:79 398 GQLSDFKIAHRVGMGGFSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTI 477 (652) T ss_pred CCCccccccceeecCCCCCccccCCCCccceeeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHH Confidence 44321 11123344556777789999999776555567789999999999999999987778889999999999999999 Q ss_pred HHHHHh---ccCcC-cCCcccccccccccceeecccchhHHHHHHHHh---hhh----ccCCCEEEEcHHHHHHHHHhhc Q lcl|NC_011614. 143 DEAGIL---NQGNN-PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALL---EDD----ELEANAFISKTQNRSLLRKIVD 211 (324) Q Consensus 143 d~a~l~---g~g~~-~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l---~~~----~~~~~~~v~~~~~~~~L~~l~d 211 (324) ++.++. ++..- ..+..++..+...+...+++++.+.+..+...+ .+. +..|..|+++|.......++.. T Consensus 478 ~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~ 557 (652) T protein:vir:79 478 ADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIR 557 (652) T ss_pred HHHHHHHHhcCcccccCCceeecccccccccccccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhc Confidence 975543 33221 133344422222222223455655555443332 221 2455678888887766655543 Q ss_pred cCCcee--eccCCCceeccc-ceEeecCccCC-CceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcC Q lcl|NC_011614. 212 PETKER--IYDRNSDSLDGL-PVVNLKSSNLK-RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) Q Consensus 212 ~~g~~~--~~~~~~~~l~G~-pv~~~~~~~~~-~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~ 287 (324) +...+- ...+....+.|+ .+++.+--+.. ....|+.+... ...+++-+ ++....+.-....-|..| T Consensus 558 s~~v~~a~~~~~~~Np~~~~~~~i~eprL~~~s~~~wylaa~~~-----~dtiev~y-----L~G~~~P~ie~~~gf~~d 627 (652) T protein:vir:79 558 SSSVKGADINAGIINPVKDFATVIAEPRLDDNSQTTFYLAASKG-----SDTIEVAY-----LNGVDTPYIDQMEGFSVD 627 (652) T ss_pred cCCCcccccccccccccccccccccccccCCCCcccEEEecCCC-----CCeEEEEE-----ecCCCCCeeeecCCCCcc Confidence 221111 111122234443 22222211111 11122222110 00112211 222222222222349999 Q ss_pred cEEEEEEEEeccEEecccceEEEEe Q lcl|NC_011614. 288 MVALRATMHVALHIADDKAFAKLVP 312 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~a~~~l~~ 312 (324) .+.+|+...||.+++|=.++.+.+- T Consensus 628 G~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 628 GVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred eEEEEEEEeccCceeeccceeeecC Confidence 9999999999999999999988877 No 169 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.48 E-value=4.1e-08 Score=61.08 Aligned_cols=286 Identities=7% Similarity=0.014 Sum_probs=153.7 Q ss_pred CchhhHHHHHHHHHhhc--cchhhhhccccccccCCCcceech---hhhHHHHHHHHhhcchhhhceeeec---CCCceE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASN--NVKPQVFNPDNVMMHEKKDGTLLN---DFTTPILQEVMENSKIMQLGKYEPM---EGTEKK 72 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~--~~~~~~~~a~~~~~~~~~g~lip~---~~~~~i~~~~~~~s~l~~l~~~~~~---~~~~~~ 72 (324) |- .+|+.+ .+..+..+.+ ....+.+|.++-. .+.+.|++...+.-..++++.+..- ...++. T Consensus 1 ~~---------~~~~~~~~~~~~~~~~~~-~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~ 70 (314) T protein:vir:10 1 MA---------IKFDAEQAKITTHLEQMG-VEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFE 70 (314) T ss_pred Cc---------cchHHHHHHHHHHHHhhc-ccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEE Confidence 21 112211 2222333333 2333344555553 4445666666666566665554332 222455 Q ss_pred EEEEeCCcceeeeccccc-ccccccceeeEEeeeeeEEEeehhHHHHHhcC---hhHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011614. 73 FTFWADKPGAYWVGEGQK-IETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg~~-~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s---~~~~~~~v~~~l~~ai~~~~d~a~l~ 148 (324) +++.+....+.|++.++. +|..+..+++.....+.++..+.++..=++.+ ..++...-....++++.+.+|+.+|+ T Consensus 71 ~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~ 150 (314) T protein:vir:10 71 YPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWS 150 (314) T ss_pred eeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEe Confidence 666666778899976544 88888888888888888888888876444434 35788888899999999999999999 Q ss_pred ccCcC-cCCcccccccccccce---eecccchhHHHHHHHHhhh---hccCCCEEEEcHHHHHHHHHhhccCCceeec-- Q lcl|NC_011614. 149 NQGNN-PFGKSIAQSIEKTNKV---IKGDFTQDNIIDLEALLED---DELEANAFISKTQNRSLLRKIVDPETKERIY-- 219 (324) Q Consensus 149 g~g~~-~~~~~~~~~~~~~~~~---~~~~~~~~~i~~~~~~l~~---~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~-- 219 (324) |+... ..|......+...+.. .+..--++|+..++.++.. ....+..++++|..+..|....+..|..++. T Consensus 151 G~~~~g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l 230 (314) T protein:vir:10 151 GSAPHGIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELF 230 (314) T ss_pred ecccccceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHH Confidence 96542 1222111111111111 1111226778888888764 2345678999999998875443333322211 Q ss_pred --cCCCceecccceEeecCccCCCceEEEe--ecccEEEEEecceEEEEeecccccccccccccchhhhhcC--cEEEEE Q lcl|NC_011614. 220 --DRNSDSLDGLPVVNLKSSNLKRGELITG--DFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD--MVALRA 293 (324) Q Consensus 220 --~~~~~~l~G~pv~~~~~~~~~~~~i~~g--d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~--~v~~r~ 293 (324) ...+-+|.+.|-... +...++..+++- +...+-+.....++..- .+.. ...+.. T Consensus 231 ~~n~~~l~I~~~~el~~-ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~-------------------~e~~~~~~~~~~ 290 (314) T protein:vir:10 231 TRNNPGLTIRFLQFLDN-YDGAGGKAALAFEKSPLNMSIEIPEVTNVLP-------------------AQPKDLHFRYPV 290 (314) T ss_pred HHhCCCcEEEEcccccc-cCCCcceEEEEEecCCcEEEEecCccceeec-------------------ceecCceEEEcc Confidence 111223444444322 111122222211 11112222222222110 0111 122334 Q ss_pred EEEe-ccEEecccceEEEEeeccC Q lcl|NC_011614. 294 TMHV-ALHIADDKAFAKLVPADAK 316 (324) Q Consensus 294 ~~r~-d~~v~~~~a~~~l~~~~~~ 316 (324) ..|+ |..+.+|.|++++++.+-+ T Consensus 291 ~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 291 TSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred eeeeEEEEEECcceeEeeeeeecC Confidence 5566 5677789999999988877 No 170 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.46 E-value=1.9e-07 Score=57.40 Aligned_cols=278 Identities=10% Similarity=0.049 Sum_probs=146.9 Q ss_pred hhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhce---------eeecCCCceEEEEEeC-Ccceeeecccc- Q lcl|NC_011614. 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGK---------YEPMEGTEKKFTFWAD-KPGAYWVGEGQ- 89 (324) Q Consensus 21 ~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~---------~~~~~~~~~~ip~~~~-~~~a~~v~Eg~- 89 (324) ...+++ -|.-...++|+.+...+.+...+.+.|++-.- ....++....+|.|.. ..+..-+.+.. T Consensus 1 M~~~~~----~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~ 76 (367) T protein:vir:80 1 MPDFNN----QVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNP 76 (367) T ss_pred Ccchhh----hhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCC Confidence 000111 01112246777777777666666666554322 1335566789999854 33333343332 Q ss_pred --cccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh---c----cCcCc------ Q lcl|NC_011614. 90 --KIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---N----QGNNP------ 154 (324) Q Consensus 90 --~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~---g----~g~~~------ 154 (324) ..+..+.+-++-.......+.....++-...-+..++.+.|.+++++.-.+...+.+|. | +..+. T Consensus 77 ~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~ 156 (367) T protein:vir:80 77 NVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKT 156 (367) T ss_pred cccccccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhh Confidence 24445555554444444445555555544444556889999999987777766665443 2 11100 Q ss_pred -----------CCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhh------ccCCcee Q lcl|NC_011614. 155 -----------FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV------DPETKER 217 (324) Q Consensus 155 -----------~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~------d~~g~~~ 217 (324) .....+... .........++++.++++..++.+....-++++||+..+..|++++ +++| T Consensus 157 ~~~~~a~~~~~~~~~~~Dis-~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~li~~i~~sd~--- 232 (367) T protein:vir:80 157 RGRVPAEVLGTAGDMVIDIS-GQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDEIEFIPDSKG--- 232 (367) T ss_pred hhccccccccccCceeeeee-ccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccccccccCCCC--- Confidence 000111111 1111122457888999999999988888899999999999998653 3333 Q ss_pred eccCCCceecccceEeecCccCC------Cce-EEEeecccEEEEEecc-eEEEEeecccccccccccccchhhhhcCcE Q lcl|NC_011614. 218 IYDRNSDSLDGLPVVNLKSSNLK------RGE-LITGDFDKLIYGIPQL-IEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) Q Consensus 218 ~~~~~~~~l~G~pv~~~~~~~~~------~~~-i~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v 289 (324) ....++++|++|++.++.+.. ... .+||. ..+.++.... ..+|+.|+.... +. .++- T Consensus 233 --~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~-GAi~~~~~~~~~~~E~~Rd~~~~---~~---------gG~d 297 (367) T protein:vir:80 233 --QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG-AAFGYADGAPQVPVAVGRRELRG---NG---------SGLE 297 (367) T ss_pred --ccccceecceeEEEeCCCcccccCCCceEEEEEEec-ceeeecccCCccceecccchhhh---cC---------CceE Confidence 223568899999998766542 112 22332 2233333221 224555544210 00 1122 Q ss_pred EEEEEEEeccEEecccceEEEEeecc--------------CCCCccccC Q lcl|NC_011614. 290 ALRATMHVALHIADDKAFAKLVPADA--------------KPSSVPGEV 324 (324) Q Consensus 290 ~~r~~~r~d~~v~~~~a~~~l~~~~~--------------~~~~~~~~~ 324 (324) .+.-..| .+++|.+|...+..-. ...+|..|. T Consensus 298 ~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eL 343 (367) T protein:vir:80 298 YILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) T ss_pred EEEeeee---EEeecceeeecccccccccccccccccccccCCCChHHh Confidence 2222222 4788888877654322 234566666 No 171 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.44 E-value=2.5e-08 Score=62.30 Aligned_cols=226 Identities=14% Similarity=0.089 Sum_probs=138.1 Q ss_pred hhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCc-eEEEEEeCCcceeeeccccccccccccee Q lcl|NC_011614. 21 PQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 21 ~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) ..++.-...+-.....-+-+......|++.+.+.+.|+..++........ ....+.++-|++.|..=++.++.++.++. T Consensus 1 m~~~~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~~s~~tt~ 80 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQPTKTQTV 80 (335) T ss_pred CCcCCCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCccccccceEE Confidence 00000000011111111224455667999999999999998887543222 22355677789999999999999999999 Q ss_pred eEEeeeeeEEEeehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCcC-cC-Cccc---------------- Q lcl|NC_011614. 100 NATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PF-GKSI---------------- 159 (324) Q Consensus 100 ~v~~~~~k~~~~v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~-~~-~~~~---------------- 159 (324) +++....-+++.+.|-+.+.+.+. .++.........+++...+.+.+|+|+.+. +. ..|+ T Consensus 81 qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~a~ 160 (335) T protein:vir:73 81 PVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAASAE 160 (335) T ss_pred EEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCccc Confidence 999999999999999887777654 345666667789999999999999995421 00 0000 Q ss_pred --cccccccc---------------------------------------------------------------------- Q lcl|NC_011614. 160 --AQSIEKTN---------------------------------------------------------------------- 167 (324) Q Consensus 160 --~~~~~~~~---------------------------------------------------------------------- 167 (324) .++.++.. T Consensus 161 ~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI~ 240 (335) T protein:vir:73 161 NVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISRIC 240 (335) T ss_pred ceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEEEe Confidence 00000000 Q ss_pred -----ceeecccchhHHHHHH-H-----HhhhhccCCCEEEEcHHHHHHHHHhhccCCceee-----ccCCCceecccce Q lcl|NC_011614. 168 -----KVIKGDFTQDNIIDLE-A-----LLEDDELEANAFISKTQNRSLLRKIVDPETKERI-----YDRNSDSLDGLPV 231 (324) Q Consensus 168 -----~~~~~~~~~~~i~~~~-~-----~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~-----~~~~~~~l~G~pv 231 (324) ...++..+..+|++++ . .++.......+|+||.+.+..|++..-..++..+ .+...-.+.|+|| T Consensus 241 NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~gipi 320 (335) T protein:vir:73 241 NIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLGIPI 320 (335) T ss_pred ecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECCeEE Confidence 0000111223344432 2 3344455557899999999999864433333322 1222346899999 Q ss_pred EeecCccCCCceEEE Q lcl|NC_011614. 232 VNLKSSNLKRGELIT 246 (324) Q Consensus 232 ~~~~~~~~~~~~i~~ 246 (324) ..+++...++..+.. T Consensus 321 r~~Dail~tE~~v~~ 335 (335) T protein:vir:73 321 RRVDAILNTESAVTA 335 (335) T ss_pred EEEeeeecCcccccC Confidence 988877766665555 No 172 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.43 E-value=1.4e-07 Score=58.10 Aligned_cols=295 Identities=8% Similarity=0.011 Sum_probs=159.3 Q ss_pred chhhHHHHHHHHHhhcc-chhhhhccc-cccccCCCcceec---hhhhHHHHHHHHhhcchhhhceeee---cCCCceEE Q lcl|NC_011614. 2 EQTQKLKLNLQHFASNN-VKPQVFNPD-NVMMHEKKDGTLL---NDFTTPILQEVMENSKIMQLGKYEP---MEGTEKKF 73 (324) Q Consensus 2 ~~~~~~~~~~~~~~~~~-~~~~~~~a~-~~~~~~~~g~lip---~~~~~~i~~~~~~~s~l~~l~~~~~---~~~~~~~i 73 (324) -|.|-...+++.+..+. ......+.. .....+..+.++- +.+.+.|++...+.-..+++..... ....++.+ T Consensus 1 ~~~~~~~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~ 80 (329) T protein:vir:79 1 MRGNIMSKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEY 80 (329) T ss_pred CccchhhhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEe Confidence 34444444444333221 222222211 1222233345555 3346777877777777777766433 33334566 Q ss_pred EEEeCCcceeeeccc-ccccccccceeeEEeeeeeEEEeehhHHHHHhcC---hhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011614. 74 TFWADKPGAYWVGEG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) Q Consensus 74 p~~~~~~~a~~v~Eg-~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s---~~~~~~~v~~~l~~ai~~~~d~a~l~g 149 (324) ++.+....+.|++.+ ..+|..+..++......+.++..+.++..=++.+ ..++...-....++++++.+|+-+|+| T Consensus 81 ~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G 160 (329) T protein:vir:79 81 QTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKG 160 (329) T ss_pred eeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEee Confidence 666767788999764 5678888888887788888888877776444434 357888888999999999999999999 Q ss_pred cCcC-cCCccccccccccccee---------ecccchhHHHHHHHHhhhh--c-cCCCEEEEcHHHHHHHHHhhccCCce Q lcl|NC_011614. 150 QGNN-PFGKSIAQSIEKTNKVI---------KGDFTQDNIIDLEALLEDD--E-LEANAFISKTQNRSLLRKIVDPETKE 216 (324) Q Consensus 150 ~g~~-~~~~~~~~~~~~~~~~~---------~~~~~~~~i~~~~~~l~~~--~-~~~~~~v~~~~~~~~L~~l~d~~g~~ 216 (324) +... ..|......+....... +...-++|+.+++.++... + ..+..++++|+.+..|.......|.. T Consensus 161 ~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~~t 240 (329) T protein:vir:79 161 SKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETTMS 240 (329) T ss_pred cccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCCcc Confidence 7542 22222222222111110 0111256778888887543 2 44578999999998886544333422 Q ss_pred eec---c-CCCceecccceEeecCccCCCceEEEeecc--cEEEEEecceEEEEeecccccccccccccchhhhhcCc-- Q lcl|NC_011614. 217 RIY---D-RNSDSLDGLPVVNLKSSNLKRGELITGDFD--KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM-- 288 (324) Q Consensus 217 ~~~---~-~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~--~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~-- 288 (324) ++. . ..+-+|.+.|-... +...+...++..+.+ .+-+.....++.... +... T Consensus 241 vl~~lk~~~~~l~I~~~~el~~-ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-------------------q~~~~~ 300 (329) T protein:vir:79 241 YLDYFKQQNGGITIESISELED-IDGAGTKAALVYEKDPMNMSIEIPEAFNMLTA-------------------QPKDLH 300 (329) T ss_pred HHHHHHHhCCCcEEEEcccccc-cCCCCceEEEEEecCCceEEEecCcceeeeec-------------------eecCce Confidence 211 1 11223444443221 112222333332222 222222233322111 1111 Q ss_pred EEEEEEEEec-cEEecccceEEEEeeccC Q lcl|NC_011614. 289 VALRATMHVA-LHIADDKAFAKLVPADAK 316 (324) Q Consensus 289 v~~r~~~r~d-~~v~~~~a~~~l~~~~~~ 316 (324) ..+....|++ ..+.+|.|++++.+.... T Consensus 301 ~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 301 FKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred EEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 2233345554 667779999999987665 No 173 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.38 E-value=1.2e-07 Score=58.52 Aligned_cols=275 Identities=10% Similarity=-0.014 Sum_probs=127.5 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhc-------eeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLG-------KYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~-------~~~~~~~~~~~i 73 (324) |..+.-. ...+.+....++.+.+.....+.+ ...++.+.-+.+ T Consensus 1 m~lsD~~------------------------------vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~ 50 (325) T protein:vir:95 1 MALSDLA------------------------------VYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDV 50 (325) T ss_pred Cchhhhh------------------------------hhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeec Confidence 2222211 112333333444444333333221 123344666678 Q ss_pred EEEeCC-c---ceeeecccccccccccc-eeeEEeeeeeEEEe--ehhHHHHHh-cChhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 74 TFWADK-P---GAYWVGEGQKIETSKAT-WVNATMRAFKLGVI--LPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) Q Consensus 74 p~~~~~-~---~a~~v~Eg~~~~~~~~~-~~~v~~~~~k~~~~--v~iS~ell~-~s~~~~~~~v~~~l~~ai~~~~d~a 145 (324) |.|..- . +..-+.+...++..+.+ ..++..+..+-.+. ..++..+.. +....+.+.|.++++++..+.+-+. T Consensus 51 pf~~~l~g~~~~~~~~~~~~~vt~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~ 130 (325) T protein:vir:95 51 AFFAKVTGGLVRRRNAYGSGTVAEKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNV 130 (325) T ss_pred cccccccccccccccCCCCceeccceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHH Confidence 877532 1 22334444455555544 44444444433332 233332222 2223445555555555554444444 Q ss_pred HHhccCcC--cCCcccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCC- Q lcl|NC_011614. 146 GILNQGNN--PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRN- 222 (324) Q Consensus 146 ~l~g~g~~--~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~~~~- 222 (324) +|.+.... ....... .+..........+++..+.++.+++.+....-+.|+||..++..|.+..-.+...++.... T Consensus 131 ~~~~l~~a~~~~~~~v~-dis~~~~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~ 209 (325) T protein:vir:95 131 GLGSVYSALSQVSDVVY-DATANTDAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTV 209 (325) T ss_pred HHHHHHHhhccccccee-eeecccCcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCc Confidence 44322110 0001111 1111111122235788999999999988888899999999999998755444433332221 Q ss_pred --CceecccceEeecCccCCCc------eEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEE Q lcl|NC_011614. 223 --SDSLDGLPVVNLKSSNLKRG------ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRAT 294 (324) Q Consensus 223 --~~~l~G~pv~~~~~~~~~~~------~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~ 294 (324) .++++|++|++.++.+.... ..|.--...+.++...+.... .+.... -.+...++| T Consensus 210 ~~i~t~~G~~VIVdD~~p~~~~g~~~~ytty~lg~GAi~~~~~~~~~~~--------~~~~~~------~~~~~~~~~-- 273 (325) T protein:vir:95 210 NVVRDPFGKLLVMTDSPNLFAAGTPNVYHILGLVPGGVLIGQNNDFDAN--------EETKNG------DENIIRTYQ-- 273 (325) T ss_pred ccccccCCcEEEEeCCCCCCCccCceeEEEEEEecCeEEecCCCCcccc--------ccccCc------ccceeeeee-- Confidence 24789999999876554321 111111112222222221111 111110 022233344 Q ss_pred EEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 295 MHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 295 ~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) .|+- -+++|.++.. +......++|-+|. T Consensus 274 ~~~t-f~lhp~G~sw-~~s~~g~sPt~aeL 301 (325) T protein:vir:95 274 AEWS-YNIGVKGFAW-DKANGGKSPTDAAL 301 (325) T ss_pred eeee-EEeecceeee-ecccccCCcChHhh Confidence 2332 3789999988 33333445666676 No 174 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.33 E-value=3.1e-07 Score=56.26 Aligned_cols=277 Identities=9% Similarity=0.018 Sum_probs=138.5 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhceee---ec---CCCceEEEEEeCCcceeee-----ccccccccccccee Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE---PM---EGTEKKFTFWADKPGAYWV-----GEGQKIETSKATWV 99 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~---~~---~~~~~~ip~~~~~~~a~~v-----~Eg~~~~~~~~~~~ 99 (324) ++ .-.++|+.|+.++++.+++..++.+++.+- .. .+..++||+... ..+.+. +++.++...+.+-+ T Consensus 1 Ma--~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) T protein:vir:99 1 MA--NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTED 77 (392) T ss_pred Cc--cccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccccccc Confidence 22 234889999999999999999998887431 11 255678887543 333332 34555555666667 Q ss_pred eEEeee-eeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhH Q lcl|NC_011614. 100 NATMRA-FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) Q Consensus 100 ~v~~~~-~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) ++++.. +....-+.|+++-...+..++...+.++..++++.++|..++.--..... ..............++. T Consensus 78 ~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~------~~~~~~~~~~~~~~~~~ 151 (392) T protein:vir:99 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY------EAAGAVHEVAPDEFFKG 151 (392) T ss_pred eEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc------cccccccccChhhhHHH Confidence 777766 33455577778766667778888899999999999999887742111100 00111111223456888 Q ss_pred HHHHHHHhhhhccCCC-EEEEcHHHHHHHHHh---h--ccCC---ceeeccCCCceecccceEeecCccCCCceEEEeec Q lcl|NC_011614. 179 IIDLEALLEDDELEAN-AFISKTQNRSLLRKI---V--DPET---KERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDF 249 (324) Q Consensus 179 i~~~~~~l~~~~~~~~-~~v~~~~~~~~L~~l---~--d~~g---~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~ 249 (324) ++++..+|...+.... .++++|..+..|.+. . +.-| ...+..+..++++|++++.+...+.. ..+.+.. T Consensus 152 i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~--t~~a~~~ 229 (392) T protein:vir:99 152 VNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHG--DAYLYHP 229 (392) T ss_pred HHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccccc--cceeeec Confidence 9999999987665433 578899998887642 1 1112 23345666789999999886654333 3333332 Q ss_pred ccEEEEEecceEE-------EEeec--ccccccccccccchhhhhcCcEEEEEEEEeccEEec---ccceEE---EEeec Q lcl|NC_011614. 250 DKLIYGIPQLIEY-------KIDET--AQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD---DKAFAK---LVPAD 314 (324) Q Consensus 250 ~~~~~~~~~~~~i-------~~~~~--~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~---~~a~~~---l~~~~ 314 (324) +.+.......... ..+.. .......+.+. -+..+...+ ....+..... ..++.. ++... T Consensus 230 ~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~----t~~s~~~~v--~~~~g~~~v~~~~~~~~~~~~~~~~~~ 303 (392) T protein:vir:99 230 TAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDS----TITSNRSLI--DTYFGLKVVEDPNGVGFVRARKIHLIP 303 (392) T ss_pred cccccccccccccccccceeEEecccceecceeecccc----eeecccccc--ceeEEEEEEeeccccceeeeeeeeeec Confidence 2222211110000 00000 00000000000 000111111 0011111111 111111 01000 Q ss_pred cCCCCccccC Q lcl|NC_011614. 315 AKPSSVPGEV 324 (324) Q Consensus 315 ~~~~~~~~~~ 324 (324) ..-+.+|..+ T Consensus 304 ~~v~v~~v~~ 313 (392) T protein:vir:99 304 GSIEVAPEAG 313 (392) T ss_pred ceeeeeeeec Confidence 0000011000 No 175 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.30 E-value=1.3e-06 Score=52.94 Aligned_cols=278 Identities=10% Similarity=-0.026 Sum_probs=135.9 Q ss_pred cccCCCcceechhhhHHHHHHHHhhcchhhhceeee-----cCCCceEEEEEeCCcceeeecccccccccccceeeEEee Q lcl|NC_011614. 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP-----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMR 104 (324) Q Consensus 30 ~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~-----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~ 104 (324) |.+..+..+-|+.|+.++++.+++..++.+++.+-. -.+..++||+... .-+.++..+...+.+-.++.++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~----~~v~dg~~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYR----VKSASGRTLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCc----eeecccCCccccccccceEEEE Confidence 556566677799999999999999999988875421 1245778887331 2233455555555555666666 Q ss_pred e-eeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHHHH Q lcl|NC_011614. 105 A-FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE 183 (324) Q Consensus 105 ~-~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 183 (324) . +.....+.++++-...+..++.+.+.+...++++..+|..++.-- .+ .........+....++++.++. T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~-~~--------a~~~~gt~gt~~~~~~~i~~a~ 147 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTL-KK--------AFHSSGTPGVRPGAFIDFANAG 147 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-hh--------cccccccCCcCcchHHHHHHHH Confidence 5 334555677776556677789889999999999999998876421 00 0011111112234589999999 Q ss_pred HHhhhhccCC-C--EEEEcHHHHHHHHHhhcc----C-CceeeccCCCceecccceEeecCccCCCceEEEeecc--cEE Q lcl|NC_011614. 184 ALLEDDELEA-N--AFISKTQNRSLLRKIVDP----E-TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFD--KLI 253 (324) Q Consensus 184 ~~l~~~~~~~-~--~~v~~~~~~~~L~~l~d~----~-g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~--~~~ 253 (324) .+|...+... . ..+++|..+..|.+-... . ....+..+..+++.|+.++.++..+.... |.+. ... T Consensus 148 ~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~ta----g~~~~t~~v 223 (418) T protein:vir:10 148 AKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTV----GDHGGTPLV 223 (418) T ss_pred HHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCcccc----cccccceee Confidence 9998877653 2 357999998877532111 1 12234566678899999988765432110 1110 111 Q ss_pred EEE-ecceEEEEeeccccc-ccccccc--cchhhhhc---------CcEEEEEEEEec------cEEe-cccc-----eE Q lcl|NC_011614. 254 YGI-PQLIEYKIDETAQLS-TVKNEDG--TPVNLFEQ---------DMVALRATMHVA------LHIA-DDKA-----FA 308 (324) Q Consensus 254 ~~~-~~~~~i~~~~~~~~~-~~~~~~~--~~~~~f~~---------~~v~~r~~~r~d------~~v~-~~~a-----~~ 308 (324) .|. ..+..+.++-..... .....++ .+-..|.- +...|++..-.. ..|. .|.- .. T Consensus 224 ~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~ 303 (418) T protein:vir:10 224 NGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATI 303 (418) T ss_pred ecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEeccccccccccc Confidence 111 111111111000000 0000000 00000000 111122222110 0110 0000 00 Q ss_pred EEEeeccCCCCccccC Q lcl|NC_011614. 309 KLVPADAKPSSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) .-+.....+......| T Consensus 304 ~~~~~~~~~~~~~~~v 319 (418) T protein:vir:10 304 NNENGDPVSLTAYQNV 319 (418) T ss_pred cccccccccccCCCcc Confidence 0000000000001111 No 176 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=98.18 E-value=8e-07 Score=54.02 Aligned_cols=300 Identities=10% Similarity=0.031 Sum_probs=147.0 Q ss_pred CchhhHHHHH-HHHHhhccch------------hhhhccccccccCCCcceechhhhHHHHHH-HHhh-cchhhhceeee Q lcl|NC_011614. 1 MEQTQKLKLN-LQHFASNNVK------------PQVFNPDNVMMHEKKDGTLLNDFTTPILQE-VMEN-SKIMQLGKYEP 65 (324) Q Consensus 1 m~~~~~~~~~-~~~~~~~~~~------------~~~~~a~~~~~~~~~g~lip~~~~~~i~~~-~~~~-s~l~~l~~~~~ 65 (324) .++.|.+... ++..++..+. .-.-++. ..+|++-+.+ -..+.++.+.. -+.. ......++..+ T Consensus 357 ~~~~n~~~g~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~-~htTSDFp~I-L~~~~nk~l~~~y~~a~~t~~~~~~~~~ 434 (693) T protein:vir:95 357 RQADNAYNGMTLRELARASLVDRGIGVASLNAPQMVGLAF-THTSSDFGLI-LLDVANKSVLAGWEEAEETFPLWTKSGI 434 (693) T ss_pred ccCCccccCCcHHHHHHHHHHhcCCccCCCCHHHHHHHHH-hcCcchhHHH-HHHHHHHHHHHHHHhhhhHHHHHhccCC Confidence 2222222111 1111111111 1111121 1233333332 23333333322 2111 12333333333 Q ss_pred cCCC-ceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 66 MEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) Q Consensus 66 ~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~ 144 (324) .+-- ........+-++-.-|.|++++......=+.-++...++|.++.||++++-+-..++.+.|...++++.++.+++ T Consensus 435 ~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~ 514 (693) T protein:vir:95 435 LTDFKPARRVGLGEFSSLRQVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGD 514 (693) T ss_pred CCcccccceeecCCCCChhhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHH Confidence 2211 111223344456667889988865444434467788999999999999998888888999999999999999997 Q ss_pred HHH---hccCcCcCCccccccccccc-ceeecccchhHHHHHHHHhhh------------hccCCCEEEEcHHHHHHHHH Q lcl|NC_011614. 145 AGI---LNQGNNPFGKSIAQSIEKTN-KVIKGDFTQDNIIDLEALLED------------DELEANAFISKTQNRSLLRK 208 (324) Q Consensus 145 a~l---~g~g~~~~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~~l~~------------~~~~~~~~v~~~~~~~~L~~ 208 (324) .++ .++..-..+..++.+.-..- ......++.+.+..+..++.. -+..|..|++++.......+ T Consensus 515 ~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~ 594 (693) T protein:vir:95 515 LVYAVLTGNPAMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQ 594 (693) T ss_pred HHHHHHhcCccccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHH Confidence 555 33332223333333322221 112345666666555333311 12456678888888776666 Q ss_pred hhccCCcee--eccCCCceeccc-ceEeecCccC-CCce-EEEeecccEEEEEecceEEEEeecccccccccccccchhh Q lcl|NC_011614. 209 IVDPETKER--IYDRNSDSLDGL-PVVNLKSSNL-KRGE-LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) Q Consensus 209 l~d~~g~~~--~~~~~~~~l~G~-pv~~~~~~~~-~~~~-i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) +..+...+. ...+....+.|+ .++..+-... .... .++.+... ..+++- .++....+.-..... T Consensus 595 l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~------dtie~~-----yL~G~~~P~ie~~~g 663 (693) T protein:vir:95 595 IINSESVPGADVNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAKKGS------DTIEVA-----YLDGVDTPYLEQQEG 663 (693) T ss_pred HhccccccccccccccccchhccccccccceecCCCCCceEEecCCCC------CeEEEE-----EecCCCCCeEeecCC Confidence 553322111 011112234443 2322211110 1011 11222110 011221 222222222222335 Q ss_pred hhcCcEEEEEEEEeccEEecccceEEEEee Q lcl|NC_011614. 284 FEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 284 f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) |..|.+.+|+...||.+++|=.++.+=.++ T Consensus 664 f~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 664 FTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred CCcceEEEEEEEeccCceeeccccccCCCC Confidence 999999999999999999999988887776 No 177 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=98.15 E-value=3.5e-06 Score=50.50 Aligned_cols=274 Identities=12% Similarity=0.057 Sum_probs=137.8 Q ss_pred cccCC-Ccceech--hhhHHHHHHHHhhcchhhhce---------eeecCCCceEEEEEeC-Cc--ceeeeccc--cccc Q lcl|NC_011614. 30 MMHEK-KDGTLLN--DFTTPILQEVMENSKIMQLGK---------YEPMEGTEKKFTFWAD-KP--GAYWVGEG--QKIE 92 (324) Q Consensus 30 ~~~~~-~g~lip~--~~~~~i~~~~~~~s~l~~l~~---------~~~~~~~~~~ip~~~~-~~--~a~~v~Eg--~~~~ 92 (324) |.++. ....+|+ .+...+.+...+.+.|++-.- ....++..+.+|.|.. .. +..+-+.+ ...+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t 80 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 23333 3344565 366666666666666554221 1234566789999864 22 22222332 2334 Q ss_pred ccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh---ccCc-CcCCccccccc-cccc Q lcl|NC_011614. 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQGN-NPFGKSIAQSI-EKTN 167 (324) Q Consensus 93 ~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~---g~g~-~~~~~~~~~~~-~~~~ 167 (324) ..+.+-++-....+..+.....++=...-|..++.+.|.+++++...+...+.+|. |-=. +.......... ..+. T Consensus 81 ~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~t~ 160 (349) T protein:vir:78 81 PRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMVV 160 (349) T ss_pred cccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhccccee Confidence 44544443333333334444444422333445788999999998887777765553 2100 00000000000 0011 Q ss_pred c-eeecccchhHHHHHHHHhhhh-----ccCCCEEEEcHHHHHHHHHhh------ccCCceeeccCCCceecccceEeec Q lcl|NC_011614. 168 K-VIKGDFTQDNIIDLEALLEDD-----ELEANAFISKTQNRSLLRKIV------DPETKERIYDRNSDSLDGLPVVNLK 235 (324) Q Consensus 168 ~-~~~~~~~~~~i~~~~~~l~~~-----~~~~~~~v~~~~~~~~L~~l~------d~~g~~~~~~~~~~~l~G~pv~~~~ 235 (324) . .....++...++++..++.+. ...-+.++||+.++..|.+.+ +.+| ....++++|++|++.+ T Consensus 161 d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~-----~~~i~ty~G~~VivDD 235 (349) T protein:vir:78 161 DVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAEN-----NTMFATYQGYRVIVDD 235 (349) T ss_pred eeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCccc-----CcccceecCeEEEEeC Confidence 1 122236777888888777665 233478999999999987543 2222 2224678999999987 Q ss_pred CccCCC------ce-EEEeecccEEEEEecc-eEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 236 SSNLKR------GE-LITGDFDKLIYGIPQL-IEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 236 ~~~~~~------~~-i~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +.++.. .. .+|| -..+.++.... ..+++.++..... ..++..+..+.++ +++|.++ T Consensus 236 ~~Pv~~~g~~~~yttylfg-~GAi~~~~~~~~~~~et~rd~~~g~------------~~G~d~l~~R~~~---~~hp~G~ 299 (349) T protein:vir:78 236 SMTVVGQGAQRKFISIIFG-QGAIGYGEGNPVMPLEYEREASRAN------------GGGVETLWTRKTW---LLHPFGY 299 (349) T ss_pred CCccccCCCCceEEEEEee-cceEEEccCCCccceeeecccccCC------------cceeEEEEEeeEE---Eeeeeee Confidence 766532 12 2333 23333443222 2355555542100 0122223222332 6778887 Q ss_pred EEEEeecc-------CCCCccccC Q lcl|NC_011614. 308 AKLVPADA-------KPSSVPGEV 324 (324) Q Consensus 308 ~~l~~~~~-------~~~~~~~~~ 324 (324) ..-+.... ..++|.+|. T Consensus 300 s~~~a~v~~~~~~~~~~sPt~aeL 323 (349) T protein:vir:78 300 RFTSAVITGNGTETIARSASWQDL 323 (349) T ss_pred eeccccccCCccccccCCCChHHh Confidence 77654333 245666677 No 178 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.97 E-value=8.7e-06 Score=48.32 Aligned_cols=274 Identities=11% Similarity=0.059 Sum_probs=136.6 Q ss_pred cccCC-Ccceech--hhhHHHHHHHHhhcchhhhcee---------eecCCCceEEEEEeC-Cccee--eeccc--cccc Q lcl|NC_011614. 30 MMHEK-KDGTLLN--DFTTPILQEVMENSKIMQLGKY---------EPMEGTEKKFTFWAD-KPGAY--WVGEG--QKIE 92 (324) Q Consensus 30 ~~~~~-~g~lip~--~~~~~i~~~~~~~s~l~~l~~~---------~~~~~~~~~ip~~~~-~~~a~--~v~Eg--~~~~ 92 (324) |.++. ....+|+ .+...+.+...+.+.|++-.-. ...++..+.+|.|.. ..++. +-+.. ...+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~t 80 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 22333 3345565 3666666666666666553211 234566778998864 23322 22221 2344 Q ss_pred ccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHh---ccCc-CcCCccccc-cccc-c Q lcl|NC_011614. 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQGN-NPFGKSIAQ-SIEK-T 166 (324) Q Consensus 93 ~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~---g~g~-~~~~~~~~~-~~~~-~ 166 (324) ..+.+-++-.....-.+.....++=...-+..++.+.|.+++++...+...+.+|. |-=. +........ .... . T Consensus 81 ~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~~~ 160 (349) T protein:vir:94 81 PRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMVV 160 (349) T ss_pred cccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCceeE Confidence 44444333222222233333333322233445788999999998888887776554 2100 000000000 0000 0 Q ss_pred cceeecccchhHHHHHHHHhhhh-----ccCCCEEEEcHHHHHHHHHhh------ccCCceeeccCCCceecccceEeec Q lcl|NC_011614. 167 NKVIKGDFTQDNIIDLEALLEDD-----ELEANAFISKTQNRSLLRKIV------DPETKERIYDRNSDSLDGLPVVNLK 235 (324) Q Consensus 167 ~~~~~~~~~~~~i~~~~~~l~~~-----~~~~~~~v~~~~~~~~L~~l~------d~~g~~~~~~~~~~~l~G~pv~~~~ 235 (324) .......++...++++..++.+. ...-+.++||+.++..|.+.+ +.+|. ...++++|++|++.+ T Consensus 161 d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~-----~~i~ty~G~~VivDD 235 (349) T protein:vir:94 161 DVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENN-----TMFATYQGYRVIVDD 235 (349) T ss_pred EecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCcccC-----cccceecCcEEEEeC Confidence 11122346777788888777664 233478999999999987643 22221 224679999999987 Q ss_pred CccCC------CceE-EEeecccEEEEEec-ceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Q lcl|NC_011614. 236 SSNLK------RGEL-ITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 236 ~~~~~------~~~i-~~gd~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 307 (324) +.+.. +... +|| -..+.++... ...+++.++..... ..++..+....++ +++|.++ T Consensus 236 ~~Pv~~~g~~~~yttylfg-~GAi~~~~~~~~~~~E~~rd~~~g~------------~~G~d~L~~R~~~---~~hp~G~ 299 (349) T protein:vir:94 236 SMTVVGQDTSRKFISIIFG-QGAIGYGEGNPEMPLEYEREASRAN------------GGGVETLWTRKTW---LLHPFGY 299 (349) T ss_pred CCccccCCCCceEEEEEee-cceEEeecCCCCcceeeecccccCC------------cceeEEEEEeeEE---Eeeeeee Confidence 66542 2222 233 2334444432 22355555542110 0112222222222 6788888 Q ss_pred EEEEeecc-------CCCCccccC Q lcl|NC_011614. 308 AKLVPADA-------KPSSVPGEV 324 (324) Q Consensus 308 ~~l~~~~~-------~~~~~~~~~ 324 (324) ..-+.... ..++|.+|. T Consensus 300 s~~~a~v~~~~~~~~~~sPt~aeL 323 (349) T protein:vir:94 300 SFTSAVITGNGTETIARSASWQDL 323 (349) T ss_pred eecccccCCCccccccCCCChHHh Confidence 77764333 235666677 No 179 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=97.79 E-value=5.3e-06 Score=49.52 Aligned_cols=300 Identities=13% Similarity=0.055 Sum_probs=150.9 Q ss_pred CchhhHHHHH----HHHHhhccchhhhhcccccc--ccC-CCcceechhhhHHHHHHHH--hhcchhhhceeeecCCCce Q lcl|NC_011614. 1 MEQTQKLKLN----LQHFASNNVKPQVFNPDNVM--MHE-KKDGTLLNDFTTPILQEVM--ENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 m~~~~~~~~~----~~~~~~~~~~~~~~~a~~~~--~~~-~~g~lip~~~~~~i~~~~~--~~s~l~~l~~~~~~~~~~~ 71 (324) |.+.++++.- +.+++.+..++ +.+.-.. .+. +++++=-+.+..+|..... +.-.+.+-..+.+..+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS--~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~ 78 (463) T protein:vir:99 1 MTIEKNLSDVQQKYADQFQEDVVKS--FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVV 78 (463) T ss_pred CCcccccchHHHHHHhhhhHHHHHH--hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhh Confidence 7666665333 44555555433 2222111 112 2344434444444433222 2222333334444444333 Q ss_pred EEEE---EeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHH-HhcChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 72 KFTF---WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) Q Consensus 72 ~ip~---~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~el-l~~s~~~~~~~v~~~l~~ai~~~~d~a~l 147 (324) .+-. +.....+.++.|+...+.+++++.......+-++....+|.-+ +.++..+.+..+.+.-.-.++..+|.++| T Consensus 79 ~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~F 158 (463) T protein:vir:99 79 KYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) T ss_pred hheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHh Confidence 3333 2333567899999999999999999999999998887777633 34556688888888889999999999999 Q ss_pred hccCc-Cc-------CCcccccccccccceee--cccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcee Q lcl|NC_011614. 148 LNQGN-NP-------FGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKER 217 (324) Q Consensus 148 ~g~g~-~~-------~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~ 217 (324) .|+.. ++ +..|+...+...+.... +.++.+.|..+-..+..++..++-++|+..+.+.|..---...+.+ T Consensus 159 yGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~ 238 (463) T protein:vir:99 159 YGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQL 238 (463) T ss_pred hhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEE Confidence 99753 11 22233333333333222 4567777777777777888888889999999999864332223333 Q ss_pred eccCCCceecccceEe--ec--CccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 218 IYDRNSDSLDGLPVVN--LK--SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 218 ~~~~~~~~l~G~pv~~--~~--~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) .....+....|+|+-- +. ...+....++- +...+ .-+++. ..+.|.--.+..-+ T Consensus 239 ~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~---~~~il--------~~~~~~-----------~p~ap~~~~~tatv 296 (463) T protein:vir:99 239 MQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVME---NELIL--------DESLQP-----------LPNAPQPAKVTATV 296 (463) T ss_pred EcCCCCceeeeeeccceeeeeeeeeeCCceecC---Ccccc--------cchhhc-----------CCCCccCceeEEEE Confidence 3333333466776621 10 00011000000 00000 000000 00011111111111 Q ss_pred EEEeccEEecccceE----EEEeeccCCCCccccC Q lcl|NC_011614. 294 TMHVALHIADDKAFA----KLVPADAKPSSVPGEV 324 (324) Q Consensus 294 ~~r~d~~v~~~~a~~----~l~~~~~~~~~~~~~~ 324 (324) +.--....-+++..+ ++......+.+.|-++ T Consensus 297 ~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~i 331 (463) T protein:vir:99 297 ETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEE 331 (463) T ss_pred eeccCCCCCCcccccceEEEEEEECCCCCcccchh Confidence 111111111111111 1223334445555555 No 180 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=97.79 E-value=5.3e-06 Score=49.52 Aligned_cols=300 Identities=13% Similarity=0.055 Sum_probs=150.9 Q ss_pred CchhhHHHHH----HHHHhhccchhhhhcccccc--ccC-CCcceechhhhHHHHHHHH--hhcchhhhceeeecCCCce Q lcl|NC_011614. 1 MEQTQKLKLN----LQHFASNNVKPQVFNPDNVM--MHE-KKDGTLLNDFTTPILQEVM--ENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 m~~~~~~~~~----~~~~~~~~~~~~~~~a~~~~--~~~-~~g~lip~~~~~~i~~~~~--~~s~l~~l~~~~~~~~~~~ 71 (324) |.+.++++.- +.+++.+..++ +.+.-.. .+. +++++=-+.+..+|..... +.-.+.+-..+.+..+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS--~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~ 78 (463) T protein:vir:95 1 MTIEKNLSDVQQKYADQFQEDVVKS--FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVV 78 (463) T ss_pred CCcccccchHHHHHHhhhhHHHHHH--hhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhh Confidence 7666665333 44555555433 2222111 112 2344434444444433222 2222333334444444333 Q ss_pred EEEE---EeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHH-HhcChhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011614. 72 KFTF---WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) Q Consensus 72 ~ip~---~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~el-l~~s~~~~~~~v~~~l~~ai~~~~d~a~l 147 (324) .+-. +.....+.++.|+...+.+++++.......+-++....+|.-+ +.++..+.+..+.+.-.-.++..+|.++| T Consensus 79 ~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~F 158 (463) T protein:vir:95 79 KYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) T ss_pred hheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHh Confidence 3333 2333567899999999999999999999999998887777633 34556688888888889999999999999 Q ss_pred hccCc-Cc-------CCcccccccccccceee--cccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCcee Q lcl|NC_011614. 148 LNQGN-NP-------FGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKER 217 (324) Q Consensus 148 ~g~g~-~~-------~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~ 217 (324) .|+.. ++ +..|+...+...+.... +.++.+.|..+-..+..++..++-++|+..+.+.|..---...+.+ T Consensus 159 yGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~ 238 (463) T protein:vir:95 159 YGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQL 238 (463) T ss_pred hhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEE Confidence 99753 11 22233333333333222 4567777777777777888888889999999999864332223333 Q ss_pred eccCCCceecccceEe--ec--CccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 218 IYDRNSDSLDGLPVVN--LK--SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 218 ~~~~~~~~l~G~pv~~--~~--~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) .....+....|+|+-- +. ...+....++- +...+ .-+++. ..+.|.--.+..-+ T Consensus 239 ~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~---~~~il--------~~~~~~-----------~p~ap~~~~~tatv 296 (463) T protein:vir:95 239 MQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVME---NELIL--------DESLQP-----------LPNAPQPAKVTATV 296 (463) T ss_pred EcCCCCceeeeeeccceeeeeeeeeeCCceecC---Ccccc--------cchhhc-----------CCCCccCceeEEEE Confidence 3333333466776621 10 00011000000 00000 000000 00011111111111 Q ss_pred EEEeccEEecccceE----EEEeeccCCCCccccC Q lcl|NC_011614. 294 TMHVALHIADDKAFA----KLVPADAKPSSVPGEV 324 (324) Q Consensus 294 ~~r~d~~v~~~~a~~----~l~~~~~~~~~~~~~~ 324 (324) +.--....-+++..+ ++......+.+.|-++ T Consensus 297 ~~~~~~~~~~~~~~a~~~Y~vv~~s~~geS~pS~i 331 (463) T protein:vir:95 297 ETKQKGAFENEEDRAGLSYKVVVNSDDAQSAPSEE 331 (463) T ss_pred eeccCCCCCCcccccceEEEEEEECCCCCcccchh Confidence 111111111111111 1223334445555555 No 181 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.53 E-value=5e-05 Score=44.15 Aligned_cols=268 Identities=12% Similarity=0.055 Sum_probs=125.3 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhceeee-----cC--CCceEEEEEeCCcceeeec-ccccccccccceeeEE Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP-----ME--GTEKKFTFWADKPGAYWVG-EGQKIETSKATWVNAT 102 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~-----~~--~~~~~ip~~~~~~~a~~v~-Eg~~~~~~~~~~~~v~ 102 (324) +..+--..||+.++.+.++.+++..++.+++.+-. .. +..++||+........... .+..+..++..-.++. T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~~v~ 80 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSAKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccceee Confidence 22222345899999999999999999999876521 11 4566777643221111111 1223333444444555 Q ss_pred eeeee-EEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHH Q lcl|NC_011614. 103 MRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 103 ~~~~k-~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..+ ....+.++++-...+..++++++...+ ++++..+|..++..--.+.. .......+....|+++.+ T Consensus 81 l~id~~k~~a~~v~d~e~~l~i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~--------~~vgt~~t~~~~~~~i~~ 151 (423) T protein:vir:35 81 GKVGKYITVAVEWTQIEEALKLNQLDQILSPIH-ERMVTDLETELAHFMMNNGA--------LSLGSPNTAIKKWADVAQ 151 (423) T ss_pred EEeccceeccceeCHHHHHhhHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccc--------cccccccCCcchHHHHHH Confidence 54433 233456666555556678887777665 77899999888752111110 011111112245899999 Q ss_pred HHHHhhhhccCC--CEEEEcHHHHHHHHH----hhccC--CceeeccCC-CceecccceEeecCccCCCceEEEeecccE Q lcl|NC_011614. 182 LEALLEDDELEA--NAFISKTQNRSLLRK----IVDPE--TKERIYDRN-SDSLDGLPVVNLKSSNLKRGELITGDFDKL 252 (324) Q Consensus 182 ~~~~l~~~~~~~--~~~v~~~~~~~~L~~----l~d~~--g~~~~~~~~-~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~ 252 (324) +..+|...+... -..+++|..+..|.+ +...+ +...+..+. .+++.|+.++.+++.+.... |.+... T Consensus 152 a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~----gt~~~~ 227 (423) T protein:vir:35 152 TASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQ----GDFDGA 227 (423) T ss_pred HHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCccccc----cccccc Confidence 999998776553 346899999877752 21111 122233443 47999999988765442111 111111 Q ss_pred EEE------------EecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEeccc--------------c Q lcl|NC_011614. 253 IYG------------IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK--------------A 306 (324) Q Consensus 253 ~~~------------~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~--------------a 306 (324) ... ......+.+.-. +. ...+. +-..|.+ ...|+..++|. - T Consensus 228 ~~v~~a~~v~~~a~~~~~~~~~~~~~~-~~----~~~g~---l~~GD~~-----t~aGv~~v~~~t~~~~~~~~t~~~~~ 294 (423) T protein:vir:35 228 ITVKTAPNVDYLSVKDSYQFTVALTGA-TP----SKTGF---LKAGDQL-----KFTSTHWLNQQSKQTLYNGSTAMSFT 294 (423) T ss_pred eeeccccccccccccccccceeeeeee-ee----ccCCc---EEecceE-----EeeeeeeccccccceeecccCCceeE Confidence 000 000000000000 00 00000 0011211 22333333211 1 Q ss_pred eEEEEee-ccCCCCc-----cc-----cC Q lcl|NC_011614. 307 FAKLVPA-DAKPSSV-----PG-----EV 324 (324) Q Consensus 307 ~~~l~~~-~~~~~~~-----~~-----~~ 324 (324) |+++... +.++..+ |+ +- T Consensus 295 ~~V~~~~~~~a~g~~~v~i~p~~~~~~~~ 323 (423) T protein:vir:35 295 ATVLEETNSTASGDVTVKLSGVPIYDEKN 323 (423) T ss_pred EEEeccccccccCceeEEccccccccCCC Confidence 2221111 0011111 11 11 No 182 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=97.49 E-value=5.6e-05 Score=43.88 Aligned_cols=266 Identities=11% Similarity=0.037 Sum_probs=109.3 Q ss_pred ccccCCCc-ceechhhhHHHHHHHHhhcchhhhcee-------eecCCCceEEEEEe-CCcce-eeecccccccccccc- Q lcl|NC_011614. 29 VMMHEKKD-GTLLNDFTTPILQEVMENSKIMQLGKY-------EPMEGTEKKFTFWA-DKPGA-YWVGEGQKIETSKAT- 97 (324) Q Consensus 29 ~~~~~~~g-~lip~~~~~~i~~~~~~~s~l~~l~~~-------~~~~~~~~~ip~~~-~~~~a-~~v~Eg~~~~~~~~~- 97 (324) .++|-... .+--+.+....++.+.+....++.+.. .++.+.-...|-+. ++... .-+.....+...+.+ T Consensus 1 ~~~t~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit~ 80 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIAA 80 (315) T ss_pred CceeeecceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceeccc Confidence 11111111 123455555666666655544443221 12222222222221 11100 111112222223322 Q ss_pred eeeEEeeeeeEEEeeh--hHHHHHh---cChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeec Q lcl|NC_011614. 98 WVNATMRAFKLGVILP--VTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKG 172 (324) Q Consensus 98 ~~~v~~~~~k~~~~v~--iS~ell~---~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~ 172 (324) ...+.++. ..+.-+ .+.+.+. ..+..+...|...+..+..+.+=...+.+.-... .+ ........... T Consensus 81 ~~dvaVk~--~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai--~~---~t~~~~~~~~a 153 (315) T protein:vir:96 81 DEMVSVKV--PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAI--GS---NAGMNVSGELA 153 (315) T ss_pred ccceeEEE--eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh--cc---ccccccccccc Confidence 22233322 222232 3333333 2333333434444444443333222222211100 00 00001112334 Q ss_pred ccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH-----hhccCCceeeccCCCceecccceEeecCccCCCceEEEe Q lcl|NC_011614. 173 DFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK-----IVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITG 247 (324) Q Consensus 173 ~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~-----l~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~g 247 (324) .++...+.++.+++.+....-+.|+||..++..|.+ .....++-+.....++ .+|++|++.++.+. ..++.- T Consensus 154 ~~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~q~L~~~~~~~~~~~~~~~~~~-~lGkrViVdD~~P~--~~~~gl 230 (315) T protein:vir:96 154 TEGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVDEAIDNKLYEEAGVVVYGGTPG-TLGKPVLVTDQCPA--TKIFGL 230 (315) T ss_pred ccCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHHhhhhhhcccccceeEecCcCc-ccccEEEEECCCCc--ceeeee Confidence 678889999999999888888999999999988865 1122222232333333 45999999765443 222211 Q ss_pred ecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc-EEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 248 DFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL-HIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 248 d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~-~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) --..+.++....+ ........+ .+....| .|..| -.++|.+|..-+ +...++|-+|. T Consensus 231 ~~GAi~~~~~~~~--------~~~~~~~~g--------~e~l~~~--~r~e~tf~l~p~G~sw~~--~~~~sPt~aeL 288 (315) T protein:vir:96 231 VAGAVMITESQAP--------GMRSYQIDD--------QENLAIG--FRAEGTANVEVLGYKWKT--KTNVNPASATL 288 (315) T ss_pred ecceeeecCCCcc--------ccccccCCC--------cceeEEE--EeeeeEeeeeeeeEEeec--CCCcCCChHHh Confidence 0111112211111 000001110 1112222 22222 367888888742 23445666666 No 183 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=97.46 E-value=6.2e-05 Score=43.66 Aligned_cols=279 Identities=9% Similarity=-0.021 Sum_probs=120.0 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhceeee-----c--CCCceEEEEEeCCcceeeeccc--ccccccccceee- Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP-----M--EGTEKKFTFWADKPGAYWVGEG--QKIETSKATWVN- 100 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~-----~--~~~~~~ip~~~~~~~a~~v~Eg--~~~~~~~~~~~~- 100 (324) +..+-..++|+-++.++++.+++..++.+++.+-. . .+..++||+-... .+.-...+ .....++..-.+ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~-~~~d~~~~~~t~~~~~~l~e~~v 79 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQF-KSERTMDGDITGKSKNSLISAKA 79 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCce-eeecccCcccCcccccccccceE Confidence 33444569999999999999999999999876522 1 2456677653311 11111111 111112222233 Q ss_pred -EEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHH Q lcl|NC_011614. 101 -ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 101 -v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) +++.-+|.. .+.++++=...+..++++++... .++++..+|..+......... ...+. ..+....|+++ T Consensus 80 ~l~id~~k~~-a~~v~d~E~~l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~-----~~vgt---~~t~~~a~~~~ 149 (423) T protein:vir:10 80 TGEVGNYITV-AVEYRQIEEALKLNQLDQILVPI-NERMVTDLETELALFMMKHGA-----LSLGS---PNTPIKKWSDV 149 (423) T ss_pred EEEecceeee-eeeeChHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhccc-----ccccc---cccccccHHHH Confidence 444444443 34565544446677887755544 689999999988642211111 01111 11112358899 Q ss_pred HHHHHHhhhhccCC--CEEEEcHHHHHHHHH----hhcc--CCceeeccC-CCceecccceEeecCccC-CCce--EEEe Q lcl|NC_011614. 180 IDLEALLEDDELEA--NAFISKTQNRSLLRK----IVDP--ETKERIYDR-NSDSLDGLPVVNLKSSNL-KRGE--LITG 247 (324) Q Consensus 180 ~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~----l~d~--~g~~~~~~~-~~~~l~G~pv~~~~~~~~-~~~~--i~~g 247 (324) .++..+|...+... -..+++|..+..|.+ +... .+..-+..+ ..++++|+.++.++..+. .++. ..+. T Consensus 150 a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~ 229 (423) T protein:vir:10 150 AQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLT 229 (423) T ss_pred HHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEEecCCcccccccccceee Confidence 99999987766543 357899999888752 2221 122223334 447999999988765431 1111 0000 Q ss_pred ecccE-EEEE-e-----cceEEE---Eeecccccccc--cccc-cchhhhhc---------CcEEEEEEEEeccEEeccc Q lcl|NC_011614. 248 DFDKL-IYGI-P-----QLIEYK---IDETAQLSTVK--NEDG-TPVNLFEQ---------DMVALRATMHVALHIADDK 305 (324) Q Consensus 248 d~~~~-~~~~-~-----~~~~i~---~~~~~~~~~~~--~~~~-~~~~~f~~---------~~v~~r~~~r~d~~v~~~~ 305 (324) .-... +.+. . .+.+.. .+...++.... .-.| ..++-..+ ....|++.. |....-+. T Consensus 230 ~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~--~~~~~a~~ 307 (423) T protein:vir:10 230 VKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVME--DANAHSSG 307 (423) T ss_pred eeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEe--cccccccC Confidence 00000 0000 0 000000 00000000000 0000 00000000 001111111 11111111 Q ss_pred ceEEEEeeccCC---------CCccccC Q lcl|NC_011614. 306 AFAKLVPADAKP---------SSVPGEV 324 (324) Q Consensus 306 a~~~l~~~~~~~---------~~~~~~~ 324 (324) ++. |+.- +.+ ...++-+ T Consensus 308 ~~t-v~i~-p~~~~~~~~~~~~~V~a~~ 333 (423) T protein:vir:10 308 DVT-VKIS-GVPIFDAGYPQYNAVDRLL 333 (423) T ss_pred ceE-EEec-cccccccCcccccceeccc Confidence 221 1110 000 0001111 No 184 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=97.43 E-value=5e-05 Score=44.17 Aligned_cols=291 Identities=11% Similarity=0.096 Sum_probs=147.6 Q ss_pred hhhhccccc----cccCCCcceech-hhhHHHHHHHHhhcchhhhceeeecCC---CceEEEEEeCCcce-eeecccc-- Q lcl|NC_011614. 21 PQVFNPDNV----MMHEKKDGTLLN-DFTTPILQEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGA-YWVGEGQ-- 89 (324) Q Consensus 21 ~~~~~a~~~----~~~~~~g~lip~-~~~~~i~~~~~~~s~l~~l~~~~~~~~---~~~~ip~~~~~~~a-~~v~Eg~-- 89 (324) ...++++.. ++..+.+.-+-. .+..+.+..+.+.-.+.+++...|++. .++++.+...-+.+ ....||- T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 122222211 111122222333 233556666666778888999998873 34444333332222 1122332 Q ss_pred ---cc-----------------------------cccccceeeEEeeeeeEEEeehhHHHHHh-cChhHHHHHH-HHHHH Q lcl|NC_011614. 90 ---KI-----------------------------ETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEM-KPMIA 135 (324) Q Consensus 90 ---~~-----------------------------~~~~~~~~~v~~~~~k~~~~v~iS~ell~-~s~~~~~~~v-~~~l~ 135 (324) ++ ...+.+-.++..+.++++.+.++|+++.+ ++...+.+.+ .+.|. T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 11 11233334567788999999999998877 3445666644 44555 Q ss_pred HHHHHHHH---HHHHhccCcCc-CCcccccccccccceeecccchhHHHHHHHHhhhhc------------------cCC Q lcl|NC_011614. 136 EAFYKKFD---EAGILNQGNNP-FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDE------------------LEA 193 (324) Q Consensus 136 ~ai~~~~d---~a~l~g~g~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~------------------~~~ 193 (324) .+-...+| +.+|+.-+.-- ++.....+.......+.+..+++++..+...|..+. ... T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~ 240 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGA 240 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCcccccc Confidence 55444443 45664432211 111112222222333456678999988887776411 112 Q ss_pred CE-EEEcHHHHHHHHHhhccCCceeecc------------CCCceecccceEeecCcc------C--------------- Q lcl|NC_011614. 194 NA-FISKTQNRSLLRKIVDPETKERIYD------------RNSDSLDGLPVVNLKSSN------L--------------- 239 (324) Q Consensus 194 ~~-~v~~~~~~~~L~~l~d~~g~~~~~~------------~~~~~l~G~pv~~~~~~~------~--------------- 239 (324) +. -+||+.....|+.++|-.|.+-|.+ +..|.+.++.++.++..- . T Consensus 241 s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~ 320 (401) T protein:vir:95 241 TRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVS 320 (401) T ss_pred ceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccccccccccccc Confidence 22 4789999999988888766665532 234556666666543200 0 Q ss_pred --CCc----eEEEeecccEEEEEecce-E--EEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE Q lcl|NC_011614. 240 --KRG----ELITGDFDKLIYGIPQLI-E--YKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 240 --~~~----~i~~gd~~~~~~~~~~~~-~--i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l 310 (324) +.. .+++|.-....++..++- . +.+. .........+..-.+-|+..+.+. +++++.+++++-.+++ T Consensus 321 ~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~i---vk~pG~~~ad~~DPlgQ~g~vgwK--~~~a~~vL~~e~m~~i 395 (401) T protein:vir:95 321 GQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVM---TKMPGKETADRNDPYGETGFSSIK--WYYGILVKRPERLALI 395 (401) T ss_pred CCCcceeeeeeEEccccceecccccCCccccceeE---eecCCcCCCCCCCcccceehhhhh--hhhhhheeccceeEEE Confidence 111 133444333333332221 1 1111 111111101111123455666664 5678899999999999 Q ss_pred EeeccC Q lcl|NC_011614. 311 VPADAK 316 (324) Q Consensus 311 ~~~~~~ 316 (324) +.++.- T Consensus 396 es~a~~ 401 (401) T protein:vir:95 396 KTVAPL 401 (401) T ss_pred EeecCC Confidence 887666 No 185 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.29 E-value=6.3e-05 Score=43.62 Aligned_cols=264 Identities=8% Similarity=-0.047 Sum_probs=132.0 Q ss_pred CCCcceechhh---hHHHHHHHHhhcchhhhcee---eecCCCceEEEEEeCCccee--eec-ccccccccccceeeEEe Q lcl|NC_011614. 33 EKKDGTLLNDF---TTPILQEVMENSKIMQLGKY---EPMEGTEKKFTFWADKPGAY--WVG-EGQKIETSKATWVNATM 103 (324) Q Consensus 33 ~~~g~lip~~~---~~~i~~~~~~~s~l~~l~~~---~~~~~~~~~ip~~~~~~~a~--~v~-Eg~~~~~~~~~~~~v~~ 103 (324) -++.+++-.++ .+.|.+...+.-..++++.+ .+....++.+...+....|. |++ ....+|..+..+++-.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 12333333333 23444444444444445443 33333345566666555666 986 45668888888888888 Q ss_pred eeeeEEEeehhHHHHHhcCh---hHHHHHHHHHHHHHHHHHHHHHHHhccCc--CcCCccccccccccccee--ec---- Q lcl|NC_011614. 104 RAFKLGVILPVTKEFLNYTY---SQFFEEMKPMIAEAFYKKFDEAGILNQGN--NPFGKSIAQSIEKTNKVI--KG---- 172 (324) Q Consensus 104 ~~~k~~~~v~iS~ell~~s~---~~~~~~v~~~l~~ai~~~~d~a~l~g~g~--~~~~~~~~~~~~~~~~~~--~~---- 172 (324) +.+.++..+.+|-+=++.+. .++...-.+...+++...+|+..+.|+.. +..|......+....... ++ T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w~ 160 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKVQ 160 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCccc Confidence 88888777777654344332 46777777778889999999999999643 222222222222111110 01 Q ss_pred ccc----hhHHHHHHHHhhhh--c-cCCCEEEEcHHHHHHHHHhhccCCc-eee--ccCCCceecccceEe-------ec Q lcl|NC_011614. 173 DFT----QDNIIDLEALLEDD--E-LEANAFISKTQNRSLLRKIVDPETK-ERI--YDRNSDSLDGLPVVN-------LK 235 (324) Q Consensus 173 ~~~----~~~i~~~~~~l~~~--~-~~~~~~v~~~~~~~~L~~l~d~~g~-~~~--~~~~~~~l~G~pv~~-------~~ 235 (324) +-| .+++.+++.++... + ..+..++++|+.+..|....-.+++ .++ ...+.....|.|+-+ .. T Consensus 161 ~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~v~~~~~~ 240 (304) T protein:vir:52 161 AMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQVAIKALPSNYGT 240 (304) T ss_pred cCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCcceEEEecccccc Confidence 113 34455566665332 2 4456899999999998654333332 221 111111122333311 11 Q ss_pred CccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEE--EEEEEEeccE-EecccceEEEEe Q lcl|NC_011614. 236 SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA--LRATMHVALH-IADDKAFAKLVP 312 (324) Q Consensus 236 ~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~--~r~~~r~d~~-v~~~~a~~~l~~ 312 (324) ....++..++.-+.+.-.+...-.+.+.+. ....++... +=.+.|+++. +..|.+++.+.- T Consensus 241 ~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l----------------~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 241 RVTDGKTRAMVYVNSKEHVIFDVPMSPTVL----------------DAQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred cCCCCceEEEEEecChhheEEecCcccccc----------------chhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 222233334433333222211111111110 012333322 2245566554 555999999988 No 186 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=97.00 E-value=0.00021 Score=40.69 Aligned_cols=275 Identities=10% Similarity=0.015 Sum_probs=121.6 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhcee-ee----c--CCCceEEEEEeCCcceeee-cccccccccccceee-- Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY-EP----M--EGTEKKFTFWADKPGAYWV-GEGQKIETSKATWVN-- 100 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~-~~----~--~~~~~~ip~~~~~~~a~~v-~Eg~~~~~~~~~~~~-- 100 (324) +..+--..+|+.++.++++.+++..++.+++.+ .. . .+.+++|++-......... .++..+...+..-.+ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v~ 80 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccceeE Confidence 322222347999999999999999999988765 21 1 2556677654322221222 133333333444344 Q ss_pred EEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHH Q lcl|NC_011614. 101 ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNII 180 (324) Q Consensus 101 v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 180 (324) +++.-+|...+ .++++=......++++++... .++++..+|..++.--..... ...+. . .+....|+++. T Consensus 81 l~id~~k~va~-~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~-----~~~gt-~--~t~~~a~~~i~ 150 (423) T protein:vir:10 81 GRVGNYITVAV-EYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGA-----LSLGS-P--NTPITKWSDVA 150 (423) T ss_pred EEeeceeeeee-eechHHHhcChhhHHHHHHHH-HHHHHHHHHHHHHHHHhhccc-----ccccc-C--CcccchHHHHH Confidence 45555555444 455543445556787766555 588999999988753111110 00010 0 11123588999 Q ss_pred HHHHHhhhhccCC--CEEEEcHHHHHHHHH----hhc--cCCceeeccCC-CceecccceEeecCccCC-CceEEEe--- Q lcl|NC_011614. 181 DLEALLEDDELEA--NAFISKTQNRSLLRK----IVD--PETKERIYDRN-SDSLDGLPVVNLKSSNLK-RGELITG--- 247 (324) Q Consensus 181 ~~~~~l~~~~~~~--~~~v~~~~~~~~L~~----l~d--~~g~~~~~~~~-~~~l~G~pv~~~~~~~~~-~~~i~~g--- 247 (324) ++..+|...+... -..+++|..+..|.+ +.. ..+...+..++ .+++.|+.++.++..+.. ++. +.+ T Consensus 151 ~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt-~~~t~~ 229 (423) T protein:vir:10 151 QTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGA-FGGTLT 229 (423) T ss_pred HHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCccccccc-ccccee Confidence 9999998776543 357899999887753 111 11222344443 378999999886544321 110 000 Q ss_pred -ecccEE----EEEecceEEEEeecccccccccccccchhhhhcCcEEE---EEEEEeccEEe------cccceEEEEee Q lcl|NC_011614. 248 -DFDKLI----YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL---RATMHVALHIA------DDKAFAKLVPA 313 (324) Q Consensus 248 -d~~~~~----~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~---r~~~r~d~~v~------~~~a~~~l~~~ 313 (324) .+...+ ........+.+.... . ...+.. -.-|.+.| ....+....++ ++.-|.++..+ T Consensus 230 ~~~~~~v~~~a~~~a~~~~~~~~~~~-~----~~~~~l---~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~ 301 (423) T protein:vir:10 230 VKTQPTVTYNAVKDSYQFTVTLTGAT-A----SVTGFL---KAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADA 301 (423) T ss_pred eeecceeccccccccceeeeeeeecc-c----cccCce---eecceEEecceeeecccccccccccccCcceEEEEEeee Confidence 000000 000011111111000 0 000000 00011100 00001111111 11122222111 Q ss_pred cc-CCC--------------------CccccC Q lcl|NC_011614. 314 DA-KPS--------------------SVPGEV 324 (324) Q Consensus 314 ~~-~~~--------------------~~~~~~ 324 (324) .. +++ ...+.+ T Consensus 302 ~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~ 333 (423) T protein:vir:10 302 NSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQV 333 (423) T ss_pred eeccCCceeeeccCccccccCCcccccccccc Confidence 11 000 011111 No 187 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=96.98 E-value=3.3e-05 Score=45.16 Aligned_cols=293 Identities=11% Similarity=0.016 Sum_probs=142.2 Q ss_pred CchhhHHHHHHH---HHhhc--cch-h-hhhccc--cc--cccCCCcceechhhh----HHHHHHHHhhcchhhhceeee Q lcl|NC_011614. 1 MEQTQKLKLNLQ---HFASN--NVK-P-QVFNPD--NV--MMHEKKDGTLLNDFT----TPILQEVMENSKIMQLGKYEP 65 (324) Q Consensus 1 m~~~~~~~~~~~---~~~~~--~~~-~-~~~~a~--~~--~~~~~~g~lip~~~~----~~i~~~~~~~s~l~~l~~~~~ 65 (324) |+.-+.++.-.+ .|... .+. + ..+..+ .. ..++.....||.-+. ..+++.+...-....+..+.. T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t 80 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccc Confidence 776665543322 11100 000 0 011111 00 011112223443333 334455555555555666555 Q ss_pred cCCC---ceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhH-HHHHhcC--hhHHHHHHHHHHHHHHH Q lcl|NC_011614. 66 MEGT---EKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVT-KEFLNYT--YSQFFEEMKPMIAEAFY 139 (324) Q Consensus 66 ~~~~---~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS-~ell~~s--~~~~~~~v~~~l~~ai~ 139 (324) .+.. ...+++.+....|.+++.+...|..+...+..+-+.+.++..+.++ .|+.... ..++...-....++++. T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:10 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred cCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 4432 3456666666778899999999988877777777778888888887 4544433 35788888888899999 Q ss_pred HHHHHHHHhccCcCc-CCccccccccccc---c----eeecccchhHHHHHHHHhhhhc------cCCCEEEEcHHHHHH Q lcl|NC_011614. 140 KKFDEAGILNQGNNP-FGKSIAQSIEKTN---K----VIKGDFTQDNIIDLEALLEDDE------LEANAFISKTQNRSL 205 (324) Q Consensus 140 ~~~d~a~l~g~g~~~-~~~~~~~~~~~~~---~----~~~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~~ 205 (324) +.+++-.+.|+.... .|......+.... + ..+...-++|+..++.+|...- ..+..++|+|..+.. T Consensus 161 ~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~ 240 (336) T protein:vir:10 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) T ss_pred HhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHh Confidence 999998888875431 1111111121110 0 0111224677888888886532 246789999999888 Q ss_pred HHHhhccCCceeec--cCCC--ceecccceEeecCccCCCceEEEeecccEEEEEecc---eEEEEeecccccccccccc Q lcl|NC_011614. 206 LRKIVDPETKERIY--DRNS--DSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDG 278 (324) Q Consensus 206 L~~l~d~~g~~~~~--~~~~--~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~~ 278 (324) |.+. +..|..++. ..+. -++...|=. ....+.... +++-...+ ..+.+......+... T Consensus 241 Ls~~-n~~g~Tvl~~lk~n~Pnl~i~t~pEl---~~a~G~~~~-------l~~~~~~~~~t~~~~~p~~~~~l~vq---- 305 (336) T protein:vir:10 241 LSKT-NQYGLAAAAKLKDIFPKLEFVTIPEY---DTASGRLVQ-------LWAPRVEGKDTATCGFTEKMRAHSIE---- 305 (336) T ss_pred ccCC-CccCccHHHHHHHhcCccEEEEcccc---ccCCCceEE-------EEEEecCCCcceeeecchhhhcccee---- Confidence 8532 222322211 1110 012122211 001111111 11111111 111111111111000 Q ss_pred cchhhhhcCcEEEEEEEEecc-EEecccceEEEEee Q lcl|NC_011614. 279 TPVNLFEQDMVALRATMHVAL-HIADDKAFAKLVPA 313 (324) Q Consensus 279 ~~~~~f~~~~v~~r~~~r~d~-~v~~~~a~~~l~~~ 313 (324) ...-....-+..|+++ .+.+|.||+++++. T Consensus 306 -----~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 306 -----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred -----ecCceeEeccccceeeeeeeccchheeeecC Confidence 0011122333455554 45559999999998 No 188 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=96.92 E-value=4e-05 Score=44.67 Aligned_cols=292 Identities=10% Similarity=-0.005 Sum_probs=142.3 Q ss_pred CchhhHHHHHHH---HHhhc------cchhhhhcccccc--ccCCCcceechhhhH----HHHHHHHhhcchhhhceeee Q lcl|NC_011614. 1 MEQTQKLKLNLQ---HFASN------NVKPQVFNPDNVM--MHEKKDGTLLNDFTT----PILQEVMENSKIMQLGKYEP 65 (324) Q Consensus 1 m~~~~~~~~~~~---~~~~~------~~~~~~~~a~~~~--~~~~~g~lip~~~~~----~i~~~~~~~s~l~~l~~~~~ 65 (324) |+.-+.++.-.+ .|... +...-.+-++... ..+....-||..+.+ .+++.+...-....+..+.. T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t 80 (336) T protein:vir:36 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccc Confidence 776666543322 11110 0010111111111 000111224444433 33444455555555555555 Q ss_pred cCCC---ceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhH-HHHHhcC--hhHHHHHHHHHHHHHHH Q lcl|NC_011614. 66 MEGT---EKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVT-KEFLNYT--YSQFFEEMKPMIAEAFY 139 (324) Q Consensus 66 ~~~~---~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS-~ell~~s--~~~~~~~v~~~l~~ai~ 139 (324) .+.. ...+++.+....|.+++.+...|..+...+..+-+.+.++..+.++ .|+.... ..++.+.-....++++. T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale 160 (336) T protein:vir:36 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred cCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 4432 3456666666778899999999988877777777778888888887 4555533 35777888888899999 Q ss_pred HHHHHHHHhccCcCcCCccccc--ccccccc-------eeecccchhHHHHHHHHhhhhc------cCCCEEEEcHHHHH Q lcl|NC_011614. 140 KKFDEAGILNQGNNPFGKSIAQ--SIEKTNK-------VIKGDFTQDNIIDLEALLEDDE------LEANAFISKTQNRS 204 (324) Q Consensus 140 ~~~d~a~l~g~g~~~~~~~~~~--~~~~~~~-------~~~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~ 204 (324) +.+++-.+.|+.... ..|+++ .+..... ..+...-++|+.+++.++...- ..+..++|+|..+. T Consensus 161 ~~~N~i~~~Gd~~~~-~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~ 239 (336) T protein:vir:36 161 KFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMS 239 (336) T ss_pred HhhCcEEEEeccccc-eEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHH Confidence 999988888875431 112222 1211100 0111224678888888886532 24678999999988 Q ss_pred HHHHhhccCCceeec--cCCC--ceecccceEeecCccCCCceEEEeecccEEEEEecc---eEEEEeeccccccccccc Q lcl|NC_011614. 205 LLRKIVDPETKERIY--DRNS--DSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNED 277 (324) Q Consensus 205 ~L~~l~d~~g~~~~~--~~~~--~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~ 277 (324) .|.+. +..|..++. ..+. -++...|=. ....+.... +++-...+ ..+.+......+... T Consensus 240 ~Ls~~-n~~g~Tvl~~lk~n~Pnl~i~t~pEl---~~a~g~~~~-------l~~~~~~~~~t~~~~~p~~~~~l~vq--- 305 (336) T protein:vir:36 240 DLSKT-NQYGLAAAAKLKDIFPKLEFVTIPEY---DTASGRLVQ-------LWAPRVEGKDTATCGFTEKMRAHSIE--- 305 (336) T ss_pred hccCC-CccCccHHHHHHHhcCccEEEEcccc---ccCCCceEE-------EEEEecCCCcceeeecchhhhcccee--- Confidence 88532 222322211 1110 012222211 001111111 11111111 111111111111000 Q ss_pred ccchhhhhcCcEEEEEEEEecc-EEecccceEEEEee Q lcl|NC_011614. 278 GTPVNLFEQDMVALRATMHVAL-HIADDKAFAKLVPA 313 (324) Q Consensus 278 ~~~~~~f~~~~v~~r~~~r~d~-~v~~~~a~~~l~~~ 313 (324) ...-....-+..|+++ .+.+|.||+++++. T Consensus 306 ------~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 306 ------RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ------ecCceeEeccccceeeeeeeccchheeeecC Confidence 0011122333455554 45559999999998 No 189 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=96.88 E-value=0.00022 Score=40.67 Aligned_cols=292 Identities=8% Similarity=-0.018 Sum_probs=144.1 Q ss_pred CchhhHHHHH--HHH----H-------hhccchhhhhcccc----ccccCCCcc--eechhhhHHHHHHHHhhcchhhhc Q lcl|NC_011614. 1 MEQTQKLKLN--LQH----F-------ASNNVKPQVFNPDN----VMMHEKKDG--TLLNDFTTPILQEVMENSKIMQLG 61 (324) Q Consensus 1 m~~~~~~~~~--~~~----~-------~~~~~~~~~~~a~~----~~~~~~~g~--lip~~~~~~i~~~~~~~s~l~~l~ 61 (324) |.-+..-+.. +++ | .........+.+.. .+++...|. ..++.+.+.|++...+.-..++++ T Consensus 1 ~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~ 80 (339) T protein:vir:94 1 MSINNDRTDIKQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIF 80 (339) T ss_pred CceechHHHHHHHHhhceeeccchhhhcchhhHhhhccccccccccccccccchhhhhhhhhchhheeecccccchhhhc Confidence 4433322111 111 1 10111111111111 111111111 133344467777777887888888 Q ss_pred eeeecCC---CceEEEEEeCCcceeeecccccccccc--cceeeEEeeeeeEEEeehhHHHHHhcC--hhHHHHHHHHHH Q lcl|NC_011614. 62 KYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSK--ATWVNATMRAFKLGVILPVTKEFLNYT--YSQFFEEMKPMI 134 (324) Q Consensus 62 ~~~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~--~~~~~v~~~~~k~~~~v~iS~ell~~s--~~~~~~~v~~~l 134 (324) .+.+.+. ..+.+++.+....|.+++.++..|..+ .++.+.++....++-.+.+ .|+...+ ..++.+.-.... T Consensus 81 pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~-~E~~~A~~~g~~l~~~Ka~aA 159 (339) T protein:vir:94 81 PEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYGD-LEMATYGEAGIDYVARQEISA 159 (339) T ss_pred ccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEEeecH-HHHHHHHhhCCChHHHHHHHH Confidence 8777653 356788888888999999988887665 5577777777777766663 5554433 367888888899 Q ss_pred HHHHHHHHHHHHHhccCcCcCCccccc--cccc-cccee-----ecccchhHHHHHHHHhhhhc------cCCCEEEEcH Q lcl|NC_011614. 135 AEAFYKKFDEAGILNQGNNPFGKSIAQ--SIEK-TNKVI-----KGDFTQDNIIDLEALLEDDE------LEANAFISKT 200 (324) Q Consensus 135 ~~ai~~~~d~a~l~g~g~~~~~~~~~~--~~~~-~~~~~-----~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~ 200 (324) ++++.+.+|+..+.|+.... ..|+++ .+.. ++... +..--++|+.+++.++...- ..+..++++| T Consensus 160 ~~al~~~~N~i~~~Gd~~~~-~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~ 238 (339) T protein:vir:94 160 SLVMAKFANSSYLLGVAGIA-NYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAP 238 (339) T ss_pred HHHHHHhhceEEeeeecccc-eEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecH Confidence 99999999999999864321 122222 1211 11100 11112567777877774432 1245799999 Q ss_pred HHHHHHHHhhccCCceeec--cCCCceecccceEeecC-ccCCCceEEEeecccEEEEEe---cceEEEEeecccccccc Q lcl|NC_011614. 201 QNRSLLRKIVDPETKERIY--DRNSDSLDGLPVVNLKS-SNLKRGELITGDFDKLIYGIP---QLIEYKIDETAQLSTVK 274 (324) Q Consensus 201 ~~~~~L~~l~d~~g~~~~~--~~~~~~l~G~pv~~~~~-~~~~~~~i~~gd~~~~~~~~~---~~~~i~~~~~~~~~~~~ 274 (324) ..+..|... ...|..++. ..+ +-++-++..+. ...+. +...+++... .-..+.+......+... T Consensus 239 ~~~~~L~~~-n~~~~Tvl~~lk~n---~pnl~i~~~~el~~a~g------~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq 308 (339) T protein:vir:94 239 SALNNVNRT-NNFGLSAGAKIAQT---YPNIQFVAVPEFDTASG------RLVQLWVPEVNGQPTGEVAFAEKLRSHSIE 308 (339) T ss_pred HHHHhcccC-CcCCccHHHHHHHh---cCCcEEEEccccccCCC------ceEEEEEEeccCCcceEEEcchhhhccccE Confidence 999988643 232322211 111 11111211110 01111 1111111111 11122222111111000 Q ss_pred cccccchhhhhcCcEEEEEEEE-eccEEecccceEEEEee Q lcl|NC_011614. 275 NEDGTPVNLFEQDMVALRATMH-VALHIADDKAFAKLVPA 313 (324) Q Consensus 275 ~~~~~~~~~f~~~~v~~r~~~r-~d~~v~~~~a~~~l~~~ 313 (324) ...-....-+..| .|..+.+|.||+++++. T Consensus 309 ---------~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 309 ---------RYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred ---------EcCceEEecceeeeeeEEEEccceeeeeecC Confidence 0011222334455 45556669999999998 No 190 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=96.82 E-value=0.00017 Score=41.24 Aligned_cols=294 Identities=10% Similarity=-0.034 Sum_probs=131.9 Q ss_pred Cchhh-HHHHHHHHHhhc----------cchhhhhccc--cccc-cCCCcce-------ech---hhhHHHHHHHHhhcc Q lcl|NC_011614. 1 MEQTQ-KLKLNLQHFASN----------NVKPQVFNPD--NVMM-HEKKDGT-------LLN---DFTTPILQEVMENSK 56 (324) Q Consensus 1 m~~~~-~~~~~~~~~~~~----------~~~~~~~~a~--~~~~-~~~~g~l-------ip~---~~~~~i~~~~~~~s~ 56 (324) |-..+ +.. ..++..+- ......+..| .... .+....+ +|. .+...+++-+..-.. T Consensus 21 ~~~~~~~~~-~~~~l~~~gi~~~~~~~~~~~~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~ 99 (379) T protein:vir:10 21 MDSADVTLD-NLKHLESYGIHLNGRKNKLFELMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVRE 99 (379) T ss_pred hccccccHH-HHHHHHhcCccccchhhhhhhhhhhhhccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhh Confidence 22222 111 11221111 1111111111 1110 0000111 122 223455666655555 Q ss_pred hhhhceeeecCCC---ceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHH-HHHhc--ChhHHHHHH Q lcl|NC_011614. 57 IMQLGKYEPMEGT---EKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTK-EFLNY--TYSQFFEEM 130 (324) Q Consensus 57 l~~l~~~~~~~~~---~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~-ell~~--s~~~~~~~v 130 (324) +..+..+.+.+.. ...+++.+....|.+++.++..|..+...+...-..+.++..+.++. |+... ...++...- T Consensus 100 a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~K 179 (379) T protein:vir:10 100 ADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEK 179 (379) T ss_pred hhhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHH Confidence 5666665554432 34566777777888999888888777666655555566666666654 33332 236788889 Q ss_pred HHHHHHHHHHHHHHHHHhccCcCcCC-ccccc--ccccccceee------------cccchhHHHHHHHHhhhh--c--- Q lcl|NC_011614. 131 KPMIAEAFYKKFDEAGILNQGNNPFG-KSIAQ--SIEKTNKVIK------------GDFTQDNIIDLEALLEDD--E--- 190 (324) Q Consensus 131 ~~~l~~ai~~~~d~a~l~g~g~~~~~-~~~~~--~~~~~~~~~~------------~~~~~~~i~~~~~~l~~~--~--- 190 (324) ....++++.+.+|+-.|.|.+..... .|+++ .+......++ ..--++|+..++.++... + T Consensus 180 a~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~ 259 (379) T protein:vir:10 180 RAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIK 259 (379) T ss_pred HHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeec Confidence 99999999999999999995322111 12222 2211111110 111246677777765432 2 Q ss_pred --cCCCEEEEcHHHHHHHHHhhccCCceee--ccCCC--ceecccceEeecCccCCCceEEEeecccEEEEEecceEE-- Q lcl|NC_011614. 191 --LEANAFISKTQNRSLLRKIVDPETKERI--YDRNS--DSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEY-- 262 (324) Q Consensus 191 --~~~~~~v~~~~~~~~L~~l~d~~g~~~~--~~~~~--~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i-- 262 (324) ..+..++++|..+..|... +..|..++ ...+. -++...|=.. .+...++...++.+- ..+.+. T Consensus 260 ~~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~~lk~n~Pnl~i~t~pEL~-~aggg~~~~~~~~~~-------~~~~~t~~ 330 (379) T protein:vir:10 260 SNKTPITIGIPNAYENYITTP-TELGYSVAQYMRESYPNVTFVSAPELN-DANGGSSAIYYYADA-------VENNGTDD 330 (379) T ss_pred ccccceeEEecHHHHHhhccc-cccCccHHHHHHHhcCCcEEEEccccc-ccCCCccEEEEEeec-------cCCCccCC Confidence 2234789999999998643 22222221 11110 1122222111 111111111222211 111100 Q ss_pred ------EEeecccccccccccccchhhhhcCcEEEEEEEEe-ccEEecccceEEEEee Q lcl|NC_011614. 263 ------KIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) Q Consensus 263 ------~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~-d~~v~~~~a~~~l~~~ 313 (324) .+......+... -..-....-...|+ |..+.+|.||+++.++ T Consensus 331 ~~~~~~~~p~k~~~l~ve---------~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 331 GRTWLQVVPTKMFTLGVE---------KKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred cceEEEecchhhhhccce---------ecCceeEeccccceeeeeeecchhhheecCC Confidence 000000000000 00011112233444 5555669999999998 No 191 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=96.71 E-value=0.0004 Score=39.24 Aligned_cols=272 Identities=10% Similarity=0.018 Sum_probs=123.4 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhceeee-----c--CCCceEEEEEeCCcceeee--cccccccccccceee- Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEP-----M--EGTEKKFTFWADKPGAYWV--GEGQKIETSKATWVN- 100 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~-----~--~~~~~~ip~~~~~~~a~~v--~Eg~~~~~~~~~~~~- 100 (324) +..+--..+|+.++.+.++.+++..++.+++.+-. . .+.+++||+-. ...+.-. ..+..+..++..-.+ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~-~~~~~~~~~~~~~~~~~~~l~e~~v 79 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPH-QFSSLRTPTGDISGQNKNNLISGKA 79 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCC-cceeecccCcccCCcccCcccccee Confidence 22222345799999999999999999998876422 1 25566777532 2222111 122223333333333 Q ss_pred -EEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHH Q lcl|NC_011614. 101 -ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 101 -v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) +++.-+|... +.++++=......++++++... .++++..+|..++.--..... ...+. ..+....|+++ T Consensus 80 ~l~id~~k~va-~~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~-----~~~gt---~~t~~~a~~~i 149 (423) T protein:vir:17 80 TGRVGNYITVA-VEYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGA-----LSLGS---PNTPITKWSDV 149 (423) T ss_pred EEEeeceeeee-eeecHHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccc-----ccccc---CCcccccHHHH Confidence 4444455444 4455544445666787766555 588999999887743111100 00111 11112358999 Q ss_pred HHHHHHhhhhccCC--CEEEEcHHHHHHHHH----hhc--cCCceeeccCC-CceecccceEeecCccC-CCceEEEee- Q lcl|NC_011614. 180 IDLEALLEDDELEA--NAFISKTQNRSLLRK----IVD--PETKERIYDRN-SDSLDGLPVVNLKSSNL-KRGELITGD- 248 (324) Q Consensus 180 ~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~----l~d--~~g~~~~~~~~-~~~l~G~pv~~~~~~~~-~~~~i~~gd- 248 (324) .++..+|...+... -..+++|..+..|.+ +.. ..+...+..+. .+++.|+.++.++..+. .++. +.+- T Consensus 150 ~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt-~~~t~ 228 (423) T protein:vir:17 150 AQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGA-FGGTL 228 (423) T ss_pred HHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCccccccc-eecee Confidence 99999998776543 357899999887753 111 11223344443 37899999988764442 1111 0000 Q ss_pred ---cccEE-EEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEe--------------cccceEEE Q lcl|NC_011614. 249 ---FDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA--------------DDKAFAKL 310 (324) Q Consensus 249 ---~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~--------------~~~a~~~l 310 (324) ....+ .+...+.......-.. .+....+. +-.-|.+ ...|+..+ ++.-|.+. T Consensus 229 ~~~~~~~v~~~a~~~~~~~~~~~~~--~~~~~~g~---l~~GD~~-----t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~ 298 (423) T protein:vir:17 229 TVKTQPTVTYNAVKDSYQFTVTLTG--ATTSVTGF---LKAGDQV-----KFTNTYWLQQQTKQALYNGATPISFTATVT 298 (423) T ss_pred eecccccccccccccccceeeeeee--eeeeccCc---eeecceE-----EecceeeecccccccccccccccceEEEEE Confidence 00000 0000000000000000 00000000 0011111 12232222 22223332 Q ss_pred Eeecc-CCCCc-----cccC Q lcl|NC_011614. 311 VPADA-KPSSV-----PGEV 324 (324) Q Consensus 311 ~~~~~-~~~~~-----~~~~ 324 (324) ..+.. .+..+ |+=. T Consensus 299 ~~~~~~a~~~~tv~i~p~~i 318 (423) T protein:vir:17 299 ADANSDSSGDVTVTLSGVPI 318 (423) T ss_pred ecccccccCceEEEecCccc Confidence 21110 11111 1111 No 192 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=96.62 E-value=0.00022 Score=40.60 Aligned_cols=183 Identities=14% Similarity=0.045 Sum_probs=93.0 Q ss_pred EEEeehhHHHHHh-----cChhHHHHHHHHHHHHHHHHHHHHHHHh----ccCcCc-CC--cccccccccccceeecccc Q lcl|NC_011614. 108 LGVILPVTKEFLN-----YTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNNP-FG--KSIAQSIEKTNKVIKGDFT 175 (324) Q Consensus 108 ~~~~v~iS~ell~-----~s~~~~~~~v~~~l~~ai~~~~d~a~l~----g~g~~~-~~--~~~~~~~~~~~~~~~~~~~ 175 (324) +- -.-+|+-+++ ++..++.+...+++++++++..|+.++. +..+.. .. .+..........+...... T Consensus 1 iD-~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l 79 (221) T protein:vir:17 1 MD-DLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAI 79 (221) T ss_pred CC-cchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHH Confidence 11 1223333333 3457899999999999999999998864 211111 00 0111110111111222344 Q ss_pred hhHHHHHHHHhhhhccCCC--EEEEcHHHHHHHHHhhcc-------CC-c-eeeccCCCceecccceEeecCccCCCce- Q lcl|NC_011614. 176 QDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDP-------ET-K-ERIYDRNSDSLDGLPVVNLKSSNLKRGE- 243 (324) Q Consensus 176 ~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~d~-------~g-~-~~~~~~~~~~l~G~pv~~~~~~~~~~~~- 243 (324) ++.+.++..+|...+.... .++++|..+..|-+-.+. .+ + .+......+++.|++|+.++..+...+. T Consensus 80 ~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~ 159 (221) T protein:vir:17 80 VDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTN 159 (221) T ss_pred HHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCcccccc Confidence 6778889999988776543 467799987777542221 01 1 1112223567899999988765542222 Q ss_pred --EEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCcc Q lcl|NC_011614. 244 --LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVP 321 (324) Q Consensus 244 --i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~ 321 (324) ...|+|. ........++.+ |. + .+ +.+.+|+|+..+|.- .|.+-| T Consensus 160 ~~~~ag~~~-~~~~~~~~yr~~--------------------fs-~--------~~-glv~~~~Avgtvkl~--~~~~~~ 206 (221) T protein:vir:17 160 LVTDPGDAT-TSGENNGSYRPA--------------------IT-D--------RA-GLVFHKEAADTVEVL--LPPSRP 206 (221) T ss_pred cccCCcccc-cccccccccccc--------------------cc-c--------eE-EEEEcchheeeeeee--cCCCCC Confidence 1122221 000000011100 11 1 12 458899998888864 344444 Q ss_pred ccC Q lcl|NC_011614. 322 GEV 324 (324) Q Consensus 322 ~~~ 324 (324) --| T Consensus 207 ~~~ 209 (221) T protein:vir:17 207 PLV 209 (221) T ss_pred cee Confidence 444 No 193 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=96.51 E-value=0.00021 Score=40.73 Aligned_cols=303 Identities=11% Similarity=0.034 Sum_probs=134.0 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccC-CCcceechhhhHHHHHHHH--hhcchhhhceeeecCCCceEEEE-- Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHE-KKDGTLLNDFTTPILQEVM--ENSKIMQLGKYEPMEGTEKKFTF-- 75 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~-~~g~lip~~~~~~i~~~~~--~~s~l~~l~~~~~~~~~~~~ip~-- 75 (324) |......+..+.+.....++.-...-...-.+. ++++|=-+.+..+|..... +.-.+.+-..+.+..+...++-. T Consensus 1 ~~~~~n~~~~~~~~~e~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~ 80 (464) T protein:vir:80 1 MTEKKNTERQLTSVQEEVIKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKYDVYL 80 (464) T ss_pred CCcchhhHhhcCcccHHHHHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhheee Confidence 543332333333333322222111111111112 2344433444444433222 22223333444444443333332 Q ss_pred -EeCCcceeeecccccccccccceeeEEeeeeeEEEe--ehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_011614. 76 -WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVI--LPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 76 -~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~--v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~ 152 (324) +.....+.++.|+...+.+++++.......+-+... +.+-..+. ++..+.+....+.-.-.++..+|.+.|.|+.. T Consensus 81 ~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lv-n~~~d~~~~~~~dai~~va~tiE~a~FyGds~ 159 (464) T protein:vir:80 81 AHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLV-NNIEDPMRILTDDAISVVAKTIEWASFYGDSD 159 (464) T ss_pred ccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhh-cchhhHHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 233356789999999999999999988876654333 33333433 45667777777778888999999999999753 Q ss_pred -Cc--------CCcccccccccccce--eecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHH-HHhhccCCceeec- Q lcl|NC_011614. 153 -NP--------FGKSIAQSIEKTNKV--IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLL-RKIVDPETKERIY- 219 (324) Q Consensus 153 -~~--------~~~~~~~~~~~~~~~--~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L-~~l~d~~g~~~~~- 219 (324) ++ +..|+...+...+.. -...++.+.|..+-..+..++..++-++|+..+.+.+ ....+. ++.+. T Consensus 160 l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~--q~~~~~ 237 (464) T protein:vir:80 160 LSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDR--QVQVIS 237 (464) T ss_pred cCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCc--eeEEEc Confidence 11 112222222222222 2234667777777777777888888899999998775 443322 23332 Q ss_pred cCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEecc Q lcl|NC_011614. 220 DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 220 ~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) +...+...|+|+--..+. .+ .+-+- ++. ++.... -++..+. .....|+.-++..-+..--.+ T Consensus 238 ~n~~~~~~G~~v~~f~sa-~G--~i~L~-~s~-~m~~~~--~ld~~~~-----------~~~~apaapsvt~tv~~~~~g 299 (464) T protein:vir:80 238 DNGQNATMGFNVKGFNSA-RG--FIRLH-GST-VMELEQ--ILDENRM-----------QLPNAPQKATVKATLEAGTKG 299 (464) T ss_pred CCCCcceeeeeccccccc-cc--ceecc-Ccc-ccCccc--ccccccc-----------cCCCCcCCceeEEEecCCccc Confidence 223334567666211110 00 00000 000 000000 0000000 000011111221111110000 Q ss_pred --EEec-cc-ceEEEEeeccCCCCccccC Q lcl|NC_011614. 300 --HIAD-DK-AFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 300 --~v~~-~~-a~~~l~~~~~~~~~~~~~~ 324 (324) .-.+ +. --=++......+.+.|-++ T Consensus 300 ~f~~~~~~~~~~Ykv~~vn~~GeS~ps~~ 328 (464) T protein:vir:80 300 KFRDEDLTIDTEYKVVVVSDDAESAPSDV 328 (464) T ss_pred CCccccccceeEEEEEEECCCCcccccee Confidence 0000 00 0112223344455555443 No 194 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=96.44 E-value=0.00056 Score=38.41 Aligned_cols=298 Identities=11% Similarity=0.020 Sum_probs=142.5 Q ss_pred CchhhHHHHH----HHHHhhccchhhhhccccccccC-CCcceechhhhHHHHHHHHhhc--chhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLN----LQHFASNNVKPQVFNPDNVMMHE-KKDGTLLNDFTTPILQEVMENS--KIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~----~~~~~~~~~~~~~~~a~~~~~~~-~~g~lip~~~~~~i~~~~~~~s--~l~~l~~~~~~~~~~~~i 73 (324) |-|+||.+.- .++.+....+.....-...-.+. +++++=-+.+..+|........ .+.+-..+.+..+...++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~y 80 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) T ss_pred CCCCcchhhccccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhhh Confidence 9999986433 33333332222111111111112 2344434445555543322222 222323334444333233 Q ss_pred EE---EeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhc-ChhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011614. 74 TF---WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY-TYSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) Q Consensus 74 p~---~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~-s~~~~~~~v~~~l~~ai~~~~d~a~l~g 149 (324) -. +.....+.++.|+...+.+++++.......+-++....+|.-+-.. +..+.+....+.-.-.++..+|.++|.| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyG 160 (468) T protein:vir:63 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhc Confidence 22 2333567899999999999999999999999998877776643332 3457777888888888999999999998 Q ss_pred cCcC----c-----CCccccccccccccee--ecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHH-HhhccCCcee Q lcl|NC_011614. 150 QGNN----P-----FGKSIAQSIEKTNKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLR-KIVDPETKER 217 (324) Q Consensus 150 ~g~~----~-----~~~~~~~~~~~~~~~~--~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~-~l~d~~g~~~ 217 (324) +..- . +..|+........... ...++.+++..+...+...+..++-++|+..+.+.|- ... ..+.. T Consensus 161 ds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L--~~q~~ 238 (468) T protein:vir:63 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQL--SKQTQ 238 (468) T ss_pred ccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhc--CceEE Confidence 7532 1 2223332222221111 1234555666666666667777778899999887772 221 11222 Q ss_pred e-ccCCCceecccceE--eec--CccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEE Q lcl|NC_011614. 218 I-YDRNSDSLDGLPVV--NLK--SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) Q Consensus 218 ~-~~~~~~~l~G~pv~--~~~--~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r 292 (324) + .+.......|.||- ++. ...+.. ..++++...+ .-++ .+.. .. ..-..+. T Consensus 239 v~~~n~~~~~~G~~v~g~~sa~G~I~l~g-s~il~~~~~l--------~~~~--~~~~-~A----------psp~~vs-- 294 (468) T protein:vir:63 239 LVRDNGNNVSVGFNIQGFHSARGFIKLHG-STVMENEQIL--------DERI--LALP-TA----------PQPAKVT-- 294 (468) T ss_pred EEcCCCCceeeeecccceecceeeeeecC-ceeeccccCC--------Cccc--cccc-cc----------ccCCccc-- Confidence 2 22233445666662 111 001111 1222222111 0000 0000 00 0000000 Q ss_pred EEEEeccEEe----cccceE-EEEeeccCCCCccccC Q lcl|NC_011614. 293 ATMHVALHIA----DDKAFA-KLVPADAKPSSVPGEV 324 (324) Q Consensus 293 ~~~r~d~~v~----~~~a~~-~l~~~~~~~~~~~~~~ 324 (324) +....++.-. ++.... +++.....+++.|.+. T Consensus 295 aT~~~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~ 331 (468) T protein:vir:63 295 ATQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEV 331 (468) T ss_pred eeeecccCCcccCCCcceEEEEEEEECCCCccccccc Confidence 1111111111 111111 2344455667777665 No 195 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=96.37 E-value=0.00046 Score=38.90 Aligned_cols=298 Identities=12% Similarity=0.069 Sum_probs=146.8 Q ss_pred CchhhHHHHHH--HHHhhccchhhhhcccccc---ccCCCcceechhhhHHHHHHHH--hhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNL--QHFASNNVKPQVFNPDNVM---MHEKKDGTLLNDFTTPILQEVM--ENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~--~~~~~~~~~~~~~~a~~~~---~~~~~g~lip~~~~~~i~~~~~--~~s~l~~l~~~~~~~~~~~~i 73 (324) |+++-.++.+. .++..+.. .-+.+.-.. +-.++|+|=-+.+..+|..... +.-.+.+-..+.+..+...++ T Consensus 3 ~~~~~~~~~~~~~~~~~e~~~--KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y 80 (462) T protein:vir:96 3 KDTNLTAEQNKYADKFQEEVM--KSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKY 80 (462) T ss_pred cccccchhhhhhhchhhHHHH--HHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhh Confidence 55544443332 13332111 112222111 1122444433445554433322 222233334444444433333 Q ss_pred EE---EeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHH-hcChhHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_011614. 74 TF---WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) Q Consensus 74 p~---~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell-~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g 149 (324) -. +.....+.++.|+...+.+++++.......+-++..-.+|...- ..+..+.+....+.-.-.++..+|.+.|.| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~a~Fyg 160 (462) T protein:vir:96 81 DVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEWASFYG 160 (462) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhh Confidence 33 23335678999999999999999999999999888766666433 234567778888888889999999999999 Q ss_pred cCc---Cc-----CCcccccccccccce--eecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccCCceeec Q lcl|NC_011614. 150 QGN---NP-----FGKSIAQSIEKTNKV--IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIY 219 (324) Q Consensus 150 ~g~---~~-----~~~~~~~~~~~~~~~--~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~g~~~~~ 219 (324) +.. +. +..|+...+...+.. -...++.+.|..+--.+..++..++-++|+..+.+.|..---...+.++. T Consensus 161 ds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~ 240 (462) T protein:vir:96 161 DASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQ 240 (462) T ss_pred hcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEc Confidence 753 11 122332222222222 22345666666666667777888888999999999986433233333333 Q ss_pred cCCCceecccceEe--ecC--ccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEE Q lcl|NC_011614. 220 DRNSDSLDGLPVVN--LKS--SNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 220 ~~~~~~l~G~pv~~--~~~--~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) ...+....|+|+-- +.. ..+....++-. ... +.-+.+. .. ..-....+.+.. T Consensus 241 ~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~---~~i--------~~~~~~~--~p-----------~ap~~~~vsaTv 296 (462) T protein:vir:96 241 DNSGNVNAGYNVQGFYSSRGFIKLHGSTVMEN---ELI--------LDESLQP--LP-----------NAPQPATVKATV 296 (462) T ss_pred CCCCceeeeeeccceeeeeeeeeeCCceecCc---ccc--------ccccccc--CC-----------CCCCCCceeEEE Confidence 33334567777621 110 01111100000 000 1111000 00 000111233332 Q ss_pred Eec--cEEeccc-c---eEEEEeeccCCCCccccC Q lcl|NC_011614. 296 HVA--LHIADDK-A---FAKLVPADAKPSSVPGEV 324 (324) Q Consensus 296 r~d--~~v~~~~-a---~~~l~~~~~~~~~~~~~~ 324 (324) ..+ +...++. + ==+++.....+.+.|.|. T Consensus 297 ~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS~PS~~ 331 (462) T protein:vir:96 297 ETGKKGLFTDEHDRAELTYKVVVNSDDAQSAPSEA 331 (462) T ss_pred EeCCCCCCCCccCceeEEEEEEEECCCCcccccee Confidence 222 3333432 1 011333344555556554 No 196 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.19 E-value=0.00031 Score=39.80 Aligned_cols=293 Identities=10% Similarity=0.029 Sum_probs=143.5 Q ss_pred CchhhHHHHHHH---HHhhc--cchhh--hhccc--cccc--cCCCcceechhhh----HHHHHHHHhhcchhhhceeee Q lcl|NC_011614. 1 MEQTQKLKLNLQ---HFASN--NVKPQ--VFNPD--NVMM--HEKKDGTLLNDFT----TPILQEVMENSKIMQLGKYEP 65 (324) Q Consensus 1 m~~~~~~~~~~~---~~~~~--~~~~~--~~~a~--~~~~--~~~~g~lip~~~~----~~i~~~~~~~s~l~~l~~~~~ 65 (324) |+.-+.++.-.+ +|... .+..+ .+..+ .... ++....-||..+. ..+++.+........+..+.. T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t 80 (336) T protein:vir:78 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhccccc Confidence 776666544333 12111 11111 11111 1110 1011111333333 344455555555556666555 Q ss_pred cCCC---ceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHH-HHHhcC--hhHHHHHHHHHHHHHHH Q lcl|NC_011614. 66 MEGT---EKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTK-EFLNYT--YSQFFEEMKPMIAEAFY 139 (324) Q Consensus 66 ~~~~---~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~-ell~~s--~~~~~~~v~~~l~~ai~ 139 (324) .+.. ...+++.+....|.+++.++..|..+...+...-+.+.++..+.++. |+.... ..++.+.-....++++. T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:78 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred CCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 4433 34577777777888999999999999888888888888888888885 443322 36788888888899999 Q ss_pred HHHHHHHHhccCcCcCCccccc--cccccccee-------ecccchhHHHHHHHHhhhhc------cCCCEEEEcHHHHH Q lcl|NC_011614. 140 KKFDEAGILNQGNNPFGKSIAQ--SIEKTNKVI-------KGDFTQDNIIDLEALLEDDE------LEANAFISKTQNRS 204 (324) Q Consensus 140 ~~~d~a~l~g~g~~~~~~~~~~--~~~~~~~~~-------~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~ 204 (324) +.+++-.+.|+.... ..|+++ .+....... +..--++|+..++.++...- ..+..++++|..+. T Consensus 161 ~~~N~~~~~Gd~~~~-~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~ 239 (336) T protein:vir:78 161 KFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMS 239 (336) T ss_pred HhhCeEEEEeccccc-eEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHH Confidence 999988888875431 112222 221111111 11223567777777774322 12457999999999 Q ss_pred HHHHhhccCCceeec--cCCCceecccceEeecC-ccCCCceEEEeecccEEEEEec---ceEEEEeecccccccccccc Q lcl|NC_011614. 205 LLRKIVDPETKERIY--DRNSDSLDGLPVVNLKS-SNLKRGELITGDFDKLIYGIPQ---LIEYKIDETAQLSTVKNEDG 278 (324) Q Consensus 205 ~L~~l~d~~g~~~~~--~~~~~~l~G~pv~~~~~-~~~~~~~i~~gd~~~~~~~~~~---~~~i~~~~~~~~~~~~~~~~ 278 (324) .|... +..|-.++. ..+ +-++-++..+. ...+ |+...++.-... -.++.+...-..+.... T Consensus 240 ~L~~~-n~~g~tv~~~lk~n---~Pnl~i~t~pel~~Ag------g~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~--- 306 (336) T protein:vir:78 240 DLSKT-NQYGLSAAAKLKEI---FPKLEFVTIPEYDTAS------GRLVQLWAPRVEGKDTATCGFTEKMRAHSIER--- 306 (336) T ss_pred hccCC-CccCccHHHHHHHh---cCccEEEEcccccccC------cceEEEEEeeccCCcceeeecchhhhccceee--- Confidence 98643 222322211 111 00111211110 0111 111111111111 11222211111111000 Q ss_pred cchhhhhcCcEEEEEEEEecc-EEecccceEEEEee Q lcl|NC_011614. 279 TPVNLFEQDMVALRATMHVAL-HIADDKAFAKLVPA 313 (324) Q Consensus 279 ~~~~~f~~~~v~~r~~~r~d~-~v~~~~a~~~l~~~ 313 (324) ..-....-...|+++ .+.+|.||+++++. T Consensus 307 ------~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 307 ------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ------cCceeEeccccceeeeeeeccchheeeccC Confidence 011122233445544 45569999999998 No 197 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=96.11 E-value=0.0009 Score=37.29 Aligned_cols=298 Identities=11% Similarity=0.027 Sum_probs=144.1 Q ss_pred CchhhHHHHH---HHHHhhccchhhhhccccccccC-CCcceechhhhHHHHHHHHhhc--chhhhceeeecCCCceEEE Q lcl|NC_011614. 1 MEQTQKLKLN---LQHFASNNVKPQVFNPDNVMMHE-KKDGTLLNDFTTPILQEVMENS--KIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 m~~~~~~~~~---~~~~~~~~~~~~~~~a~~~~~~~-~~g~lip~~~~~~i~~~~~~~s--~l~~l~~~~~~~~~~~~ip 74 (324) |-++|+.+.. .++.+.+-.++-...-...-.+. +++++=-+.+..+|........ .+.+-..+.+..+...++- T Consensus 1 ~~~~~~~~~~~~n~~~~~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y~ 80 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKYD 80 (467) T ss_pred CCCcchhhhhhcccccCHHHHHHHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhhhe Confidence 9999885443 33444443333211111111112 2344444555555543332222 2223233344443333333 Q ss_pred E---EeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhc-ChhHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|NC_011614. 75 F---WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY-TYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 75 ~---~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~-s~~~~~~~v~~~l~~ai~~~~d~a~l~g~ 150 (324) . +.....+.++.|+...+.+++++.......+-++....+|.-+-.. +..+.+....+.-.-.++..+|.++|.|+ T Consensus 81 ~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyGd 160 (467) T protein:vir:80 81 VYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFGD 160 (467) T ss_pred eeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 2 2333567899999999999999999999999998877776643332 34577778888888889999999999987 Q ss_pred CcC----c-----CCccccccccccccee--ecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHH-HhhccCCceee Q lcl|NC_011614. 151 GNN----P-----FGKSIAQSIEKTNKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLR-KIVDPETKERI 218 (324) Q Consensus 151 g~~----~-----~~~~~~~~~~~~~~~~--~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~-~l~d~~g~~~~ 218 (324) ..- . +..|+........... ...++.+++..+...+...+..++-++|+..+.+.|- ... ..+..+ T Consensus 161 s~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L--~~q~~v 238 (467) T protein:vir:80 161 SDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQL--SKQTQL 238 (467) T ss_pred cccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhc--CceEEE Confidence 532 1 2223332222221111 1234555666666666667777778899999887772 221 112222 Q ss_pred -ccCCCceecccceE--eec--CccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEE Q lcl|NC_011614. 219 -YDRNSDSLDGLPVV--NLK--SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 219 -~~~~~~~l~G~pv~--~~~--~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) .+.......|.||- ++. ...+.. ..++++...+ +-++ .+... . ..-..+. + T Consensus 239 ~~~n~~~~~~G~~v~g~~sa~G~I~l~g-s~il~~~~~l--------~~~~--~~~~~-A----------psp~~vs--a 294 (467) T protein:vir:80 239 VRDNGNNVSVGFNIQGFHSARGFIKLHG-STVMENEQIL--------DERI--LALPT-A----------PQPAKVT--A 294 (467) T ss_pred EcCCCCceeeeecccceecceeeeeecC-ceeeccccCC--------Cccc--ccccc-c----------ccCCccc--e Confidence 22233445666662 111 001111 1222222111 0000 00000 0 0000000 1 Q ss_pred EEEeccEEe----cccceE-EEEeeccCCCCccccC Q lcl|NC_011614. 294 TMHVALHIA----DDKAFA-KLVPADAKPSSVPGEV 324 (324) Q Consensus 294 ~~r~d~~v~----~~~a~~-~l~~~~~~~~~~~~~~ 324 (324) ....++.-. ++.... +++.....+++.|.+. T Consensus 295 T~~~~~~g~~~~~~~a~y~Y~v~~vs~~GES~pS~~ 330 (467) T protein:vir:80 295 TQEAGKKGQFRAEDLAAHEYKVVVSSDDAESIASEV 330 (467) T ss_pred eeecccCCcccCCCcceEEEEEEEECCCCccccccc Confidence 111111111 111111 2344455667777665 No 198 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=95.96 E-value=0.0012 Score=36.61 Aligned_cols=265 Identities=8% Similarity=0.038 Sum_probs=122.7 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhh-cchhhhceeeecCCCceEEEEEeCC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADK 79 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~-s~l~~l~~~~~~~~~~~~ip~~~~~ 79 (324) |.-+..+= .++-..+...+.+..... ....+++++++-+....++.....- T Consensus 1 m~it~~~l----------------------------~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~ 52 (302) T protein:vir:10 1 MLINKQSL----------------------------NAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTF 52 (302) T ss_pred CcccHHHH----------------------------HHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCC Confidence 32222110 001111111122222111 1244455655544444445444444 Q ss_pred cce-eeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCc--C--- Q lcl|NC_011614. 80 PGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN--N--- 153 (324) Q Consensus 80 ~~a-~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~--~--- 153 (324) |.. .|++| +......=...+++.++++..+.||++.+.+-..++..-+.+.++++.++.+|+.++.--.+ + T Consensus 53 p~l~e~~Ge---~~~~~l~~~~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~ 129 (302) T protein:vir:10 53 PKMRRWIGA---KVVKNLKAYKYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPC 129 (302) T ss_pred CCccccccc---eeeccccccceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcc Confidence 543 56544 44445555567799999999999999999988888999999999999999999876642111 1 Q ss_pred cCCccccccccc-----------cc-ceeecccc---hhHHHHHHHHhhhh-----ccCCCEEEEcHHHHHHHHHhhccC Q lcl|NC_011614. 154 PFGKSIAQSIEK-----------TN-KVIKGDFT---QDNIIDLEALLEDD-----ELEANAFISKTQNRSLLRKIVDPE 213 (324) Q Consensus 154 ~~~~~~~~~~~~-----------~~-~~~~~~~~---~~~i~~~~~~l~~~-----~~~~~~~v~~~~~~~~L~~l~d~~ 213 (324) ..+......--. .. ......++ ++....++.++... ...|..+++.|......+++-.. T Consensus 130 ~DG~~fF~~dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~- 208 (302) T protein:vir:10 130 FDGQYFIDTDHPVGDASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN- 208 (302) T ss_pred cCCcceecccccccccccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc- Confidence 111111111000 00 00112233 33344444444332 34456788888887666554321 Q ss_pred CceeeccCCCceeccc-ceEeecCccCCCceEEEeeccc---EEEEEecceEEEEeecccccccccccccchhhhhcCcE Q lcl|NC_011614. 214 TKERIYDRNSDSLDGL-PVVNLKSSNLKRGELITGDFDK---LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) Q Consensus 214 g~~~~~~~~~~~l~G~-pv~~~~~~~~~~~~i~~gd~~~---~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v 289 (324) ++ ...+....+.|. -+++.+....+..=.++.+.+. +++.-+++..++..+ -|..+.+ T Consensus 209 ~~--~~~g~~Np~~g~~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~----------------~~~~dgv 270 (302) T protein:vir:10 209 PK--LADNTPNPYVGTAELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQV----------------NLDSDDV 270 (302) T ss_pred cc--cCCCCcceeccceEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEecc----------------CCCCCce Confidence 11 111222223332 2333322222222233333332 233334444444322 2566777 Q ss_pred EEEEEEEeccE------EecccceEEEEeecc Q lcl|NC_011614. 290 ALRATMHVALH------IADDKAFAKLVPADA 315 (324) Q Consensus 290 ~~r~~~r~d~~------v~~~~a~~~l~~~~~ 315 (324) .+|.+..+|.. ...+...-.-+..++ T Consensus 271 ~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 271 FNLRKLKFGAEARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred EEEEEEEEeeeeeeecchhhhhhhhccCccCC Confidence 77766666641 111111111122111 No 199 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=95.60 E-value=0.0018 Score=35.68 Aligned_cols=301 Identities=10% Similarity=-0.070 Sum_probs=127.4 Q ss_pred Cch------------------hh-------HHHHH------------HHHHhhccchhhhh--ccccccccCCCcceech Q lcl|NC_011614. 1 MEQ------------------TQ-------KLKLN------------LQHFASNNVKPQVF--NPDNVMMHEKKDGTLLN 41 (324) Q Consensus 1 m~~------------------~~-------~~~~~------------~~~~~~~~~~~~~~--~a~~~~~~~~~g~lip~ 41 (324) |.+ .+ +++.. .+...+.......+ -++.....+.+..=||- T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~g~p~ 80 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDLKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPI 80 (382) T ss_pred CCCcceeeeecCCccccchhhhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCccccCCccHHH Confidence 111 11 11100 01111111100111 11111111122222465 Q ss_pred hhhH----HHHHHHHhhcchhhhceeeecCCC---ceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehh Q lcl|NC_011614. 42 DFTT----PILQEVMENSKIMQLGKYEPMEGT---EKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPV 114 (324) Q Consensus 42 ~~~~----~i~~~~~~~s~l~~l~~~~~~~~~---~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~i 114 (324) .+.+ .+++-+.+--....+..+...+.. .+.+++.+....|.+++.++..|..+...+..+-..+.+.....+ T Consensus 81 ~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~y 160 (382) T protein:vir:96 81 QFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGLLV 160 (382) T ss_pred HHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCCccccccceeEEEEEEEEEeeee Confidence 5554 445555555556666666554432 346777777788889998888887665544433334444444444 Q ss_pred -HHHHHhcC--hhHHHHHHHHHHHHHHHHHHHHHHHhccCcC-cCC-ccccc--ccccccceee-------cccchhHHH Q lcl|NC_011614. 115 -TKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PFG-KSIAQ--SIEKTNKVIK-------GDFTQDNII 180 (324) Q Consensus 115 -S~ell~~s--~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~-~~~-~~~~~--~~~~~~~~~~-------~~~~~~~i~ 180 (324) ..|+.+.+ ..++.+.-....++++.+.+|+-.|.|+..+ ..+ .|+++ .+.+....++ ..--++|+. T Consensus 161 g~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~ 240 (382) T protein:vir:96 161 GTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIR 240 (382) T ss_pred cHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHHHHH Confidence 45655543 3577777888888999999999999996322 111 12222 2211111111 111246777 Q ss_pred HHHHHhhhhcc-------CCCEEEEcHHHHHHHHHhhccCCceeec--cCC-C-ceecccceEeecCccCCCceEEEeec Q lcl|NC_011614. 181 DLEALLEDDEL-------EANAFISKTQNRSLLRKIVDPETKERIY--DRN-S-DSLDGLPVVNLKSSNLKRGELITGDF 249 (324) Q Consensus 181 ~~~~~l~~~~~-------~~~~~v~~~~~~~~L~~l~d~~g~~~~~--~~~-~-~~l~G~pv~~~~~~~~~~~~i~~gd~ 249 (324) .++.++...-. .+..++++|..+..|... +..|-.++. ..+ + -++-..|=..........+ T Consensus 241 ~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n~Pnl~i~t~peL~~a~~~g~g~------- 312 (382) T protein:vir:96 241 EAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-TPYGISVSDWIEQTYPKMRIVSAPELSGVQMQGKTP------- 312 (382) T ss_pred HHHHHHHhccCCeeeecccceEEeechHHHhhcccc-CccCccHHHHHHHhcCCcEEEEccccccccCCCccc------- Confidence 78777743221 133688999988887432 222221111 110 0 0111111100000000000 Q ss_pred ccEEEEEecceEE--EEeecccccccccccccchhhhhcCcE-----EE--EEEE-EeccEEecccceEEEEee Q lcl|NC_011614. 250 DKLIYGIPQLIEY--KIDETAQLSTVKNEDGTPVNLFEQDMV-----AL--RATM-HVALHIADDKAFAKLVPA 313 (324) Q Consensus 250 ~~~~~~~~~~~~i--~~~~~~~~~~~~~~~~~~~~~f~~~~v-----~~--r~~~-r~d~~v~~~~a~~~l~~~ 313 (324) ....+-...++.. ..+.+.. ..+. +.....|+.-.+ .+ -... ..|..+.+|.||+++++. T Consensus 313 ~~~~~~~~~e~~~~~~~s~~~p-~~f~---q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 313 EDALVLFVEEVDASVDGSTDGG-SVFS---QLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred eeEEEEecchhhhhcccccccC-ccee---ccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 0000000000000 0000000 0000 000000000000 00 0111 256667779999999988 No 200 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=95.59 E-value=0.0018 Score=35.64 Aligned_cols=272 Identities=12% Similarity=0.071 Sum_probs=142.5 Q ss_pred ccccCCCcceec-hhhhHHHHHHHHhhcchhhhce-eeecCCC-ceEEEEEeCCcceeeecccccccccccceeeEEeee Q lcl|NC_011614. 29 VMMHEKKDGTLL-NDFTTPILQEVMENSKIMQLGK-YEPMEGT-EKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) Q Consensus 29 ~~~~~~~g~lip-~~~~~~i~~~~~~~s~l~~l~~-~~~~~~~-~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) ...+++.-++|- ++++..|...+.+.-.--...+ +.-.+.+ ...||.. +.+...-..|..+..-...+.+++++-. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~~~i~TGEIt~~i 79 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIYNPIETGEITFQI 79 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeeecccccceEEEEE Confidence 233444455655 5556666555555444444455 4444444 3455543 3445555566666667777888999999 Q ss_pred eeEEEe-ehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhccCc---CcCCcccccccccc--cceeecccchh Q lcl|NC_011614. 106 FKLGVI-LPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGN---NPFGKSIAQSIEKT--NKVIKGDFTQD 177 (324) Q Consensus 106 ~k~~~~-v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~g~---~~~~~~~~~~~~~~--~~~~~~~~~~~ 177 (324) ..+.+- ..||++|-+|+. ..+......+-+++|....+.-++.-... +..+-...++.... +....+..... T Consensus 80 ~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~~~ 159 (313) T protein:vir:95 80 TEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFALK 159 (313) T ss_pred EeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceehhh Confidence 887655 589999999885 23444444455667777777666642111 11111222333332 22233456677 Q ss_pred HHHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhc------cCCceeeccCCC------ceecccceEeec-------- Q lcl|NC_011614. 178 NIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD------PETKERIYDRNS------DSLDGLPVVNLK-------- 235 (324) Q Consensus 178 ~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d------~~g~~~~~~~~~------~~l~G~pv~~~~-------- 235 (324) +++.+...+..+.... -.+++.|.....|..+.. .+|+-++..+.. ..++|..+.+++ T Consensus 160 ~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~AN~~ 239 (313) T protein:vir:95 160 HLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVANYN 239 (313) T ss_pred HHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhcccc Confidence 8888877776655332 368999999999887642 234555443322 246676665442 Q ss_pred -CccCCCce---EEEe--ec-ccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc-e Q lcl|NC_011614. 236 -SSNLKRGE---LITG--DF-DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-F 307 (324) Q Consensus 236 -~~~~~~~~---i~~g--d~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a-~ 307 (324) +...+.+. ++.+ |. -.-+++-|+.+-- +.+...++-..+....| +|+|+.+.+-+- . T Consensus 240 D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~-------------s~~~~~~~~~~~~~~~~--~R~G~Gi~R~~~L~ 304 (313) T protein:vir:95 240 DGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPK-------------SEGERNKDRARDEHVVR--CRYGFGIQRLDTLG 304 (313) T ss_pred ccccccCceeeeeeeeeecccccceeeeeccccc-------------cccccccccccccceee--eeecccceeeccee Confidence 11111111 1111 10 0113333433211 11111112234445454 566777766554 4 Q ss_pred EEEEeeccC Q lcl|NC_011614. 308 AKLVPADAK 316 (324) Q Consensus 308 ~~l~~~~~~ 316 (324) ++++.+++. T Consensus 305 ~~~~~A~~~ 313 (313) T protein:vir:95 305 LLATSATAY 313 (313) T ss_pred EEEeccccC Confidence 555777777 No 201 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=95.31 E-value=0.00064 Score=38.10 Aligned_cols=286 Identities=17% Similarity=0.192 Sum_probs=141.1 Q ss_pred CchhhHHHHHHHHHhhccchhhhhcc-------ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNP-------DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a-------~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) ++.+|..-.+.+-.+.+.-.+...++ .+++.+ +....+|..+.-.|-..+..+.++++.+-+.+.+.-..+. T Consensus 5 iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiT-D~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~ 83 (318) T protein:vir:86 5 IESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTIT-DTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 83 (318) T ss_pred hhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCceee-ccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhhhh Confidence 56677777777766666555533332 122222 3345689999888888899999999876666655443322 Q ss_pred EEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHh---cChhHHHHHHHHHHHHHHH-HHHHHHHHhc Q lcl|NC_011614. 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFY-KKFDEAGILN 149 (324) Q Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~---~s~~~~~~~v~~~l~~ai~-~~~d~a~l~g 149 (324) . +.+...|...-.|..+.+...+|..-++.+.-+.....+ -|+.+ .+...+..+|+.+|+.++. +..|.+++-| T Consensus 84 s-~~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~G 161 (318) T protein:vir:86 84 S-FDSSAEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 161 (318) T ss_pred h-hhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhheee Confidence 2 223356777788999999888888877776554444444 33333 4455679999999999999 8899999999 Q ss_pred cCcCcCCccc-cc---cccc-c-cceeecccchhHHH-HHHHHh-hhhccCCCEEEEcHHHHH-HHHHhhccCCc--eee Q lcl|NC_011614. 150 QGNNPFGKSI-AQ---SIEK-T-NKVIKGDFTQDNII-DLEALL-EDDELEANAFISKTQNRS-LLRKIVDPETK--ERI 218 (324) Q Consensus 150 ~g~~~~~~~~-~~---~~~~-~-~~~~~~~~~~~~i~-~~~~~l-~~~~~~~~~~v~~~~~~~-~L~~l~d~~g~--~~~ 218 (324) +|++.....- .. .+.+ + ....+++..+.+.+ .+..-+ +.++ .-.+++...+.. .|..++.+..+ ..+ T Consensus 162 DG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdfvrptag--rrylivkaedrkalldelrqatanahvri 239 (318) T protein:vir:86 162 DGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDFVRPTAG--RRYLIVKAEDRKALLDELRQATANAHVRI 239 (318) T ss_pred cCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhhhccCCC--ceEEEEeecchHHHHHHHHhhcccceeEE Confidence 9987522110 00 0111 1 11122333332222 221111 1112 123556555544 34455544332 221 Q ss_pred c-cCC-Cceecccc-e-EeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhh--hhcCcEEEE Q lcl|NC_011614. 219 Y-DRN-SDSLDGLP-V-VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL--FEQDMVALR 292 (324) Q Consensus 219 ~-~~~-~~~l~G~p-v-~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--f~~~~v~~r 292 (324) . +.. ..+--|.. + +.+.+-.+ +..+ +.| +...+++.+- +-++. |.+|+--+. T Consensus 240 knddteiasevgvdeiivytgskal-kptv-lvd---------qkyhidmqdl-----------tkvdafewktnsnmil 297 (318) T protein:vir:86 240 KNDDTEIASEVGVDEIIVYTGSKAL-KPTV-LVD---------QKYHIDMQDL-----------TKVDAFEWKTNSNMIL 297 (318) T ss_pred eccchhhhhhcCcceeeeeeccccc-ccee-eec---------cceecchhhh-----------hhhhcceeccCCceEE Confidence 1 110 00111211 1 11111111 1111 111 1111111111 11111 344544455 Q ss_pred EEEEeccEEecccceEEEEee Q lcl|NC_011614. 293 ATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 293 ~~~r~d~~v~~~~a~~~l~~~ 313 (324) ++....+.+-.-+|=++++.. T Consensus 298 vetltsghvetynagavitvs 318 (318) T protein:vir:86 298 VETLTSGHVETYNAGAVITVS 318 (318) T ss_pred EeecccCcceeecCceeEEeC Confidence 555555544443443444432 No 202 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=95.15 E-value=0.0016 Score=35.89 Aligned_cols=298 Identities=10% Similarity=-0.055 Sum_probs=128.9 Q ss_pred CchhhHHHHHHH--HHhhccchh------------hhhccccccccCCCcceechhhhHHH----HHHHHhhcchhhhce Q lcl|NC_011614. 1 MEQTQKLKLNLQ--HFASNNVKP------------QVFNPDNVMMHEKKDGTLLNDFTTPI----LQEVMENSKIMQLGK 62 (324) Q Consensus 1 m~~~~~~~~~~~--~~~~~~~~~------------~~~~a~~~~~~~~~g~lip~~~~~~i----~~~~~~~s~l~~l~~ 62 (324) |...+..+.+.. ++.+..... ..+.++.....+.++.=||-.+.+.| ++-+..--....++. T Consensus 30 ~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~p 109 (388) T protein:vir:99 30 LTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILG 109 (388) T ss_pred eechhhHhhhhcceeccCccchhhhhhhhhhhhhhcccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhcc Confidence 322222111111 222211110 11111111111122222566665533 343444444444555 Q ss_pred eeecCCC---ceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHH-HHHhc--ChhHHHHHHHHHHHH Q lcl|NC_011614. 63 YEPMEGT---EKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTK-EFLNY--TYSQFFEEMKPMIAE 136 (324) Q Consensus 63 ~~~~~~~---~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~-ell~~--s~~~~~~~v~~~l~~ 136 (324) +.+.+.. ...+++.+....|.+++.++..|..+...+...-..+.+.....++. |+-.. ...++...-.....+ T Consensus 110 v~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ 189 (388) T protein:vir:99 110 VKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAV 189 (388) T ss_pred ccccCCccceeEEEeeeecceeEEEeecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHH Confidence 5444332 44566767677888999988888777665555555555555555554 33332 236788888888999 Q ss_pred HHHHHHHHHHHhccCcC-c-CCcccccc--ccc---cccee--------ecccchhHHHHHHHHhhhhcc-------CCC Q lcl|NC_011614. 137 AFYKKFDEAGILNQGNN-P-FGKSIAQS--IEK---TNKVI--------KGDFTQDNIIDLEALLEDDEL-------EAN 194 (324) Q Consensus 137 ai~~~~d~a~l~g~g~~-~-~~~~~~~~--~~~---~~~~~--------~~~~~~~~i~~~~~~l~~~~~-------~~~ 194 (324) ++.+.+++-.|.|.... . ...|+++. +.. ..+.. +..--++|+..++.++...-. .+. T Consensus 190 ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~ 269 (388) T protein:vir:99 190 QLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDI 269 (388) T ss_pred HHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccce Confidence 99999999999884321 1 12222221 111 00000 111125667778777743221 123 Q ss_pred EEEEcHHHHHHHHHhhccCCceeec--cCCCceecccceEee---c-Cc-cCCCceEEE-eec-ccEEEEEe-cceE--E Q lcl|NC_011614. 195 AFISKTQNRSLLRKIVDPETKERIY--DRNSDSLDGLPVVNL---K-SS-NLKRGELIT-GDF-DKLIYGIP-QLIE--Y 262 (324) Q Consensus 195 ~~v~~~~~~~~L~~l~d~~g~~~~~--~~~~~~l~G~pv~~~---~-~~-~~~~~~i~~-gd~-~~~~~~~~-~~~~--i 262 (324) .+++.|..+..|... +..|-.++. ..+ +-++-++.. . +. ..+...+++ .+. .....+.. +... . T Consensus 270 tL~LP~~~~~~Ls~~-n~~g~Tvl~~lk~n---~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~ 345 (388) T protein:vir:99 270 TLVLPMNKVDMLSVV-TDLGISVRDWLKQT---YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQ 345 (388) T ss_pred EEEechHHHHhcccc-CcCCccHHHHHHHh---cCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEE Confidence 688999988888533 222221211 111 111111111 0 10 011111111 110 00000000 0000 0 Q ss_pred EEeecccccccccccccchhhhhcCc--EEEEEEEE-eccEEecccceEEEEee Q lcl|NC_011614. 263 KIDETAQLSTVKNEDGTPVNLFEQDM--VALRATMH-VALHIADDKAFAKLVPA 313 (324) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~f~~~~--v~~r~~~r-~d~~v~~~~a~~~l~~~ 313 (324) .+........ -+... ...-...| .|..+.+|.||+++++. T Consensus 346 ~~p~~~~~l~-----------vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 346 LVQSKFVTLG-----------VEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred eccccccccc-----------ceecCceeEeccccceeeeEEeccchhheeccC Confidence 0000000000 01111 11112233 45566679999999998 No 203 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=95.09 E-value=0.0028 Score=34.60 Aligned_cols=297 Identities=14% Similarity=0.132 Sum_probs=159.3 Q ss_pred Cchhh--HHHHHHHHHhhccchhhhhccccccc-cCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEE Q lcl|NC_011614. 1 MEQTQ--KLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFW 76 (324) Q Consensus 1 m~~~~--~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~ 76 (324) |.+.- ++...+.+.+... ++.. ..+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.- T Consensus 1 M~~~tr~~~~~y~~~~A~~n---------gv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg 71 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLN---------GISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVG 71 (355) T ss_pred CChHHHHHHHHHHHHHHHHh---------CCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeec Confidence 76633 3333344443321 1111 1123566778889999999999999999999988875443 23333 Q ss_pred eCCcceeeec--cc-ccccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011614. 77 ADKPGAYWVG--EG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 77 ~~~~~a~~v~--Eg-~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g 151 (324) .+++-+.-+. .+ ...|.....++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++..+=.-.|+|+. T Consensus 72 v~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s 151 (355) T protein:vir:18 72 VTGTIASTTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTT 151 (355) T ss_pred cCcceeeccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhccccee Confidence 4444444332 11 223334445666667676666666666666653 236899999999999888887778888854 Q ss_pred ----c----CcCC----ccccccccc-----------------ccc-e-eecccchhHH----HHHHHH-hhhhccCC-- Q lcl|NC_011614. 152 ----N----NPFG----KSIAQSIEK-----------------TNK-V-IKGDFTQDNI----IDLEAL-LEDDELEA-- 193 (324) Q Consensus 152 ----~----~~~~----~~~~~~~~~-----------------~~~-~-~~~~~~~~~i----~~~~~~-l~~~~~~~-- 193 (324) + ++.+ .|.+...-. +.. . ....-+|..| .++... ++..+++. T Consensus 152 ~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~d 231 (355) T protein:vir:18 152 RADTSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPK 231 (355) T ss_pred eeccCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCC Confidence 1 1111 111111000 000 0 0111233332 345543 45555544 Q ss_pred CEEEEcHHHHH-HHHHhhccCCcee--ec-cCC--CceecccceEeecCccCCCceEEEeecccEEEEEecc-eEEEEee Q lcl|NC_011614. 194 NAFISKTQNRS-LLRKIVDPETKER--IY-DRN--SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDE 266 (324) Q Consensus 194 ~~~v~~~~~~~-~L~~l~d~~g~~~--~~-~~~--~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~i~~~~ 266 (324) -+.+|...... +-..+-...+.|- .. +.. ..++.|+|.+..+ ..|...++...++++-+-...+ .+=.+.+ T Consensus 232 LVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~P--ffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d 309 (355) T protein:vir:18 232 LVAIVGRKLLADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVP--YFPANAVFVTTLENLSIYFMDESHRRSIDE 309 (355) T ss_pred EEEEEchhhhHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEe Confidence 36778777543 2223322322222 11 111 3579999998755 4556678888888875544333 3323322 Q ss_pred cccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 267 TAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 267 ~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ....+ ...++.. ..-|+.|-+..++|.++...-.....|++. T Consensus 310 ~p~r~-------rie~y~s---------~Ne~YvVEd~~~~a~ieni~~~~~~~~~~~ 351 (355) T protein:vir:18 310 NPKKD-------RVENYES---------MNIDYVVEAYAAGCLLENITLGDFTAPAAP 351 (355) T ss_pred ccccc-------cccchhh---------hcceeeeeccccEEEEeeeeecCCCCcccc Confidence 22111 1111222 234677788888888776666555555555 No 204 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=94.85 E-value=0.0018 Score=35.64 Aligned_cols=294 Identities=11% Similarity=0.037 Sum_probs=138.5 Q ss_pred CchhhHHHHHHH---HHhhc--cchhh--hhccc--cccc-cCCCc-ceechhhhH----HHHHHHHhhcchhhhceeee Q lcl|NC_011614. 1 MEQTQKLKLNLQ---HFASN--NVKPQ--VFNPD--NVMM-HEKKD-GTLLNDFTT----PILQEVMENSKIMQLGKYEP 65 (324) Q Consensus 1 m~~~~~~~~~~~---~~~~~--~~~~~--~~~a~--~~~~-~~~~g-~lip~~~~~----~i~~~~~~~s~l~~l~~~~~ 65 (324) |+.-+.++.-.+ +|... .+..+ .+..+ .... -++++ .-||..+.+ .+++.+.....+..+..+.+ T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t 80 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhccccc Confidence 776666544333 12111 11111 11111 1110 01111 113433332 33344444444444555444 Q ss_pred cCC---CceEEEEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcC---hhHHHHHHHHHHHHHHH Q lcl|NC_011614. 66 MEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFY 139 (324) Q Consensus 66 ~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s---~~~~~~~v~~~l~~ai~ 139 (324) .+. ....+++.+....+.+.+.....|..+...+...-+.+.++..+.++.+=+..+ ..++.+.-....++++. T Consensus 81 ~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:10 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred CCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 332 223456666666778888888999988887777777888888888885333333 35788888888889999 Q ss_pred HHHHHHHHhccCcC-cCCcccccccccccce-------eecccchhHHHHHHHHhhhhc------cCCCEEEEcHHHHHH Q lcl|NC_011614. 140 KKFDEAGILNQGNN-PFGKSIAQSIEKTNKV-------IKGDFTQDNIIDLEALLEDDE------LEANAFISKTQNRSL 205 (324) Q Consensus 140 ~~~d~a~l~g~g~~-~~~~~~~~~~~~~~~~-------~~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~~ 205 (324) +.+++-.+.|+... ..|......+...... .+..--++|+..++.++...- ..+..++++|..+.. T Consensus 161 ~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~ 240 (336) T protein:vir:10 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) T ss_pred HhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHh Confidence 99998888887643 1111111122111111 111223567777888774432 124579999999999 Q ss_pred HHHhhccCCceeec--cCCCceecccceEeecC-ccCCCceEEEeecccEEEEEec---ceEEEEeeccccccccccccc Q lcl|NC_011614. 206 LRKIVDPETKERIY--DRNSDSLDGLPVVNLKS-SNLKRGELITGDFDKLIYGIPQ---LIEYKIDETAQLSTVKNEDGT 279 (324) Q Consensus 206 L~~l~d~~g~~~~~--~~~~~~l~G~pv~~~~~-~~~~~~~i~~gd~~~~~~~~~~---~~~i~~~~~~~~~~~~~~~~~ 279 (324) |... +..|-.++. ..+ +-++-++..+. ...+ |+...++.-... -.++.+...-..+... . T Consensus 241 L~~~-n~~g~tv~~~lk~n---~Pnl~i~t~pel~~Ag------g~~~~~~~~~~~~~~t~~~~~P~~f~~lpvq----~ 306 (336) T protein:vir:10 241 LSKT-NQYGLSAAAKLKEI---FPKLEFVTIPEYDTAS------GRLVQLWAPRVEGKDTATCGFTEKMRAHSIE----R 306 (336) T ss_pred ccCC-CccCccHHHHHHHh---CCccEEEEcccccccC------CceEEEEEecccCCcceeeecChhhhcccee----e Confidence 8643 222322211 111 00111211110 0111 111111111111 1122221111111100 0 Q ss_pred chhhhhcCcEEEEEEEEecc-EEecccceEEEEee Q lcl|NC_011614. 280 PVNLFEQDMVALRATMHVAL-HIADDKAFAKLVPA 313 (324) Q Consensus 280 ~~~~f~~~~v~~r~~~r~d~-~v~~~~a~~~l~~~ 313 (324) ..-....-+..|+++ .+.+|-||+++++. T Consensus 307 -----~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 307 -----YSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred -----cCceeEeccccceeeeeeeccchheeeccC Confidence 011122233445544 45569999999998 No 205 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=94.74 E-value=0.00088 Score=37.34 Aligned_cols=286 Identities=16% Similarity=0.180 Sum_probs=138.8 Q ss_pred CchhhHHHHHHHHHhhccchhhhhcc-------ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNP-------DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a-------~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) ++.+|..-.+.+-.+.+.-.+...++ .+++.+ +....+|..+.-.|-..+..+.++++.+-+...+.-..+. T Consensus 80 iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiT-D~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~ 158 (393) T protein:vir:16 80 IESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTIT-DTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 158 (393) T ss_pred HhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCccee-ccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHh Confidence 35555555555555555444432221 122222 3346689999888888899999998876665555433221 Q ss_pred EEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHh---cChhHHHHHHHHHHHHHHH-HHHHHHHHhc Q lcl|NC_011614. 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFY-KKFDEAGILN 149 (324) Q Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~---~s~~~~~~~v~~~l~~ai~-~~~d~a~l~g 149 (324) . +.+...|...-.|..+.+...+|..-++.+.-+.....+ -++.+ .+...+..+|+..|+.++. +..|.+++-| T Consensus 159 s-~~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~G 236 (393) T protein:vir:16 159 S-FDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 236 (393) T ss_pred h-hhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhhee Confidence 1 223346777788999998888888777776554444444 23333 4455679999999999999 8899999999 Q ss_pred cCcCcCCccc-ccc---ccc--ccceeecccchhH-HHHHHHHh-hhhccCCCEEEEcHHHHHH-HHHhhccCC--ceee Q lcl|NC_011614. 150 QGNNPFGKSI-AQS---IEK--TNKVIKGDFTQDN-IIDLEALL-EDDELEANAFISKTQNRSL-LRKIVDPET--KERI 218 (324) Q Consensus 150 ~g~~~~~~~~-~~~---~~~--~~~~~~~~~~~~~-i~~~~~~l-~~~~~~~~~~v~~~~~~~~-L~~l~d~~g--~~~~ 218 (324) +|++.....- ... +.. +....++...+.+ +..+..-+ +.++ .-.+++...+..+ |..++.+.. +..+ T Consensus 237 DG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptag--rrylivktedrkalldelrqatananvri 314 (393) T protein:vir:16 237 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQATANANVRI 314 (393) T ss_pred cCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHhhhccCceee Confidence 9987522110 000 011 1111233333333 33332222 1122 1235665555443 445554332 2222 Q ss_pred cc-CC-Cceecccc-e-EeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhh--hhcCcEEEE Q lcl|NC_011614. 219 YD-RN-SDSLDGLP-V-VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL--FEQDMVALR 292 (324) Q Consensus 219 ~~-~~-~~~l~G~p-v-~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--f~~~~v~~r 292 (324) .. .. ..+--|.. + +++.+-.+ +..+ +.| +...+++.+- +-++. |.+|+--+. T Consensus 315 knddteiasevgvdeiivytgskal-kptv-lvd---------qkyhidmqdl-----------tkvdafewktnsnmil 372 (393) T protein:vir:16 315 KNDDTEIASEVGVDEIIVYTGSKAL-KPTV-LVD---------QKYHIDMQDL-----------TKVDAFEWKTNSNMIL 372 (393) T ss_pred eccchhhhhhcCcceeeeeeccccc-ccee-eec---------cccccchhhh-----------hhhhhheeccCCceEE Confidence 11 10 00111211 1 11111111 1111 111 1111111111 11112 344544455 Q ss_pred EEEEeccEEecccceEEEEee Q lcl|NC_011614. 293 ATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 293 ~~~r~d~~v~~~~a~~~l~~~ 313 (324) ++....+.|-.-+|=++++.. T Consensus 373 vetltsghvetynagavitvs 393 (393) T protein:vir:16 373 VETLTSGHVETYNAGAVITVS 393 (393) T ss_pred EeecccCcceeeccceeEeeC Confidence 555555544443443444432 No 206 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=94.35 E-value=0.0047 Score=33.36 Aligned_cols=299 Identities=12% Similarity=0.106 Sum_probs=156.0 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccc-cCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~~ 78 (324) |...-. ....+|..... + ..++.. ..+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-.+ T Consensus 1 M~~~tr--~~~~~y~~~~A---~--~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~ 73 (355) T protein:vir:98 1 MRPETR--FKFNAYLTRVA---E--LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVT 73 (355) T ss_pred CChHHH--HHHHHHHHHHH---H--HhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccC Confidence 654432 22223322211 1 111111 1123466788888899999999999999999988875433 2333334 Q ss_pred Ccceeeec--cc-ccccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccC-- Q lcl|NC_011614. 79 KPGAYWVG--EG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQG-- 151 (324) Q Consensus 79 ~~~a~~v~--Eg-~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g-- 151 (324) ++-+.-+. .+ ...|.....++.-.+..++.-.-..|+.+.|+. ..++|...+.+.+.+.++..+=.-.|+|+. T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A 153 (355) T protein:vir:98 74 GTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRA 153 (355) T ss_pred ccccccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeee Confidence 44444331 11 222333444666666666666666666666652 236899999999999888887778888854 Q ss_pred --c----CcCC----ccccccccc-----------------ccc--eeecccchhHH----HHHHHH-hhhhccCC--CE Q lcl|NC_011614. 152 --N----NPFG----KSIAQSIEK-----------------TNK--VIKGDFTQDNI----IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 152 --~----~~~~----~~~~~~~~~-----------------~~~--~~~~~~~~~~i----~~~~~~-l~~~~~~~--~~ 195 (324) + ++.+ .|.+...-. +.. .....-+|..| .++... ++..+++. -+ T Consensus 154 ~~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLV 233 (355) T protein:vir:98 154 DTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLV 233 (355) T ss_pred ccCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 1 1111 111111000 000 00111223332 345554 45555443 36 Q ss_pred EEEcHHHHH-HHHHhhccCCcee--e-ccC--CCceecccceEeecCccCCCceEEEeecccEEEEEecc-eEEEEeecc Q lcl|NC_011614. 196 FISKTQNRS-LLRKIVDPETKER--I-YDR--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETA 268 (324) Q Consensus 196 ~v~~~~~~~-~L~~l~d~~g~~~--~-~~~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~i~~~~~~ 268 (324) .+|...... +-..+-.....|- + .+. ...++.|+|.+..+ ..|...++...++++-+-...+ .+=.+.+.. T Consensus 234 vivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~P--ffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p 311 (355) T protein:vir:98 234 AIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVP--YFPANAVLVTTLENLSIYFMDESHRRSIDENP 311 (355) T ss_pred EEEchhhhHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 778877543 2223322222221 0 111 13579999998755 4556678888888875544333 332332222 Q ss_pred cccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 269 QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 269 ~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ..+ ...++.. ..-|+.|-+..++|.++...-.....|++- T Consensus 312 ~r~-------rie~y~s---------~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) T protein:vir:98 312 KKD-------RVENYES---------MNIDYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) T ss_pred ccc-------cccchhh---------hcceeeeeccccEEEeeceeeeCCCCCccc Confidence 111 1111222 234667778888888776555555555554 No 207 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=93.99 E-value=0.0015 Score=36.04 Aligned_cols=286 Identities=16% Similarity=0.180 Sum_probs=139.0 Q ss_pred CchhhHHHHHHHHHhhccchhhhhcc-------ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNP-------DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a-------~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) ++.+|..-.+.+-.+.+.-.+...++ .+++.+ +....+|..+.-.|-..+..+.++++.+-+.+.+.-..+. T Consensus 87 i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiT-D~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~ 165 (400) T protein:vir:93 87 IESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTIT-DTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 165 (400) T ss_pred HhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCccee-ccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHh Confidence 35555555555555555444432221 222222 3346689999888888899999998876665555433221 Q ss_pred EEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHh---cChhHHHHHHHHHHHHHHH-HHHHHHHHhc Q lcl|NC_011614. 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFY-KKFDEAGILN 149 (324) Q Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~---~s~~~~~~~v~~~l~~ai~-~~~d~a~l~g 149 (324) . +.+...|...-.|..+.+...+|.--++.+.-+.....+ -++.+ ++...+..+|+..|+.++. +..|.+++-| T Consensus 166 s-~~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~G 243 (400) T protein:vir:93 166 S-FDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 243 (400) T ss_pred h-hhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhhee Confidence 1 223346777788999998888888877776554444444 23333 4455679999999999999 8899999999 Q ss_pred cCcCcCCccc-ccc---ccc--ccceeecccchhH-HHHHHHHh-hhhccCCCEEEEcHHHHH-HHHHhhccCCc--eee Q lcl|NC_011614. 150 QGNNPFGKSI-AQS---IEK--TNKVIKGDFTQDN-IIDLEALL-EDDELEANAFISKTQNRS-LLRKIVDPETK--ERI 218 (324) Q Consensus 150 ~g~~~~~~~~-~~~---~~~--~~~~~~~~~~~~~-i~~~~~~l-~~~~~~~~~~v~~~~~~~-~L~~l~d~~g~--~~~ 218 (324) +|++.....- ... ... +....++...+.+ +..+..-+ +.++ .-.+++...+.. .|..++.+..+ ..+ T Consensus 244 DG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptag--rrylivktedrkalldelrqatanahvri 321 (400) T protein:vir:93 244 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQATANAHVRI 321 (400) T ss_pred cCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHhhccccceEe Confidence 9987522110 000 011 1111233333333 33332222 1122 123566555544 34455544322 221 Q ss_pred cc--CCCceecccc-e-EeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhh--hhcCcEEEE Q lcl|NC_011614. 219 YD--RNSDSLDGLP-V-VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL--FEQDMVALR 292 (324) Q Consensus 219 ~~--~~~~~l~G~p-v-~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--f~~~~v~~r 292 (324) .. ....+--|.. + +++.+-.+ +..+ +.|. ...+++.+- +-++. |.+|+--+. T Consensus 322 knddaeiasevgvdeiivytgskal-kptv-lvdq---------kyhidmqdl-----------tkvdafewktnsnmil 379 (400) T protein:vir:93 322 KNDDAEIASEVGVDEIIVYTGSKAL-KPTV-LVDQ---------KYHIDMQDL-----------TKVDAFEWKTNSNMIL 379 (400) T ss_pred ecchhhhhhhcCcceeeeeeccccc-ccee-eecc---------ccccchhhh-----------hhhhhheeccCCceEE Confidence 11 1001111211 1 11111111 1111 1111 111111111 11112 344544455 Q ss_pred EEEEeccEEecccceEEEEee Q lcl|NC_011614. 293 ATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 293 ~~~r~d~~v~~~~a~~~l~~~ 313 (324) ++....+.|-.-+|=++++.. T Consensus 380 vetltsghvetynagavitvs 400 (400) T protein:vir:93 380 VETLTSGHVETYNAGAVITVS 400 (400) T ss_pred EeecccCcceeeccceeEeeC Confidence 555555544443443444432 No 208 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=91.82 E-value=0.014 Score=30.74 Aligned_cols=300 Identities=13% Similarity=0.076 Sum_probs=153.2 Q ss_pred Cchh--hHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceE-EEEEe Q lcl|NC_011614. 1 MEQT--QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWA 77 (324) Q Consensus 1 m~~~--~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~ 77 (324) |.+. +..+....+|..... +..+... ...+..+.|.+.+.+.+...+.+.+-++++.+.+++..-... +-.-. T Consensus 1 m~~~M~~~tr~~~~~y~~~~A---~~ngv~~-~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~ 76 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCDALA---KAYGIDI-SKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGV 76 (358) T ss_pred CcccccHHHHHHHHHHHHHHH---HHhCCCh-hHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecC Confidence 6552 222333333333211 1111111 111345778899999999999999999999998888754332 33333 Q ss_pred CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcC-----hhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_011614. 78 DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT-----YSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s-----~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~ 152 (324) +++-++-+.. ..+.....++...+..++.-.-..|+.+.++.= ..+|...+.+.+.+.++...=.-.|+|+.- T Consensus 77 ~g~iagrt~t--r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~ 154 (358) T protein:vir:78 77 GQLYTGRKKG--GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSA 154 (358) T ss_pred CcccceecCC--CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceee Confidence 4444444333 223444556666666666666666777666521 136888999989988887777777788431 Q ss_pred --------CcCC----ccccc------------ccccccce-e-ec-ccch---hHH-HHHH-HHhhhhccCC--CEEEE Q lcl|NC_011614. 153 --------NPFG----KSIAQ------------SIEKTNKV-I-KG-DFTQ---DNI-IDLE-ALLEDDELEA--NAFIS 198 (324) Q Consensus 153 --------~~~~----~~~~~------------~~~~~~~~-~-~~-~~~~---~~i-~~~~-~~l~~~~~~~--~~~v~ 198 (324) ++.+ .|.+. ........ . .+ .-+| |.+ .++. ..|+..+++. -+.+| T Consensus 155 A~~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVviv 234 (358) T protein:vir:78 155 ADDTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLV 234 (358) T ss_pred ccCCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEE Confidence 1111 11111 00011111 0 01 1233 333 3454 3445555554 36778 Q ss_pred cHHHHH-HHHHhhccCCcee---eccCCCceecccceEeecCccCCCceEEEeecccEEEE-EecceEEEEeeccccccc Q lcl|NC_011614. 199 KTQNRS-LLRKIVDPETKER---IYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETAQLSTV 273 (324) Q Consensus 199 ~~~~~~-~L~~l~d~~g~~~---~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~-~~~~~~i~~~~~~~~~~~ 273 (324) ...... +-..+-...+.|- -......++-|+|.+..+ ..|...++...++++-+- ..+..+=.+.++...+. T Consensus 235 G~dLla~k~~~l~n~~~~pTE~~Aa~~i~k~iGGlpa~~~P--fFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r- 311 (358) T protein:vir:78 235 GTDLVAAAQAKLYSEATKPSEQIAAQQLAKSIAGRKAYIPP--FFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQDSKS- 311 (358) T ss_pred chhhhhHHhhhHhhcCCCcHHHHHHHHHHHHhCCCeEEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEecccccc- Confidence 777654 2223322222221 111112578999998755 455567788888876443 33333333333222111 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccC----CCCc--cccC Q lcl|NC_011614. 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAK----PSSV--PGEV 324 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~----~~~~--~~~~ 324 (324) ..++.. ..-|+.|-+..+++.++...-. |+.. +++- T Consensus 312 ------iE~y~s---------~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~~~~~~~ 353 (358) T protein:vir:78 312 ------FDNQYW---------RMEGYALGEHKAYGGFEEADIEIGADPAVLAVEAAA 353 (358) T ss_pred ------ccchhh---------hcceeeeeccccEEEEeeeeeeeCCCCCccccCCcc Confidence 111222 2356778888888887654322 1111 1111 No 209 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=91.40 E-value=0.016 Score=30.43 Aligned_cols=287 Identities=15% Similarity=0.176 Sum_probs=140.8 Q ss_pred CchhhHHHHHHHHHhhccchhhhhcc-------ccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNP-------DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a-------~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) +|.+|....+..-...+.-.++-.++ ++++. ++...-+|..+...|-..+....++.+.+-+.+++.--.+ T Consensus 5 iesqnavteffdvlkknsgkseiknawnaklaengvti-tdttfqlprklvesintallntnpvfkvfhvtnvgallvs- 82 (318) T protein:vir:94 5 IESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS- 82 (318) T ss_pred hhhhhhHHHHHHHHhcccChhhhhhhhhhhhhhCCcee-ecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhheeee- Confidence 66677665554433333222222222 22222 2334557888888888888888888887777776554221 Q ss_pred EEEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHH--HHHhcChhHHHHHHHHHHHHHHHHH-HHHHHHhcc Q lcl|NC_011614. 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTK--EFLNYTYSQFFEEMKPMIAEAFYKK-FDEAGILNQ 150 (324) Q Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~--ell~~s~~~~~~~v~~~l~~ai~~~-~d~a~l~g~ 150 (324) ..+++..+|....+|+.+.+...++.--++.|.-+.....+-+ +-+++|...+...|...|..++..+ .|-+++-|+ T Consensus 83 rsfdssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnkivdlalvegd 162 (318) T protein:vir:94 83 RSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGD 162 (318) T ss_pred ccccccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhhhheeeeecC Confidence 2345566788888999999988888877777766666555544 3466788889999999999999888 466777788 Q ss_pred CcCcCCccccc-cccc-----ccceeecccchhH-HHHHHHHh-hhhccCCCEEEEcHHHHHH-HHHhhccCC--ceeec Q lcl|NC_011614. 151 GNNPFGKSIAQ-SIEK-----TNKVIKGDFTQDN-IIDLEALL-EDDELEANAFISKTQNRSL-LRKIVDPET--KERIY 219 (324) Q Consensus 151 g~~~~~~~~~~-~~~~-----~~~~~~~~~~~~~-i~~~~~~l-~~~~~~~~~~v~~~~~~~~-L~~l~d~~g--~~~~~ 219 (324) |++.....-.. .+.+ +....++...+.| +..+..-+ +.++ .-.+++...+..+ |..++.+.. +..+. T Consensus 163 gtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptag--rrylivktedrkalldelrqatananvrik 240 (318) T protein:vir:94 163 GTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQATANANVRIK 240 (318) T ss_pred CcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCC--ceEEEEeccchHHHHHHHHhhhcccceEEe Confidence 87643211111 1100 1111223333333 32332222 1122 1235666555443 445554332 22221 Q ss_pred c-CC-Cceecccc-eE-eecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhh--hhcCcEEEEE Q lcl|NC_011614. 220 D-RN-SDSLDGLP-VV-NLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL--FEQDMVALRA 293 (324) Q Consensus 220 ~-~~-~~~l~G~p-v~-~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~--f~~~~v~~r~ 293 (324) . .. ..+--|.. ++ ++.+-.+ +. ..+.|. ...+++.+- +-++. |.+|+--+.+ T Consensus 241 nddteiasevgvdeiivytgskav-kp-tvlvdq---------kyhidmqdl-----------tkvdafewktnsnmilv 298 (318) T protein:vir:94 241 NDDTEIASEVGVDEIIVYTGSKAV-KP-TVLVDQ---------KYHIDMQDL-----------TKVDAFEWKTNSNMILV 298 (318) T ss_pred ccchhhhhhcCcceeEEeeccccc-cc-eeEecc---------ceecchhhh-----------hhhhceeeccCCceEEE Confidence 1 10 00111211 11 1111111 11 111111 111111111 11111 3445444555 Q ss_pred EEEeccEEecccceEEEEee Q lcl|NC_011614. 294 TMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 294 ~~r~d~~v~~~~a~~~l~~~ 313 (324) +....+.+-.-+|=++++.. T Consensus 299 etltsghvetynagavitvs 318 (318) T protein:vir:94 299 ETLTSGHVETYNAGAVITVS 318 (318) T ss_pred EecccCcceeecCceeEEeC Confidence 55555544443443444432 No 210 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=90.21 E-value=0.022 Score=29.67 Aligned_cols=290 Identities=13% Similarity=0.079 Sum_probs=157.2 Q ss_pred Cchhh--HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEe Q lcl|NC_011614. 1 MEQTQ--KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWA 77 (324) Q Consensus 1 m~~~~--~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~ 77 (324) |.+.- ++...+.+.+... ++.. .+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-. T Consensus 1 M~~~tr~~~~~y~~~~A~~n---------gv~~-~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~ 70 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLN---------DTGD-VSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV 70 (337) T ss_pred CChHHHHHHHHHHHHHHHhc---------Chhh-hcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeecc Confidence 76633 3333344333322 1211 123456778888999999999999999999988875433 233333 Q ss_pred CCcceeeec--ccccccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccC-- Q lcl|NC_011614. 78 DKPGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQG-- 151 (324) Q Consensus 78 ~~~~a~~v~--Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g-- 151 (324) +++-+.-.. .+...|.+-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++..+=.-.|+|+. T Consensus 71 ~g~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A 150 (337) T protein:vir:10 71 SGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA 150 (337) T ss_pred CcceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeec Confidence 444443332 22223334455677777777766667777776663 236899999999999988887778888854 Q ss_pred --c----CcCC----ccccccc------------cccc--ceeecccchhH----HHHHHHH-hhhhccCC--CEEEEcH Q lcl|NC_011614. 152 --N----NPFG----KSIAQSI------------EKTN--KVIKGDFTQDN----IIDLEAL-LEDDELEA--NAFISKT 200 (324) Q Consensus 152 --~----~~~~----~~~~~~~------------~~~~--~~~~~~~~~~~----i~~~~~~-l~~~~~~~--~~~v~~~ 200 (324) + ++.+ .|.+... ...+ ......-+|.. +.++... +++.+++. -+.+|.. T Consensus 151 ~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~ 230 (337) T protein:vir:10 151 ATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGR 230 (337) T ss_pred cCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1 1111 1111110 0000 00111123333 3455543 45555544 3677777 Q ss_pred HHHH-HHHHhhccCCcee--ec-cC--CCceecccceEeecCccCCCceEEEeecccEEEEEecc-eEEEEeeccccccc Q lcl|NC_011614. 201 QNRS-LLRKIVDPETKER--IY-DR--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTV 273 (324) Q Consensus 201 ~~~~-~L~~l~d~~g~~~--~~-~~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~ 273 (324) .... +-..+-...+.|- .. +. ...++.|+|.+..+ ..|+..++...++++-+-...+ .+=.+.+... T Consensus 231 dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~P--ffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~---- 304 (337) T protein:vir:10 231 ELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVP--FFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE---- 304 (337) T ss_pred hhhhHHhhHHhccCCCcHHHHHHHHHHHhhhhCCceeEEcc--ccCCCceEEeechhcEEEEecCcEEEEEEEccc---- Confidence 6654 2222222222221 11 11 12578999998755 4556678888888875544333 3333322221 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) ++.+.-.=...-|+.|-+..++|.++...-..+ T Consensus 305 ------------r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 305 ------------RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ------------cccccchhhccceeeeeccccEEEEeceeecCC Confidence 121111112235778888899998876555444 No 211 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=90.16 E-value=0.022 Score=29.63 Aligned_cols=290 Identities=13% Similarity=0.074 Sum_probs=157.0 Q ss_pred Cchhh--HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEe Q lcl|NC_011614. 1 MEQTQ--KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWA 77 (324) Q Consensus 1 m~~~~--~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~ 77 (324) |.+.- ++...+.+.+... ++.. .+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-. T Consensus 1 M~~~tr~~~~~y~~~~A~~n---------gv~~-~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~ 70 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLN---------DTGD-VSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV 70 (337) T ss_pred CChHHHHHHHHHHHHHHHhc---------Chhh-hcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeecc Confidence 76633 3333344333322 1211 123456778888999999999999999999988875433 233333 Q ss_pred CCcceeeec--ccccccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccC-- Q lcl|NC_011614. 78 DKPGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQG-- 151 (324) Q Consensus 78 ~~~~a~~v~--Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g-- 151 (324) +++-+.-.. .+...|.+-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++..+=.-.|+|+. T Consensus 71 ~g~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A 150 (337) T protein:vir:79 71 SGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA 150 (337) T ss_pred CcceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeec Confidence 444443332 22223334455677777777766667777776663 236899999999999988887778888854 Q ss_pred --c----CcCC----ccccccc------------cccc--ceeecccchhH----HHHHHHH-hhhhccCC--CEEEEcH Q lcl|NC_011614. 152 --N----NPFG----KSIAQSI------------EKTN--KVIKGDFTQDN----IIDLEAL-LEDDELEA--NAFISKT 200 (324) Q Consensus 152 --~----~~~~----~~~~~~~------------~~~~--~~~~~~~~~~~----i~~~~~~-l~~~~~~~--~~~v~~~ 200 (324) + ++.+ .|.+... ...+ ......-+|.. +.++... +++.+++. -+.+|.. T Consensus 151 ~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~ 230 (337) T protein:vir:79 151 ATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGR 230 (337) T ss_pred cCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1 1111 1111110 0000 01111123333 3455543 45555544 3677777 Q ss_pred HHHH-HHHHhhccCCcee--ec-cC--CCceecccceEeecCccCCCceEEEeecccEEEEEecc-eEEEEeeccccccc Q lcl|NC_011614. 201 QNRS-LLRKIVDPETKER--IY-DR--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTV 273 (324) Q Consensus 201 ~~~~-~L~~l~d~~g~~~--~~-~~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~ 273 (324) .... +-..+-...+.|- .. +. ...++.|+|.+..+ ..|+..++...++++-+-...+ .+=.+.+... T Consensus 231 dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~P--ffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~---- 304 (337) T protein:vir:79 231 ELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVP--FFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE---- 304 (337) T ss_pred hhhhHHhhHHhccCCCcHHHHHHHHHHHhhhhCCceeEEcc--ccCCCceEEeechhcEEEEecCcEEEEEEEccc---- Confidence 6654 2222222222221 11 11 12578999998755 4556678888888875544333 3333322221 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) ++.+.-.=...-|+.|-+..++|.++...-..+ T Consensus 305 ------------r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 305 ------------RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ------------cccccchhhccceeeeeccccEEEEeceeecCC Confidence 121111112235777888899888876555444 No 212 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=89.61 E-value=0.025 Score=29.33 Aligned_cols=271 Identities=9% Similarity=0.008 Sum_probs=125.4 Q ss_pred cccCCCcceechhhhHHHHHHHHhhcchhhhce------eeecCCCceEEEEEeCCcceee-ecc-cccccccccceeeE Q lcl|NC_011614. 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGK------YEPMEGTEKKFTFWADKPGAYW-VGE-GQKIETSKATWVNA 101 (324) Q Consensus 30 ~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~------~~~~~~~~~~ip~~~~~~~a~~-v~E-g~~~~~~~~~~~~v 101 (324) |.+ .-.++.++..+.+.+...+....|.. +...++.+++||..+...-... .+- |..-...+.++... T Consensus 1 MA~----~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~ 76 (299) T protein:vir:79 1 MAA----LNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPK 76 (299) T ss_pred Ccc----chhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEE Confidence 111 11247778888888877776665532 2234567899998875332222 121 22222334556667 Q ss_pred EeeeeeEEEe-ehhHHHHHhcC--hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhH Q lcl|NC_011614. 102 TMRAFKLGVI-LPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) Q Consensus 102 ~~~~~k~~~~-v~iS~ell~~s--~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) ++.-.|.-.+ +.--+ .+.+ ...+...+.+...+.++-.+|...+..--++....+ ........+.+.-++. T Consensus 77 ~ldqdr~~~f~vD~~D--vdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g----~~~~~~~~T~~n~y~~ 150 (299) T protein:vir:79 77 VLTNQRKWSTLVHPAD--INQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALG----NTADTTVLTTTNVLEV 150 (299) T ss_pred Eeeccccceeccchhh--HHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcC----CcccccccCHHHHHHH Confidence 7766665443 22112 1112 122344455555666666777766643111110000 0011111223445788 Q ss_pred HHHHHHHhhhhccCC--CEEEEcHHHHHHHHHhhc----c--CCceeeccCCCceecccceEeecCccCC-CceEEEeec Q lcl|NC_011614. 179 IIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD----P--ETKERIYDRNSDSLDGLPVVNLKSSNLK-RGELITGDF 249 (324) Q Consensus 179 i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~l~d----~--~g~~~~~~~~~~~l~G~pv~~~~~~~~~-~~~i~~gd~ 249 (324) +.+++.+|..+.... -.++++|..+..|.+... . ........+..++|.|+||+..++.-+. +..+.-|.. T Consensus 151 i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~ 230 (299) T protein:vir:79 151 FDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWK 230 (299) T ss_pred HHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCcc Confidence 889999998877543 457899999998864321 1 1112344566688999999875543221 111111100 Q ss_pred -------ccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc-eEEEEeeccCC Q lcl|NC_011614. 250 -------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-FAKLVPADAKP 317 (324) Q Consensus 250 -------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a-~~~l~~~~~~~ 317 (324) -++++. .....+.+.......... ++. .+++--.+.-..|.|.-|.+.+. -+.+..+++.+ T Consensus 231 ~~~~ak~in~ii~-~~~a~~~~~K~~~~~~~~-P~~-----~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 231 VGAGAKQIFMSLV-HPSAIITPVSYQFSKLDE-PTA-----VTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred ccCcccccceEEE-cCCeeeeeEeeeeEEeec-CCC-----CCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 012222 122222222222211111 110 01111112223456665665333 22344444444 No 213 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=89.22 E-value=0.015 Score=30.51 Aligned_cols=276 Identities=13% Similarity=0.055 Sum_probs=128.2 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHH--HhhcchhhhceeeecCCCceEEEEEe- Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEV--MENSKIMQLGKYEPMEGTEKKFTFWA- 77 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~--~~~s~l~~l~~~~~~~~~~~~ip~~~- 77 (324) |- -.+++|..... ..|.+. ....|+++=-+.+..++.... .+.-.+.+-..+.+..+....+-... T Consensus 1 ~~-----~~~~~~~~~a~-----~~al~~-a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~ 69 (470) T protein:vir:10 1 MP-----YEHLKHLDEAT-----LKALNA-AGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTA 69 (470) T ss_pred CC-----hhHhhhhhHHH-----HHHHHH-hhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhcc Confidence 32 12233322221 112222 111223331122222221111 11111222222333333322332211 Q ss_pred --CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHH---HhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_011614. 78 --DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF---LNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 78 --~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~el---l~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~ 152 (324) +....+...|++-.+.+++++....+..+-++....+|.-. ++....+++..+.+.---.+++.+|.++|.|+.. T Consensus 70 rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~ 149 (470) T protein:vir:10 70 RHDKIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNL 149 (470) T ss_pred ccccccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccc Confidence 22223345899999999999999999999999998998753 3334457888888888888999999999999641 Q ss_pred ---C-------cCCcccccccc-----cccceeecccchhHHHHHHHHh--hhhccCCCEEEEcHHHHHHHHHhhccCCc Q lcl|NC_011614. 153 ---N-------PFGKSIAQSIE-----KTNKVIKGDFTQDNIIDLEALL--EDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) Q Consensus 153 ---~-------~~~~~~~~~~~-----~~~~~~~~~~~~~~i~~~~~~l--~~~~~~~~~~v~~~~~~~~L~~l~d~~g~ 215 (324) . -+..|+...+. .+-..-...++.+.|..+...+ ..++..++-++|+..+.+.|..--....+ T Consensus 150 l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qR 229 (470) T protein:vir:10 150 LGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISR 229 (470) T ss_pred cccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceE Confidence 1 11222222111 1111222345566666666666 45778888899999999998765555445 Q ss_pred eeeccCCCceecccceEeecCccCCCceEEEeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEE Q lcl|NC_011614. 216 ERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 216 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) .++....+....|+|+--.- +.++++.+.-+. ....+.. + +- . T Consensus 230 v~~~~N~~~~~~G~~v~~f~-------------------sa~G~I~L~~s~--~m~~~~k--------~--~p------~ 272 (470) T protein:vir:10 230 VMTTADRRAGLLGADAQSYI-------------------GVRGEHSLYPSQ--FLGDFHK--------F--NP------A 272 (470) T ss_pred EEEecCCCceeeeeecccee-------------------eeeeeeeecccc--cccchhh--------c--Cc------c Confidence 55554444456787762110 011111110000 0000000 0 00 0 Q ss_pred EeccEE---ecccceEEEE---eeccCCCCc------cccC Q lcl|NC_011614. 296 HVALHI---ADDKAFAKLV---PADAKPSSV------PGEV 324 (324) Q Consensus 296 r~d~~v---~~~~a~~~l~---~~~~~~~~~------~~~~ 324 (324) +++-.+ .-|...+-++ +.++.+..+ +.+| T Consensus 273 ~l~~~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~~v 313 (470) T protein:vir:10 273 RFGAEVGDFAAPSNSWTVSTTDNFVTLPYNSGLGDPANTTV 313 (470) T ss_pred cCCcccCCcccCceeEEeecCCCceeecccCCCCcccCcce Confidence 111111 1222222211 112222222 3334 No 214 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=87.79 E-value=0.036 Score=28.47 Aligned_cols=291 Identities=13% Similarity=0.102 Sum_probs=153.6 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEeCC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~~~ 79 (324) |...-. ....+|... .....++. ..+..+.|.|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-.++ T Consensus 1 M~~~tr--~~~~~y~~~-----~A~~ngv~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g 72 (338) T protein:vir:11 1 MRNETR--KQFDAYLAQ-----LAKLNGVN-SAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSG 72 (338) T ss_pred CCHHHH--HHHHHHHHH-----HHHHhCCC-cccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCc Confidence 654432 222333322 11112222 2244567889999999999999999999999988875443 33333444 Q ss_pred cceeeec--ccc-cccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccC--- Q lcl|NC_011614. 80 PGAYWVG--EGQ-KIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQG--- 151 (324) Q Consensus 80 ~~a~~v~--Eg~-~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g--- 151 (324) +-+.-+. .+. ..|..-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++..+=.-.|+|+. T Consensus 73 ~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~ 152 (338) T protein:vir:11 73 TIASRTDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAA 152 (338) T ss_pred cccccccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeecc Confidence 4444332 122 22222234566666666666666666666653 236899999999999988887788888854 Q ss_pred -c----CcCC----ccccccc-----------cccc-ce-e-ec-ccchhHH----HHHHH-HhhhhccCC--CEEEEcH Q lcl|NC_011614. 152 -N----NPFG----KSIAQSI-----------EKTN-KV-I-KG-DFTQDNI----IDLEA-LLEDDELEA--NAFISKT 200 (324) Q Consensus 152 -~----~~~~----~~~~~~~-----------~~~~-~~-~-~~-~~~~~~i----~~~~~-~l~~~~~~~--~~~v~~~ 200 (324) + ++.+ .|.+... ++.+ .. . .+ .-.|..| .++.. .+++.+++. -+.+|.. T Consensus 153 ~Td~~~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~ 232 (338) T protein:vir:11 153 TTNRAANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGR 232 (338) T ss_pred CCChhhCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1 1111 1111100 0000 00 0 01 1223332 34554 345555544 3677887 Q ss_pred HHHH-HHHHhhccCCcee--ec-cC--CCceecccceEeecCccCCCceEEEeecccEEEEEecc-eEEEEeeccccccc Q lcl|NC_011614. 201 QNRS-LLRKIVDPETKER--IY-DR--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTV 273 (324) Q Consensus 201 ~~~~-~L~~l~d~~g~~~--~~-~~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~ 273 (324) .... +-..+-.....|- .. +. ...++.|+|.+..+ ..|...++...++++-+-...+ .+=.+.+... T Consensus 233 dLladk~~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~P--ffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~---- 306 (338) T protein:vir:11 233 ELVHDKYFPMVNKDQPATEKIATDLILSQKRMGGLPPVEVP--YVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPE---- 306 (338) T ss_pred hhhHHHHhHHHhcCCChHHHHHHHHHHHhhhhCCceeEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEeccc---- Confidence 7543 2122222222221 11 11 13579999998755 4556678888888875544333 3322322221 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) ++.+.-.=...-|+.|-+..++|.++...-.- T Consensus 307 ------------r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 307 ------------KNRIENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred ------------cccccchhhhccceeeeccccEEEeecceecC Confidence 11111111123577788888888887654444 No 215 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=87.03 E-value=0.041 Score=28.16 Aligned_cols=290 Identities=13% Similarity=0.068 Sum_probs=154.9 Q ss_pred Cchhh--HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEe Q lcl|NC_011614. 1 MEQTQ--KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWA 77 (324) Q Consensus 1 m~~~~--~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~ 77 (324) |.+.- ++...+.+.+...- +. ..+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-. T Consensus 1 M~~~tr~~~~~y~~~~A~~ng---------v~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~ 70 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLND---------TG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV 70 (337) T ss_pred CChHHHHHHHHHHHHHHHhcC---------hh-hhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEeccc Confidence 76633 33333444333221 11 1234566888899999999999999999999888774433 233333 Q ss_pred CCcceeeec--ccccccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|NC_011614. 78 DKPGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGN- 152 (324) Q Consensus 78 ~~~~a~~v~--Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~- 152 (324) +++-++-.. -+...|.+-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++...=.-.|+|+.- T Consensus 71 ~g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 150 (337) T protein:vir:78 71 SGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA 150 (337) T ss_pred CcceeeeecCCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeec Confidence 444443332 12222333345666666666666666676666652 2368999999999988887777777788431 Q ss_pred -------CcCC----cccc------------ccccccc--ceeecccchhHH----HHHHHH-hhhhccCC--CEEEEcH Q lcl|NC_011614. 153 -------NPFG----KSIA------------QSIEKTN--KVIKGDFTQDNI----IDLEAL-LEDDELEA--NAFISKT 200 (324) Q Consensus 153 -------~~~~----~~~~------------~~~~~~~--~~~~~~~~~~~i----~~~~~~-l~~~~~~~--~~~v~~~ 200 (324) ++.+ .|.+ ......+ ......-+|..| .++... +++.+++. -+.+|.. T Consensus 151 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~ 230 (337) T protein:vir:78 151 ATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGR 230 (337) T ss_pred cCCChhhCcCccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1111 1111 1000000 001111233332 455653 46655544 3677777 Q ss_pred HHHHH-HHHhhccCCcee--ec-c--CCCceecccceEeecCccCCCceEEEeecccEEEE-EecceEEEEeeccccccc Q lcl|NC_011614. 201 QNRSL-LRKIVDPETKER--IY-D--RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETAQLSTV 273 (324) Q Consensus 201 ~~~~~-L~~l~d~~g~~~--~~-~--~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~-~~~~~~i~~~~~~~~~~~ 273 (324) ..... -..+-...+.|- .. + ....++.|+|.+..+ ..|...++...++++-+- ..+..+=.+.++.. T Consensus 231 dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~P--fFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~---- 304 (337) T protein:vir:78 231 ELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVP--FFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE---- 304 (337) T ss_pred hhhHHHHHHHHhcCCCcHHHHHHHHHHHhhhhcCcceEEcc--ccCCCceEEeechhcEEEEecCcEEEEEEeccc---- Confidence 76542 122222222221 10 1 113578999998755 455667788888876443 33333333333222 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCC Q lcl|NC_011614. 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPS 318 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) ++.+.-.=...-|+.|-+..+++.++...-..+ T Consensus 305 ------------r~rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 305 ------------RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ------------cccccchhhccceeeeeccccEEEEeceeecCC Confidence 111111111235778888899888876555444 No 216 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=86.31 E-value=0.047 Score=27.89 Aligned_cols=299 Identities=15% Similarity=0.131 Sum_probs=156.0 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccc-cCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~~ 78 (324) |...-. ....+|.... .+. .++.. ..+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-.+ T Consensus 1 M~~~tr--~~~~~y~~~~---A~~--ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:60 1 MRQETR--FKFNAYLSRV---AEL--NGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHH--HHHHHHHHHH---HHH--hCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 654432 2223333221 111 11111 1123466888899999999999999999999888775433 2333334 Q ss_pred Ccceeeec--ccccc-cccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|NC_011614. 79 KPGAYWVG--EGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGN- 152 (324) Q Consensus 79 ~~~a~~v~--Eg~~~-~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~- 152 (324) ++-++-+. -+.+. |..-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++...=.-.|+|+.- T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 153 (357) T protein:vir:60 74 GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRA 153 (357) T ss_pred cccccccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeee Confidence 44444331 11222 222245666666676666666677766652 2368888888888888887777777788431 Q ss_pred -------CcCC----ccccccc----------------cccc-c-e-eecccch---hHH-HHHHHH-hhhhccCC--CE Q lcl|NC_011614. 153 -------NPFG----KSIAQSI----------------EKTN-K-V-IKGDFTQ---DNI-IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 153 -------~~~~----~~~~~~~----------------~~~~-~-~-~~~~~~~---~~i-~~~~~~-l~~~~~~~--~~ 195 (324) ++.+ .|.+... +.+. . . ....-+| |.+ .++... ++..+++. -+ T Consensus 154 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLV 233 (357) T protein:vir:60 154 ETSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLV 233 (357) T ss_pred ccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 1111 1111100 0000 0 0 0111233 332 345654 46655544 36 Q ss_pred EEEcHHHHH-HHHHhhccCCcee--ec-cC--CCceecccceEeecCccCCCceEEEeecccEEEE-EecceEEEEeecc Q lcl|NC_011614. 196 FISKTQNRS-LLRKIVDPETKER--IY-DR--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETA 268 (324) Q Consensus 196 ~v~~~~~~~-~L~~l~d~~g~~~--~~-~~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~-~~~~~~i~~~~~~ 268 (324) .+|...... +-..+-...+.|- +. +. ...++.|+|.+..+ ..|...++...++++-+- ..+..+=.+.+.. T Consensus 234 vivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~P--fFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p 311 (357) T protein:vir:60 234 VIVGRQLLADKYFPIVNREQDNSEMLAADVIISQKRIGNLPAVRVP--YFPADAMLITKLENLSIYYMDDSHRRVIEENP 311 (357) T ss_pred EEEchhhhhHHhhhHhhcCCChHHHHHHHHHHHhhhhcCcceEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 777777643 2223322222221 11 11 13578999998755 455567788888876443 3333333333322 Q ss_pred cccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 269 QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 269 ~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ..+. ..++.. ..-|+.|-+..++|.++...-.....|++- T Consensus 312 ~r~r-------iE~y~s---------~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~ 351 (357) T protein:vir:60 312 KLDR-------VENYES---------MNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred cccc-------ccchhh---------hcceeeeeccccEEEeeeeeeccCcccccC Confidence 2111 111122 235777888888888876655555556665 No 217 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=86.10 E-value=0.048 Score=27.82 Aligned_cols=276 Identities=11% Similarity=-0.007 Sum_probs=136.6 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhc----eeeecC-CCceEEEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLG----KYEPME-GTEKKFTF 75 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~----~~~~~~-~~~~~ip~ 75 (324) |--.|.-+...-. -.+.+..+.+.+...++|+... ++.+.+ +.++..|. T Consensus 1 mp~~~lsel~t~t--------------------------l~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l 54 (321) T protein:vir:34 1 MPFPNISDIITTT--------------------------IESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEEL 54 (321) T ss_pred CCCchHHHHHHHH--------------------------HHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEE Confidence 3222211111000 1122233444455555555432 222333 34566666 Q ss_pred EeC-Ccceeee-cccccccccccceeeEEeeeeeEEEeehhHH-HHHhcCh----hHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011614. 76 WAD-KPGAYWV-GEGQKIETSKATWVNATMRAFKLGVILPVTK-EFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 76 ~~~-~~~a~~v-~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~-ell~~s~----~~~~~~v~~~l~~ai~~~~d~a~l~ 148 (324) .-. ..++.|. ++..-.......|+..++..+.++..+.||- |++..+. ++|...-.+...+.++..+|..+-. T Consensus 55 ~y~~~s~~~wy~Gyd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~s 134 (321) T protein:vir:34 55 SFSGNSNGGWYSGYDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYG 134 (321) T ss_pred eeccCcceeEEEeeeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhc Confidence 544 7788997 4544445556779999999999999999987 5555554 4666666677778888888887664 Q ss_pred -ccCcC-cCCcccccccccc--ccee------------------ecccchhHHHHHHHHh----hhhccCCCEEEEcHHH Q lcl|NC_011614. 149 -NQGNN-PFGKSIAQSIEKT--NKVI------------------KGDFTQDNIIDLEALL----EDDELEANAFISKTQN 202 (324) Q Consensus 149 -g~g~~-~~~~~~~~~~~~~--~~~~------------------~~~~~~~~i~~~~~~l----~~~~~~~~~~v~~~~~ 202 (324) |++.+ .+-.|+...+... .++. .+..+...+..++.++ .-....|..|++...- T Consensus 135 dGTa~g~~~i~GL~~lv~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~ 214 (321) T protein:vir:34 135 DGTAFGGRAINGLDGAVPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDA 214 (321) T ss_pred cccccccchhhhhhhhcccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHH Confidence 44422 2222222222211 1111 1112333343333333 3345577889999888 Q ss_pred HHHHHHhhccCCceeeccCC-----CceecccceEeecC--ccCCCceEEEeecccEEEEEecceEEEEeeccccccccc Q lcl|NC_011614. 203 RSLLRKIVDPETKERIYDRN-----SDSLDGLPVVNLKS--SNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) Q Consensus 203 ~~~L~~l~d~~g~~~~~~~~-----~~~l~G~pv~~~~~--~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) |...+.-.....|+.-.+.. .=.+.|..|+.-+. ...+....|+-|-+++-+-...+-.+......... T Consensus 215 y~~y~~s~q~~qR~~~~~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r~~---- 290 (321) T protein:vir:34 215 WTTYSNSLQVLQRFTSAEEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSRRA---- 290 (321) T ss_pred HHHHHHhhheeeeecccccccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCccccc---- Confidence 87776533332222211111 11245555555432 34677788888888776665544333332221100 Q ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEEee Q lcl|NC_011614. 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 276 ~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) + +-+|.+.--...+.....-++.+=.+|+.. T Consensus 291 ----~---~NqdA~~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 291 ----A---FNQDAEAQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred ----c---cchhHHhhhhhhhheeeeecccceeEEeeC Confidence 0 011111111122333344455555555543 No 218 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=85.55 E-value=0.052 Score=27.62 Aligned_cols=299 Identities=10% Similarity=0.044 Sum_probs=143.5 Q ss_pred Cch--hhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceE-EEEEe Q lcl|NC_011614. 1 MEQ--TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWA 77 (324) Q Consensus 1 m~~--~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~ 77 (324) |.+ +++.+....+|..... ...++.. ....+.|-|.+.+.+...+.+.+-+++..+.+++..-... +-.-. T Consensus 1 m~~~m~~~tr~~~~~y~~~~A-----~~ngv~~-~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~ 74 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLA-----KSYGVSN-VAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGV 74 (341) T ss_pred CcccccHHHHHHHHHHHHHHH-----HHcCccc-ccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeeccc Confidence 664 2333333333333211 1112222 2334667788889999999999999999998888754432 33333 Q ss_pred CCcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcC-----hhHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|NC_011614. 78 DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT-----YSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s-----~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~ 152 (324) +++-+.-+.- +..|- ++.++...+...+.-.-..|+.+.++.= -++|...+.+.+.++++..+=.-.|+|+.- T Consensus 75 ~g~iagrtdt-~R~~r-~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~ 152 (341) T protein:vir:27 75 SGLYTGRKAG-GRFTK-QVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSA 152 (341) T ss_pred ccceeeccCC-Cceec-ccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceee Confidence 3444433321 22222 2356666666666555566666655421 367999999999999888887888888541 Q ss_pred --------CcCC----ccccccccc--------cccee-ecccchhH----HHHHHHH-hhhhccCC--CEEEEcHHHHH Q lcl|NC_011614. 153 --------NPFG----KSIAQSIEK--------TNKVI-KGDFTQDN----IIDLEAL-LEDDELEA--NAFISKTQNRS 204 (324) Q Consensus 153 --------~~~~----~~~~~~~~~--------~~~~~-~~~~~~~~----i~~~~~~-l~~~~~~~--~~~v~~~~~~~ 204 (324) ++.+ .|.+...-. ...+. ...-+|.. +.++... |+..+++. -+.+|...... T Consensus 153 A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla 232 (341) T protein:vir:27 153 EADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIG 232 (341) T ss_pred ccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhh Confidence 1111 111111100 00011 11223333 3445543 45555544 36778766644 Q ss_pred -HHHHhhccCCcee---eccCCCceecccceEeecCccCCCceEEEeecccEEEEEecc-eEEEEeeccccccccccccc Q lcl|NC_011614. 205 -LLRKIVDPETKER---IYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTVKNEDGT 279 (324) Q Consensus 205 -~L~~l~d~~g~~~---~~~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~ 279 (324) +-..+-.....|- -......++.|+|.+..+ ..|...++...++++-+-...+ .+=.+-+....+. T Consensus 233 ~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~P--ffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~r------- 303 (341) T protein:vir:27 233 AAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPP--FLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKR------- 303 (341) T ss_pred hhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEcc--ccCCCceEEeeccceEEEEecCcEEEEEEecccccc------- Confidence 2222322211111 111113589999998755 4566678888888876544433 2222222221111 Q ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 280 ~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ..+ |++ +|.++-+ |. ...-.|..++..+++=- ...|- T Consensus 304 ie~-yes---~YvVEdy-g~--~~~~~~~~vkl~~~~~~-~~~~~ 340 (341) T protein:vir:27 304 SKT-HTG---AWKVTQW-VC--WKRSPLTTQKKSTSALN-HRSER 340 (341) T ss_pred ccc-hhh---hheeehh-hh--hhhccccccccCccccc-ccccc Confidence 111 222 2332222 11 11222334443221100 01111 No 219 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=85.34 E-value=0.054 Score=27.55 Aligned_cols=301 Identities=11% Similarity=-0.006 Sum_probs=137.4 Q ss_pred CchhhHHHHHHHH-Hhhc-------cchhh---hhccccccc------------cCCCcceec-hhhhHHHHHHH--Hhh Q lcl|NC_011614. 1 MEQTQKLKLNLQH-FASN-------NVKPQ---VFNPDNVMM------------HEKKDGTLL-NDFTTPILQEV--MEN 54 (324) Q Consensus 1 m~~~~~~~~~~~~-~~~~-------~~~~~---~~~a~~~~~------------~~~~g~lip-~~~~~~i~~~~--~~~ 54 (324) |-.+.|.|..+++ |-+. -..++ +...+++.- +..+|+.+- +.+..++.... .+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ 80 (514) T protein:vir:10 1 MYTQDKTKDIMKKSFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERD 80 (514) T ss_pred CCccchhhHHHhhhhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcc Confidence 7777666666543 1111 00111 111121111 111222221 22222221111 111 Q ss_pred cchhhhceeeecCCCceEEE---EEeCCcceeeecccccccccccceeeEEeeeeeEEEeehhHH--HHHhcChhHHHHH Q lcl|NC_011614. 55 SKIMQLGKYEPMEGTEKKFT---FWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTK--EFLNYTYSQFFEE 129 (324) Q Consensus 55 s~l~~l~~~~~~~~~~~~ip---~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~--ell~~s~~~~~~~ 129 (324) -.+.+-..+.+..+....+- .+.....+.++.|+.-.+.+++.+....+..+-++....+|. ++. ++..+.+.. T Consensus 81 ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~-n~i~d~~~~ 159 (514) T protein:vir:10 81 FTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQRA-NTIVDSLKV 159 (514) T ss_pred hhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhhc-cchhhHHHH Confidence 11122222233333222222 222334678899999999999999999988888777655554 443 366688888 Q ss_pred HHHHHHHHHHHHHHHHHHhccCc--------CcCCccccccccccccee--ecccchhHHHHHHHHhhhhccCCCEEEEc Q lcl|NC_011614. 130 MKPMIAEAFYKKFDEAGILNQGN--------NPFGKSIAQSIEKTNKVI--KGDFTQDNIIDLEALLEDDELEANAFISK 199 (324) Q Consensus 130 v~~~l~~ai~~~~d~a~l~g~g~--------~~~~~~~~~~~~~~~~~~--~~~~~~~~i~~~~~~l~~~~~~~~~~v~~ 199 (324) ..+.-.-.++..+|.++|.|+.. +-+..|+...+...+... ...++.+.|..+--.+..++..++-++|+ T Consensus 160 ~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp 239 (514) T protein:vir:10 160 QEYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMP 239 (514) T ss_pred HHHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCc Confidence 88888889999999999988653 112233333333222221 22456555655555566677778889999 Q ss_pred HHHHHHHHHhhccCCceeeccCCCceecccceE--eec--CccCCCceEEEeecccEEEEEecceEEEEeeccccccccc Q lcl|NC_011614. 200 TQNRSLLRKIVDPETKERIYDRNSDSLDGLPVV--NLK--SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) Q Consensus 200 ~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~pv~--~~~--~~~~~~~~i~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) ..+.+.|..--....+.++....++...|+|+- .+. ...+.... +.+.++. +........+ T Consensus 240 ~~vka~f~~~~~~~qRV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~-im~~~n~----------L~~~~~~~~~---- 304 (514) T protein:vir:10 240 IGIKADFVNQHLNGQRVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGST-IMDSDNK----------LDFDRPVSPT---- 304 (514) T ss_pred hHHHHHHhhcccCcceEEeecCccceeeeeeccceeEeccceeecCCe-eeccccc----------CccCCccCCc---- Confidence 999888764333333444433334445666652 111 11111111 1111111 1000000000 Q ss_pred ccccchhhhhcCcEEEEEEEEec-------c------EEecccce----EEEEeeccCCCCccccC Q lcl|NC_011614. 276 EDGTPVNLFEQDMVALRATMHVA-------L------HIADDKAF----AKLVPADAKPSSVPGEV 324 (324) Q Consensus 276 ~~~~~~~~f~~~~v~~r~~~r~d-------~------~v~~~~a~----~~l~~~~~~~~~~~~~~ 324 (324) .-...++.+-+...-+ . .+.+.++- =++......+.+.|-++ T Consensus 305 -------Ap~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~ 363 (514) T protein:vir:10 305 -------APTAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLV 363 (514) T ss_pred -------CCCCCcceEEEecCcccccCcccccccccccccccccceeEEEEEEEECCCCcccccce Confidence 0011111111111100 0 01222221 12344455555666665 No 220 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=85.23 E-value=0.054 Score=27.52 Aligned_cols=270 Identities=8% Similarity=-0.038 Sum_probs=123.7 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhc--eeeecCCCceEEEEEeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLG--KYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~--~~~~~~~~~~~ip~~~~ 78 (324) |.-+ .-+.++..+.+.+...+....+. .+.-.++.+++||..+. T Consensus 1 Main----------------------------------~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~ 46 (290) T protein:vir:78 1 MAIN----------------------------------YVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITT 46 (290) T ss_pred Cchh----------------------------------HHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeecc Confidence 1110 01344555555554444433332 22335677899998775 Q ss_pred Ccce-eeecccccccccccceeeEEeeeeeEEEe-ehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCC Q lcl|NC_011614. 79 KPGA-YWVGEGQKIETSKATWVNATMRAFKLGVI-LPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 79 ~~~a-~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~-v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~ 156 (324) ..-. +-.+.|-..+.-+.++...++.-.+.-.+ +.--+.-.......+...+.+...+.++-.+|...+.---+.... T Consensus 47 ~gl~DY~R~~g~~~g~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~ 126 (290) T protein:vir:78 47 TGLKAHTRNKGYNEGSASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKT 126 (290) T ss_pred CcccccccCCCcccCccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhc Confidence 3222 22233333344445566666766664333 322121111112456677778888888888897766421110000 Q ss_pred cccccccccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccC-----C--ceeeccCCCceeccc Q lcl|NC_011614. 157 KSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPE-----T--KERIYDRNSDSLDGL 229 (324) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~-----g--~~~~~~~~~~~l~G~ 229 (324) .+ .....+.+.+.-++.+.+++.++......+-.++++|..+..|.....-. + ......+..+++.|+ T Consensus 127 ~~-----~~~~~t~t~~n~~~~i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~ 201 (290) T protein:vir:78 127 NS-----NSVAEEITKDNVFTKLKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGT 201 (290) T ss_pred cC-----cccccccCHHHHHHHHHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCc Confidence 00 01111122334567777888888766555567889999999886432111 1 111235556789999 Q ss_pred ceEeecCcc--CCCceEEEeec-------ccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccE Q lcl|NC_011614. 230 PVVNLKSSN--LKRGELITGDF-------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH 300 (324) Q Consensus 230 pv~~~~~~~--~~~~~i~~gd~-------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~ 300 (324) +|+..++.. ..+..+.-|-. -++++. .....+-+..+....... ++.. -+-+.-.+.-..+.|.- T Consensus 202 ~ii~vps~~r~~t~~~f~~G~~~~~~ak~in~ii~-~~~a~i~~~K~~~~~~~~-P~~~----~~~d~~~~~~r~y~d~~ 275 (290) T protein:vir:78 202 RIVEVEAEDRFYDTFDFTDGYKPAAGAKKLNFLLV-NKGSVVGGAKHASIYLHA-PGSV----GQGDGWLYQYRVYHDIF 275 (290) T ss_pred EEEEecccchhhhhhhhcccccccCCccceeEEEE-cCCceeeeeeeeEEEeeC-CCCC----cCcceeeeeeeeeeeee Confidence 998766421 11111111110 012222 222222222222222111 1100 01122334444566766 Q ss_pred EecccceEEEEeecc Q lcl|NC_011614. 301 IADDKAFAKLVPADA 315 (324) Q Consensus 301 v~~~~a~~~l~~~~~ 315 (324) |.+.+.=.+....+. T Consensus 276 v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 276 VLDQQKDGVIASTEV 290 (290) T ss_pred eeccccCeeEEEeeC Confidence 666444333332222 No 221 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=82.44 E-value=0.077 Score=26.69 Aligned_cols=293 Identities=11% Similarity=0.031 Sum_probs=149.3 Q ss_pred CchhhHHHHHHHHHhhcc-chhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNN-VKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~-~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~~ 78 (324) |.| +++...+.+.+... +... ..+.+.-+.|.|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-.+ T Consensus 1 mtr-~~~~~y~~~~A~~ngv~~a-------~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~ 72 (336) T protein:vir:37 1 MNK-QAYYALAAALAKHFNQPLD-------SVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATE 72 (336) T ss_pred CcH-HHHHHHHHHHHHHhCCChh-------hhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccC Confidence 877 34444444444321 1111 112223467889999999999999999999999988874433 2333333 Q ss_pred CcceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcC--hhHHH-HHHHHHHHHHHHHHHHHHHHhccC---- Q lcl|NC_011614. 79 KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT--YSQFF-EEMKPMIAEAFYKKFDEAGILNQG---- 151 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s--~~~~~-~~v~~~l~~ai~~~~d~a~l~g~g---- 151 (324) ++-+.-..- ...+ .+..++.-.+..++.-.-..|+.+.++.= .+++. ..+...+.+.++..+=.-.|+|+. T Consensus 73 g~iagrtdt-~R~~-~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~ 150 (336) T protein:vir:37 73 KGVTGRKQT-GRNL-ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADN 150 (336) T ss_pred cccccccCC-Cccc-cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccC Confidence 443332221 1222 22456666677777666677777766632 13432 333333455555444455667742 Q ss_pred -cCcCC----ccccc------------cc-ccccce-e-eccc---chhH-HHHHHHHhhhhccCC--CEEEEcHHHHH- Q lcl|NC_011614. 152 -NNPFG----KSIAQ------------SI-EKTNKV-I-KGDF---TQDN-IIDLEALLEDDELEA--NAFISKTQNRS- 204 (324) Q Consensus 152 -~~~~~----~~~~~------------~~-~~~~~~-~-~~~~---~~~~-i~~~~~~l~~~~~~~--~~~v~~~~~~~- 204 (324) .++.+ .|.+. .. ...... . ...- +.|. +.++...|+..+++. -+.+|...... T Consensus 151 TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~ 230 (336) T protein:vir:37 151 TTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSK 230 (336) T ss_pred CCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhhhhh Confidence 11111 11111 00 000000 0 1111 2333 356666676666553 36677776532 Q ss_pred HHHHhhccCC-cee--ec---cCCCceecccceEeecCccCCCceEEEeecccEEEEEecc-eEEEEeeccccccccccc Q lcl|NC_011614. 205 LLRKIVDPET-KER--IY---DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTVKNED 277 (324) Q Consensus 205 ~L~~l~d~~g-~~~--~~---~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~ 277 (324) ....+-...+ +|- .. .....++.|+|.+..+ ..|...++...++++-+-...+ .+=.+-+....+ T Consensus 231 ~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~P--ffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~------ 302 (336) T protein:vir:37 231 ETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPP--NFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKK------ 302 (336) T ss_pred hhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceeEEcc--ccCCCceEEeechhcEEEEecCcEEEEEEEccccc------ Confidence 2222333332 221 11 1123578999998755 4556678888888875544333 332332222111 Q ss_pred ccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 278 GTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 278 ~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ...++. ...-|+.|-+..++|.++.... .-|+|| T Consensus 303 -rie~y~---------s~Ne~YvVEd~~~~a~iE~i~v---~~~~e~ 336 (336) T protein:vir:37 303 -GLVTSY---------YRQEGYVVEDLGLMTAIDHTKV---KLNGEV 336 (336) T ss_pred -cccchh---------hhcceeeeeccccEEEeeeeee---eecCcC Confidence 111112 1235777888999998876533 236777 No 222 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=82.34 E-value=0.078 Score=26.67 Aligned_cols=301 Identities=10% Similarity=-0.003 Sum_probs=116.8 Q ss_pred CchhhHHHHH----HHHHhhccchhhhhccccccccCCCcceechhhhHHHHH---HHHhhcchhhhce---ee-----e Q lcl|NC_011614. 1 MEQTQKLKLN----LQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQ---EVMENSKIMQLGK---YE-----P 65 (324) Q Consensus 1 m~~~~~~~~~----~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~---~~~~~s~l~~l~~---~~-----~ 65 (324) |.-+. .|.. .+....+-......+......+.....-......+.... ............+ .. . T Consensus 162 ~s~si-~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~ 240 (523) T protein:vir:59 162 SSGAV-YYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPS 240 (523) T ss_pred cccce-eeeeccccccccccccccccccccccccccccccccchhhccccccccccccccccccccccccccccCCCccc Confidence 21110 0000 000000000000000000000000000000000000000 0000000000000 00 0 Q ss_pred cCCCceEEEEEe----CCcc-----eeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcC-----hhHHHHHHH Q lcl|NC_011614. 66 MEGTEKKFTFWA----DKPG-----AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT-----YSQFFEEMK 131 (324) Q Consensus 66 ~~~~~~~ip~~~----~~~~-----a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s-----~~~~~~~v~ 131 (324) ..+....+.... ...+ ...-.++...++-..+++++++.++..+-....|-||.+|- ..|.++.|. T Consensus 241 t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELa 320 (523) T protein:vir:59 241 TQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIV 320 (523) T ss_pred ccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHH Confidence 000000000000 0001 11123455678888889999999999999999999999984 456999999 Q ss_pred HHHHHHHHHHHHHHHHhccCcC--------cCCcccccccccccceeecccc----hhHHHHHHHHhh-------h--hc Q lcl|NC_011614. 132 PMIAEAFYKKFDEAGILNQGNN--------PFGKSIAQSIEKTNKVIKGDFT----QDNIIDLEALLE-------D--DE 190 (324) Q Consensus 132 ~~l~~ai~~~~d~a~l~g~g~~--------~~~~~~~~~~~~~~~~~~~~~~----~~~i~~~~~~l~-------~--~~ 190 (324) +.|...|...+++.||.---+. ....++..........-...-. .+-...|+-++. . .+ T Consensus 321 nILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~ 400 (523) T protein:vir:59 321 TLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAV 400 (523) T ss_pred HHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhccc Confidence 9999999999999988632111 0111111110000000000001 122233333332 1 12 Q ss_pred cCCCEEEEcHHHHHHHHHhhccCCcee-eccCC----Cceec-ccceEeecCccCCCceEEEeecccEEEEEecce---- Q lcl|NC_011614. 191 LEANAFISKTQNRSLLRKIVDPETKER-IYDRN----SDSLD-GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLI---- 260 (324) Q Consensus 191 ~~~~~~v~~~~~~~~L~~l~d~~g~~~-~~~~~----~~~l~-G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~~---- 260 (324) .....++|+++....|...---+++.. -...+ .|.|. |++|++.+. .+..-++.| +.+.. T Consensus 401 ~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l~~~~~vy~d~~--~~~dy~~~g--------~k~~~~~~~ 470 (523) T protein:vir:59 401 AGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMVQGRYRLYKNIY--QNQPVIIMG--------NQDLNTPWQ 470 (523) T ss_pred ccccEEEEchhHHHHHHhccccccCCccccccccceeEEEecCceEEEecCC--CCcceEEEE--------ecccCCccc Confidence 245678999999998864211111111 11111 13343 456665332 222222222 22211 Q ss_pred -EEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCC Q lcl|NC_011614. 261 -EYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKP 317 (324) Q Consensus 261 -~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~ 317 (324) .+-+.....+..+.... -|. -| +=.+=...|++..|.+|-+..+|-.+--.| T Consensus 471 ~~~~y~Py~~l~~~~~~~-dp~-s~---qp~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 471 TGAVYAPYVPLLFTPTIV-DPV-NF---SYRRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred ccceecccchhhcccccc-cCC-cc---cceeeeeeehhheecchhHhhhhhhhhcCC Confidence 11111111111111000 000 13 223444689999999999988776555555 No 223 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=80.19 E-value=0.097 Score=26.13 Aligned_cols=299 Identities=15% Similarity=0.143 Sum_probs=156.7 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccc-cCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~~ 78 (324) |...-. ....+|.... .+. .++.. ..+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-.+ T Consensus 1 M~~~tr--~~~~~y~~~~---A~~--ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:20 1 MRQETR--FKFNAYLSRV---AEL--NGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHH--HHHHHHHHHH---HHH--hCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 654432 2223333221 111 11111 1123466888899999999999999999999888775433 2333334 Q ss_pred Ccceeeec--ccccc-cccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|NC_011614. 79 KPGAYWVG--EGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGN- 152 (324) Q Consensus 79 ~~~a~~v~--Eg~~~-~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~- 152 (324) ++-++-+. -+.+. |..-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++...=.-.|+|+.- T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 153 (357) T protein:vir:20 74 GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRA 153 (357) T ss_pred ccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeee Confidence 44444331 11222 222235666666666666666677666652 2368888888888888887777777788431 Q ss_pred -------CcCC----ccccccc----------------cccc-c-e-eecccch---hHH-HHHHHH-hhhhccCC--CE Q lcl|NC_011614. 153 -------NPFG----KSIAQSI----------------EKTN-K-V-IKGDFTQ---DNI-IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 153 -------~~~~----~~~~~~~----------------~~~~-~-~-~~~~~~~---~~i-~~~~~~-l~~~~~~~--~~ 195 (324) ++.+ .|.+... +.+. . . ....-+| |.+ .++... ++..+++. -+ T Consensus 154 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLV 233 (357) T protein:vir:20 154 ETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLV 233 (357) T ss_pred ccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 1111 1111100 0000 0 0 0111233 332 345654 46655544 36 Q ss_pred EEEcHHHHH-HHHHhhccCCcee--ecc-C--CCceecccceEeecCccCCCceEEEeecccEEEE-EecceEEEEeecc Q lcl|NC_011614. 196 FISKTQNRS-LLRKIVDPETKER--IYD-R--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETA 268 (324) Q Consensus 196 ~v~~~~~~~-~L~~l~d~~g~~~--~~~-~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~-~~~~~~i~~~~~~ 268 (324) .+|...... +-..+-...+.|- +.. . ...++.|+|.+..+ ..|...++...++++-+- ..+..+=.+.+.. T Consensus 234 vivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~P--fFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p 311 (357) T protein:vir:20 234 VIVGRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVP--YFPADAMLITKLENLSIYYMDDSHRRVIEENP 311 (357) T ss_pred EEEchhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 777777643 2223322222221 111 1 13578999998755 455567788888876443 3333333333322 Q ss_pred cccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 269 QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 269 ~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ..+. ..++.. ..-|+.|-+..++|.++...-.....|++. T Consensus 312 ~r~r-------iE~y~s---------~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~ 351 (357) T protein:vir:20 312 KLDR-------VENYES---------MNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred cccc-------ccchhh---------hcceeeeeccccEEEeeeeeeccccCCccC Confidence 2111 111121 235777888888888886666555556666 No 224 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=77.72 E-value=0.12 Score=25.60 Aligned_cols=299 Identities=15% Similarity=0.140 Sum_probs=156.6 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccc-cCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEeC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWAD 78 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~~ 78 (324) |...-. ....+|.... .+. .++.. ..+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-.+ T Consensus 1 M~~~tr--~~~~~y~~~~---A~~--ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:56 1 MRQETR--FKFNAYLSRV---AEL--NGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHH--HHHHHHHHHH---HHH--hCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 654432 2223333221 111 11111 1123466888899999999999999999999888775433 2333334 Q ss_pred Ccceeeec--ccccc-cccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|NC_011614. 79 KPGAYWVG--EGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGN- 152 (324) Q Consensus 79 ~~~a~~v~--Eg~~~-~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~- 152 (324) ++-++-+. -+.+. |..-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++...=.-.|+|+.- T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 153 (357) T protein:vir:56 74 GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRA 153 (357) T ss_pred ccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeee Confidence 44444331 11222 222245666666666666666677666652 2368888888888888887777777788431 Q ss_pred -------CcCC----ccccccc----------------cccc-c-e-eecccch---hHH-HHHHHH-hhhhccCC--CE Q lcl|NC_011614. 153 -------NPFG----KSIAQSI----------------EKTN-K-V-IKGDFTQ---DNI-IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 153 -------~~~~----~~~~~~~----------------~~~~-~-~-~~~~~~~---~~i-~~~~~~-l~~~~~~~--~~ 195 (324) ++.+ .|.+... +.+. . . ....-+| |.+ .++... ++..+++. -+ T Consensus 154 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLV 233 (357) T protein:vir:56 154 ETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLV 233 (357) T ss_pred ccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 1111 1111100 0000 0 0 1111233 332 345654 46655544 36 Q ss_pred EEEcHHHHH-HHHHhhccCCcee--ecc-C--CCceecccceEeecCccCCCceEEEeecccEEEE-EecceEEEEeecc Q lcl|NC_011614. 196 FISKTQNRS-LLRKIVDPETKER--IYD-R--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETA 268 (324) Q Consensus 196 ~v~~~~~~~-~L~~l~d~~g~~~--~~~-~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~-~~~~~~i~~~~~~ 268 (324) .+|...... +-..+-...+.|- +.. . ...++.|+|.+..+ ..|...++...++++-+- ..+..+=.+.+.. T Consensus 234 vivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~P--fFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p 311 (357) T protein:vir:56 234 VIVGRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVP--YFPADAMLITKLENLSIYYMDDSHRRVIEENP 311 (357) T ss_pred EEEchhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 777777643 2223322222221 111 1 13578999998755 455567788888876443 3333333333322 Q ss_pred cccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 269 QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 269 ~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ..+. ..++.. ..-|+.|-+..++|.++...-.....|++- T Consensus 312 ~r~r-------iE~y~s---------~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~ 351 (357) T protein:vir:56 312 KLDR-------VENYES---------MNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred cccc-------ccchhh---------hcceeeeeccccEEEeeeeeeccCCCCccc Confidence 2111 111121 235777888888888877666655556665 No 225 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=76.67 E-value=0.13 Score=25.39 Aligned_cols=294 Identities=11% Similarity=0.044 Sum_probs=147.4 Q ss_pred CchhhHHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEeCC Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~~~ 79 (324) |.| +++...+.+.+...-.. ....+.+.-+.|.|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-.++ T Consensus 1 mtr-~~~~~y~~~~A~~ngv~------~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g 73 (336) T protein:vir:37 1 MNK-QAYYALAAALAKHFNQP------LDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEK 73 (336) T ss_pred CcH-HHHHHHHHHHHHHhCCC------hhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCc Confidence 877 34444444444321110 00112223467889999999999999999999999988874433 23333344 Q ss_pred cceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHhcC--hhH-HHHHHHHHHHHHHHHHHHHHHHhccC----c Q lcl|NC_011614. 80 PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT--YSQ-FFEEMKPMIAEAFYKKFDEAGILNQG----N 152 (324) Q Consensus 80 ~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~s--~~~-~~~~v~~~l~~ai~~~~d~a~l~g~g----~ 152 (324) +-+.-..-+.. .....++.-.+..++.-.-..|+.+.++.= .++ +...+...+.+.++..+=.-.|+|+. + T Consensus 74 ~iagrtdt~r~--r~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~T 151 (336) T protein:vir:37 74 GVTGRKQTGRN--LATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNT 151 (336) T ss_pred ccccccCCCCC--ccccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCC Confidence 44433322211 112335556666666666667777766632 133 22333333444444444455567742 1 Q ss_pred -CcCC----cccccc------------c-ccccce-e-eccc---chhH-HHHHHHHhhhhccCC--CEEEEcHHHHH-H Q lcl|NC_011614. 153 -NPFG----KSIAQS------------I-EKTNKV-I-KGDF---TQDN-IIDLEALLEDDELEA--NAFISKTQNRS-L 205 (324) Q Consensus 153 -~~~~----~~~~~~------------~-~~~~~~-~-~~~~---~~~~-i~~~~~~l~~~~~~~--~~~v~~~~~~~-~ 205 (324) ++.+ .|.+.. . ...... . ...- +.|. +.++...|+..+++. -+.+|...... . T Consensus 152 dnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla~~ 231 (336) T protein:vir:37 152 TKTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLVSKE 231 (336) T ss_pred CCccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhhhhhh Confidence 1111 111110 0 000000 0 1111 2333 356666676666553 36677776543 2 Q ss_pred HHHhhccCC-cee--ec---cCCCceecccceEeecCccCCCceEEEeecccEEEEEecc-eEEEEeecccccccccccc Q lcl|NC_011614. 206 LRKIVDPET-KER--IY---DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTVKNEDG 278 (324) Q Consensus 206 L~~l~d~~g-~~~--~~---~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~i~~~~~~~~~~~~~~~~ 278 (324) ...+-...+ +|- .. .....++.|+|.+..+ ..|...++...++++-+-...+ .+=.+-+....+ T Consensus 232 ~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~P--ffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~------- 302 (336) T protein:vir:37 232 TKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPP--NFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKK------- 302 (336) T ss_pred hhhhhhhcCCCHHHHHHHHHHHHHHhhCCceEEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEEccccc------- Confidence 222333322 221 11 1123578999998755 4556678888888875544333 332332222111 Q ss_pred cchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 279 TPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 279 ~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ...++. ...-|+.|-+..++|.++.... .-|+|| T Consensus 303 rie~y~---------s~Ne~YvVEd~~~~a~iE~i~v---~~~~e~ 336 (336) T protein:vir:37 303 GLVTSY---------YRQEGYVVEDLGLMTAIDHTKV---KLNGEV 336 (336) T ss_pred cccchh---------hhcceeeeeccccEEEeeeeee---eccccC Confidence 111112 1235777888999999876543 236777 No 226 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=75.30 E-value=0.15 Score=25.13 Aligned_cols=298 Identities=11% Similarity=0.015 Sum_probs=138.3 Q ss_pred Cchhh--HHHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEE-EEEe Q lcl|NC_011614. 1 MEQTQ--KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF-TFWA 77 (324) Q Consensus 1 m~~~~--~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i-p~~~ 77 (324) |.+.- ++...+.+.+...-. .......+.-+.|.|.+.+.+...+.+.+-++++.+.+++.--...+ .... T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv------~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~ 74 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGA------NPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSN 74 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCC------ccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeec Confidence 66532 233333333322111 00011122346788899999999999999999999888875322222 2222 Q ss_pred CCcceeeeccccc-ccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhH-HHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|NC_011614. 78 DKPGAYWVGEGQK-IETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQ-FFEEMKPMIAEAFYKKFDEAGILNQGN- 152 (324) Q Consensus 78 ~~~~a~~v~Eg~~-~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~-~~~~v~~~l~~ai~~~~d~a~l~g~g~- 152 (324) ++..+.-...... .... ..+.-.+..++.-.-..|+.+.++. ..++ |...+.+.+.+.++...=.-.|+|+.- T Consensus 75 sg~~t~r~~t~~~~~~~~--~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A 152 (343) T protein:vir:98 75 RKRHYGAHDRRTPIQQRW--TRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVG 152 (343) T ss_pred CccccCccccCCCccccc--cCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeec Confidence 3332221111111 1111 1111234444444445555555542 1255 888888888888877766777777431 Q ss_pred --CcCCc------cccc------------ccccccce-e-ecccchhH----HHHHHHHhhhhccCC--CEEEEcHHHHH Q lcl|NC_011614. 153 --NPFGK------SIAQ------------SIEKTNKV-I-KGDFTQDN----IIDLEALLEDDELEA--NAFISKTQNRS 204 (324) Q Consensus 153 --~~~~~------~~~~------------~~~~~~~~-~-~~~~~~~~----i~~~~~~l~~~~~~~--~~~v~~~~~~~ 204 (324) ...|. |.+. .....+.. . ...-+|.. +.++...|+..+++. -+.+|...... T Consensus 153 ~~T~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLla 232 (343) T protein:vir:98 153 TDTSDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLVA 232 (343) T ss_pred cCCCCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhhh Confidence 11111 1110 00000000 0 11112322 234555666655554 36677777643 Q ss_pred H-HHHhhccCCcee---ec---cCCCceecccceEeecCccCCCceEEEeecccEEEE-EecceEEEEeecccccccccc Q lcl|NC_011614. 205 L-LRKIVDPETKER---IY---DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETAQLSTVKNE 276 (324) Q Consensus 205 ~-L~~l~d~~g~~~---~~---~~~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~-~~~~~~i~~~~~~~~~~~~~~ 276 (324) . -..+-...+++. .. .....++-|+|.+..+ ..|...++...++++-+- ..+..+=.+.+....+. T Consensus 233 ~~~~~l~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~P--fFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r---- 306 (343) T protein:vir:98 233 KEASLVYKGNGLIATEKAALNTHDLMKSFGGMPAMIVP--NMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDDKKA---- 306 (343) T ss_pred hhhhhhhhhcCCChHHHHHHHHHHHHHhhCCCeeEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEecccccc---- Confidence 2 122333333211 00 0113578999998755 455667888888876543 33333333333222111 Q ss_pred cccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCccccC Q lcl|NC_011614. 277 DGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 277 ~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) ..++.. ..-|+.|-+..++|.++....+-....+-- T Consensus 307 ---ie~y~s---------~Ne~YvVEd~~~~a~iE~i~v~~~~~~g~w 342 (343) T protein:vir:98 307 ---VRDSYY---------RNEAYAVEDCGKFMAVDFTKVKLSSGKGTW 342 (343) T ss_pred ---ccchhh---------hcceeeeeccccEEEeeeeeeeecCCCCCC Confidence 111122 224566667777777654332222111100 No 227 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=75.26 E-value=0.15 Score=25.12 Aligned_cols=294 Identities=12% Similarity=0.126 Sum_probs=151.8 Q ss_pred CchhhHHHHHHHHHhhccchhhhhcccccc---ccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEE Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVM---MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFW 76 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~~~~~a~~~~---~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~ 76 (324) |...-. ....+|..... ...++. ...+--+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.- T Consensus 1 M~~~tr--~~~~~y~~~~A-----~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg 73 (342) T protein:vir:10 1 MKDLTL--EKYNAYLARQA-----ELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLD 73 (342) T ss_pred CChHHH--HHHHHHHHHHH-----HHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecc Confidence 765432 22233333111 111111 11122466888899999999999999999999888775433 33333 Q ss_pred eCCcceeeec-c--cccccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|NC_011614. 77 ADKPGAYWVG-E--GQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 77 ~~~~~a~~v~-E--g~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g 151 (324) .+++-++-+. . +...|.+-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++...=.-.|+|+. T Consensus 74 ~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts 153 (342) T protein:vir:10 74 SAHTVASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTS 153 (342) T ss_pred cCcccccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceeccccee Confidence 4444444332 1 1223333345666666666666666677666652 236899999999998888777777778843 Q ss_pred c--------CcCC----cccccc-------------cccccceeecccch---hHH-HHHHHH-hhhhccCC--CEEEEc Q lcl|NC_011614. 152 N--------NPFG----KSIAQS-------------IEKTNKVIKGDFTQ---DNI-IDLEAL-LEDDELEA--NAFISK 199 (324) Q Consensus 152 ~--------~~~~----~~~~~~-------------~~~~~~~~~~~~~~---~~i-~~~~~~-l~~~~~~~--~~~v~~ 199 (324) - ++.+ .|.+.. ...........-+| |.+ .++... ++..+++. -+.+|. T Consensus 154 ~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG 233 (342) T protein:vir:10 154 RAATSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITG 233 (342) T ss_pred eccCCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 1 1111 111110 00000011111233 332 345654 46655544 367777 Q ss_pred HHHHH-HHHHhhccCCcee--e-ccC--CCceecccceEeecCccCCCceEEEeecccEEE-EEecceEEEEeecccccc Q lcl|NC_011614. 200 TQNRS-LLRKIVDPETKER--I-YDR--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY-GIPQLIEYKIDETAQLST 272 (324) Q Consensus 200 ~~~~~-~L~~l~d~~g~~~--~-~~~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~-~~~~~~~i~~~~~~~~~~ 272 (324) ..... +-..+-.....|- . .+. ...++.|+|.+..+ ..|...++...++++-+ ...+..+=.+.+....+. T Consensus 234 ~dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~P--fFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~r 311 (342) T protein:vir:10 234 RKLLADKYFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVP--FFPANAILITKLENLAIYVQEGTTRKHIENVPKKDR 311 (342) T ss_pred hhhhHHHHHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEcc--ccCCCceEEeeccccEEEEecCcEEEEEEecccccc Confidence 77654 2222222222221 1 111 13578999998755 45556778888887644 333333333333222111 Q ss_pred cccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 273 ~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) ..++. ...-|+.|-+..++|.++...-...- T Consensus 312 -------ie~y~---------s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 312 -------IETYE---------SENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred -------ccchh---------hhccceeeeccccEEEeecceecCCC Confidence 11111 12357778888888888754332211 No 228 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=65.37 E-value=0.28 Score=23.58 Aligned_cols=285 Identities=11% Similarity=0.026 Sum_probs=117.9 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchh--hhceeeecCCCceEEEEEe-CCc-ceeeecccccccc-cccceeeEEeee Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIM--QLGKYEPMEGTEKKFTFWA-DKP-GAYWVGEGQKIET-SKATWVNATMRA 105 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~--~l~~~~~~~~~~~~ip~~~-~~~-~a~~v~Eg~~~~~-~~~~~~~v~~~~ 105 (324) ++.--..+-+.++...|-+.......++ .+++..++.+-.+.+.... ... .|.+++.+.+.+. ....++...+.+ T Consensus 1 M~~i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~~ 80 (348) T protein:vir:27 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMHDEQM 80 (348) T ss_pred CcchhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeeecCCCCcceecccceeeeeeec Confidence 1111112223333332222222222222 2344444444333332222 222 3567777666553 345577777777 Q ss_pred eeEEEeehhHHHHH------hcCh-hHHHHHH-------HHHHHHHHHHHHHH----HHHhcc----CcCcCCccccccc Q lcl|NC_011614. 106 FKLGVILPVTKEFL------NYTY-SQFFEEM-------KPMIAEAFYKKFDE----AGILNQ----GNNPFGKSIAQSI 163 (324) Q Consensus 106 ~k~~~~v~iS~ell------~~s~-~~~~~~v-------~~~l~~ai~~~~d~----a~l~g~----g~~~~~~~~~~~~ 163 (324) -.++-...++.+-+ ..+. .+..+.+ ...+.+++.+.+|. ++.+|- +.+.. ..+.-+. T Consensus 81 p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~-~~vdfg~ 159 (348) T protein:vir:27 81 PFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVN-KDIDYGV 159 (348) T ss_pred CccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCee-EEEeecC Confidence 66766666654332 2111 1222222 22234555555554 333331 11100 0000000 Q ss_pred c-------cccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH---hhccC----Cce-eeccCC----Cc Q lcl|NC_011614. 164 E-------KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---IVDPE----TKE-RIYDRN----SD 224 (324) Q Consensus 164 ~-------~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~---l~d~~----g~~-~~~~~~----~~ 224 (324) . ......++...++||.++...+...+..+..++|++..+.+|.+ +++.- +.. .+.... -+ T Consensus 160 ~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~~ 239 (348) T protein:vir:27 160 KPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELENYIA 239 (348) T ss_pred CcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHHHHHHHH Confidence 0 01111223455678888877777778888899999999999853 33221 111 111110 12 Q ss_pred eecccceEeecC----------ccCCCceEEEeeccc---EEEEEe-cceEEEEeecccccccccccccchhhhh-cC-- Q lcl|NC_011614. 225 SLDGLPVVNLKS----------SNLKRGELITGDFDK---LIYGIP-QLIEYKIDETAQLSTVKNEDGTPVNLFE-QD-- 287 (324) Q Consensus 225 ~l~G~pv~~~~~----------~~~~~~~i~~gd~~~---~~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~f~-~~-- 287 (324) ++.|+++++.+. ...+++.+++..... ..+|.- ++..................+.....|. .| T Consensus 240 ~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~ 319 (348) T protein:vir:27 240 DNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTDPV 319 (348) T ss_pred hhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEeeecCCCc Confidence 345666654221 112444454443322 222211 0000000000000000000001111111 11 Q ss_pred cEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 288 MVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) ...+.+..+.=-.+.+|+++.+++..++. T Consensus 320 ~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 320 NVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred eEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 23344555555566778888888887776 No 229 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=64.84 E-value=0.29 Score=23.50 Aligned_cols=284 Identities=11% Similarity=0.048 Sum_probs=118.0 Q ss_pred ccCCCcceechhhhHHHHHHHH-hhcchh--hhceeeecCCCceEEEEE-eCCc-ceeeecccccccc-cccceeeEEee Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVM-ENSKIM--QLGKYEPMEGTEKKFTFW-ADKP-GAYWVGEGQKIET-SKATWVNATMR 104 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~-~~s~l~--~l~~~~~~~~~~~~ip~~-~~~~-~a~~v~Eg~~~~~-~~~~~~~v~~~ 104 (324) ++.--..+-+.++.. +++.+. ....++ .+++..+..+..+.+... .... .|.++..+.+.+. ....++...+. T Consensus 1 M~~i~d~f~~~~l~~-~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:96 1 MGLIYDKVTASNIAG-YFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEIHDEQ 79 (348) T ss_pred CcchhhccCHHHHHH-HHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeeeecCCCCcceecccceeeeeee Confidence 111111222233333 333332 222222 334544444433333222 2222 3678887766554 34557777777 Q ss_pred eeeEEEeehhHHHHHh------cC-hhHHHHHHHH-------HHHHHHHHHHHH----HHHhcc----CcCcCCcccccc Q lcl|NC_011614. 105 AFKLGVILPVTKEFLN------YT-YSQFFEEMKP-------MIAEAFYKKFDE----AGILNQ----GNNPFGKSIAQS 162 (324) Q Consensus 105 ~~k~~~~v~iS~ell~------~s-~~~~~~~v~~-------~l~~ai~~~~d~----a~l~g~----g~~~~~~~~~~~ 162 (324) +-.++-...++.+-++ .+ .....+.+.+ .+.+.+.+.+|. ++.+|- +.+.. ..+.-+ T Consensus 80 ~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~-~~vdfg 158 (348) T protein:vir:96 80 MPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVN-KDIDYG 158 (348) T ss_pred cCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCee-EEEecc Confidence 7766666665542211 11 1112222222 233455555553 333331 11100 000000 Q ss_pred c-------ccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH---hhc----cCCcee-ecc----CCC Q lcl|NC_011614. 163 I-------EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---IVD----PETKER-IYD----RNS 223 (324) Q Consensus 163 ~-------~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~---l~d----~~g~~~-~~~----~~~ 223 (324) . .......++...+.+|.++...+...+..+..++|++..+.+|.+ +++ .++... +.. ..- T Consensus 159 ~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 238 (348) T protein:vir:96 159 VKADHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAELQNYV 238 (348) T ss_pred CCcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHHHHHHH Confidence 0 001112234456678888877777778888899999999999852 322 111111 110 011 Q ss_pred ceecccceEeecCc----------cCCCceEEEeecccE---EEEEe-cceEEEEeecccccccccccccchhhh-hcC- Q lcl|NC_011614. 224 DSLDGLPVVNLKSS----------NLKRGELITGDFDKL---IYGIP-QLIEYKIDETAQLSTVKNEDGTPVNLF-EQD- 287 (324) Q Consensus 224 ~~l~G~pv~~~~~~----------~~~~~~i~~gd~~~~---~~~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~f-~~~- 287 (324) +.+.|+++++.+.. ..+++.+++...... .+|.- ++...............-..+.....| +.| T Consensus 239 ~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP 318 (348) T protein:vir:96 239 ADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTTTKTTDP 318 (348) T ss_pred hhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEEEeeecCCC Confidence 23456666543211 123344444332221 12210 000000000000000000001111112 112 Q ss_pred -cEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 288 -MVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 288 -~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) ...+++..+.=-.+.+|+++.+++..++. T Consensus 319 ~~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 319 VNVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred ceEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 23355555555556678899998887766 No 230 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=64.55 E-value=0.3 Score=23.46 Aligned_cols=291 Identities=15% Similarity=0.106 Sum_probs=151.9 Q ss_pred CchhhH--HHHHHHHHhhccchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCce-EEEEEe Q lcl|NC_011614. 1 MEQTQK--LKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWA 77 (324) Q Consensus 1 m~~~~~--~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~ 77 (324) |.+.-. +.....+.+... ++. ..+..+.|-|.+.+.+...+.+.+-++++.+.+++..-.. .+-.-. T Consensus 1 M~~~tr~~~~~y~~~~A~~n---------gv~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~ 70 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLN---------GVE-RVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGV 70 (339) T ss_pred CChHHHHHHHHHHHHHHHHh---------Ccc-cccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeecc Confidence 654432 222233322211 121 2234567888899999999999999999999888775433 233333 Q ss_pred CCcceeeec-cccc-ccccccceeeEEeeeeeEEEeehhHHHHHhc--ChhHHHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|NC_011614. 78 DKPGAYWVG-EGQK-IETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGN- 152 (324) Q Consensus 78 ~~~~a~~v~-Eg~~-~~~~~~~~~~v~~~~~k~~~~v~iS~ell~~--s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~- 152 (324) +++-++-+. -+.+ .|.+-..++.-.+..++.-.-..|+.+.++. ..++|...+.+.+.+.++...=.-.|+|+.- T Consensus 71 ~g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 150 (339) T protein:vir:79 71 SGPVASTTDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRA 150 (339) T ss_pred CcceeecccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeee Confidence 444443321 1222 2222245666666666666666666666652 2368999999999988887777777788431 Q ss_pred -------CcCC----ccccc------------cccccc-ce-e-ecccchhHH----HHHHH-HhhhhccCC--CEEEEc Q lcl|NC_011614. 153 -------NPFG----KSIAQ------------SIEKTN-KV-I-KGDFTQDNI----IDLEA-LLEDDELEA--NAFISK 199 (324) Q Consensus 153 -------~~~~----~~~~~------------~~~~~~-~~-~-~~~~~~~~i----~~~~~-~l~~~~~~~--~~~v~~ 199 (324) ++.+ .|.+. ....++ .. . ...-.|..| .++.. .+++.+++. -+.+|. T Consensus 151 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG 230 (339) T protein:vir:79 151 ATSDRVANPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCG 230 (339) T ss_pred cCCChhhCcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 1111 11111 000000 00 1 111123332 45564 345666544 367777 Q ss_pred HHHHH-HHHHhhccCCcee--ec-cC--CCceecccceEeecCccCCCceEEEeecccEEEEE-ecceEEEEeecccccc Q lcl|NC_011614. 200 TQNRS-LLRKIVDPETKER--IY-DR--NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDETAQLST 272 (324) Q Consensus 200 ~~~~~-~L~~l~d~~g~~~--~~-~~--~~~~l~G~pv~~~~~~~~~~~~i~~gd~~~~~~~~-~~~~~i~~~~~~~~~~ 272 (324) ..... +-..+-.....|- .. +. ...++.|+|.+..+ ..|...++...++++-+-. .+..+=.+.+... T Consensus 231 ~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~P--fFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~--- 305 (339) T protein:vir:79 231 RNLLSDKYFPLVNRDRDPVQQIAADLIISQKRIGNLPAIRVP--YFPANGLLVTRLDNLSIYYQEGGRRRTILDNAK--- 305 (339) T ss_pred hhhhhhHhhhHhhcCCChHHHHHHHHHHHhhhhCCceeEEcc--ccCCCceEEeechhcEEEEecCcEEEEEEeccc--- Confidence 77654 2223322222221 11 11 13578999998755 4556677888888765433 3333333333222 Q ss_pred cccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCC Q lcl|NC_011614. 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSS 319 (324) Q Consensus 273 ~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) ++.+.-.=...-|+.|-+..+++.++...-..+. T Consensus 306 -------------r~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 306 -------------RDRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred -------------cccccchhhccceeeeeccccEEEeeeeecccCC Confidence 1111111112347778888888888755443333 No 231 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=63.81 E-value=0.31 Score=23.37 Aligned_cols=271 Identities=9% Similarity=-0.032 Sum_probs=112.8 Q ss_pred cccCCCcceechhhhHHHHHHHHhhcchhhhceeeecCCCceEEEEEeCCcceeeec--cccccc-cccccee---eEEe Q lcl|NC_011614. 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVG--EGQKIE-TSKATWV---NATM 103 (324) Q Consensus 30 ~~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~--Eg~~~~-~~~~~~~---~v~~ 103 (324) +++-....++-|.+.+--+..-.+.-+--.+++++|++....+|+.+. .++.-+. +-+... ....+|. .... T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P~vpv~~~~~k~~~f~--~eaF~~~~t~r~~~~~~~~v~~~~~~~~~~ 78 (307) T protein:vir:10 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQSLMPVVEVEKEGGKIPKFG--KESFRLYKTERALRARSNRMNPEDLGSIDI 78 (307) T ss_pred CCCCCCCcccChhHHHHHHhhcchhhhhhhcCCcccccccccceeeEC--cccccchhhhcccCCCcceeeccccccccc Confidence 333333334334444433333333333345678899888888888875 2332122 111111 1122222 2222 Q ss_pred eeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHH---HhccCcC-cCCcccccccccccceeecccchhHH Q lcl|NC_011614. 104 RAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG---ILNQGNN-PFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 104 ~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~---l~g~g~~-~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) .....+-..++...--..+..++++.-.+.+.+.|.+..|..+ +....+= ......+. ++......+.....+| T Consensus 79 ~~~~~~L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLs--Gt~~Wsd~~sDPi~di 156 (307) T protein:vir:10 79 VLDEHDLEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLS--ATEKFTAAGSDPVGVI 156 (307) T ss_pred ccccccccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEec--cccccCCCCCCcHHHH Confidence 2222222344444444455567777777777777776665422 2211110 11111111 1222222344566677 Q ss_pred HHHHHHhh-hhccCCCEEEEcHHHHHHHHH---hh---ccCCceeeccCCCceecccceEeecCccC----CCceEEEee Q lcl|NC_011614. 180 IDLEALLE-DDELEANAFISKTQNRSLLRK---IV---DPETKERIYDRNSDSLDGLPVVNLKSSNL----KRGELITGD 248 (324) Q Consensus 180 ~~~~~~l~-~~~~~~~~~v~~~~~~~~L~~---l~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~----~~~~i~~gd 248 (324) .+...++. ..+..+..++|.+..|.+|.. +. +..+.-.+....-..++|+--+....... ++..-+.|+ T Consensus 157 ~~~~~ai~~~~g~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~~~la~ll~v~~i~vg~a~~~~~~~~~~~iw~~ 236 (307) T protein:vir:10 157 EDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWGA 236 (307) T ss_pred HHHHHHHHhhhCCccceEEeCHHHHHHHhcCHHHHHHhCCccccccCHHHHHHHhCceeEEEeeeeeeccCCccceeCCC Confidence 77666664 457889999999999988853 11 12221111111112345543322211100 000000011 Q ss_pred cccEEEEEec---------------ceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEee Q lcl|NC_011614. 249 FDKLIYGIPQ---------------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 249 ~~~~~~~~~~---------------~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) .+++.+.. +.+.. ++. ....+.. ....+.-.+|+.....-.+.-++|=..++.+ T Consensus 237 --~~vl~yv~~~~~~~~~~~~epsfGyT~~--~~g--~~~~d~~-----~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~ 305 (307) T protein:vir:10 237 --NIVLAYVPLQRGGQQRTPYEPSYGYTLR--KKG--NPVVDTR-----IEDGKLELVRSTDIFRPYLLGADAGYLISGI 305 (307) T ss_pred --ceEEEecccccCCCCCcccccccceeEE--EcC--CeEeece-----ecCCceeEEeccccccceeecccccceeccC Confidence 01111100 00000 000 0000000 0112333355555555555555555555554 Q ss_pred cc Q lcl|NC_011614. 314 DA 315 (324) Q Consensus 314 ~~ 315 (324) .- T Consensus 306 ~~ 307 (307) T protein:vir:10 306 NG 307 (307) T ss_pred CC Confidence 33 No 232 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=62.59 E-value=0.33 Score=23.21 Aligned_cols=278 Identities=12% Similarity=-0.017 Sum_probs=118.0 Q ss_pred CCCcceechhhhHHHHHHHHhhcc-hhhhceeeecCCCceEEEEEeCCccee-----eecccccccccccceeeEEeeee Q lcl|NC_011614. 33 EKKDGTLLNDFTTPILQEVMENSK-IMQLGKYEPMEGTEKKFTFWADKPGAY-----WVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 33 ~~~g~lip~~~~~~i~~~~~~~s~-l~~l~~~~~~~~~~~~ip~~~~~~~a~-----~v~Eg~~~~~~~~~~~~v~~~~~ 106 (324) =+.+..++..+.+.+-...++..- --.+++.+|++....+||++... ++. -++-++....-+++....+...+ T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~-e~F~~~~t~r~~~~~~~~v~~~~~~~~~~~~ 79 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLA-QGFTVPETLVGRKSKPNEVEFSATDETGSTE 79 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechh-hcccccchhhccCCCcceEeecccCceeeec Confidence 112334444444434333222222 23467888988888888887532 221 12333333333344444455555 Q ss_pred eEEEeehhHHHHHhcC--hhHHHHHHHHHHHHHHHHHHHHHH---HhccCcCcCC-cccccccccccceeecccchhHHH Q lcl|NC_011614. 107 KLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAG---ILNQGNNPFG-KSIAQSIEKTNKVIKGDFTQDNII 180 (324) Q Consensus 107 k~~~~v~iS~ell~~s--~~~~~~~v~~~l~~ai~~~~d~a~---l~g~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~ 180 (324) ..+-..+|..+-..++ ..+.++.-.+.+.+.|....|..+ +....+-+.. ...+. ++......+.....+|. T Consensus 80 ~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Ls--gt~~wsd~~SDPi~~i~ 157 (309) T protein:vir:99 80 DHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--GADQWSDPTSNPLPVIT 157 (309) T ss_pred ccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEec--CccccCCCCCCcHHHHH Confidence 5555566666655544 367788888888887777666422 2221111111 01111 11111112233444444 Q ss_pred HHHHHhhhhccCCCEEEEcHHHHHHHHH-------hhccCCcee-eccCCCceecccc-eEeecCcc---C-CCc-eE-- Q lcl|NC_011614. 181 DLEALLEDDELEANAFISKTQNRSLLRK-------IVDPETKER-IYDRNSDSLDGLP-VVNLKSSN---L-KRG-EL-- 244 (324) Q Consensus 181 ~~~~~l~~~~~~~~~~v~~~~~~~~L~~-------l~d~~g~~~-~~~~~~~~l~G~p-v~~~~~~~---~-~~~-~i-- 244 (324) +.+. .-++.+..++|....|.+|+. ++-..+... +....--.++|+- |++..+.- . ++. .+ T Consensus 158 ~~~~---~~g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~~~~~ 234 (309) T protein:vir:99 158 DALD---SVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIR 234 (309) T ss_pred HHHH---hhCCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceeecccccccccccc Confidence 4444 447889999999999988753 222222221 2222223355553 33321110 0 000 00 Q ss_pred EEeecccEEEEEecceEEE-Eeeccccccccccccc-chhhh-hcCcEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 245 ITGDFDKLIYGIPQLIEYK-IDETAQLSTVKNEDGT-PVNLF-EQDMVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 245 ~~gd~~~~~~~~~~~~~i~-~~~~~~~~~~~~~~~~-~~~~f-~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) +-|+..-+.+.....-.++ .+-..+.....-..+. ....+ ..+.-.+|+..+..-.+.-+++=..++.+.+. T Consensus 235 iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va~ 309 (309) T protein:vir:99 235 AWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) T ss_pred ccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhcccC Confidence 0011101111100000000 0000000000000000 00001 23334467666666666666666666666555 No 233 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=61.39 E-value=0.35 Score=23.05 Aligned_cols=306 Identities=11% Similarity=0.093 Sum_probs=125.8 Q ss_pred CchhhHH----------------------HHHHHHHhhccch-hhhhccccccccCCCcce--echhhhHHHHHHHHhhc Q lcl|NC_011614. 1 MEQTQKL----------------------KLNLQHFASNNVK-PQVFNPDNVMMHEKKDGT--LLNDFTTPILQEVMENS 55 (324) Q Consensus 1 m~~~~~~----------------------~~~~~~~~~~~~~-~~~~~a~~~~~~~~~g~l--ip~~~~~~i~~~~~~~s 55 (324) ++|+--. +.+-+..+...+. .++..+-+...++..+.. .-|.+. .+.+++.... T Consensus 28 ~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~es~~t~~v~~~~P~Li-~lvRra~p~L 106 (521) T protein:vir:10 28 SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAGQTSGAVTQIGPAVM-GMVRRAIPNL 106 (521) T ss_pred chhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCccccccccccccccccccccCCchhh-hHHHHHHhhh Confidence 1111100 0111111110000 011111111111111111 112222 2444555666 Q ss_pred chhhhceeeecCCCceEEE----EEeCC---------------cceee-------------------------------- Q lcl|NC_011614. 56 KIMQLGKYEPMEGTEKKFT----FWADK---------------PGAYW-------------------------------- 84 (324) Q Consensus 56 ~l~~l~~~~~~~~~~~~ip----~~~~~---------------~~a~~-------------------------------- 84 (324) +..+++.+-||.++..-|- +.... +++.| T Consensus 107 Ia~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g 186 (521) T protein:vir:10 107 IAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTG 186 (521) T ss_pred hhhhceeeccCCchhhhheeeeeeccCCccccccccccchhccccccccccccccccccccccccccccccccccccccc Confidence 7777888877765432211 00000 00000 Q ss_pred -------------------------------------e--------cc---------cccccccccceeeEEeeeeeEEE Q lcl|NC_011614. 85 -------------------------------------V--------GE---------GQKIETSKATWVNATMRAFKLGV 110 (324) Q Consensus 85 -------------------------------------v--------~E---------g~~~~~~~~~~~~v~~~~~k~~~ 110 (324) + +| +...++-..+++++++.++..+- T Consensus 187 ~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ss~~~f~EMaFsIeKvtVtAKSRaL 266 (521) T protein:vir:10 187 TVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGSTDNPWNEMGFRIDKQVIEAKSRQL 266 (521) T ss_pred cceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhccCCCCccccccceeeEEEEEEEeeeccce Confidence 0 01 11245556667788888888888 Q ss_pred eehhHHHHHhcC----hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCc-CC-cccccccccccce--ee------ccc-c Q lcl|NC_011614. 111 ILPVTKEFLNYT----YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP-FG-KSIAQSIEKTNKV--IK------GDF-T 175 (324) Q Consensus 111 ~v~iS~ell~~s----~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~-~~-~~~~~~~~~~~~~--~~------~~~-~ 175 (324) ....|-||.+|- ..|.+++|.+.|...|...+++.+|.--..+. .+ .++...-+...+. .. +.- . T Consensus 267 KAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~~G~~d~~~~~d~~~~~~~ 346 (521) T protein:vir:10 267 KAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWA 346 (521) T ss_pred eccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHH Confidence 899999999986 36899999999999999999999984211110 00 0000000000000 00 001 1 Q ss_pred hhHHHHHHHHhh-------h--hccCCCEEEEcHHHHHHHHHhh--c---cC-Cceee-ccCCC----cee-cccceEee Q lcl|NC_011614. 176 QDNIIDLEALLE-------D--DELEANAFISKTQNRSLLRKIV--D---PE-TKERI-YDRNS----DSL-DGLPVVNL 234 (324) Q Consensus 176 ~~~i~~~~~~l~-------~--~~~~~~~~v~~~~~~~~L~~l~--d---~~-g~~~~-~~~~~----~~l-~G~pv~~~ 234 (324) .+-+..|+.++. . .-.....++|+++....|...- + +. +...+ .+.+. |.| .|++|++. T Consensus 347 ~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D 426 (521) T protein:vir:10 347 GESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYID 426 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEec Confidence 122233444432 2 1133356899999999997531 1 11 11112 22222 233 34566653 Q ss_pred cCccCCCceEEEeecccEEEEEecceEEE----EeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce--- Q lcl|NC_011614. 235 KSSNLKRGELITGDFDKLIYGIPQLIEYK----IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF--- 307 (324) Q Consensus 235 ~~~~~~~~~i~~gd~~~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~--- 307 (324) +. .+..-+ .+|+.++..++ +.....+......| + .-|| =.+=...|++. ..+|=+. T Consensus 427 ~y--~~~dy~--------~vG~KG~~~~~~glfyaPYv~l~~~~~~d--p-~sfq---P~~g~~tRY~l-~~NP~~~~~~ 489 (521) T protein:vir:10 427 QY--AKQDYF--------TVGYKGPNEMDAGIYYAPYVALTPLRGSD--P-KNFQ---PVMGFKTRYGI-GINPFAESAA 489 (521) T ss_pred CC--CCcceE--------EEEEeCCcccccceeeccccccccccccC--C-cccc---ceeeeeeeece-eecCcccccC Confidence 32 222222 23333222211 11111111111111 0 1132 22334567777 4455221 Q ss_pred ----EEEEeeccCCCCccccC Q lcl|NC_011614. 308 ----AKLVPADAKPSSVPGEV 324 (324) Q Consensus 308 ----~~l~~~~~~~~~~~~~~ 324 (324) .+|+...|.-....+++ T Consensus 490 ~~~~~~i~~~~~~~~a~~~~~ 510 (521) T protein:vir:10 490 QAPASRIQSGMPSILNSLGKN 510 (521) T ss_pred Cccceeecccchhhhcccccc Confidence 23333344444445555 No 234 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=61.04 E-value=0.36 Score=23.01 Aligned_cols=278 Identities=7% Similarity=-0.013 Sum_probs=120.4 Q ss_pred cccCCCcceechhhhHHHHHHHHhhcchhh-------hceeeecCCCceEEEEEeCC--cceeeecccccc-ccccccee Q lcl|NC_011614. 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQ-------LGKYEPMEGTEKKFTFWADK--PGAYWVGEGQKI-ETSKATWV 99 (324) Q Consensus 30 ~~~~~~g~lip~~~~~~i~~~~~~~s~l~~-------l~~~~~~~~~~~~ip~~~~~--~~a~~v~Eg~~~-~~~~~~~~ 99 (324) |+ .-.-+.++..+.+.+.....-.. ...+.-.++.+++||..+.. -.-+-..-|-.. ..-+.++. T Consensus 1 Ma-----inya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~e 75 (346) T protein:vir:10 1 MT-----INYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWD 75 (346) T ss_pred Cc-----chhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCccccccccccee Confidence 11 00123445555444444321101 11233356788999998632 221212222211 22344555 Q ss_pred eEEeeeeeEEEe-ehhHHHHHhcC--hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccch Q lcl|NC_011614. 100 NATMRAFKLGVI-LPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQ 176 (324) Q Consensus 100 ~v~~~~~k~~~~-v~iS~ell~~s--~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (324) ..++.-.+.-.+ +.--+ ++.+ ...+...+.+...+.++=.+|...|.---+.... .........+.+.+.-+ T Consensus 76 t~tl~qDR~~~F~vD~mD--vDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~---~~~~~~~~~a~T~~ni~ 150 (346) T protein:vir:10 76 SYELKNERYWSTLVDPSD--IDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEA---AHDGGITTNTLDEKNIL 150 (346) T ss_pred EEEeeccccceecccccc--hHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhh---hccccccccccCHHHHH Confidence 666665554333 22111 1111 1234444445555556667777655321000000 00001111112233456 Q ss_pred hHHHHHHHHhhhhcc--CCCEEEEcHHHHHHHHHhhcc-----CCceeeccCCCceecccceEeecCccCC-CceEEEe- Q lcl|NC_011614. 177 DNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIVDP-----ETKERIYDRNSDSLDGLPVVNLKSSNLK-RGELITG- 247 (324) Q Consensus 177 ~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~L~~l~d~-----~g~~~~~~~~~~~l~G~pv~~~~~~~~~-~~~i~~g- 247 (324) +.+.+++.++..+.. .+..++++|..+..|.+...- .++.....+..+++.|+||+..++.-+. +..+.-| T Consensus 151 ~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~ 230 (346) T protein:vir:10 151 PAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGS 230 (346) T ss_pred HHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcchhhcccchhhccCc Confidence 778888888877665 335678999999988643211 1122223556678999999875533221 1111111 Q ss_pred ----ec--ccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc---eEEEEeeccCCC Q lcl|NC_011614. 248 ----DF--DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA---FAKLVPADAKPS 318 (324) Q Consensus 248 ----d~--~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a---~~~l~~~~~~~~ 318 (324) +- -++++. .....+-+..+..........+ ..|.-.+.-..+.|.-|.+.+. ++-++.+.+.+. T Consensus 231 ~~~t~ak~INfiiv-~~~A~ia~~K~~~~~if~P~~~------~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~~ 303 (346) T protein:vir:10 231 KIIDTAKQIEMFLI-YNGVQIAPEKYSFVGFDQPSAA------TSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKDQ 303 (346) T ss_pred cccCCccceeEEEE-CCceeeeeeeeeeeEeeCCCCC------cccceeeeeeeeeeeeeeccccceEEEeeecccccCc Confidence 00 012222 2233333333322222222111 1222233344566776666443 333455555555 Q ss_pred Ccccc-C Q lcl|NC_011614. 319 SVPGE-V 324 (324) Q Consensus 319 ~~~~~-~ 324 (324) .+++. + T Consensus 304 ~~~~~~~ 310 (346) T protein:vir:10 304 EQSGQDA 310 (346) T ss_pred cCccccc Confidence 55443 3 No 235 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=59.48 E-value=0.39 Score=22.82 Aligned_cols=277 Identities=12% Similarity=0.073 Sum_probs=120.7 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhh-h---c-eeeecCCCceEEEEEeCCcce-eeeccccccc--ccccceeeEE Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQ-L---G-KYEPMEGTEKKFTFWADKPGA-YWVGEGQKIE--TSKATWVNAT 102 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~-l---~-~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~--~~~~~~~~v~ 102 (324) ++.+ .-..+.+...+-+.+...+ +.. | . .+.-.++.+++||..+...-. +-..-+.... +-+.+++..+ T Consensus 1 Mant--l~ya~~~~~~LD~~~~~~~-~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~et~t 77 (312) T protein:vir:10 1 MANT--LAYGQVLQQGLDKQATQEL-LTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYETKT 77 (312) T ss_pred CCcc--hhHHHHHHHHHHHHHHhhh-ccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccceeEE Confidence 1111 1122444554433333332 222 1 1 122356788999987643322 2222222222 2234455666 Q ss_pred eeeeeEEEe-ehhHHHHHhcC--hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHH Q lcl|NC_011614. 103 MRAFKLGVI-LPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 103 ~~~~k~~~~-v~iS~ell~~s--~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) +.-.+.-.+ +.--+ ++.+ ...+...+.+...+.+.=.+|...|.---......+.... .....+.+.+.-++.+ T Consensus 78 l~qDR~~~F~vD~mD--vDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~-~~~~~~~T~~ni~~~i 154 (312) T protein:vir:10 78 MTQDRGRKFTLDAMD--VDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTN-VEYSYSVNSSTIINKI 154 (312) T ss_pred eeecccceeeccccc--hhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccc-cccccccCHHHHHHHH Confidence 665554333 32222 1222 2345556666677777778888766421110000000000 0111112233446667 Q ss_pred HHHHHHhhhhccC-CCEEEEcHHHHHHHHHhhc-----cCCceeeccCCCceecccceEeecCccC-CCceEEEee---- Q lcl|NC_011614. 180 IDLEALLEDDELE-ANAFISKTQNRSLLRKIVD-----PETKERIYDRNSDSLDGLPVVNLKSSNL-KRGELITGD---- 248 (324) Q Consensus 180 ~~~~~~l~~~~~~-~~~~v~~~~~~~~L~~l~d-----~~g~~~~~~~~~~~l~G~pv~~~~~~~~-~~~~i~~gd---- 248 (324) .+++.++.+++.. +..++|.|..+..|.+-.. .+..........++|.|+||+..++.-+ .+..+.-|- T Consensus 155 ~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~~~ 234 (312) T protein:vir:10 155 KTGIKIIRENGYNGPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTTSNQ 234 (312) T ss_pred HHHHHHHHHccCCCceEEEeChHHHHHHhhhhhceecccccccceeeeeeeeecccEEEEchhhhccceeeeccCccccc Confidence 7888888886654 4567889998877764210 1111122345557899999987654332 111111110 Q ss_pred ----c--------ccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc-eEEEEeecc Q lcl|NC_011614. 249 ----F--------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA-FAKLVPADA 315 (324) Q Consensus 249 ----~--------~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a-~~~l~~~~~ 315 (324) | =++++. .....+-+..+...... .++.. -..|.-.+.-..+.|.-|.+.+. -+.+..+++ T Consensus 235 ~~gg~~~~~~ak~INfiiv-~~~a~i~~~K~~~~~if-~P~~~----~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a 308 (312) T protein:vir:10 235 TAGGYLKGTKALDTNFIIA-PVDVPLAITKQDKMRIF-DPETN----QTANAWSMDYRRYHDLWVTDNKANSVYANFKDA 308 (312) T ss_pred ccCceeecCcccccceEEe-CCceeeceeeeeeeeee-CCCCC----CCcceeeeeeeeeeeeeeeccccCeEEEEeecc Confidence 0 012222 22233333333222211 11100 01122233344566766666433 223555556 Q ss_pred CCCC Q lcl|NC_011614. 316 KPSS 319 (324) Q Consensus 316 ~~~~ 319 (324) .|.+ T Consensus 309 ~~~~ 312 (312) T protein:vir:10 309 KPVG 312 (312) T ss_pred cCCC Confidence 6655 No 236 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=55.80 E-value=0.47 Score=22.38 Aligned_cols=259 Identities=14% Similarity=0.075 Sum_probs=113.2 Q ss_pred ccCCCcceechhhhHHHHHHHHhhcchhhhc------eeeecCCCceEEEEEeC--CcceeeecccccccccccceeeEE Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVMENSKIMQLG------KYEPMEGTEKKFTFWAD--KPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~~~s~l~~l~------~~~~~~~~~~~ip~~~~--~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) ++ .-.-+.+...+.+.....+....+. .+...++.+++||..+. +-..+-.+-|-....-..++...+ T Consensus 1 Ma----in~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et~t 76 (285) T protein:vir:79 1 MT----VVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKETVK 76 (285) T ss_pred Cc----chhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceeeeEEE Confidence 11 1123455666666666655554442 23445677899999853 222232333433334445555666 Q ss_pred eeeeeEEEe-ehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcCCcccccccccccceeecccchhHHHH Q lcl|NC_011614. 103 MRAFKLGVI-LPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 103 ~~~~k~~~~-v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) +.-.+.-.+ +.--+. -+.....+...+.+...+.+.=.+|...|.---++. ....+.+.+.+.-++.+.+ T Consensus 77 l~~DR~~~f~iD~mDv-dEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a--------~~~~~~~~T~~nv~~~i~~ 147 (285) T protein:vir:79 77 LTHEDWFGYDLDQFDM-DENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSA--------AKKATDSITKDNALDAYDT 147 (285) T ss_pred eeccccceecccccch-hhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhc--------ccccccccCHHHHHHHHHH Confidence 665553332 322121 111122333334444445555567765443211100 0111111223334677788 Q ss_pred HHHHhhhhcc-CCCEEEEcHHHHHHHHHhhccC-----Cceee---ccCCCceecc-cceEeecCccCCCc------eEE Q lcl|NC_011614. 182 LEALLEDDEL-EANAFISKTQNRSLLRKIVDPE-----TKERI---YDRNSDSLDG-LPVVNLKSSNLKRG------ELI 245 (324) Q Consensus 182 ~~~~l~~~~~-~~~~~v~~~~~~~~L~~l~d~~-----g~~~~---~~~~~~~l~G-~pv~~~~~~~~~~~------~i~ 245 (324) ++.++...+. .+-.++|+|..+..|.+.+.-. .+... .....+.|.| .|++..++.-+... .++ T Consensus 148 ~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infi 227 (285) T protein:vir:79 148 AEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFI 227 (285) T ss_pred HHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEE Confidence 8888887765 3346789999999886543211 11111 1233567888 89987665444321 122 Q ss_pred EeecccEEEEEecceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE-EEEeeccC Q lcl|NC_011614. 246 TGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA-KLVPADAK 316 (324) Q Consensus 246 ~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~-~l~~~~~~ 316 (324) +...+. .+...+--.+.+. .+..++++ |.-.+.-..+.|.=|.+.+.=. .+..+++. T Consensus 228 iv~~~a-~i~~~K~~~~~~f-----~P~~~~~~--------d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 228 LTPLSA-IAPIVKYDSVSVI-----DPSTDRSG--------NRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred EecCce-eccceeeeeeEeE-----CCCCCCCc--------ceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 222221 1211111111111 01111111 1122333345565555533211 12222222 No 237 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=52.04 E-value=0.56 Score=21.94 Aligned_cols=306 Identities=12% Similarity=0.101 Sum_probs=128.1 Q ss_pred CchhhHHHHHHHH---H-----h---------hccchhh------hhccccccccCCCcce--echhhhHHHHHHHHhhc Q lcl|NC_011614. 1 MEQTQKLKLNLQH---F-----A---------SNNVKPQ------VFNPDNVMMHEKKDGT--LLNDFTTPILQEVMENS 55 (324) Q Consensus 1 m~~~~~~~~~~~~---~-----~---------~~~~~~~------~~~a~~~~~~~~~g~l--ip~~~~~~i~~~~~~~s 55 (324) ++|+--.+.+-.+ + . ...+.+. ++.+-+...++..+.. .-|.+. .+.+++.... T Consensus 28 ~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~iaes~~t~~v~~~~P~Li-~lvRra~p~L 106 (521) T protein:vir:72 28 SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAGQTSGAVTQIGPAVM-GMVRRAIPNL 106 (521) T ss_pred chhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCcccccccccccccccCCchhh-hHHHHHHhhh Confidence 2111100000000 0 0 0111111 1111111111111111 112222 2444455666 Q ss_pred chhhhceeeecCCCceEEE----EE-eCC--------------cce---------------------------------- Q lcl|NC_011614. 56 KIMQLGKYEPMEGTEKKFT----FW-ADK--------------PGA---------------------------------- 82 (324) Q Consensus 56 ~l~~l~~~~~~~~~~~~ip----~~-~~~--------------~~a---------------------------------- 82 (324) +..+++.+-||.++..-|- +. ... +.+ T Consensus 107 Ia~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~~~~~~~~~g 186 (521) T protein:vir:72 107 IAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDIYTHFFQETG 186 (521) T ss_pred hhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhccccccccccccccccccccccccccccccccccccccc Confidence 6777777777765432111 00 000 000 Q ss_pred -----------------------------------e--------eecc---------cccccccccceeeEEeeeeeEEE Q lcl|NC_011614. 83 -----------------------------------Y--------WVGE---------GQKIETSKATWVNATMRAFKLGV 110 (324) Q Consensus 83 -----------------------------------~--------~v~E---------g~~~~~~~~~~~~v~~~~~k~~~ 110 (324) . -.+| +...++-..+++++++.++..+- T Consensus 187 t~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaL 266 (521) T protein:vir:72 187 TVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSRQL 266 (521) T ss_pred ccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccCCcccccccceeeEEEEEEEeeeccce Confidence 0 0011 11134444555788888888888 Q ss_pred eehhHHHHHhcC----hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCc-CCc-ccccccccccce--ee------ccc-c Q lcl|NC_011614. 111 ILPVTKEFLNYT----YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP-FGK-SIAQSIEKTNKV--IK------GDF-T 175 (324) Q Consensus 111 ~v~iS~ell~~s----~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~-~~~-~~~~~~~~~~~~--~~------~~~-~ 175 (324) ....|-||.+|- ..|.+++|.+.|...|...+++.+|.--..+. .+. ++...-+...+. .. +.- . T Consensus 267 KAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~~G~~d~~~~~d~~~~~~~ 346 (521) T protein:vir:72 267 KAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSKAGVFDFQDPIDIRGARWA 346 (521) T ss_pred eccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCccccceecccccccccchHH Confidence 899999999986 36899999999999999999999984211110 000 000000000000 00 001 1 Q ss_pred hhHHHHHHHHhh-------h--hccCCCEEEEcHHHHHHHHHhh--cc---CC-ceee-ccCCC----cee-cccceEee Q lcl|NC_011614. 176 QDNIIDLEALLE-------D--DELEANAFISKTQNRSLLRKIV--DP---ET-KERI-YDRNS----DSL-DGLPVVNL 234 (324) Q Consensus 176 ~~~i~~~~~~l~-------~--~~~~~~~~v~~~~~~~~L~~l~--d~---~g-~~~~-~~~~~----~~l-~G~pv~~~ 234 (324) .+-+..|+.++. . .-.....++|+++....|...- +. .+ ...+ .+.+. |.| .|++|++. T Consensus 347 ~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D 426 (521) T protein:vir:72 347 GESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYID 426 (521) T ss_pred HHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEccCceEEEec Confidence 122233444432 2 1133356899999999997531 11 10 1112 22222 223 34666653 Q ss_pred cCccCCCceEEEeecccEEEEEecceEEE----EeecccccccccccccchhhhhcCcEEEEEEEEeccEEecc------ Q lcl|NC_011614. 235 KSSNLKRGELITGDFDKLIYGIPQLIEYK----IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD------ 304 (324) Q Consensus 235 ~~~~~~~~~i~~gd~~~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~------ 304 (324) +. .+..-+ .+|+.++..++ +.....+......| + .-|| =.+=...|++. ..+| T Consensus 427 ~y--~~~dy~--------~vG~KG~~~~~~glfyaPYv~l~~~~~~d--p-~sfq---P~~g~~tRY~l-~~NP~~~~~~ 489 (521) T protein:vir:72 427 QY--AKQDYF--------TVGYKGPNEMDAGIYYAPYVALTPLRGSD--P-KNFQ---PVMGFKTRYGI-GINPFAESAA 489 (521) T ss_pred CC--CCcceE--------EEEEeCCcccccceeeccccccccccccC--C-cccc---ceeeeeeeece-eecCcccccC Confidence 32 222222 23333222211 11111111111111 0 1132 22334567777 4455 Q ss_pred -cceEEEEeeccCCCCccccC Q lcl|NC_011614. 305 -KAFAKLVPADAKPSSVPGEV 324 (324) Q Consensus 305 -~a~~~l~~~~~~~~~~~~~~ 324 (324) +-..+++...|......+++ T Consensus 490 ~~~a~~i~~~~~~~~a~~~~~ 510 (521) T protein:vir:72 490 QAPASRIQSGMPSILNSLGKN 510 (521) T ss_pred cccceeecCcChhhhcCcccc Confidence 22356677667666667777 No 238 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=50.57 E-value=0.61 Score=21.78 Aligned_cols=270 Identities=9% Similarity=-0.016 Sum_probs=107.5 Q ss_pred cccCCCcceechhhhHHHHHHHHhhcch-hhhceeeecCCCceEEEEEeCCcceeee--ccccccc-ccccc---eeeEE Q lcl|NC_011614. 30 MMHEKKDGTLLNDFTTPILQEVMENSKI-MQLGKYEPMEGTEKKFTFWADKPGAYWV--GEGQKIE-TSKAT---WVNAT 102 (324) Q Consensus 30 ~~~~~~g~lip~~~~~~i~~~~~~~s~l-~~l~~~~~~~~~~~~ip~~~~~~~a~~v--~Eg~~~~-~~~~~---~~~v~ 102 (324) +++-....++ ..+.+.+-...++..-+ -.+++.+++.....+|+++.. ++.-+ .+-+... ....+ ++... T Consensus 1 m~~~~~~~~~-dp~LT~~A~gy~n~~~Iad~lfP~vpV~~~~~k~~~f~~--e~f~~~~t~ra~~~~~~~v~~~~~~~~~ 77 (307) T protein:vir:79 1 MGRLSKLRIV-DPVLTNLAIGYTNAEFIGQTLMPVVEVEKEGGKIPKFGK--ESFRLYQTERALRAKSNRMNPEDIDSVD 77 (307) T ss_pred CCCCCCCccc-CHHHHHHHhhccchhhhhhhcCCcccccccccceeeecc--ccccccccccccCCCcceeeeecccccc Confidence 3333333333 33333333333322223 245678888877788887742 22111 1111111 11122 22223 Q ss_pred eeeeeEEEeehhHHHHHhcChhHHHHHHHHHHHHHHHHHHHHHH---HhccCc-CcCCcccccccccccceeecccchhH Q lcl|NC_011614. 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG---ILNQGN-NPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) Q Consensus 103 ~~~~k~~~~v~iS~ell~~s~~~~~~~v~~~l~~ai~~~~d~a~---l~g~g~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) +.....+-..+|....-..+..++++.-.+.+.+.|.+..|..+ +....+ .......+. ++......+.....+ T Consensus 78 ~~~~~~~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLs--gt~~Wsd~~sDPi~d 155 (307) T protein:vir:79 78 VNLDEHDLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLS--ATEKFTAANSDPVGV 155 (307) T ss_pred ccccccchhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEc--cCcccCCCCCCcHHH Confidence 33333333344444333344556777767777777766666422 221111 011111111 122222334556667 Q ss_pred HHHHHHHhh-hhccCCCEEEEcHHHHHHHHH---hh---ccCCceeeccCCCceecccc-eEeecCccC---CCceEEEe Q lcl|NC_011614. 179 IIDLEALLE-DDELEANAFISKTQNRSLLRK---IV---DPETKERIYDRNSDSLDGLP-VVNLKSSNL---KRGELITG 247 (324) Q Consensus 179 i~~~~~~l~-~~~~~~~~~v~~~~~~~~L~~---l~---d~~g~~~~~~~~~~~l~G~p-v~~~~~~~~---~~~~i~~g 247 (324) |.+...++. ..+..+..++|.+..|.+|.. +. ...+.-++....-..++|+. |.+-.+.-. ++..-+.| T Consensus 156 i~~~~~ai~~~~g~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it~~~la~l~~v~~V~vg~a~y~~~~~~~~~iw~ 235 (307) T protein:vir:79 156 IEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWG 235 (307) T ss_pred HHHHHHHHHHhhCCccceEEeCHHHHHHHhcCHHHHHHhcCccccccCHHHHHHHhCceeEEEeeeeeecccccchhcCC Confidence 777666665 457889999999999988853 11 11122122211223455554 222111100 00000001 Q ss_pred ecccEEEEEec---------------ceEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEe Q lcl|NC_011614. 248 DFDKLIYGIPQ---------------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP 312 (324) Q Consensus 248 d~~~~~~~~~~---------------~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~ 312 (324) + .+++.+.. +.++.. +. ....+. .....+.-.+|+.....-.+.-|++=..++. T Consensus 236 ~--~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~--~g--~~~~d~-----~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~ 304 (307) T protein:vir:79 236 A--NIVLAYVPLQRGGQQRTPYEPSYGYTLRK--KG--NPVVDT-----RIEDGKLELVRATDIFRPYLLGADAGYLISG 304 (307) T ss_pred C--ceEEEecccccCCCCCcccccccceeEEe--cC--ceEEec-----ccCCCceeEEeecccccceeeccccchhhcc Confidence 0 11111100 000000 00 000000 0011223335555555555555555444444 Q ss_pred ecc Q lcl|NC_011614. 313 ADA 315 (324) Q Consensus 313 ~~~ 315 (324) +.- T Consensus 305 ~v~ 307 (307) T protein:vir:79 305 ING 307 (307) T ss_pred CCC Confidence 332 No 239 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=49.49 E-value=0.64 Score=21.66 Aligned_cols=305 Identities=11% Similarity=0.004 Sum_probs=126.9 Q ss_pred CchhhHHHHHHHHHhhccchh----------hhhccccccc--c-------------------CCCc--c-eechhhhHH Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKP----------QVFNPDNVMM--H-------------------EKKD--G-TLLNDFTTP 46 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~----------~~~~a~~~~~--~-------------------~~~g--~-lip~~~~~~ 46 (324) =..-|-.+.++|.|++..+.+ ..-++.-.+. + ..++ . .++. .+ T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~PN~~~pll~li~~g~~~ta~ast~~w~~d~~~~~~~~~ta~a~a~~T~l~ve~---~~ 85 (418) T protein:vir:10 9 NTTLNPQELNMKSFAGTILRRVPNGSAPLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEAAADATVLTVEN---SD 85 (418) T ss_pred ccCCChhhhchhhhhhhhhhhcCCcchhhhhhhhcccccccceeEEEEEEEEEeeeeEEEEEEEecCceEEEEcC---cc Confidence 123334455555555532211 1111110000 0 0000 0 1111 11 Q ss_pred HHHHHHhhcch-----hhhceeeecCCCceEEEEEeCCcceeeec-------------ccccccccccce-eeEEeeeee Q lcl|NC_011614. 47 ILQEVMENSKI-----MQLGKYEPMEGTEKKFTFWADKPGAYWVG-------------EGQKIETSKATW-VNATMRAFK 107 (324) Q Consensus 47 i~~~~~~~s~l-----~~l~~~~~~~~~~~~ip~~~~~~~a~~v~-------------Eg~~~~~~~~~~-~~v~~~~~k 107 (324) + +.+...+ ..+.++..+.+...++-+-..+..|+-++ ||++.++....- ..+.=-.+= T Consensus 86 ~---f~~~~l~~~~~~~Evirv~sVng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd~~ta~~~k~~~vsNvtQI 162 (418) T protein:vir:10 86 G---LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQRPTARSIQPVYVPNFTQI 162 (418) T ss_pred e---eccccEEEEccCCeEEEEEEEeCCEEEEEEecCCeeEEEEecCceEEEeccccccccccCCcceecceeccchhhh Confidence 1 1111111 11334444555555555544444433332 455544332111 111100111 Q ss_pred EEEeehhHHHHHhc-C--h-hH-HHHHHHHHHHHHHHHHHHHHHHhcc----CcCcCC----cccccccc---cccceee Q lcl|NC_011614. 108 LGVILPVTKEFLNY-T--Y-SQ-FFEEMKPMIAEAFYKKFDEAGILNQ----GNNPFG----KSIAQSIE---KTNKVIK 171 (324) Q Consensus 108 ~~~~v~iS~ell~~-s--~-~~-~~~~v~~~l~~ai~~~~d~a~l~g~----g~~~~~----~~~~~~~~---~~~~~~~ 171 (324) +.-.+.||...... . . .+ +++...+.+-++ ..+|+++|+|. +++..+ .+++..+. ..+...+ T Consensus 163 F~~avsvSgTaqAs~~q~Gvsn~~ese~drk~~~a--v~iEkalI~G~~~~~~~~~g~~R~m~GIl~~vr~~~~gnVv~a 240 (418) T protein:vir:10 163 FRNAWALTDTARASYAEAGYSNITESRRDCMDFHA--TEQETAIFFGQAFMGTYNGQPLHTTQGIVDAVRQYAPDNVNAM 240 (418) T ss_pred hhhhhhhhhhhhhccccccCchHHHHHHHHHHHHH--HHHHHHHhcccccCCCcCCcchhhHHHHHHHHhhhcccceecc Confidence 22334455442221 0 1 12 344444444443 47899999995 222222 23332211 1222222 Q ss_pred ---cccchhHHHHHHHHhhhhc----cCC----CEEEEcHHHHHHHHHhhccCCceeeccCCCcee---------cccce Q lcl|NC_011614. 172 ---GDFTQDNIIDLEALLEDDE----LEA----NAFISKTQNRSLLRKIVDPETKERIYDRNSDSL---------DGLPV 231 (324) Q Consensus 172 ---~~~~~~~i~~~~~~l~~~~----~~~----~~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~~l---------~G~pv 231 (324) +.++++++.++.......+ ... =.++++++....|.++- +. +-... ..+- +|+-. T Consensus 241 ~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~---~~-I~~~~-~e~~~G~vv~~~~~~~G~ 315 (418) T protein:vir:10 241 PNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF---GE-VTVTQ-RETSYGMVFTEWKFFKGR 315 (418) T ss_pred CCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh---hh-eeecc-cceeeeEEEEEEEcceEE Confidence 3578899988877764321 111 12567888888876663 22 21111 1111 12111 Q ss_pred Ee------ecCccCCCceEEEeecccEEEEEe--cceEEEEeecccc----cccccccccchhhhhcCcEEEEEEEEecc Q lcl|NC_011614. 232 VN------LKSSNLKRGELITGDFDKLIYGIP--QLIEYKIDETAQL----STVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 232 ~~------~~~~~~~~~~i~~gd~~~~~~~~~--~~~~i~~~~~~~~----~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) +. .++..++++.++..|.+++-+.+- +.+..+..-.... ......++..++ .++++ ....+.. T Consensus 316 I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D-~~kG~----iv~E~tL 390 (418) T protein:vir:10 316 LILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVD-AQGGS----LTSEWAL 390 (418) T ss_pred EEeecccccccccCCCceEEEEccccceEEEeccccccchhcccCCCcccccccccccccccc-cccce----EEEEeee Confidence 11 134467888899999887766554 5555555432210 000001111111 23333 3567888 Q ss_pred EEecccceEEEEee----ccCCCCcccc Q lcl|NC_011614. 300 HIADDKAFAKLVPA----DAKPSSVPGE 323 (324) Q Consensus 300 ~v~~~~a~~~l~~~----~~~~~~~~~~ 323 (324) .+++|.|.+++++. ...+...|+- T Consensus 391 e~~N~~a~avitgl~~~~~~~~~t~p~~ 418 (418) T protein:vir:10 391 ELLNPQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred eeecccceEEeeccceecccccCCCCCC Confidence 99999999998632 1222222333 No 240 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=43.39 E-value=0.85 Score=20.98 Aligned_cols=285 Identities=13% Similarity=0.029 Sum_probs=106.4 Q ss_pred chhhHHHHHHHHHhhc---cchhhhhccccccccCCCcceechhhhHHHHHHHHhhcchhh-hceeeecCCCceEEEEEe Q lcl|NC_011614. 2 EQTQKLKLNLQHFASN---NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ-LGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 2 ~~~~~~~~~~~~~~~~---~~~~~~~~a~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~-l~~~~~~~~~~~~ip~~~ 77 (324) -+++++..++|.|+.- -+....+.+ +++.....+-+.. +++..++.+..+.+.... T Consensus 1 ~~~~~~~~~~~~~~~~~~d~~~~~~l~~--------------------~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~ 60 (349) T protein:vir:10 1 MKNQKLQLDLQRFATPILDMFSQNTVLD--------------------YTRNRQYPEMLGDTLFPAVKVPTLEVDILKAG 60 (349) T ss_pred CCcchhhHHHHHHHHHhhcccCHHHHHH--------------------HHHhcCcchhhHhhcCCccccccceeEEEeec Confidence 4677788888877553 122222221 1111111111111 223233322222222211 Q ss_pred C-Cc-ceeeecccccccccccceeeEEeeeeeEEEeehhHHHHHh---c-ChhHHHHHHH-------HHHHHHHHHHHHH Q lcl|NC_011614. 78 D-KP-GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN---Y-TYSQFFEEMK-------PMIAEAFYKKFDE 144 (324) Q Consensus 78 ~-~~-~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~v~iS~ell~---~-s~~~~~~~v~-------~~l~~ai~~~~d~ 144 (324) . .+ .|.+++.+++.+..+-........+-.++-...++.+-+. . ...+....+. ..+.+.+.+.+|. T Consensus 61 ~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~ 140 (349) T protein:vir:10 61 SRVPTIASVSAFDAEAEIGTREASKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEK 140 (349) T ss_pred cCcceeeeeecCCCCcceecccceeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1 12 2455555555443222233344444444444445432221 1 1112222222 2333445555553 Q ss_pred ----HHHhccCc-CcCCccccccc---------ccccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH-- Q lcl|NC_011614. 145 ----AGILNQGN-NPFGKSIAQSI---------EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK-- 208 (324) Q Consensus 145 ----a~l~g~g~-~~~~~~~~~~~---------~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~-- 208 (324) ++.+|.-. ...+..+.-.. +......++....+||.++.. ..+..+..++|++.++..|.+ T Consensus 141 m~~q~l~~Gki~~~~~g~~vD~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~~---~~g~~p~~~vm~~~~~~~l~~~~ 217 (349) T protein:vir:10 141 MTMEMFATGKITDKKNGIAIDYGVPKKHQETLSGTKTWDKSDASIIDNLQDWSD---SLDVTPTRALTSKKVLRILMRST 217 (349) T ss_pred HHHHHHhCCeeEEcCCcEEEecccCccceeEecCcccCCCCCCCHHHHHHHHHH---HhCCCccEEEeCHHHHHHHhcCH Confidence 33333100 00000000000 111111122334455555544 446778899999999998852 Q ss_pred -hh---ccCCceeecc-CC----CceecccceEeecC--------------ccCCCceEEEeecc---cEEEEEecceEE Q lcl|NC_011614. 209 -IV---DPETKERIYD-RN----SDSLDGLPVVNLKS--------------SNLKRGELITGDFD---KLIYGIPQLIEY 262 (324) Q Consensus 209 -l~---d~~g~~~~~~-~~----~~~l~G~pv~~~~~--------------~~~~~~~i~~gd~~---~~~~~~~~~~~i 262 (324) ++ +.++...... .. -+.+.|.++++... ...+++.+++.... ...+|.- . T Consensus 218 ~i~~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~~~~G~~~yG~~----~ 293 (349) T protein:vir:10 218 EIKEAIFGKDTGRVVGQADLDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFNDEVPGQKIYGPT----P 293 (349) T ss_pred HHHHHhcccccccccCHHHHHHHHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecCCCceeEEeecc----c Confidence 22 2222211111 10 11234444544321 12244444444322 1223221 1 Q ss_pred EEeec-ccccccccccc-cchhhh-hcC--cEEEEEEEEeccEEecccceEEEEee Q lcl|NC_011614. 263 KIDET-AQLSTVKNEDG-TPVNLF-EQD--MVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 263 ~~~~~-~~~~~~~~~~~-~~~~~f-~~~--~v~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) +.++. .........+. .....+ +.| ...+++..+.=-.+.+|+++.++++. T Consensus 294 e~~~~~~g~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 294 EENRLISSNAQVSNVGNIMAKIYETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred hhhhhcccccceeeccceEEEeeeecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 11100 00000000000 111111 112 33345555555556778888888877 No 241 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=40.94 E-value=0.95 Score=20.71 Aligned_cols=284 Identities=11% Similarity=0.059 Sum_probs=114.7 Q ss_pred ccCCCcceechhhhHHHHHHHH-hhcc-hh-hhceeeecCCCceEEEE-EeCCc-ceeeecccccccc-cccceeeEEee Q lcl|NC_011614. 31 MHEKKDGTLLNDFTTPILQEVM-ENSK-IM-QLGKYEPMEGTEKKFTF-WADKP-GAYWVGEGQKIET-SKATWVNATMR 104 (324) Q Consensus 31 ~~~~~g~lip~~~~~~i~~~~~-~~s~-l~-~l~~~~~~~~~~~~ip~-~~~~~-~a~~v~Eg~~~~~-~~~~~~~v~~~ 104 (324) ++.--..+-+.++.. ++..+. .... +. .+++..++......... ..... .|.++..+.+.+. ....++...+. T Consensus 1 M~~l~d~f~~~~l~~-~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:49 1 MGLIYDKVTASNIAG-YFNALQENVDSTLGESIFPARKQLGTKLSYITGASGQSVALKAAAFDTNVTVRDRVSAEMHDEQ 79 (348) T ss_pred CcchhhhcCHHHHHH-HHHhccccchhhhHhhcCCCccccCceeEEEEeecCceeeeeeecCCCCcceecccceeeeeee Confidence 111111122233322 222221 1222 21 22343333333333222 22223 4567776666543 34556777777 Q ss_pred eeeEEEeehhHHHHH------hcC-hhHHHHHHHHH-------HHHHHHHHHHH----HHHhcc----CcCcCCcccccc Q lcl|NC_011614. 105 AFKLGVILPVTKEFL------NYT-YSQFFEEMKPM-------IAEAFYKKFDE----AGILNQ----GNNPFGKSIAQS 162 (324) Q Consensus 105 ~~k~~~~v~iS~ell------~~s-~~~~~~~v~~~-------l~~ai~~~~d~----a~l~g~----g~~~~~~~~~~~ 162 (324) +-.++-...++.+-+ .++ ..+..+.+.+. +.+++.+.+|. ++.+|. +.+. ...+.-+ T Consensus 80 ~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~-~~~vdyg 158 (348) T protein:vir:49 80 MPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGV-NKDIDYG 158 (348) T ss_pred cCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCc-eEEEeec Confidence 766666665554321 111 12222223222 33455555553 333331 1110 0000000 Q ss_pred cc-------cccceeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH---hhc---c-CCcee-eccC----CC Q lcl|NC_011614. 163 IE-------KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---IVD---P-ETKER-IYDR----NS 223 (324) Q Consensus 163 ~~-------~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~---l~d---~-~g~~~-~~~~----~~ 223 (324) .. ......++...+.||.++...+...+..+..++|++..+.+|.+ +++ . ++... +... .- T Consensus 159 ~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~ 238 (348) T protein:vir:49 159 VKPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAELDNYI 238 (348) T ss_pred CCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHHHHHHH Confidence 00 01112234455678888877777778888999999999998843 221 1 11111 1110 01 Q ss_pred ceecccceEeecCc----------cCCCceEEEeecc---cEEEEEecc-eEEEEeecccccccccccccchhhhhc-C- Q lcl|NC_011614. 224 DSLDGLPVVNLKSS----------NLKRGELITGDFD---KLIYGIPQL-IEYKIDETAQLSTVKNEDGTPVNLFEQ-D- 287 (324) Q Consensus 224 ~~l~G~pv~~~~~~----------~~~~~~i~~gd~~---~~~~~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~f~~-~- 287 (324) ..+.|+++++.... ..+++.++++... ...+|.--+ ..................+.....|.+ | T Consensus 239 ~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP 318 (348) T protein:vir:49 239 ADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVTTTKTTDP 318 (348) T ss_pred HhhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCeEEEeeeecCCC Confidence 23456666543211 1233444444322 122221100 000000000000000001111111221 1 Q ss_pred -cEEEEEEEEeccEEecccceEEEEeeccC Q lcl|NC_011614. 288 -MVALRATMHVALHIADDKAFAKLVPADAK 316 (324) Q Consensus 288 -~v~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) ...+.+....=-.+.+|+++.+++..++. T Consensus 319 ~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 319 VNVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred ceEEEEEeeeccccccCCCcEEEEEEecCC Confidence 23344444444556678899888887777 No 242 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=36.25 E-value=1.2 Score=20.18 Aligned_cols=272 Identities=11% Similarity=0.025 Sum_probs=112.9 Q ss_pred CcceechhhhHHHHHHHHhhcchhhhc--eeeecCCCceEEEEEeCC-cceeeecccccccc-cccceeeEEeeeeeEEE Q lcl|NC_011614. 35 KDGTLLNDFTTPILQEVMENSKIMQLG--KYEPMEGTEKKFTFWADK-PGAYWVGEGQKIET-SKATWVNATMRAFKLGV 110 (324) Q Consensus 35 ~g~lip~~~~~~i~~~~~~~s~l~~l~--~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~~~~~-~~~~~~~v~~~~~k~~~ 110 (324) -..+-+.++...+-+.....+.|+++. ...+.+...+.+-...+. .-|.++..+.+... ..-.+....+.+-.+.. T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~d~~fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~i~~ 80 (341) T protein:vir:34 1 MSMYTTAQLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRGGSTSEFTPGYVKP 80 (341) T ss_pred CCCcCHHHHHHHHHhccCccchhHHhcCCcccccccceEEEEEeeCCeeEEEeecCCCCcceeccCceeeeEEecCccCc Confidence 233444555543433334445555543 122333333333333332 33555655444332 22245555555555555 Q ss_pred eehhHHHHHh-cC-------hhH----HHHHHH---HHHHHHHHHHHHHHH----Hhcc----CcCcCCccccccccccc Q lcl|NC_011614. 111 ILPVTKEFLN-YT-------YSQ----FFEEMK---PMIAEAFYKKFDEAG----ILNQ----GNNPFGKSIAQSIEKTN 167 (324) Q Consensus 111 ~v~iS~ell~-~s-------~~~----~~~~v~---~~l~~ai~~~~d~a~----l~g~----g~~~~~~~~~~~~~~~~ 167 (324) ...|+.+-+. .. ... +.+.+. ..+.+.+...+|..+ .+|. +.+.....+.-...... T Consensus 81 ~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~~~~vDfg~~~~~ 160 (341) T protein:vir:34 81 KHEVNPQMTLRRLPDEDPQNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVDMGRSEEN 160 (341) T ss_pred cceeCHHHHHHHhhccccccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCccEEEEEeCCCCcc Confidence 5555532221 00 001 111222 233345555556433 3331 11100000111111111 Q ss_pred c---------eeecccchhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHH---hhc-------cCCcee--ecc-CC--- Q lcl|NC_011614. 168 K---------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---IVD-------PETKER--IYD-RN--- 222 (324) Q Consensus 168 ~---------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~---l~d-------~~g~~~--~~~-~~--- 222 (324) . ...+..+++.+.++...+...+..+..++|++..+.+|.. +++ .++... ... .. T Consensus 161 ~~~~t~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~ 240 (341) T protein:vir:34 161 NITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETAVKDLGKAVS 240 (341) T ss_pred ceEecCCccCCcCCCchHHHHHHHHHHHHhcCCceEEEEeCHHHHHHHhcCHHHHHHHhhccccccccccccccccccee Confidence 0 1112234567777777777778888889999999988842 221 111111 011 11 Q ss_pred -CceecccceEeecCc---------cCCCceEEEeecc---cEEEEEecceEEEEeecccccccccccccchhhh--h-- Q lcl|NC_011614. 223 -SDSLDGLPVVNLKSS---------NLKRGELITGDFD---KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLF--E-- 285 (324) Q Consensus 223 -~~~l~G~pv~~~~~~---------~~~~~~i~~gd~~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f--~-- 285 (324) .+++.|++++..+.. ..+++.+++.... ...+|...++.. .... .. ........| + T Consensus 241 ~~~~~~g~~i~~y~~~y~ddG~~~~~ip~~~v~l~p~g~~g~~~yg~~~d~~~--~~~~-~~----~~~~~~~~~~~~~d 313 (341) T protein:vir:34 241 YKGMYGDVAIVVYSGQYVENGVKKNFLPDNTMVLGNTQARGLRTYGCIQDADA--QREG-IN----ASARYPKNWVTTGD 313 (341) T ss_pred eeeecCCceEEEEcCEEEECCcEEeeecCCeEEEeeCCCcceEEEeecccccc--cccc-ee----eeeEeeeeeeecCC Confidence 123456666543221 1344445554432 223333222110 0000 00 000000001 1 Q ss_pred cCcEEEEEEEEeccEEecccceEEEEee Q lcl|NC_011614. 286 QDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 286 ~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) -....+++..+.=-.+.+|+++.+++.+ T Consensus 314 p~~~~~~~~s~pLPv~~~pd~~~~a~V~ 341 (341) T protein:vir:34 314 PAREFTMIQSAPLMLLADPDEFVSVQLA 341 (341) T ss_pred CcEEEEEEcccceeeeeCCCcEEEEEeC Confidence 1234455666655667789999999987 No 243 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=35.78 E-value=1.2 Score=20.13 Aligned_cols=303 Identities=11% Similarity=0.113 Sum_probs=121.3 Q ss_pred CchhhH---HHHHHHHHhh---------------ccchhhh------hccccccccCCCcceechhhhHH---HHHHHHh Q lcl|NC_011614. 1 MEQTQK---LKLNLQHFAS---------------NNVKPQV------FNPDNVMMHEKKDGTLLNDFTTP---ILQEVME 53 (324) Q Consensus 1 m~~~~~---~~~~~~~~~~---------------~~~~~~~------~~a~~~~~~~~~g~lip~~~~~~---i~~~~~~ 53 (324) +.|+-- +=.+.+++-. ..+.+++ +.+-+...++..+.+ ..+... .+++..+ T Consensus 25 ~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~~~~~t~~v--~~~~P~l~~l~rRa~p 102 (519) T protein:vir:10 25 ASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQTSGAV--TQIGPAVMGMVRRAIP 102 (519) T ss_pred hhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccccccccccc--cccchhHHHHHHHHHH Confidence 111110 0000111000 1111111 112111122222222 122222 3445556 Q ss_pred hcchhhhceeeecCCCceEEE-----EEeC-----C---------ccee------------------------------- Q lcl|NC_011614. 54 NSKIMQLGKYEPMEGTEKKFT-----FWAD-----K---------PGAY------------------------------- 83 (324) Q Consensus 54 ~s~l~~l~~~~~~~~~~~~ip-----~~~~-----~---------~~a~------------------------------- 83 (324) ..+..+++.+-||.++..-|- +.+. . +.+. T Consensus 103 ~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~~~~~~~~~~~~~~~g~~~~~~~~~ 182 (519) T protein:vir:10 103 HLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETFEALAASKVLEVGKIYSHFFEA 182 (519) T ss_pred hhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCccccccccccccccccccccccccccccc Confidence 667777888877765432111 0000 0 0000 Q ss_pred --------------------------------------e--------ecc---------cccccccccceeeEEeeeeeE Q lcl|NC_011614. 84 --------------------------------------W--------VGE---------GQKIETSKATWVNATMRAFKL 108 (324) Q Consensus 84 --------------------------------------~--------v~E---------g~~~~~~~~~~~~v~~~~~k~ 108 (324) - .+| +.+.++-..+++++++.++.. T Consensus 183 s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~lggss~~~f~EMaFsIeKvTVtAKSR 262 (519) T protein:vir:10 183 TGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGSTDNPWNEMGFRIDKQVIEAKSR 262 (519) T ss_pred cccceeccccccccCCCCcCccccccccccccccccccccccccccchhhccccCCCccccchhhhceeEEEEEEeeecc Confidence 0 011 112345556677888888888 Q ss_pred EEeehhHHHHHhcC----hhHHHHHHHHHHHHHHHHHHHHHHHhccCcCcC-C-cccccccccccce--------eeccc Q lcl|NC_011614. 109 GVILPVTKEFLNYT----YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF-G-KSIAQSIEKTNKV--------IKGDF 174 (324) Q Consensus 109 ~~~v~iS~ell~~s----~~~~~~~v~~~l~~ai~~~~d~a~l~g~g~~~~-~-~~~~~~~~~~~~~--------~~~~~ 174 (324) +-....|-||.+|- ..|.+++|.+.|...|...+++.+|.--..+.. + .++...-....+. ..+.. T Consensus 263 aLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~aGv~d~~~~~d~~~~r 342 (519) T protein:vir:10 263 QLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMTNTVGAKAGVFDFQDPIDIRGAR 342 (519) T ss_pred cccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcceeecccCcccccceeecccccccccch Confidence 88899999999986 368999999999999999999999952111100 0 0011000000000 00111 Q ss_pred -chhHHHHHHHHh-------hh--hccCCCEEEEcHHHHHHHHHhhc-----cCC-ceee-ccCCC----cee-cccceE Q lcl|NC_011614. 175 -TQDNIIDLEALL-------ED--DELEANAFISKTQNRSLLRKIVD-----PET-KERI-YDRNS----DSL-DGLPVV 232 (324) Q Consensus 175 -~~~~i~~~~~~l-------~~--~~~~~~~~v~~~~~~~~L~~l~d-----~~g-~~~~-~~~~~----~~l-~G~pv~ 232 (324) ..+-+..|+.++ .. .+.....++|+++....|...-. +.+ +..+ .+.+. |.| .|++|+ T Consensus 343 w~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 422 (519) T protein:vir:10 343 WAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQGFNVDTTKAVFAGVLGGKYRVY 422 (519) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEE Confidence 112223333333 22 22333678999999999875431 000 1111 12222 233 345666 Q ss_pred eecCccCCCceEEEeecccEEEEEecceEEE----EeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceE Q lcl|NC_011614. 233 NLKSSNLKRGELITGDFDKLIYGIPQLIEYK----IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~~~~~i~~gd~~~~~~~~~~~~~i~----~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~ 308 (324) +.+. .+.. ++.+|+.++..++ +.....+......| + .-|| =.+=...|++. ..+| |+ T Consensus 423 ~D~y--~~~d--------y~~vG~KG~~~~~~glfyaPYv~l~~~~~~d--p-~sfq---P~~g~~tRY~l-~~NP--~~ 483 (519) T protein:vir:10 423 IDQY--ARSD--------YFTIGYKGSNEMDAGIYYAPYVALTPLRGSD--P-KNFQ---PVMGFKTRYGI-GINP--FA 483 (519) T ss_pred ecCC--CCcc--------eEEEEEecCcccccceeeccccccccccccC--C-cccc---ceeeeeeeece-eecC--cc Confidence 5332 2222 2233333222211 11111111111111 0 0122 22333567776 4555 33 Q ss_pred EEEeecc---CCCCccc-----cC Q lcl|NC_011614. 309 KLVPADA---KPSSVPG-----EV 324 (324) Q Consensus 309 ~l~~~~~---~~~~~~~-----~~ 324 (324) .....+. ...+.|. ++ T Consensus 484 ~~~~~~~~~~i~~g~~~~a~~~~~ 507 (519) T protein:vir:10 484 DPAAQAPTKRIQNGMPDIVNSLGL 507 (519) T ss_pred cccccCccceeccCchhhhccccC Confidence 2111110 0111111 11 No 244 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=34.31 E-value=1.3 Score=19.96 Aligned_cols=309 Identities=12% Similarity=0.040 Sum_probs=125.3 Q ss_pred CchhhHHHHHHHHHhhccchh----------hhhccc-------------------cccc--cCCCc---ceechhhhHH Q lcl|NC_011614. 1 MEQTQKLKLNLQHFASNNVKP----------QVFNPD-------------------NVMM--HEKKD---GTLLNDFTTP 46 (324) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~----------~~~~a~-------------------~~~~--~~~~g---~lip~~~~~~ 46 (324) =..-|-.+.++|.|++..+.+ ..-.+. .+.+ ...++ ..++.. + T Consensus 9 ~~~~~~~~~~~~~~~~~~~~~~PN~~~p~l~~i~~g~~~~~~~~t~~w~~d~l~~~~~~~ta~~~a~~T~i~V~~~---~ 85 (418) T protein:vir:96 9 NTTLNPQELNMKSFAGTILRRVPNGSAPLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEALADATVLTVENS---D 85 (418) T ss_pred ccCCChhhhchhhhhhhhhhhcCCcccchhhhhcccCccccceeEEEEEeeEeeeeeEEEEEEEecCceEEEecCC---c Confidence 222333455555555432111 100000 0000 00011 111111 1 Q ss_pred HHHHHHhhcch-----hhhceeeecCCCceEEEEEeCCcceeeecccc-------cccccccceeeEEeeeeeEEEeehh Q lcl|NC_011614. 47 ILQEVMENSKI-----MQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQ-------KIETSKATWVNATMRAFKLGVILPV 114 (324) Q Consensus 47 i~~~~~~~s~l-----~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~-------~~~~~~~~~~~v~~~~~k~~~~v~i 114 (324) . +.+...+ ..+.++..+.+....+-+--.+..|+-++.|. .+++..-..+....++..+.-+..| T Consensus 86 ~---f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eEGsd~~ta~~~k~~~vsN~tQI 162 (418) T protein:vir:96 86 G---LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEEGSQRPTARSIQPVYVPNFTQI 162 (418) T ss_pred c---cccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCcccccccCCcceecceeccchhhe Confidence 1 1222221 12344445555555555544443333333222 2233222222223333444444444 Q ss_pred HHHHHhcChh-----------HHHHHHHHHHHHHHHHHHHHHHHhccC----cCcCC-------c-ccccccccccceee Q lcl|NC_011614. 115 TKEFLNYTYS-----------QFFEEMKPMIAEAFYKKFDEAGILNQG----NNPFG-------K-SIAQSIEKTNKVIK 171 (324) Q Consensus 115 S~ell~~s~~-----------~~~~~v~~~l~~ai~~~~d~a~l~g~g----~~~~~-------~-~~~~~~~~~~~~~~ 171 (324) -++.+.-|.. ++.....+.|.+. ...+|.++++|.. .+..+ . ++..-+ ..+...+ T Consensus 163 f~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~ng~p~~~t~R~m~gI~~f~-~~Nvi~a 240 (418) T protein:vir:96 163 FRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYNGQPLHTTQGIVDAIRQYA-PDNVNAM 240 (418) T ss_pred ehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCCCcccccccchhHHHHhhc-ccccccc Confidence 4444443322 1222223444444 4467888888852 22212 1 111111 1111122 Q ss_pred ---cccchhHHHHHHHHhhhh----ccCCC----EEEEcHHHHHHHHHhhccCCceeeccCCCc-------eecc-cceE Q lcl|NC_011614. 172 ---GDFTQDNIIDLEALLEDD----ELEAN----AFISKTQNRSLLRKIVDPETKERIYDRNSD-------SLDG-LPVV 232 (324) Q Consensus 172 ---~~~~~~~i~~~~~~l~~~----~~~~~----~~v~~~~~~~~L~~l~d~~g~~~~~~~~~~-------~l~G-~pv~ 232 (324) ..++.+.+.++..+.-.. +.... .++++++...+|.++-. .-+..-.+...+ +-+| ++++ T Consensus 241 g~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~-~I~~~~~en~~G~vv~~~~Td~G~v~ii 319 (418) T protein:vir:96 241 PNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-EVTVTQRETSYGMVFTEWKFFKGRLIIK 319 (418) T ss_pred CCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhc-eeEeccccceeceEEEEEEeeccEEEEE Confidence 246788888876665331 22221 25789998888887642 111111111111 1123 3444 Q ss_pred eec---CccCCCceEEEeecccEEEEEe--cceEEEEeecccc----cccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|NC_011614. 233 NLK---SSNLKRGELITGDFDKLIYGIP--QLIEYKIDETAQL----STVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 233 ~~~---~~~~~~~~i~~gd~~~~~~~~~--~~~~i~~~~~~~~----~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) .++ +..++++.+++.|.+.+-+.+- +.+..+..-.... ......++..++ -++++ ....+...+++ T Consensus 320 ~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D-~~~G~----l~~Eltle~~N 394 (418) T protein:vir:96 320 EHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVD-AQGGS----LTSEWALELLN 394 (418) T ss_pred ecCCCCccccCcceEEEEecCceEEEEecCCCccchhcccCCCcccccccccccccccc-cccCE----EEEEEEEEeec Confidence 433 2234555677777777655543 4444443322110 000000011111 12333 35678889999 Q ss_pred ccceEEEEee-ccCCCCc---ccc Q lcl|NC_011614. 304 DKAFAKLVPA-DAKPSSV---PGE 323 (324) Q Consensus 304 ~~a~~~l~~~-~~~~~~~---~~~ 323 (324) |.+.+++++. .+++.+- |+- T Consensus 395 ~~a~a~itgl~~~~~~~~~~~~~~ 418 (418) T protein:vir:96 395 PQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred ccccEEeecccccccccccCCCCC Confidence 9999999743 3333332 222 No 245 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=33.79 E-value=1.3 Score=19.90 Aligned_cols=281 Identities=9% Similarity=-0.009 Sum_probs=113.0 Q ss_pred ccccCCCcceechhhhHHHHHHHHhhcc----h-hhhceeeecCCCceEEEEEe-CCc-ceeeecccccccccc-cceee Q lcl|NC_011614. 29 VMMHEKKDGTLLNDFTTPILQEVMENSK----I-MQLGKYEPMEGTEKKFTFWA-DKP-GAYWVGEGQKIETSK-ATWVN 100 (324) Q Consensus 29 ~~~~~~~g~lip~~~~~~i~~~~~~~s~----l-~~l~~~~~~~~~~~~ip~~~-~~~-~a~~v~Eg~~~~~~~-~~~~~ 100 (324) ...+-.. -++-+.....+++.+....+ + -.+++..+..+-.+.+-... ..+ .+.+++.+++.+..+ ..++. T Consensus 1 M~~~~~~-d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~~~ 79 (348) T protein:vir:98 1 MSWTLDT-EFIEPTQLTGLIREALRDLQVNRFRLARWLPNVDVDDITFEFLRGGGGLAETASYRSWDTESKIGRREGLAK 79 (348) T ss_pred Ccchhhh-hccCHHHHHHHHHHHhhccCcchhhHHhcCCCccccceEEEEEeccCCceeeeeeecCCCccceeeccccee Confidence 1111111 12333333334444432222 2 22344333333222221111 112 356777777666543 45788 Q ss_pred EEeeeeeEEEeehhHHHHHhc---Ch-hHHHHHHH---HHHHHHHHHHHH----HHHHhcc----CcCcCC-ccccc--c Q lcl|NC_011614. 101 ATMRAFKLGVILPVTKEFLNY---TY-SQFFEEMK---PMIAEAFYKKFD----EAGILNQ----GNNPFG-KSIAQ--S 162 (324) Q Consensus 101 v~~~~~k~~~~v~iS~ell~~---s~-~~~~~~v~---~~l~~ai~~~~d----~a~l~g~----g~~~~~-~~~~~--~ 162 (324) .++.+-.++-...++.+-+.. +. ..+...+. ..+.+++.+.+| .++.+|- |.+-.- -+... . T Consensus 80 ~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~~~~ 159 (348) T protein:vir:98 80 VMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIGSHS 159 (348) T ss_pred eeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCcccc Confidence 888887777776776642221 11 12222222 234455555555 3444441 111000 00000 0 Q ss_pred -cccccc-eeecccchhHHHHHHHHhhh-hccCCCEEEEcHHHHHHHHH---hhcc----C---CceeeccCCCc---ee Q lcl|NC_011614. 163 -IEKTNK-VIKGDFTQDNIIDLEALLED-DELEANAFISKTQNRSLLRK---IVDP----E---TKERIYDRNSD---SL 226 (324) Q Consensus 163 -~~~~~~-~~~~~~~~~~i~~~~~~l~~-~~~~~~~~v~~~~~~~~L~~---l~d~----~---g~~~~~~~~~~---~l 226 (324) .++... ..++...++||.++...+.. .+..+..++|++..+.+|.+ +++. + ...++...... .- T Consensus 160 ~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (348) T protein:vir:98 160 VVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLNTVLSS 239 (348) T ss_pred cccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHHHHHHh Confidence 001111 11233456788888777765 47778899999999998852 3321 1 11222111111 12 Q ss_pred cccceEeecC----------ccCCCceEEEeecc------------cEEEEEecceEEEEeecccccccccccccchhhh Q lcl|NC_011614. 227 DGLPVVNLKS----------SNLKRGELITGDFD------------KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLF 284 (324) Q Consensus 227 ~G~pv~~~~~----------~~~~~~~i~~gd~~------------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f 284 (324) +|.|.+...+ ...+++.+++..-. ..++|. ..+..+...........+.....| T Consensus 240 ~g~~~i~~~d~~~~~~g~~~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~----~~e~~~~~~~~~~~~~~~i~~~~~ 315 (348) T protein:vir:98 240 MGLPPIEVYDAKVAVDGVSTRITPANAIALLPEPGATDAAQPTELGATLLGT----TAESLEDDYALAPGEQPGIVAATW 315 (348) T ss_pred hCCeEEEEeeeEEEcCCceeceecCCeEEEEecCCcccccccccccceeccc----chhhhccccccceeccCceeeeee Confidence 3554333211 01122233322110 001110 000010000000000111111112 Q ss_pred hc-C--cEEEEEEEEeccEEecccceEEEEeec Q lcl|NC_011614. 285 EQ-D--MVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 285 ~~-~--~v~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) .+ | ...+++..+.=-.+.+|+++.++++.+ T Consensus 316 ~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 316 KTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 11 1 333455555545566788888888776 No 246 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=28.83 E-value=1.7 Score=19.30 Aligned_cols=275 Identities=11% Similarity=0.007 Sum_probs=108.7 Q ss_pred cccCCCccee--chhhhHHHHHHHHhhcchhhhc----eeeecCCCceEEEEEeCCcce-eeecccccccccccceeeEE Q lcl|NC_011614. 30 MMHEKKDGTL--LNDFTTPILQEVMENSKIMQLG----KYEPMEGTEKKFTFWADKPGA-YWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 30 ~~~~~~g~li--p~~~~~~i~~~~~~~s~l~~l~----~~~~~~~~~~~ip~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~ 102 (324) +.+..+...+ -+.+...+-+.+... .+.+.. ..+-.++.+++||..+...-. +-...|-....-..+++..+ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~-~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~g~v~~~~et~t 79 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAG-LFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGFNSGTISDEKTIYT 79 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhh-hcccceecCchheeecCCEEEEEeeeeccccccccccCccccceeeeeeEEE Confidence 2222221112 233344333333332 222221 112246778999998743322 22333333223344555566 Q ss_pred eeeeeEEEe-ehhHHHHHhcCh--hHHHHHHHHHHHHHHHHHHHHHHHhcc---CcCcCCcccccccccccceeecccc- Q lcl|NC_011614. 103 MRAFKLGVI-LPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQ---GNNPFGKSIAQSIEKTNKVIKGDFT- 175 (324) Q Consensus 103 ~~~~k~~~~-v~iS~ell~~s~--~~~~~~v~~~l~~ai~~~~d~a~l~g~---g~~~~~~~~~~~~~~~~~~~~~~~~- 175 (324) +.-.+.-.+ +.--+ ++.+. ..+...+.+...+.+.=.+|...|.-- .+...+.................++ T Consensus 80 l~~DR~~~f~vD~mD--vdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~ 157 (311) T protein:vir:99 80 MGQDRDVEFYLDRQD--VDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDE 157 (311) T ss_pred eeeccceeeecchhc--hhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccCH Confidence 655553332 33222 12221 234444555555556666776544311 1110000000011111112222333 Q ss_pred ---hhHHHHHHHHhhhhccCCCEEEEcHHHHHHHHHhhccC-----C--ceeeccCCCceecccceEee-cCccC-CCce Q lcl|NC_011614. 176 ---QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPE-----T--KERIYDRNSDSLDGLPVVNL-KSSNL-KRGE 243 (324) Q Consensus 176 ---~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~-----g--~~~~~~~~~~~l~G~pv~~~-~~~~~-~~~~ 243 (324) ++.|...+.++......+-.++|+|..+..|...+.-. . ...-.....++|.|.|++-. ++.-. .+.. T Consensus 158 ~nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t~~~ 237 (311) T protein:vir:99 158 TNAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVYESNRFMTKYD 237 (311) T ss_pred HHHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEecCchhhcchhh Confidence 45555666666554555567889999988876432111 0 11113445678999998765 33211 1111 Q ss_pred EEEe-----ecc-cEEEEEecceEEEEeeccccccc---ccccccchhhhhcCcEEEEEEEEeccEEecccc-eEEEEee Q lcl|NC_011614. 244 LITG-----DFD-KLIYGIPQLIEYKIDETAQLSTV---KNEDGTPVNLFEQDMVALRATMHVALHIADDKA-FAKLVPA 313 (324) Q Consensus 244 i~~g-----d~~-~~~~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a-~~~l~~~ 313 (324) +.-| +-. --++-......+-+..+...+.. .++. -|.-.+.-..+.|.-|.+.+. -+.+..+ T Consensus 238 ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~--------gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k 309 (311) T protein:vir:99 238 FTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTD--------GDGYLYQNRLYHDLFIKKHKRDGIFVSVK 309 (311) T ss_pred hcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCC--------cceeeeeeeeeeeeeeeccccCeEEEeee Confidence 1101 000 00111122222333332222211 1111 112223334556666666332 1123333 Q ss_pred cc Q lcl|NC_011614. 314 DA 315 (324) Q Consensus 314 ~~ 315 (324) .+ T Consensus 310 ~A 311 (311) T protein:vir:99 310 KA 311 (311) T ss_pred cC Confidence 33 No 247 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=20.79 E-value=2.7 Score=18.21 Aligned_cols=280 Identities=12% Similarity=0.057 Sum_probs=120.5 Q ss_pred ccccccCCCcceechhhhHHHHHHHHhhcchhh-hcee--------ee-c---CCCceEEEEEeCCcceeeeccccc--c Q lcl|NC_011614. 27 DNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ-LGKY--------EP-M---EGTEKKFTFWADKPGAYWVGEGQK--I 91 (324) Q Consensus 27 ~~~~~~~~~g~lip~~~~~~i~~~~~~~s~l~~-l~~~--------~~-~---~~~~~~ip~~~~~~~a~~v~Eg~~--~ 91 (324) +..+....+.......++..+.....+.+.+.+ +... +. . .+.+++++....- ...+|.+++. . T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L-~g~gv~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHL-RGKPTYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeec-ccCCcccCceeec Confidence 212222222234456677788877777776665 4221 10 0 1222333322211 2334433333 3 Q ss_pred cccccceeeEEeeeeeEEEeehhHHHHH-hcChhHHHHHHHHHHHHHHHHHHHHHHHh-ccCc---C--------cCCcc Q lcl|NC_011614. 92 ETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGIL-NQGN---N--------PFGKS 158 (324) Q Consensus 92 ~~~~~~~~~v~~~~~k~~~~v~iS~ell-~~s~~~~~~~v~~~l~~ai~~~~d~a~l~-g~g~---~--------~~~~~ 158 (324) .+...+|.+-++..-.+..-+.....+- +.+..+|...-.+.|..-+.+..|..+|. -.|. + ..+.+ T Consensus 80 nee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~ 159 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGYA 159 (364) T ss_pred cccceeEEeeEEEEeeccccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCccccc Confidence 4556666666665555544444322222 34678999999999999999999986663 2221 0 00000 Q ss_pred c-----------ccc---cccccceeecccchhHHHHHHHHhhhhccC----------------CCEEEEcHHHHHHHHH Q lcl|NC_011614. 159 I-----------AQS---IEKTNKVIKGDFTQDNIIDLEALLEDDELE----------------ANAFISKTQNRSLLRK 208 (324) Q Consensus 159 ~-----------~~~---~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~----------------~~~~v~~~~~~~~L~~ 208 (324) . +-. ....+...++.++.+.|.++...+...+.. .=+++|||..+..|+. T Consensus 160 ~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) T protein:vir:93 160 GNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMRT 239 (364) T ss_pred ccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhhh Confidence 0 000 111112234556777777776665443211 1157899999988874 Q ss_pred hhcc--------------CCceeeccCCCceecccceEeecCc-------cCCC---c-eEEEeeccc-EEEEEecceEE Q lcl|NC_011614. 209 IVDP--------------ETKERIYDRNSDSLDGLPVVNLKSS-------NLKR---G-ELITGDFDK-LIYGIPQLIEY 262 (324) Q Consensus 209 l~d~--------------~g~~~~~~~~~~~l~G~pv~~~~~~-------~~~~---~-~i~~gd~~~-~~~~~~~~~~i 262 (324) -.++ ..+|+|. +.-+.+.|++++-.+.. .... . -+++|--.. +.+|-.++.+. T Consensus 240 ~t~~~w~d~qk~A~~~~g~~nPlF~-G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~~g~~~ 318 (364) T protein:vir:93 240 AAGGTWIDFQKAAAAAEGRNNPIFK-GGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRF 318 (364) T ss_pred cCCHHHHHHHHHhhhcccccCCcee-cCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecCCCCCc Confidence 3321 1245555 44567788776532211 1110 0 123332221 12233334433 Q ss_pred EEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeeccCCCCc Q lcl|NC_011614. 263 KIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADAKPSSV 320 (324) Q Consensus 263 ~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) ...++.. . +.+..+..+ .+-.+.-+.|..- -|+.++--+..+++- + T Consensus 319 ~w~Ee~~-D-~gn~~~i~~-~~i~G~kK~rF~~-~DfGvi~idtaa~~~--------~ 364 (364) T protein:vir:93 319 DWEETVK-D-YGNEPAIAA-GFIAGMKKARFNN-KDFGVISIDTAAKKH--------S 364 (364) T ss_pred eeeeccc-C-CCCchhhhh-hhHhhhhhcccCC-ccceEEEeccccccc--------C Confidence 2222211 0 001100000 0112222222211 122222211111111 1 Done!