Query lcl|NC_011085.2_cdsid_YP_002048657.1 [gene=MmP1_gp36] [protein=major capsid protein] [protein_id=YP_002048657.1] [location=20830..21861] Match_columns 343 No_of_seqs 147 out of 165 Neff 7.8 Searched_HMMs 1612 Date Thu Nov 7 12:51:55 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_36 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_36_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8885 Length: 347 # 100.0 2E-104 1E-107 588.8 28.7 343 1-343 1-346 (347) 2 protein:vir:94576 Length: 347 100.0 2E-104 1E-107 589.4 27.9 343 1-343 1-347 (347) 3 protein:vir:3364 Length: 347 # 100.0 6E-104 4E-107 586.3 28.9 343 1-343 1-345 (347) 4 protein:vir:10450 Length: 344 100.0 2E-103 1E-106 583.2 28.6 342 1-343 1-344 (344) 5 protein:vir:1541 Length: 347 # 100.0 3E-102 2E-105 577.3 28.5 343 1-343 1-345 (347) 6 protein:vir:94711 Length: 347 100.0 4E-102 2E-105 576.5 27.9 342 1-343 1-346 (347) 7 protein:vir:2201 Length: 345 # 100.0 9E-102 5E-105 574.7 28.4 341 1-343 1-345 (345) 8 protein:vir:100057 Length: 375 100.0 4.4E-94 2.8E-97 532.4 25.9 341 1-343 1-370 (375) 9 protein:vir:80213 Length: 334 100.0 1.8E-92 1.1E-95 523.6 28.2 326 1-343 1-332 (334) 10 protein:vir:103323 Length: 364 100.0 7.1E-92 4.4E-95 520.3 27.8 332 4-343 1-339 (364) 11 protein:vir:6324 Length: 335 # 100.0 2.6E-91 1.6E-94 517.2 26.4 319 4-343 1-327 (335) 12 protein:vir:78739 Length: 332 100.0 6.8E-91 4.2E-94 514.9 25.4 321 1-341 4-332 (332) 13 protein:vir:78935 Length: 335 100.0 4.3E-90 2.7E-93 510.5 27.0 322 4-343 1-328 (335) 14 protein:vir:97031 Length: 402 100.0 4.4E-90 2.7E-93 510.5 26.3 326 4-343 1-333 (402) 15 protein:vir:7019 Length: 401 # 100.0 1.5E-86 9.1E-90 491.2 25.9 325 4-343 1-333 (401) 16 protein:vir:105645 Length: 400 100.0 4.2E-86 2.6E-89 488.6 27.6 324 4-343 1-333 (400) 17 protein:vir:99675 Length: 324 100.0 1E-83 6.4E-87 475.6 23.5 292 50-343 1-296 (324) 18 protein:vir:94622 Length: 341 100.0 2.3E-71 1.4E-74 407.9 25.4 319 1-343 3-339 (341) 19 protein:vir:80180 Length: 381 100.0 1.2E-65 7.3E-69 376.6 23.7 331 1-343 1-381 (381) 20 protein:vir:3136 Length: 322 # 100.0 2.1E-59 1.3E-62 342.3 15.0 300 4-343 1-318 (322) 21 protein:vir:105822 Length: 273 100.0 9.9E-58 6.1E-61 333.1 22.1 266 1-343 1-273 (273) 22 protein:vir:102605 Length: 273 100.0 9.9E-58 6.1E-61 333.1 22.1 266 1-343 1-273 (273) 23 protein:vir:7990 Length: 273 # 100.0 2.6E-56 1.6E-59 325.3 21.1 266 1-343 1-273 (273) 24 protein:vir:102655 Length: 322 100.0 4E-55 2.5E-58 318.8 24.3 310 1-343 1-321 (322) 25 protein:vir:1781 Length: 221 # 100.0 1.3E-48 8E-52 283.1 14.9 217 95-335 1-221 (221) 26 protein:vir:80930 Length: 278 100.0 5.3E-46 3.3E-49 268.8 19.4 270 1-343 1-277 (278) 27 protein:vir:94800 Length: 319 100.0 6.8E-44 4.2E-47 257.2 21.2 284 1-343 5-294 (319) 28 protein:vir:97331 Length: 319 100.0 6.8E-44 4.2E-47 257.2 21.2 284 1-343 5-294 (319) 29 protein:vir:107120 Length: 329 100.0 8.8E-44 5.5E-47 256.6 21.4 284 1-343 16-306 (329) 30 protein:vir:108303 Length: 418 100.0 3E-43 1.9E-46 253.7 22.5 295 1-343 1-417 (418) 31 protein:vir:96123 Length: 274 100.0 2.6E-43 1.6E-46 254.0 19.9 263 1-343 1-270 (274) 32 protein:vir:95898 Length: 274 100.0 1.7E-42 1.1E-45 249.5 18.7 263 1-343 1-270 (274) 33 protein:vir:96262 Length: 274 100.0 1.7E-42 1.1E-45 249.5 18.7 263 1-343 1-270 (274) 34 protein:vir:93742 Length: 274 100.0 3.2E-42 2E-45 248.1 19.0 264 1-343 1-270 (274) 35 protein:vir:1239 Length: 274 # 100.0 5.6E-42 3.5E-45 246.8 19.6 264 1-343 1-270 (274) 36 protein:vir:97433 Length: 274 100.0 9.1E-42 5.6E-45 245.6 19.3 264 1-343 1-270 (274) 37 protein:vir:94494 Length: 274 100.0 9.1E-42 5.6E-45 245.6 19.3 264 1-343 1-270 (274) 38 protein:vir:96833 Length: 275 100.0 8.4E-42 5.2E-45 245.8 19.0 265 1-343 1-271 (275) 39 protein:vir:99075 Length: 392 100.0 2.7E-41 1.7E-44 243.0 21.3 284 1-343 1-323 (392) 40 protein:vir:3525 Length: 423 # 100.0 8.5E-41 5.2E-44 240.3 23.3 296 1-342 1-423 (423) 41 protein:vir:174 Length: 423 # 100.0 2E-40 1.2E-43 238.2 23.2 298 1-342 1-423 (423) 42 protein:vir:105374 Length: 423 100.0 4.1E-40 2.5E-43 236.5 23.2 298 1-342 1-423 (423) 43 protein:vir:3613 Length: 272 # 100.0 2.4E-40 1.5E-43 237.8 19.4 267 1-343 1-272 (272) 44 protein:vir:105522 Length: 423 100.0 5.2E-38 3.2E-41 225.0 22.0 298 1-342 1-423 (423) 45 protein:vir:105334 Length: 276 100.0 4.6E-38 2.8E-41 225.3 19.0 264 1-343 1-270 (276) 46 protein:vir:3033 Length: 272 # 100.0 3.4E-36 2.1E-39 215.1 19.7 263 1-343 1-269 (272) 47 protein:vir:9820 Length: 272 # 100.0 3.4E-36 2.1E-39 215.1 19.7 263 1-343 1-269 (272) 48 protein:vir:79008 Length: 299 100.0 2E-35 1.2E-38 210.9 21.8 284 1-343 1-298 (299) 49 protein:vir:78920 Length: 290 100.0 2.2E-32 1.4E-35 194.1 20.6 277 1-343 1-290 (290) 50 protein:vir:102335 Length: 312 99.9 1.2E-29 7.4E-33 179.2 20.6 298 1-343 1-307 (312) 51 protein:vir:739 Length: 231 # 99.9 3.4E-30 2.1E-33 182.1 15.9 230 51-343 1-231 (231) 52 protein:vir:95107 Length: 270 99.9 2.4E-29 1.5E-32 177.5 18.3 261 1-343 1-266 (270) 53 protein:vir:105464 Length: 346 99.9 6E-29 3.7E-32 175.3 20.1 284 1-343 1-301 (346) 54 protein:vir:99523 Length: 311 99.9 4.3E-24 2.7E-27 148.7 20.4 297 4-343 1-310 (311) 55 protein:vir:79712 Length: 285 99.9 1.2E-24 7.7E-28 151.6 17.3 266 24-343 1-284 (285) 56 protein:vir:95451 Length: 313 99.8 3.7E-24 2.3E-27 149.0 11.9 299 17-343 1-311 (313) 57 protein:vir:78090 Length: 302 99.8 1.7E-22 1E-25 140.0 19.5 284 1-343 1-299 (302) 58 protein:vir:9265 Length: 430 # 99.8 1.6E-20 9.9E-24 129.1 19.8 299 1-343 1-429 (430) 59 protein:vir:100939 Length: 430 99.8 1.6E-20 9.9E-24 129.1 19.8 299 1-343 1-429 (430) 60 protein:vir:2106 Length: 430 # 99.8 2E-20 1.2E-23 128.6 19.9 299 1-343 1-429 (430) 61 protein:vir:41 Length: 299 # N 99.7 3.1E-17 2E-20 111.1 21.2 281 10-343 1-298 (299) 62 protein:vir:78523 Length: 338 99.6 3.2E-17 2E-20 111.0 19.3 307 1-343 1-336 (338) 63 protein:vir:6242 Length: 390 # 99.6 3.8E-17 2.4E-20 110.6 16.4 289 1-343 97-389 (390) 64 protein:vir:1328 Length: 392 # 99.6 1.1E-16 6.7E-20 108.1 17.5 292 1-343 97-391 (392) 65 protein:vir:78223 Length: 333 99.6 3.5E-16 2.2E-19 105.3 20.0 306 1-343 1-332 (333) 66 protein:vir:7771 Length: 330 # 99.6 1.5E-15 9.3E-19 101.9 21.0 297 1-343 1-324 (330) 67 protein:vir:94142 Length: 304 99.5 1.8E-15 1.1E-18 101.4 20.7 285 1-342 1-304 (304) 68 protein:vir:105905 Length: 304 99.5 1.8E-15 1.1E-18 101.4 20.7 285 1-342 1-304 (304) 69 protein:vir:96223 Length: 324 99.5 5.7E-15 3.5E-18 98.7 19.0 284 1-343 15-315 (324) 70 protein:vir:9309 Length: 324 # 99.5 1.1E-14 6.5E-18 97.2 20.4 278 1-343 21-315 (324) 71 protein:vir:97053 Length: 390 99.5 3.7E-15 2.3E-18 99.7 17.6 287 1-341 99-390 (390) 72 protein:vir:4511 Length: 409 # 99.5 2.8E-15 1.7E-18 100.4 16.9 297 1-343 93-406 (409) 73 protein:vir:98339 Length: 415 99.5 9.2E-15 5.7E-18 97.5 19.6 291 1-343 109-404 (415) 74 protein:vir:81100 Length: 415 99.5 9.2E-15 5.7E-18 97.5 19.6 291 1-343 109-404 (415) 75 protein:vir:79987 Length: 415 99.5 9.2E-15 5.7E-18 97.5 19.6 291 1-343 109-404 (415) 76 protein:vir:78830 Length: 324 99.5 1.1E-14 6.8E-18 97.1 19.5 284 1-343 15-315 (324) 77 protein:vir:96392 Length: 324 99.5 1.1E-14 6.8E-18 97.1 19.5 284 1-343 15-315 (324) 78 protein:vir:9574 Length: 300 # 99.5 2.4E-14 1.5E-17 95.3 21.2 283 1-343 1-300 (300) 79 protein:vir:4700 Length: 415 # 99.5 3.8E-15 2.3E-18 99.7 16.5 292 1-343 110-404 (415) 80 protein:vir:4600 Length: 415 # 99.5 3.8E-15 2.3E-18 99.7 16.5 292 1-343 110-404 (415) 81 protein:vir:1886 Length: 385 # 99.5 5.8E-15 3.6E-18 98.6 17.1 286 1-343 93-384 (385) 82 protein:vir:191 Length: 385 # 99.5 5.8E-15 3.6E-18 98.6 17.1 286 1-343 93-384 (385) 83 protein:vir:103955 Length: 324 99.5 2.8E-14 1.7E-17 94.9 20.0 284 1-343 15-315 (324) 84 protein:vir:8187 Length: 311 # 99.4 3.2E-14 2E-17 94.6 20.2 294 1-343 1-310 (311) 85 protein:vir:9410 Length: 415 # 99.4 9.7E-15 6E-18 97.4 17.1 292 1-343 109-404 (415) 86 protein:vir:97148 Length: 324 99.4 2.8E-14 1.8E-17 94.9 19.7 286 1-343 1-315 (324) 87 protein:vir:94771 Length: 298 99.4 3.6E-14 2.2E-17 94.3 20.1 282 1-342 1-298 (298) 88 protein:vir:4339 Length: 395 # 99.4 3.3E-14 2.1E-17 94.5 19.3 291 1-343 98-395 (395) 89 protein:vir:99749 Length: 324 99.4 6.5E-14 4E-17 92.9 20.1 281 1-343 18-315 (324) 90 protein:vir:9759 Length: 303 # 99.4 4.5E-14 2.8E-17 93.7 19.0 285 1-343 1-303 (303) 91 protein:vir:8102 Length: 543 # 99.4 4.5E-14 2.8E-17 93.7 18.6 297 1-343 237-542 (543) 92 protein:vir:3870 Length: 400 # 99.4 1.4E-14 8.6E-18 96.6 15.7 277 1-343 120-399 (400) 93 protein:vir:104085 Length: 320 99.4 1.1E-13 6.8E-17 91.6 20.6 295 1-343 1-318 (320) 94 protein:vir:104256 Length: 458 99.4 4.6E-14 2.8E-17 93.7 18.4 294 1-343 155-458 (458) 95 protein:vir:1638 Length: 298 # 99.4 1.2E-13 7.2E-17 91.5 20.2 281 1-342 1-298 (298) 96 protein:vir:95763 Length: 297 99.4 8.1E-14 5E-17 92.4 19.3 278 1-343 1-296 (297) 97 protein:vir:80684 Length: 315 99.4 6.5E-14 4E-17 92.9 18.5 285 1-343 1-306 (315) 98 protein:vir:99920 Length: 311 99.4 1.4E-13 8.5E-17 91.1 19.8 296 1-342 1-311 (311) 99 protein:vir:10364 Length: 390 99.4 8.6E-14 5.3E-17 92.2 18.6 279 1-341 107-390 (390) 100 protein:vir:2344 Length: 397 # 99.4 9.5E-14 5.9E-17 92.0 18.5 284 1-343 1-306 (397) 101 protein:vir:4830 Length: 397 # 99.4 1.1E-13 7.1E-17 91.5 18.9 283 1-343 94-385 (397) 102 protein:vir:81070 Length: 390 99.4 8.2E-14 5.1E-17 92.3 17.9 287 1-341 95-390 (390) 103 protein:vir:94673 Length: 419 99.4 7.3E-14 4.5E-17 92.6 17.6 294 1-343 110-417 (419) 104 protein:vir:485 Length: 407 # 99.4 1.6E-13 9.9E-17 90.7 19.2 296 1-343 87-400 (407) 105 protein:vir:100135 Length: 418 99.4 1.1E-13 6.9E-17 91.6 18.3 287 1-343 121-415 (418) 106 protein:vir:4856 Length: 293 # 99.4 2.2E-13 1.3E-16 90.0 19.9 273 1-343 1-281 (293) 107 protein:vir:101607 Length: 379 99.4 1.2E-13 7.7E-17 91.3 18.2 271 1-343 100-379 (379) 108 protein:vir:100247 Length: 425 99.3 1.9E-13 1.2E-16 90.3 18.3 294 1-343 117-424 (425) 109 protein:vir:4997 Length: 397 # 99.3 2.5E-13 1.5E-16 89.7 18.9 283 1-343 95-385 (397) 110 protein:vir:4456 Length: 401 # 99.3 3.7E-13 2.3E-16 88.7 19.9 300 1-343 88-401 (401) 111 protein:vir:1433 Length: 435 # 99.3 1.3E-12 7.8E-16 85.8 20.6 295 1-343 105-433 (435) 112 protein:vir:102119 Length: 404 99.3 6.2E-13 3.9E-16 87.5 18.5 296 1-343 92-400 (404) 113 protein:vir:102944 Length: 330 99.3 2.5E-13 1.6E-16 89.6 16.3 280 1-343 1-296 (330) 114 protein:vir:4953 Length: 397 # 99.3 6.7E-13 4.2E-16 87.3 18.5 281 1-343 95-385 (397) 115 protein:vir:80376 Length: 435 99.3 2.5E-12 1.5E-15 84.2 21.5 296 1-343 105-433 (435) 116 protein:vir:2430 Length: 318 # 99.3 1.5E-12 9.6E-16 85.3 19.9 289 1-343 1-315 (318) 117 protein:vir:3991 Length: 404 # 99.3 1.7E-12 1.1E-15 85.0 20.1 284 1-343 98-393 (404) 118 protein:vir:1268 Length: 397 # 99.3 2E-12 1.3E-15 84.7 20.3 281 1-343 98-397 (397) 119 protein:vir:81160 Length: 371 99.3 9.7E-13 6E-16 86.4 18.2 281 1-343 84-371 (371) 120 protein:vir:5739 Length: 366 # 99.3 2.9E-12 1.8E-15 83.8 20.8 295 1-343 52-366 (366) 121 protein:vir:105038 Length: 428 99.3 4.1E-12 2.5E-15 83.0 21.5 296 1-343 113-428 (428) 122 protein:vir:5974 Length: 324 # 99.3 1E-12 6.5E-16 86.3 17.8 275 1-343 1-290 (324) 123 protein:vir:2504 Length: 305 # 99.2 4.4E-12 2.7E-15 82.8 20.3 281 1-343 1-298 (305) 124 protein:vir:95376 Length: 425 99.2 6E-13 3.7E-16 87.6 15.5 291 1-343 119-424 (425) 125 protein:vir:1383 Length: 421 # 99.2 1.1E-12 6.6E-16 86.2 16.6 276 1-343 101-384 (421) 126 protein:vir:7409 Length: 408 # 99.2 3E-12 1.9E-15 83.7 18.7 284 1-343 97-393 (408) 127 protein:vir:6212 Length: 434 # 99.2 2E-12 1.2E-15 84.8 17.0 292 1-343 127-433 (434) 128 protein:vir:1583 Length: 351 # 99.2 1.4E-12 8.9E-16 85.5 15.7 281 1-343 1-299 (351) 129 protein:vir:96762 Length: 632 99.2 4.9E-12 3.1E-15 82.6 18.3 285 1-342 334-632 (632) 130 protein:vir:1025 Length: 408 # 99.2 6E-12 3.7E-15 82.1 18.7 284 1-343 97-393 (408) 131 protein:vir:4226 Length: 326 # 99.2 1E-11 6.5E-15 80.8 19.8 295 1-343 1-323 (326) 132 protein:vir:100172 Length: 394 99.2 7.5E-12 4.6E-15 81.6 19.0 278 1-343 100-384 (394) 133 protein:vir:81227 Length: 413 99.2 4.9E-12 3E-15 82.6 17.7 290 1-343 105-410 (413) 134 protein:vir:962 Length: 397 # 99.2 2.4E-12 1.5E-15 84.3 14.8 275 1-343 121-397 (397) 135 protein:vir:3845 Length: 395 # 99.1 2.3E-11 1.4E-14 78.9 19.0 274 1-343 102-383 (395) 136 protein:vir:1084 Length: 437 # 99.1 1.1E-11 7.1E-15 80.6 17.3 279 1-343 141-427 (437) 137 protein:vir:7855 Length: 497 # 99.1 1.7E-11 1.1E-14 79.6 17.6 299 1-343 138-493 (497) 138 protein:vir:101650 Length: 497 99.1 1.7E-11 1.1E-14 79.6 17.6 299 1-343 138-493 (497) 139 protein:vir:9704 Length: 394 # 99.1 1.7E-11 1.1E-14 79.6 17.5 273 1-343 115-390 (394) 140 protein:vir:93616 Length: 645 99.1 5.7E-11 3.5E-14 76.8 19.7 290 1-343 315-640 (645) 141 protein:vir:102873 Length: 392 99.1 4.9E-11 3.1E-14 77.1 19.4 284 1-343 84-392 (392) 142 protein:vir:105004 Length: 392 99.1 4.9E-11 3.1E-14 77.1 19.4 284 1-343 84-392 (392) 143 protein:vir:107593 Length: 392 99.1 4.9E-11 3.1E-14 77.1 19.4 284 1-343 84-392 (392) 144 protein:vir:102082 Length: 392 99.1 4.9E-11 3.1E-14 77.1 19.4 284 1-343 84-392 (392) 145 protein:vir:100884 Length: 389 99.1 4.3E-11 2.7E-14 77.4 18.5 280 1-343 95-384 (389) 146 protein:vir:4092 Length: 390 # 99.1 8.3E-11 5.1E-14 75.9 19.5 293 1-343 69-368 (390) 147 protein:vir:8420 Length: 477 # 99.0 1.1E-10 7E-14 75.1 19.6 296 1-343 148-471 (477) 148 protein:vir:105610 Length: 430 99.0 2.2E-10 1.3E-13 73.6 20.2 326 5-343 1-422 (430) 149 protein:vir:78640 Length: 352 99.0 2E-11 1.2E-14 79.3 13.3 269 1-343 64-346 (352) 150 protein:vir:2770 Length: 318 # 99.0 4.5E-11 2.8E-14 77.3 14.9 261 1-287 1-318 (318) 151 protein:vir:9875 Length: 296 # 99.0 2.9E-10 1.8E-13 72.9 18.9 280 1-343 1-295 (296) 152 protein:vir:9361 Length: 402 # 99.0 2E-11 1.2E-14 79.2 12.0 271 1-343 115-396 (402) 153 protein:vir:93881 Length: 387 98.9 9.2E-11 5.7E-14 75.6 15.3 272 1-343 100-381 (387) 154 protein:vir:9927 Length: 295 # 98.9 3.4E-10 2.1E-13 72.5 16.9 271 1-343 1-288 (295) 155 protein:vir:93696 Length: 364 98.9 1E-09 6.3E-13 69.9 19.5 303 1-343 1-359 (364) 156 protein:vir:94424 Length: 387 98.9 3.2E-11 2E-14 78.1 10.4 271 1-343 99-381 (387) 157 protein:vir:96978 Length: 387 98.9 3.2E-11 2E-14 78.1 10.4 271 1-343 99-381 (387) 158 protein:vir:2685 Length: 387 # 98.9 3.2E-11 2E-14 78.1 10.4 271 1-343 99-381 (387) 159 protein:vir:9643 Length: 377 # 98.8 8.2E-10 5.1E-13 70.4 16.8 288 1-343 59-377 (377) 160 protein:vir:3298 Length: 404 # 98.8 2.2E-09 1.3E-12 68.1 19.1 335 1-343 1-401 (404) 161 protein:vir:104439 Length: 404 98.8 2.2E-09 1.3E-12 68.1 19.1 335 1-343 1-401 (404) 162 protein:vir:10123 Length: 404 98.8 2.2E-09 1.3E-12 68.1 19.1 335 1-343 1-401 (404) 163 protein:vir:819 Length: 404 # 98.8 2.2E-09 1.3E-12 68.1 19.1 335 1-343 1-401 (404) 164 protein:vir:108211 Length: 318 98.8 2.1E-09 1.3E-12 68.2 17.9 293 1-341 1-318 (318) 165 protein:vir:100632 Length: 381 98.7 3.5E-09 2.1E-12 67.0 17.6 288 1-343 56-368 (381) 166 protein:vir:4197 Length: 314 # 98.7 3.8E-09 2.3E-12 66.8 17.6 295 1-343 1-314 (314) 167 protein:vir:9509 Length: 381 # 98.7 2.4E-09 1.5E-12 67.9 16.5 288 1-343 57-370 (381) 168 protein:vir:101291 Length: 381 98.7 2.4E-09 1.5E-12 67.9 16.5 288 1-343 57-370 (381) 169 protein:vir:106647 Length: 303 98.7 2.3E-09 1.5E-12 67.9 16.2 277 1-343 1-296 (303) 170 protein:vir:98635 Length: 377 98.7 3.1E-09 1.9E-12 67.2 16.2 284 1-343 59-377 (377) 171 protein:vir:4159 Length: 315 # 98.7 5.3E-09 3.3E-12 65.9 17.2 298 1-342 7-315 (315) 172 protein:vir:78350 Length: 383 98.7 6.5E-09 4E-12 65.5 16.8 285 1-343 64-375 (383) 173 protein:vir:95963 Length: 395 98.6 5.3E-09 3.3E-12 66.0 15.3 291 1-343 66-376 (395) 174 protein:vir:3158 Length: 321 # 98.6 4.4E-09 2.7E-12 66.4 14.6 294 1-343 1-312 (321) 175 protein:vir:95875 Length: 401 98.6 3.4E-08 2.1E-11 61.5 19.4 321 1-343 1-400 (401) 176 protein:vir:80128 Length: 466 98.5 4.8E-09 3E-12 66.2 13.0 292 1-343 123-448 (466) 177 protein:vir:80446 Length: 367 98.2 5.7E-07 3.5E-10 54.8 15.9 294 1-343 1-330 (367) 178 protein:vir:79928 Length: 393 98.0 3E-07 1.8E-10 56.4 12.2 299 1-343 59-381 (393) 179 protein:vir:78387 Length: 349 97.5 2.6E-05 1.6E-08 45.8 15.7 286 1-343 1-348 (349) 180 protein:vir:107687 Length: 319 97.4 5E-05 3.1E-08 44.2 15.6 296 1-341 1-319 (319) 181 protein:vir:94989 Length: 349 97.3 0.00011 7E-08 42.2 16.2 288 1-343 1-348 (349) 182 protein:vir:80068 Length: 301 97.2 0.00014 8.9E-08 41.7 17.9 284 16-341 1-301 (301) 183 protein:vir:3969 Length: 287 # 96.9 0.0002 1.2E-07 40.9 14.6 263 10-343 1-286 (287) 184 protein:vir:98871 Length: 314 96.7 0.00029 1.8E-07 40.0 14.1 282 1-343 11-311 (314) 185 protein:vir:103285 Length: 296 96.7 0.00043 2.7E-07 39.0 16.8 275 16-341 1-296 (296) 186 protein:vir:97397 Length: 517 96.1 0.0011 6.5E-07 36.9 15.4 280 1-343 226-517 (517) 187 protein:vir:79548 Length: 652 96.0 0.0011 6.8E-07 36.8 14.9 293 1-340 336-652 (652) 188 protein:vir:104342 Length: 314 95.8 0.0013 8.2E-07 36.4 13.0 290 1-341 1-314 (314) 189 protein:vir:5942 Length: 523 # 95.3 0.0023 1.5E-06 35.0 14.8 308 1-343 162-523 (523) 190 protein:vir:94528 Length: 286 95.1 0.0028 1.7E-06 34.6 19.0 268 1-343 1-286 (286) 191 protein:vir:4074 Length: 480 # 95.0 0.003 1.9E-06 34.4 13.7 274 1-343 171-477 (480) 192 protein:vir:103181 Length: 457 94.2 0.0052 3.2E-06 33.1 13.8 305 1-343 97-439 (457) 193 protein:vir:97255 Length: 310 94.0 0.0058 3.6E-06 32.8 19.3 291 1-343 1-310 (310) 194 protein:vir:79642 Length: 329 93.9 0.0061 3.8E-06 32.7 16.5 298 1-343 14-329 (329) 195 protein:vir:95512 Length: 693 93.1 0.0089 5.5E-06 31.8 13.3 293 1-343 371-692 (693) 196 protein:vir:4786 Length: 295 # 93.0 0.0093 5.7E-06 31.7 15.1 256 4-331 1-295 (295) 197 protein:vir:8324 Length: 410 # 89.3 0.027 1.7E-05 29.1 10.6 275 1-341 85-410 (410) 198 protein:vir:10324 Length: 320 84.9 0.057 3.5E-05 27.4 11.0 292 10-343 1-317 (320) 199 protein:vir:101811 Length: 529 84.1 0.063 3.9E-05 27.1 14.8 314 1-343 127-508 (529) 200 protein:vir:101039 Length: 529 83.6 0.067 4.2E-05 27.0 15.1 305 1-343 151-508 (529) 201 protein:vir:94070 Length: 339 83.1 0.072 4.4E-05 26.9 12.9 287 1-341 35-339 (339) 202 protein:vir:94933 Length: 330 82.8 0.074 4.6E-05 26.8 16.3 300 1-343 1-330 (330) 203 protein:vir:95131 Length: 325 82.0 0.081 5E-05 26.6 14.7 274 18-342 1-325 (325) 204 protein:vir:78148 Length: 123 80.0 0.024 1.5E-05 29.4 5.3 119 206-343 1-123 (123) 205 protein:vir:79078 Length: 307 79.5 0.1 6.5E-05 26.0 12.2 290 1-343 1-307 (307) 206 protein:vir:5670 Length: 514 # 78.7 0.11 6.9E-05 25.8 14.4 302 1-343 114-493 (514) 207 protein:vir:96079 Length: 382 77.8 0.12 7.5E-05 25.6 16.2 302 1-341 51-382 (382) 208 protein:vir:99424 Length: 360 75.3 0.15 9.2E-05 25.1 14.2 302 1-343 1-356 (360) 209 protein:vir:107882 Length: 307 74.5 0.16 9.7E-05 25.0 13.2 294 1-343 1-307 (307) 210 protein:vir:99576 Length: 388 68.2 0.24 0.00015 24.0 11.8 308 1-341 57-388 (388) 211 protein:vir:107732 Length: 379 64.8 0.29 0.00018 23.5 15.8 300 1-341 56-379 (379) 212 protein:vir:104549 Length: 462 64.8 0.29 0.00018 23.5 17.1 299 1-343 97-448 (462) 213 protein:vir:106286 Length: 534 61.9 0.35 0.00021 23.1 16.9 309 1-343 125-512 (534) 214 protein:vir:103886 Length: 302 61.1 0.36 0.00022 23.0 15.9 273 16-343 1-293 (302) 215 protein:vir:95258 Length: 368 55.0 0.49 0.0003 22.3 16.6 317 1-343 1-366 (368) 216 protein:vir:100603 Length: 529 54.9 0.49 0.00031 22.3 15.1 309 1-343 144-508 (529) 217 protein:vir:6601 Length: 528 # 54.3 0.51 0.00031 22.2 16.6 312 1-343 116-507 (528) 218 protein:vir:99888 Length: 309 53.7 0.52 0.00032 22.1 10.3 279 1-343 1-300 (309) 219 protein:vir:5255 Length: 304 # 39.0 1 0.00064 20.5 13.2 273 21-340 1-304 (304) 220 protein:vir:3643 Length: 336 # 26.4 1.9 0.0012 19.0 11.5 283 1-341 34-336 (336) 221 protein:vir:270 Length: 341 # 25.6 2 0.0013 18.9 9.8 296 1-343 1-332 (341) 222 protein:vir:78558 Length: 336 25.0 2.1 0.0013 18.8 12.0 283 1-341 34-336 (336) 223 protein:vir:103463 Length: 521 24.9 2.1 0.0013 18.8 16.0 304 1-343 127-499 (521) 224 protein:vir:101557 Length: 336 24.6 2.2 0.0013 18.8 13.1 283 1-341 34-336 (336) 225 protein:vir:7214 Length: 521 # 22.5 2.4 0.0015 18.5 16.2 304 1-343 127-499 (521) 226 protein:vir:106734 Length: 336 21.7 2.6 0.0016 18.3 10.9 283 1-341 34-336 (336) 227 protein:vir:98143 Length: 524 20.5 2.8 0.0017 18.2 15.2 297 1-343 127-503 (524) No 1 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=2.2e-104 Score=588.83 Aligned_cols=343 Identities=73% Similarity=1.128 Sum_probs=326.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLD 80 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 80 (343) |||+++|++++|||||+++++|+++||||+|+|||+++|+++|+|++++++|++++||++|||++|++++.+|+||++++ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~~~~~g~~l~ 80 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecceeeeeeccccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCC Q lcl|NC_011085. 81 DKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGS 160 (343) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~ 160 (343) ++.+++++++++|+||+++|++|.|||+|++|+++|+|+++++++|++||+++|++|+++++++++.+.+....++|+++ T Consensus 81 ~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~ 160 (347) T protein:vir:88 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) T ss_pred CCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCccc Confidence 87788999999999999999999999999999999999999999999999999999999999999988888889999999 Q ss_pred ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEEE Q lcl|NC_011085. 161 ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRNV 240 (343) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~i 240 (343) +..+..+++++.+++...++++++.|++|+++|+|++||++|||+||+|++|++||+++++++.+|.+...+++|.|+++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~vg~i 240 (347) T protein:vir:88 161 AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRNV 240 (347) T ss_pred cccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcceeeee Confidence 99999998889999999999999999999999999999999999999999999999999999999999899999999999 Q ss_pred eceEEEEeccccccccccccc---cccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhh Q lcl|NC_011085. 241 MGFEVVEVPHLTAGGAGDDRE---DETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQAD 317 (343) Q Consensus 241 ~Gf~V~~sn~lp~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d 317 (343) +||+||+|||+|.+..+..+. ...+...|.+......+|+.++++.++|+||++|+++++++++++|.+|++++|+| T Consensus 241 ~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d 320 (347) T protein:vir:88 241 MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQAD 320 (347) T ss_pred ccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechhhHHH Confidence 999999999999876655433 23455667777777888999999999999999999999999999999999999999 Q ss_pred hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 318 QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 318 ~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+++++||++++||||+|+|+++.= T Consensus 321 ~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 321 QIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred HhhhhhhhcCceeccceEEEEEeCCC Confidence 99999999999999999999999988 No 2 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=1.7e-104 Score=589.43 Aligned_cols=343 Identities=81% Similarity=1.201 Sum_probs=321.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLD 80 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 80 (343) |||+++|++++|||||+++++|+++||||+|+|||+++|+++|+|++++++|+|++|||++||++|++++.+|+||++++ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~~~~G~~l~ 80 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLD 80 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccceeEeeeecCcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCC Q lcl|NC_011085. 81 DKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGS 160 (343) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~ 160 (343) ++.+++++++++|+||+++|++|.|||+|++|+++|+|+++++++|++||+++||+|+++++++++++.+....+.|.++ T Consensus 81 ~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~ 160 (347) T protein:vir:94 81 DKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGLGK 160 (347) T ss_pred CCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCc Confidence 87788999999999999999999999999999999999999999999999999999999999999998888888888888 Q ss_pred ceeecccc-cccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 161 ASILEVGA-KGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 161 ~~~~~~~~-~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) ++.+..+. .+...++++.+.++++.|++|+++|+|++||++|||+||+|++|+.||+...+...++.+...+++|.|++ T Consensus 161 ~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V~~ 240 (347) T protein:vir:94 161 AHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRN 240 (347) T ss_pred ceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccccccceeEE Confidence 88776654 34455677788999999999999999999999999999999999999998888888888888899999999 Q ss_pred EeceEEEEeccccccccccccccc---cccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhh Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDE---TTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQA 316 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~ 316 (343) ++||+||+|||+|....+.++... ++...|.+....+.+|+++|+++++|+||++|+++++++++++|.+|++++|+ T Consensus 241 v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~~~ 320 (347) T protein:vir:94 241 VMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRANFQA 320 (347) T ss_pred eeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeechhhhh Confidence 999999999999987766655443 45667778888888999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 317 DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 317 d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|+++++|||+++||||+|+|+++.- T Consensus 321 ~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 321 DQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhcCcccccceeEEEEecCC Confidence 999999999999999999999999988 No 3 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=6.5e-104 Score=586.30 Aligned_cols=343 Identities=80% Similarity=1.169 Sum_probs=318.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLD 80 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 80 (343) |||+++|++++|||||+++++|+++||||+|++||+++|+++|+|+++++.|++++||++|||++|++++++|++|++++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~~~~~g~~l~ 80 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccceeeeeecCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc-- Q lcl|NC_011085. 81 DKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL-- 158 (343) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~-- 158 (343) ++++++++++++|+||+++||+|.|||+|++|+++|+|+++++++|++||+++|++|+++++++.+.+..+....+++ T Consensus 81 ~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~ 160 (347) T protein:vir:33 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGK 160 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Confidence 988889999999999999999999999999999999999999999999999999999999998877655444433333 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) ..++.+..++++...++...++++++.|++|+++|+|++||++|||+||+|++|++||+++++++++|.+++.+++|.|+ T Consensus 161 ~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~G~V~ 240 (347) T protein:vir:33 161 PTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPERGTIR 240 (347) T ss_pred cccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccccccccceeE Confidence 33344555666677788888999999999999999999999999999999999999999999999999988999999999 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQ 318 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~ 318 (343) +++||+||+|||||..+.+......+++..+.+.......++.+|+..+||+||++|+++++++++++|++|++++|+|+ T Consensus 241 ~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~~d~ 320 (347) T protein:vir:33 241 NVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQ 320 (347) T ss_pred EEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhhhHh Confidence 99999999999999988877777777777888777777889999999999999999999999999999999999999999 Q ss_pred hhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 319 IIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 319 i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+++++||++++||||+|+|+++.= T Consensus 321 i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 321 IIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhhhcCCceecccceEEEecCCC Confidence 9999999999999999999999876 No 4 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=2.3e-103 Score=583.25 Aligned_cols=342 Identities=78% Similarity=1.146 Sum_probs=318.2 Q ss_pred CCCCCccccccccccc-cccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGK-GQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~-~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i 79 (343) |||++++++.++.++. .++++|+++||||+|+|||+++|+++|+|++++++|+|++|||++||++|++++++|+||++| T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~~~~~~~~~G~~l 80 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENL 80 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeeceeEEEeeecCCCC Confidence 9999999887744433 378888999999999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +++.+++++++++|+||+.+|++|.|||+|++|++||+|+++++++|++||+++|++|+++++++++.++|.+..+++++ T Consensus 81 ~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~~ 160 (344) T protein:vir:10 81 DDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITGLG 160 (344) T ss_pred CCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccc Confidence 99888999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred Cceeecccc-cccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 160 SASILEVGA-KGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 160 ~~~~~~~~~-~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) +++++.... +...++++..++++++.|++|+++|+|++||++|||+||+|++|++||+++++++.+|.++..+++|+|+ T Consensus 161 ~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~ 240 (344) T protein:vir:10 161 TATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIR 240 (344) T ss_pred ccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccceeeeEEE Confidence 998877654 4455677888899999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQ 318 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~ 318 (343) +++||+||+|||+|.++.+ .+...+++.+|.+.....++++.+|+++|||+|||+|+++++++++++|.+|++++|+|+ T Consensus 241 ~v~G~~V~~Sn~lp~~~~~-~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~ 319 (344) T protein:vir:10 241 NVMGFEVVEVPHLTAGGAG-TSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANFQADQ 319 (344) T ss_pred EEeceEEEeccccccccCC-cccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchhHHHHH Confidence 9999999999999987554 455566777888777777889999999999999999999999999999999999999999 Q ss_pred hhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 319 IIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 319 i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+++|+|||+++||||+++|+++.- T Consensus 320 i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 320 IIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHhhcccceecccceEEEEeecC Confidence 9999999999999999998888888 No 5 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=2.9e-102 Score=577.26 Aligned_cols=343 Identities=81% Similarity=1.174 Sum_probs=318.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLD 80 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 80 (343) |||+++|++++|||||+++++|+++||||+|+++|+++|+++|++++++++|++++||++|||++|++++++|++|++++ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~~~~~~g~~l~ 80 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceeeeeeccCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCC Q lcl|NC_011085. 81 DKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGS 160 (343) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~ 160 (343) ++++++++++++|+||+++|++|.|||+|++|+++|+|+++++++|++||+++|++|+++++++++++........+.+. T Consensus 81 ~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~g~ 160 (347) T protein:vir:15 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGLGK 160 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCc Confidence 88888999999999999999999999999999999999999999999999999999999999887765544444433332 Q ss_pred c--eeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 161 A--SILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 161 ~--~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) . .......+++.++|...+++|+++|++|+++|+|++||++|||+||+|++|+.||+++++++.+|.++..+++|.|+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G~Vg 240 (347) T protein:vir:15 161 PTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHERGTIR 240 (347) T ss_pred cccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccceEEE Confidence 2 33344566678889999999999999999999999999999999999999999999999999999998899999999 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQ 318 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~ 318 (343) +++||+||+|||||..+.+......+++.++.+........+.+|+..++|+||++|+++++++++++|.+|++++|+|+ T Consensus 241 ~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~~d~ 320 (347) T protein:vir:15 241 NVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQ 320 (347) T ss_pred EEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhhhhh Confidence 99999999999999988777777777888888887777788999999999999999999999999999999999999999 Q ss_pred hhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 319 IIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 319 i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+++++||++++||||+|+|+++.= T Consensus 321 i~~~~~~G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 321 IIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hehhhhcCCceeccccEEEEecCCC Confidence 9999999999999999999999876 No 6 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=4e-102 Score=576.47 Aligned_cols=342 Identities=73% Similarity=1.115 Sum_probs=321.2 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLD 80 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 80 (343) |||.+ ++.++|||||+++++|+++||||+|.+||+++|+++|+++++++.|+|++||++|||++|++++++|+||++|+ T Consensus 1 m~~~~-~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~t~G~~l~ 79 (347) T protein:vir:94 1 MANVP-GQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPGERLS 79 (347) T ss_pred CCCCC-ccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceeeeeecCCCCcC Confidence 99995 68889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCC Q lcl|NC_011085. 81 DKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGS 160 (343) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~ 160 (343) ++++++++++++|+||+++|++|.|||+|++|+++|+|+++++++|++||+++|++|++++++.++++.+.+..++|++. T Consensus 80 ~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~ 159 (347) T protein:vir:94 80 DKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLGT 159 (347) T ss_pred CCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcc Confidence 98888999999999999999999999999999999999999999999999999999999999999888888888999999 Q ss_pred ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEEE Q lcl|NC_011085. 161 ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRNV 240 (343) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~i 240 (343) +++++.+..++..++++..+++++.|++|+++|+|++||++|||+||+|++|++||+++++++.++.++..+++|.|+++ T Consensus 160 ~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i 239 (347) T protein:vir:94 160 ASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIRNV 239 (347) T ss_pred cceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999889999999999 Q ss_pred eceEEEEecccccccccccccc----ccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhh Q lcl|NC_011085. 241 MGFEVVEVPHLTAGGAGDDRED----ETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQA 316 (343) Q Consensus 241 ~Gf~V~~sn~lp~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~ 316 (343) +||+||+|||||..+.+..+.. ...+-.+.+......+|+++|++.++++|||+|+++++++++++|.+|++++|+ T Consensus 240 ~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~ 319 (347) T protein:vir:94 240 MGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQG 319 (347) T ss_pred eceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhhHH Confidence 9999999999998766554332 222234555556667899999999999999999999999999999999999999 Q ss_pred hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 317 DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 317 d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|+++++||++++||||+|+|+++.= T Consensus 320 d~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 320 DLIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred HHhhhhhhhcCcccccceeEEEEecCC Confidence 999999999999999999999999988 No 7 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=8.6e-102 Score=574.66 Aligned_cols=341 Identities=78% Similarity=1.151 Sum_probs=311.2 Q ss_pred CCCCCcccccc--ccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLG--KDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~--t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~ 78 (343) ||+++++++.+ |||||+ +++|+++||||+|+|||+++|+++|++++++++|+|++|||++||++|++++++|+||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVV-AAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGEN 79 (345) T ss_pred Ccccccchhcccccccccc-cCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecceEEEeeecCCC Confidence 99999998877 666776 577999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) |++++++++++|++|+||+.+|++|.|||+|++|++||+|+++++|+|++||+++||+|+++++++++.+.+.++.++++ T Consensus 80 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~ 159 (345) T protein:vir:22 80 LDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEGL 159 (345) T ss_pred CCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Confidence 99988889999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred CCceeecccc-cccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 159 GSASILEVGA-KGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 159 ~~~~~~~~~~-~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) +.+..+.... ....+++...++.+++.|++|+++|+|++||.+|||+||+|++|++||+++++++.+|.++..+++|+| T Consensus 160 ~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V 239 (345) T protein:vir:22 160 GTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSI 239 (345) T ss_pred ccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccccceE Confidence 8888776554 344556777788999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccc-cccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhh Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPK-TAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQA 316 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~ 316 (343) ++++||+||+|||+|....+.... .+....|.++. ..+.++..+.+++|+++|||+|+++++++++++|.+|++++|+ T Consensus 240 ~~i~G~~V~~sn~lp~~~~~~~~~-~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~ 318 (345) T protein:vir:22 240 RNVMGFEVVEVPHLTAGGAGTARE-GTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANFQA 318 (345) T ss_pred EEEeceEEEecccccccccCcccc-CcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechhHHH Confidence 999999999999999765554433 34444444443 3345677788999999999999999999999999999999999 Q ss_pred hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 317 DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 317 d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|+++++|||+++||||+++|+++-- T Consensus 319 d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 319 DQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHHHHhcCCcccccceeEEEEEeeC Confidence 999999999999999999999998888 No 8 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=4.4e-94 Score=532.38 Aligned_cols=341 Identities=24% Similarity=0.338 Sum_probs=295.2 Q ss_pred CCC----CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCC Q lcl|NC_011085. 1 MAD----MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAG 76 (343) Q Consensus 1 ~~~----~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g 76 (343) |++ .-++.|.+|||||+++ +|+++||||+|+|||+++|+++|++++++++|+|++|||++|+++|++++++|+|| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~-~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGA-TDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccc-cchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeEEeeecCC Confidence 443 3466788999999954 68899999999999999999999999999999999999999999999999999999 Q ss_pred CcCCCcc-CCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 77 QSLDDKR-KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 77 ~~i~~~~-~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) ++|++++ .++++++++|+||+.+||+|.|||+|++|+++|+|+++++|+|++||+++|++|+++++++++...|..... T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~ 159 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATN 159 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 9998763 567889999999999999999999999999999999999999999999999999999999999988877766 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhcc---chhhhhccccccch Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAA---LMPNAANYAALIDP 232 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~---~~~~~~~~~~~~~~ 232 (343) ...++++.+..++.+.. +.+..++++++.|++++++|+|++||++|||+||+|++|++||++ +++++.+|+++... T Consensus 160 ~~~~Gg~~i~~~sg~~~-~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~ 238 (375) T protein:vir:10 160 FVEPGGTQIRVGSGTNE-SDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQ 238 (375) T ss_pred ccccCcceeeecccccc-ccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccccee Confidence 66677777766543332 223346788999999999999999999999999999999999986 67889999988889 Q ss_pred hcceeEEEeceEEEEecccccccccccccccccc-------cccccccc--------ccccccccc---cceEeEeechh Q lcl|NC_011085. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTN-------QKHAFPKT--------AEGDTKVAL---DNVVGLFQHRS 294 (343) Q Consensus 233 ~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~-------~~~~~~~~--------~~~~~~~~~---~~~~~l~~~~~ 294 (343) .+|.|++++||+||+|||+|..+.+.+..+.+.+ ..|.++.. .+.+|+++| +++||++|||+ T Consensus 239 ~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~ 318 (375) T protein:vir:10 239 SGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKE 318 (375) T ss_pred ccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchh Confidence 9999999999999999999987665433322211 11222111 224689898 89999999999 Q ss_pred hheeeeeeeeEEeee---eccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 295 AVGTVKLKDLSLERA---RRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 295 Av~~~~~~~~~~e~~---~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+++++++++++|.+ |+++||+|+|+++|+|||+++||||+|+|++++= T Consensus 319 A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 319 AAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred heeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 999999999999987 6999999999999999999999999999999832 No 9 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=1.8e-92 Score=523.62 Aligned_cols=326 Identities=16% Similarity=0.131 Sum_probs=284.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLD 80 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 80 (343) |++.+.+.+ |||+|++.++| ++||||+|+|||+++|+++|+|++++++|+|++|||+|||++|++++++|+||++++ T Consensus 1 m~~~~~~~~--t~~~~~~~~~~-~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~~~~g~~l~ 77 (334) T protein:vir:80 1 MTYPAANTH--TRPGWGGANSD-VSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTIAGRKAGEELV 77 (334) T ss_pred CCCCcCCCc--cccccccccch-heehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceeeeeecCCCCCC Confidence 888876444 89999988776 789999999999999999999999999999999999999999999999999999998 Q ss_pred CccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCC Q lcl|NC_011085. 81 DKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGS 160 (343) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~ 160 (343) ++ ++++++++|+||+.+|++++|||+|++|++||+|+++++|+|++||+++||+|+++++++++.+.|.........+ T Consensus 78 ~~--~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 78 VQ--KNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CC--CcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 86 4889999999999999999999999999999999999999999999999999999999999988776544332222 Q ss_pred ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCc---CCcEEEeCHHHHHHHhccchhhhhcccc---ccchhc Q lcl|NC_011085. 161 ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPS---ADRTFYTTPEVYSAILAALMPNAANYAA---LIDPER 234 (343) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~---~gR~~vv~P~~~~~Ll~~~~~~~~~~~~---~~~~~~ 234 (343) +....... +...+.+..++.++++++.|++.|+|++||+ .|||+||+|++|++||++++|++++|++ ...+.+ T Consensus 156 ~~~~~~~~-g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~ 234 (334) T protein:vir:80 156 ILLPSTIS-GLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVG 234 (334) T ss_pred cceeeccc-ccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccc Confidence 22222221 1222344557888999999999999999995 6799999999999999999999999864 346899 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |+|++++||+||+|||+|..+.+.+.. .. ....|+++|++.+++|+|++|+++++++++++|.+|++++ T Consensus 235 g~i~~v~G~~V~~Sn~~P~~~~t~~~~----------g~-~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~ 303 (334) T protein:vir:80 235 GRIAMLNGVRVVETPRFPQSAITANAL----------GA-DFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKD 303 (334) T ss_pred eeEEEEeceEEEeecCCCCcccccccc----------cc-ccccccccccceEEEEEeCceEEEEEEeecceeeeechhh Confidence 999999999999999999876554322 11 1236889999999999999999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|+|+++++|||+++||||+++++++-= T Consensus 304 ~~d~i~~~~a~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 304 FGHYLDTFQSYNIGQRRPDAVAVHDITVT 332 (334) T ss_pred HHHHHHHHHHcCCceeccceEEEEEEeee Confidence 99999999999999999999999998866 No 10 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=7.1e-92 Score=520.28 Aligned_cols=332 Identities=14% Similarity=0.109 Sum_probs=286.9 Q ss_pred CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCcc Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKR 83 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~ 83 (343) |+.++. .|||||+++ +|+++||||+|+|||+++|+++|++++++++|+|++|||++||++|++++++|+||++++++ T Consensus 1 ms~~n~-~t~~~~~~~-~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~~~~~~~G~~ld~~- 77 (364) T protein:vir:10 1 MSNPNV-LTQPAVSAS-GEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETELQVLSPGKSPDAS- 77 (364) T ss_pred CCCccc-ccccccccc-cchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeEEeeeccCcccCCC- Confidence 777666 599999854 47799999999999999999999999999999999999999999999999999999999874 Q ss_pred CCCccceEEEEeeeeeeeeeeccchHHHHhchh-hHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-ccccccccccccCCc Q lcl|NC_011085. 84 KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLCN-MPAASNENIAGLGSA 161 (343) Q Consensus 84 ~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~-~~~~~~~~~~g~~~~ 161 (343) ++.+++++|+||+.+|++++|+|+|++|++|| +|+++++|+|++||+++||+|++++.+++. .+.+....+.+.+.| T Consensus 78 -~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g 156 (364) T protein:vir:10 78 -PTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHG 156 (364) T ss_pred -CcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCc Confidence 58899999999999999999999999999999 899999999999999999999988765542 222333334444444 Q ss_pred eeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc--cccchhcceeEE Q lcl|NC_011085. 162 SILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYA--ALIDPERGSIRN 239 (343) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~V~~ 239 (343) ..+.... ...+....+.+++++|++|.+.|+|++||.+|||+||+|++|++||+++++++.+|+ ++..+.+|+|++ T Consensus 157 ~~i~~~~--~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~ 234 (364) T protein:vir:10 157 FSIHIVG--LASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLK 234 (364) T ss_pred ceeeecc--cCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEE Confidence 4443322 223445567889999999999999999999999999999999999999999999986 567799999999 Q ss_pred EeceEEEEecccccccccccccccccccccccccccccc-c--cccccceEeEeechhhheeeeeeeeEEeeeeccchhh Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGD-T--KVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQA 316 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~ 316 (343) ++||+||+|||||+.+.++... .....|.+++.++++ | .++++++++++|||+|+++++++++++|.+|++++|+ T Consensus 235 v~Gv~Vv~Sn~lP~~~~~~~~t--~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~ 312 (364) T protein:vir:10 235 SWNTPIVPSNRFPKLSDNTEGT--GNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKT 312 (364) T ss_pred EeceEEEecccccccccccccc--ccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceee Confidence 9999999999999876554332 234567777777664 3 3788999999999999999999999999999999999 Q ss_pred hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 317 DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 317 d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|+++++|||+++||||+++|++.+. T Consensus 313 ~~ida~~a~G~g~lRPeaa~~i~~~~~ 339 (364) T protein:vir:10 313 WYIDTFLAEGAIPDRWEAVAVVTAADT 339 (364) T ss_pred eeeeeehcccCcccCccceEEEEecCC Confidence 999999999999999999999999887 No 11 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=2.6e-91 Score=517.21 Aligned_cols=319 Identities=18% Similarity=0.182 Sum_probs=280.6 Q ss_pred CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCcc Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKR 83 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~ 83 (343) |++.++ .|||||+++++|. +||||+|+|||+++|+++|+|++++++|+|++|||+|||++|+.++++|+||++|++++ T Consensus 1 ms~~~~-~tr~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:63 1 MSFLND-LTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELERSR 78 (335) T ss_pred CCCccc-chhhhcccccchh-heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeeecccCCcCcCCCC Confidence 666666 4999999999985 89999999999999999999999999999999999999999999999999999999874 Q ss_pred CCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCc-- Q lcl|NC_011085. 84 KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSA-- 161 (343) Q Consensus 84 ~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~-- 161 (343) +.+++++|+||+.+|++++|||+|++|++||+|+|+++|+|++||+++||+++++++++++.+++..... ++..| T Consensus 79 --~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~-~~~~G~~ 155 (335) T protein:vir:63 79 --VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLED-AFSPGVL 155 (335) T ss_pred --ccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCC-CcCCCcc Confidence 6889999999999999999999999999999999999999999999999999999999999887665433 32222 Q ss_pred eeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCC---cEEEeCHHHHHHHhccchhhhhcccc---ccchhcc Q lcl|NC_011085. 162 SILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSAD---RTFYTTPEVYSAILAALMPNAANYAA---LIDPERG 235 (343) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g---R~~vv~P~~~~~Ll~~~~~~~~~~~~---~~~~~~G 235 (343) ..+..++.+.. ..++++++++++|.++|+|++||+++ ||++|+|++|++||+++++++.+|++ ...+.+| T Consensus 156 ~~~~~tg~~~~----~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g 231 (335) T protein:vir:63 156 EKLDLTGLTAK----QAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKS 231 (335) T ss_pred eeeeeccCccc----ccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCc Confidence 22222222222 23678899999999999999999755 99999999999999999999999863 4568999 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchh Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQ 315 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~ 315 (343) +|++++||+|++|||||..+.++ |.+++.. ..|++++++.++++||++|++++++++++.|.+|++++| T Consensus 232 ~v~~v~Gv~V~~sn~lP~~~~t~----------~~lg~a~-n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~ 300 (335) T protein:vir:63 232 RVAILNGVKVLETPRFATKAIAA----------HPLGRHF-NVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKF 300 (335) T ss_pred eeEEeeceEEEeeccCCCCCccc----------ccccccC-CccccccceeEEEEEecceEEEEEEeecccceeeccchh Confidence 99999999999999999876543 3333333 458899999999999999999999999999999999999 Q ss_pred hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+|+++++|||+++||||+++|++ -| T Consensus 301 ~~~i~~~~a~G~g~lRPe~a~~i~~-tg 327 (335) T protein:vir:63 301 SWVLDTFQMYNIGARRPDTAGAIEL-KG 327 (335) T ss_pred hHHhHHHHHcCCcccccceEEEEEE-cC Confidence 9999999999999999999999998 56 No 12 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=6.8e-91 Score=514.90 Aligned_cols=321 Identities=25% Similarity=0.378 Sum_probs=280.7 Q ss_pred CCCCCccccccccccccccccchh-HHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKL-ALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~-al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i 79 (343) ++||+.+++ .|+||+++++|.+ +||||+|+|||+++|+++|+++++++.|++++|+||+||++|++++++|++|+++ T Consensus 4 ~~~~~~~~~--~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~~~~~g~~l 81 (332) T protein:vir:78 4 LSNFSLPNQ--ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTPI 81 (332) T ss_pred cccccCCcc--ccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEeeecCCCCC Confidence 889998888 7999999999976 9999999999999999999999999999999999999999999999999999999 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) ++. +++++++++|+||+.+|++|.|||+|++|+++|+|+++++++|++||+++|++|+++++++++...+... .+ T Consensus 82 ~~~-~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~----~~ 156 (332) T protein:vir:78 82 VGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTG----EP 156 (332) T ss_pred CCC-CCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccc----cc Confidence 874 4689999999999999999999999999999999999999999999999999999999998877655443 33 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhc--cchhhhhcccc-ccchhcce Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILA--ALMPNAANYAA-LIDPERGS 236 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~--~~~~~~~~~~~-~~~~~~G~ 236 (343) ++..+..+. +..+++ +++++.|++|+++|+|++||.+|||+||+|++|+.||+ ++++++.++.+ ++.+++|. T Consensus 157 g~~~~~~~~-~~~~~~----~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~ 231 (332) T protein:vir:78 157 GGFHVNIGA-GNTNDA----QAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) T ss_pred cccccccCC-ccccCH----HHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecce Confidence 344443333 234444 56778888999999999999999999999999999998 78899998866 46788886 Q ss_pred -eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEe---eeecc Q lcl|NC_011085. 237 -IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLE---RARRA 312 (343) Q Consensus 237 -V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e---~~~~~ 312 (343) |++++||+||+|||||..+.+....... ...+..|+++|++.++++||++|+++++++++++| .+|++ T Consensus 232 ~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~--------~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~ 303 (332) T protein:vir:78 232 GLYSIAGIRILKSNNLAGLYGQDLSSAAV--------TGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNV 303 (332) T ss_pred eeeEEeeeEEEecCccccCcccccccccc--------cccccccccccccceEEeecccceeeeeeeccchhhhhcccch Confidence 8999999999999999766544332211 11234689999999999999999999999988665 57899 Q ss_pred chhhhhhhhhhhhccceecccceEEEEec Q lcl|NC_011085. 313 EYQADQIIARYAMGHGGLRPEAAGALVFT 341 (343) Q Consensus 313 ~~~~d~i~~~~~~G~~v~rpe~~~~i~~~ 341 (343) ++|+|+|+++++||++++||||+++|+.. T Consensus 304 ~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 304 QYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhHhhhhhhhhhcCceecccceEEEeeC Confidence 99999999999999999999999999999 No 13 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=4.3e-90 Score=510.50 Aligned_cols=322 Identities=17% Similarity=0.147 Sum_probs=281.9 Q ss_pred CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCcc Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKR 83 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~ 83 (343) |++.++ .|||||+++++|. +||||+|+|||+++|+++|+|++++++|+|++|||+|||++|+.++++++||+++++++ T Consensus 1 ms~~~~-~t~~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~l~~~~ 78 (335) T protein:vir:78 1 MSFLND-LTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELERSR 78 (335) T ss_pred CCcccc-ccccccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeeecccccCcccCCCC Confidence 666666 5999999998885 89999999999999999999999999999999999999999999999999999999874 Q ss_pred CCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCcee Q lcl|NC_011085. 84 KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASI 163 (343) Q Consensus 84 ~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~ 163 (343) +++++++|+||+.+|++++|||+|++|++||+|+++++|+|++||+++||+++++++++++.+.+.+...+-++++.. T Consensus 79 --~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:78 79 --VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred --cccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcce Confidence 788999999999999999999999999999999999999999999999999999999999988776644332222222 Q ss_pred ecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcC---CcEEEeCHHHHHHHhccchhhhhcccc---ccchhccee Q lcl|NC_011085. 164 LEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSA---DRTFYTTPEVYSAILAALMPNAANYAA---LIDPERGSI 237 (343) Q Consensus 164 ~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~---gR~~vv~P~~~~~Ll~~~~~~~~~~~~---~~~~~~G~V 237 (343) ....+.. +....++.++++++++.+.|+|++||+. |||++|+|++|++|++++++++.+|++ ...+.+|+| T Consensus 157 ~~~~tg~---~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v 233 (335) T protein:vir:78 157 KLDLTGL---TAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRV 233 (335) T ss_pred eeeeccc---cccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccccccccccee Confidence 2211111 2223467889999999999999999975 699999999999999999999999863 456899999 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhh Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQAD 317 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d 317 (343) ++++||+|++|||||..+.+ +|.+++.+ ..|+.++++.++++||++|++++++++++.|.+|++++|+| T Consensus 234 ~~v~Gv~V~~Sn~lP~~~~t----------~~~lg~a~-n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~ 302 (335) T protein:vir:78 234 AILNGVKVLETPRFATKAIS----------AHPLGRHF-NVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSW 302 (335) T ss_pred EEeeceEEEeeccCCCCCCc----------cccccccC-CcccccccceEEEEEecceEEEEEEEecccceeeccchhhH Confidence 99999999999999987644 34444443 45778999999999999999999999999999999999999 Q ss_pred hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 318 QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 318 ~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+++++|||+++||||+|+|+++-= T Consensus 303 ~i~~~~a~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 303 VLDTFQMYNIGARRPDTAGAIELKGI 328 (335) T ss_pred hhhHHHHcCCcccCcceEEEEEecCC Confidence 99999999999999999999997632 No 14 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=4.4e-90 Score=510.47 Aligned_cols=326 Identities=14% Similarity=0.119 Sum_probs=282.0 Q ss_pred CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCcc Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKR 83 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~ 83 (343) |+.+|. .|||||+++ +|+++||||+|+|||+++|+++|++++++++|+|++|||++||++|++++++|+||++++++ T Consensus 1 Ms~~n~-~t~~~~~~s-~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~a~y~~~G~~ldg~- 77 (402) T protein:vir:97 1 MSTPNT-LTNVAVSAS-GEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNAT- 77 (402) T ss_pred CCCccc-ccccccccc-cchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeEEeeeccccccCCC- Confidence 777766 599999854 47799999999999999999999999999999999999999999999999999999999875 Q ss_pred CCCccceEEEEeeeeeeeeeeccchHHHHhchh-hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc-cccccccccccCCc Q lcl|NC_011085. 84 KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLCNM-PAASNENIAGLGSA 161 (343) Q Consensus 84 ~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~-~~~~~~~~~g~~~~ 161 (343) ++.+++++|+||+.+|++++|+|+|++|++|| +|+++++|+|++||+++||+|++++..++.. +.++...+.+.+.+ T Consensus 78 -~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g 156 (402) T protein:vir:97 78 -PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) T ss_pred -CcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccc Confidence 57899999999999999999999999999999 8999999999999999999998887665542 34455555555544 Q ss_pred eeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc--cccchhcceeEE Q lcl|NC_011085. 162 SILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYA--ALIDPERGSIRN 239 (343) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~V~~ 239 (343) ..+....+.. .....+.+++++|++|.++|+|++||.+|||++|+|++|++||+++++++++|+ +.+.+.+|+|++ T Consensus 157 ~s~~~~~t~~--~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~ 234 (402) T protein:vir:97 157 FSINVNVTES--EALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) T ss_pred cccccccccc--hhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEE Confidence 4444332211 113346788999999999999999999999999999999999999999999984 567799999999 Q ss_pred EeceEEEEecccccccccccccccccccccccccccccc---ccccccceEeEeechhhheeeeeeeeEEeeeeccchhh Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGD---TKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQA 316 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~ 316 (343) ++||+||+|||||+.+. ....|...+..+++ ++++++++++++|||+|++++++++++.|.+||+++|+ T Consensus 235 v~Gv~Vv~SnnlP~~a~--------~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~ 306 (402) T protein:vir:97 235 SYNCPVIPSNRFPTFAQ--------DQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKT 306 (402) T ss_pred EeceEEEecCccccccc--------cccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHH Confidence 99999999999997531 12234444444443 66999999999999999999999999999999999999 Q ss_pred hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 317 DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 317 d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|+++++||++++||||++++++..| T Consensus 307 ~~id~~~a~G~g~~RPeaa~vv~~~~~ 333 (402) T protein:vir:97 307 YYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) T ss_pred HHHHHHHHhCCcccCccceEEEEEecc Confidence 999999999999999999999999986 No 15 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=1.5e-86 Score=491.16 Aligned_cols=325 Identities=14% Similarity=0.123 Sum_probs=280.8 Q ss_pred CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCcc Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKR 83 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~ 83 (343) |++.++ .|||||++++ +.++||||+|+|||+++|+++|++++++++|+|++|||++||++|+.++++|+||++++++ T Consensus 1 Ms~~n~-~t~~~~~~sg-~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s~~~~~~pG~~ld~~- 77 (401) T protein:vir:70 1 MSTPNN-LTNVAVSASG-EVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPAAT- 77 (401) T ss_pred CCCCcc-cccccccccc-chhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEeeeecCCCCcCCC- Confidence 666665 5999998555 7799999999999999999999999999999999999999999999999999999999875 Q ss_pred CCCccceEEEEeeeeeeeeeeccchHHHHhchh-hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc-cccccccccccCCc Q lcl|NC_011085. 84 KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLCNM-PAASNENIAGLGSA 161 (343) Q Consensus 84 ~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~-~~~~~~~~~g~~~~ 161 (343) ++.++|++|+||+.+|++++|+|+|++|++|| +|+++++++|++||+++||+|++.+..++.. +.++...+.+.++| T Consensus 78 -~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G 156 (401) T protein:vir:70 78 -STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHG 156 (401) T ss_pred -CcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCc Confidence 57899999999999999999999999999999 8999999999999999999998887655532 45667778888888 Q ss_pred eeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEe-CHHHHHHHhccchhhhhccc--cccchhcceeE Q lcl|NC_011085. 162 SILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYT-TPEVYSAILAALMPNAANYA--ALIDPERGSIR 238 (343) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv-~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~V~ 238 (343) ..+..+...+.. ....++++++|++|...|+|++||.. |+++| +|.+|++|+..+++++++|+ +.+.+.+|+|. T Consensus 157 ~~i~v~~~~~~~--~~~~~~l~~ai~dA~~~LdEkdVP~~-r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~ 233 (401) T protein:vir:70 157 FSINVEVAEGEA--LVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTL 233 (401) T ss_pred eEEecccccccc--ccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEE Confidence 888876544332 22346688899999999999999965 55555 77778899999999999986 45779999999 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccc---cccccccceEeEeechhhheeeeeeeeEEeeeeccchh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEG---DTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQ 315 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~ 315 (343) +++||+||+|||+|+++.+ ..+|..++..++ +++++++++++++|||+|+++++.++++.|.+||+++| T Consensus 234 ~vaGv~Vv~SnnlP~~a~~--------it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~ 305 (401) T protein:vir:70 234 SSYNCPVIPSNRFPKYSQG--------QTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEK 305 (401) T ss_pred EEeceEEEeeccccccccc--------cccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhh Confidence 9999999999999975421 123444444443 36699999999999999999999999999999999999 Q ss_pred hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+|+++++||++++||||++++++..+ T Consensus 306 ~~~id~~~a~g~g~~RPeaa~vv~~k~~ 333 (401) T protein:vir:70 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRN 333 (401) T ss_pred HHHHHHHHHhCCcccchhheEEEeecCc Confidence 9999999999999999999999887776 No 16 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=4.2e-86 Score=488.65 Aligned_cols=324 Identities=14% Similarity=0.127 Sum_probs=275.8 Q ss_pred CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCcc Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKR 83 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~ 83 (343) |++.++ .|||||+++ +|.++||||+|+|||+++|+++|++++++++|+|++|||++||++|++++++++||++|+++ T Consensus 1 Ms~~n~-~t~p~~~gs-g~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~s~a~y~~pG~~ldg~- 77 (400) T protein:vir:10 1 MSTPNN-LTNVAVSAS-GEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPAAT- 77 (400) T ss_pred CCCCcc-ccccccccc-cchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEEeeecCCCCcCCC- Confidence 666666 599999854 48899999999999999999999999999999999999999999999999999999999876 Q ss_pred CCCccceEEEEeeeeeeeeeeccchHHHHhchh-hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc-ccccccccccc--C Q lcl|NC_011085. 84 KDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLCNM-PAASNENIAGL--G 159 (343) Q Consensus 84 ~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~-~~~~~~~~~g~--~ 159 (343) ++.+++++|+||+.+|++++|+|+|++|++|| +|+|+++|+|++||+++||++|+++..++.. +..+...+.|. + T Consensus 78 -~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 78 -STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred -CcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 57899999999999999999999999999999 9999999999999999999999887666421 22233333333 2 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc--cccchhccee Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYA--ALIDPERGSI 237 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~V 237 (343) .+..+........++ +++++++|++|.+.|+|++||.++++++++|++|++|+.++++++++|+ +++++.+|+| T Consensus 157 ~s~~v~~~~~~~~~~----~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v 232 (400) T protein:vir:10 157 FSVNVEVNEGEALVN----PQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFV 232 (400) T ss_pred cceeecccccccccC----HHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceE Confidence 233332222222333 4678889999999999999997766667788888899999999999986 4577999999 Q ss_pred EEEeceEEEEecccccccccccccccccccccccccccccc---ccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGD---TKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) .+++|++||+|||+|+.+. ....|.+++...++ ++++++++++++|||+|++++++++++.|.+||+++ T Consensus 233 ~~v~Gv~Iv~Sn~lP~~a~--------~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~ 304 (400) T protein:vir:10 233 LSSYNCPVIPSNRFPKYSQ--------GQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKE 304 (400) T ss_pred EEEeceEEEeeCcCCcccC--------cccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhh Confidence 9999999999999997532 23345555555554 669999999999999999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|+|+++++||++++||||++++++..+ T Consensus 305 ~~~~id~~~a~G~g~~RPeaa~vv~~~~~ 333 (400) T protein:vir:10 305 KTYYIDTFMSEGAIPDRWEAVSVVTTKRQ 333 (400) T ss_pred HHHHHHHHHHhCCcccchhheEEEEecCC Confidence 99999999999999999999999999999 No 17 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=1e-83 Score=475.56 Aligned_cols=292 Identities=63% Similarity=0.935 Sum_probs=260.4 Q ss_pred cccccccceEEEEeccCcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHH Q lcl|NC_011085. 50 IMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESL 129 (343) Q Consensus 50 ~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aL 129 (343) .+|+|++|||++||++|++++++|+||++|+++++++++++++|+||+++|++|.|||+|++|++||+|+++++|+|++| T Consensus 1 ~vr~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aL 80 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEAL 80 (324) T ss_pred CeeeeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHH Confidence 77899999999999999999999999999999888899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCH Q lcl|NC_011085. 130 AMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTP 209 (343) Q Consensus 130 a~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P 209 (343) |+.+||+|++++++.++...+....+.+..++..+...+.+ ..+++..++++++.|++|+++|||++||++|||+||+| T Consensus 81 A~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~-~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P 159 (324) T protein:vir:99 81 AMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGK-KEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDP 159 (324) T ss_pred HHHHHHHHHHHHHHhhhcccccccCCcccCCccceeccccc-ccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCh Confidence 99999999999998888776665544443334333333322 33555667889999999999999999999999999999 Q ss_pred HHHHHHhccchhhhhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccc----cccccccc Q lcl|NC_011085. 210 EVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEG----DTKVALDN 285 (343) Q Consensus 210 ~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~ 285 (343) ++|++||++.++++.+|++++.+++|.|++++||+||+|||+|....+.. .......+|.+++..+. +|++++++ T Consensus 160 ~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~-~~a~~~~~~~~~~~~~~~~~~ky~~d~~~ 238 (324) T protein:vir:99 160 DTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNP-TDAFDGTGHIFPATGDSTTTGKMTVGADN 238 (324) T ss_pred HHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccc-cccccccccccccccccccccccccccCc Confidence 99999998888889999999999999999999999999999998765543 34556667777776554 59999999 Q ss_pred eEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 286 VVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 286 ~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+||+||++|+++++++++++|.+|++++|+|+|+++|+|||+++||||++++++++| T Consensus 239 ~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~ 296 (324) T protein:vir:99 239 VVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDG 296 (324) T ss_pred eeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccC Confidence 9999999999999999999999999999999999999999999999999999999999 No 18 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=2.3e-71 Score=407.88 Aligned_cols=319 Identities=16% Similarity=0.141 Sum_probs=258.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccc--cccceEEEEeccCcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRS--ISSGKSAQFPVLGRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~--i~~G~tv~i~~iG~~t~~~~~~g~~ 78 (343) |+|+-+|+++ .+..+..+..|+|+++|++.|+++++++++++.++ +++|+|||||++|++++++|++|.+ T Consensus 3 ~~~~~~~~~~--------~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~~~~d~~~~~~ 74 (341) T protein:vir:94 3 LGNTITGPSI--------NTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISELGVEDKATDVP 74 (341) T ss_pred chhhhccccc--------cchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCcceeeeecCCCc Confidence 6666666555 34444443349999999999999999999998764 5679999999999999999999998 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++. +++++++++|+||+++|+++.|+|+|+.|+++|+|++++++++++||+++|+.|+..++.++..+.+. T Consensus 75 i~~--~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~------- 145 (341) T protein:vir:94 75 VGV--QPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQN------- 145 (341) T ss_pred ccc--ccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCc------- Confidence 876 46889999999999999999999999999999999999999999999999999998776554322111 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) .+.........++ ....++.|++|+++||+++||.+|||+||+|++|+.||++++|++.++.++..+++|.|+ T Consensus 146 ----~~~~~~~~~t~~~---~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig 218 (341) T protein:vir:94 146 ----VFSSSNGAITGNG---QAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIG 218 (341) T ss_pred ----cccCccccccCch---hhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeee Confidence 0100011111111 123467788999999999999999999999999999999999999999998889999999 Q ss_pred EEeceEEEEecccccccccccccccccccccccc-----ccccccccccccceEeEeechhhheeeeeee---------- Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFP-----KTAEGDTKVALDNVVGLFQHRSAVGTVKLKD---------- 303 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~---------- 303 (343) +++||+||+||++|..+.+...........+... ....+.++++++..+||++|++|++.++..+ T Consensus 219 ~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~ 298 (341) T protein:vir:94 219 SLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSK 298 (341) T ss_pred eEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccc Confidence 9999999999999987766544443333333222 2233468899999999999999999998555 Q ss_pred -eEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 304 -LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 304 -~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.+|..|++.+|+|+|+|+++|||+++||||++.|++.+= T Consensus 299 ~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 299 APRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred cccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcC Confidence 6678889999999999999999999999999988877666 No 19 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=1.2e-65 Score=376.56 Aligned_cols=331 Identities=17% Similarity=0.198 Sum_probs=260.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccc--cccceEEEEeccCcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRS--ISSGKSAQFPVLGRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~--i~~G~tv~i~~iG~~t~~~~~~g~~ 78 (343) ||++|+. +...|++.+..+..++..|+|+++|++.|++.+++.++++.++ .+.|+|||||++|++++.+|++|.+ T Consensus 1 ~~~~~~~---~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d~~~g~~ 77 (381) T protein:vir:80 1 MATIQGT---GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRAAVYDKQPQTP 77 (381) T ss_pred Cceeccc---ccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcceeeeecCCCc Confidence 9999944 4677888888888776669999999999999999999887654 4689999999999999999999998 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+ ++++++++++||+.+|+++.|+|+|+.|+++|+|++++++++++||+++|+.|+..+.+......+... T Consensus 78 i~~~--~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~----- 150 (381) T protein:vir:80 78 VNLQ--ARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIY----- 150 (381) T ss_pred cccc--ccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc----- Confidence 8764 678899999999999999999999999999999999999999999999999999887665543322111 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) .....+..+...... .....+.+++.|++|+++||+++||.+|||+||+|++|+.||+++++++++|.++..+++|.|+ T Consensus 151 t~~~~i~~~~~~~~~-t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig 229 (381) T protein:vir:80 151 SYDTTLGDGTVNAHL-TGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVG 229 (381) T ss_pred ccccccccccccccc-ccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeee Confidence 111111111111110 1112345678889999999999999999999999999999999999999999988899999999 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccc----------------------------------- Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVAL----------------------------------- 283 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------------------------------- 283 (343) +++||+||+||+||..+.+.......... +..+......|.+++ T Consensus 230 ~i~G~~Vv~Sn~lp~~~~t~~~~~agap~-~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~ 308 (381) T protein:vir:80 230 TILGMEVIVTTQIGINSLTGYVNGQGAPT-QPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAA 308 (381) T ss_pred EEcceEEEeecccccccccceeeeccccc-cccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeec Confidence 99999999999999865543322211110 100110111122222 Q ss_pred -------------cceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 284 -------------DNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 284 -------------~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .-..++++|+++.+.+.++.++.+..+...|++|.|.|+++||++++||++++.|.++-= T Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 309 DGGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred CCCceeeeehhhhhhhhhcccccccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 223567889998888888888998999999999999999999999999999999986533 No 20 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=2.1e-59 Score=342.30 Aligned_cols=300 Identities=12% Similarity=0.068 Sum_probs=224.1 Q ss_pred CCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCc Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDK 82 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~ 82 (343) |++|+|+ ++..++|. |+|+.+++..+++..+...+.+......|+|||||+||++++++|++++++.. T Consensus 1 ~~~~n~t----------s~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~~~~~i~~- 69 (322) T protein:vir:31 1 MSTGNNT----------SNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRPEQGDFTF- 69 (322) T ss_pred CCCCCCc----------ccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccccCCCCccc- Confidence 6667653 22345663 99999999999999999998887777789999999999999999999999866 Q ss_pred cCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-ccc-ccccccCC Q lcl|NC_011085. 83 RKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA-ASN-ENIAGLGS 160 (343) Q Consensus 83 ~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~-~~~-~~~~g~~~ 160 (343) +++++++.+|+|||.|||+|.||| |++|+++|+++.++++++|+||+.+|+++...+..++.... ... ..+.+.+ T Consensus 70 -d~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~- 146 (322) T protein:vir:31 70 -DNLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVP- 146 (322) T ss_pred -ccCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCc- Confidence 468999999999999999999999 99999999999999999999999999999887776653221 111 1112221 Q ss_pred ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHH---------Hhccchhhhhccccccc Q lcl|NC_011085. 161 ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSA---------ILAALMPNAANYAALID 231 (343) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~---------Ll~~~~~~~~~~~~~~~ 231 (343) ..+++ ..++|.. .|+.|++++.+|||+|||.+|||+||+|+++.. |+++++|+..+-.|. T Consensus 147 ~~iv~-----~gt~~~~----ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~-- 215 (322) T protein:vir:31 147 HRFVG-----TGTDQTM----DVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGI-- 215 (322) T ss_pred cceec-----cCCCchh----hHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccc-- Confidence 11221 1223322 366778889999999999999999999999764 577888876544433 Q ss_pred hhcc--eeEEEeceEEEEeccccccccc--ccccccccc--ccccccccccccccccccceEeEeechhhheeeeeeeeE Q lcl|NC_011085. 232 PERG--SIRNVMGFEVVEVPHLTAGGAG--DDREDETTN--QKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLS 305 (343) Q Consensus 232 ~~~G--~V~~i~Gf~V~~sn~lp~~~~~--~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~ 305 (343) .+| .||+++||+||+||++|..+-+ ++....++. .-..|.+.. -+.++ ...++..+.++ T Consensus 216 -a~g~~~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~-------------~~~~~-~~~~~~~~l~~ 280 (322) T protein:vir:31 216 -APDMQFVRSVYGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVS-------------DMGLL-PFVVAWKEMPT 280 (322) T ss_pred -hhhHHHHHHHhceeeeeeccccccccccccCcccccccceeeccccccc-------------chhhh-hhhhHhhhhhh Confidence 223 4999999999999999843321 111111110 001111110 01122 34455566778 Q ss_pred EeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 306 LERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 306 ~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .|.+|++++|+|.++++++||++++|||.++++...+- T Consensus 281 ~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~~~ 318 (322) T protein:vir:31 281 TKSFIDDYNDDLNTATTARWGNGLVRDENLVCVLANAD 318 (322) T ss_pred hhcccCccccccceeeeeeecceeecccceEEEEeccc Confidence 99999999999999999999999999999999999999 No 21 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=9.9e-58 Score=333.10 Aligned_cols=266 Identities=20% Similarity=0.194 Sum_probs=221.0 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc---ccccceEEEEeccCcceeeeecC- Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR---SISSGKSAQFPVLGRTRAAYLQA- 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~---~i~~G~tv~i~~iG~~t~~~~~~- 75 (343) ||+. .|+ |+|+++|++.|++.+++.++++.+ +++.|+||+||++|++++.+|++ T Consensus 1 MA~~---------------------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 59 (273) T protein:vir:10 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAA 59 (273) T ss_pred Ccch---------------------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccC Confidence 4441 355 999999999999999999998654 57789999999999999999986 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) +..++. +++.+++++++||+.+|+++.|+|+|+.|+++|+++ ++++++++||+++|+.++..++.++... T Consensus 60 ~~~~~~--~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~------- 129 (273) T protein:vir:10 60 GRQTSA--DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTAL------- 129 (273) T ss_pred CCccCc--cccccceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 444443 568899999999999999999999999999999865 9999999999999999998775432110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhh-hhcccc-ccchh Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPN-AANYAA-LIDPE 233 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~-~~~~~ 233 (343) ..+...++ ..+++.|++|+++|++++||.+|||+||+|++|+.|++++.++ +.++.+ ...++ T Consensus 130 ------------~~~~~~~~----~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~ 193 (273) T protein:vir:10 130 ------------TGSAPTDA----DDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) T ss_pred ------------ccccccch----hHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccccccee Confidence 00111222 3457788899999999999999999999999999999988655 556554 46789 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) +|.||+++||+||+||+||..+.. -.+++|++|+++++++. ++|..|+++ T Consensus 194 ~G~ig~i~G~~v~~s~~lp~~~~~-----------------------------~~~~~~~~A~~~a~q~~-~~e~~r~~~ 243 (273) T protein:vir:10 194 AGTIGNLLGARIVESNNLRDTDDE-----------------------------QFVAFHPSAAAYVSQID-TVEALRDQD 243 (273) T ss_pred eeeeeEEeceEEEEecccccCCcc-----------------------------EEEEEeccceeeeeeee-hhhcccCCC Confidence 999999999999999999953210 12689999999998765 899999999 Q ss_pred hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+|.|+|+++||++++|||++++|+.+.= T Consensus 244 ~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 244 SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 999999999999999999999999876655 No 22 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=9.9e-58 Score=333.10 Aligned_cols=266 Identities=20% Similarity=0.194 Sum_probs=221.0 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc---ccccceEEEEeccCcceeeeecC- Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR---SISSGKSAQFPVLGRTRAAYLQA- 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~---~i~~G~tv~i~~iG~~t~~~~~~- 75 (343) ||+. .|+ |+|+++|++.|++.+++.++++.+ +++.|+||+||++|++++.+|++ T Consensus 1 MA~~---------------------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 59 (273) T protein:vir:10 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAA 59 (273) T ss_pred Ccch---------------------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccC Confidence 4441 355 999999999999999999998654 57789999999999999999986 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) +..++. +++.+++++++||+.+|+++.|+|+|+.|+++|+++ ++++++++||+++|+.++..++.++... T Consensus 60 ~~~~~~--~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~------- 129 (273) T protein:vir:10 60 GRQTSA--DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTAL------- 129 (273) T ss_pred CCccCc--cccccceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 444443 568899999999999999999999999999999865 9999999999999999998775432110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhh-hhcccc-ccchh Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPN-AANYAA-LIDPE 233 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~-~~~~~ 233 (343) ..+...++ ..+++.|++|+++|++++||.+|||+||+|++|+.|++++.++ +.++.+ ...++ T Consensus 130 ------------~~~~~~~~----~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~ 193 (273) T protein:vir:10 130 ------------TGSAPTDA----DDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) T ss_pred ------------ccccccch----hHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccccccee Confidence 00111222 3457788899999999999999999999999999999988655 556554 46789 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) +|.||+++||+||+||+||..+.. -.+++|++|+++++++. ++|..|+++ T Consensus 194 ~G~ig~i~G~~v~~s~~lp~~~~~-----------------------------~~~~~~~~A~~~a~q~~-~~e~~r~~~ 243 (273) T protein:vir:10 194 AGTIGNLLGARIVESNNLRDTDDE-----------------------------QFVAFHPSAAAYVSQID-TVEALRDQD 243 (273) T ss_pred eeeeeEEeceEEEEecccccCCcc-----------------------------EEEEEeccceeeeeeee-hhhcccCCC Confidence 999999999999999999953210 12689999999998765 899999999 Q ss_pred hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+|.|+|+++||++++|||++++|+.+.= T Consensus 244 ~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 244 SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 999999999999999999999999876655 No 23 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=2.6e-56 Score=325.33 Aligned_cols=266 Identities=20% Similarity=0.185 Sum_probs=219.3 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc---ccccceEEEEeccCcceeeeecC- Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR---SISSGKSAQFPVLGRTRAAYLQA- 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~---~i~~G~tv~i~~iG~~t~~~~~~- 75 (343) ||+. .|+ |+|+++|++.|++.+++.++++.+ ....|+||+||++|.+++.+|++ T Consensus 1 MA~~---------------------~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~ 59 (273) T protein:vir:79 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAA 59 (273) T ss_pred Ccch---------------------hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccC Confidence 5552 255 999999999999999999987654 33469999999999999998874 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) |..++. +++++++++++||+.+++++.|+|+|+.|+++|++ +++++++++||+++|+.++..+..++... T Consensus 60 ~~~~~~--~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~~------- 129 (273) T protein:vir:79 60 GRQTSA--DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTAL------- 129 (273) T ss_pred CCccCc--cccccceEEEEEeeecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhccccc------- Confidence 555543 56889999999999999999999999999999987 59999999999999999987775432110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccch-hhhhcccc-ccchh Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALM-PNAANYAA-LIDPE 233 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~-~~~~~~~~-~~~~~ 233 (343) .. +...++ ..+++.|.+|+++||+++||.+|||+||+|++|+.||+++. +.+.++.+ +..++ T Consensus 130 ---------~~---~~~~~~----~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~ 193 (273) T protein:vir:79 130 ---------TG---SAPSDA----DDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) T ss_pred ---------cc---ccccch----hhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhccccccee Confidence 00 111122 33567888999999999999999999999999999999875 55667665 45789 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) +|.||+++||+|++||++|..+. ...+++|++|+++++++. ++|..|+++ T Consensus 194 ~G~ig~~~G~~i~~s~~lp~~~~-----------------------------~~~~a~~~~A~~~a~~~~-~~e~~r~~~ 243 (273) T protein:vir:79 194 AGTIGNLLGARIVESNNLRDTDD-----------------------------EQFVAFHPSAAAYVSQID-TVEALRDQD 243 (273) T ss_pred eeEeeEEeceEEEecccccccCc-----------------------------eEEEEEeccceeeeeehh-hhhcccCcc Confidence 99999999999999999995321 012678999999998765 899999999 Q ss_pred hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+|+|+++++||++++|||++++|+.+.= T Consensus 244 ~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 244 SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 999999999999999999999999776555 No 24 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=4e-55 Score=318.81 Aligned_cols=310 Identities=13% Similarity=0.077 Sum_probs=224.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHH-hhhhccCccccccc-cceEEE------EeccCcceeee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFAR-TSVTTNRHIMRSIS-SGKSAQ------FPVLGRTRAAY 72 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~-~s~~~~~~~~~~i~-~G~tv~------i~~iG~~t~~~ 72 (343) |+= ++ +-.|--.=+.++.+.|+|+|+.+|+..||. .|+|++.++.++-. ++++++ ++.+++..+.. T Consensus 1 ~~~---~~---~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (322) T protein:vir:10 1 MKL---NA---IMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQ 74 (322) T ss_pred Ccc---cc---eeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccc Confidence 321 11 111111122357788999999999999884 59999999988543 444434 44455555555 Q ss_pred ecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_011085. 73 LQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASN 152 (343) Q Consensus 73 ~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (343) +.+++..+-...+.+++.+.+.++++ |++++|||+|++|+++|++++|++++++||+|++|+.|+..+...+....+. T Consensus 75 ~~~d~~~dtp~~~~~~~~r~~~~~d~-~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~g- 152 (322) T protein:vir:10 75 QSADGTYPTPVNNKPFAKRRTNVDTY-DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTG- 152 (322) T ss_pred cccCcccCCCccccccceEEEeeccc-ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccc- Confidence 44444332222346778888777666 7889999999999999999999999999999999999987665444321111 Q ss_pred ccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCC-cEEEeCHHHHHHHhccchhhhhccccccc Q lcl|NC_011085. 153 ENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSAD-RTFYTTPEVYSAILAALMPNAANYAALID 231 (343) Q Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (343) ....++++..+..+.. ...++.|++|++.|++++||+++ ||+||+|++|++||++++|+++||.+... T Consensus 153 -t~v~~~ss~~i~~g~~----------g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~ 221 (322) T protein:vir:10 153 -QPVEFLATQEIGDGTK----------PISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMD 221 (322) T ss_pred -cccccCCCcccccCcc----------chhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchh Confidence 1111222222222111 12367788999999999999875 99999999999999999999999998777 Q ss_pred h-hcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee Q lcl|NC_011085. 232 P-ERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR 310 (343) Q Consensus 232 ~-~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~ 310 (343) + ++|.|++++||+|++||+||..+.+..+. ... ...+...+. +++||++|+++++.++++++..+ T Consensus 222 l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~--------~~~-----~~~~~~~~~-~~a~~k~Av~~a~~~dv~~~i~~ 287 (322) T protein:vir:10 222 LQSKGIITNWMGYTWIVSTRLDKFDPTQWGM--------AAE-----DGPQGDEIW-CIAMTDMALGYHSCKDIWTKVAE 287 (322) T ss_pred hhhcCeeeeeeeEEEEEeccCCccccccccc--------ccc-----CCCCcccee-EEEEecCceeEEEeeeeeEEeec Confidence 6 67999999999999999999765433211 111 112222333 37999999999999999999887 Q ss_pred cc-chhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 311 RA-EYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 311 ~~-~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +| +++++.|.++++||+++++|+++++|.+..- T Consensus 288 ~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 288 DPSASFAWRIYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred cCCcchhhhhhhhhhhCceEeccCcEEEEEEecc Confidence 66 5579999999999999999999999999888 No 25 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=100.00 E-value=1.3e-48 Score=283.13 Aligned_cols=217 Identities=22% Similarity=0.308 Sum_probs=164.9 Q ss_pred eeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccc Q lcl|NC_011085. 95 IDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTS 174 (343) Q Consensus 95 iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 174 (343) ||++++++|.|||+|++|++||+|+++++|+|++||+++|++|+++++++++...|.+..+.+ +.. ... .+.+++ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g---~~~-~~~-a~~t~~ 75 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGG---FSV-NIG-AGNTNN 75 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccC---cce-ecc-ccccCC Confidence 999999999999999999999999999999999999999999999999999877766554322 221 111 123334 Q ss_pred hHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhc--cchhhhhcccc-ccchhcc-eeEEEeceEEEEecc Q lcl|NC_011085. 175 PVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILA--ALMPNAANYAA-LIDPERG-SIRNVMGFEVVEVPH 250 (343) Q Consensus 175 ~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~--~~~~~~~~~~~-~~~~~~G-~V~~i~Gf~V~~sn~ 250 (343) ++++++.|++|+++|||++||++|||+||+|++|+.||+ ++++.+.++.+ .+.+++| .|++++||+||+||| T Consensus 76 ----~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~Snn 151 (221) T protein:vir:17 76 ----AQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNV 151 (221) T ss_pred ----HHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEecc Confidence 456678888999999999999999999999999998886 46788888865 4568888 499999999999999 Q ss_pred ccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhcccee Q lcl|NC_011085. 251 LTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGL 330 (343) Q Consensus 251 lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~ 330 (343) +|+...+..... .+..........+|+++|++++||+|||+|+++++.+.+- .|+| ++..+ ..+. T Consensus 152 lP~~~gt~~~~~---ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~---~~~~-----~~~~~----~~~~ 216 (221) T protein:vir:17 152 LASLYGTNLVTD---PGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPP---SRPP-----LVISM----FSIR 216 (221) T ss_pred CCcccccccccC---CccccccccccccccccccceEEEEEcchheeeeeeecCC---CCCc-----eeeee----eecc Confidence 998654433221 1111222233458999999999999999999999987543 2332 11111 2345 Q ss_pred cccce Q lcl|NC_011085. 331 RPEAA 335 (343) Q Consensus 331 rpe~~ 335 (343) |||.- T Consensus 217 ~~~~~ 221 (221) T protein:vir:17 217 RPDRR 221 (221) T ss_pred CCCCC Confidence 56555 No 26 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=5.3e-46 Score=268.81 Aligned_cols=270 Identities=16% Similarity=0.123 Sum_probs=221.0 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCcc-eeeeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGRT-RAAYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~ 75 (343) |||+.+--. .+|+ |+|+..|.+.|.+..++.++.... ++ +.|++|+||+++.. .+.+|.. T Consensus 1 Ma~~~T~~~---------------~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~ 65 (278) T protein:vir:80 1 MADLTTKLA---------------NLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAE 65 (278) T ss_pred CCCcceehh---------------heecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecC Confidence 888632221 2355 999999999999999998886543 44 46999999998754 4678999 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) |+.++. ++++.++.+++|++.. ..|.|+|++..++..|++++++++++++|++++|+.++..+..+.... T Consensus 66 g~~i~~--~~lt~~~~~~~i~~~~-~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~------- 135 (278) T protein:vir:80 66 GAAIDY--SALETESVKHGIKKAG-KGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEV------- 135 (278) T ss_pred CCcCcc--cccccceeeEeeehhh-ccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 998875 4689999999999975 589999999999999999999999999999999999998775432110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchh Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPE 233 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~ 233 (343) ..+... ......++.+.++..+|+++++|. .|+++|+|++|+.|+++. +|+.....++..++ T Consensus 136 -----------~~~~t~----~~~~~~~~~~~da~~~l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~ 199 (278) T protein:vir:80 136 -----------KGAINI----GLIDKIENTFTDAPDAIEDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLV 199 (278) T ss_pred -----------cccccc----chhhhHHHHHHHHHHhhcccCCCc-ccEEEECHHHHHHHHhhhhhhcccccccccccee Confidence 000000 112233566777889999999995 678999999999998875 56666666777899 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) +|.|++++||+||+||++|.+. +.++|+.|++++..+++++|.+|+++ T Consensus 200 ~G~ig~~~G~~Vi~s~~~p~~t--------------------------------~~l~~~gAi~~~~~~~~~vE~~Rd~~ 247 (278) T protein:vir:80 200 KGAFGELLGWEIVRTKKLADGN--------------------------------ALAVKAGALKTFLKRNLLAESGRDMD 247 (278) T ss_pred eccceeecceeEEEcCCCCcce--------------------------------EEEEeccceeeeecCCcccccccchh Confidence 9999999999999999998421 25788999999999999999999999 Q ss_pred hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++|.|.++++||++++||++++.|+..+| T Consensus 248 ~~~d~i~~~~~yg~~v~~~~~~v~it~~a~ 277 (278) T protein:vir:80 248 HKLTKFNADQHYAVALVDETKAVKVVPVAG 277 (278) T ss_pred hccceeeeeeEEEEEEEcCcceEEEeeccC Confidence 999999999999999999999999999999 No 27 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=6.8e-44 Score=257.23 Aligned_cols=284 Identities=13% Similarity=0.027 Sum_probs=218.1 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccC-ccc-cccccceEEEEeccCcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNR-HIM-RSISSGKSAQFPVLGRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~-~~~-~~i~~G~tv~i~~iG~~t~~~~~~g~ 77 (343) .-|.++--.. .-+|-.+.+=++..+.+ |+|++.+++.+...++...+ ++. ....+|++|+||+++.+.++||++++ T Consensus 5 ~~~~~~~~~~-~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~ 83 (319) T protein:vir:94 5 IKNATGMLKL-NLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) T ss_pred cccccceeEe-ehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCC Confidence 2222222221 23455555556565555 99999999988888776543 342 36678999999999999999999988 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDV--RSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) .... ++++.+..+++||+.+||.|.||++|..|++.++ ...+.+.+...++..+|.+.+..++..+... T Consensus 84 g~~~--g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~------- 154 (319) T protein:vir:94 84 TNEF--DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------- 154 (319) T ss_pred Cccc--CCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc------- Confidence 6654 5688999999999999999999999999998876 4456778888999999988887765432110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcc Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERG 235 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (343) . +.. ...+++++.|+++.++|+|++|| ++||++|+|++|.+|+++++|+.....++..+.+| T Consensus 155 --------~-----~~~----~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) T protein:vir:94 155 --------L-----TVG----TGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) T ss_pred --------c-----ccc----cCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 0 001 12356788889999999999999 69999999999999999999988665566778999 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee-ccch Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR-RAEY 314 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~-~~~~ 314 (343) .|++++||+|+++|+..... .. -++.|++|+.++.+.+ .+|.++ .+.+ T Consensus 217 ~Vg~idG~~Vi~vps~~~k~-------------------------in-----~i~~h~~A~~~~~k~~-~~~~~~p~~~~ 265 (319) T protein:vir:94 217 VQGELDGFVIVKVPTKLLQG-------------------------LQ-----AIAVVGEVLASPIQAD-LAKTNSNIPGM 265 (319) T ss_pred eceeecCeEEEEeccccccc-------------------------ce-----EEEEcCCeeeeeeeee-eeeccCCCccc Confidence 99999999999976532100 11 2789999999888776 678776 5889 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|.|+++++||++|+||+..++++...= T Consensus 266 ~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:94 266 FGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred cceeeeeeeeeeeEEeccccceEEEeecC Confidence 99999999999999999998777763222 No 28 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=6.8e-44 Score=257.23 Aligned_cols=284 Identities=13% Similarity=0.027 Sum_probs=218.1 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccC-ccc-cccccceEEEEeccCcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNR-HIM-RSISSGKSAQFPVLGRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~-~~~-~~i~~G~tv~i~~iG~~t~~~~~~g~ 77 (343) .-|.++--.. .-+|-.+.+=++..+.+ |+|++.+++.+...++...+ ++. ....+|++|+||+++.+.++||++++ T Consensus 5 ~~~~~~~~~~-~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~ 83 (319) T protein:vir:97 5 IKNATGMLKL-NLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) T ss_pred cccccceeEe-ehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCC Confidence 2222222221 23455555556565555 99999999988888776543 342 36678999999999999999999988 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDV--RSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) .... ++++.+..+++||+.+||.|.||++|..|++.++ ...+.+.+...++..+|.+.+..++..+... T Consensus 84 g~~~--g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~------- 154 (319) T protein:vir:97 84 TNEF--DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------- 154 (319) T ss_pred Cccc--CCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc------- Confidence 6654 5688999999999999999999999999998876 4456778888999999988887765432110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcc Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERG 235 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (343) . +.. ...+++++.|+++.++|+|++|| ++||++|+|++|.+|+++++|+.....++..+.+| T Consensus 155 --------~-----~~~----~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) T protein:vir:97 155 --------L-----TVG----TGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) T ss_pred --------c-----ccc----cCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 0 001 12356788889999999999999 69999999999999999999988665566778999 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee-ccch Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR-RAEY 314 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~-~~~~ 314 (343) .|++++||+|+++|+..... .. -++.|++|+.++.+.+ .+|.++ .+.+ T Consensus 217 ~Vg~idG~~Vi~vps~~~k~-------------------------in-----~i~~h~~A~~~~~k~~-~~~~~~p~~~~ 265 (319) T protein:vir:97 217 VQGELDGFVIVKVPTKLLQG-------------------------LQ-----AIAVVGEVLASPIQAD-LAKTNSNIPGM 265 (319) T ss_pred eceeecCeEEEEeccccccc-------------------------ce-----EEEEcCCeeeeeeeee-eeeccCCCccc Confidence 99999999999976532100 11 2789999999888776 678776 5889 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |+|.|+++++||++|+||+..++++...= T Consensus 266 ~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:97 266 FGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred cceeeeeeeeeeeEEeccccceEEEeecC Confidence 99999999999999999998777763222 No 29 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=8.8e-44 Score=256.62 Aligned_cols=284 Identities=13% Similarity=0.023 Sum_probs=219.8 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccC-cc-ccccccceEEEEeccCcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNR-HI-MRSISSGKSAQFPVLGRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~-~~-~~~i~~G~tv~i~~iG~~t~~~~~~g~ 77 (343) .-|..+--.. .-+|..+.+-.+..+-+ |+|++.|++.|...++...+ ++ .....+|++|+||+++.+.++||+++. T Consensus 16 ~~~~~~~~~~-~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~R~~ 94 (329) T protein:vir:10 16 IKNATGKLKL-NLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYKRNA 94 (329) T ss_pred hhcccceeEE-ehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecccccccccCCC Confidence 2233222222 33455566666665544 99999999999988776543 33 235678999999999999999999988 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDV--RSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) .... ++++.+..+++||+.+||.|.||++|..|++.++ ...+.+.+.+.++..+|.+.+..++..+... T Consensus 95 g~~~--g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~------- 165 (329) T protein:vir:10 95 TNEF--DHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKH------- 165 (329) T ss_pred Cccc--cccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccc------- Confidence 6654 5688999999999999999999999999998776 4456677899999999999887775432110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcc Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERG 235 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (343) . +... ..+++++.|+++.++|+|++|| ++||++|+|++|.+|+++++|+.....++..+++| T Consensus 166 --------~-----~~~~----t~~nay~~i~~a~~~Lde~~vp-~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g 227 (329) T protein:vir:10 166 --------L-----TVGS----GADAQYDAVLDVSVELDEIGAG-ASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKG 227 (329) T ss_pred --------c-----cccc----CHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 0 0111 2356788889999999999999 59999999999999999999987655556678999 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee-ccch Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR-RAEY 314 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~-~~~~ 314 (343) .|++++||+|+++|+..... .. .++.|++|+.++.+.+ .+|.++ .+.+ T Consensus 228 ~Vg~idG~~Ii~vps~~~k~-------------------------in-----~ii~~~~A~~~~~K~~-~~~~~~p~~~~ 276 (329) T protein:vir:10 228 VQGELDGFTIVKVPSKMLQG-------------------------VE-----AMAVIGEVMASPIQAN-EAKLNSNVPGM 276 (329) T ss_pred eeeeecCeEEEEecCCcccc-------------------------ee-----EEEEcCCceeeeeeee-eeeeeCCCCcc Confidence 99999999999987543210 11 2789999999988877 788876 4889 Q ss_pred hhhhhhhhhhhccceecccceEEEEecC-C Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTA-G 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~-g 343 (343) ++|.|+++++||++|+||++.++++... . T Consensus 277 ~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 277 FGTLAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred chheeeeeeeeeeEEEccccCEEEEecccC Confidence 9999999999999999999877765322 1 No 30 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=3e-43 Score=253.68 Aligned_cols=295 Identities=17% Similarity=0.131 Sum_probs=211.7 Q ss_pred CCCCCccccccccccccccccchhHHH-HHHHHHHHHHHHHHhhhhccCcccc---cc-ccceEEEEeccCcceeeeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALF-LKVFGGEVLTAFARTSVTTNRHIMR---SI-SSGKSAQFPVLGRTRAAYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~-ie~~~g~V~~~f~~~s~~~~~~~~~---~i-~~G~tv~i~~iG~~t~~~~~~ 75 (343) ||-+ +|. +. .|+|+.++++.|++++++.++++.+ ++ +.|+|||||+.+..+++++. T Consensus 1 m~~~---~N~---------------~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg~- 61 (418) T protein:vir:10 1 MAVQ---DNN---------------LLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASGR- 61 (418) T ss_pred CCcc---ccc---------------cccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecccC- Confidence 5443 221 12 3799999999999999999988764 33 35999999999999998865 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) .+. .+++..++++|+||+.+|++|.|+|.|++|...|++++++++++++||+.+|+.++..+..+++.. T Consensus 62 --~~~--~~~~te~~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~------- 130 (418) T protein:vir:10 62 --TLV--KQPMVDQTIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSS------- 130 (418) T ss_pred --Ccc--ccccccceEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------- Confidence 343 356888999999999999999999999999999999999999999999999999987654433211 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCC-cEEEeCHHHHHHHhccchhhhhccccccchhc Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSAD-RTFYTTPEVYSAILAALMPNAANYAALIDPER 234 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (343) ++.+ +.+ . .++.|.+++.+|++++||.+| ||+||+|++|+.|++++++......++..+++ T Consensus 131 -----------gt~g--t~~-~----~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~ 192 (418) T protein:vir:10 131 -----------GTPG--VRP-G----AFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKM 192 (418) T ss_pred -----------ccCC--cCc-c----hHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhhe Confidence 0001 111 1 256677889999999999985 99999999999999888776544555667999 Q ss_pred ceeEEEeceEEEEecccccccccccccccccccc---cc-c------------ccccc---------------------- Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQK---HA-F------------PKTAE---------------------- 276 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~---~~-~------------~~~~~---------------------- 276 (343) |.||+++||+||+|||+|...........+..+. +. . ...+. T Consensus 193 G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~ 272 (418) T protein:vir:10 193 GYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQ 272 (418) T ss_pred eeeeeeeceEEEEecCCCcccccccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccce Confidence 9999999999999999996432221111110000 00 0 00000 Q ss_pred -------------ccccc---------------------------------------------cccceEeEeechhhhee Q lcl|NC_011085. 277 -------------GDTKV---------------------------------------------ALDNVVGLFQHRSAVGT 298 (343) Q Consensus 277 -------------~~~~~---------------------------------------------~~~~~~~l~~~~~Av~~ 298 (343) +.... ..+....++||++|+.. T Consensus 273 ~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l 352 (418) T protein:vir:10 273 EFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIAL 352 (418) T ss_pred EEEEEeeccccccCcceeEeccccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEE Confidence 00000 00112248999998877 Q ss_pred eeeee--------------------eEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 299 VKLKD--------------------LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 299 ~~~~~--------------------~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.... +++-.+||.+..-+.++.-..||.+.+|||.++.|.-++- T Consensus 353 ~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~~ 417 (418) T protein:vir:10 353 AMIDLELPQSAVIKSRAADPETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELALRLWGAAS 417 (418) T ss_pred EEeeccCCCCCCcceEEEeccCCeEEEEEEcccccccceEEEEEeecCceeecccceEEEEeecC Confidence 65433 2222336666677778888899999999999988877777 No 31 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=2.6e-43 Score=254.01 Aligned_cols=263 Identities=16% Similarity=0.177 Sum_probs=218.9 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCc-ceeeeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGR-TRAAYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~-~t~~~~~~ 75 (343) |||.++ +. -.+++ |+|+..|.+.|.+..++.++.+.. ++ +.|++|+||+++. ..+.+|.+ T Consensus 1 ma~~~T------~~---------~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~ 65 (274) T protein:vir:96 1 MAQGTT------KV---------SNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE 65 (274) T ss_pred CCcccc------ch---------hhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCC Confidence 887643 21 12455 999999999999999999987665 33 3599999999875 46789999 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) |+.++. .+++.++.+++|++. ++.|.|+|++..++..|++++++++++++|++++|+.++..+..+... T Consensus 66 g~~i~~--~~it~~~~~~~i~~~-~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~-------- 134 (274) T protein:vir:96 66 GEKIPV--DQIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLT-------- 134 (274) T ss_pred CCcCch--hhcccceeEEEEEee-eceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-------- Confidence 998875 468899999999884 788999999999999999999999999999999999998766432100 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchh Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPE 233 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~ 233 (343) ..++.. .++.|.+|..+|+++++ ++||++|+|++|..|+++. +|+.....++..++ T Consensus 135 ------------~~~~~~--------~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~ 192 (274) T protein:vir:96 135 ------------VEADIT--------KLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIV 192 (274) T ss_pred ------------cCcccc--------cHHHHHHHHHHhcccCC--CceEEEeCHHHHHHHHhccccccccccccccccee Confidence 000111 15667788999999886 6899999999999998874 56766666777899 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) +|.|++++||+|++||++|.+. ++++++.|++++..+++++|..|+++ T Consensus 193 ~g~ig~~~G~~Vi~s~~~p~~t--------------------------------~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:96 193 KGAFGEALGAVIVRSNKLNKGE--------------------------------ALLAKKGAVKLITKRDFFLEKDRDAS 240 (274) T ss_pred ecccceecCeeEEEcCCCCcce--------------------------------EEEEeCcceeeeecCCcccccccchh Confidence 9999999999999999998421 26788999999999999999999999 Q ss_pred hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++|.|.++++||++++||+++++|+...| T Consensus 241 ~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:96 241 RKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) T ss_pred hcccEEEEeeEEEEEEEcCccEEEEEcCcc Confidence 999999999999999999999999999999 No 32 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=1.7e-42 Score=249.55 Aligned_cols=263 Identities=16% Similarity=0.145 Sum_probs=216.4 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc-ccc--cceEEEEeccCcc-eeeeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR-SIS--SGKSAQFPVLGRT-RAAYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~-~i~--~G~tv~i~~iG~~-t~~~~~~ 75 (343) |||..+ +. ..+++ |+|+..|.+.+.+..++.++.... ++. .|++|+||..... .+.+|.. T Consensus 1 m~~~~T------~l---------~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~ 65 (274) T protein:vir:95 1 MAQGMT------KL---------TNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE 65 (274) T ss_pred CCccee------eh---------hheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccC Confidence 877532 21 12454 999999999999999999986544 444 5999999996643 5678999 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) |+.++. +.++.++.+++|++. ++.|.|+|++..++..|++++++++++++||+++|+.++..+.++.... T Consensus 66 g~~i~~--~~lt~~~~~~~i~~~-~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~------- 135 (274) T protein:vir:95 66 GEKIPT--DILETKKREAKIRKI-AKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTV------- 135 (274) T ss_pred CCccch--hhcccceeEEEeeee-ecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 998865 568999999999884 8899999999999999999999999999999999999987664322110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchh Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPE 233 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~ 233 (343) . ++.. .++.|.+|..+|++.+. .+||++|+|++|+.|+++. +|+..+-.+...++ T Consensus 136 ---------~----~~~~--------~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~ 192 (274) T protein:vir:95 136 ---------E----ADIT--------KLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIV 192 (274) T ss_pred ---------c----cccc--------CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhcccccccccccccccee Confidence 0 0011 14556678889988774 7899999999999999985 66766656678899 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) +|.|++++||+||+||++|.+. ++++++.|++++..+++++|..||++ T Consensus 193 ~G~ig~~~G~~Vi~s~~~~~~t--------------------------------~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:95 193 KGAFGEALGAVIVRSNKLEAGT--------------------------------AILAKKGAVKLITKRDFFLETDRDPS 240 (274) T ss_pred ccccceecCeEEEEeCCCCCce--------------------------------EEEEeccceeeeecCCcccccccccc Confidence 9999999999999999998321 25778889999989999999999999 Q ss_pred hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++.|.|.+++.||++++||++++.++...| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:95 241 TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 999999999999999999999999999999 No 33 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=1.7e-42 Score=249.55 Aligned_cols=263 Identities=16% Similarity=0.145 Sum_probs=216.4 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc-ccc--cceEEEEeccCcc-eeeeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR-SIS--SGKSAQFPVLGRT-RAAYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~-~i~--~G~tv~i~~iG~~-t~~~~~~ 75 (343) |||..+ +. ..+++ |+|+..|.+.+.+..++.++.... ++. .|++|+||..... .+.+|.. T Consensus 1 m~~~~T------~l---------~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~ 65 (274) T protein:vir:96 1 MAQGMT------KL---------TNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE 65 (274) T ss_pred CCccee------eh---------hheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccC Confidence 877532 21 12454 999999999999999999986544 444 5999999996643 5678999 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) |+.++. +.++.++.+++|++. ++.|.|+|++..++..|++++++++++++||+++|+.++..+.++.... T Consensus 66 g~~i~~--~~lt~~~~~~~i~~~-~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~------- 135 (274) T protein:vir:96 66 GEKIPT--DILETKKREAKIRKI-AKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTV------- 135 (274) T ss_pred CCccch--hhcccceeEEEeeee-ecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 998865 568999999999884 8899999999999999999999999999999999999987664322110 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchh Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPE 233 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~ 233 (343) . ++.. .++.|.+|..+|++.+. .+||++|+|++|+.|+++. +|+..+-.+...++ T Consensus 136 ---------~----~~~~--------~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~ 192 (274) T protein:vir:96 136 ---------E----ADIT--------KLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIV 192 (274) T ss_pred ---------c----cccc--------CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhcccccccccccccccee Confidence 0 0011 14556678889988774 7899999999999999985 66766656678899 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) +|.|++++||+||+||++|.+. ++++++.|++++..+++++|..||++ T Consensus 193 ~G~ig~~~G~~Vi~s~~~~~~t--------------------------------~~l~~~gA~~~~~~~~~~vE~~Rd~~ 240 (274) T protein:vir:96 193 KGAFGEALGAVIVRSNKLEAGT--------------------------------AILAKKGAVKLITKRDFFLETDRDPS 240 (274) T ss_pred ccccceecCeEEEEeCCCCCce--------------------------------EEEEeccceeeeecCCcccccccccc Confidence 9999999999999999998321 25778889999989999999999999 Q ss_pred hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++.|.|.+++.||++++||++++.++...| T Consensus 241 ~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:96 241 TKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 999999999999999999999999999999 No 34 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=3.2e-42 Score=248.09 Aligned_cols=264 Identities=16% Similarity=0.144 Sum_probs=218.0 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCc-ceeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGR-TRAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~-~t~~~~~~g 76 (343) |||..+ +.. +-+..|+|+..|.+.+.+..++.++.... ++ +.|++|+||++.. ..+++|..| T Consensus 1 ma~~~T------~~~--------~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg 66 (274) T protein:vir:93 1 MPQGIT------KTS--------NQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccce------ehh--------heechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCC Confidence 887533 221 11344999999999999999999888664 33 3599999999765 367899999 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.++. ..++.++.+++|++. ++.|.|+|++..++..|++++.+++++++|++++|+.++..+.++.... T Consensus 67 ~~i~~--~~it~~~~~~~i~~~-~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~-------- 135 (274) T protein:vir:93 67 EKIPT--DILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-------- 135 (274) T ss_pred Ccccc--cccccceeEEEeeee-cccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Confidence 99875 468899999999885 6899999999999999999999999999999999999987764322110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchhc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPER 234 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (343) .++.++ ++.|.+|..+|+++++ ++||++|+|++|+.|+++. +|+.....++..+++ T Consensus 136 ------------~~~~~~--------~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:93 136 ------------NADITK--------LNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred ------------cccccC--------HHHHHHHHHHhhhccC--CccEEEeCHHHHHHHHhhhhhcccccccccccceee Confidence 011111 4556677889998875 6899999999999999885 566666666778999 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |.|++++||+|++||++|.+. ++++++.|++++..+++++|..|++++ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~t--------------------------------~~l~~~gai~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:93 194 GAFGEALGAIIVRTNKLEAGT--------------------------------AILAKKGAVKLILKRDFFLEVARDAST 241 (274) T ss_pred cccceecCeeEEEcCCCCcce--------------------------------EEEEeCCeEEEEecCCcccccccchhh Confidence 999999999999999998321 268899999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++|.|+++++||++++||++++.++...| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~~v~~t~~~~ 270 (274) T protein:vir:93 242 KTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEeeCcc Confidence 99999999999999999999999999999 No 35 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=5.6e-42 Score=246.75 Aligned_cols=264 Identities=17% Similarity=0.146 Sum_probs=217.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCcc-eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGRT-RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (343) |||.. |+. .+-+..|+|+..|.+.|.+..++.+++... ++ ++|++|+||..+.. .+.+|..| T Consensus 1 ma~~~------T~l--------~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g 66 (274) T protein:vir:12 1 MAQGL------TKT--------SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCcce------eeh--------hhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 87753 221 122344999999999999999999988775 33 45999999996643 56789999 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.++. +.++.++.+++|++ .++.|.|+|++..++..|++++++++++++||+++|+.++..+.++.... T Consensus 67 ~~i~~--~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~-------- 135 (274) T protein:vir:12 67 EKIPT--DILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-------- 135 (274) T ss_pred Cccch--hhcccceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Confidence 98865 56899999999988 58899999999999999999999999999999999999987765322110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchhc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPER 234 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (343) . .... . ++.|.+|..+|++++. .+||++|+|++|..|+++. +|+...-.+...+++ T Consensus 136 ----------~--~~a~----~----~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~ 193 (274) T protein:vir:12 136 ----------N--ADIT----K----LNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred ----------c--cccc----C----HHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhhhhhccccccccccceec Confidence 0 0111 1 4566677888988764 7899999999999999985 677766566778999 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |.|++++||+|++||++|.+. +.++++.|++++..+++++|..||+++ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~t--------------------------------~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:12 194 GAFGEALGAIIVRSNKLEAGT--------------------------------AILAKKGAVKLILKRDFFLEVARDAST 241 (274) T ss_pred ccceeecCeeEEEeCCCCcce--------------------------------EEEEeccceeeeecCCceeccccchhh Confidence 999999999999999998421 257788899999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.|.|.+++.||++++||+++++++...| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:12 242 KTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cccEEEeeeEEEEEEEcCCceEEEEcCCc Confidence 99999999999999999999999999988 No 36 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=9.1e-42 Score=245.59 Aligned_cols=264 Identities=16% Similarity=0.148 Sum_probs=217.7 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCcc-eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGRT-RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (343) |||.. |+. + +-+..|+|+..|.+.+.+..++.++.... ++ ++|++|+||+++.. .+.+|..| T Consensus 1 ma~~~------T~~------~--d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g 66 (274) T protein:vir:97 1 MPQGL------TKT------S--DQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccc------eeh------h--heechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 88753 221 1 22344999999999999999999888765 33 45999999997643 56789999 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.++. ..++.++.+++|++. ++.|.|+|++..++..|++++++++++++|++++|+.++..+.++.... T Consensus 67 ~~i~~--~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~-------- 135 (274) T protein:vir:97 67 EKIPT--DILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-------- 135 (274) T ss_pred Ccccc--cccccceeEEEeeee-cceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-------- Confidence 98865 468899999999885 6889999999999999999999999999999999999987764322110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchhc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPER 234 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (343) .++..+ ++.|.+|..+|++++. .+||++|+|++|..|+++. +|+...-.++..+++ T Consensus 136 ------------~~~~~~--------~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:97 136 ------------NADITK--------LNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred ------------cccccC--------HHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceec Confidence 011111 4567778889998875 6899999999999999885 677766667778899 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |.|++++||+|++||++|.+. +.++++.|++++..+++++|..||+++ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~t--------------------------------~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:97 194 GAFGEALGAIIVRTNKLEAGT--------------------------------AILAKKGAVKLILKRDFFLEVARDAST 241 (274) T ss_pred cccceecCeeEEEcCCCCcce--------------------------------EEEEeCcceEeeecCCceeccccchhh Confidence 999999999999999998321 267889999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.|.|.+++.||+++++|++++.++.+.| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:97 242 KTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 99999999999999999999999999988 No 37 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=9.1e-42 Score=245.59 Aligned_cols=264 Identities=16% Similarity=0.148 Sum_probs=217.7 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCcc-eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGRT-RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (343) |||.. |+. + +-+..|+|+..|.+.+.+..++.++.... ++ ++|++|+||+++.. .+.+|..| T Consensus 1 ma~~~------T~~------~--d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g 66 (274) T protein:vir:94 1 MPQGL------TKT------S--DQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccc------eeh------h--heechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 88753 221 1 22344999999999999999999888765 33 45999999997643 56789999 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.++. ..++.++.+++|++. ++.|.|+|++..++..|++++++++++++|++++|+.++..+.++.... T Consensus 67 ~~i~~--~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~-------- 135 (274) T protein:vir:94 67 EKIPT--DILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTV-------- 135 (274) T ss_pred Ccccc--cccccceeEEEeeee-cceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-------- Confidence 98865 468899999999885 6889999999999999999999999999999999999987764322110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchhc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPER 234 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (343) .++..+ ++.|.+|..+|++++. .+||++|+|++|..|+++. +|+...-.++..+++ T Consensus 136 ------------~~~~~~--------~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:94 136 ------------NADITK--------LNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred ------------cccccC--------HHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceec Confidence 011111 4567778889998875 6899999999999999885 677766667778899 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |.|++++||+|++||++|.+. +.++++.|++++..+++++|..||+++ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~t--------------------------------~~l~~~gA~~~~~~~~~~vE~~Rd~~~ 241 (274) T protein:vir:94 194 GAFGEALGAIIVRTNKLEAGT--------------------------------AILAKKGAVKLILKRDFFLEVARDAST 241 (274) T ss_pred cccceecCeeEEEcCCCCcce--------------------------------EEEEeCcceEeeecCCceeccccchhh Confidence 999999999999999998321 267889999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.|.|.+++.||+++++|++++.++.+.| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:94 242 KTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred cccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 99999999999999999999999999988 No 38 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=8.4e-42 Score=245.77 Aligned_cols=265 Identities=16% Similarity=0.158 Sum_probs=215.0 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-ccc--cceEEEEeccCcc-eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SIS--SGKSAQFPVLGRT-RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i~--~G~tv~i~~iG~~-t~~~~~~g 76 (343) ||.. +. |+. .+-+..|+|+..|.+.+.+..+|.++.... ++. .|++|+||..... .+.+|..| T Consensus 1 ~~~~---~~--T~l--------~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g 67 (275) T protein:vir:96 1 MALE---NM--TKL--------ANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEG 67 (275) T ss_pred CCCc---cc--chh--------hhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCC Confidence 4443 11 221 121345999999999999999999997654 343 5999999997653 56789999 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.|+. ..++.++.+.+|.+ .++.|.|+|++..++..|++.+++++++++||+++|+.++..+.++.... T Consensus 68 ~~i~~--~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~-------- 136 (275) T protein:vir:96 68 EEIPI--DLIETKKRQATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKV-------- 136 (275) T ss_pred CCcch--hhcccceeeEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------- Confidence 99875 46888999999976 58999999999999999999999999999999999999987664322110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchhc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPER 234 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (343) .+... .++.|.+|..+|.+.+. ++||++|+|++|..|+++. +|+..+..++..+++ T Consensus 137 ------------~~~~~--------~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~ 194 (275) T protein:vir:96 137 ------------EADIT--------KLAGLQTAIDKFNDEDL--EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVK 194 (275) T ss_pred ------------ccccc--------CHHHHHHHHHHhccccC--CccEEEeCHHHHHHHHhcccccccccccccccceec Confidence 00111 14566778888987764 7899999999999998874 677777777788999 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |.|++++||+|++||++|.+. ++++++.|++++..+++++|..|++++ T Consensus 195 G~ig~~~G~~Vi~s~~~p~~t--------------------------------~~i~~~gA~~~~~~~~~~vE~~Rd~~~ 242 (275) T protein:vir:96 195 GAFGEALGAIIVRSNKIKEGE--------------------------------AILAKRGAVKLITKRDFFLETERHASH 242 (275) T ss_pred cccceecCeeEEEeCCCCcce--------------------------------EEEEeccceeeeecCCcccccccchhh Confidence 999999999999999998421 257788999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.|.|++++.||++++||++++.++++.+ T Consensus 243 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 243 KSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred cCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 99999999999999999999999998877 No 39 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=2.7e-41 Score=243.02 Aligned_cols=284 Identities=12% Similarity=0.086 Sum_probs=186.3 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc---ccc--cceEEEEeccCcceeeeec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR---SIS--SGKSAQFPVLGRTRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~---~i~--~G~tv~i~~iG~~t~~~~~ 74 (343) |||+ +|+ |+|+.++++.|+++.+|.++++.. +++ .|++|+|++.+..++++|+ T Consensus 1 Ma~~---------------------~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~ 59 (392) T protein:vir:99 1 MANA---------------------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRK 59 (392) T ss_pred Cccc---------------------cccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeee Confidence 5542 355 899999999999999999998654 554 5999999999999999987 Q ss_pred C-----CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 75 A-----GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 75 ~-----g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) + +.++. .+++.+++++++||+.+|++|.|+|.|+.|...|++.++.++++++||+++|+.++..+..+.... T Consensus 60 ~~~~~~~~~~~--~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~- 136 (392) T protein:vir:99 60 LRGAGAERNLT--VSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA- 136 (392) T ss_pred ccccccCCccc--ccccccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc- Confidence 5 33333 357888999999999999999999999999999999999999999999999999987665322110 Q ss_pred cccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_011085. 150 ASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAAL 229 (343) Q Consensus 150 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (343) .......+|.. .++.|.+|+++|++++||. |||+||+|++|+.|+++++|.+.++.++ T Consensus 137 -----------------~~~~~~~~~~~----~~~~i~~a~~~L~~~~vP~-~R~~vv~p~~~~~l~~~~~~~~~~~~g~ 194 (392) T protein:vir:99 137 -----------------AGAVHEVAPDE----FFKGVNGARRALNELYIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQ 194 (392) T ss_pred -----------------cccccccChhh----hHHHHHHHHHHHhhcCCCC-CCEEEEcHHHHHHHhcccceeecccccc Confidence 11122234433 4566778899999999996 8999999999999999999998877654 Q ss_pred ---cchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEE Q lcl|NC_011085. 230 ---IDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSL 306 (343) Q Consensus 230 ---~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~ 306 (343) ..+++|.||+++||+||+|+++|..........................+........ ++ -... T Consensus 195 ~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a~~~~a~~~at~a~v~~~~~~~~~s~s~~~-------~v------~~~~ 261 (392) T protein:vir:99 195 SAVSALQEARLGRIYGYEIVESTLIPHGDAYLYHPTAFIMATRAPAPPMGAVRSTAISGDQ-------RI------AMRW 261 (392) T ss_pred hhhhhhhcceeeeeeeeEEEeecccccccceeeeccccccccccccccccccceeEEeccc-------ce------ecce Confidence 4589999999999999999999976543322211111111100000000000000000 00 0011 Q ss_pred eeeeccchhhhhhhhhhhhccceecccceEEE-------------------------EecCC Q lcl|NC_011085. 307 ERARRAEYQADQIIARYAMGHGGLRPEAAGAL-------------------------VFTAG 343 (343) Q Consensus 307 e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i-------------------------~~~~g 343 (343) -..++.....+........|.+.+.......+ .+..| T Consensus 262 ~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~~~~~~~~~ 323 (392) T protein:vir:99 262 LVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGANATITAAAG 323 (392) T ss_pred eecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeecccceeEeeec Confidence 11122333333332222333333322211111 11111 No 40 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=8.5e-41 Score=240.28 Aligned_cols=296 Identities=14% Similarity=0.132 Sum_probs=210.1 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc---cc---ccceEEEEeccCcceeeee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR---SI---SSGKSAQFPVLGRTRAAYL 73 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~---~i---~~G~tv~i~~iG~~t~~~~ 73 (343) |||+ ...|| |+|+.++++.|+++.++.++++.. ++ +.|+||+|++.++.++++| T Consensus 1 MAN~-------------------llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~ 61 (423) T protein:vir:35 1 MANN-------------------LESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERT 61 (423) T ss_pred Cccc-------------------hhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecc Confidence 6643 22354 999999999999999999998764 34 3499999999999999999 Q ss_pred cCC--CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_011085. 74 QAG--QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAAS 151 (343) Q Consensus 74 ~~g--~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (343) .++ ..+. ++++...++.|+||+.+|++|.++|.|+.|..-|+ ..+.+.++++|++.+|+.++..+...+.. T Consensus 62 ~~~~~~~~~--~~~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~---- 134 (423) T protein:vir:35 62 ETGDITGKD--KNGLFSAKATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGAL---- 134 (423) T ss_pred cCcCCCCcc--ccccccceeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc---- Confidence 764 3444 35677788999999999999999999999988888 46778889999999999998766543210 Q ss_pred cccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccch-hhhhcccccc Q lcl|NC_011085. 152 NENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALM-PNAANYAALI 230 (343) Q Consensus 152 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~-~~~~~~~~~~ 230 (343) .. ++.+ ++.. .++.+.+++.+|++++||..|||+||+|++|..|++++. +.+.+..++. T Consensus 135 ---~v----------gt~~---t~~~----~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~ 194 (423) T protein:vir:35 135 ---SL----------GSPN---TAIK----KWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRT 194 (423) T ss_pred ---cc----------cccc---CCcc----hHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhH Confidence 00 0101 1111 156788899999999999999999999999999997665 5555555667 Q ss_pred chhccee-EEEeceEEEEecccccccccccccccccccc--------------------------------ccccccc-- Q lcl|NC_011085. 231 DPERGSI-RNVMGFEVVEVPHLTAGGAGDDREDETTNQK--------------------------------HAFPKTA-- 275 (343) Q Consensus 231 ~~~~G~V-~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~--------------------------------~~~~~~~-- 275 (343) .+++|.| |+++||+||+||++|.....+.....+...+ -.+.-++ T Consensus 195 alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~ 274 (423) T protein:vir:35 195 AWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTH 274 (423) T ss_pred HHhhccceeeecceEEEEcCCCccccccccccceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeee Confidence 7999876 9999999999999996432221111000000 0000000 Q ss_pred -------------------------------ccccc----------------------------------ccccceEeEe Q lcl|NC_011085. 276 -------------------------------EGDTK----------------------------------VALDNVVGLF 290 (343) Q Consensus 276 -------------------------------~~~~~----------------------------------~~~~~~~~l~ 290 (343) .+.+. ...+..+.|+ T Consensus 275 ~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~ 354 (423) T protein:vir:35 275 WLNQQSKQTLYNGSTAMSFTATVLEETNSTASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLF 354 (423) T ss_pred eccccccceeecccCCceeEEEEeccccccccCceeEEccccccccCCCcccccccccccCCceeeeeecCCCceeEEEe Confidence 00000 0001225689 Q ss_pred echhhheeeeee-----------------eeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 291 QHRSAVGTVKLK-----------------DLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 291 ~~~~Av~~~~~~-----------------~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) |||+|+..+... .+++..+||.+..-+.++.-..||.+.+|||.++.+.-.- T Consensus 355 ~~~~a~~l~~~~l~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:35 355 YNKFFCGLGTIPLPKLHSLDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred ecCceeEEEEEccccCCccceeeccccCceEEEEEeeccccCceEEEEEeecceeeecccceEEEEecC Confidence 999988876543 3344455677666777888888999999999997776544 No 41 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=2e-40 Score=238.23 Aligned_cols=298 Identities=12% Similarity=0.095 Sum_probs=208.1 Q ss_pred CCCCCccccccccccccccccchhHHH-HHHHHHHHHHHHHHhhhhccCcccc---cc---ccceEEEEeccCcceeeee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALF-LKVFGGEVLTAFARTSVTTNRHIMR---SI---SSGKSAQFPVLGRTRAAYL 73 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~-ie~~~g~V~~~f~~~s~~~~~~~~~---~i---~~G~tv~i~~iG~~t~~~~ 73 (343) |||+ ...| .++|+.++++.|+++.++.++++.+ ++ +.|+||+|++.+..++++| T Consensus 1 MaN~-------------------llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~ 61 (423) T protein:vir:17 1 MPNN-------------------LDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRT 61 (423) T ss_pred Cccc-------------------hhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecc Confidence 5544 1234 4999999999999999999998764 33 3599999999999999999 Q ss_pred cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 74 QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 74 ~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) +......-+.+++...++.|+||+.||++|.++|.|+.+.--|+ +++.+.++++||+.+|+.++..+.+.+... T Consensus 62 ~~~~~~~~~~~~l~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~----- 135 (423) T protein:vir:17 62 PTGDISGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALS----- 135 (423) T ss_pred cCcccCCcccCccccceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Confidence 75332111346777888999999999999999999998765555 889999999999999999987754432110 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchh-hhhccccccch Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMP-NAANYAALIDP 232 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~-~~~~~~~~~~~ 232 (343) . ++.+. .+ . .++.+.+++.+|++++||.+|||+||+|++|..|++++.+ ...+-.++..+ T Consensus 136 --~----------gt~~t--~~-~----a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:17 136 --L----------GSPNT--PI-T----KWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAW 196 (423) T ss_pred --c----------ccCCc--cc-c----cHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHH Confidence 0 00011 00 1 1567888999999999999999999999999999987754 44455566779 Q ss_pred hccee-EEEeceEEEEecccccccccccccc--------ccc-------------------------------ccc---- Q lcl|NC_011085. 233 ERGSI-RNVMGFEVVEVPHLTAGGAGDDRED--------ETT-------------------------------NQK---- 268 (343) Q Consensus 233 ~~G~V-~~i~Gf~V~~sn~lp~~~~~~~~~~--------~~~-------------------------------~~~---- 268 (343) ++|.| |+++||+||+||++|.....+.... .+. ..+ T Consensus 197 r~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v 276 (423) T protein:vir:17 197 ENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWL 276 (423) T ss_pred hhccceeeecceEEEEeCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeee Confidence 99987 8999999999999995322221100 000 000 Q ss_pred ----------------cccccc------ccccc--------------------c--------------ccccceEeEeec Q lcl|NC_011085. 269 ----------------HAFPKT------AEGDT--------------------K--------------VALDNVVGLFQH 292 (343) Q Consensus 269 ----------------~~~~~~------~~~~~--------------------~--------------~~~~~~~~l~~~ 292 (343) ..+.-. ..+.. . ...+..+.|+|| T Consensus 277 ~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~ 356 (423) T protein:vir:17 277 QQQTKQALYNGATPISFTATVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYN 356 (423) T ss_pred cccccccccccccccceEEEEEecccccccCceEEEecCccccccCCcccccceecccCCceeeccccccCCeeEEEEec Confidence 000000 00000 0 001123458999 Q ss_pred hhhheeeee-----------------eeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 293 RSAVGTVKL-----------------KDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 293 ~~Av~~~~~-----------------~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) |+|+..+.. ..+++-.+||.+..-+.++.-..||.+.+|||.++.+.-.- T Consensus 357 ~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:17 357 KFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CcceEEEEEcccCCCccceeecccCCcEEEEEEecccccceeEEEEEeecceeeeccceEEEEEecC Confidence 999887653 33344445666666666888888999999999997776555 No 42 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=4.1e-40 Score=236.52 Aligned_cols=298 Identities=12% Similarity=0.082 Sum_probs=209.1 Q ss_pred CCCCCccccccccccccccccchhHHH-HHHHHHHHHHHHHHhhhhccCcccc---cc---ccceEEEEeccCcceeeee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALF-LKVFGGEVLTAFARTSVTTNRHIMR---SI---SSGKSAQFPVLGRTRAAYL 73 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~-ie~~~g~V~~~f~~~s~~~~~~~~~---~i---~~G~tv~i~~iG~~t~~~~ 73 (343) |||+ ...| .|+|+.++++.|+++.++.++++.+ ++ +.|+||+|++.+..++++| T Consensus 1 MaN~-------------------llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~ 61 (423) T protein:vir:10 1 MPNN-------------------LDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRT 61 (423) T ss_pred Cccc-------------------hhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeecc Confidence 5543 2234 4899999999999999999998764 34 3599999999999999999 Q ss_pred cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 74 QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 74 ~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) +++..-.-+.+++...+++|+||+.||++|.++|.|+.+.--|+ +++.+++.++||+.+|+.++..+...+... T Consensus 62 ~~~~~~~~~~~dl~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~----- 135 (423) T protein:vir:10 62 PTGDISGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALS----- 135 (423) T ss_pred CCccccccccCccccceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Confidence 86421111346788899999999999999999999988655555 889999999999999999987654322110 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhh-hhccccccch Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPN-AANYAALIDP 232 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~~~~~ 232 (343) . ++.+.. + . .++.+.+++.+|++++||..|||+||+|++|..|++++.+. ..+..++..+ T Consensus 136 --~----------gt~~t~--~-~----a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:10 136 --L----------GSPNTP--I-T----KWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAW 196 (423) T ss_pred --c----------ccCCcc--c-c----hHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhh Confidence 0 011111 1 1 15678889999999999999999999999999999877644 4455566779 Q ss_pred hccee-EEEeceEEEEecccccccccccccc--------c--------------------c--cc--ccccccccc---- Q lcl|NC_011085. 233 ERGSI-RNVMGFEVVEVPHLTAGGAGDDRED--------E--------------------T--TN--QKHAFPKTA---- 275 (343) Q Consensus 233 ~~G~V-~~i~Gf~V~~sn~lp~~~~~~~~~~--------~--------------------~--~~--~~~~~~~~~---- 275 (343) ++|.| |+++||+||+||++|.....+.... . . +. .+-.+..++ T Consensus 197 r~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v 276 (423) T protein:vir:10 197 ENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWL 276 (423) T ss_pred hhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeee Confidence 99987 8999999999999996322211100 0 0 00 000000000 Q ss_pred -----------------------------ccccc----------------------------------ccccceEeEeec Q lcl|NC_011085. 276 -----------------------------EGDTK----------------------------------VALDNVVGLFQH 292 (343) Q Consensus 276 -----------------------------~~~~~----------------------------------~~~~~~~~l~~~ 292 (343) .+... ...+..+.|+|| T Consensus 277 ~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~ 356 (423) T protein:vir:10 277 QQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYN 356 (423) T ss_pred cccccccccccccCcceEEEEEeeeeeccCCceeeeccCccccccCCcccccccccccCCceeeccccccCCeeEEEEec Confidence 00000 011123558999 Q ss_pred hhhheeeee-----------------eeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 293 RSAVGTVKL-----------------KDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 293 ~~Av~~~~~-----------------~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) |+|+..+.. ..+++-.+||.+..-+.++.-..||.+.+|||.++.+.-.- T Consensus 357 ~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 357 KFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CcceEEEEEcccCCCccceeeccccCceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 999877643 33445555777666777888888999999999997776555 No 43 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=2.4e-40 Score=237.76 Aligned_cols=267 Identities=18% Similarity=0.150 Sum_probs=214.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-ccc--cceEEEEeccCcc-eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SIS--SGKSAQFPVLGRT-RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i~--~G~tv~i~~iG~~-t~~~~~~g 76 (343) |||.. |+. .+-+..|+|+..|.+.|.+..++.++.... ++. .|++|+||..+.. ...++..| T Consensus 1 ma~~~------T~~--------~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg 66 (272) T protein:vir:36 1 MSKQK------TTL--------ADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEG 66 (272) T ss_pred CCCcc------eeh--------hhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCC Confidence 88753 221 122345999999999999999999887664 343 5999999997665 35678889 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) ..++. +.++.++.+++|.+. ...|.|+|++..++..|++++++++++++||+++|+.++..+..... T Consensus 67 ~~i~~--~~lt~~~~~~~i~~~-~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~---------- 133 (272) T protein:vir:36 67 GEISL--DKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ---------- 133 (272) T ss_pred CccCh--hhcCCcceeEeeehh-hccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------- Confidence 98875 468899999999886 57899999999999999999999999999999999999866532111 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhh-ccccccchhcc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAA-NYAALIDPERG 235 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~~~~~~~G 235 (343) .. +. ...++.|.+|..+|.+.++| .||++|+|++|+.|+++.++... ++.+...+++| T Consensus 134 --------~~-------~~----~~~~d~i~~A~~~lgd~~~~--~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G 192 (272) T protein:vir:36 134 --------TV-------ST----KANVDGVQAALDIFNDEDAQ--AYVLIVNPKDAAKIRKDANAKNIGSEVGANALING 192 (272) T ss_pred --------cc-------cc----cccHHHHHHHHHHhhhcCCC--ceEEEEcHHHHHHHhcccccccccccccccceeee Confidence 00 00 11256677889999999875 68999999999999999888765 45667789999 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchh Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQ 315 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~ 315 (343) .|++++|++|++||++|.... ....+++.+.|+++...+++++|..|+++++ T Consensus 193 ~ig~~~G~~Vv~s~~~p~~~~----------------------------~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~ 244 (272) T protein:vir:36 193 TYADVLGAQIVRSKKLAEGSA----------------------------LMFKIVSNSPALKLVLKRGVQVETDRDIVTK 244 (272) T ss_pred ccceecCeeEEEeCCCCCCce----------------------------eEEEEEecccceeeeecCCcccccccchhhc Confidence 999999999999999994321 0122567888999999999999999999999 Q ss_pred hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|.|+++++||++++||++++.++++-= T Consensus 245 ~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 245 TTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred CcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 9999999999999999999999876422 No 44 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=5.2e-38 Score=224.97 Aligned_cols=298 Identities=12% Similarity=0.071 Sum_probs=206.0 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc---cc---ccceEEEEeccCcceeeeec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR---SI---SSGKSAQFPVLGRTRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~---~i---~~G~tv~i~~iG~~t~~~~~ 74 (343) |||+- .+|-.++|+.+.++.|+++.++.++++.. ++ +.|+||+|++.+..++++.. T Consensus 1 MANsl------------------~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~ 62 (423) T protein:vir:10 1 MANNL------------------DANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTM 62 (423) T ss_pred Ccccc------------------ccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeeccc Confidence 66441 12345899999999999999999998764 33 25999999999999988754 Q ss_pred CCCcCCC-ccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 75 AGQSLDD-KRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 75 ~g~~i~~-~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) +..+.+ ..+++...++.++||+.+|++|.++|.|+.+.--|+ +.+.+++.++||+.+|+.++..++..+.. T Consensus 63 -~~~~t~~~~~~l~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~------ 134 (423) T protein:vir:10 63 -DGDITGKSKNSLISAKATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGAL------ 134 (423) T ss_pred -CcccCcccccccccceEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhcccc------ Confidence 333333 234566678999999999999999999988554445 88999999999999999997655432210 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhh-hhccccccch Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPN-AANYAALIDP 232 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~~~~~ 232 (343) .. +..+...+ -++.+.+++.+|++.+||..+||+||+|++|..|++++.+. ..+..++..+ T Consensus 135 -~v----------gt~~t~~~-------a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:10 135 -SL----------GSPNTPIK-------KWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAW 196 (423) T ss_pred -cc----------cccccccc-------cHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHH Confidence 00 11111111 15678889999999999999999999999999999776544 4455667779 Q ss_pred hccee-EEEeceEEEEeccccccccccccc-----------cccc-----ccc------c-------------------- Q lcl|NC_011085. 233 ERGSI-RNVMGFEVVEVPHLTAGGAGDDRE-----------DETT-----NQK------H-------------------- 269 (343) Q Consensus 233 ~~G~V-~~i~Gf~V~~sn~lp~~~~~~~~~-----------~~~~-----~~~------~-------------------- 269 (343) ++|.| |+++||+||+||++|....++.+. ..+. ... + T Consensus 197 r~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v 276 (423) T protein:vir:10 197 ENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWL 276 (423) T ss_pred HhcccceeecceEEEEecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeee Confidence 99976 999999999999999532221110 0000 000 0 Q ss_pred -----------------ccccc------cccccc----------------------------------ccccceEeEeec Q lcl|NC_011085. 270 -----------------AFPKT------AEGDTK----------------------------------VALDNVVGLFQH 292 (343) Q Consensus 270 -----------------~~~~~------~~~~~~----------------------------------~~~~~~~~l~~~ 292 (343) .+.-. ..+... ...+..+.|+|| T Consensus 277 ~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~ 356 (423) T protein:vir:10 277 NQQSKQTLYNGASALSFTATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYN 356 (423) T ss_pred cccccceeecccCCcceEEEEEecccccccCceEEEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEec Confidence 00000 000000 001123458999 Q ss_pred hhhheeeee-----------------eeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 293 RSAVGTVKL-----------------KDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 293 ~~Av~~~~~-----------------~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) |+|+..+.. ..+++-.+||.+..-+.++.-..||.+.+|||.++.+.-.- T Consensus 357 ~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 357 KLFCGLGTIPLPKLHSIDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGGQFFGNP 423 (423) T ss_pred CcceEEEEEcccCCCccceeecccccceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 998876643 34445556777767777888888999999999997776555 No 45 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=4.6e-38 Score=225.30 Aligned_cols=264 Identities=17% Similarity=0.166 Sum_probs=215.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCcc-eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGRT-RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~~-t~~~~~~g 76 (343) |||.++ +. .+-+..|+|+..|.+.+.+..++.++.... ++ +.|++|+||..+.. .+.++..| T Consensus 1 Ma~~~T------~l--------~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg 66 (276) T protein:vir:10 1 MAQGTT------TK--------STQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEG 66 (276) T ss_pred CCccee------eh--------hhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCC Confidence 887532 21 122355999999999999999999998765 34 36999999987654 45678889 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.|+. ..++.++.+.+|.+ .+..|.++|++..++..|++.+++++++++||+++|+.++..+..+... T Consensus 67 ~~i~~--~~lt~~~~~a~i~~-~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~--------- 134 (276) T protein:vir:10 67 QKIPV--DKIETNRREAKIHK-IGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLT--------- 134 (276) T ss_pred CccCc--cccccceeeEEeeh-ccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------- Confidence 98875 46888999999966 5899999999999999999999999999999999999998765432110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhcc--chhhhhccccccchhc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAA--LMPNAANYAALIDPER 234 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~--~~~~~~~~~~~~~~~~ 234 (343) + . ++..+ ++.|.+|..+|.++++ +.++++|+|++|..|+++ .+|+..+-.++..+++ T Consensus 135 -------~--~--~~~~t--------~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (276) T protein:vir:10 135 -------V--S--ADIGT--------LAGLEAAIDTFDDEDL--EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVK 193 (276) T ss_pred -------c--c--ccccC--------HHHHHHHHHHhccccC--cccEEEEcHHHHHHHHHhccccccccccccccceec Confidence 0 0 01111 4567778889988875 789999999999999764 5788776667778899 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |.|++++|++|++|+++|.+. ++++++.|++++..+++++|..|++++ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~t--------------------------------~~l~~~gAi~~~~~~~~~vE~dRd~~~ 241 (276) T protein:vir:10 194 GAFGEALGAVIVRSKKLDEGE--------------------------------AILAKRGAVKLITKRDFFLETDRDPST 241 (276) T ss_pred cccceecceeEEEcCCCCcce--------------------------------EEEEeccceeeeecCCceeecccchhh Confidence 999999999999999998421 257888999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.|.|.+++.||+++++|++++.++...| T Consensus 242 ~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (276) T protein:vir:10 242 KTTALYSDKHYVAYLYDESKAVKVTKGAG 270 (276) T ss_pred cccEEEEeeEEEEEEEcCcceEEEecCCc Confidence 99999999999999999999999998888 No 46 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=3.4e-36 Score=215.06 Aligned_cols=263 Identities=15% Similarity=0.123 Sum_probs=211.6 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCc-ceeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGR-TRAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~-~t~~~~~~g 76 (343) ||+.++ +. + +-+..|+|+..|.+.+.+.+++.++.... ++ ..|++|+||+.+. ..+.++..| T Consensus 1 MA~~~T------~~------~--~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg 66 (272) T protein:vir:30 1 MAVGTT------KM------A--QMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEG 66 (272) T ss_pred CCCccc------cc------h--heechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCC Confidence 887642 21 1 12345999999999999999998887764 33 3599999999864 567889889 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.++. .+++.++.++++.+. ...+.|+|.+..++..|+++++.+++++++++++|+.++..+.++... T Consensus 67 ~~i~~--~~~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~--------- 134 (272) T protein:vir:30 67 EAIPM--TQLGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT--------- 134 (272) T ss_pred Ccccc--cccccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------- Confidence 98875 458889999999885 577999999999999999999999999999999999998765332110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchhc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPER 234 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (343) + . +. . .++.|.+|..+|++.+ ...|+++|+|++|..|+++. ++......+...+++ T Consensus 135 -------~--~--~~-----~----t~d~i~da~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:30 135 -------V--E--AT-----A----TVDGVSKALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred -------c--c--cc-----c----CHHHHHHHHHHHhccC--CCccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 0 0 00 0 1456677788898776 45799999999999998774 444444445667899 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |.|++++|++|++|+++|.+. .+++++.|++.+..+++++|.+|++++ T Consensus 193 g~ig~i~G~~Vi~s~~~p~~t--------------------------------~~~~~~~a~~~~~~~~~~ve~~r~~~~ 240 (272) T protein:vir:30 193 GVYGEVLGVQIVRSRKCPKGT--------------------------------AYMVRKGALRIMLKRNTMVETDRDITK 240 (272) T ss_pred ccchhhcCeeEEEcCCCCcce--------------------------------EEEEcCCeEEEEecCCceeeecccccc Confidence 999999999999999998321 257888999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.|.|.++++||.++++|++++.+++.+= T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:30 241 AINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 99999999999999999999999988765 No 47 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=3.4e-36 Score=215.06 Aligned_cols=263 Identities=15% Similarity=0.123 Sum_probs=211.6 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-cc--ccceEEEEeccCc-ceeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SI--SSGKSAQFPVLGR-TRAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i--~~G~tv~i~~iG~-~t~~~~~~g 76 (343) ||+.++ +. + +-+..|+|+..|.+.+.+.+++.++.... ++ ..|++|+||+.+. ..+.++..| T Consensus 1 MA~~~T------~~------~--~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg 66 (272) T protein:vir:98 1 MAVGTT------KM------A--QMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEG 66 (272) T ss_pred CCCccc------cc------h--heechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCC Confidence 887642 21 1 12345999999999999999998887764 33 3599999999864 567889889 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.++. .+++.++.++++.+. ...+.|+|.+..++..|+++++.+++++++++++|+.++..+.++... T Consensus 67 ~~i~~--~~~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~--------- 134 (272) T protein:vir:98 67 EAIPM--TQLGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT--------- 134 (272) T ss_pred Ccccc--cccccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------- Confidence 98875 458889999999885 577999999999999999999999999999999999998765332110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc--hhhhhccccccchhc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL--MPNAANYAALIDPER 234 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (343) + . +. . .++.|.+|..+|++.+ ...|+++|+|++|..|+++. ++......+...+++ T Consensus 135 -------~--~--~~-----~----t~d~i~da~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:98 135 -------V--E--AT-----A----TVDGVSKALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred -------c--c--cc-----c----CHHHHHHHHHHHhccC--CCccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 0 0 00 0 1456677788898776 45799999999999998774 444444445667899 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |.|++++|++|++|+++|.+. .+++++.|++.+..+++++|.+|++++ T Consensus 193 g~ig~i~G~~Vi~s~~~p~~t--------------------------------~~~~~~~a~~~~~~~~~~ve~~r~~~~ 240 (272) T protein:vir:98 193 GVYGEVLGVQIVRSRKCPKGT--------------------------------AYMVRKGALRIMLKRNTMVETDRDITK 240 (272) T ss_pred ccchhhcCeeEEEcCCCCcce--------------------------------EEEEcCCeEEEEecCCceeeecccccc Confidence 999999999999999998321 257888999999999999999999999 Q ss_pred hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.|.|.++++||.++++|++++.+++.+= T Consensus 241 ~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:98 241 AINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred ceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 99999999999999999999999988765 No 48 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=2e-35 Score=210.87 Aligned_cols=284 Identities=13% Similarity=0.059 Sum_probs=189.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc---cc--ccceEEEEeccCcceeeeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR---SI--SSGKSAQFPVLGRTRAAYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~---~i--~~G~tv~i~~iG~~t~~~~~~ 75 (343) ||.. + |.|+|++++++.|...+++..+.+.. .+ .+|++||||+++.+.++||++ T Consensus 1 MA~~--------------------n-~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R 59 (299) T protein:vir:79 1 MAAL--------------------N-YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNR 59 (299) T ss_pred Cccc--------------------h-hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecccccccccccc Confidence 3321 1 67999999999999999988765543 23 479999999999999999998 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDV--RSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~--~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) ++.... ..+++.+..+++||+.+||.|.||++|..|++..+ -....+.+.+.++..+|.+.+..|+..+.... T Consensus 60 ~~~g~~-~g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g---- 134 (299) T protein:vir:79 60 DTIAVA-QRNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALG---- 134 (299) T ss_pred CCCccc-ccccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcC---- Confidence 764332 34578889999999999999999977777766554 23345556677888888888877764432110 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhc-cccccch Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAAN-YAALIDP 232 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-~~~~~~~ 232 (343) ..+..+.+ +.+++++.|+++.++|+|++||.+|||++|+|++|.+|+++++|++.. ....... T Consensus 135 --------------~~~~~~~~--T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~ 198 (299) T protein:vir:79 135 --------------NTADTTVL--TTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTS 198 (299) T ss_pred --------------Cccccccc--CHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccce Confidence 00111111 135678888999999999999999999999999999999999988654 4444567 Q ss_pred hcceeEEEeceEEEE--eccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee Q lcl|NC_011085. 233 ERGSIRNVMGFEVVE--VPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR 310 (343) Q Consensus 233 ~~G~V~~i~Gf~V~~--sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~ 310 (343) ++|.|++++||+|++ |++++..-...... + + ..+.-...| ++.|++|+......+ .+..+. T Consensus 199 ~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~--~---~------~~~ak~in~-----ii~~~~a~~~~~K~~-~~~~~~ 261 (299) T protein:vir:79 199 LNRQTTDIDTVKIIKVPSNLMKTAYDFTTGW--K---V------GAGAKQIFM-----SLVHPSAIITPVSYQ-FSKLDE 261 (299) T ss_pred eeeeeeeecceEEEEechhhcCccceeccCc--c---c------cCcccccce-----EEEcCCeeeeeEeee-eEEeec Confidence 899999999999998 56676321000000 0 0 000001122 788999887665544 344433 Q ss_pred cc-chhhh-hhhhhhhhccceecccceEE--EEecCC Q lcl|NC_011085. 311 RA-EYQAD-QIIARYAMGHGGLRPEAAGA--LVFTAG 343 (343) Q Consensus 311 ~~-~~~~d-~i~~~~~~G~~v~rpe~~~~--i~~~~g 343 (343) +. ...+| ++..+.-+..-++.....++ -.-++| T Consensus 262 P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~ 298 (299) T protein:vir:79 262 PTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAG 298 (299) T ss_pred CCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecC Confidence 22 22333 33344444666665544333 233344 No 49 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=100.00 E-value=2.2e-32 Score=194.09 Aligned_cols=277 Identities=14% Similarity=0.083 Sum_probs=197.7 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc-ccccceEEEEeccCcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR-SISSGKSAQFPVLGRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~-~i~~G~tv~i~~iG~~t~~~~~~g~~i 79 (343) || .. |.++|++.+++.|...+++..+.+.+ ...+|++|+||+++.+.+++|++++.. T Consensus 1 Ma---------------------in-~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~ 58 (290) T protein:vir:78 1 MA---------------------IN-YVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGY 58 (290) T ss_pred Cc---------------------hh-HHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCc Confidence 11 11 45899999999999999988776543 567899999999999999999998866 Q ss_pred CCccCCCccceEEEEeeeeeeeeeecc--chHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .. .+++.+..+++||+.++|.|.|| |+||.+....+.....+.+.+.++..+|.+.+..|+..+.... T Consensus 59 ~~--g~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~-------- 128 (290) T protein:vir:78 59 NE--GSASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS-------- 128 (290) T ss_pred cc--CccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC-------- Confidence 54 45788899999999999999999 8899888888888899999999999999998887765542210 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcc-cc-ccchhcc Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANY-AA-LIDPERG 235 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~-~~-~~~~~~G 235 (343) . . .....+ .+++++.|+++.++|+| ||.+|||++|+|++|.+|+++++|...-. .. .....+| T Consensus 129 ----~-~----~~~t~t----~~n~~~~i~~~~~~lde--vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~ 193 (290) T protein:vir:78 129 ----N-S----VAEEIT----KDNVFTKLKAAIRKVKK--YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIET 193 (290) T ss_pred ----c-c----cccccC----HHHHHHHHHHHHHHHHh--cCCCCeEEEECHHHHHHHhhChhhhccccccccccccccc Confidence 0 0 011122 34667778888899987 89999999999999999999999886432 22 2344599 Q ss_pred eeEEEeceEEEEecc---ccc-cccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeec Q lcl|NC_011085. 236 SIRNVMGFEVVEVPH---LTA-GGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARR 311 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~---lp~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~ 311 (343) +|++++||+|++.++ +-. .....+ ..+ ... .-... -++.|++|+......+ .+..+ + T Consensus 194 ~V~~idG~~ii~vps~~r~~t~~~f~~G------~~~-----~~~-ak~in-----~ii~~~~a~i~~~K~~-~~~~~-~ 254 (290) T protein:vir:78 194 RITAIDGTRIVEVEAEDRFYDTFDFTDG------YKP-----AAG-AKKLN-----FLLVNKGSVVGGAKHA-SIYLH-A 254 (290) T ss_pred eeeeecCcEEEEecccchhhhhhhhccc------ccc-----cCC-cccee-----EEEEcCCceeeeeeee-EEEee-C Confidence 999999999999652 110 011100 000 000 00112 2788998876665544 34443 3 Q ss_pred cch----hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 312 AEY----QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 312 ~~~----~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |.. -+|.+..+.-+.+-++.....++++-..= T Consensus 255 P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 255 PGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred CCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 322 36788888888888887776655544433 No 50 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.95 E-value=1.2e-29 Score=179.17 Aligned_cols=298 Identities=11% Similarity=0.012 Sum_probs=195.3 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccc-c--ccccceEEEEeccCcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIM-R--SISSGKSAQFPVLGRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~-~--~i~~G~tv~i~~iG~~t~~~~~~g~ 77 (343) |||. . =|.++|+.++++.|...+++..+... . .+.+|++|+||+|....+++|++++ T Consensus 1 Mant-------------------l-~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~ 60 (312) T protein:vir:10 1 MANT-------------------L-AYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGS 60 (312) T ss_pred CCcc-------------------h-hHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeeccccccccccc Confidence 5543 1 17799999999999999887766422 2 4678999999999999999999976 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeecc--chHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) ..-....+++.+..++++++.++|.|.|| |+||.+....+..-+.+.+.+.....+|.+.+..|+..+...... T Consensus 61 g~~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~---- 136 (312) T protein:vir:10 61 ANAYVGGDVKFEYETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGD---- 136 (312) T ss_pred CCccccccccccceeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc---- Confidence 63333356888999999999999999999 888887777777777777888899999999887776554322100 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcc Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERG 235 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (343) +.+ +..+ +-+..++++.|.++.++|+|+.|| .+|+++|+|+++.+|.++..+............+| T Consensus 137 -----~~~------~~~~--~~T~~ni~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~~ 202 (312) T protein:vir:10 137 -----TNV------EYSY--SVNSSTIINKIKTGIKIIRENGYN-GPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQT 202 (312) T ss_pred -----ccc------cccc--ccCHHHHHHHHHHHHHHHHHccCC-CceEEEeChHHHHHHhhhhhceecccccccceeee Confidence 000 0001 112466788899999999999999 69999999999987776543332222333445699 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccc-cccccccccccccccceEeEeechhhheeeeeeeeEEeeee---c Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKH-AFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR---R 311 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~---~ 311 (343) +|++++|++|++.+.--....-.-..+.+.+... .+.... +.-...| ++.|++|+......+ .+..+- + T Consensus 203 ~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~~~~~gg~~~~~-~ak~INf-----iiv~~~a~i~~~K~~-~~~if~P~~~ 275 (312) T protein:vir:10 203 QVPSIDGCALIKTPQNRMYSSILLNDGTTSNQTAGGYLKGT-KALDTNF-----IIAPVDVPLAITKQD-KMRIFDPETN 275 (312) T ss_pred eeeeecccEEEEchhhhccceeeeccCcccccccCceeecC-cccccce-----EEeCCceeeceeeee-eeeeeCCCCC Confidence 9999999999985422111110000000000000 000000 0011222 788998776554443 333331 2 Q ss_pred cchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 312 AEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 312 ~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +...+|.+..+.-+.+-++.....++..--.. T Consensus 276 ~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~ 307 (312) T protein:vir:10 276 QTANAWSMDYRRYHDLWVTDNKANSVYANFKD 307 (312) T ss_pred CCcceeeeeeeeeeeeeeeccccCeEEEEeec Confidence 23346899999988999988877666433222 No 51 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.94 E-value=3.4e-30 Score=182.13 Aligned_cols=230 Identities=17% Similarity=0.137 Sum_probs=185.3 Q ss_pred ccccccceEEEEeccCcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHH Q lcl|NC_011085. 51 MRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLA 130 (343) Q Consensus 51 ~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa 130 (343) ..-+..|+|++||.. -..+.++..|+.++. +.++.++.+.+|.+. ...|.|.|.+..++..|++.+.++|++.+|| T Consensus 1 ~~~~~~Gdtit~P~~-iGda~~v~eG~~i~~--~~l~~t~~~atIk~~-gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA 76 (231) T protein:vir:73 1 ENGINLANLCEYPND-IGDAADVAEGGEISL--DKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLA 76 (231) T ss_pred CccccCCceEEeccc-ccchhhhcCCCcCCh--hhccccceeeeEeee-ccceeeeHHHHhhccCchHHHHHHHHHHHHH Confidence 224567999999864 234578889999975 468889999999775 7899999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHH Q lcl|NC_011085. 131 MAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPE 210 (343) Q Consensus 131 ~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~ 210 (343) +++|..++.++.+++- .. .+. . -++.|.+|..+|.+.+ ..+++++|+|+ T Consensus 77 ~kvD~di~~~~~~a~l------------------~~--~~~-~--------t~d~i~~A~~~fgde~--~~~~vivv~p~ 125 (231) T protein:vir:73 77 NKVDDDLLKAAKTTSQ------------------TV--STK-A--------NVDGVQAALDIFNDED--AQAYVLIVNPK 125 (231) T ss_pred HhhhHHHHHhhccccc------------------cc--ccc-c--------cHHHHHHHHHHhcccc--ccceEEEEcch Confidence 9999999866543221 00 010 0 1566777888898876 46789999999 Q ss_pred HHHHHhccchhhhh-ccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeE Q lcl|NC_011085. 211 VYSAILAALMPNAA-NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGL 289 (343) Q Consensus 211 ~~~~Ll~~~~~~~~-~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 289 (343) .|+.|.++.++... +..+...+++|.||.+.|++|+.|+++|.++.. .+-+ T Consensus 126 ~~~~Lrk~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~----------------------------~~~~ 177 (231) T protein:vir:73 126 DAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSAL----------------------------MFKI 177 (231) T ss_pred HHHhhhhccchhhhhhhhccceeeecccceEcceEEEEcCCCCCCcee----------------------------eeeE Confidence 99999998877664 355677899999999999999999999953210 0113 Q ss_pred eechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 290 FQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 290 ~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++.+.|++....+++++|..||++.+.|.|.+.+.|++++.+|++++.++++-= T Consensus 178 i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 178 VSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 456789999999999999999999999999999999999999999999877533 No 52 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.94 E-value=2.4e-29 Score=177.48 Aligned_cols=261 Identities=15% Similarity=0.142 Sum_probs=201.7 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccc-c--ccceEEEEeccCcc-eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRS-I--SSGKSAQFPVLGRT-RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~-i--~~G~tv~i~~iG~~-t~~~~~~g 76 (343) ||.+.-. +-+..|+|+..|.+.+.+..+|.++....+ + +.|++|+||..... .+.++..| T Consensus 1 Ma~T~~~----------------d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg 64 (270) T protein:vir:95 1 MTQTKKA----------------NLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEG 64 (270) T ss_pred CCceehh----------------hhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCC Confidence 6554211 223569999999999999999998887763 3 46999999987543 45678889 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.|+. +.++.++.+.+|-+. ...|.|+|++...+..|++.+.+++++.+||+++|..++..+..+.... T Consensus 65 ~~i~~--~~lt~~~~~a~i~~~-gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~-------- 133 (270) T protein:vir:95 65 VAMDT--TQMSMTTTKVTVKET-GKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA-------- 133 (270) T ss_pred Cccch--hhcccchheeeeehh-hCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------- Confidence 98865 568888889999665 6789999998888888999999999999999999999987664321100 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcce Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGS 236 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (343) +.... .+.|.+|..+|.+.. ....+++|+|..|+.|+++..+.. .-.+...+++|. T Consensus 134 -----------------~~~~t----~~~~~dA~~~lgd~~--~~~~~i~vhs~~~~~Lrk~~~~~~-~~~~~~~~~~G~ 189 (270) T protein:vir:95 134 -----------------TVSAD----ATGILDAIEVFNSEN--DEDYVLYVNPKDYNKLVKSLFKVG-GNVQDRAISKGD 189 (270) T ss_pred -----------------ccccC----HHHHHHHHHHhcccc--CCCcEEEEcHHHHHHHHhhhcccc-cccccchhcccc Confidence 00011 344556677785543 345799999999999998764332 223456789999 Q ss_pred eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhh Q lcl|NC_011085. 237 IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQA 316 (343) Q Consensus 237 V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~ 316 (343) |+.+.|++|+++.+.|... .+.++++.|++++..+++++|..||++++. T Consensus 190 ig~~~G~~Viv~s~~~~~~-------------------------------~~~l~~~gAi~~~~~~~~~vEtdRd~~~~~ 238 (270) T protein:vir:95 190 LVEIVGVSDIVKSKRVSEN-------------------------------TAFLQRYGAMEIVNKKKPEAYTDFDILKRT 238 (270) T ss_pred cceecceeEEEeCCCCCce-------------------------------eEEEEeccceeeeecCCceeeeccchhhcc Confidence 9999999998877665211 136788999999999999999999999999 Q ss_pred hhhhhhhhhccceecccceEEEEe-cCC Q lcl|NC_011085. 317 DQIIARYAMGHGGLRPEAAGALVF-TAG 343 (343) Q Consensus 317 d~i~~~~~~G~~v~rpe~~~~i~~-~~g 343 (343) |.+.+++.||.++++|++++.+++ ++| T Consensus 239 d~i~~~~~y~v~~~~~skvv~~t~~~a~ 266 (270) T protein:vir:95 239 HLLSTNYHYSVNLKDETGVVKVTFKPSG 266 (270) T ss_pred cEEEeeeEEEEEEEccceEEEEEecCCC Confidence 999999999999999999999985 455 No 53 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.94 E-value=6e-29 Score=175.30 Aligned_cols=284 Identities=10% Similarity=0.037 Sum_probs=185.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcc-C-----ccccccccceEEEEeccC-cceeeee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTN-R-----HIMRSISSGKSAQFPVLG-RTRAAYL 73 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~-~-----~~~~~i~~G~tv~i~~iG-~~t~~~~ 73 (343) || .. |.++|+.+|++.|..+++... + .......+|++|+||++. .+.+++| T Consensus 1 Ma---------------------in-ya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY 58 (346) T protein:vir:10 1 MT---------------------IN-YAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDR 58 (346) T ss_pred Cc---------------------ch-hHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccc Confidence 11 11 568999999999988766532 2 112245689999999996 5679999 Q ss_pred cCCCcCCCccCCCccceEEEEeeeeeeeeeecc--chHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_011085. 74 QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAAS 151 (343) Q Consensus 74 ~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (343) +++..... ...++.+..+++|++.++|.|.|| |+||.+.......-+.+.+....+..+|.+.+..|+..+.... T Consensus 59 ~R~~g~~~-~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~-- 135 (346) T protein:vir:10 59 QRRTITTP-VANYSNDWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAH-- 135 (346) T ss_pred cccCCccc-ccccccceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhc-- Confidence 98664432 245788999999999999999999 6666654454444444556666778889887776654432211 Q ss_pred cccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccc Q lcl|NC_011085. 152 NENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALID 231 (343) Q Consensus 152 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (343) ++... +...+ .+++++.|+++.++|+|+.||.+|||++|+|++|.+|+++++|......++.. T Consensus 136 --------~~~~~-----~~a~T----~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~ 198 (346) T protein:vir:10 136 --------DGGIT-----TNTLD----EKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPN 198 (346) T ss_pred --------ccccc-----ccccC----HHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheecccccccc Confidence 00000 11112 35678888899999999999999999999999999999999888654434444 Q ss_pred hhcceeEEEeceEEEE--eccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeee Q lcl|NC_011085. 232 PERGSIRNVMGFEVVE--VPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERA 309 (343) Q Consensus 232 ~~~G~V~~i~Gf~V~~--sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~ 309 (343) ..+|+|++++||+|++ |++++..=.-.. +.. ... +.-...| ++.|++|+......+ .+..+ T Consensus 199 ~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~-----G~~-----~~t-~ak~INf-----iiv~~~A~ia~~K~~-~~~if 261 (346) T protein:vir:10 199 NIQRTVYSLDDVTIRVVPSDLMQTAYDFSD-----GSK-----IID-TAKQIEM-----FLIYNGVQIAPEKYS-FVGFD 261 (346) T ss_pred ccceeeeeecCeEEEEcchhhcccchhhcc-----Ccc-----ccC-CccceeE-----EEECCceeeeeeeee-eeEee Confidence 5699999999999998 556652110000 000 000 0001122 788998876555444 33433 Q ss_pred ec-cchhh-hhhhhhhhhccceecccceEEEEec----CC Q lcl|NC_011085. 310 RR-AEYQA-DQIIARYAMGHGGLRPEAAGALVFT----AG 343 (343) Q Consensus 310 ~~-~~~~~-d~i~~~~~~G~~v~rpe~~~~i~~~----~g 343 (343) -+ +...+ |.+..+.-+.+-++.....++..-- +| T Consensus 262 ~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~ 301 (346) T protein:vir:10 262 QPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKK 301 (346) T ss_pred CCCCCcccceeeeeeeeeeeeeeccccceEEEeeeccccc Confidence 22 23333 5788888888888887666553322 22 No 54 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.88 E-value=4.3e-24 Score=148.69 Aligned_cols=297 Identities=11% Similarity=0.084 Sum_probs=185.2 Q ss_pred CCccccccccccccccccchhHH-HHHHHHHHHHHHHHHhhhhccCcccc-cc-ccceEEEEeccCcceeeeecCCCcCC Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLAL-FLKVFGGEVLTAFARTSVTTNRHIMR-SI-SSGKSAQFPVLGRTRAAYLQAGQSLD 80 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al-~ie~~~g~V~~~f~~~s~~~~~~~~~-~i-~~G~tv~i~~iG~~t~~~~~~g~~i~ 80 (343) |.+- ++-.|| |.++|+.++++.|...++...+.+.. .+ .+|++|+||++....+++|++++... T Consensus 1 ~~~~-------------an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~ 67 (311) T protein:vir:99 1 MPTD-------------AETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGFN 67 (311) T ss_pred CCCc-------------chhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeeccccccccccCcc Confidence 2222 223455 78999999999999998776665433 34 48999999999999999999987543 Q ss_pred CccCCCccceEEEEeeeeeeeeeecc--chHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 81 DKRKDIKHTEKTIVIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 81 ~~~~~~~~~~~~l~iD~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ..+++.+..++++++.+++.|.|| |+||.......-.-..+.......-.+|..-+..|+..+...... T Consensus 68 --~g~v~~~~et~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~------- 138 (311) T protein:vir:99 68 --SGTISDEKTIYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGT------- 138 (311) T ss_pred --ccceeeeeeEEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc------- Confidence 467889999999999999999999 555544333322233334444566778888777776444322111 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhh-cccc-ccchhcce Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAA-NYAA-LIDPERGS 236 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~-~~~~~~G~ 236 (343) ..+.....+. ......-..+++++.|..+..++++ ||.++|+++|+|++|.+|.+++.|... +... ...-.++. T Consensus 139 ~~~~~~~~~~--~~~~~~lt~~nvl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~ 214 (311) T protein:vir:99 139 DTEGTLLAKT--HKTEETLDETNAYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESR 214 (311) T ss_pred ccchhhhccc--cccccccCHHHHHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccc Confidence 0000000011 0111112245678888888888887 788999999999999999888777642 2211 12235888 Q ss_pred eEEEeceEEEEe---ccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee--- Q lcl|NC_011085. 237 IRNVMGFEVVEV---PHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR--- 310 (343) Q Consensus 237 V~~i~Gf~V~~s---n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~--- 310 (343) |+.++|++|++. +++...-....+... ..+.-...| ++.|++|+......+ .+..+- T Consensus 215 V~~lDgv~Ii~V~ps~r~~t~~~ft~G~~~-----------~~~ak~INf-----iiv~~~a~i~~~K~~-~v~~f~P~~ 277 (311) T protein:vir:99 215 ITSIDGVQLIEVYESNRFMTKYDFTDGAKP-----------TEDAKAINF-----LVVAKPAVISIVKEN-AVFLFAPGQ 277 (311) T ss_pred cceecCeEEEEecCchhhcchhhhcCCccc-----------cCcccccce-----EEeCCCeeeeeeeee-eeeeeCCCC Confidence 999999999975 345422110000000 000001222 788998776554433 233221 Q ss_pred ccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 311 RAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 311 ~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.+-.+|.+..+.-+.+-++.....++..--.+ T Consensus 278 ~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~ 310 (311) T protein:vir:99 278 HTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKK 310 (311) T ss_pred CCCcceeeeeeeeeeeeeeeccccCeEEEeeec Confidence 223347888888888888888876665443333 No 55 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.88 E-value=1.2e-24 Score=151.64 Aligned_cols=266 Identities=14% Similarity=0.057 Sum_probs=175.9 Q ss_pred hHH-HHHHHHHHHHHHHHHhhhhccCccc-----cccccceEEEEeccCc-ceeeeecCCCcCCCccCCCccceEEEEee Q lcl|NC_011085. 24 LAL-FLKVFGGEVLTAFARTSVTTNRHIM-----RSISSGKSAQFPVLGR-TRAAYLQAGQSLDDKRKDIKHTEKTIVID 96 (343) Q Consensus 24 ~al-~ie~~~g~V~~~f~~~s~~~~~~~~-----~~i~~G~tv~i~~iG~-~t~~~~~~g~~i~~~~~~~~~~~~~l~iD 96 (343) -++ +.++|+..+++.|...+++..+... ....+|++|+||++.. ..+++|+++...+ ..+++.+..+++++ T Consensus 1 Main~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~--~g~v~~~~et~tl~ 78 (285) T protein:vir:79 1 MTVVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNA--RKTISVGKETVKLT 78 (285) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCcc--ccccceeeeEEEee Confidence 111 5689999999999988887766443 2456899999999964 6799999977543 45688899999999 Q ss_pred eeeeeeeeccchHHHHhchhhHHHHHHH-HHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccch Q lcl|NC_011085. 97 GLLTADVLIYDIEDAMNHYDVRSEYTSQ-IGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSP 175 (343) Q Consensus 97 ~~~~~~~~Idd~D~~q~~~d~~~~~~~~-~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~ 175 (343) +.+++.|.||.+|..++..=..+.++.+ ........+|..-+..++..+.. .. +... T Consensus 79 ~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~---------------~~-----~~~~-- 136 (285) T protein:vir:79 79 HEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAK---------------KA-----TDSI-- 136 (285) T ss_pred ccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhccc---------------cc-----cccc-- Confidence 9999999999666555321112333333 23344567777666555432210 00 0111 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccccc---chhcceeEEEec-eEEEEe--c Q lcl|NC_011085. 176 VELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALI---DPERGSIRNVMG-FEVVEV--P 249 (343) Q Consensus 176 ~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~---~~~~G~V~~i~G-f~V~~s--n 249 (343) +.+++++.|.++.++|+|..|| .+||++|+|++|.+|++++.|...-..+.. .-.++.|+.++| ++|++. + T Consensus 137 --T~~nv~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~ 213 (285) T protein:vir:79 137 --TKDNALDAYDTAEAYMFDNEVP-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSD 213 (285) T ss_pred --CHHHHHHHHHHHHHHHHHcCCC-CceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchh Confidence 2456788899999999999999 699999999999999999888764322221 224668999999 899984 3 Q ss_pred cccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc---chhhhhhhhhhhhc Q lcl|NC_011085. 250 HLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA---EYQADQIIARYAMG 326 (343) Q Consensus 250 ~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~---~~~~d~i~~~~~~G 326 (343) ++.....+ =...| ++.|++|+......+ .+..+-++ .--+|.+..+.-++ T Consensus 214 r~kt~~~~---------------------k~Inf-----iiv~~~a~i~~~K~~-~~~~f~P~~~~~~d~~~~~~R~Y~d 266 (285) T protein:vir:79 214 RLKGLGIT---------------------NHVNF-----ILTPLSAIAPIVKYD-SVSVIDPSTDRSGNRWTIKGLSYYD 266 (285) T ss_pred hccCcCcc---------------------hhccE-----EEecCceeccceeee-eeEeECCCCCCCcceeeeeeeeeee Confidence 44321100 01222 788998765554433 23332222 33467888888888 Q ss_pred cceecccceEEEE-ecCC Q lcl|NC_011085. 327 HGGLRPEAAGALV-FTAG 343 (343) Q Consensus 327 ~~v~rpe~~~~i~-~~~g 343 (343) +-++.....++.. .++| T Consensus 267 ~fv~~nk~~~Iy~~~~a~ 284 (285) T protein:vir:79 267 AIVLDNAKKGIYVAATAG 284 (285) T ss_pred eeehhhccceeeeeeccc Confidence 8888776655533 4444 No 56 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.84 E-value=3.7e-24 Score=149.03 Aligned_cols=299 Identities=15% Similarity=0.149 Sum_probs=208.7 Q ss_pred cccccchhHHHH-HHHHHHHHHHHHHhhhhccCcc-ccccccceEEEEeccCcceeeeecCCCcCCCccCCCccceEEEE Q lcl|NC_011085. 17 GQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHI-MRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKRKDIKHTEKTIV 94 (343) Q Consensus 17 ~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~-~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~~l~ 94 (343) -+.+++..|+.. |+|+.+++-.+.++.+-..+.+ +-+.-.|++.||+.+|.++++.....+++.. .++++.+.++. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~tiGs~~~~~~~E~~~~~~--~~i~TGEIt~~ 78 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTIGSVTLQEAEEDTPLIY--NPIETGEITFQ 78 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEecccCceeeeccccCCCeee--cccccceEEEE Confidence 122333345544 9999999888877754334443 4456689999999999999998887787765 56899999999 Q ss_pred eeeeeeeeeec-cchHHHHhchh-hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc-cccccccccCCceeeccccccc Q lcl|NC_011085. 95 IDGLLTADVLI-YDIEDAMNHYD-VRSEYTSQIGESLAMAADGAVLAELAGLCNMPA-ASNENIAGLGSASILEVGAKGD 171 (343) Q Consensus 95 iD~~~~~~~~I-dd~D~~q~~~d-~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~-~~~~~~~g~~~~~~~~~~~~~~ 171 (343) |.+++--+.+| +|+-+.-..+| ++++...|.++|+.+.+...+|..- .+.+++ +....+.|++.-.+ ++.++ T Consensus 79 i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G--~~~FA~~~~P~~vNG~PH~~V---~~~T~ 153 (313) T protein:vir:95 79 ITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTG--AEYFAANPGPHNVNGFPHVIV---SAETN 153 (313) T ss_pred EEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhc--hhhhccCCCCcccccccceEE---eccCC Confidence 99988777777 45555556666 8999999999999999988776432 222333 33334555554333 22222 Q ss_pred ccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhh-hccccccchhcc------eeEEEeceE Q lcl|NC_011085. 172 LTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNA-ANYAALIDPERG------SIRNVMGFE 244 (343) Q Consensus 172 ~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~-~~~~~~~~~~~G------~V~~i~Gf~ 244 (343) .+ .-+..+..++-.+++.++|.+||+.||+|.....|-.-..+.+ ....+.-.+.+| .|.+++|++ T Consensus 154 ~~-------~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~D 226 (313) T protein:vir:95 154 GV-------FALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWD 226 (313) T ss_pred ce-------ehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhh Confidence 11 1144566778899999999999999999998887765433333 111222233444 678999999 Q ss_pred EEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhh Q lcl|NC_011085. 245 VVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYA 324 (343) Q Consensus 245 V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~ 324 (343) ++.||.|...+-+....+.-+..++.|.+.. -.+-.-+..++.+.+++|.+++..+-.+.-...++ T Consensus 227 i~~SN~L~~AN~~D~~tT~~G~~~NlFM~i~--------------D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R 292 (313) T protein:vir:95 227 ILTSNRLHVANYNDGTTTGNGYVGNLFMCIL--------------DDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCR 292 (313) T ss_pred hhhhhhhhhccccccccccCceeeeeeeeee--------------cccccceeeeeccccccccccccccccccceeeee Confidence 9999999877665443332222233333322 12233455677778899999999999999999999 Q ss_pred hccceecccceEEEEecCC Q lcl|NC_011085. 325 MGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 325 ~G~~v~rpe~~~~i~~~~g 343 (343) ||.++.|-|.++++.+.+- T Consensus 293 ~G~Gi~R~~~L~~~~~~A~ 311 (313) T protein:vir:95 293 YGFGIQRLDTLGLLATSAT 311 (313) T ss_pred ecccceeecceeEEEeccc Confidence 9999999999999999888 No 57 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.84 E-value=1.7e-22 Score=140.00 Aligned_cols=284 Identities=13% Similarity=0.108 Sum_probs=184.7 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc---ccccceEEEEeccC-----cceeee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR---SISSGKSAQFPVLG-----RTRAAY 72 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~---~i~~G~tv~i~~iG-----~~t~~~ 72 (343) |||. . =|.++|+.++++.|...+++..+.... .+.+|++|+||.|. ++-+++ T Consensus 1 Mant-------------------l-~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~d 60 (302) T protein:vir:78 1 MANS-------------------L-ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKA 60 (302) T ss_pred CCch-------------------h-HHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccc Confidence 5533 1 177999999999999999877763322 46789999999995 556889 Q ss_pred ecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 73 LQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNH--YDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 73 ~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~--~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) |++++... ...++.+..++++++.+++.|.||-+|..+++ ...-.-..+.......-.+|..-+..|+..+.... T Consensus 61 y~R~~g~~--~g~v~~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~- 137 (302) T protein:vir:78 61 YNRSTGFT--QGSVTLAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVG- 137 (302) T ss_pred cccccCcc--ccceeeeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccC- Confidence 99987543 35678888999999999999999955544443 32223333335556677888877766654332110 Q ss_pred ccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhh-cc-cc Q lcl|NC_011085. 151 SNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAA-NY-AA 228 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~-~~ 228 (343) .....+.+...++++++.|..+.+.++++ ++|+++|+|+++.+|.+++.+... +. .. T Consensus 138 -----------------~~~~~~~~~~t~~nvl~~i~~~~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~ 196 (302) T protein:vir:78 138 -----------------GVIDLSKPDASAQALMGDIATAMELVDDS----NQLILVTSPTTLAGLLNTALIRESKNTQVL 196 (302) T ss_pred -----------------ccccccccchhHHHHHHHHHHHHHHhhcc----CCeEEEEChHHHHHHhcchhhccceecccc Confidence 00111122234678889999999999996 599999999999999988766532 11 11 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ...-.+++|+.++|++|++.+.--....... ..++. ... +.-...| ++.|++|+......+ .+.. T Consensus 197 ~~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f---~~G~~-----~~~-~ak~INf-----iiv~~~a~ia~~K~~-~~~i 261 (302) T protein:vir:78 197 RRGEVDTKITFIQDVEVLQVPSEYLYDKVAP---KVGVP-----DYT-GAKKIPY-----MIFKRDAPTGIVKTD-KVRV 261 (302) T ss_pred ccccccceeeeecccEEEEchhhhcccceec---cCCcc-----ccC-CccceeE-----EEECCCeeeeeeeee-eeEe Confidence 2233488999999999998543221111110 00000 000 0011222 788998776555444 3333 Q ss_pred e-eccchhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 309 A-RRAEYQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 309 ~-~~~~~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) + +.+...+ |.+..+.-+.+-++.....++++-.-+ T Consensus 262 f~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~ 299 (302) T protein:vir:78 262 FEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFG 299 (302) T ss_pred eCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeecc Confidence 3 3345554 588999889999999988777776666 No 58 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.78 E-value=1.6e-20 Score=129.11 Aligned_cols=299 Identities=13% Similarity=0.069 Sum_probs=190.6 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcc---ccc---cccceEEEEeccCcceeeeec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHI---MRS---ISSGKSAQFPVLGRTRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~---~~~---i~~G~tv~i~~iG~~t~~~~~ 74 (343) |||. +...+++-..|.++.|+...++...+. ..+ -+.|++|.+|.--.....+ T Consensus 1 MAn~-------------------l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~-- 59 (430) T protein:vir:92 1 MALN-------------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-- 59 (430) T ss_pred Cccc-------------------hhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc-- Confidence 6665 222456777888999999988886433 222 2569999988765544433 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) |..+.+.+.++...++.++||+.+-..|.+.+-| +...+....+.+.+..+||.++|..++..++...+.... T Consensus 60 -G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~---- 132 (430) T protein:vir:92 60 -GWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVIT---- 132 (430) T ss_pred -CcccCCCCCccccceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhccccccc---- Confidence 6666665555666788999999999999998644 567777788889999999999999998765433222110 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcC-CcEEEeCHHHHHHHhcc-chhhhhccccccch Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSA-DRTFYTTPEVYSAILAA-LMPNAANYAALIDP 232 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (343) .. .++....+++ +..+..+.+.|++..||.+ +|.++++|+.+..|... .++...+-.....+ T Consensus 133 ---~~------~~t~~~~~~~-------~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~ 196 (430) T protein:vir:92 133 ---SP------DAIGTNTADA-------WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred ---cc------ccCCCcCCcc-------hhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHH Confidence 00 0111111221 4566678899999999995 89999999999998653 23333333345678 Q ss_pred hcceeEE-EeceE-EEEeccccccccccccccccccc------------------------------------ccccccc Q lcl|NC_011085. 233 ERGSIRN-VMGFE-VVEVPHLTAGGAGDDREDETTNQ------------------------------------KHAFPKT 274 (343) Q Consensus 233 ~~G~V~~-i~Gf~-V~~sn~lp~~~~~~~~~~~~~~~------------------------------------~~~~~~~ 274 (343) ++|.|++ +.||+ +|+++++|....+........+. +-.|.-+ T Consensus 197 r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftia 276 (430) T protein:vir:92 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) T ss_pred hhccccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEec Confidence 9999996 99995 78999999633222111100000 0000000 Q ss_pred cc--------------ccccc----------------------------c-------------------ccceEeEeech Q lcl|NC_011085. 275 AE--------------GDTKV----------------------------A-------------------LDNVVGLFQHR 293 (343) Q Consensus 275 ~~--------------~~~~~----------------------------~-------------------~~~~~~l~~~~ 293 (343) +- -.|.. . ......++||| T Consensus 277 GV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr 356 (430) T protein:vir:92 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) T ss_pred ceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcc Confidence 00 00000 0 00023589999 Q ss_pred hhheeeeeee-----------------------eEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 294 SAVGTVKLKD-----------------------LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 294 ~Av~~~~~~~-----------------------~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+..+.... +.+-..+|.+..-..++.-..||.+.+|||.++++..-|- T Consensus 357 ~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~ 429 (430) T protein:vir:92 357 DAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) T ss_pred cceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCC Confidence 9988776543 1122235555555667778889999999999887776555 No 59 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.78 E-value=1.6e-20 Score=129.11 Aligned_cols=299 Identities=13% Similarity=0.069 Sum_probs=190.6 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcc---ccc---cccceEEEEeccCcceeeeec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHI---MRS---ISSGKSAQFPVLGRTRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~---~~~---i~~G~tv~i~~iG~~t~~~~~ 74 (343) |||. +...+++-..|.++.|+...++...+. ..+ -+.|++|.+|.--.....+ T Consensus 1 MAn~-------------------l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~-- 59 (430) T protein:vir:10 1 MALN-------------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE-- 59 (430) T ss_pred Cccc-------------------hhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc-- Confidence 6665 222456777888999999988886433 222 2569999988765544433 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) |..+.+.+.++...++.++||+.+-..|.+.+-| +...+....+.+.+..+||.++|..++..++...+.... T Consensus 60 -G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~---- 132 (430) T protein:vir:10 60 -GWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVIT---- 132 (430) T ss_pred -CcccCCCCCccccceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhccccccc---- Confidence 6666665555666788999999999999998644 567777788889999999999999998765433222110 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcC-CcEEEeCHHHHHHHhcc-chhhhhccccccch Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSA-DRTFYTTPEVYSAILAA-LMPNAANYAALIDP 232 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (343) .. .++....+++ +..+..+.+.|++..||.+ +|.++++|+.+..|... .++...+-.....+ T Consensus 133 ---~~------~~t~~~~~~~-------~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~ 196 (430) T protein:vir:10 133 ---SP------DAIGTNTADA-------WNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred ---cc------ccCCCcCCcc-------hhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHH Confidence 00 0111111221 4566678899999999995 89999999999998653 23333333345678 Q ss_pred hcceeEE-EeceE-EEEeccccccccccccccccccc------------------------------------ccccccc Q lcl|NC_011085. 233 ERGSIRN-VMGFE-VVEVPHLTAGGAGDDREDETTNQ------------------------------------KHAFPKT 274 (343) Q Consensus 233 ~~G~V~~-i~Gf~-V~~sn~lp~~~~~~~~~~~~~~~------------------------------------~~~~~~~ 274 (343) ++|.|++ +.||+ +|+++++|....+........+. +-.|.-+ T Consensus 197 r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftia 276 (430) T protein:vir:10 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) T ss_pred hhccccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEec Confidence 9999996 99995 78999999633222111100000 0000000 Q ss_pred cc--------------ccccc----------------------------c-------------------ccceEeEeech Q lcl|NC_011085. 275 AE--------------GDTKV----------------------------A-------------------LDNVVGLFQHR 293 (343) Q Consensus 275 ~~--------------~~~~~----------------------------~-------------------~~~~~~l~~~~ 293 (343) +- -.|.. . ......++||| T Consensus 277 GV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr 356 (430) T protein:vir:10 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) T ss_pred ceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcc Confidence 00 00000 0 00023589999 Q ss_pred hhheeeeeee-----------------------eEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 294 SAVGTVKLKD-----------------------LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 294 ~Av~~~~~~~-----------------------~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+..+.... +.+-..+|.+..-..++.-..||.+.+|||.++++..-|- T Consensus 357 ~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~ 429 (430) T protein:vir:10 357 DAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) T ss_pred cceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCC Confidence 9988776543 1122235555555667778889999999999887776555 No 60 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.78 E-value=2e-20 Score=128.58 Aligned_cols=299 Identities=13% Similarity=0.096 Sum_probs=188.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcc---cccc---ccceEEEEeccCcceeeeec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHI---MRSI---SSGKSAQFPVLGRTRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~---~~~i---~~G~tv~i~~iG~~t~~~~~ 74 (343) ||++ -++ ++++=-.|+++.|....+++.++. ..+. +.|+++.+|.--..... T Consensus 1 Ma~~--~~~-----------------~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~--- 58 (430) T protein:vir:21 1 MALN--EGQ-----------------IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) T ss_pred Cccc--cch-----------------hhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeecccccccc--- Confidence 7665 111 233322889999999999887533 2232 57999998865443322 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) .|..+.++..++...++.++||+.+-..|.+.+ +| +...|....+.+.+..+||.++|..++..++........ T Consensus 59 ~G~~~t~~~~~~~e~~v~~~~~~~~~V~~~~~~-kE-l~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~---- 132 (430) T protein:vir:21 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVIT---- 132 (430) T ss_pred ccccccCCCccceeeeEeEEEeeeccceEEeeh-hH-hcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccc---- Confidence 255555655567778889999999988888874 33 567778889999999999999999998776543322110 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcC-CcEEEeCHHHHHHHhcc-chhhhhccccccch Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSA-DRTFYTTPEVYSAILAA-LMPNAANYAALIDP 232 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (343) .. .++.+..+++ ++.+..++..|++..||.+ +|.++++|+.+..|... .++...+-.+...+ T Consensus 133 ---~~------~~t~~~~~~~-------~~~~A~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~~~A~ 196 (430) T protein:vir:21 133 ---SP------DAIGTNTADA-------WNFVADAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred ---cc------CCCCCCCCcc-------hhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhccccccccchhHHH Confidence 00 0111112222 4566677889999999995 79999999999988653 33444444455678 Q ss_pred hcceeEE-EeceE-EEEeccccccccccccccccccc------------------------------------ccccccc Q lcl|NC_011085. 233 ERGSIRN-VMGFE-VVEVPHLTAGGAGDDREDETTNQ------------------------------------KHAFPKT 274 (343) Q Consensus 233 ~~G~V~~-i~Gf~-V~~sn~lp~~~~~~~~~~~~~~~------------------------------------~~~~~~~ 274 (343) ++|.|++ +.||+ +|+++++|....+........+. +-.|.-+ T Consensus 197 r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftia 276 (430) T protein:vir:21 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFA 276 (430) T ss_pred hhcccccccchhhhhhhcCCcccccCccCcCceeccccccccccceeccccccccccccceeeeeecccceecccEEEec Confidence 9999996 99996 78999999633222111100000 0000000 Q ss_pred c------------------------ccc-cc-----------------cc-------------------ccceEeEeech Q lcl|NC_011085. 275 A------------------------EGD-TK-----------------VA-------------------LDNVVGLFQHR 293 (343) Q Consensus 275 ~------------------------~~~-~~-----------------~~-------------------~~~~~~l~~~~ 293 (343) + ++. .. .. ......++||+ T Consensus 277 GV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~ 356 (430) T protein:vir:21 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) T ss_pred ceeeeccccccccCCcceEEEEEecCCceeEEeecccccccccccccccccceeccccccCceeEEeccCCcccceeEcc Confidence 0 000 00 00 00023489999 Q ss_pred hhheeeeeee-----------------------eEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 294 SAVGTVKLKD-----------------------LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 294 ~Av~~~~~~~-----------------------~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +|+..+.... +.+-..+|.+.....++.-..||.+.+|||.++++..-|- T Consensus 357 ~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~ 429 (430) T protein:vir:21 357 DAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQT 429 (430) T ss_pred ceeEEEEecccCCCChhHhhheeeeeccccceEEEEEEccccccCceEEEEEeecCccccCcceEEEEcCCCC Confidence 9988776543 1122224555556677888899999999999887776555 No 61 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.66 E-value=3.1e-17 Score=111.06 Aligned_cols=281 Identities=14% Similarity=0.117 Sum_probs=175.1 Q ss_pred ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCccCCCccc Q lcl|NC_011085. 10 LGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKRKDIKHT 89 (343) Q Consensus 10 ~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~ 89 (343) ++.++-.+..+++.-.+..+.++.++.+..++.++++.+.++.++. +.+.++|....+.+..+.+|+.++.+ +++.+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~--~~~f~ 77 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMT-KPEEEFTFMSGVGAFWVDEAERIQTS--KPTFT 77 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecC-CCcEEEEEEcCCceeeeecCcccccc--cccee Confidence 3333333333333334567999999999999999999998877764 56778888887888888888888654 46677 Q ss_pred eEEEEeeeeeeeeeeccchHHHH-hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccc Q lcl|NC_011085. 90 EKTIVIDGLLTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGA 168 (343) Q Consensus 90 ~~~l~iD~~~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~ 168 (343) ++++...+. +..+.|.+ +-.+ +..|+.+.+.++.++++++.+|+.++.- .... .+.|. +.... T Consensus 78 ~v~l~~~k~-~~~~~is~-ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G----~g~~-----~~~gi-----l~~~~ 141 (299) T protein:vir:41 78 KAKMRSKKM-GVIIPTTK-ENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTG----VESP-----YNWNI-----LKSAT 141 (299) T ss_pred EEEEeeEEE-EEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhc----ccCc-----ccccc-----ccccc Confidence 777777554 34455654 3333 4588999999999999999999988732 1100 11111 11000 Q ss_pred cccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEEEeceEEEEe Q lcl|NC_011085. 169 KGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEV 248 (343) Q Consensus 169 ~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~s 248 (343) ....+ .......++.|.++...|...+.+ +-.++++|..|..|.+-.. .+..+........ ..++++|.+|+.+ T Consensus 142 ~~~~~--~~~~~~~~~~l~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lkd-~~G~~l~~~~~~~-~~~~l~G~PV~~~ 215 (299) T protein:vir:41 142 DASNL--VEETANKYDDLNEAIGLIEAEDLE--PNGIATIRKQRVKYRSTKD-GNGMPIFNTATSN-GVDDVLGLPIAYT 215 (299) T ss_pred cccee--eccccccHHHHHHHHHhhhcccCC--cCEEEEcHHHHHHHHHhhc-cCCceeecCCcCC-CCceecceeeEEe Confidence 00000 000111245666677778777764 3357999999999886322 1223322223333 3468999999999 Q ss_pred ccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc--------------h Q lcl|NC_011085. 249 PHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE--------------Y 314 (343) Q Consensus 249 n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~--------------~ 314 (343) +++|.... +...++.+.+-+..+..+++++|..++.. + T Consensus 216 ~~~~~~~~----------------------------~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 267 (299) T protein:vir:41 216 PKYTFGDK----------------------------DISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLA 267 (299) T ss_pred cccCCCCC----------------------------ceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhh Confidence 99984321 01112222222223444555666655432 2 Q ss_pred hhh--hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QAD--QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~d--~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.+ .++...++|.++.+|++.+.|+.+++ T Consensus 268 ~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 268 ERDMAAIKATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred hcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 233 34666788999999999999999999 No 62 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.64 E-value=3.2e-17 Score=110.98 Aligned_cols=307 Identities=11% Similarity=0.023 Sum_probs=172.0 Q ss_pred CCCCCcccccccc-ccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC---------ccee Q lcl|NC_011085. 1 MADMKGGQQLGKD-QGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG---------RTRA 70 (343) Q Consensus 1 ~~~~~~~~~~~t~-~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG---------~~t~ 70 (343) ||.++.-..+.+- ..++......-+|+.+.|..++.+..++.|.++.+.++.++. +..++||++. ..++ T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~~~~~ 79 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVKRPEVGQVGVGTS 79 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccceeeccccc Confidence 7777654443221 223444434445788999999999999999999998887764 5677777752 2334 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) .....|+.++.+ +++..++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++.+|+.++.--. +.. + T Consensus 80 ~~~~Eg~~~~~~--~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g--~~~--~ 152 (338) T protein:vir:78 80 NEQREGGTKPLS--GTAWDTRSVAPIKL-ATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKS--PLT--G 152 (338) T ss_pred cccccccccccc--ccceeEEEEEEEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccC--CCc--c Confidence 444455555432 34556666655433 2334454311123568999999999999999999998873211 000 0 Q ss_pred ccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhh--cccc Q lcl|NC_011085. 151 SNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAA--NYAA 228 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~--~~~~ 228 (343) ..+.+.............+ .........++.|.++...+. .+........+++|..|..|++.....+. .+.- T Consensus 153 --~~~~gi~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~ 227 (338) T protein:vir:78 153 --SALQGIDTNNVIVNTTNVD--YLQTGTTPLLDRFLDGYDLVS-ANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDP 227 (338) T ss_pred --ccccccccccccccccccc--cccccchhhHHHHHHHHHHhh-hhccccceEEEEchHHHHHHHHHhhhccCCCceee Confidence 0011111111110000001 111112234566666555543 33333445788999999998765544332 2333 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ......|..++++|++|+.++++|........ ++...++...+....+..++++++. T Consensus 228 ~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~-----------------------~~~~~~~gdfs~~~~~~~~~~~i~~ 284 (338) T protein:vir:78 228 TRINLAASAGDLLGLPVQFGKAVGGDLGAATD-----------------------SKVRVVGGDFSQLKYGFADEIRVKM 284 (338) T ss_pred cccccCCCCceeeeeeEEEccccCccccccCC-----------------------cccEEEEEecceEEEEeecccEEEE Confidence 34456677789999999999999953211100 0111122222222233344455555 Q ss_pred eeccc--------------hhh--hhhhhhhhhccceecccceEEEEec-CC Q lcl|NC_011085. 309 ARRAE--------------YQA--DQIIARYAMGHGGLRPEAAGALVFT-AG 343 (343) Q Consensus 309 ~~~~~--------------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~-~g 343 (343) .++.. ++. ..++..+++|.+++||++.+.|+-- ++ T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 285 SDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred eecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCC Confidence 54321 112 2356778899999999998776543 23 No 63 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.61 E-value=3.8e-17 Score=110.60 Aligned_cols=289 Identities=11% Similarity=0.063 Sum_probs=163.9 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~ 78 (343) +....-+.........+..+++. .+.+ +++...+....+..++++.+.++....++..+.||+. |...+.....|+. T Consensus 97 ~~~~~r~~~~~~~~~~~t~~~~g-~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~ 175 (390) T protein:vir:62 97 NLGEARSFEFAPEKRDGTKAGNP-NVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGETAE 175 (390) T ss_pred hhhhhHHHHhhhhhhcccccCCC-ccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeeccccc Confidence 11100000000000011111111 1234 5666667767777788888888777777788999877 5556666777888 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+ +++..++++.+-+.- .-..|.+-=-.++.+|+.+.+.++.++++++..|+.++. +.. . +.|. T Consensus 176 ~~~~--~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~----G~G---~----p~Gi 241 (390) T protein:vir:62 176 IPES--YPATAQRSMGGFKYG-FASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFIT----GTG---Q----PRGI 241 (390) T ss_pred cccc--ccceeeeEeeeeeEE-eehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhc----cCC---c----cccc Confidence 7664 456777777775542 334454322224667999999999999999999998862 111 1 1111 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) .................... ++.|.++...|+..... +-..|++|..|..|.+-.. .+..|.-...+..|... T Consensus 242 ~~~~~~~~~~~~~~~~~~~~----~~~l~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd-~~g~~l~~~~~~~g~~~ 314 (390) T protein:vir:62 242 LTDASPATATFLATDTDSKV----SDALIDLFHEVPSAYRA--NAKYVVNDLRAAQMRKLKD-ANGQYLWQSGLTVGAPS 314 (390) T ss_pred cccccccccceecccccccc----hHHHHHHHHhhhhhhhc--CCEEEEchHHHHHHHHhhc-cCCCeeecCCcCCCccc Confidence 11100000000111111112 34444555566655432 3356889999998854211 12234333445667777 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhh- Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQAD- 317 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d- 317 (343) +++|++|+.++++|...+ +-++|+.. ......+++++...+..+..| T Consensus 315 ~l~G~Pv~~~~~~p~~~i----------------------~~gd~s~~----------~i~~~~~~~v~~~~~~~~~~~~ 362 (390) T protein:vir:62 315 LFNGKVVETDDGMPADKI----------------------LFADLSKY----------RVRFAGSLRVDRSVDAKFSTDQ 362 (390) T ss_pred eecccceEEecCCCCccE----------------------EEeeccce----------eEEeecceEEEeeccccccCCc Confidence 899999999999984211 01233221 122234455555555544333 Q ss_pred -hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 318 -QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 318 -~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+++.+++|+++++|+++.+|+++++ T Consensus 363 ~~~~~~~r~d~~~~~~~A~~~l~~~~~ 389 (390) T protein:vir:62 363 IVYRFLQRADGLLVDARGAKVLTVTPG 389 (390) T ss_pred EEEEEEEEeCcEeechhheEEEEeecC Confidence 45788899999999999999999999 No 64 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.59 E-value=1.1e-16 Score=108.11 Aligned_cols=292 Identities=13% Similarity=0.061 Sum_probs=164.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) +.....+.........+-.+++..-+-.+++...+.....+.++++.+.++....++..+.+|+. +.+++.-+..|+.+ T Consensus 97 ~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 176 (392) T protein:vir:13 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEI 176 (392) T ss_pred chhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccc Confidence 10000000000000001111111112236777888888888889888888777777788888776 44666667778877 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++.+-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|+.+|.- .++ ..+.|.. T Consensus 177 ~~~--~~~f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G--~Gt-------~~p~Gil 244 (392) T protein:vir:13 177 PES--YPATTQRSMGGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTG--TGT-------GQPRGIL 244 (392) T ss_pred ccc--ccceeeEEeeeeeE-EeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcc--cCC-------ccccccc Confidence 654 35667777766543 23344543222235678999999999999999999988731 111 1111211 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) ..........+..... ...++.|.++...|...... .. ..|++|..+..|..-.. .+..|.-...+..|...+ T Consensus 245 ~~~~~~~~~~~~~~~~----~~~~d~l~~~~~~l~~~~~~-~a-~~v~n~~~~~~l~~lkd-~~G~~l~~~~~~~g~~~~ 317 (392) T protein:vir:13 245 TDATGANAAFGEADAD----SKVSDALIDLFHEVPSAYRK-NA-KFVVNDLRAAQMRKLKD-ANGQYLWQSALTVGAPDT 317 (392) T ss_pred cccccccccccccccc----cccHHHHHHHHHhhhhhhhc-CC-EEEEcHHHHHHHHHhhc-cCCceeecCCcCCCCCce Confidence 1111000111111111 11244455555556554321 23 45779999998864221 122232223455676779 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhh--h Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQA--D 317 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~--d 317 (343) ++|.+|+.++++|...+ +-++|+. ...+....++++.++++.+.. . T Consensus 318 l~G~Pv~~~~~~~~~~i----------------------~~Gdf~~----------~~i~~~~~~~i~~~~~~~~~~~~~ 365 (392) T protein:vir:13 318 FNGKVVETDDGMPADKV----------------------LFADLSK----------YRVRFAGSLRVDRSVDAKFSTDQI 365 (392) T ss_pred ecceeeEEcCCCCCCcE----------------------EEeeccc----------eeEEeecceEEEeeccccccCCcE Confidence 99999999999984211 0122322 222333455666665554433 4 Q ss_pred hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 318 QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 318 ~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+++..++|.++++|++.+.++++++ T Consensus 366 ~~r~~~r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 366 VYRFLQRADGLLVDARGAKVLTVTPA 391 (392) T ss_pred EEEEEEEeccEEecccceEEEEeecc Confidence 56788899999999999999999999 No 65 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.59 E-value=3.5e-16 Score=105.30 Aligned_cols=306 Identities=13% Similarity=0.107 Sum_probs=169.2 Q ss_pred CCCCCcccc--ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQ--LGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~--~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~ 77 (343) ||.++.-.. +++.+ .+......-+|+.+.+..++.+..++.|.++.+.++.++.+ ...++|+. +.+++.....|. T Consensus 1 ~a~l~el~~~~~~~~~-~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPNSAGSNH-QGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVGQVGVGT 78 (333) T ss_pred CchhHHhhhhcccccc-cCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEeecCcc Confidence 777765532 22222 33333333347889999999999999999999988877654 55567765 444444444343 Q ss_pred cCCCc------cCCCccceEEEEeeeeeeee-eeccchHHH-HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 78 SLDDK------RKDIKHTEKTIVIDGLLTAD-VLIYDIEDA-MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 78 ~i~~~------~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~-q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) ..... ...++..++++ ...+... ..|.+ +-. ++..|+.+.+.++.++++++.+|+.++.- ..... T Consensus 79 ~~~~~e~~~~~~~~~~f~~i~l--~~~kl~~~~~is~-ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G----~g~~~ 151 (333) T protein:vir:78 79 SNEQREGGLKPLSGTAWDTRSV--SPIKLATIVTVSE-EFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHG----KSPLT 151 (333) T ss_pred cccccccccccccccceeEEEE--eeEEEEEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcc----cCCCC Confidence 22110 01233334444 4444444 33443 222 46788999999999999999999988732 11111 Q ss_pred cccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhh--hccc Q lcl|NC_011085. 150 ASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNA--ANYA 227 (343) Q Consensus 150 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~--~~~~ 227 (343) + ..+.|......+...+ ........+...++.|+++...+..+. ....-..+++|..|..|++.....+ ..|. T Consensus 152 ~--~~~~g~~~~~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i 226 (333) T protein:vir:78 152 G--SALQGIDTDNVIANTT--NVDYLQETGDPLLDRLLDGYDLVSANT-DVEFNGWAVDPRFRAHLLRAQAYRDANGNVD 226 (333) T ss_pred C--cccccccccccccccc--cccccccccchhHHHHHHHHHhhcccc-ccCceEEEEcchHHHHHHHHhhhcCCCCcee Confidence 1 1111111111111111 001111112234566666665555432 2233467889999999987554433 2343 Q ss_pred cccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEe Q lcl|NC_011085. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLE 307 (343) Q Consensus 228 ~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e 307 (343) -......|..++++|++|+.|+++|........ .+...++...+-+..+..+.++++ T Consensus 227 ~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~-----------------------~~~~~~~gD~~~~~~g~~~~~~i~ 283 (333) T protein:vir:78 227 PSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVD-----------------------SKTRIIGGDFSQLKFGFADEIRIK 283 (333) T ss_pred ecCccccCCCceeeceeeEEccccCCCccccCC-----------------------CccEEEEEecccEEEEEeeccEEE Confidence 344566677889999999999999954211100 011112333332333444555555 Q ss_pred eeeccc-----------hhhh--hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 308 RARRAE-----------YQAD--QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 308 ~~~~~~-----------~~~d--~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ..++.. ++.| .++..++++.++++|++.+.|+-..= T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a 332 (333) T protein:vir:78 284 MSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQ 332 (333) T ss_pred EeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCC Confidence 544321 1222 35777889999999999998864444 No 66 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.56 E-value=1.5e-15 Score=101.86 Aligned_cols=297 Identities=14% Similarity=0.057 Sum_probs=168.0 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||...--....+-+ ++.-.+..+.+..++.+..++.++++++.+..+..+ ..+.+|+. +.+.+..+..|+.+ T Consensus 1 m~~~~~~a~~~~~t------~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~ 73 (330) T protein:vir:77 1 MAGSTVPSTQVALT------GDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGP-TGISIPHWTGAVSASWTGEAERK 73 (330) T ss_pred Ccccccchhhcccc------CCCcceechhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEcCCcceeEecCCCcc Confidence 77664333322222 111223456677888899999999999988777554 45778876 56667777888888 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++..++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++++|+.+|.- .....+......... T Consensus 74 ~~~--~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G----~g~~~~~~g~~~~~~ 146 (330) T protein:vir:77 74 PIT--KGSFGKQELEPVKI-TTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHG----IDKPSAFKGYLAETT 146 (330) T ss_pred ccc--cceeeEEEEeEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcc----cCCCCcccccccccc Confidence 754 35666666666443 23344544112235689999999999999999999988721 111111111111111 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccc-----hhc Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALID-----PER 234 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~-----~~~ 234 (343) ....... +...+.......+++.|.++...+..++.+ ....+++|..|..|.+-..- +..+.-... ... T Consensus 147 ~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~~~~~ 220 (330) T protein:vir:77 147 KVVSLAD---TNLTTASGPQGNAYLAVNNALSLLVNSGKK--WTGTLLDNVTEPILNTAVDG-NGRPLFVESTYTEQVGA 220 (330) T ss_pred ccceeec---ccccccccccchhHHHHHHHHHhhhhcCCC--ccEEEEcHHHHHHHHHHhcc-CCceeecCccccccccc Confidence 1111100 011111111223456677777777777654 33578999999988753211 112211111 122 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc- Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE- 313 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~- 313 (343) ..-++++|++|+.++++|...... +.+.++...+.+..+..+.++++..++.. T Consensus 221 ~~~~~l~G~PV~~~~~~p~~~~~~--------------------------~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~ 274 (330) T protein:vir:77 221 IREGRILGRPTYVADNVVNGTVGN--------------------------RVVGVMGDFSQVIWGQIGGLSFDVTDQATL 274 (330) T ss_pred cCCceecceeeEEeccccCCCCCC--------------------------ccEEEEEecceEEEEEecCcEEEEeeccee Confidence 234689999999999999532211 11223333333333444445555444321 Q ss_pred -----------------hh--hhhhhhhhhhccceecccceEEEEecC-C Q lcl|NC_011085. 314 -----------------YQ--ADQIIARYAMGHGGLRPEAAGALVFTA-G 343 (343) Q Consensus 314 -----------------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~-g 343 (343) +. ...++..+++|.++++|++.+.|+... | T Consensus 275 ~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 275 DFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAG 324 (330) T ss_pred eecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 11 244677889999999999988886554 4 No 67 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.55 E-value=1.8e-15 Score=101.40 Aligned_cols=285 Identities=13% Similarity=0.042 Sum_probs=168.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||-..-.....+.+..+ -.+..+.+..++.+..++.+.++.++++..+. +.+++||+. +...+.-+..++.+ T Consensus 1 ma~~~~~~~~~~~t~~g------g~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~ 73 (304) T protein:vir:94 1 MATPTYTPGNVILSDFK------NGVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSETERI 73 (304) T ss_pred CcccccccccccccCCC------ceecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecCccc Confidence 88776444433332211 13577899999999999999999888877764 456788887 55567777778777 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++.+.+.- .-+.|.+-=..++.+|+.+.+.++.++++++.+|+.++.- .....+... .. T Consensus 74 ~~~--~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G----~g~~~~~~~----~~ 142 (304) T protein:vir:94 74 QTS--KPEYAQAEMEAKKIG-VIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFG----TKSPYNTST----SG 142 (304) T ss_pred ccc--cceeeEEEEEEEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheec----cCCCccccc----cc Confidence 653 466777777665543 3345544222235689999999999999999999988632 111111110 00 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) .+.. ..... ..++ .......++.|.++...+...+... ..++++|..|..|.+-. +..|.- +-.+..++ T Consensus 143 ~~~~-~~~~~-~~~~-~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~lk-----d~~G~~-l~~~~~~~ 211 (304) T protein:vir:94 143 KPLV-EGAEE-KGNV-VTDTNNLYVDLSALMATIEDEELDP--NGVLTTRSFRSKMRNAL-----DANDRP-LFDANGNE 211 (304) T ss_pred cccc-ccccc-cccc-cccccchHHHHHHHHHHhhhccCCc--CEEEEcHHHHHHHHHhh-----ccCCcE-eecCCCcc Confidence 1111 00000 0000 0111223566777777777776543 35789999999987532 222221 12333578 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc------ Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE------ 313 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (343) ++|.+|+.++++|....... -+-++|++. ..+...+++++..++.. T Consensus 212 l~G~PV~~~~~~~~~~~~~~------------------~~~gd~~~~----------~~~~~~~~~i~~~~e~~~~~~~~ 263 (304) T protein:vir:94 212 IMGLPLSYTGADVYDKKKSL------------------ALMGDWDYA----------RYGILQGIEYAISEDATLTTLQA 263 (304) T ss_pred ccceeeEEecccccCCCCcE------------------EEEEehhhE----------EEEEecceEEEEeecceeeeecc Confidence 99999999999985321100 011233322 22223333444333321 Q ss_pred ----------hhh--hhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 314 ----------YQA--DQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 314 ----------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) +.. ..++..+++|..+++|++.+.|+... T Consensus 264 ~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 264 SDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 121 34567788999999999999999999 No 68 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.55 E-value=1.8e-15 Score=101.40 Aligned_cols=285 Identities=13% Similarity=0.042 Sum_probs=168.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||-..-.....+.+..+ -.+..+.+..++.+..++.+.++.++++..+. +.+++||+. +...+.-+..++.+ T Consensus 1 ma~~~~~~~~~~~t~~g------g~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~~~~ 73 (304) T protein:vir:10 1 MATPTYTPGNVILSDFK------NGVIPAEQGTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSETERI 73 (304) T ss_pred CcccccccccccccCCC------ceecchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecCccc Confidence 88776444433332211 13577899999999999999999888877764 456788887 55567777778777 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++.+.+.- .-+.|.+-=..++.+|+.+.+.++.++++++.+|+.++.- .....+... .. T Consensus 74 ~~~--~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G----~g~~~~~~~----~~ 142 (304) T protein:vir:10 74 QTS--KPEYAQAEMEAKKIG-VIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFG----TKSPYNTST----SG 142 (304) T ss_pred ccc--cceeeEEEEEEEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheec----cCCCccccc----cc Confidence 653 466777777665543 3345544222235689999999999999999999988632 111111110 00 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) .+.. ..... ..++ .......++.|.++...+...+... ..++++|..|..|.+-. +..|.- +-.+..++ T Consensus 143 ~~~~-~~~~~-~~~~-~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~lk-----d~~G~~-l~~~~~~~ 211 (304) T protein:vir:10 143 KPLV-EGAEE-KGNV-VTDTNNLYVDLSALMATIEDEELDP--NGVLTTRSFRSKMRNAL-----DANDRP-LFDANGNE 211 (304) T ss_pred cccc-ccccc-cccc-cccccchHHHHHHHHHHhhhccCCc--CEEEEcHHHHHHHHHhh-----ccCCcE-eecCCCcc Confidence 1111 00000 0000 0111223566777777777776543 35789999999987532 222221 12333578 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc------ Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE------ 313 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (343) ++|.+|+.++++|....... -+-++|++. ..+...+++++..++.. T Consensus 212 l~G~PV~~~~~~~~~~~~~~------------------~~~gd~~~~----------~~~~~~~~~i~~~~e~~~~~~~~ 263 (304) T protein:vir:10 212 IMGLPLSYTGADVYDKKKSL------------------ALMGDWDYA----------RYGILQGIEYAISEDATLTTLQA 263 (304) T ss_pred ccceeeEEecccccCCCCcE------------------EEEEehhhE----------EEEEecceEEEEeecceeeeecc Confidence 99999999999985321100 011233322 22223333444333321 Q ss_pred ----------hhh--hhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 314 ----------YQA--DQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 314 ----------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) +.. ..++..+++|..+++|++.+.|+... T Consensus 264 ~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 264 SDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 121 34567788999999999999999999 No 69 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.49 E-value=5.7e-15 Score=98.68 Aligned_cols=284 Identities=11% Similarity=0.015 Sum_probs=165.6 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) +.+...++. .+........+...|..+.+..++.+..+..|.++++.+..++. |.+++||+. +.+.+.-+.+|+.+ T Consensus 15 ~~~~~~~~~--~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:96 15 ASNNVKPQV--FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHhhhhhhh--cccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeeecCCccc Confidence 111111111 11111111122223566899999999999999999998887765 456888886 55666777788887 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++..-+.. .-..|.+-=-.++..|+.+.+.++.++++++++|+.+|.--. +. ..+.+.. T Consensus 92 ~~~--~~~f~~v~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g-----~~---~~~~~~~ 160 (324) T protein:vir:96 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG-----NN---PFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEEeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCC-----CC---CcCcccc Confidence 653 466777777665543 334555421223468899999999999999999998873211 00 0111111 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) ... ......... ...++.|.++...+...+... -.++++|..+..|..-. +-.|...+..+..++ T Consensus 161 ~~~----~~~~~~~~~----~~~~~~i~~~~~~i~~~~~~~--~~~i~n~~~~~~L~~lk-----d~~G~~~~~~~~~~~ 225 (324) T protein:vir:96 161 QSI----KKTNKVIKG----DFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNSDS 225 (324) T ss_pred ccc----cccceeccc----ccchHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhh-----CCCCCeeecCCCCCc Confidence 000 000000001 112455556666676666532 35789999999887532 223334455667788 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc------ Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE------ 313 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (343) ++|++|+.++..+.... ..++.+.+.+..+...+++++..++.. T Consensus 226 l~G~PV~~~~~~~~~~~------------------------------~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:96 226 LDGLPVVNLKSSNLKRG------------------------------ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeeEeecCCCCCcc------------------------------eEEEEecceEEEEEecCcEEEEeeccccccccc Confidence 99999998776542110 012222222223344455555554431 Q ss_pred --------hhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 --------YQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 --------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.. -.++..+++|.++++|++.+.|+...+ T Consensus 276 ~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 122 346777889999999999999987666 No 70 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.49 E-value=1.1e-14 Score=97.21 Aligned_cols=278 Identities=10% Similarity=0.028 Sum_probs=166.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) +.... +++. + ...+...|..+.|..++.+..++.|.++.+.++.++.+ ..++||+. +.+.+.-+..|+.+ T Consensus 21 ~~~~~-a~~~-~------~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:93 21 PQVFN-PDNV-M------MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred hhhcc-cccc-c------ccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeeecCCccc Confidence 22221 1111 1 11111235678999999999999999999988777654 55778776 66677777888888 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ .++.+++++..-+. +.-+.|.+-=-.++.+|+.+.+.++.++++++.+|+.+|.-- ... ..+.+.. T Consensus 92 ~~~--~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~----g~~----~~~~~~~ 160 (324) T protein:vir:93 92 ETS--KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GNN----PFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEEeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCC----CCC----CcCcccc Confidence 764 35677777766544 233556542222356899999999999999999999887321 100 0011110 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) .. ......... +...++.|.++...|..++... ..++++|..|..|.+- .+-.|...+..+..++ T Consensus 161 ~~----~~~~~~~~~----~~~~~~~i~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~l-----~d~~G~~~~~~~~~~~ 225 (324) T protein:vir:93 161 QS----IEKTNKVIK----GDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKI-----VDPETKERIYDRNSDS 225 (324) T ss_pred cc----ccccceecc----ccccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHh-----hCCCCCeeecCCCCCc Confidence 00 000000000 1112556666777777776532 3688999999998753 2233444555666788 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc------ Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE------ 313 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (343) ++|.+|+.+++.+.... ..++...+-+..+..++++++..++.. T Consensus 226 l~G~PVv~~~~~~~~~~------------------------------~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:93 226 LDGLPVVNLKSSNLKRG------------------------------ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeeEeecCCCCCcc------------------------------eEEEEecceEEEEEecCcEEEEeeccccccccc Confidence 99999998776542110 012222222223344555666555431 Q ss_pred --------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 --------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 --------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++ .-.++..+++|.++++|++.+.|+.... T Consensus 276 ~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:93 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 11 2466778889999999999999986555 No 71 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.49 E-value=3.7e-15 Score=99.70 Aligned_cols=287 Identities=16% Similarity=0.091 Sum_probs=169.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC--cceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG--RTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (343) .+....-.....+.+....+++.-.|..+.+...+.+..+..+.++++++..++.+ .++.+|+.. ...+..+..|+. T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~ 177 (390) T protein:vir:97 99 SARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEGAL 177 (390) T ss_pred hhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccC-CceEEEEEecCCcceeeecCCcc Confidence 00000000000111111222222335668889999999999999999888777654 467777763 345667777887 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+ +++..++++.+.+.. .-..|.+ +-.+.+.++.+.+.++.++++++++|+.+|.- ... ...+.|. T Consensus 178 ~~~~--~~~~~~i~~~~~k~~-~~~~is~-ell~ds~~l~~~i~~~la~a~~~~~d~a~l~G----~g~----~~~p~Gi 245 (390) T protein:vir:97 178 KPES--SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----TGA----NDGLLGL 245 (390) T ss_pred cccc--ccceeEEEEeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhc----CCC----Cccccce Confidence 7653 456777888777653 3345554 23334567889999999999999999988731 100 1112221 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) - ........ .....+...++.+.++...+.....+.. .+|++|..|..|.+-.. .+..|.-.. ...|..+ T Consensus 246 ~-----~~~~~~~~-~~~~~~~~~~d~~~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd-~~G~~l~~~-~~~~~~~ 315 (390) T protein:vir:97 246 I-----PQATTYAA-PTTIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKD-ANNQYLIGN-ARGTLTP 315 (390) T ss_pred e-----eccccccc-cccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceeecC-ccCCCCc Confidence 1 11110000 0011122335666777778888877544 56789999998875332 122222111 2345567 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc-chhhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA-EYQAD 317 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~-~~~~d 317 (343) +++|.+|+.|+.+|.+.. +-++|.. +...+..++++++..++. .+..+ T Consensus 316 ~l~G~pV~~~~~~~~~~~----------------------~~gd~~~---------~~~~~~~~~~~i~~~~~~~~f~~~ 364 (390) T protein:vir:97 316 TLWGLPVVATQAMAPGEF----------------------LVGAFDL---------AAQIFDQWDARVEIGYVNDDFQRN 364 (390) T ss_pred eecceeeEEcCCCCCCcE----------------------EEEeccc---------eEEEEEecceEEEEeecccccccC Confidence 899999999999984210 0122221 233344566677777654 34445 Q ss_pred h--hhhhhhhccceecccceEEEEec Q lcl|NC_011085. 318 Q--IIARYAMGHGGLRPEAAGALVFT 341 (343) Q Consensus 318 ~--i~~~~~~G~~v~rpe~~~~i~~~ 341 (343) . ++..++||.++++|++.+.+.+. T Consensus 365 ~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 365 MVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred cEEEEEEEeeccEEeccccEEEEEeC Confidence 4 66777899999999999999999 No 72 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.49 E-value=2.8e-15 Score=100.36 Aligned_cols=297 Identities=11% Similarity=0.037 Sum_probs=157.5 Q ss_pred CCCCCccccccc-------ccc-ccccccch--hHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcce- Q lcl|NC_011085. 1 MADMKGGQQLGK-------DQG-KGQSGGDK--LALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTR- 69 (343) Q Consensus 1 ~~~~~~~~~~~t-------~~g-~~~~~~d~--~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t- 69 (343) +.-+..+.+..+ +.- ..+.+.+. -.|..+.|..++.+..+..+.++++.++.++.++..+.++..+... T Consensus 93 ~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 172 (409) T protein:vir:45 93 DKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSE 172 (409) T ss_pred HHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCcc Confidence 000100000000 000 00001111 1234589999999999999999999888888888888888775432 Q ss_pred -eeeecCCCcCCCccCCCccceEEEEeeeeeeee--eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011085. 70 -AAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTAD--VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCN 146 (343) Q Consensus 70 -~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~--~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~ 146 (343) ......|..++.+ +++...+++ ...+... +.|.+-=-.++.+|+.+.+.++.+++++++.|+.|+.-- ++. T Consensus 173 ~~~~v~E~~~~~~~--~~~f~~~~l--~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~--G~~ 246 (409) T protein:vir:45 173 VGVLLGENEEAGEE--DTDFGMGSL--GALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGT--GAG 246 (409) T ss_pred cccccccccccccc--ccccceeee--eeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccC--CCC Confidence 2344445555443 344444444 4444432 235442222356899999999999999999999887310 010 Q ss_pred ccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcE-EEeCHHHHHHHhccchhhhhc Q lcl|NC_011085. 147 MPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRT-FYTTPEVYSAILAALMPNAAN 225 (343) Q Consensus 147 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~-~vv~P~~~~~Ll~~~~~~~~~ 225 (343) . ...+.|........ ..+. .+.... ++.|.++...|..... ....| ++++|..|..|.+-.. .+.. T Consensus 247 ~----~~~p~Gil~~~~~~--~~~~-~~~~~~----~d~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lkd-~~G~ 313 (409) T protein:vir:45 247 T----PKQPKGLAASVTGT--TQTA-AANAVK----WQEILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEMED-GQGR 313 (409) T ss_pred C----ccccceeeeccccc--cccc-cccccc----hHHHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhhc-CCCc Confidence 0 01111211100000 0000 010111 3445555556655543 33456 4679999888754221 1223 Q ss_pred cccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeE Q lcl|NC_011085. 226 YAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLS 305 (343) Q Consensus 226 ~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~ 305 (343) |.-...+..|...+++|.+|+.++++|..+.+.... +-++|.. ........+. T Consensus 314 ~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i-----------------~~Gd~~~----------~~i~~~~~~~ 366 (409) T protein:vir:45 314 PLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFM-----------------FCGDFDR----------FIIRRVRYMI 366 (409) T ss_pred eeeccCcCCCCCceecceeeEEecCcCCccCCccEE-----------------EEeehhh----------hheeeccceE Confidence 332344566777899999999999999532211100 0012221 1112223344 Q ss_pred Eeeeeccchhhh--hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 306 LERARRAEYQAD--QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 306 ~e~~~~~~~~~d--~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++...|+-..-+ .+++.++||.++++|++.+.++.+.. T Consensus 367 ~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s 406 (409) T protein:vir:45 367 LKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGS 406 (409) T ss_pred EEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccC Confidence 555555432223 37888899999999999998877555 No 73 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.48 E-value=9.2e-15 Score=97.54 Aligned_cols=291 Identities=8% Similarity=0.033 Sum_probs=166.0 Q ss_pred CCC-CCcccccccccccc-ccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCcceeeeecCC Q lcl|NC_011085. 1 MAD-MKGGQQLGKDQGKG-QSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRAAYLQAG 76 (343) Q Consensus 1 ~~~-~~~~~~~~t~~g~~-~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g 76 (343) +.+ ...+... +.+.. ...| -.+..+.|..++.+..+..+.+++++++.++.++. ++.++. .+...+.....| T Consensus 109 ~~~~~~~~~~~--~~~~~~~~~g--g~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~ 184 (415) T protein:vir:98 109 FTEYLETRNDI--QGGSLKTDSG--FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL 184 (415) T ss_pred HHHHHhhhhhh--hhcccccccc--ccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccc Confidence 000 0000000 00000 0011 12455899999999999999999999888776432 344443 455566666667 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) ..++.+. .++.+++++.+.+.- .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.++.....+. +... T Consensus 185 ~~~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~----~~~~--- 255 (415) T protein:vir:98 185 EENPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS----TGST--- 255 (415) T ss_pred cccCccc-ccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCc----cccc--- Confidence 7665422 234566666665443 224454322234578899999999999999999998875432111 0000 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcce Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGS 236 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (343) +...... .. ....+.... ++.|.++...+...+.. .-.+|++|..|..|..-.. .+..|.-...+.+|. T Consensus 256 ~~~~~~~---~~-~~~~~~~~~----~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd-~~G~~l~~~~~~~~~ 324 (415) T protein:vir:98 256 SSGFEKE---GK-KLEVKKAKS----LDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKT 324 (415) T ss_pred ccccccc---cc-ccccccccc----hhHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-cCCceeeccCcCCCC Confidence 0000000 00 001111122 44555556667666653 2256789999999875321 123343334566777 Q ss_pred eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchh Q lcl|NC_011085. 237 IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQ 315 (343) Q Consensus 237 V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~ 315 (343) .++++|++|+.++++|..+.+.. +.++.+.+ ++.......++++..+. ..+ T Consensus 325 ~~~l~G~pV~~~~~~~~~~~~~~---------------------------~~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:98 325 QQRLLGAKIEILPDEVLGQKGNN---------------------------TLIIGNLKDAIVLFDRSQYQASWTDY-MHF 376 (415) T ss_pred CceecceeeEEecccccCCCCcc---------------------------EEEEEehhccEEEEeecceEEEEecc-ccC Confidence 78999999999999985432211 11333322 33334445566665543 334 Q ss_pred hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ...+++.++++.++.+|++.+.++++.- T Consensus 377 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:98 377 GECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEecc Confidence 4567888999999999999999988765 No 74 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.48 E-value=9.2e-15 Score=97.54 Aligned_cols=291 Identities=8% Similarity=0.033 Sum_probs=166.0 Q ss_pred CCC-CCcccccccccccc-ccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCcceeeeecCC Q lcl|NC_011085. 1 MAD-MKGGQQLGKDQGKG-QSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRAAYLQAG 76 (343) Q Consensus 1 ~~~-~~~~~~~~t~~g~~-~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g 76 (343) +.+ ...+... +.+.. ...| -.+..+.|..++.+..+..+.+++++++.++.++. ++.++. .+...+.....| T Consensus 109 ~~~~~~~~~~~--~~~~~~~~~g--g~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~ 184 (415) T protein:vir:81 109 FTEYLETRNDI--QGGSLKTDSG--FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL 184 (415) T ss_pred HHHHHhhhhhh--hhcccccccc--ccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccc Confidence 000 0000000 00000 0011 12455899999999999999999999888776432 344443 455566666667 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) ..++.+. .++.+++++.+.+.- .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.++.....+. +... T Consensus 185 ~~~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~----~~~~--- 255 (415) T protein:vir:81 185 EENPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS----TGST--- 255 (415) T ss_pred cccCccc-ccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCc----cccc--- Confidence 7665422 234566666665443 224454322234578899999999999999999998875432111 0000 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcce Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGS 236 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (343) +...... .. ....+.... ++.|.++...+...+.. .-.+|++|..|..|..-.. .+..|.-...+.+|. T Consensus 256 ~~~~~~~---~~-~~~~~~~~~----~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd-~~G~~l~~~~~~~~~ 324 (415) T protein:vir:81 256 SSGFEKE---GK-KLEVKKAKS----LDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKT 324 (415) T ss_pred ccccccc---cc-ccccccccc----hhHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-cCCceeeccCcCCCC Confidence 0000000 00 001111122 44555556667666653 2256789999999875321 123343334566777 Q ss_pred eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchh Q lcl|NC_011085. 237 IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQ 315 (343) Q Consensus 237 V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~ 315 (343) .++++|++|+.++++|..+.+.. +.++.+.+ ++.......++++..+. ..+ T Consensus 325 ~~~l~G~pV~~~~~~~~~~~~~~---------------------------~~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:81 325 QQRLLGAKIEILPDEVLGQKGNN---------------------------TLIIGNLKDAIVLFDRSQYQASWTDY-MHF 376 (415) T ss_pred CceecceeeEEecccccCCCCcc---------------------------EEEEEehhccEEEEeecceEEEEecc-ccC Confidence 78999999999999985432211 11333322 33334445566665543 334 Q ss_pred hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ...+++.++++.++.+|++.+.++++.- T Consensus 377 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:81 377 GECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEecc Confidence 4567888999999999999999988765 No 75 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.48 E-value=9.2e-15 Score=97.54 Aligned_cols=291 Identities=8% Similarity=0.033 Sum_probs=166.0 Q ss_pred CCC-CCcccccccccccc-ccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCcceeeeecCC Q lcl|NC_011085. 1 MAD-MKGGQQLGKDQGKG-QSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRAAYLQAG 76 (343) Q Consensus 1 ~~~-~~~~~~~~t~~g~~-~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g 76 (343) +.+ ...+... +.+.. ...| -.+..+.|..++.+..+..+.+++++++.++.++. ++.++. .+...+.....| T Consensus 109 ~~~~~~~~~~~--~~~~~~~~~g--g~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~ 184 (415) T protein:vir:79 109 FTEYLETRNDI--QGGSLKTDSG--FVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEEL 184 (415) T ss_pred HHHHHhhhhhh--hhcccccccc--ccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccc Confidence 000 0000000 00000 0011 12455899999999999999999999888776432 344443 455566666667 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) ..++.+. .++.+++++.+.+.- .-+.|.+-=-.++.+|+.+.+.++.++++++..|+.++.....+. +... T Consensus 185 ~~~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~----~~~~--- 255 (415) T protein:vir:79 185 EENPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGS----TGST--- 255 (415) T ss_pred cccCccc-ccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCc----cccc--- Confidence 7665422 234566666665443 224454322234578899999999999999999998875432111 0000 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcce Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGS 236 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (343) +...... .. ....+.... ++.|.++...+...+.. .-.+|++|..|..|..-.. .+..|.-...+.+|. T Consensus 256 ~~~~~~~---~~-~~~~~~~~~----~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd-~~G~~l~~~~~~~~~ 324 (415) T protein:vir:79 256 SSGFEKE---GK-KLEVKKAKS----LDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKT 324 (415) T ss_pred ccccccc---cc-ccccccccc----hhHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-cCCceeeccCcCCCC Confidence 0000000 00 001111122 44555556667666653 2256789999999875321 123343334566777 Q ss_pred eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchh Q lcl|NC_011085. 237 IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQ 315 (343) Q Consensus 237 V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~ 315 (343) .++++|++|+.++++|..+.+.. +.++.+.+ ++.......++++..+. ..+ T Consensus 325 ~~~l~G~pV~~~~~~~~~~~~~~---------------------------~~~~Gd~~~~~~~~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:79 325 QQRLLGAKIEILPDEVLGQKGNN---------------------------TLIIGNLKDAIVLFDRSQYQASWTDY-MHF 376 (415) T ss_pred CceecceeeEEecccccCCCCcc---------------------------EEEEEehhccEEEEeecceEEEEecc-ccC Confidence 78999999999999985432211 11333322 33334445566665543 334 Q ss_pred hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ...+++.++++.++.+|++.+.++++.- T Consensus 377 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:79 377 GECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEecc Confidence 4567888999999999999999988765 No 76 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.48 E-value=1.1e-14 Score=97.10 Aligned_cols=284 Identities=11% Similarity=0.023 Sum_probs=164.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) +.+...... .+......+++.-.+..+.|..++.+..+..|.++++++..++. |.+++||+. +.+.+.-+..|+.+ T Consensus 15 ~~~~~~~~~--~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:78 15 ASNNVKPQV--FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHHhhhhhh--hccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEecCCccc Confidence 111111111 00001111122223566889999999999999999998877764 556888886 56667777788888 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++..-+.. .-..|.+-=-.++.+|+.+.+.++.++++++++|+.+|.-- .... .+.+.. T Consensus 92 ~~~--~~~~~~v~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~----g~~~----~~~gi~ 160 (324) T protein:vir:78 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNP----FGKSIA 160 (324) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccC----CCCC----cCcccc Confidence 754 467777777765442 33455542122346899999999999999999999887321 1100 011110 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) .. .. ........ ...++.|.++...|..++... -.++++|..|..|.+-. +-.|...+..|..++ T Consensus 161 ~~--~~--~~~~~~~~----~~t~~~i~~~~~~l~~~~~~~--~~~vmn~~~~~~L~~l~-----d~~G~~~~~~~~~~~ 225 (324) T protein:vir:78 161 QS--IE--KTNKVIKG----DFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNSDS 225 (324) T ss_pred cc--cc--ccceeccc----cccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhh-----ccCCCeeecCCCCCc Confidence 00 00 00000011 112455666667777766532 35789999999886532 223334455677788 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc------ Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE------ 313 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (343) ++|++|+.++..+.... ..++.+.+-+..+..+++++|..++.. T Consensus 226 l~G~PV~~~~~~~~~~~------------------------------~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:78 226 LDGLPVVNLKSSNLKRG------------------------------ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeeEeeCCCCCCcc------------------------------eEEEEecceEEEEEecCcEEEEeeccccccccc Confidence 99999998776542110 012222222223444455666555431 Q ss_pred --------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 --------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 --------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +. ...++..+++|.+++||++.+.|+.... T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:78 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 11 2445677889999999999998886444 No 77 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.48 E-value=1.1e-14 Score=97.10 Aligned_cols=284 Identities=11% Similarity=0.023 Sum_probs=164.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) +.+...... .+......+++.-.+..+.|..++.+..+..|.++++++..++. |.+++||+. +.+.+.-+..|+.+ T Consensus 15 ~~~~~~~~~--~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:96 15 ASNNVKPQV--FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHHhhhhhh--hccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEecCCccc Confidence 111111111 00001111122223566889999999999999999998877764 556888886 56667777788888 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++..-+.. .-..|.+-=-.++.+|+.+.+.++.++++++++|+.+|.-- .... .+.+.. T Consensus 92 ~~~--~~~~~~v~~~~~k~~-~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~----g~~~----~~~gi~ 160 (324) T protein:vir:96 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNP----FGKSIA 160 (324) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccC----CCCC----cCcccc Confidence 754 467777777765442 33455542122346899999999999999999999887321 1100 011110 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) .. .. ........ ...++.|.++...|..++... -.++++|..|..|.+-. +-.|...+..|..++ T Consensus 161 ~~--~~--~~~~~~~~----~~t~~~i~~~~~~l~~~~~~~--~~~vmn~~~~~~L~~l~-----d~~G~~~~~~~~~~~ 225 (324) T protein:vir:96 161 QS--IE--KTNKVIKG----DFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNSDS 225 (324) T ss_pred cc--cc--ccceeccc----cccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhh-----ccCCCeeecCCCCCc Confidence 00 00 00000011 112455666667777766532 35789999999886532 223334455677788 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc------ Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE------ 313 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (343) ++|++|+.++..+.... ..++.+.+-+..+..+++++|..++.. T Consensus 226 l~G~PV~~~~~~~~~~~------------------------------~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:96 226 LDGLPVVNLKSSNLKRG------------------------------ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeeEeeCCCCCCcc------------------------------eEEEEecceEEEEEecCcEEEEeeccccccccc Confidence 99999998776542110 012222222223444455666555431 Q ss_pred --------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 --------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 --------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +. ...++..+++|.+++||++.+.|+.... T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:96 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 11 2445677889999999999998886444 No 78 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.47 E-value=2.4e-14 Score=95.27 Aligned_cols=283 Identities=11% Similarity=0.028 Sum_probs=162.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||..+...- .|..+.++.++.+..++.|.++.+.+...+.+| .+.+|++ +.+.+.-+..|+.+ T Consensus 1 ma~~t~~~G---------------~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~Eg~~~ 64 (300) T protein:vir:95 1 MSEAQLSKG---------------NLFNPELVTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDIDIVAENGKK 64 (300) T ss_pred CcccccCCc---------------ceechhhHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEeeCCccc Confidence 777643322 134578899999999999998888777766544 4667764 55667777778777 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHH-----hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM-----NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q-----~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) +.+ +++.+++++..-+. +.-..|.+ |.. ...|+.+.+.++.+++++++.|+.++.-.. ........ T Consensus 65 ~~s--~~~f~~v~l~~~k~-~~~~~iS~--ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~----~~~g~~~~ 135 (300) T protein:vir:95 65 THG--GVSLDPVTIVPLKV-EYGARVSD--EFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGIN----PRTKQAST 135 (300) T ss_pred ccc--cccceeeEeeeEEE-EEeehhhH--HHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhccc----CCCCCCcc Confidence 653 35666777665433 23334433 332 347889999999999999999999873211 00000000 Q ss_pred ccccC-CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchh Q lcl|NC_011085. 155 IAGLG-SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPE 233 (343) Q Consensus 155 ~~g~~-~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (343) +.+.. ..........+ +. ...++.|.++...+...+.. ....+++|..+..|.+-..- +..+.-..... T Consensus 136 ~~~~~~~~~~~~~~~~~---~~----~~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~L~~lkd~-~G~~i~~~~~~ 205 (300) T protein:vir:95 136 IIGDNCFDKKVTQTVPF---KD----TNPDESMEDAVGMIDGSERD--ITGAILDPIFTTALSKMKNA-EGGKLYPELAW 205 (300) T ss_pred cccccccccccceeecc---cc----cchHHHHHHHHHHhhhcCCC--ccEEEECHHHHHHHHHhhcc-CCCeeccCccc Confidence 00000 00000000000 11 11245566666677766543 22578999999988653321 12222123344 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeee--ec Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERA--RR 311 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~--~~ 311 (343) .|..++++|++|+.|+.+|......... -+-+||++.+-+ ...+.++++.. .+ T Consensus 206 ~~~~~~l~G~Pv~~s~~v~~~~~~~~~~----------------~~~GDf~~~~~~---------~~~~~~~~~v~~~~~ 260 (300) T protein:vir:95 206 GGVPDAINGLAVDKNRTVSYSQTDPKNT----------------AIVGDFETMFKW---------GYAKEVPMEIIKYGD 260 (300) T ss_pred cCCCceecceeeEEecCCCCCCCCCccE----------------EEEeeccceEEE---------EEecccEEEEeeccC Confidence 5677899999999999998543211100 011344333211 11222233322 12 Q ss_pred cc------hhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 312 AE------YQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 312 ~~------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++ ++. -.++..+++|.++++|++.+.|+-.+| T Consensus 261 ~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 261 PDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred CCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 21 222 345778889999999999999999999 No 79 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.47 E-value=3.8e-15 Score=99.66 Aligned_cols=292 Identities=7% Similarity=-0.002 Sum_probs=162.3 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g~~ 78 (343) +..+...... +. ....+++--.+..+.|.+++.+..+..+.+++++++.++.++. ++.++. .+...+.....|.. T Consensus 110 ~~~~~~~~~~--~~-~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~ 186 (415) T protein:vir:47 110 TEYLETRNDI--QG-GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) T ss_pred HHHHhhhhhh--hh-ccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccc Confidence 0000000000 00 0000111112455899999999999999999999888776543 333333 34455666667776 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+. .++.+++++..-+.- .-+.|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.-...+.. .. . T Consensus 187 ~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~----~~-----~ 255 (415) T protein:vir:47 187 NPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGST----GS-----T 255 (415) T ss_pred ccccc-ccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCc----cc-----c Confidence 65422 234555666554432 2244543222235688999999999999999999998743221110 00 0 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) ...... .......+.... ++.|.++...+...... .=.+|++|..|..|..-.. .+..|.-...+.+|..+ T Consensus 256 ~~~~~~--~~~~~~~~~~~~----~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd-~~G~~i~~~~~~~~~~~ 326 (415) T protein:vir:47 256 SSGFEK--EGKKLEVKKAKS----LDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQQ 326 (415) T ss_pred cccccc--ccceeccccccc----hHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-cCCCeeeccCcCCCCCc Confidence 000000 000011111112 34444555555555543 2357899999998865321 22334433456678788 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchhhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQAD 317 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~~d 317 (343) +++|++|+.++++|..+.+.. ..++...+ ++..+..++++++.... ..+.. T Consensus 327 ~l~G~pV~~~~~~~~~~~~~~---------------------------~~~~gd~~~~~~~~~~~~~~v~~~~~-~~~~~ 378 (415) T protein:vir:47 327 RLLGAKIEILPDEVLGQKGNN---------------------------TLIIGNLKDAIVLFDRSQYQASWTDY-MHFGE 378 (415) T ss_pred cccceeeEEeccccccCCCcc---------------------------EEEEEehhccEEEEeecceEEEeecc-ccCce Confidence 999999999999985432211 01222222 23334445556655443 33345 Q ss_pred hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 318 QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 318 ~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+++.++++.++++|++.+.++++.- T Consensus 379 ~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:47 379 CLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEEEEeccEEeccccEEEEEeecc Confidence 67888999999999999999887654 No 80 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.47 E-value=3.8e-15 Score=99.66 Aligned_cols=292 Identities=7% Similarity=-0.002 Sum_probs=162.3 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~~~~~~g~~ 78 (343) +..+...... +. ....+++--.+..+.|.+++.+..+..+.+++++++.++.++. ++.++. .+...+.....|.. T Consensus 110 ~~~~~~~~~~--~~-~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~ 186 (415) T protein:vir:46 110 TEYLETRNDI--QG-GSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) T ss_pred HHHHhhhhhh--hh-ccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccc Confidence 0000000000 00 0000111112455899999999999999999999888776543 333333 34455666667776 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+. .++.+++++..-+.- .-+.|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.-...+.. .. . T Consensus 187 ~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~----~~-----~ 255 (415) T protein:vir:46 187 NPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGST----GS-----T 255 (415) T ss_pred ccccc-ccceeeEEeeeeeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCc----cc-----c Confidence 65422 234555666554432 2244543222235688999999999999999999998743221110 00 0 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) ...... .......+.... ++.|.++...+...... .=.+|++|..|..|..-.. .+..|.-...+.+|..+ T Consensus 256 ~~~~~~--~~~~~~~~~~~~----~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd-~~G~~i~~~~~~~~~~~ 326 (415) T protein:vir:46 256 SSGFEK--EGKKLEVKKAKS----LDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQQ 326 (415) T ss_pred cccccc--ccceeccccccc----hHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-cCCCeeeccCcCCCCCc Confidence 000000 000011111112 34444555555555543 2357899999998865321 22334433456678788 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchhhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQAD 317 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~~d 317 (343) +++|++|+.++++|..+.+.. ..++...+ ++..+..++++++.... ..+.. T Consensus 327 ~l~G~pV~~~~~~~~~~~~~~---------------------------~~~~gd~~~~~~~~~~~~~~v~~~~~-~~~~~ 378 (415) T protein:vir:46 327 RLLGAKIEILPDEVLGQKGNN---------------------------TLIIGNLKDAIVLFDRSQYQASWTDY-MHFGE 378 (415) T ss_pred cccceeeEEeccccccCCCcc---------------------------EEEEEehhccEEEEeecceEEEeecc-ccCce Confidence 999999999999985432211 01222222 23334445556655443 33345 Q ss_pred hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 318 QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 318 ~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+++.++++.++++|++.+.++++.- T Consensus 379 ~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:46 379 CLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEEEEeccEEeccccEEEEEeecc Confidence 67888999999999999999887654 No 81 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.46 E-value=5.8e-15 Score=98.63 Aligned_cols=286 Identities=16% Similarity=0.127 Sum_probs=164.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC--cceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG--RTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (343) +.+..... .+-......++.-.+..+.+..++.+.....+.++.++++.++. +.++++|+.. ..++.....|+. T Consensus 93 ~~~~~~~~---~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~ 168 (385) T protein:vir:18 93 QGTFGAKT---FNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVVAEKAL 168 (385) T ss_pred hccchhhH---HHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeeeccCcc Confidence 11110000 00001111111112455788889999999899999998887764 4578888863 345556667777 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.. +++..++++.+.+.- ..+.|.+ +-.+...++.+.+.++.++++++.+|+.+|.- .... ..+.|. T Consensus 169 ~~~~--~~~~~~~~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G----~g~~----~~~~Gi 236 (385) T protein:vir:18 169 KPES--DITFSKQTANVKTIA-HWVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLLNG----DGTG----DNLEGL 236 (385) T ss_pred cccc--ccceeEEEEeeeeEE-EeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC----Cccccc Confidence 6553 356777777776653 3345654 33344466888999999999999999988732 1111 111111 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) . ..+..... +........++.|.++...|.....+ .-.++++|..|..|.+-.. .+..|... ....|..+ T Consensus 237 ~-----~~~~~~~~-~~~~~~~~~~d~i~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd-~~G~~l~~-~~~~~~~~ 306 (385) T protein:vir:18 237 N-----KVATAYDT-SLNATGDTRADIIAHAIYQVTESEFS--ASGIVLNPRDWHNIALLKD-NEGRYIFG-GPQAFTSN 306 (385) T ss_pred c-----cccccccc-cccccccchHHHHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhc-CCCceecc-CcccCCCc Confidence 1 11100000 00001122356666777777666643 2368899999998875332 12223221 23466778 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc-c-hhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA-E-YQA 316 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~-~-~~~ 316 (343) +++|.+|+.|+.+|.+.. +-++|.. ++..+..++++++..+.. . +.. T Consensus 307 ~l~G~pV~~~~~~p~~~~----------------------~~gd~~~---------~~~~~~~~~~~v~~~~~~~~~~~~ 355 (385) T protein:vir:18 307 IMWGLPVVPTKAQAAGTF----------------------TVGGFDM---------ASQVWDRMDATVEVSREDRDNFVK 355 (385) T ss_pred eecceeeEEcCcCCCCcE----------------------EEeeccc---------EEEEEEecceEEEEeccccchhhc Confidence 999999999999984311 0112222 222233344555554433 1 222 Q ss_pred --hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 317 --DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 317 --d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ..++..+++|.++++|++.+.++++++ T Consensus 356 ~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 356 NMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred CcEEEEEEEeeccEEecccceEEEEeccC Confidence 355777889999999999999999999 No 82 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.46 E-value=5.8e-15 Score=98.63 Aligned_cols=286 Identities=16% Similarity=0.127 Sum_probs=164.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC--cceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG--RTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (343) +.+..... .+-......++.-.+..+.+..++.+.....+.++.++++.++. +.++++|+.. ..++.....|+. T Consensus 93 ~~~~~~~~---~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~ 168 (385) T protein:vir:19 93 QGTFGAKT---FNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVVAEKAL 168 (385) T ss_pred hccchhhH---HHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeeeccCcc Confidence 11110000 00001111111112455788889999999899999998887764 4578888863 345556667777 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.. +++..++++.+.+.- ..+.|.+ +-.+...++.+.+.++.++++++.+|+.+|.- .... ..+.|. T Consensus 169 ~~~~--~~~~~~~~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l~G----~g~~----~~~~Gi 236 (385) T protein:vir:19 169 KPES--DITFSKQTANVKTIA-HWVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLLNG----DGTG----DNLEGL 236 (385) T ss_pred cccc--ccceeEEEEeeeeEE-EeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC----Cccccc Confidence 6553 356777777776653 3345654 33344466888999999999999999988732 1111 111111 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) . ..+..... +........++.|.++...|.....+ .-.++++|..|..|.+-.. .+..|... ....|..+ T Consensus 237 ~-----~~~~~~~~-~~~~~~~~~~d~i~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd-~~G~~l~~-~~~~~~~~ 306 (385) T protein:vir:19 237 N-----KVATAYDT-SLNATGDTRADIIAHAIYQVTESEFS--ASGIVLNPRDWHNIALLKD-NEGRYIFG-GPQAFTSN 306 (385) T ss_pred c-----cccccccc-cccccccchHHHHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhc-CCCceecc-CcccCCCc Confidence 1 11100000 00001122356666777777666643 2368899999998875332 12223221 23466778 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc-c-hhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA-E-YQA 316 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~-~-~~~ 316 (343) +++|.+|+.|+.+|.+.. +-++|.. ++..+..++++++..+.. . +.. T Consensus 307 ~l~G~pV~~~~~~p~~~~----------------------~~gd~~~---------~~~~~~~~~~~v~~~~~~~~~~~~ 355 (385) T protein:vir:19 307 IMWGLPVVPTKAQAAGTF----------------------TVGGFDM---------ASQVWDRMDATVEVSREDRDNFVK 355 (385) T ss_pred eecceeeEEcCcCCCCcE----------------------EEeeccc---------EEEEEEecceEEEEeccccchhhc Confidence 999999999999984311 0112222 222233344555554433 1 222 Q ss_pred --hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 317 --DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 317 --d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ..++..+++|.++++|++.+.++++++ T Consensus 356 ~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 356 NMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred CcEEEEEEEeeccEEecccceEEEEeccC Confidence 355777889999999999999999999 No 83 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.45 E-value=2.8e-14 Score=94.88 Aligned_cols=284 Identities=11% Similarity=0.032 Sum_probs=162.3 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ..++..++.. +........+...|..+.|..++.+...+.|.++.+.+..++. +.+++||+. +...+.-+..|+.+ T Consensus 15 ~~~~~~~~~~--~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:10 15 ASNNVKPQVF--NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHHhhcccee--cccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceeEeccCccc Confidence 1111111110 0001111122223566899999999999999999998877765 456888886 55667777888887 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++..-+. ..-..|.+-=-.++..|+.+.+.++.++++++++|+.+|.--. . . ..+.+.. T Consensus 92 ~~~--~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g--~--~----~~~~~i~ 160 (324) T protein:vir:10 92 ETS--KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG--N--N----PFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCC--C--C----ccCcccc Confidence 653 35666777665443 2334454411223468899999999999999999998873211 0 0 0111111 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) .+ +. ........ ...++.|.++...|..++.... .++++|..|..|.+-. +.+|...+..+.-++ T Consensus 161 ~~--~~--~~~~~~~~----~~t~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l~-----d~~g~~~~~~~~~~~ 225 (324) T protein:vir:10 161 QS--IE--KTNKVIKG----DFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIV-----DPETKERIYDRNSDT 225 (324) T ss_pred cc--cc--ccceeccc----cCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhh-----ccCCceeecCCCCcc Confidence 00 00 00000111 1124556666777777665322 5689999999887532 223334445555578 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc------ Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE------ 313 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (343) ++|.+|+.++..+.... ..++.+.+-+..+..++++++..++.. T Consensus 226 l~G~PV~~~~~~~~~~~------------------------------~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:10 226 LDGLPVVNLKSSNLKRG------------------------------ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeEEeecCCCCCcc------------------------------eEEEEecccEEEEEecCcEEEEeeccccccccc Confidence 99999998776542110 012222222223334455555554421 Q ss_pred --------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 --------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 --------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++ .-.++..+++|.++++|++.+.|+...- T Consensus 276 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:10 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 11 2445667889999999999998876555 No 84 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.45 E-value=3.2e-14 Score=94.58 Aligned_cols=294 Identities=13% Similarity=0.047 Sum_probs=165.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||-.+.|+- +..+.|..++.+..+..|+++.+.++.++.+| .+++|+. +.+.+.-+..|+.+ T Consensus 1 mat~~~gg~----------------lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~ 63 (311) T protein:vir:81 1 MVALATGTF----------------QLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQK 63 (311) T ss_pred CceecCCce----------------EcchhHHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCccc Confidence 776655432 23378899999999999999999887776554 5788886 67777778888888 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHH-----hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM-----NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q-----~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) +.+ +++.+++++..-+.- .-..|.+ |.. ...++.+.+.++.+++|++.+|+.++.--.... ....... T Consensus 64 ~~~--~~~f~~v~l~~~kl~-~~~~iS~--ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~--~~~~~gi 136 (311) T protein:vir:81 64 SES--TATFAPVTAIPRKVQ-VTQRFSQ--EVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLT--GAALSGS 136 (311) T ss_pred ccc--cceeeEEEEeeEEEE-EeehhhH--HHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCC--Ccccccc Confidence 754 456677777664442 2234433 322 345688999999999999999998874311000 0001111 Q ss_pred ccccC-CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchh Q lcl|NC_011085. 155 IAGLG-SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPE 233 (343) Q Consensus 155 ~~g~~-~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (343) ..+.. ....+..+. .+.. .+...+.++...+...+.. ....+++|..+..|.+-..- +..+.-..... T Consensus 137 ~~~~~~~~~~~~~~~----~~~~----~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~ 205 (311) T protein:vir:81 137 PAKILDTTNIVELTT----GTSA----TPDLAVEAAVGLVLGDNLS--PDGVALDNTFSFMLATQRDS-QGRKLYPELGF 205 (311) T ss_pred cccccccceeeeecc----cccc----hHHHHHHHHHHHhhhcCCC--ceEEEEcHHHHHHHHhhhcc-CCCeeecCccc Confidence 11110 111111111 1111 1122343444555555542 23578999999988653211 22222223344 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) .|..++++|.+|+.++++|........... .+..+..+...++...+-+.......++++..++.. T Consensus 206 ~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~--------------~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~ 271 (311) T protein:vir:81 206 GTDVASFAGLNAAVSDTVRGGPEAVTASTG--------------VYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGD 271 (311) T ss_pred cCCCceecceeEEecccccccccccccccc--------------hhcccCCccEEEEEecccEEEEEeccceEEEeccCC Confidence 566789999999999999854322111100 000111112223333333333344445566554421 Q ss_pred -------hhhh--hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 -------YQAD--QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 -------~~~d--~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +..+ .+++.+++|.++++|++.+.|+-..= T Consensus 272 ~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 310 (311) T protein:vir:81 272 PDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) T ss_pred CCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeecc Confidence 2222 45667889999999999888754333 No 85 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.45 E-value=9.7e-15 Score=97.42 Aligned_cols=292 Identities=8% Similarity=0.017 Sum_probs=164.9 Q ss_pred CCC-CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccc-eEEEEecc-CcceeeeecCCC Q lcl|NC_011085. 1 MAD-MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSG-KSAQFPVL-GRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~-~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~i-G~~t~~~~~~g~ 77 (343) +.+ ...... .+.+.. ..++--.+..+.|.+++.+..+..+.+++++++..+.++ .++.++.. +...+.....|. T Consensus 109 ~~~~~~~~~~--~~~~~~-~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~ 185 (415) T protein:vir:94 109 FTEYLETRND--IQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) T ss_pred HHHHhhhhhh--hhhhcc-ccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccc Confidence 000 000000 000000 011111234488999999999999999999998887654 34555443 455566666777 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.+. .++.+++++.+-+.- ..+.|.+-=-.++.+|+.+.+.++.++++++.+|+.|+.-...+.. . ....+ T Consensus 186 ~~~~~~-~~~~~~i~~~~~k~~-~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~----~-~~~~~ 258 (415) T protein:vir:94 186 ENPELA-VKPFFQLAYDINTHR-GYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGST----G-STSSG 258 (415) T ss_pred cccccc-cccceeeEeeheeee-eechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcc----c-ccccc Confidence 665322 234556666554442 2234543212235688999999999999999999988754321110 0 00000 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) . . ........+.... ++.|.++...+...+.. .-.+|++|..|..|..-.. .+..|.-...+.+|.. T Consensus 259 ~----~--~~~~~~~~~~~~~----~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd-~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:94 259 F----E--KEGKKLEVKKAKS----LDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD-KLGNYLIQPDVKEKTQ 325 (415) T ss_pred c----c--ccccccccccccc----hHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc-cCCCeeeccCcCCCCC Confidence 0 0 0000011111122 34455555566666653 3357889999999875321 1223333335567778 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchhh Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQA 316 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~~ 316 (343) ++++|++|+.++++|....+.. ..++...+ ++..+....++++..+. ..+. T Consensus 326 ~~l~G~pV~~~~~~~~~~~~~~---------------------------~i~~gd~~~~~~~~~~~~~~v~~~~~-~~~~ 377 (415) T protein:vir:94 326 QRLLGAKIEILPDEVLGQKGNN---------------------------TLIIGNLKDAIVLFDRSQYQASWTDY-MHFG 377 (415) T ss_pred ceecceeeEEecccccCCCCcc---------------------------EEEEEehhccEEEEeecceEEEEecc-ccCc Confidence 8999999999999985432211 01222222 23334445556654443 3445 Q ss_pred hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 317 DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 317 d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ..+++.++++.++++|++++.++++.- T Consensus 378 ~~~r~~~r~d~~~~~~~a~~~~~~~~~ 404 (415) T protein:vir:94 378 ECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred eEEEEEEEeccEEeccccEEEEEEecc Confidence 678889999999999999999987765 No 86 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.45 E-value=2.8e-14 Score=94.85 Aligned_cols=286 Identities=10% Similarity=0.016 Sum_probs=165.4 Q ss_pred CCCCCccccc------------cccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-Cc Q lcl|NC_011085. 1 MADMKGGQQL------------GKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GR 67 (343) Q Consensus 1 ~~~~~~~~~~------------~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~ 67 (343) |=.++..... ..+......+.+.-.+..+.|..++.+..++.+.++.+.+..++. +.+++||+. +. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~ 79 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADK 79 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeecc-CCceEEEEEecC Confidence 2111111100 001101111122223566899999999999999999998777754 566888886 55 Q ss_pred ceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 68 TRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 68 ~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) +.+.-...|+.++.+ +++.+++++..-+. ..-..|.+---.++.+++.+.+.++.++++++++|+.+|.-- . T Consensus 80 ~~a~~v~Eg~~~~~~--~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~----g- 151 (324) T protein:vir:97 80 PGAYWVGEGQKIETS--KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----G- 151 (324) T ss_pred cceeEeccCcccccc--ccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccC----C- Confidence 666777778877653 46677777766544 233455542122346889999999999999999999887321 0 Q ss_pred cccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc Q lcl|NC_011085. 148 PAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYA 227 (343) Q Consensus 148 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (343) +. ..+.+... .. ......... ...++.|.++...|...+... -.++++|..|..|.+-. |.+ T Consensus 152 ~~---~~~~gi~~--~~--~~~~~~~~~----~~~~~~i~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~lk-----d~~ 213 (324) T protein:vir:97 152 NN---PFGKSIAQ--SI--EKTNKVIKG----DFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPE 213 (324) T ss_pred CC---ccCccccc--cc--cccceeccc----cCCHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhh-----cCC Confidence 00 00111000 00 000000111 112455666677777776532 25789999999887532 223 Q ss_pred cccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEe Q lcl|NC_011085. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLE 307 (343) Q Consensus 228 ~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e 307 (343) |...+..+.-++++|.+|+.++..+.... ..++...+-+..+..+++++| T Consensus 214 g~~~~~~~~~~tl~G~PV~~~~~~~~~~~------------------------------~~~~gd~~~~~i~~~~~~~i~ 263 (324) T protein:vir:97 214 TKERIYDRNSDTLDGLPVVNLKSSNLKRG------------------------------ELITGDFDKLIYGIPQLIEYK 263 (324) T ss_pred CceeecCCCCccccceeeEeecCCCCCcc------------------------------eEEEEecccEEEEEecCcEEE Confidence 33444455567899999999876653210 012222222223445556666 Q ss_pred eeeccc--------------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 308 RARRAE--------------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 308 ~~~~~~--------------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ..++.. ++ .-.++..+++|.++++|++.+.|+.... T Consensus 264 ~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) T protein:vir:97 264 IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 655432 12 2345666889999999999999988777 No 87 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.44 E-value=3.6e-14 Score=94.31 Aligned_cols=282 Identities=12% Similarity=0.021 Sum_probs=164.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||- .+|. |..+.+..++.+..++.|+++.+.+..++.+| .++||++ +.+++.-+..|+.+ T Consensus 1 ma~---------------~gG~---lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~ 61 (298) T protein:vir:94 1 MVL---------------NKGT---LFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKK 61 (298) T ss_pred Cee---------------cccc---ccChhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceEEeeCCccc Confidence 222 2221 34578899999999999999998887776554 5788886 66678888888877 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHH-----hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM-----NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q-----~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) +.+ +++.+++++..-+.. ..+.|.+ |.. ...++.+.+.++.+++|++.+|+.++....... ... T Consensus 62 ~~~--~~~f~~v~l~~~k~~-~~~~iS~--ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~------g~~ 130 (298) T protein:vir:94 62 THG--GVTLAPQTMVPIKVE-YGARISD--EFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL------GTA 130 (298) T ss_pred ccc--ccceeEEEEeeeEEE-EeeehhH--HHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC------Ccc Confidence 654 456667777654442 3344543 322 235688999999999999999998874311000 000 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhc Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPER 234 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (343) ..+.+...... ..+...........+++.|.++..+|...+... ...+++|..+..|.+-..- +..|.-...... T Consensus 131 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~ 205 (298) T protein:vir:94 131 SAVIGTNHFDS--KVTQKVEAPRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKDL-QGNALFPELKWG 205 (298) T ss_pred ccccccccccc--ccccccccccccccHHHHHHHHHHhhhhcCCCc--cEEEEcHHHHHHHHHhhcc-CCCeeecCcccC Confidence 01110000000 000000000111234566777788888877643 3689999999988653221 223322344556 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee--cc Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR--RA 312 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~--~~ 312 (343) |..++++|++|+.++++|........ .-+-++|...+ .....++++++..+ ++ T Consensus 206 ~~~~tl~G~PV~~~~~v~~~~~~~~~----------------~~~~Gdfs~~~---------~~~~~~~~~~~~~~~~~~ 260 (298) T protein:vir:94 206 ATPDTINGLPVDVNKTVSDMSLTQRD----------------RAIIGDFANGF---------KWGYAKEVPLEVIQYGDP 260 (298) T ss_pred CCCceecceeeEEecccccccCCCcc----------------EEEEeeccceE---------EEEEecCceEEEeecCCC Confidence 77789999999999999853211100 00112333222 12223334444433 22 Q ss_pred c------hhhh--hhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 313 E------YQAD--QIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 313 ~------~~~d--~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) + ++.| .++..+++|.+++||++.+.|+-.- T Consensus 261 d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 261 DNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 1 2223 3677888999999999998886555 No 88 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.43 E-value=3.3e-14 Score=94.47 Aligned_cols=291 Identities=14% Similarity=0.112 Sum_probs=166.9 Q ss_pred CCCCCccccccc-cccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-C-cceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGK-DQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-G-RTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t-~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G-~~t~~~~~~g~ 77 (343) +..+..+..... +......+++.-.+..+.|+.++.+..+..+.++++++..++. |.++.+++. + ..++..+..|+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~ 176 (395) T protein:vir:43 98 TSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTE-SNSVEYVRETGFVNNAAPVSEGT 176 (395) T ss_pred HHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecC-CCceEEEEEecCCCceeeecCCc Confidence 111111111100 0000011111123567889999999999999999999888765 456777775 3 34566667777 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.. +++.+++++.+.+... -+.|.+ +-.+...++.+.+.++.+.++++.+|..++.- .....+ +.| T Consensus 177 ~~~~~--~~~~~~i~~~~~k~~~-~~~is~-ell~d~~~l~~~v~~~la~a~~~~~d~~~l~G----~g~~~~----~~G 244 (395) T protein:vir:43 177 QKPYS--DLTFELENAPVRTIAH-LFKASR-QILDDASALQSYIDARARYGLMLVEECQLLYG----NGTGAN----LHG 244 (395) T ss_pred ccccc--ccceeEEEEeeeeEEE-eehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCCc----ccc Confidence 76653 4567777777766543 245554 33444557888889999999999999988732 111111 111 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) ......+.....+. .......++.|.++...+...+.+. -.+|++|..|..|.+-..- +..|... ...+|.. T Consensus 245 i~~~~~~~~~~~~~----~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~-~G~~i~~-~~~~~~~ 316 (395) T protein:vir:43 245 IIPQAQAYAPPSGV----VVTAEQRIDRIRLAILQAQLAEFPA--SGIVLNPIDWALIELNKDA-ENRYIIG-SPQNGTT 316 (395) T ss_pred cccccccccccccc----ccccchhHHHHHHHHHhhccccCCC--cEEEEcHHHHHHHHHhhcc-CCceecc-ccccCCC Confidence 11110000000111 1112234666777777777776542 3678999999988653221 2223222 3456667 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc--hh Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE--YQ 315 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~--~~ 315 (343) ++++|.+|+.++.+|.+.. +-++|+... ..+....++++..+... +. T Consensus 317 ~~l~G~pVv~~~~~~~~~~----------------------~~gd~~~~~---------~~~~~~~~~i~~~~~~~~~f~ 365 (395) T protein:vir:43 317 PTLWRLPVVETQAITQDEF----------------------LTGAFSLGA---------QIFDRMDIEVLVSTENDKDFE 365 (395) T ss_pred ceecceeeEEcCCCCCCcE----------------------EEEeccceE---------EEEEecceEEEEeccccchhh Confidence 8899999999999984321 112332221 11222334455544332 22 Q ss_pred hh--hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 AD--QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ~d--~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+ .++..+++|.++++|++.+.+.++.= T Consensus 366 ~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 366 NNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred cCcEEEEEEEeeccEEecccceEEEEeccC Confidence 23 56777889999999999999987777 No 89 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.42 E-value=6.5e-14 Score=92.88 Aligned_cols=281 Identities=10% Similarity=-0.000 Sum_probs=164.6 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) |...+...-. ......+...|..+.|..++.+...+.+.++.+.++.++. +.+++||+. +...+.-...|+.+ T Consensus 18 ~~~~~~~~a~-----~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:99 18 NVKPQVFNPD-----NVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred hhhhhhcccc-----ceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEeccCccc Confidence 2111111110 0111122223567899999999999999999998877765 456888886 45667777788887 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++..-+.- .-..|.+-=-.++..|+.+.+.++.++++++++|+.+|.-- +. . ..+.+.. T Consensus 92 ~~~--~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~--g~---~---~~~~~~~ 160 (324) T protein:vir:99 92 ETS--KATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--GN---N---PFGKSIA 160 (324) T ss_pred ccc--ccceeEEEEeeEEEE-EeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC--CC---C---ccCcccc Confidence 754 456777777665442 33445542122345889999999999999999999887321 00 0 0111111 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) .+ .......... ...++.|.++...|..++.... .++++|..|..|.+-. |.+|...+..+.-++ T Consensus 161 ~~----~~~~~~~~~~----~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l~-----d~~g~~~~~~~~~~~ 225 (324) T protein:vir:99 161 QS----IEKTNKVIKG----DFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIV-----DPETKERIYDRNSDT 225 (324) T ss_pred cc----ccccceeccc----cCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhh-----cCCCceeecCCCCcc Confidence 00 0000111111 1124566667777877765322 5789999999887532 233334444555578 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc------ Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE------ 313 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~------ 313 (343) ++|.+|+.++.++.... ..++.+.+-+..+..+++++|..++.. T Consensus 226 l~G~PVv~~~~~~~~~~------------------------------~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:99 226 LDGLPVVNLKSSNLKRG------------------------------ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred ccceeEEeecCCCCCcc------------------------------eEEEEecccEEEEEecCcEEEEeeccccccccc Confidence 99999999877652110 012222222223344455666554431 Q ss_pred --------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 --------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 --------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++ .-.++..+++|.++++|++.+.|+.... T Consensus 276 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~ 315 (324) T protein:vir:99 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 11 2445667889999999999999887665 No 90 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.42 E-value=4.5e-14 Score=93.74 Aligned_cols=285 Identities=10% Similarity=0.041 Sum_probs=158.3 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||....++. +..+.++.++.+..+..|.++.+.++.++.+ .+++||+. +.+.+..+.+|+.+ T Consensus 1 m~t~t~gg~----------------liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E~~~~ 63 (303) T protein:vir:97 1 MGTETSKAS----------------LFDKHLVSDLINKVKGHSSLAKLSSQKPIPF-NGSKEFTFTLDSDIDVVAENGKK 63 (303) T ss_pred CcccCCCCe----------------EcchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEecCcceEEeecCccc Confidence 553322221 3457899999999999999999988777654 56777774 56677778788877 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHH-----HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDA-----MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~-----q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) +.+ +++-+++++..-+. .....|.+ |. ....++.+.+.++.+++|++.+|+.++.-.-. ....... T Consensus 64 ~~s--~~~f~~v~l~~~kl-~~~~~iS~--ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~----~~g~~~~ 134 (303) T protein:vir:97 64 THG--GLSLEPVTIVPIKV-EYGARLSD--EFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINP----RTKKASD 134 (303) T ss_pred ccc--ccceeeEEeeeEEE-EEeehhhH--HHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhccccc----CCccccc Confidence 653 35666666655333 22334432 32 23467889999999999999999988743210 0000000 Q ss_pred ccccCCceeecccc-cccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccc-cch Q lcl|NC_011085. 155 IAGLGSASILEVGA-KGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAAL-IDP 232 (343) Q Consensus 155 ~~g~~~~~~~~~~~-~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~-~~~ 232 (343) +.+. +......+ ....++. ...++.|.++...+...+... ..++++|..+..|.+-..... .+.-. ... T Consensus 135 ~~~~--~~~~~~~~~~~~~~~~----~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~L~~lkd~~g-~~~~~~~~~ 205 (303) T protein:vir:97 135 VIGT--NHFDSKVTQVVKFTES----EDADANIEAAVNLIQGAEGVV--TGLAMDTEFSTALAKVTNGEM-GPKMYPELA 205 (303) T ss_pred cccc--cccccccccccccccc----cchHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhccCC-CeEEecCcc Confidence 0000 00000000 0000111 123455666666666665432 348889999998875322111 11111 112 Q ss_pred hcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee-- Q lcl|NC_011085. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR-- 310 (343) Q Consensus 233 ~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~-- 310 (343) ..+..++++|.+|+.|+++|......... ..-+-++|.+.+.+ ...+.+++|... T Consensus 206 ~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~--------------~~~~~Gdf~~~~~~---------~~~~~~~~~~~~~~ 262 (303) T protein:vir:97 206 WGANPDSINGLKSSVNTTVGAGADEAESK--------------DLVIIGDFESMFKW---------GYAKQIPMEIIKYG 262 (303) T ss_pred CCCCCceecceeeEEecccCCccccCCCc--------------cEEEEeeccccEEE---------EEecCcEEEEeecc Confidence 23455789999999999999543221110 00122344333322 222233343332 Q ss_pred ccc------hhhh--hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 311 RAE------YQAD--QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 311 ~~~------~~~d--~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++ +..| .++...+++.++++|++.+.|+-..= T Consensus 263 ~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 263 DPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 211 2222 56778899999999999887764333 No 91 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.41 E-value=4.5e-14 Score=93.73 Aligned_cols=297 Identities=13% Similarity=0.059 Sum_probs=156.9 Q ss_pred CCCCCccc-cccccccccccccchhHHHHHHHHHHHH-HHHHHhhhhccCccccccccceEEEEec-cCcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQ-QLGKDQGKGQSGGDKLALFLKVFGGEVL-TAFARTSVTTNRHIMRSISSGKSAQFPV-LGRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~-~~~t~~g~~~~~~d~~al~ie~~~g~V~-~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~~~~~g~ 77 (343) +.....-. ......+....++- .|..+.|..++. ..+...+.+..+.++... .|+ +.+|+ .+...+..+..|. T Consensus 237 l~~~e~~~~~~~~~~~~t~~~gg--~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~-~~~~~~~~~~~a~~v~Eg~ 312 (543) T protein:vir:81 237 LTEEEKRAINEVRAMGLTKADGG--YLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGD-VWHGVSSAAVQWSWDAEFE 312 (543) T ss_pred hhhhhhhhhhhhhhcccccccCc--ccCchhhhhHHHHHHHhhhchhhhhcccccC-Ccc-eEEEEecCCcceeecccCc Confidence 00000000 00000000011111 234467776654 566777888877775443 344 44544 4666677777787 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.+ .++..++++.+.+.-. -+.|.+ +-.+.+.|+.+.+.++.++++++..|+.||.- .... ..+.| T Consensus 313 ~~~~~--~~~~~~i~~~~~k~~~-~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~ail~G----~Gt~----~~p~G 380 (543) T protein:vir:81 313 EVSDD--SPEFGQPEIPVKKAQG-FVPISI-EALQDEANVTETVALLFAEGKDELEAVTLTTG----TGQG----NQPTG 380 (543) T ss_pred ccccc--ccccceeeeeeeeeEe-eehhhH-HHHhccHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCC----ccccc Confidence 77653 4567777777665532 245554 34445689999999999999999999988621 1100 11122 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) ............+..++. ...++.+.++...|...+-+ .-.++++|..|..|.+-..- +..|.- ..+..|.. T Consensus 381 i~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~l~~lkd~-~G~~l~-~~~~~g~~ 452 (543) T protein:vir:81 381 IVTALAGTAAEIAPVTAE----TFALADVYAVYEQLAARHRR--QGAWLANNLIYNKIRQFDTQ-GGAGLW-TTIGNGEP 452 (543) T ss_pred chhhcccccccccccccc----cccHHHHHHHHHhhhccccC--CcEEEEcHHHHHHHHHhhcC-CCceec-cCcCCCCC Confidence 111000000000011111 11245555566666655533 23678999999998753211 222221 13445666 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc----- Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA----- 312 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~----- 312 (343) ++++|.+|+.++++|......... ...+.++.+.+-+..+...+++++...+. T Consensus 453 ~~l~G~pv~~~~~~~~~~~~~~~~----------------------~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~ 510 (543) T protein:vir:81 453 SQLLGRPVGEAEAMDANWNTSASA----------------------DNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNR 510 (543) T ss_pred ccccceeeEEeccccccccccccC----------------------CcceEEEeeccceeEEeecccEEEEeccccccch Confidence 789999999999999654221110 00111223333333333334444432211 Q ss_pred -chhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 313 -EYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 313 -~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ....-.+++.+++|.++++|++.+.++++.. T Consensus 511 ~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~ 542 (543) T protein:vir:81 511 RPNGSRGWFAYYRMGADVVNPNAFRLLNVETA 542 (543) T ss_pred hhcCceEEEEEEeeccEeecccceEEEEeccc Confidence 1112345677788999999999999999999 No 92 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.41 E-value=1.4e-14 Score=96.57 Aligned_cols=277 Identities=11% Similarity=0.051 Sum_probs=157.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEec--cCcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPV--LGRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~--iG~~t~~~~~~g~~ 78 (343) ................+...++--.+..+.|..++.+.....+.+++++++.++.++ +..+|. .+...+..+..+.. T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~ 198 (400) T protein:vir:38 120 AVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQ-KGTYPTVANATTKMVTVAELEK 198 (400) T ss_pred hhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCc-ceEEEEEecCCCcccccccccc Confidence 000000000000000111111111245589999999999999999999988776543 445554 34445566666665 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) .... ..+...++++.+.+. +.-+.|.+-=-.++.+|+.+.+.++.+++|+...|+.|+.... T Consensus 199 ~~~~-~~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~---------------- 260 (400) T protein:vir:38 199 NPAM-AKPEFKPVNWSVETY-RQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLK---------------- 260 (400) T ss_pred cccc-ccccceeeEeehhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhccc---------------- Confidence 5432 234555666655433 2223444311123568899999999999999999988863211 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) ++. .....+ ++.+.+.+ ...++. ...-..|++|..|..|.+-.. .+..|.-...+.+|.-+ T Consensus 261 -~~~------~~~~~~----~~~~~~~~---~~~~~~----~~~a~~v~~~~~~~~l~~lkd-~~G~~i~~~~~~~~~~~ 321 (400) T protein:vir:38 261 -GFT------AKTISS----VDDLKHIN---NVDLDP----AYSRVIIASQSFYNFLDTVKD-GNGRYLLQDSILTPSGK 321 (400) T ss_pred -ccc------cccccc----HHHHHHHH---Hhhhhh----hhCcEEEEcHHHHHHHHHhhc-cCCCeeeecCcCCCCcc Confidence 000 001111 22222221 112222 123467889999999875321 12333323345667778 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchhhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQAD 317 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~~d 317 (343) +++|++|+.+++.|..+.+.. +.++...+ ++..+..+.++++..++ .++.. T Consensus 322 ~l~G~pv~~~~~~~~~~~g~~---------------------------~~~~gd~s~~~~~~~~~~~~~~~~~~-~~~~~ 373 (400) T protein:vir:38 322 SVLGMPIAVVSDDTLGAAGEA---------------------------HAFLGDIKRAILFANRADFMVRWVDD-QIYGQ 373 (400) T ss_pred ccccceeEEecccccCCCCce---------------------------EEEEEeccccEEEEeecceEEEEecc-cccce Confidence 999999999999985432211 11232322 33344445556666544 45667 Q ss_pred hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 318 QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 318 ~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+++.+++|.++++|++.+.|+++.. T Consensus 374 ~~~~~~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 374 FLQAGMRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred eEEEEEEeccEEecccceEEEEeecC Confidence 89999999999999999999999999 No 93 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.41 E-value=1.1e-13 Score=91.62 Aligned_cols=295 Identities=13% Similarity=0.016 Sum_probs=161.0 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||-....+- ..+.-..-.+++.-.+..+.|..++.+..++.+.++.+.+...+. +.+.+||+. +.+.+.-+..|+.+ T Consensus 1 ~~~~~~~~~-~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 78 (320) T protein:vir:10 1 MAAGTAFQV-DHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWIGDVSAQWIGEGDMK 78 (320) T ss_pred CCCCccCCH-HHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEecCCccc Confidence 555444321 122111111222122556889999999999999999988877764 456788876 55667777778887 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++.+++++.+-+. ..-+.|.+-=-.++..|+.+.+.++.++++++.+|+.+|.- ... .....+.+.. T Consensus 79 ~~~--~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G----~g~--~~~~~~~~~~ 149 (320) T protein:vir:10 79 PIT--KGNMTSQNIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNG----TDS--PFPTYLAQTT 149 (320) T ss_pred ccc--ccceeEEEEeeEEE-EEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcc----cCC--CCCccccccc Confidence 653 45666777666543 23344544212235689999999999999999999988632 110 0001111111 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccch-----hc Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDP-----ER 234 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~-----~~ 234 (343) .+..+.........+ -..+-+.+.++...+...+. ..-+.+++|..|..|.+-..- +..+...... .. T Consensus 150 ~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~~~~~~ 222 (320) T protein:vir:10 150 KSVSLADPGGATASD----LTAYDAVAVNGLSLLVNAKK--KWTHTLLDDIVEPILNGAKDK-NGRPLFIESTYTDENSP 222 (320) T ss_pred ccccceecccccccc----cccHHHHHHHHHhhhhcccC--CCcEEEEcHHHHHHHHHhhcc-CCceeeccccccCcccc Confidence 111100000000000 01112234445555555543 344788999999999753221 1122111111 11 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc- Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE- 313 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~- 313 (343) ..-++++|++|+.++++|.... .+++.+.+-+..+....++++..++.. T Consensus 223 ~~~~~i~g~pv~~~~~~~~~~~------------------------------~~~~gd~~~~~~~~~~~~~i~~~~~~~~ 272 (320) T protein:vir:10 223 FRAGRIVSRPTILSDHVADGTT------------------------------VGYMGDFRNVIWGQVGGLSFDVTDQATL 272 (320) T ss_pred ccCceeeeeeeEecCCCCCCce------------------------------EEEEeecceEEEEEecCeEEEEeeccee Confidence 1124789999999999884221 112222222223344445555554431 Q ss_pred -------------hhh--hhhhhhhhhccceecccceEEEE-ecCC Q lcl|NC_011085. 314 -------------YQA--DQIIARYAMGHGGLRPEAAGALV-FTAG 343 (343) Q Consensus 314 -------------~~~--d~i~~~~~~G~~v~rpe~~~~i~-~~~g 343 (343) +.. -.++..+++|.+++||++.+.|+ .++. T Consensus 273 ~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap 318 (320) T protein:vir:10 273 NLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTP 318 (320) T ss_pred eeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 111 34577788999999999998886 3344 No 94 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.41 E-value=4.6e-14 Score=93.72 Aligned_cols=294 Identities=13% Similarity=0.044 Sum_probs=153.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEec-cCcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPV-LGRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~~~~~g~~i 79 (343) +......+.. ....+...+..+.|..++.+..+..+.++.+.++.++.++ ...+++ .+.+.+.....+... T Consensus 155 ~~~~~a~~~~-------~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~e~~~~ 226 (458) T protein:vir:10 155 QRHLKAVNQS-------SSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSK-ILTMLVEPDAGKATWVAASTYG 226 (458) T ss_pred hhhhhhhhhc-------ccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCc-ceEEEEecCCcceeeccccccc Confidence 1111100110 0111122356789999999999999998888887776554 445544 344455444555444 Q ss_pred CCcc----CCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 80 DDKR----KDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 80 ~~~~----~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) +.+. ..++..++++ ...++.. +.|.+-=-.++.+++.+.+.++.+++|++.+|+.+|.- .++ . . T Consensus 227 ~~~~~~~~~~~~~~~i~~--~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G--~G~---~----~ 295 (458) T protein:vir:10 227 TDTTTGEEVKGALKEIHF--STYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTG--DGS---G----K 295 (458) T ss_pred ccccccccccccceeeEe--eeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcC--CCC---C----c Confidence 3321 1223444444 4444444 34443212224589999999999999999999988731 111 1 1 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc----cc Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA----LI 230 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~----~~ 230 (343) +.|................+........++.|.++...|...+.. .=..|++|..|..|..-... +..|.. .. T Consensus 296 p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~l~~lkd~-~G~~i~~~~~~~ 372 (458) T protein:vir:10 296 PKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYDLLEDE-EWQDVAQVGNDS 372 (458) T ss_pred cceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHhhccc-CCceeecccccc Confidence 111111100000000000000000011144555566667666543 33568899999887643211 222221 22 Q ss_pred chhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR 310 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~ 310 (343) ....|...+++|.+|+.++.+|..+.... -.+|.|. +....+....++++... T Consensus 373 ~~~~~~~~~l~G~pv~~~~~~p~~~~~~~-------------------------~~~~~f~--~~~~~~~~~~~~v~~d~ 425 (458) T protein:vir:10 373 VKLQGQVGRIYGLPVVVSEYFPAKANSAE-------------------------FAVIVYK--DNFVMPRQRAVTVERER 425 (458) T ss_pred ccccCcCceecceeeEEccccccccCCcc-------------------------eEEEEec--ccEEEEEeeceEEEeec Confidence 34456667899999999999995421110 0111111 12222333344443221 Q ss_pred ccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 311 RAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 311 ~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) -...-...++...++|..+.+|++.+..++.+- T Consensus 426 ~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 426 QAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 112222346777889999999999999888888 No 95 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.40 E-value=1.2e-13 Score=91.51 Aligned_cols=281 Identities=12% Similarity=0.015 Sum_probs=161.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||- .+|. |..+.+..++.+..++.|.++.+.+..++.+|+ +.||+. +.+++.-+..|+.+ T Consensus 1 ma~---------------~gG~---lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~ 61 (298) T protein:vir:16 1 MVL---------------NKGT---LFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKK 61 (298) T ss_pred Ccc---------------cCcc---eechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEecCCccc Confidence 441 1121 355678888888899899999998877766544 567774 66778888888877 Q ss_pred CCccCCCccceEEEEeeeeeeee-eeccchHHHH-----hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAM-----NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q-----~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) +.+ +++.+++++..- ++.. ..|.+ |.. +..++.+.+.++.++++++.+|+.++.-..... . T Consensus 62 ~~~--~~~f~~v~l~~~--k~a~~~~iS~--ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~-------g 128 (298) T protein:vir:16 62 THG--GVTLAPQTMVPI--KVEYGARISD--EFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRL-------G 128 (298) T ss_pred ccc--ccceeEEEEeee--eEEEeehhhH--HHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC-------C Confidence 654 355566666554 3333 33432 322 346788999999999999999998874311111 0 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchh Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPE 233 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (343) ...+..+-.... ...+...........+++.|.++...+...+.+.. ..+++|..+..|.+-... +..|.-..... T Consensus 129 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~-~G~~i~~~~~~ 204 (298) T protein:vir:16 129 TASAVIGTNHFD-SKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDL-QDNALFPELKW 204 (298) T ss_pred cccccccccccc-cccccccccccccccHHHHHHHHHHHhhhcCCCcc--EEEEcHHHHHHHHHhhcc-CCCeeecCccc Confidence 000000000000 00000011111112235567777777877776433 477899999988764322 22232233456 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeec-- Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARR-- 311 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~-- 311 (343) .|..++++|.+|+.++++|....+... .-+-++|+..+. ......++++..++ T Consensus 205 ~~~~~~l~G~PV~~~~~v~~~~~~~~~----------------~~~~GDfs~~~~---------~~~~~~~~~~~~~~~~ 259 (298) T protein:vir:16 205 GATPDTINGLPVDVNKTVSDMSLTQRD----------------RAIIGDFANGFK---------WGYAKEVPLEVIQYGD 259 (298) T ss_pred CCCCceecceeeEEecccccccCCCcc----------------EEEEeeccceEE---------EEEecCceEEEeeccC Confidence 677789999999999999953221100 011234433322 12222334443332 Q ss_pred cc------hhh--hhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 312 AE------YQA--DQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 312 ~~------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) +. ++. -.++..+++|.+++||++.+.|+-.- T Consensus 260 ~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 260 PDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 21 122 34677888999999999999886555 No 96 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.40 E-value=8.1e-14 Score=92.36 Aligned_cols=278 Identities=12% Similarity=0.045 Sum_probs=162.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) |.-+.--+- ....+++.-.|..+.|..++.+..++.+.++.+.+...+.++..+.+++. +...+..+..|+.+ T Consensus 1 m~~~~~~~~------~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 74 (297) T protein:vir:95 1 MTVQTFNPE------NVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKI 74 (297) T ss_pred CCccccccc------cccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccc Confidence 322211111 11112222236779999999999999999999888777655555566644 45677778888888 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHH-hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) +.+ +++.+++++...+. .....|.+ +-.+ +..|+.+.+.++.++++++++|+.++.- .....+ .+. T Consensus 75 ~~~--~~~f~~v~l~~~k~-~~~~~is~-ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G----~g~~~~-----~gi 141 (297) T protein:vir:95 75 KTD--KPEVVPVTLKAHKL-GIILVTSR-EALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLG----HDTPFA-----NSV 141 (297) T ss_pred ccc--ccceeEEEEeeEEE-EEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcc----cCCccc-----ccc Confidence 654 36677777766544 33345554 2233 5689999999999999999999998731 111111 111 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) ... . ......... ...++.|.++..+|..++.+.. .++++|..|..|.+-. +..|. .+-++..+ T Consensus 142 ~~~--~--~~~~~~~~~----~~t~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~-----d~~G~-~i~~~~~~ 205 (297) T protein:vir:95 142 AKA--A--KDANKVIGG----PINYDNILKLQDALYDADVEPN--AFVSKIQNRSALREAR-----DGNKV-SIYDKAAN 205 (297) T ss_pred ccc--c--cccceeccc----ccCHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhh-----ccCCc-eeecCCCC Confidence 000 0 000000000 1124556666777777776433 5788999999987522 11221 12244557 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc----- Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE----- 313 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~----- 313 (343) +++|++|+.+++.+..... .++...+.+..+...+++++..++.. T Consensus 206 ~l~G~Pv~~~~~~~~~~~~------------------------------~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~ 255 (297) T protein:vir:95 206 TIDGITTVDLKSARFEKGD------------------------------LLAGDFDNLIYGVPYNITYKISEEGQISTIT 255 (297) T ss_pred cccceeeEeecCCCCCCce------------------------------EEEEecccEEEEEecCeEEEEeecccccccc Confidence 8999999987765421100 12222222223344445555554431 Q ss_pred ---------hhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 ---------YQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ---------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++. -.++...++|.++++|++.+.|+..-. T Consensus 256 ~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~ 296 (297) T protein:vir:95 256 NADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAER 296 (297) T ss_pred ccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCC Confidence 222 345666789999999999999988888 No 97 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.40 E-value=6.5e-14 Score=92.89 Aligned_cols=285 Identities=12% Similarity=0.056 Sum_probs=156.7 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||..++... . .+..+.+++++.+..++.|+++.+.++.... +..++||+. |.+.+.-+..|+.+ T Consensus 1 Ma~~~~~~g------------g--~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~-~~~~~ip~~~~~~~a~wv~Eg~~~ 65 (315) T protein:vir:80 1 MADDFLSAG------------K--LELPGSMIGAVRDRAIDSGVLAKLSPEQPTI-FGPVKGAVFSGVPRAKIVGEGEVK 65 (315) T ss_pred CCCCcCCcC------------c--eEcchHHHHHHHHHHHhhchhhhhcceeecC-CCceEEEEEeCCcceEEeeCCccc Confidence 664422111 1 1345899999999999999999888776654 456788885 56677777888877 Q ss_pred CCccCCCccceEEEEeeeeeeee-eeccchHHHH--hchh----hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAM--NHYD----VRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASN 152 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q--~~~d----~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (343) +.+ +++.+++++.. .+... ..|.+ |.. +..| +.+.+.++.+++|++++|+.++.- .... .. T Consensus 66 ~~s--~~~f~~v~l~~--~kl~~~~~iS~--ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G----~~~~--~~ 133 (315) T protein:vir:80 66 PSA--SVDVSAFTAQP--IKVVTQQRVSD--EFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHG----IDPA--TG 133 (315) T ss_pred ccc--ccceeeeEeee--eeEEeeehhhH--HHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeec----cCCC--CC Confidence 654 35666666654 33333 33432 222 2233 678889999999999999888631 1100 00 Q ss_pred ccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccc--- Q lcl|NC_011085. 153 ENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAAL--- 229 (343) Q Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~--- 229 (343) ..+.+.... +.... ... ......++.|.++...+..++.-... ..+++|..+..|.+-......+..+. T Consensus 134 ~~~~~~~~~--~~~~~-~~~----~~~~~~~~d~~~~~~~~~~~~~~~~~-~~imn~~~~~~L~~l~~~~g~~~~g~~~~ 205 (315) T protein:vir:80 134 KAASAVHTS--LNKTK-NIV----DATDSATADLVKAVGLIAGAGLQVPN-GVALDPAFSFALSTEVYPKGSPLAGQPMY 205 (315) T ss_pred ccccccccc--ccccc-cee----eccccchHHHHHHHHHHhhccCccce-EEEEcHHHHHHHHHHhhccCCcccccccc Confidence 011111110 00000 000 00111234445555555555443223 46789999999875433222222222 Q ss_pred cchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeee Q lcl|NC_011085. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERA 309 (343) Q Consensus 230 ~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~ 309 (343) ..+..|..++++|.+|+.++++|.......... ..-+-+||++.. ++. .+.+++|.. T Consensus 206 ~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~-------------~~~~~GDfs~~~--------~g~--~~~~~i~i~ 262 (315) T protein:vir:80 206 PAAGFAGLDNWRGLNVGASSTVSGAPEMSPASG-------------VKAIVGDFSRVH--------WGF--QRNFPIELI 262 (315) T ss_pred cccccCCCceecceeeEecCcCCcccccccccc-------------cEEEEeecccEE--------EEE--ecCeeEEEe Confidence 134456668999999999999985432211100 000123443322 222 223344443 Q ss_pred eccc--------hhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 310 RRAE--------YQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 310 ~~~~--------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++.. ++. -.+++..++|.++++|++.+.|+...= T Consensus 263 ~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a 306 (315) T protein:vir:80 263 EYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) T ss_pred ccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccC Confidence 3211 222 245667889999999999999985542 No 98 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.39 E-value=1.4e-13 Score=91.11 Aligned_cols=296 Identities=12% Similarity=0.050 Sum_probs=156.6 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) ||.+++.... +..+.++.++.+..+..|+++.+.++..+.+ ..++||++ +.+++.-+..|+.+ T Consensus 1 Mat~tt~~g~---------------~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~-~~~~~p~~~~~~~a~wv~Eg~~~ 64 (311) T protein:vir:99 1 MATFGTGNLK---------------NLPRNIADGMVKDVVQGSTVAVLSARKPQRF-GNEDIITFNGRPKAEFVGEGQQK 64 (311) T ss_pred CceecCCCce---------------eccHHHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEEeecCccc Confidence 8866433321 3447888999999999999998888766654 44688886 67777777788888 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHH-----HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDA-----MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~-----q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) +.+ +++..++++..-+. ..-+.|.+ |. ++..|+.+.+.++.+++|++++|+.+|.-.. +.. ... T Consensus 65 ~~~--~~~f~~v~l~~~k~-~~~~~iS~--ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g--~~~----g~~ 133 (311) T protein:vir:99 65 SST--TGEFDFVTSTPKKA-QVTMRFNE--EVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRIN--PLT----GTV 133 (311) T ss_pred ccc--cceeeEEEEeeEEE-EEeehhhH--HHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccC--ccc----Ccc Confidence 753 35566666655332 22234443 32 2457899999999999999999998874211 000 000 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhc Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPER 234 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (343) +.+.. ..+.... ...+........+.+.+..+...+.........-..+++|..+..|.+-..- +..|.-...... T Consensus 134 ~~g~~--~~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~-~G~~l~~~~~~~ 209 (311) T protein:vir:99 134 IPGWS--NYLGAAS-KRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYT-DGRKKFPELGLG 209 (311) T ss_pred ccccc--ccccccc-ceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhcc-CCCeeecCcccC Confidence 11100 0000000 0000000000111122223333333332211111278899999998653221 122322233445 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeec--c Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARR--A 312 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~--~ 312 (343) +..++++|++|+.|+++|......... ..........-+-++|... +.....++++++..+. + T Consensus 210 ~~~~~l~G~Pv~~s~~i~~~~~~~~~~------~~~~~~~~~~~~~Gdf~~~---------~~~~~~~~~~~~~~~~~~~ 274 (311) T protein:vir:99 210 IGVSSFEGIDASVSDTVNGGDEADPDD------EDLDAARAVRGIVGDFANG---------IHWGVQRDIPVELIKYGDP 274 (311) T ss_pred CCCceecceeeEeeccccccccccccc------chhhccCcceEEEeecccc---------EEEEEecCceEEEeecCCC Confidence 667899999999999998543221110 0000000000111233222 2222233334443322 1 Q ss_pred c-----hhhhh--hhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 313 E-----YQADQ--IIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 313 ~-----~~~d~--i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) + +..|. +++..++|..+++|++++.....+ T Consensus 275 ~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 275 DGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred CcchhhhhcCcEEEEEEEeecceecChhHeeeecccC Confidence 2 23333 477788999999998887777777 No 99 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.39 E-value=8.6e-14 Score=92.23 Aligned_cols=279 Identities=16% Similarity=0.104 Sum_probs=161.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC--cceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG--RTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (343) ++..... ....+++.-.+.+..+...+.+.....+.++++++..++.+ .++.+|+.. ..++.....|+. T Consensus 107 ~~~~~~~--------~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~ 177 (390) T protein:vir:10 107 KAALNTA--------STDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEGAL 177 (390) T ss_pred HHHHHhh--------hcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCCcceeeecCCcc Confidence 1111111 11111111234566677777778888888888888777644 467777753 345666677777 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.. +++.+++++.+.+.. .-+.|.+ +-.+...++.+.+.++.++++++..|+.++.- ... +..+.|. T Consensus 178 ~~~~--~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~l~~~~~~~~~~~il~G----~G~----~~~p~Gi 245 (390) T protein:vir:10 178 KPES--SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----TGA----NDGLLGL 245 (390) T ss_pred cccc--ccceeEEEEeeEEEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhhc----CCC----Ccccccc Confidence 6653 456777777776553 3344554 23344568889999999999999999988731 110 1112221 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) .......... . ...+...++.+..+...|...+.+.. .+|++|..|..|.+-..- +..|.-... ..+..+ T Consensus 246 ~~~~~~~~~~--~----~~~~~~~~~~~~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd~-~g~~l~~~~-~~~~~~ 315 (390) T protein:vir:10 246 IPQATTYAAP--T----TIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKDA-NNQYLIGNA-RGTLTP 315 (390) T ss_pred cccccccccc--c----cccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC-CCceeecCC-cCcCCc Confidence 1111000000 0 00112234566667777887777543 567999999988753321 222321111 234456 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc-chhhh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA-EYQAD 317 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~-~~~~d 317 (343) +++|.+|+.++.+|.+.. +-++|+..+ ..+....++++..+.. .+..+ T Consensus 316 ~l~G~pv~~~~~~p~~~~----------------------~~gdf~~~~---------~~~~~~~~~i~~~~~~~~~~~~ 364 (390) T protein:vir:10 316 TLWGLPVVATQAMAPGEF----------------------LVGAFDLAA---------QIFDQWDARVEIGYVNDDFQRN 364 (390) T ss_pred eecceeeEEcCCCCCCcE----------------------EEEeccceE---------EEEEecceEEEEeecccccccC Confidence 899999999999984210 112333222 2223344566666543 33344 Q ss_pred --hhhhhhhhccceecccceEEEEec Q lcl|NC_011085. 318 --QIIARYAMGHGGLRPEAAGALVFT 341 (343) Q Consensus 318 --~i~~~~~~G~~v~rpe~~~~i~~~ 341 (343) .+++.++++.++++|++.+.+.+. T Consensus 365 ~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 365 MVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred cEEEEEEEeeccEEeccccEEEEEeC Confidence 455668899999999999999999 No 100 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.38 E-value=9.5e-14 Score=91.99 Aligned_cols=284 Identities=14% Similarity=0.071 Sum_probs=157.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~i 79 (343) |+-- .. .+.......++.-++..+.+..++.+..++.+.++.+.+..++. +.+++||+. +.+.+.-+..|..+ T Consensus 1 ~g~~---~e--~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~~ 74 (397) T protein:vir:23 1 MGFS---AD--HSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMG-ATGIVIPHWTGDVSAQWIGEGDMK 74 (397) T ss_pred CCcC---HH--HHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEcCCcceEEecCCccc Confidence 3211 11 12111111111112455666778888888889989888877764 456788876 45556666677777 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.+ +++..++++.+-+. ..-+.|.+-=-.++.+|+.+.+.++.+++|++++|+.+|.-- ....+ +.+. T Consensus 75 ~~s--~~~f~~v~l~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~----gt~~~----~~~~- 142 (397) T protein:vir:23 75 PIT--KGNMTKRDVHPAKI-ATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGT----NAPSA----FQGY- 142 (397) T ss_pred ccc--ccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcc----cCCcc----cccc- Confidence 653 45667777766443 233445442122356899999999999999999999887321 11000 0111 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc-----cccchhc Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYA-----ALIDPER 234 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~-----~~~~~~~ 234 (343) .............. ..+.+.++...|.....+ .-..+++|..|..|.+-..- +..|. ....... T Consensus 143 ----~~~~~~~~~~~~~~----~~~~~~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~-~G~~i~~~~~~~~~~~~ 211 (397) T protein:vir:23 143 ----LDQSNKTQSISPNA----YQGLGVSGLTKLVTDGKK--WTHTLLDDTVEPVLNGSVDA-NGRPLFVESTYESLTTP 211 (397) T ss_pred ----cccccceeeecccc----hhHHHHHHHHhhhhcccC--CCEEEEcHHHHHHHHHhhcc-CCceeeccccccccccc Confidence 11111111111111 123344455556666543 34579999999998864322 12221 1112222 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc- Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE- 313 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~- 313 (343) +..++++|++|+.++++|..... -+-++|++.+ .+..+.+.+|..++.. T Consensus 212 ~~~~tl~G~Pv~~s~~~~~g~~~--------------------~~~gDfs~~~----------i~~~~~i~i~~~~e~~~ 261 (397) T protein:vir:23 212 FREGRILGRPTILSDHVAEGDVV--------------------GYAGDFSQII----------WGQVGGLSFDVTDQATL 261 (397) T ss_pred ccCceeeeeeEEEeCCCCCCceE--------------------EEEeecceEE----------EEEEeceEEEEeeeeee Confidence 34468999999999999843210 0112333322 1222333444333321 Q ss_pred -------------hhhh--hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 -------------YQAD--QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 -------------~~~d--~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +..| .++..++++.++++|++.+.+..... T Consensus 262 ~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~ 306 (397) T protein:vir:23 262 NLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPV 306 (397) T ss_pred eeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccc Confidence 2223 45677889999999999999998776 No 101 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.38 E-value=1.1e-13 Score=91.53 Aligned_cols=283 Identities=14% Similarity=0.058 Sum_probs=160.7 Q ss_pred CCCC-CccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccccccc--ceEEEEecc-CcceeeeecCC Q lcl|NC_011085. 1 MADM-KGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISS--GKSAQFPVL-GRTRAAYLQAG 76 (343) Q Consensus 1 ~~~~-~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~--G~tv~i~~i-G~~t~~~~~~g 76 (343) +..+ .+.............+++.-.+..+.|..++.+..+..+.+++++++.++.+ |+....+.. +...+.....| T Consensus 94 ~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~ 173 (397) T protein:vir:48 94 FKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEA 173 (397) T ss_pred HHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccc Confidence 1000 0000000000001111111124568999999999999999999988877653 333333332 22334555566 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.++.+ ..++..++++.+.+.- ....|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-. T Consensus 174 ~~~~~~-~~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~--------------- 236 (397) T protein:vir:48 174 GSIGTN-DDPKLYPIRYAIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI--------------- 236 (397) T ss_pred cccccc-cccceeeEEeeheeee-eehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------- Confidence 666432 2345667777765542 33455542222357899999999999999999999887321 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcce Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGS 236 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (343) +.+... + .. .. ++.|.++...|.....+ +=.++++|..|..|.+-..- +..|.-...+..|. T Consensus 237 --g~~~~~--~---~~----~~----~d~i~~~~~~l~~~~~~--~a~~v~n~~~~~~L~~lkd~-~G~~i~~~~~~~~~ 298 (397) T protein:vir:48 237 --ATLPTK--P---TL----TK----WDDIIDLQAKVDPAIKQ--TSFFLTNTSGFTALKKVKNA-FGDYLMERDVKSPT 298 (397) T ss_pred --cccccc--c---cc----cc----HHHHHHHHHHhhhhhcC--CCEEEECHHHHHHHHHhhcC-CCceeeccCcCCCC Confidence 111110 0 01 11 34455556667766654 33667899999998763222 22333334566777 Q ss_pred eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccc-h Q lcl|NC_011085. 237 IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAE-Y 314 (343) Q Consensus 237 V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~-~ 314 (343) -++++|++|+.+.+.+...... ...+.++...+ ++..+....++++..+... + T Consensus 299 ~~~l~G~PV~~~~~~~~~~~~~-------------------------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~ 353 (397) T protein:vir:48 299 GYSIDGFAVKEVADRWLANASS-------------------------GAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGA 353 (397) T ss_pred CceeccceeEEecccccCCcCC-------------------------CceEEEEEeccceEEEEeecceEEEEeccchhh Confidence 7899999999876533211110 00111233322 4444445556666655432 2 Q ss_pred h---hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 Q---ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~---~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) | ...+++.++++.++++|++.+.+++++. T Consensus 354 ~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 354 FETDTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred hhcCceeEEEEeeeccEEecccceEEEEeccc Confidence 2 2467788889999999999999998887 No 102 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.38 E-value=8.2e-14 Score=92.33 Aligned_cols=287 Identities=15% Similarity=0.080 Sum_probs=164.0 Q ss_pred CCCCCccc----cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC--cceeeeec Q lcl|NC_011085. 1 MADMKGGQ----QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG--RTRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~----~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG--~~t~~~~~ 74 (343) +.....-. ....+......+++.-.+..+.|...+.+.....+.+++++++.++. +.++++++.. ..++.-+. T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~ 173 (390) T protein:vir:81 95 WNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD-SALIEYVQETGFVNNAAIVA 173 (390) T ss_pred HhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeecc-CCceEEEEEecCCcceeeec Confidence 00000000 00000001111222223566778888888999899999988877654 4567777753 34566677 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) .|+.++.. +++.+++++.+.+.- .-..|.+ +-.+.+.++.+.+.++.+.++++.+|+.++.- ... +.. T Consensus 174 Eg~~~~~~--~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~a~l~G----~g~----~~~ 241 (390) T protein:vir:81 174 EGALKPES--SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILRG----TGA----NDG 241 (390) T ss_pred CCcccccc--cceeeEEEEeeeEEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhc----CCC----CCc Confidence 78877653 356677777776553 2344544 33344567888999999999999999988732 100 111 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhc Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPER 234 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (343) +.|.-. .+.... ..........++.|..+...+...+.+.. .+|++|..|..|.+-..- +..|.-. .... T Consensus 242 ~~Gi~~-----~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~-~G~~l~~-~~~~ 311 (390) T protein:vir:81 242 LLGLIP-----QATTYA-APTTIAGATRVDQLRLAMLQASLAEYNPS--GIVINPIDWAAIELAKDA-NNQYLIG-NARG 311 (390) T ss_pred ccceee-----cccccc-cccccccchhHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC-CCceeec-Cccc Confidence 222111 110000 00001112235566667777877776543 567899999988753321 1222211 1234 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~ 314 (343) |...+++|.+|+.++.+|.+.. +-++|+.. +..+....++++..+...+ T Consensus 312 ~~~~~l~G~pv~~~~~~p~~~~----------------------~~gd~~~~---------~~~~~~~~~~v~~~~~~~~ 360 (390) T protein:vir:81 312 TLTPTLWGLPVVATQAMAPGEF----------------------LVGAFDLA---------AQIFDQWDARVEIGYVGED 360 (390) T ss_pred ccCceecceeeEEcCCCCCCcE----------------------EEEehhce---------EEEEEecceEEEEecccch Confidence 4556899999999999984311 11233222 2223345567776665444 Q ss_pred h-hh--hhhhhhhhccceecccceEEEEec Q lcl|NC_011085. 315 Q-AD--QIIARYAMGHGGLRPEAAGALVFT 341 (343) Q Consensus 315 ~-~d--~i~~~~~~G~~v~rpe~~~~i~~~ 341 (343) | .+ .++..++++.++++|++.+.+++. T Consensus 361 ~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 361 FQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred hhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 3 34 467888899999999999999999 No 103 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.38 E-value=7.3e-14 Score=92.60 Aligned_cols=294 Identities=11% Similarity=0.105 Sum_probs=160.2 Q ss_pred CCCCCccccccccccccccccchhH-HHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcce---------e Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLA-LFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTR---------A 70 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~a-l~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t---------~ 70 (343) +.......... +....+...+... +.-+.+.+.+.......+.++++++..+.. +.++++++....+ + T Consensus 110 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a 187 (419) T protein:vir:94 110 MRDIDPNRLLS-RDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKA 187 (419) T ss_pred HHHHHHHHhhc-cccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeecc-CCceeeeeeccccccccccCccc Confidence 00000000000 0011111111111 233677888877777777778887766653 5667777643222 2 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) ..+..|+.++.+ +++..++++.+.+.- .-+.|.+ +-.+...++.+.+.++.++++++.+|+.||.- .++ T Consensus 188 ~~v~Eg~~~~~~--~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~aii~G--~G~----- 256 (419) T protein:vir:94 188 AVVPEGTAKPQS--TLSFDTITTTLKTVA-HWLPITR-QAADDNSQLMGYIQGRLTYGLRFLRDRQLLNG--NGS----- 256 (419) T ss_pred ceecCCcccccc--ccceeeEEeeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhc--cCc----- Confidence 333445555432 355666676665543 3345543 23334456888899999999999999998731 111 Q ss_pred ccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_011085. 151 SNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALI 230 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (343) ..+.|.....-+.......... .......++.|.++...+...+.+. -.++++|..|..|+.-..-....|.-.. T Consensus 257 --~~p~Gi~~~~~~~~~~~~~~~~-~~t~~~~~~~l~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~~k~~~~~~~~~~~ 331 (419) T protein:vir:94 257 --TEMQGILTTPGIGTYQQPKPTA-PATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQAPGSGVFRVIA 331 (419) T ss_pred --ccccceeccccccccccccccc-ccccchhHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHHhhcCCCceeecC Confidence 1111211100000000000000 1112234677777777787776643 3679999999998765433333333233 Q ss_pred chhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR 310 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~ 310 (343) ....|..++++|++|+.++.+|.+.. +-++|+... ..+..+.++++..+ T Consensus 332 ~~~~~~~~~l~G~pV~~~~~~~~~~~----------------------~~gd~~~~~---------~~~~~~~~~v~~~~ 380 (419) T protein:vir:94 332 NVQGEATPRIWGLNVVSTVAIAQGTA----------------------LVGGFRQGA---------TLWSRQGITVLMTD 380 (419) T ss_pred CcccCCCccccceeeEEcCCCCCccE----------------------EEeeccceE---------EEEEecceEEEEec Confidence 45567778999999999999984310 112333222 12223345555544 Q ss_pred ccc-hh---hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 311 RAE-YQ---ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 311 ~~~-~~---~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ... +| ...++...+++.++++|++.+.+++++= T Consensus 381 ~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa 417 (419) T protein:vir:94 381 SHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) T ss_pred cccchhhcCcEEEEEEEeeccEEeccccEEEEEeccC Confidence 332 22 2456788899999999999999999888 No 104 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.37 E-value=1.6e-13 Score=90.75 Aligned_cols=296 Identities=13% Similarity=0.046 Sum_probs=157.4 Q ss_pred CCCCCcccccccc--------ccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEec-cCcceee Q lcl|NC_011085. 1 MADMKGGQQLGKD--------QGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPV-LGRTRAA 71 (343) Q Consensus 1 ~~~~~~~~~~~t~--------~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~ 71 (343) +.-+..+.....+ .+....+| .+..+.|..++.+..+..+.+++++++.++.++ +..+|+ .+.+++. T Consensus 87 ~~~l~~g~~~~~~~~e~~a~~~~t~~~gG---~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~ 162 (407) T protein:vir:48 87 IGFMRKGREDGLRELERKALQVGNDEDGG---YAIPEELDRTILTLLKDEVVMRQEATVITLGGS-DYKKLVNLGGTTSG 162 (407) T ss_pred HHHHhccchhhhhHHHHHhhhcccCCCCc---ccccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCccee Confidence 1101111100000 00000111 145589999999999999999998887776555 455544 4556666 Q ss_pred eecCCCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 72 YLQAGQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 72 ~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) -...|..++.+. ..+..++++.+- ++.. +.|.+-=-.++.+|+.+.+.++.++++++..|+.++.- .+++ T Consensus 163 ~v~E~~~~~~~~-~~~f~~i~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G--~G~~---- 233 (407) T protein:vir:48 163 WVGETDARPETA-TSKLGLIEPFMG--EIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSG--DGSK---- 233 (407) T ss_pred eecccccccccc-cccceeEEeeee--eeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc--CCCC---- Confidence 666666655432 134455555554 4343 34443222235679999999999999999999987631 1110 Q ss_pred ccccccccCCceeecccc------cccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhh Q lcl|NC_011085. 151 SNENIAGLGSASILEVGA------KGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAA 224 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~------~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~ 224 (343) .+.|........... .............-++.|.++...|.....+ +=..|++|..|..|.+-..- +. T Consensus 234 ---~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~--~a~~v~n~~~~~~L~~lkD~-~G 307 (407) T protein:vir:48 234 ---KPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRS--GAKFMMNNSSLFAIRLLKDN-DG 307 (407) T ss_pred ---ccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhc--CCEEEEcHHHHHHHHHhhcc-CC Confidence 111111000000000 0000000000111245566666667666543 22457999999988643211 22 Q ss_pred ccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeee Q lcl|NC_011085. 225 NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDL 304 (343) Q Consensus 225 ~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~ 304 (343) .|.-...+..|..++++|.+|+.++++|..+.+.... +-++|+. ++..+....+ T Consensus 308 r~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i-----------------~~Gd~~~---------~~~i~~~~~~ 361 (407) T protein:vir:48 308 NYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAI-----------------AFGNFKR---------GYTIVDRIGT 361 (407) T ss_pred ceeeccCcCCCCCceecceeeEEecCcCCccCCccEE-----------------EEEeccc---------cEEEEEeece Confidence 2322234567778899999999999999533211100 0022222 1222222223 Q ss_pred EEeeeeccc--hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 305 SLERARRAE--YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 305 ~~e~~~~~~--~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++ +++- +-...+++.+++|+++++|++.+.|+.++- T Consensus 362 ~i~--~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa 400 (407) T protein:vir:48 362 RIL--RDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAA 400 (407) T ss_pred EEE--eeccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 333 3332 222347788899999999999999999888 No 105 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.37 E-value=1.1e-13 Score=91.62 Aligned_cols=287 Identities=14% Similarity=0.106 Sum_probs=158.4 Q ss_pred CCCCC--ccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCc--ceeeeecCC Q lcl|NC_011085. 1 MADMK--GGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGR--TRAAYLQAG 76 (343) Q Consensus 1 ~~~~~--~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~--~t~~~~~~g 76 (343) |.... ...+.....+.+...+ -.|..+.|+.++.+.....+.++++++..++. +.++.+|+... .++.....| T Consensus 121 ~~~~~~~~~~~~~~~~~~~~~~~--g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~ 197 (418) T protein:vir:10 121 RVRVDRKSIMNVPATVGSGVSGS--NSLVVADRQAGIIAPPQRKMTIRDLLMPGQTS-SSSIEYTVETGFTNNAAAVAEG 197 (418) T ss_pred hhhhHHHHHHHhhhhccCCCCCC--ccccchhHHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEEecCCCceeeeccC Confidence 11000 0000000111111111 23567999999999999999999998877764 55677777533 455556667 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) +.++.+ +++.+++++...+... -..|.+ +-.+.+.++.+.+.++.++++++.+|+.++.- .... ..+. T Consensus 198 ~~~~~~--~~~f~~v~~~~~k~~~-~~~is~-ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G----~g~~----~~p~ 265 (418) T protein:vir:10 198 AQKPTS--DLKFNLKNQPVRTIAH-LFKASR-QILDDAPALQSYIDGRARYGLQLTEEGQILKG----DGTG----ANIL 265 (418) T ss_pred cccccc--ccceeeEEEeeeeEEE-eehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCC----cccc Confidence 776543 3566667766655432 244543 33444568999999999999999999988731 1000 1112 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcce Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGS 236 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (343) |.-.......... ..+. ...++.|+.+...+...+.+.. .+|++|..|..|.+-..- +..|... ...+|. T Consensus 266 Gi~~~~~~~~~~~--~~~~----~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~-~G~~i~~-~~~~~~ 335 (418) T protein:vir:10 266 GILPQASAFMPSI--TLAN----ATPIDKIRLALLQAVLAEFPAT--GIVLNPIDWASIELTKDS-QGRYIVG-NPVNGT 335 (418) T ss_pred ccccccccccccc--cccc----cccHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC-CCceecc-ccccCC Confidence 2111000000000 0011 1124455555556665554322 477899999988653211 2223221 234666 Q ss_pred eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc-hh Q lcl|NC_011085. 237 IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE-YQ 315 (343) Q Consensus 237 V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~-~~ 315 (343) .++++|++|+.|+++|.+.. +-++|+..+ ..+....++++..++.. .| T Consensus 336 ~~~l~G~pV~~~~~~p~~~~----------------------~~gd~s~~~---------~~~~~~~~~i~~~~~~~~~f 384 (418) T protein:vir:10 336 TPRLWNLPVVETQAMTANEF----------------------LVGAFSMAA---------QIFDRMEIEVLLSTENVDDF 384 (418) T ss_pred CceecceeeEEcCCCCCCcE----------------------EEeeccceE---------EEEEecceEEEEecccchhh Confidence 78999999999999984321 112232221 11223344555544332 12 Q ss_pred -h--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 -A--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 -~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) . ..++..++++.++++|++.+.+.++.- T Consensus 385 ~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~ 415 (418) T protein:vir:10 385 EKNMVSIRAEERLALAVYRPESFVTGALVEQ 415 (418) T ss_pred hcCceEEEEEEeeccEEecccceEEEEeccC Confidence 2 355677789999999999988776654 No 106 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.37 E-value=2.2e-13 Score=90.01 Aligned_cols=273 Identities=15% Similarity=0.057 Sum_probs=163.0 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccccccc-ceEEEEeccC--cceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISS-GKSAQFPVLG--RTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~iG--~~t~~~~~~g~ 77 (343) |.+..... |- .+| -.+..+.|..++.+..+..+.++++.+...+.. ..+..|+... ...+.....|+ T Consensus 1 ~l~~~~~~---t~-----~~g--g~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~ 70 (293) T protein:vir:48 1 MLDSKTDH---SG-----SDA--GLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAG 70 (293) T ss_pred Cceeeccc---cc-----CcC--ceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCc Confidence 22221111 11 111 124568999999999999999999888776653 3456666543 34456666777 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.+. .++..++++...+.. ..+.|.+-=-.++.+|+.+.+.++.++++++..|+.|+..+.+.+ T Consensus 71 ~~~~~~-~~~~~~i~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~------------ 136 (293) T protein:vir:48 71 KIADID-DPKLSLIKYTIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLP------------ 136 (293) T ss_pred cccccc-ccceeEEEEeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccc------------ Confidence 765432 245667777665543 334565422234568999999999999999999998874321100 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) ..+... . ++.|.++..+|..+..+ .-..+++|..|..|.+-..- +..+.-...+.+|.. T Consensus 137 ----------~~~~~~----~----~d~i~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~-~g~~l~~~~~~~~~~ 195 (293) T protein:vir:48 137 ----------TKPTLT----K----WDDIIDLEAKVDPAIKQ--TSFFLTNTSGFTALKKVKNA-LGDYLMERDVKSPTG 195 (293) T ss_pred ----------cccccc----C----HHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHHhhcc-CCceEeecCcCCCCC Confidence 001111 1 34455556666655443 33567899999988653322 223333345667778 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeech-hhheeeeeeeeEEeeeecc-chh Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHR-SAVGTVKLKDLSLERARRA-EYQ 315 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~Av~~~~~~~~~~e~~~~~-~~~ 315 (343) ++++|.+|+.+.+.+....... ..+.++... +++..+....++++..+.. ++| T Consensus 196 ~~l~G~Pv~~~~~~~~~~~~~~-------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 250 (293) T protein:vir:48 196 YSIAGFAVKEISDRWLPNASSG-------------------------VMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAF 250 (293) T ss_pred ceecceeeEEecccccCCccCC-------------------------ceEEEEEeccceEEEEEecceEEEEecccchhh Confidence 8999999998766543221110 111133332 2444444555666665532 222 Q ss_pred ---hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 ---ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ---~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ...++..+++|.++++|++.+.++++.. T Consensus 251 ~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 281 (293) T protein:vir:48 251 ETDTTKVRVIDRFDVVATDTEAFVPASFKAI 281 (293) T ss_pred hcCeEEEEEEEeeCcEEecccceEEEEeecc Confidence 2457788889999999999999998776 No 107 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.37 E-value=1.2e-13 Score=91.34 Aligned_cols=271 Identities=16% Similarity=0.028 Sum_probs=155.8 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC---cceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG---RTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG---~~t~~~~~~g~ 77 (343) ...+..... ....++.-.+..+.|..++.+...+.+.++++.++.++. +.++.||+.- .........|+ T Consensus 100 ~~~~~~~~~-------~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~Eg~ 171 (379) T protein:vir:10 100 SIQVKAVGD-------MTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSIS-GGTYTFVRENGAGEGAIGAQVEGA 171 (379) T ss_pred hhhhhhhcc-------cccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeecc-CCceEEEEeecCCCcccccccCCc Confidence 111111111 111122222456889999999988888899988877764 4567777642 23334455666 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.+ +++.+++++.+.++- .-+.|.+ +-.+...++.+.+..+.+++|++..|+.++.-+. T Consensus 172 ~~~~~--~~~f~~i~~~~~k~~-~~~~iS~-ell~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~--------------- 232 (379) T protein:vir:10 172 TKGQK--DYDISMIDVNTDFIA-GFTRYSK-KMANNLPFLTSFIPNALRRDYAKAENAAFNAVLA--------------- 232 (379) T ss_pred ccccc--ccceeeeEeeeeeEE-eeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc--------------- Confidence 66543 356677777665543 2234443 2233344578888889999999999987753211 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccc--cchhcc Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAAL--IDPERG 235 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~--~~~~~G 235 (343) .+......+.+..+ .++.|.++...+...+.+.. .+|++|..|..|.+-..- +..|... .....| T Consensus 233 --~~~~~~~~~~~~~~--------~~d~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~~~ 299 (379) T protein:vir:10 233 --ANATASTEIITNKN--------KVEMLINEIAKQENLDFPVT--AIVLRPTDYYDILVTQKS-VGAGYGLPGVVTQDN 299 (379) T ss_pred --cccccccccccCcc--------cHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhcc-CCceeccCCccCCCC Confidence 00001111111111 13455566666666665432 467899999988653322 2333222 223456 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc-h Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE-Y 314 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~-~ 314 (343) ...+++|++|+.|+.+|.+.. +-++|+... ....+.++++..+++. + T Consensus 300 ~~~~l~G~pvv~s~~~~ag~~----------------------~~gdf~~~~----------~~~~~~~~i~~~~~~~~~ 347 (379) T protein:vir:10 300 GVLRINGIPLFRATWLAANKY----------------------YVGDWTRVT----------KVTTEGLSLEFSEVEGTN 347 (379) T ss_pred CcceecceeeEecCCCCCCce----------------------EEeecccEE----------EEEEeceEEEEeeccccc Confidence 667899999999999984210 113343322 1222344566655542 2 Q ss_pred hh---hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 QA---DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 ~~---d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) |. ..+++..++|..+++|++.+.+.+++= T Consensus 348 f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 348 FVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred ccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 22 355667899999999999999999888 No 108 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.35 E-value=1.9e-13 Score=90.32 Aligned_cols=294 Identities=14% Similarity=0.086 Sum_probs=155.2 Q ss_pred CCC-CCccc---cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEec-cCcceeeeecC Q lcl|NC_011085. 1 MAD-MKGGQ---QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPV-LGRTRAAYLQA 75 (343) Q Consensus 1 ~~~-~~~~~---~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~~~~~ 75 (343) +.+ +..+. ...+ +....+| .|..+.|..++.+..+..+.++.+.++.++.++. +++|+ .+.+++.-... T Consensus 117 f~~~l~~~e~~~al~~--~t~~~gG---~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~~~~~~~~~a~wv~E 190 (425) T protein:vir:10 117 FKAHVKRGDVQAALNK--GEDSEGG---YLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAG-FSKLFNMGGTTSGWVGE 190 (425) T ss_pred HHHHhhhhhhHHHhhc--CcCCCCc---eeccHhHHHHHHHHHHhhhhhhhhceeeeccCCc-eEEEEEcCCcceeeecc Confidence 000 00000 0000 0000111 1455899999999999999999998887765543 45544 45556655566 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) |+.++.+. .++..++++.. .++.. ..|.+-=-.++.+|+.+.+.++.++++++..|+.++.- .+.+ .+.+. T Consensus 191 ~~~~~~~~-~~~f~~v~~~~--~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G--~G~~---~p~Gi 262 (425) T protein:vir:10 191 ASQRPQTN-AATFQPLSFAS--GEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAG--DGTN---KPNGL 262 (425) T ss_pred cccccccc-ccccceeeeeh--eeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcc--cCCC---Cccee Confidence 66654321 12345555554 34333 33443222235689999999999999999999988631 0110 11111 Q ss_pred ccccCCceeec------ccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_011085. 155 IAGLGSASILE------VGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA 228 (343) Q Consensus 155 ~~g~~~~~~~~------~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (343) ......+.... ....+...+.. ..++.|.++...|+.... .+=..|++|..|..|.+-..- +..|.= T Consensus 263 l~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~d~l~~l~~~l~~~~~--~~a~~vmn~~~~~~L~~lkD~-~G~~l~ 335 (425) T protein:vir:10 263 LTYIAGGANAAKHPFGAIEVVNSGAAAD----ITSDGIIDLVYDLPSAFT--GNARFAMNRNTQRQVRKLKDG-QGNYLW 335 (425) T ss_pred eecccccccccccccccccccccccccc----ccHHHHHHHHhhhhhhhc--cCCEEEEchHHHHHHHHhhcC-CCceee Confidence 11000000000 00000001111 124445555555655443 233568999999988653221 122322 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ...+.+|.-++++|.+|+.++++|....+..+. +-++|+.. +..+..+.++ . T Consensus 336 ~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i-----------------~~Gd~~~~---------~~i~~~~~~~--v 387 (425) T protein:vir:10 336 QPSYVAGQPATLAGYPVTEVPDMPDVAANSTPI-----------------LFGDFQQT---------YLIIDRIGVR--V 387 (425) T ss_pred ccCccCCCCceecceeeEEecCcCCccCCccEE-----------------EEEehhcc---------EEEEEecceE--E Confidence 234567777899999999999999543221110 01223222 1122222222 2 Q ss_pred eeccc--hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 309 ARRAE--YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 309 ~~~~~--~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+++- +--..+++..+++.++++|++...|...+= T Consensus 388 ~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as 424 (425) T protein:vir:10 388 LRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAAS 424 (425) T ss_pred EecccccCCcEEEEEEEEeccEeecccceEEEEeecc Confidence 33332 222356778889999999999999999888 No 109 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.35 E-value=2.5e-13 Score=89.70 Aligned_cols=283 Identities=14% Similarity=0.071 Sum_probs=159.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEeccCc--ceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPVLGR--TRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~iG~--~t~~~~~~g~ 77 (343) ..-+.++....-+.......++--.+..+.|..++.+..+..+.++.++++..+..+. ++.++.... ..+.....|. T Consensus 95 ~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 174 (397) T protein:vir:49 95 KNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGG 174 (397) T ss_pred HHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeecccc Confidence 0001111110000000111111112456899999999999999999988888776432 344554432 2344445566 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.+ ..++.+++++.+.+.- .-+.|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.-. T Consensus 175 ~~~~~-~~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~---------------- 236 (397) T protein:vir:49 175 QIGQN-DDPKLSLIRYAIKRYA-GISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI---------------- 236 (397) T ss_pred ccccc-cccceeeeEeeeeeeE-eehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc---------------- Confidence 65432 1234566666665442 33445432223356899999999999999999999887321 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) +.++.. +... . ++.|.++...|+....+. -..+++|..|..|.+-..- +..|.-...+..|.- T Consensus 237 -g~~~~~-----~~~~----~----~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~l~~lkd~-~g~~l~~~~~~~g~~ 299 (397) T protein:vir:49 237 -GTLPNK-----PTLA----K----WDDIIDLQAKVDPAIKQT--SLFLTNTSGFTALKKVKNA-MGDYLMERDVKSPTG 299 (397) T ss_pred -cccccc-----cccc----C----HHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhcc-CCceeecccccCCCC Confidence 111100 0111 1 344556666777766543 4788999999988653211 222322234566777 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeec-hhhheeeeeeeeEEeeeeccc--- Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQH-RSAVGTVKLKDLSLERARRAE--- 313 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~-~~Av~~~~~~~~~~e~~~~~~--- 313 (343) ++++|++|+.+.+.+....+.. ....++.. ++++..+....++++..+... T Consensus 300 ~~l~G~pV~~~~~~~~~~~~~~-------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 354 (397) T protein:vir:49 300 YSIDGFVVKEISDRFLPNGTGG-------------------------AMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAF 354 (397) T ss_pred ceecceeeEEecccccccccCC-------------------------ceeEEEeeccceEEEEeecccEEEEeccccchh Confidence 8999999998665432211100 01112222 224444555556666554321 Q ss_pred -hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 -YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 -~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +-...+++..++|.++++|++.+.+++++. T Consensus 355 ~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 355 ETDTTKVRVIDRFDVVSTDTEAFVPASFKAI 385 (397) T ss_pred hcCeeeEEEEEeeccEEecccceEEEEeccc Confidence 222457888899999999999999999988 No 110 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.35 E-value=3.7e-13 Score=88.73 Aligned_cols=300 Identities=12% Similarity=0.074 Sum_probs=153.2 Q ss_pred CCCCCcccccc-----cc---ccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEec-cCcceee Q lcl|NC_011085. 1 MADMKGGQQLG-----KD---QGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPV-LGRTRAA 71 (343) Q Consensus 1 ~~~~~~~~~~~-----t~---~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~-iG~~t~~ 71 (343) +..+..+.... .+ .+....+| -+..+.|..++.+..+..+.++.+.+..++.++ ...+++ .+.+.+. T Consensus 88 ~~~lr~~~~~~~~~~e~~a~~~~~~~~GG---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~ 163 (401) T protein:vir:44 88 VGFLRKGREDGLRDLERKALQVGTDEDGG---YAVPEELDRSILSLLKDEVVMRQEATVITVGGS-DYKKLVNLGGTASG 163 (401) T ss_pred HHHHhhhhhhhhHHHHHHHhhcCCCCCCc---eeccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCccce Confidence 00000000000 00 00000111 134589999999999999999998887776544 444554 4545454 Q ss_pred eecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_011085. 72 YLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAAS 151 (343) Q Consensus 72 ~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (343) -...|...+.+. .++.+++++.+-+.. .-+.|.+---.++.+|+.+.+.++.++++++..|+.++.- .++. .+ T Consensus 164 wv~E~~~~~~~~-~~~~~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G--~G~~---~p 236 (401) T protein:vir:44 164 WVGETDTRSQTA-TSRLGLIEPFMGEIY-GNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTG--DGTK---KP 236 (401) T ss_pred eeccccccCccc-cccceeeeeehhhee-eehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc--CCCC---cc Confidence 444555544321 234555555554332 2233443222235678999999999999999999988731 1110 01 Q ss_pred cccccccCCcee---ecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_011085. 152 NENIAGLGSASI---LEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA 228 (343) Q Consensus 152 ~~~~~g~~~~~~---~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (343) .+.......... ...+.....++ .......++.|.++...|..... .+=..+++|..|..|.+-..- +..+.- T Consensus 237 ~Gil~~~~~~~~~~~~~~~~~~~~~t-~~~~~~~~d~i~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~-~G~~l~ 312 (401) T protein:vir:44 237 KGFLAYESTEESDKARAFGKLQHIVS-GEATAVTADAIIKLIYTLRKAHR--TGAKFMMNNNSLFAIRLLKDT-EGNYLW 312 (401) T ss_pred ceeecccccccccccccccccccccc-ccccccCHHHHHHHHHhcchhhh--cCCEEEEcHHHHHHHHHhhcc-CCceee Confidence 010000000000 00000000000 00011114455555556655433 233567999999988643221 122322 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ...+.+|..++++|.+|+.++++|..+.+.... +-++|.. ++.....+.++++ T Consensus 313 ~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i-----------------~~Gd~~~---------~~~i~~~~~~~~~- 365 (401) T protein:vir:44 313 RPGLELGQPSSLAGYGIAENEQMPDIAADAKAI-----------------AFGNFKR---------GYTIVDRIGTRIL- 365 (401) T ss_pred cCCcCCCCCceecceeeEEecCcCCccCCccEE-----------------EEeehhc---------cEEEEEecceEEe- Confidence 234567777899999999999999533221100 0122222 2222223333333 Q ss_pred eeccchhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 309 ARRAEYQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 309 ~~~~~~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++-... ..+++.+++|+++++|++.+.|++++= T Consensus 366 -~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 366 -RDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred -eeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 3332222 346777889999999999999999888 No 111 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.31 E-value=1.3e-12 Score=85.82 Aligned_cols=295 Identities=14% Similarity=0.125 Sum_probs=151.2 Q ss_pred CCCCCcc-------------cc---ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccC-ccccccccceEEEEe Q lcl|NC_011085. 1 MADMKGG-------------QQ---LGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNR-HIMRSISSGKSAQFP 63 (343) Q Consensus 1 ~~~~~~~-------------~~---~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~-~~~~~i~~G~tv~i~ 63 (343) ++...+. .. .....+....+| .+..+.|..++.+..+..+.++.+ ++..+..+| .+.+| T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg---~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~-~~~~p 180 (435) T protein:vir:14 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGG---VLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIP 180 (435) T ss_pred HHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCc---cccchhHHHHHHHHHhhhchhhhhcceeeecCCC-ceEEE Confidence 0000000 00 000000111111 134478888888888877777765 444344444 57888 Q ss_pred cc-CcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhc--hhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 64 VL-GRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNH--YDVRSEYTSQIGESLAMAADGAVLAE 140 (343) Q Consensus 64 ~i-G~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~--~d~~~~~~~~~~~aLa~~~D~~i~~~ 140 (343) ++ +.+.+.-...|..++.+ +++..++++..-+.- .-+.|.+-=-.++. .++.+.+..+.+++++++.|+.++.- T Consensus 181 ~~~~~~~a~~v~E~~~~~~~--~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G 257 (435) T protein:vir:14 181 RLKGGAIVGYIGADTDIPTT--QQQFDDLKLTAKKMA-ALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRD 257 (435) T ss_pred EEeCCcceeeeccCcccccc--ccceeEEEeeeEEEE-EeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 86 55556666667766543 355666666664442 23445431111232 34888899999999999999988621 Q ss_pred HHhhhhccccccccccccCCceeec-ccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccc Q lcl|NC_011085. 141 LAGLCNMPAASNENIAGLGSASILE-VGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAAL 219 (343) Q Consensus 141 ~~~~a~~~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~ 219 (343) ... ...+.|........ .... +....+..+.+.+.++...+...+.-......|++|..|..|.+-. T Consensus 258 ----~G~----~~~p~Gi~~~~~~~~~~~~----~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk 325 (435) T protein:vir:14 258 ----DGT----ANTPKGLRFWALPSNVITA----SDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLR 325 (435) T ss_pred ----CCC----Cccccceeecccccceecc----ccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhh Confidence 100 01122221110000 0011 1111233344445555555655544333456789999999886533 Q ss_pred hhhhhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheee Q lcl|NC_011085. 220 MPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTV 299 (343) Q Consensus 220 ~~~~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~ 299 (343) . .+..|.- ..... +.++|++|+.++.+|......... . .=+-++|+.. ..+ T Consensus 326 d-~~G~~l~-~~~~~---g~l~G~Pv~~~~~~p~~~~~~~~~-----~---------~i~~gd~s~~----------~i~ 376 (435) T protein:vir:14 326 D-GNGNKVY-PELAN---GMLKGYPVGKTTQVPINLGETGKE-----S---------EIYFTDFGDV----------FIG 376 (435) T ss_pred c-cCCceec-cCCCC---CeeecceeEeeccccccccCCCcc-----c---------eEEEeecccE----------EEE Confidence 2 1222211 01222 379999999999999642211100 0 0011233221 122 Q ss_pred eeeeeEEeeeeccc-----------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 300 KLKDLSLERARRAE-----------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 300 ~~~~~~~e~~~~~~-----------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ...+++++..++.. ++ .-.++..++++.++.||++.+.|.--.+ T Consensus 377 ~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:14 377 EEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAW 433 (435) T ss_pred EecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCC Confidence 33444555444321 11 2567888999999999999888887666 No 112 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.30 E-value=6.2e-13 Score=87.50 Aligned_cols=296 Identities=9% Similarity=0.019 Sum_probs=160.4 Q ss_pred CCCCCccccc----cccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccc-eEEEEec-cCcceeeeec Q lcl|NC_011085. 1 MADMKGGQQL----GKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSG-KSAQFPV-LGRTRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~----~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~-iG~~t~~~~~ 74 (343) +......+.. -.+--..+..++--.+..+.|.+++.+..+..+.++++++..++.++ -++.+++ .+...+.... T Consensus 92 ~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~ 171 (404) T protein:vir:10 92 LKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLS 171 (404) T ss_pred HHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecc Confidence 1110000000 00000000001111234588999999999999999999888877642 3455554 5666777777 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) .|+..+.+...++..++++...+.- .-+.|.+-=-.++.+++.+.+.++.++++++.+|+.|+.- ..... . T Consensus 172 e~~~~~~~~~~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G----~g~~~----~ 242 (404) T protein:vir:10 172 ENQQIPTNGDNGKLERFNFKLKDLA-DFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYG----AGGDE----H 242 (404) T ss_pred ccccccccccccceeeeEeeheeeE-eeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhc----CCCCC----c Confidence 7777655433455566666655442 2344544212235678999999999999999999988732 11111 1 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHH-HHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchh Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARA-KLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPE 233 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (343) +.|......+. +...+.... ++.+..+.. .|....-+ +-.++++|..|..|.+-.. .+..|.-...+. T Consensus 243 ~~gi~~~~~~~----~~~~~~~~~----~~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lkd-~~G~~l~~~~~~ 311 (404) T protein:vir:10 243 ATGIMTANKFK----KITLPKSPA----LKDFKKCKNVELLNVFKA--TSSWIVNQDGFNYLDSLED-KTGRPYLQPDPK 311 (404) T ss_pred ccceeeccccc----eeecccccc----HHHHHHHHHhhhhccccC--CCEEEEcHHHHHHHHHhhc-cCCceeeccCcC Confidence 11111111000 111111112 233332222 34333322 2357899999998875322 123333333456 Q ss_pred cceeEEEeceEEEEec-cccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeec Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVP-HLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARR 311 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn-~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~ 311 (343) +|...+++|.+|+.++ .+|..+.+ ..+.++++.+ ++..+....++++..++ T Consensus 312 ~~~~~~l~G~PV~~~~~~~~~~~~~---------------------------~~~~~~gd~s~~~~~~~~~~~~i~~~~~ 364 (404) T protein:vir:10 312 DPTQYRFLGLPVIELPNDLLLSTES---------------------------AIPVLLGDTKEAYKYVSDGAYELATTNI 364 (404) T ss_pred CCCCccccceeeEEecccccCCCCC---------------------------ccEEEEEeccccEEEEEecceEEEEecc Confidence 7777899999998644 33322111 1112333322 44444445556665544 Q ss_pred cc----hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 312 AE----YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 312 ~~----~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +- .-...++..+++|.++++|++.+.++++.- T Consensus 365 ~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~a 400 (404) T protein:vir:10 365 GAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVE 400 (404) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 31 223458899999999999999999999988 No 113 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.30 E-value=2.5e-13 Score=89.65 Aligned_cols=280 Identities=14% Similarity=0.113 Sum_probs=165.8 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhc--c-Ccccccc-----ccceEEEEeccCcc--e Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTT--N-RHIMRSI-----SSGKSAQFPVLGRT--R 69 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~--~-~~~~~~i-----~~G~tv~i~~iG~~--t 69 (343) ||+.. |+. -.+++ |+|...|.+...+.+.|. + +++...+ .+|+++.+|..+.. . T Consensus 1 Ma~~~------T~l---------~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~ 65 (330) T protein:vir:10 1 MANEL------TKI---------LDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGD 65 (330) T ss_pred CCCCc------eEe---------eeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCc Confidence 88742 332 12455 999999999888877663 2 2232222 36999999998755 3 Q ss_pred eeeecCCC-cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 70 AAYLQAGQ-SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 70 ~~~~~~g~-~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) ..++..|. .+.. ..++.++..-+|=. .-..+.+.|+....+-.|++.++.++.+...+++.+..++..+.+.-+.. T Consensus 66 ~~~~~dg~~~i~~--~ki~t~~~~a~i~~-~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~ 142 (330) T protein:vir:10 66 SEVLGNGDKALET--GKITAGADIACVLY-RGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATG 142 (330) T ss_pred ccccCCCccccch--hhcccceeEEEEEe-ecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhh Confidence 55665553 5643 44666665555433 34568899998877888999999999999999988888887665443321 Q ss_pred ccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhh-ccc Q lcl|NC_011085. 149 AASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAA-NYA 227 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~ 227 (343) ..... ..................+ ++.|.+|..+|.++. ..-..++++|..|..|.+.. +++. .+. T Consensus 143 ~~~~~--~~~~~~~~~~~~~~~a~~s--------~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li~~~~~s 209 (330) T protein:vir:10 143 TAGEK--GALEETHVSDQSKASTGID--------AGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKDN-LIQYIQPT 209 (330) T ss_pred hcccc--hhhhhhheecccccccccC--------HHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHhh-hhhhhccc Confidence 11100 0000000000000000001 345667777887664 34578999999999998753 3322 221 Q ss_pred cccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeee---e Q lcl|NC_011085. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD---L 304 (343) Q Consensus 228 ~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~---~ 304 (343) -.++.|+.++|.+|+.+..+|.... .| ...++-+-|++.....+ + T Consensus 210 ----~~~~~i~~~~G~~VivdD~~p~~~~---------------------~y-------t~yl~~~GAi~~~~~~~~~~v 257 (330) T protein:vir:10 210 ----TATINIPTYLGYRVIIDDGIAPTGD---------------------IY-------TSYLFRTGSIGLNTGNPSGLT 257 (330) T ss_pred ----ccCcccccccceEEEEeCCCCCCCC---------------------ce-------eEEEEecCceeeecccCCccc Confidence 1246789999999999999984210 01 11344455666554332 4 Q ss_pred EEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 305 SLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 305 ~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+|..|+++.-.|.+..++.|...+.=-..........| T Consensus 258 ~~EtdRd~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~ 296 (330) T protein:vir:10 258 TFETSREAAKGNDMIYTRRALVMHPYGVKWTGAEVDAGN 296 (330) T ss_pred cccccCCccccceEEEEeeEEEeeeeeeeecccccccCc Confidence 678889988777777777766544322221111112223 No 114 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.30 E-value=6.7e-13 Score=87.32 Aligned_cols=281 Identities=16% Similarity=0.087 Sum_probs=160.4 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccccccc-ceEEEEecc--CcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISS-GKSAQFPVL--GRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~i--G~~t~~~~~~g~ 77 (343) ...+..+.+.....-....+++--.+..+.|..++.+..+..+.++++++...+.+ .-+..++.. +...+.....|. T Consensus 95 ~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 174 (397) T protein:vir:49 95 KNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAG 174 (397) T ss_pred HHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCcc Confidence 00011111000000000111111124558999999999999999999988877653 223444543 334466667777 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.+. .++.+++++.+.+. +....|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.-.. T Consensus 175 ~~~~~~-~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g--------------- 237 (397) T protein:vir:49 175 KIADVD-DPKLSLIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIA--------------- 237 (397) T ss_pred cccccc-ccceeeEEeeeeeE-EeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------- Confidence 765422 35567777776544 3334555422223568999999999999999999998873211 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) .+.. .+ ..+ . ++.|.++...|.....+ .-.++++|..|..|..-..- +..|.-...+..|.- T Consensus 238 --~~~~--~~---~~~----~----~d~i~~~~~~l~~~~~~--~a~~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~~~ 299 (397) T protein:vir:49 238 --ALPT--KP---TLT----K----WDDIIDLEAKVDPAIKQ--TSFFLTNTSGFTALKKVKNA-LGDYLMERDVKSPTG 299 (397) T ss_pred --cccc--cc---ccc----c----HHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHHhhcC-CCceeeccCcCCCCC Confidence 1110 00 001 1 34455556667666654 34678999999998753221 223332334566777 Q ss_pred EEEeceEEEEecc--ccccccccccccccccccccccccccccccccccceEeEeech-hhheeeeeeeeEEeeeecc-c Q lcl|NC_011085. 238 RNVMGFEVVEVPH--LTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHR-SAVGTVKLKDLSLERARRA-E 313 (343) Q Consensus 238 ~~i~Gf~V~~sn~--lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~Av~~~~~~~~~~e~~~~~-~ 313 (343) ++++|++|+.+.+ +|...... .+.++... +++..+..+.++++..+.. + T Consensus 300 ~~l~G~PV~~~~~~~~~~~~~~~---------------------------~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~ 352 (397) T protein:vir:49 300 YSIDGFAVKEVADRWLANGTGGA---------------------------MPLYFGDLKQAVTLFDRQHMSLLSTNIGGG 352 (397) T ss_pred ceecceeeEEecccccccccCCc---------------------------eeEEEeeccceEEEEeecceEEEEeccccc Confidence 8999999998654 33211110 01122222 2344444455566654432 1 Q ss_pred ---hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 ---YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ---~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +....++...+++.++++|++.+.++++.. T Consensus 353 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 353 AFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 385 (397) T ss_pred hhhcCceeEEEEeeeCcEEecccceEEEEeecc Confidence 223457888899999999999999999887 No 115 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.30 E-value=2.5e-12 Score=84.22 Aligned_cols=296 Identities=15% Similarity=0.121 Sum_probs=155.3 Q ss_pred CCCCCcc--------------cc--ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccC-ccccccccceEEEEe Q lcl|NC_011085. 1 MADMKGG--------------QQ--LGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNR-HIMRSISSGKSAQFP 63 (343) Q Consensus 1 ~~~~~~~--------------~~--~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~-~~~~~i~~G~tv~i~ 63 (343) |+...+- .. .....+....+| .+..+.|..++.+..+..+.++.+ .++.+...| .+.+| T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg---~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~~p 180 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGG---VLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIP 180 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCc---cccchhHHHHHHHHHhhhchhhhccceeeecCCC-ceEEE Confidence 1110000 00 000000011111 134578889999888888887776 333343344 57777 Q ss_pred cc-CcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccch--HHHHhchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 64 VL-GRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDI--EDAMNHYDVRSEYTSQIGESLAMAADGAVLAE 140 (343) Q Consensus 64 ~i-G~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~--D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~ 140 (343) +. +.+.+.-...|+.++.+ +++.+++++.+.+.. ..+.|.+- +.....+++.+.+.++.+++++++.|+.++.- T Consensus 181 ~~~~~~~a~~v~E~~~~~~~--~~~f~~i~~~~~k~~-~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G 257 (435) T protein:vir:80 181 RLKGGAIVGYIGADTDIPTT--QQQFDDLKLTAKKMA-ALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRD 257 (435) T ss_pred EEeCCcceeeeccCcccccc--ccceeeEEEeeEEEE-EeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 76 55566666667766543 356666676665543 33445431 11112457889999999999999999988731 Q ss_pred HHhhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccch Q lcl|NC_011085. 141 LAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALM 220 (343) Q Consensus 141 ~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~ 220 (343) .... ..+.|......... ....+.....+.+...+.++...|...+....+-..|++|..|..|..-.. T Consensus 258 ----~G~~----~~p~Gi~~~~~~~~---~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd 326 (435) T protein:vir:80 258 ----DGTA----NTPKGLRFWALPGN---VITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRD 326 (435) T ss_pred ----CCCC----Ccccceeecccccc---eeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhc Confidence 1000 11222111110000 000011111233344455566666666654444456899999988855321 Q ss_pred hhhhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeee Q lcl|NC_011085. 221 PNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVK 300 (343) Q Consensus 221 ~~~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~ 300 (343) .+..|.-. .... ++++|++|+.++++|.....+... . .-+-++|.. +..+. T Consensus 327 -~~G~~l~~-~~~~---~~l~G~pv~~~~~~p~~~~~~~~~------~--------~i~~gd~s~----------~~i~~ 377 (435) T protein:vir:80 327 -GNGNKVYP-ELAN---GMLKGYPVGKTTQVPINLGEAGKE------S--------EIYFTDFGD----------VFIGE 377 (435) T ss_pred -cCCceecc-CCCC---CeEeeeeeEEeccccccccCCCCc------c--------eEEEEEccc----------EEEEe Confidence 12222110 1122 379999999999999542111000 0 001122322 22233 Q ss_pred eeeeEEeeeeccc-----------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 301 LKDLSLERARRAE-----------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 301 ~~~~~~e~~~~~~-----------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ...++++..++.. ++ ...++...+|+.++.||++.+.|.--.. T Consensus 378 ~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:80 378 EETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAW 433 (435) T ss_pred ecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCC Confidence 4455566555432 11 2466788899999999999999987776 No 116 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.29 E-value=1.5e-12 Score=85.33 Aligned_cols=289 Identities=13% Similarity=0.055 Sum_probs=154.5 Q ss_pred CCC-CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCCc Q lcl|NC_011085. 1 MAD-MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~-~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~~ 78 (343) |+- ...... .+.-..-.+++.-.+..+.+..++.+..++.++++.+.++.++. +.+++||+. +.+.+.-...|+. T Consensus 1 ~~~~~~~~~e--~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~ 77 (318) T protein:vir:24 1 MAAGTAFAVD--HAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWVGDVSAQWIGEGDM 77 (318) T ss_pred CCCCCCCCHH--HHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceEEecCCcc Confidence 111 000000 00000001111112456889999999999999999998877765 456777765 5566777778888 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+ +++.+++++..-+.- .-..|.+-=-.++.+|+.+.+.++.++++++++|+.++.-- .... +.+. T Consensus 78 ~~~~--~~~f~~i~~~~~k~~-~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~----g~~~-----~~~~ 145 (318) T protein:vir:24 78 KPIT--KGNMTSQTIAPHKIA-TIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGT----DSPF-----PTYI 145 (318) T ss_pred cccc--ccceeEEEEeeEEEE-EeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhccc----CCCC-----Cccc Confidence 7654 356666666554432 22345431112356889999999999999999999886321 1100 1111 Q ss_pred CCce-eecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcc-- Q lcl|NC_011085. 159 GSAS-ILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERG-- 235 (343) Q Consensus 159 ~~~~-~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G-- 235 (343) .... .+..+...... .. ..+.+.++...+...+ ...-.++++|..|..|.+-..- +..|........| T Consensus 146 ~~~~~~~~~~~~~~~~--~~----~~~~~~~~~~~~~~~~--~~~~~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~~~~ 216 (318) T protein:vir:24 146 GQTTKAISIADTTGAT--TV----YDQVAVNGLSLLVNDG--KKWTHTLLDDITEPILNGAKDQ-NGRPLFIESTYGEAA 216 (318) T ss_pred cccccccccccccccc--ch----HHHHHHHHHHhhcccc--CCCCEEEEcHHHHHHHHHhhcc-CCceeecCccccCcc Confidence 1100 01111111110 01 1122333344444443 3344679999999998753222 2222211111122 Q ss_pred ---eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc Q lcl|NC_011085. 236 ---SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA 312 (343) Q Consensus 236 ---~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~ 312 (343) .-+.+.|++|+.++++|.+.. ++++...+.+..+..+++.+|..++. T Consensus 217 ~~~~~~~i~g~pv~~~~~~~~~~~------------------------------~~~~gdfs~~~~~~~~~l~i~~~~~~ 266 (318) T protein:vir:24 217 SPFRSGRIVARPTILSDHVVEGTT------------------------------VGFMGDFSQLIWGQIGGLSFDVTDQA 266 (318) T ss_pred ccccCceEEEEeeEEeCCCCCCcc------------------------------EEEEeecceEEEEEecCeEEEEeecc Confidence 225789999999998874221 01122222222333444555554442 Q ss_pred c--------------hhh--hhhhhhhhhccceecccceEEEEecC--C Q lcl|NC_011085. 313 E--------------YQA--DQIIARYAMGHGGLRPEAAGALVFTA--G 343 (343) Q Consensus 313 ~--------------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~--g 343 (343) . +.. -.++..+++|.++++|++.+.|+... | T Consensus 267 ~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~ 315 (318) T protein:vir:24 267 TLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGG 315 (318) T ss_pred ceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCC Confidence 1 222 34578889999999999998887743 2 No 117 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.28 E-value=1.7e-12 Score=85.04 Aligned_cols=284 Identities=12% Similarity=0.063 Sum_probs=157.3 Q ss_pred CCCCCccc----cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccc-eEEEEeccC--cceeeee Q lcl|NC_011085. 1 MADMKGGQ----QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSG-KSAQFPVLG--RTRAAYL 73 (343) Q Consensus 1 ~~~~~~~~----~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~iG--~~t~~~~ 73 (343) +..+..+. ....|.-.....++--.+..+.|..++.+..+..+.++++++..++.++ -++.++... ...+... T Consensus 98 ~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 177 (404) T protein:vir:39 98 VNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMD 177 (404) T ss_pred HHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeee Confidence 10000000 0001100011111111245689999999999999999999988877643 244444432 2344556 Q ss_pred cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 74 QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 74 ~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) ..|+.++.+ ..++..++++.+.+.- ..+.|.+-=-..+.+|+.+.+.++.++++++..|+.|+.- T Consensus 178 ~Eg~~~~~~-~~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g------------- 242 (404) T protein:vir:39 178 AEDGKIPDL-DNPRLTIIKYLIKRYA-GIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAA------------- 242 (404) T ss_pred cCccccccc-cccceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhc------------- Confidence 667666532 1345677777776553 3345654222235688999999999999999999988732 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchh Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPE 233 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (343) .+.+.. .+. ..+ ++.+.+++. ..++...-+ +=.+|++|..|..|..-.. .+..|.-...+. T Consensus 243 ----~g~~~~--~~~---~~~----~~~i~~~~~---~~~~~~~~~--~a~~v~n~~~~~~L~~lkd-~~G~~l~~~~~~ 303 (404) T protein:vir:39 243 ----MGTVPK--KPT---IAK----FDDVITMIN---TSVDPAIIA--TSSLLTNQSGLNKLALVKT-AEGKYLLEPDPT 303 (404) T ss_pred ----cccccc--ccc---ccc----HHHHHHHHH---Hhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCceeeccCcC Confidence 111110 011 111 222333321 223333222 2367899999999985322 122333333455 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeecc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRA 312 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~ 312 (343) .|...+++|++|+.+.+.+....+.. ....++...+ ++..+..+.++++..+.. T Consensus 304 ~~~~~~l~G~pV~~~~~~~~~~~~~~-------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~ 358 (404) T protein:vir:39 304 KPNSYLIKGKKVIVVADRWLPNSGST-------------------------VYPLYYGDMSQAITLFDRENMSLLPTNIG 358 (404) T ss_pred CCCcceecceeEEEecccccCccCCC-------------------------ccEEEEEeccccEEEEeecceEEEEeccc Confidence 66778999999999776432211100 0011233322 344444455666665543 Q ss_pred c----hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 313 E----YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 313 ~----~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) . +....++..++||.++++|++.+.++++.- T Consensus 359 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (404) T protein:vir:39 359 AGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAI 393 (404) T ss_pred hhhhhhceeeEEEEeeeccEEecccceEEEEeecc Confidence 2 223457788999999999999999987665 No 118 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.28 E-value=2e-12 Score=84.70 Aligned_cols=281 Identities=9% Similarity=0.017 Sum_probs=159.1 Q ss_pred CCC-CCcccc----------ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccc-eEEEEec-cCc Q lcl|NC_011085. 1 MAD-MKGGQQ----------LGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSG-KSAQFPV-LGR 67 (343) Q Consensus 1 ~~~-~~~~~~----------~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~-iG~ 67 (343) ++. +.+... ...|...+...++--.+..+.|..++.+.....+.++.++++..+.++ ..+.+++ .+. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 177 (397) T protein:vir:12 98 FLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADM 177 (397) T ss_pred HHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCC Confidence 000 000000 000110111111111245589999999999999999998888777642 2444554 456 Q ss_pred ceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 68 TRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 68 ~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) ..+..+..|..++.+. .++.+++++...+.- .-..|.+-=-..+.+|+.+.+.++.+++|++..|+.|+.-. T Consensus 178 ~~a~~v~Eg~~~~~~~-~~~~~~v~~~~~k~~-~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~------ 249 (397) T protein:vir:12 178 VPFSPVEELGNLPEID-QPRFTKVSYSIIDYG-GIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAI------ 249 (397) T ss_pred cceeeecccccccccc-cccceeEEeeheeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc------ Confidence 6677777787765432 245666777665443 22345432222356799999999999999999999887321 Q ss_pred cccccccccccCCceeecccccccccchHHHHHHHHHHHHHHH-HHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcc Q lcl|NC_011085. 148 PAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIAR-AKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANY 226 (343) Q Consensus 148 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~ 226 (343) +.+. +.+. .+ ++.|.++. ..|+...- .+=..+++|..|..|.+-..- +..| T Consensus 250 -----------g~~~--~~g~----~~--------~~~i~~~~~~~l~~~~~--~~a~~~~n~~~~~~L~~lkd~-~G~~ 301 (397) T protein:vir:12 250 -----------ASLK--KVDI----DG--------LDGIKKALNVTLDPMVA--PGSIVLTNQDGYDWLDTLKDG-TGRY 301 (397) T ss_pred -----------cccc--cccc----cc--------HHHHHHHHhhccchhhh--CCCEEEEcHHHHHHHHHhhcc-CCce Confidence 1111 0000 11 22233322 23433332 234578999999988653211 2333 Q ss_pred ccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeE Q lcl|NC_011085. 227 AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLS 305 (343) Q Consensus 227 ~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~ 305 (343) .-...+.+|..++++|.+|+.+++........ ....++...+ ++..+..+.++ T Consensus 302 l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~--------------------------~~~~~~gd~~~~~~~~~~~~~~ 355 (397) T protein:vir:12 302 LLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKG--------------------------KAPLIIGNLKEAIVLFDREQQS 355 (397) T ss_pred eecccccCCCCccccceeeEEecccccccCCC--------------------------ccEEEEEehhceEEEEeecceE Confidence 33334567777899999999887643211100 0001233322 33344445556 Q ss_pred Eeeeeccc----hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 306 LERARRAE----YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 306 ~e~~~~~~----~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++..+... .-...+++.++++.++++|++.+.+++++= T Consensus 356 i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 356 IASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 66554432 223578888999999999999999999999 No 119 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.27 E-value=9.7e-13 Score=86.44 Aligned_cols=281 Identities=10% Similarity=0.027 Sum_probs=160.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccc-eEEEEeccC-cceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSG-KSAQFPVLG-RTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~iG-~~t~~~~~~g~~ 78 (343) -++..-+.+.+|- ..+. .+..+.|..++.+..+..+.++++++...+.++ -++.++..+ ...+.....|+. T Consensus 84 ~~~~~~a~~~~t~-----~~gg--~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 156 (371) T protein:vir:81 84 RTRFRNAMSEGSN-----QDGG--YTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAA 156 (371) T ss_pred HHHHHHhhccCCC-----ccCc--eeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccc Confidence 0000000111111 1111 135588999999999999999999988877643 344455543 456777777877 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+. .++.+++++...+.- ..+.|.+-=-..+.+|+.+.+.++.++++++..|+.++.-.. T Consensus 157 ~~~~~-~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g---------------- 218 (371) T protein:vir:81 157 IGEKA-TPQFTLLQYQVKKYA-GFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLN---------------- 218 (371) T ss_pred ccccc-ccceeeEEeeeeEEE-EeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------- Confidence 65422 245566666665442 234554421223468999999999999999999988864211 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeE Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) .+. + +... .++.+.+.+ ...|+...- ..=..|++|..|..|.+-.. .+..|.-...+..|..+ T Consensus 219 -~~~--~----~~~~----~~~~i~~~~---~~~l~~~~~--~~a~~vmn~~~~~~L~~lkd-~~g~~l~~~~~~~~~~~ 281 (371) T protein:vir:81 219 -TKA--K----TAIA----DLDGLKQII---NVQLDPVFR--STSSVIVNQDAFNWLDTLKD-QNGQYLLQPSISSPTGR 281 (371) T ss_pred -ccc--c----cccc----cHHHHHHHH---Hhhcchhhh--cCCEEEEcHHHHHHHHHhhc-cCCCeeeecccCCCCCc Confidence 000 0 0001 122222222 123333322 23367899999998875322 12333333345667778 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccc-hh- Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAE-YQ- 315 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~-~~- 315 (343) +++|.+|+.++++|.+......... .....++...+ .+..+....++++..+... .| T Consensus 282 ~l~G~pV~~~~~~~~~~~~~~~~~~--------------------~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~ 341 (371) T protein:vir:81 282 QLLGLPVVIVSNKVLANRVDGGTGA--------------------QFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFE 341 (371) T ss_pred eecceeEEEecccccCccccccccC--------------------CcceEEEEehhceEEEEeecceEEEEeccccchhh Confidence 9999999999999965432111000 00011222222 2333444455555554431 22 Q ss_pred --hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 --ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 --~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ...+++.+++|.++++|++.+.++++.= T Consensus 342 ~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 342 TDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred cCceEEEEEEeeccEEecccceEEEEEecC Confidence 3467888889999999999999998877 No 120 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.27 E-value=2.9e-12 Score=83.82 Aligned_cols=295 Identities=14% Similarity=0.077 Sum_probs=148.5 Q ss_pred CCCCCcccc-----ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccC-ccccccccceEEEEecc-Ccceeeee Q lcl|NC_011085. 1 MADMKGGQQ-----LGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNR-HIMRSISSGKSAQFPVL-GRTRAAYL 73 (343) Q Consensus 1 ~~~~~~~~~-----~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~-~~~~~i~~G~tv~i~~i-G~~t~~~~ 73 (343) ||...++.. ..+-.+ +|- .|..+.+.+++.+..+..++++.+ .+.-+...| .+.+|+. +.+.+.-. T Consensus 52 ~a~~~~~~~~~~~a~~~~~~----~Gg--~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g-~~~~p~~t~~~~a~wv 124 (366) T protein:vir:57 52 FAATELGDTGLSMAISTAAG----SGG--ALIPQNMQNEVIELLRDRTVVRILGARSIPLPNG-NLSMPRLSGGATAGYV 124 (366) T ss_pred HHHHhhcchhhhhhcccccc----CCc--cccchhHHHHHHHHHhhhcchhhhceeeeecCCC-ceEEEEEeCCcceeee Confidence 222111111 111111 111 134578899999988888888766 444344445 4777776 55566667 Q ss_pred cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 74 QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 74 ~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) ..|+.++.+ +++.+++++..-+. +.-..|.+-=-.++.+++.+.+.++.++++++.+|+.+|.- ......+.+ T Consensus 125 ~E~~~~~~s--~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G----~G~~~~p~G 197 (366) T protein:vir:57 125 GEGKDVVAT--GATFDDVKLSAKTM-IALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRD----DGTGDTPKG 197 (366) T ss_pred ccCcccccc--ccceeEEEEeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhcc----CCCCccccc Confidence 778877654 35666666665433 23344543212356789999999999999999999988732 111101111 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchh Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPE 233 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (343) .......++.. ....+...+.. .....++. +...+...+.-...=..+++|..|..|.+-.. -.|.-.+. T Consensus 198 i~~~~~~~~~~-~~~~~t~~~~~-~~~~~~~~---~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd-----~~G~~l~~ 267 (366) T protein:vir:57 198 MKAVATAANRL-VAWTGTAINLT-TIDEYLDS---LILKHMDSNSNMIRCGWGLSNRTYMTLFGLRD-----GNGNKVYP 267 (366) T ss_pred eeeccccccce-eeccccccchh-hHHHHHHH---HHHhhhccccccccCEEEecHHHHHHHHhhhc-----cCCceecc Confidence 10000000000 00000011110 01111221 12222222222223345799999998875321 11111111 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) ...-+.++|++|+.|+++|........ ....++...+-+..+....++++..++.. T Consensus 268 ~~~~g~l~G~Pvv~s~~ip~~~~~~~~------------------------~~~i~~gdfs~~~i~~~~~i~i~~~~ea~ 323 (366) T protein:vir:57 268 EMSQGILKGYPIQRTSAIPANLGDDGN------------------------ESEIYFCDFNDVVIGEDGMMKVDFSTEAT 323 (366) T ss_pred CCCCCeecceeeEEccccccccccCCC------------------------ccEEEEEecceEEEEEecceEEEEeeccc Confidence 112247999999999999953211100 00012223332223334445555555432 Q ss_pred -----------hhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 -----------YQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 -----------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++. -.++..++++.+++||++.+.|+-..= T Consensus 324 ~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 324 YKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred cccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 112 267788889999999998887753322 No 121 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.27 E-value=4.1e-12 Score=83.03 Aligned_cols=296 Identities=11% Similarity=0.077 Sum_probs=147.8 Q ss_pred CCCCCcccccccccccc-ccccchhHHHHHHHHHHHHHHHHHhhhhccC-ccccccccceEEEEecc-CcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKG-QSGGDKLALFLKVFGGEVLTAFARTSVTTNR-HIMRSISSGKSAQFPVL-GRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~-~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~-~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~ 77 (343) |+.-........+.-.. -.+|- -|..+.|..++.+..+..++++.+ ++.-+..+| .+.||++ +.+++.-...|+ T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~gg--~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g-~~~~p~~~~~~~a~~v~Eg~ 189 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAGSGG--VLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNG-NMSLPRLAGGATASYTGENQ 189 (428) T ss_pred HhhhhhhhhhHhhhhcccccCCc--cccchhHHHHHHHHHhhhchhhhhcceeeecCCc-ceEEEEEeCCcceeeeccCc Confidence 22111111111111000 00111 123477888888888888888777 333222233 3778876 445666667777 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.+ +++.+++++...+. +.-+.|.+-=-.++.+++.+.+.++.+++|++..|+.+|.- ... ...+.| T Consensus 190 ~~~~~--~~~f~~i~~~~~k~-~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G----~G~----~~~p~G 258 (428) T protein:vir:10 190 DAKVS--EARFDDVKLTAKTM-IAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRD----DGT----GDTPIG 258 (428) T ss_pred ccccc--ccceeeEEeeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhcc----CCC----Cccccc Confidence 77653 35666777766444 23345554222346789999999999999999999988621 110 111122 Q ss_pred cCC----ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchh Q lcl|NC_011085. 158 LGS----ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPE 233 (343) Q Consensus 158 ~~~----~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (343) .-. .+.+.........+ .......++.+ .......+.....-..+++|..|..|.+-.. .+..|.-. ... T Consensus 259 i~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~---~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd-~~G~~i~~-~~~ 332 (428) T protein:vir:10 259 MKARATQWNRLLPWAADAAVN-LDTIDTYLDSI---ILMSMDGNSNMISSGWGMSNRTYMKLFGLRD-GNGNKVYP-EMA 332 (428) T ss_pred ccccccccccccccccccccc-HHHHHHHHHHH---HHhhhccccccccCEEEEcHHHHHHHHHhhc-cCCceecc-CCC Confidence 111 11111001111111 11111112221 1111111111122345679999988865321 12222110 122 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~ 313 (343) . ++++|.+|+.++++|....... +..+.++.+.+-+..+....++++..++.. T Consensus 333 ~---g~l~G~pv~~~~~~p~~~~~~~------------------------~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~ 385 (428) T protein:vir:10 333 Q---GMLKGYPIQRTSAIPANLGEGG------------------------KESEIYFADFNDVVIGEDGNMKVDFSKEAS 385 (428) T ss_pred C---CeeeceeeEEeccccccccCCC------------------------ccceEEEEecceEEEEEecceEEEeecccc Confidence 3 3799999999999996421110 001112223332333334445555555432 Q ss_pred -----------hh--hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 -----------YQ--ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 -----------~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++ .-.++...+++.++.||++.+.++--.= T Consensus 386 ~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 386 YIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 11 2456888899999999999988764444 No 122 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.26 E-value=1e-12 Score=86.26 Aligned_cols=275 Identities=15% Similarity=0.106 Sum_probs=171.4 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhc--cCc-c---ccc----cccceEEEEeccCcc- Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTT--NRH-I---MRS----ISSGKSAQFPVLGRT- 68 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~--~~~-~---~~~----i~~G~tv~i~~iG~~- 68 (343) ||.+ +. + .+++ |+|...|.+...+.+.|. +.+ + ... -.+|+++.+|..+.. T Consensus 1 MA~T--------~l------s---d~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~ 63 (324) T protein:vir:59 1 MAYT--------KI------S---DVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLD 63 (324) T ss_pred CCce--------ee------e---ceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCC Confidence 6632 21 1 2455 999999998888887763 222 1 111 136999999998765 Q ss_pred -eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 69 -RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 69 -t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) ...++..++.+.. ..+..++..-+|= .....+.+.|+....+--|++.++.++.+..++++.+..+|..|...... T Consensus 64 Gd~~~v~~~~~i~~--~~l~t~~~~a~i~-~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~ 140 (324) T protein:vir:59 64 GDSQVLNDTDDLVP--QKINAGQDKAVLI-LRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSN 140 (324) T ss_pred CcccccCCCcccch--hhcccceeeEEEE-eecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 4678888888764 4566666665553 46677889998877788899999999999999999999988776543321 Q ss_pred cccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc Q lcl|NC_011085. 148 PAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYA 227 (343) Q Consensus 148 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (343) ... ....++..+.++. .-. .+.|.+|..+|.++. ..-..++++|..|..|.+........+. T Consensus 141 ~~~---------~~~~~dvsa~~~~---~~s----~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~li~~~~~s 202 (324) T protein:vir:59 141 DDM---------KDNKLDISGTADG---IYS----AETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQDLIEFVKDS 202 (324) T ss_pred ccc---------ccceeeeeccccc---eec----HHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHhhhhhhcccc Confidence 111 1111222221111 111 245666777887753 3446889999999999876422211121 Q ss_pred cccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeee-eeEE Q lcl|NC_011085. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLK-DLSL 306 (343) Q Consensus 228 ~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~-~~~~ 306 (343) . .++.|+.++|.+|+++..+|....... ......+.+-+-|++....+ ++.+ T Consensus 203 -~---~~~~i~~~~G~~VivdD~~p~~~~~~~-----------------------~~~y~s~l~~~GAi~~~~~~~~v~v 255 (324) T protein:vir:59 203 -Q---SGIRFPTYMNKRVIVDDSMPVETLEDG-----------------------TKVFTSYLFGAGALGYAEGQPEVPT 255 (324) T ss_pred -c---cCceeeeecccEEEEeCCCCccccCCC-----------------------CceEEEEEEecCeEEEeecCCCcce Confidence 1 245789999999999999995321110 01122355666777777654 4568 Q ss_pred eeeeccchhhhhhhhhhhhccceecccceEEEEec-CC Q lcl|NC_011085. 307 ERARRAEYQADQIIARYAMGHGGLRPEAAGALVFT-AG 343 (343) Q Consensus 307 e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~-~g 343 (343) |..|++....|.+.....|...+. +.--.... .| T Consensus 256 E~dRd~~~g~~~l~~r~~~~~~p~---G~s~~~~~~~~ 290 (324) T protein:vir:59 256 ETARNALGSQDILINRKHFVLHPR---GVKFTENAMAG 290 (324) T ss_pred ecccCccccceEEEEeeEEEeEee---eEEecccccCC Confidence 999999877777777666654332 11111111 12 No 123 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=99.24 E-value=4.4e-12 Score=82.83 Aligned_cols=281 Identities=13% Similarity=0.058 Sum_probs=149.6 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC-cceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG-RTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG-~~t~~~~~~g~~i 79 (343) ||..++.... .|..+.++.++.+..++.+.++.++++.++. +.+.+||+.. .+.+.-+..|+.. T Consensus 1 ma~~t~~~gg--------------~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~wv~E~~~~ 65 (305) T protein:vir:25 1 MADISRAEVA--------------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMG-TKTTHLPVLATLPEADWVGESATD 65 (305) T ss_pred CCCccCCccc--------------eecCHHHHHHHHHHHHhhchhhhhcceeecc-CCcEEEEEEeCCcceEEeeccccc Confidence 7776443321 2345788999999999999999999887765 4567787754 4456666666654 Q ss_pred CCcc---CCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 80 DDKR---KDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 80 ~~~~---~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) +... .+++..++++.. .|+.. ..|.+-=-.++.+|+.+.+.++.+++|++.+|+.++.-- ....+. ... T Consensus 66 ~~~~~~~s~~~f~~i~~~~--~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~----g~~~~~-~~~ 138 (305) T protein:vir:25 66 PKGVKPTSKVTWANRTLVA--EEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGT----DKPASW-VSP 138 (305) T ss_pred ccccccccccceeeEEeee--EEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheecc----CCCCCc-ccc Confidence 4321 123344444444 34333 445432122356889999999999999999999987311 000000 000 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcc Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERG 235 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (343) .......... ..............+++.+..+...+....-.. . -++++|..|..|.+-. |-+|.-.+.. T Consensus 139 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~v~~~~~~~~l~~lk-----d~~G~~i~~~- 208 (305) T protein:vir:25 139 ALIPAAVTAG--QAVEVVGGVANESDIVGATNRAAKAVASAGWAP-D-TLLSSLALRYEVANIR-----DANGNPVFRD- 208 (305) T ss_pred cccccccccc--ccccccccchhhhHHHHHHHHHHHhhhhccccc-c-eeEecHHHHHHHHHhh-----ccCCceeecC- Confidence 0000000000 000011111112334555555544444332211 1 2677999999886422 2222223333 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc--- Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA--- 312 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~--- 312 (343) +.++|++|+.++++|...... . -+-++|+. ...+..+.++++..++. T Consensus 209 --~~l~G~Pv~~~~~~~~~~~~~-~-----------------~~~gd~s~----------~~i~~~~~~~i~~~~~~~~~ 258 (305) T protein:vir:25 209 --DSFAGFRTFFNRNGAWDADAA-I-----------------EVIADSSR----------VKIGVRQDITVKFLDQATLG 258 (305) T ss_pred --CcccccceEEcCccCCCCCcc-E-----------------EEEEecce----------EEEEEecCeEEEEeeeeeee Confidence 369999999999987432100 0 01122322 12222333344433321 Q ss_pred -------chhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 313 -------EYQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 313 -------~~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .++. -.++...++|..++||++++.+....- T Consensus 259 ~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 259 TGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred cCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 1222 245677789999999999888876533 No 124 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.24 E-value=6e-13 Score=87.58 Aligned_cols=291 Identities=10% Similarity=0.065 Sum_probs=149.3 Q ss_pred CCCCCcccccc-----ccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcc-eeeeec Q lcl|NC_011085. 1 MADMKGGQQLG-----KDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRT-RAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~~-----t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~-t~~~~~ 74 (343) |.......... .........++--.+..+.+..++.+..+..+.+++++++.++. |+ ++||+.... .+.... T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~-~~ip~~~~~~~a~~v~ 196 (425) T protein:vir:95 119 LKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GT-TRILVDTDTSPATWIE 196 (425) T ss_pred HhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ce-eEEEEecCCccccccc Confidence 11110000000 00000011111112455889999999999999999998877753 44 467776544 344555 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) .|..++.+. ..+.+++++.. .++.. +.|.+-=-.++..++.+.+.++.++++++..|+.||.- ... ... T Consensus 197 E~~~~~~~~-~~~f~~i~l~~--~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G----~G~---~~~ 266 (425) T protein:vir:95 197 QSGALPTGD-VGTIASIDFDG--FKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKG----TGA---ANK 266 (425) T ss_pred ccccccccc-ccccceeeeeh--eeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhcc----CCC---Ccc Confidence 666664432 12345555544 44443 34544222334578999999999999999999988731 100 000 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHH-HHHHhccchhh--hhcccccc Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEV-YSAILAALMPN--AANYAALI 230 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~-~~~Ll~~~~~~--~~~~~~~~ 230 (343) .+.|...+ +............. .++.+.++...+.....+...-+.+++|.. |..|..-.... +..|.. T Consensus 267 ~p~Gil~~--~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~-- 338 (425) T protein:vir:95 267 QPLGIIPS--LPPENQVTVEADNN----LLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVG-- 338 (425) T ss_pred ccceeecc--cccccccccccccc----hHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceee-- Confidence 11111110 00000000011111 234444444555555443344344555554 44443222111 222321 Q ss_pred chhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR 310 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~ 310 (343) ....+...+++|.+|+.++++|...+ +-++|... ..+..+.++++... T Consensus 339 ~~~~~~~~~l~G~pvv~~~~~~~~~i----------------------~~Gd~~~~----------~~~~~~~~~i~~~~ 386 (425) T protein:vir:95 339 KLPNLRTPDLLGLRVVFNNFLDDDTV----------------------LFGEFEQY----------TLVERENITIDSST 386 (425) T ss_pred ccCCCCCccccceeeEEcCcCCCccE----------------------EEEecccE----------EEEeecceEEEeec Confidence 12355567899999999999984311 01223221 12223444555554 Q ss_pred ccchh--hhhhhhhhhhccceecccceEEEEecC---C Q lcl|NC_011085. 311 RAEYQ--ADQIIARYAMGHGGLRPEAAGALVFTA---G 343 (343) Q Consensus 311 ~~~~~--~d~i~~~~~~G~~v~rpe~~~~i~~~~---g 343 (343) +..+- ...+++..++++++++|++.+.+.++. | T Consensus 387 ~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g 424 (425) T protein:vir:95 387 HVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQG 424 (425) T ss_pred ccccccCceEEEEEEeeCcEeecccceEEEEecCcCCC Confidence 43221 246777889999999999999988876 4 No 125 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.24 E-value=1.1e-12 Score=86.23 Aligned_cols=276 Identities=11% Similarity=0.033 Sum_probs=158.7 Q ss_pred CCCCCccccc-cccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcc---eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQL-GKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRT---RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~-~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~---t~~~~~~g 76 (343) +..+.+.... -.|.+.....|. .|..+.|..++.+..+..+.++++++..++.++ ++++|+.... .+.....| T Consensus 101 ~~~~~~~~~~~~~ra~~t~~~gg--~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~ 177 (421) T protein:vir:13 101 SKTIRGIQLSEEERDIMSSTNNG--AVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANLAKD 177 (421) T ss_pred HHhhhccchhHHHhhccccCCcc--eecchhhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeecccc Confidence 0011110000 022222222221 245588999999999888999999888776543 4565543322 24445556 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) ..++.+ +++..++++.+.+.. .-+.|.+-=-.++.+|+.+.+.++.+++++...|..++..+... T Consensus 178 ~~~~~s--~~~f~~i~~~~~k~~-~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~------------ 242 (421) T protein:vir:13 178 TELVKA--MLKTQPMAYDIDDYG-LLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAV------------ 242 (421) T ss_pred cccccc--ccceeEEEeeeeeeE-eehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhc------------ Confidence 665542 356666777665442 33445442223356889999999999999999998887532110 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcce Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGS 236 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (343) +... ... . ++.|.++...|..+..+. -.+|++|..|..|..-..- +..|.- .....|. T Consensus 243 -------~~~~---~~~----~----~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~l~~lkd~-~G~~i~-~~~~~~~ 300 (421) T protein:vir:13 243 -------LAEE---TIN----D----YAGLVKTINSLVPNARKR--AIIVTNSDGRAYLDGLMDK-QGRPLL-KELSDGG 300 (421) T ss_pred -------cccc---ccc----c----hHHHHHHHHHhhhhhcCC--CEEEEcHHHHHHHHHhhcC-CCceee-cCcCCCC Confidence 0000 011 1 334445555566555432 2567899999988753211 222321 1245666 Q ss_pred eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchh Q lcl|NC_011085. 237 IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQ 315 (343) Q Consensus 237 V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~ 315 (343) ..+++|.+|+.++++|..+.+. ...+++..+ ++.......++++..++..+. T Consensus 301 ~~tl~G~pV~~~~~~~~~~~~~---------------------------~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~ 353 (421) T protein:vir:13 301 DLVFKGRPVIELEESIFDVGDE---------------------------TKFIVSDFKTLIKFMDRKQYLIDQSKEAGYT 353 (421) T ss_pred CceecceeeEEeccccccCCCc---------------------------eEEEEEeccccEEEEEecceEEEeecccccc Confidence 7899999999999988432110 112333333 344455566778877766543 Q ss_pred h--hhhhhhhhhccceecccceEEEEecC-C Q lcl|NC_011085. 316 A--DQIIARYAMGHGGLRPEAAGALVFTA-G 343 (343) Q Consensus 316 ~--d~i~~~~~~G~~v~rpe~~~~i~~~~-g 343 (343) . ..+++..+++.++++|+++..+.... | T Consensus 354 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (421) T protein:vir:13 354 KNETIARIIERFDVNSPLDKSSDAEKIRKFG 384 (421) T ss_pred cCeeEEEEEeeecceeecchhhheeeecccc Confidence 3 36788899999999999976544433 2 No 126 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.23 E-value=3e-12 Score=83.75 Aligned_cols=284 Identities=12% Similarity=0.080 Sum_probs=154.7 Q ss_pred CCCC-Ccc----ccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccc-eEEEEeccCcc-eeeee Q lcl|NC_011085. 1 MADM-KGG----QQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSG-KSAQFPVLGRT-RAAYL 73 (343) Q Consensus 1 ~~~~-~~~----~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~iG~~-t~~~~ 73 (343) +.+. ..+ .....+.-.....++--.+..+.|..++.+..+..+.++++++..++.++ .++.+++.... ....+ T Consensus 97 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (408) T protein:vir:74 97 FVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAM 176 (408) T ss_pred HHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccc Confidence 0000 000 00001100001111111235589999999999999999999988877653 34556654332 22223 Q ss_pred -cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_011085. 74 -QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASN 152 (343) Q Consensus 74 -~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (343) ..|+.++.. ..++..+++++..+.- .-..|.+-=-.++.+|+.+.+.++.+++|++..|+.|+.- T Consensus 177 v~E~~~~~~~-~~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G------------ 242 (408) T protein:vir:74 177 DEEDGKIPDL-DNPRLTIIKYLIKRYA-GIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAA------------ 242 (408) T ss_pred cccccccccc-cccceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc------------ Confidence 334554432 1245566677665542 3345554222335789999999999999999999988631 Q ss_pred ccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccch Q lcl|NC_011085. 153 ENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDP 232 (343) Q Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (343) .+.+.. .+ .. ..++.+++++ ...|+....+ +=..|++|..|..|..-.. .+..|.-...+ T Consensus 243 -----~G~~~~--~~---~~----~~~~~i~~~~---~~~l~~~~~~--~a~~v~n~~~~~~l~~lkd-~~G~~l~~~~~ 302 (408) T protein:vir:74 243 -----MGTVPK--KP---TI----ANFDDVITMI---NTSVDPAIIA--TSSLLTNQSGLNKLALVKT-AEGKYLLEPDP 302 (408) T ss_pred -----cccccc--cc---cc----ccHHHHHHHH---HHhhhhhhcC--CCEEEEcHHHHHHHHHhhc-CCCceEeccCc Confidence 111110 00 00 1123333332 2345555443 2357789999999975321 22333323345 Q ss_pred hcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeech-hhheeeeeeeeEEeeeec Q lcl|NC_011085. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHR-SAVGTVKLKDLSLERARR 311 (343) Q Consensus 233 ~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~Av~~~~~~~~~~e~~~~ 311 (343) ..|.-++++|++|+.+++.+.....+. ..+.++... +++..+..+.++++..+. T Consensus 303 ~~~~~~~l~G~pV~~~~~~~~~~~~~~-------------------------~~~i~~gd~~~~~~~~~~~~~~i~~~~~ 357 (408) T protein:vir:74 303 TKPNSYLIKGKQVIVVADRWLPNSGST-------------------------VYPLYYGDMSQAITLFDRENMSLLPTNI 357 (408) T ss_pred CCCCCceecceeeEEecCcccccccCC-------------------------cceEEEEehhccEEEEEecceEEEEecc Confidence 566678999999998875332111100 001122222 233344445555655443 Q ss_pred c----chhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 312 A----EYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 312 ~----~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) . .+....++..++++.++++|++.+.++++.. T Consensus 358 ~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (408) T protein:vir:74 358 GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAI 393 (408) T ss_pred ccchhhcceeeEEEEEeeCcEEecccceEEEEeecc Confidence 2 2334557788899999999999999998777 No 127 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.22 E-value=2e-12 Score=84.76 Aligned_cols=292 Identities=11% Similarity=0.052 Sum_probs=145.1 Q ss_pred CCCCCccccccccccc-cccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-Ccceeeee---cC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGK-GQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYL---QA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~-~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~---~~ 75 (343) ..++-.++....+.-. +..+++-=.|..+.|+.+|.+..+..+.++.+.++.... | .+.+|++ +..+.... .. T Consensus 127 ~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~-~~~~p~~~~~~~a~~~~~~~e 204 (434) T protein:vir:62 127 FANYIVGNIDEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK-E-NIKYPVLVKKAEAQGHKNERT 204 (434) T ss_pred HHHHhccccchhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC-C-ceEEEEEecCCcccceecccc Confidence 0000000000000000 011111111344899999999999999998888765543 3 3667664 22222221 22 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) |..++. .+++..++++.+- ++.. +.|.+-=-.++.+|+.+.+.++.+++|++..|+.++.- .....+ T Consensus 205 ~~~~~~--~~~~f~~v~~~~~--k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G----~G~~~~---- 272 (434) T protein:vir:62 205 NNEMPE--TDIEFDEIELSPT--EFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNG----DEANNI---- 272 (434) T ss_pred cccccc--cccceeeEEeehe--eeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCCcc---- Confidence 333332 2344555555554 3333 33433212235689999999999999999999988721 111111 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc--ccch Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA--LIDP 232 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~--~~~~ 232 (343) +.|. +.....+...+... .++.|.++...|+....+ .. ..|++|..|..|.+-..- +..|.- .... T Consensus 273 ~~g~-----~~~~~~~~~~~~~~----~~d~l~~l~~~l~~~~~~-~a-~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~ 340 (434) T protein:vir:62 273 NDGA-----LAKKAVEFKTDEKN----LYDALVKMKNTPVKEVRK-KA-RWVLNTAALTKIETMKTD-DGFPLLRPFNQA 340 (434) T ss_pred ccce-----eecccccccccccc----hhhHHHHHHhhcchhhhc-CC-EEEEcHHHHHHHHHhhcc-CCCEeeccCCCc Confidence 1111 11111111111122 355566666677665543 23 347899999988643211 223321 1233 Q ss_pred hcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc Q lcl|NC_011085. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA 312 (343) Q Consensus 233 ~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~ 312 (343) ..|.-.+++|++|+.++.+|....+..... +-++|+... ++.+. -.+.++...+. T Consensus 341 ~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i----------------~~Gdfs~~~--i~~~~-------g~~~i~~~~~~ 395 (434) T protein:vir:62 341 EGGIGYTLLGFPVEEEDAIDIPDSPDTPVF----------------YFGDFSKFY--IQDVI-------GSLEVQKLVEL 395 (434) T ss_pred cCCCCceecceeeEEecCccCccCCCceEE----------------EEeeccceE--EEEee-------ceeEEEeehhh Confidence 456667899999999999985432211100 112332221 11110 11233333322 Q ss_pred chhhh--hhhhhhhhccceec-ccceEEEEec----CC Q lcl|NC_011085. 313 EYQAD--QIIARYAMGHGGLR-PEAAGALVFT----AG 343 (343) Q Consensus 313 ~~~~d--~i~~~~~~G~~v~r-pe~~~~i~~~----~g 343 (343) -+.-+ .+++..++.+++++ |++..++.+. .| T Consensus 396 ~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~ 433 (434) T protein:vir:62 396 FSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTG 433 (434) T ss_pred hcccCceEEEEEeeecceeecCcccceEEEEEeccCCC Confidence 11123 36778889899775 8888877444 33 No 128 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.20 E-value=1.4e-12 Score=85.52 Aligned_cols=281 Identities=12% Similarity=0.090 Sum_probs=160.4 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhcc---Ccccccc-----ccceEEEEeccCcc--e Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTN---RHIMRSI-----SSGKSAQFPVLGRT--R 69 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~---~~~~~~i-----~~G~tv~i~~iG~~--t 69 (343) ||.+ +. + .+++ |+|...|.+.+.+.+.|.. +++...+ .+|+++.||..+.. . T Consensus 1 MA~T--------~l------s---d~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd 63 (351) T protein:vir:15 1 MAET--------HL------S---DLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGD 63 (351) T ss_pred CCce--------ee------e---eeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCc Confidence 7632 21 1 2455 9999999988888776632 2322222 35999999998764 5 Q ss_pred eeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 70 AAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 70 ~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) ..++..+..|.. ..+...+..-+|= ..-..+.+.|+...-+--|++.++.++.+...++..+..+|..|........ T Consensus 64 ~~~~~~~~~i~~--~kitt~~~~a~i~-~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~ 140 (351) T protein:vir:15 64 PDNWTDSDDIDV--NNLTSGKQQGIKF-YQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTK 140 (351) T ss_pred ccccCCCcccch--heecccceeEEEE-eeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchh Confidence 778888888765 4576666666663 3345588999888878889999999999999999999988877653322111 Q ss_pred cccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_011085. 150 ASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAAL 229 (343) Q Consensus 150 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (343) .. .+.+......+. .+..-. .+.|.+|..+|-+..= ..-..++++|..|..|.+...+....+. . T Consensus 141 ~~--------~~~~~d~t~~~~-~~~~is----~~~l~~A~~~~GD~~~-~~~~~ivmhS~v~~~L~~~~li~~~~~s-~ 205 (351) T protein:vir:15 141 IA--------NSKVYDQTKVSP-SEPMFG----AKGFTGAIGLMGDLQD-TAFGAIAVNSATYSLMKVQGLIETIQPQ-N 205 (351) T ss_pred hc--------ccceeccccccc-cccccC----HHHHHHHHHHhccccc-cceEEEEEChHHHHHHHhhhhhhhcccc-c Confidence 10 111111111111 111111 2456667777755321 1236788999999999876422222221 1 Q ss_pred cchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeee Q lcl|NC_011085. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERA 309 (343) Q Consensus 230 ~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~ 309 (343) .++.|+.++|.+|+.+..+|....+.... ......+-+-|++..+..+ .+|.. T Consensus 206 ---~~~~i~t~~G~~VivdD~~p~~~~~~~~~-----------------------~ytsyl~~~GAi~~~~~~~-~ve~~ 258 (351) T protein:vir:15 206 ---GATPFEAYNGLRIVLDDDIEIDLTDKTKP-----------------------VSTSYIFAPGAVRYSTNMR-STETK 258 (351) T ss_pred ---cCcccceecceEEEEcCCCccccCCCCCc-----------------------eeEEEEEecceeeeecCCc-Cccee Confidence 14568999999999999999653322110 1112344555666555443 56777 Q ss_pred eccchh--hhhhhhhh-----hhccceecccceEEEEecCC Q lcl|NC_011085. 310 RRAEYQ--ADQIIARY-----AMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 310 ~~~~~~--~d~i~~~~-----~~G~~v~rpe~~~~i~~~~g 343 (343) |++... .|.+..+. .+|.+--.+........|.- T Consensus 259 rd~~~~~g~d~l~~r~~~~~hp~G~s~~~~~~~~~~~sPt~ 299 (351) T protein:vir:15 259 YDPLINGGQDVIVQKRVGTIHVAGTSIKASFSPSKASFPTI 299 (351) T ss_pred ecccCCCCceEEEEeeeeeeeeeeeeecccccccCcCCcCh Confidence 776542 23333333 33333322211111111111 No 129 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.20 E-value=4.9e-12 Score=82.57 Aligned_cols=285 Identities=15% Similarity=0.110 Sum_probs=151.5 Q ss_pred CCCCCc----c-----ccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccC-ccccccccceEEEEecc-Ccc Q lcl|NC_011085. 1 MADMKG----G-----QQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNR-HIMRSISSGKSAQFPVL-GRT 68 (343) Q Consensus 1 ~~~~~~----~-----~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~-~~~~~i~~G~tv~i~~i-G~~ 68 (343) ++...+ + .....|.......++--.|.. ++++.++.+..+..++++.+ ++.-+...| .+.||+. +.+ T Consensus 334 ~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g-~~~ip~~~~~~ 412 (632) T protein:vir:96 334 IADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG-DVDIPKKTSGA 412 (632) T ss_pred HHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCc-ceEEEEEeCCc Confidence 110000 0 000011111111111112334 44577888888778887776 333333344 5778876 556 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) ++.....|+.++.+ +++.+++++.. .++.. +.|.+-=-.++.+|+.+.+..+.+++|++..|+.+|.- ... T Consensus 413 ~a~wv~E~~~~~~s--~~~f~~i~l~~--~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G----~G~ 484 (632) T protein:vir:96 413 NFYWIGEDEDVQDS--DFDFTTLSFSP--KTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTG----TGL 484 (632) T ss_pred eeEeecCCcccccc--ccceeeEEeee--eEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcc----cCC Confidence 66667777777654 35566666655 34333 33432112246789999999999999999999988731 111 Q ss_pred cccccccccccCCceeec-ccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcc Q lcl|NC_011085. 148 PAASNENIAGLGSASILE-VGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANY 226 (343) Q Consensus 148 ~~~~~~~~~g~~~~~~~~-~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~ 226 (343) . ..+.|.-...-+. ....+.. .-++.+.++...+...++....-..+++|..+..|..... .+. T Consensus 485 ~----~~p~Gi~~~~~~~~~~~~~~~--------~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l---~d~ 549 (632) T protein:vir:96 485 A----NDPVGLLNMTGVPALTYPAGG--------VDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQV---FDN 549 (632) T ss_pred C----Cccceeeecccccceeccccc--------CCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhc---cCC Confidence 1 1112211100010 0001111 1134555666777777776555567789988877765321 122 Q ss_pred ccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEE Q lcl|NC_011085. 227 AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSL 306 (343) Q Consensus 227 ~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~ 306 (343) +|.-.+..| .++|++|+.||.+|.... +-++|+... ++......+.+ T Consensus 550 ~G~~i~~~~---~l~G~pv~~s~~ip~~~~----------------------~~gd~s~~~--------i~~~~~~~i~~ 596 (632) T protein:vir:96 550 TGERIWQNN---EVNGYRAEASNQIPADTW----------------------IFGDWSQIV--------IAMWGVLDLKV 596 (632) T ss_pred CCceeecCC---eecccceEeccccccCcE----------------------EEeecceEE--------EEEecceEEEE Confidence 233334343 689999999999984321 012232221 11111122222 Q ss_pred eeeeccchhhhhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 307 ERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 307 e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) ..+.....-.-.++..+.++.++++|++.+.++..+ T Consensus 597 ~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 597 DPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 222222233346778899999999999999999999 No 130 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.20 E-value=6e-12 Score=82.11 Aligned_cols=284 Identities=12% Similarity=0.061 Sum_probs=155.5 Q ss_pred CCC-CCcc----ccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccccccc-ceEEEEeccCc--ceeee Q lcl|NC_011085. 1 MAD-MKGG----QQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISS-GKSAQFPVLGR--TRAAY 72 (343) Q Consensus 1 ~~~-~~~~----~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~iG~--~t~~~ 72 (343) +++ +..+ .....+.-..+..++--.+..+.|+.++.+..+..+.++++++..++.+ ..++.++.... ..... T Consensus 97 ~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 176 (408) T protein:vir:10 97 FVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVM 176 (408) T ss_pred HHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceee Confidence 000 0000 0000111111111111124558999999999999999999988877653 23345554433 33445 Q ss_pred ecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_011085. 73 LQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASN 152 (343) Q Consensus 73 ~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (343) ...|+.++.+. .++..++++...+.. .-..|.+-=-.++.+|+.+.+.++.++++++..|+.|+.-.. T Consensus 177 v~E~~~~~~~~-~~~~~~i~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g---------- 244 (408) T protein:vir:10 177 DAEDGKIPDLD-NPQLTIIKYLIKRYA-GIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMK---------- 244 (408) T ss_pred ecCcccccccc-CcceeeEEeeeeeEE-eeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc---------- Confidence 55566665321 234566666654442 223454321223568999999999999999999998873211 Q ss_pred ccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccch Q lcl|NC_011085. 153 ENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDP 232 (343) Q Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (343) .+... .+ ...++.+++++ ...|+...- .+-.++++|..|..|.+-... +..|.-...+ T Consensus 245 -------~~~~~--~~-------~~~~~~l~~~~---~~~~~~~~~--~~a~~v~n~~~~~~l~~lkd~-~G~~i~~~~~ 302 (408) T protein:vir:10 245 -------AAPKK--PT-------IAKFDDVITMI---NTAVDPAII--ATSSLLTNQSGLNKLALVKTA-EGKYLLEPDP 302 (408) T ss_pred -------ccccc--cc-------cccHHHHHHHH---HHhhhhhhc--cCCEEEEcHHHHHHHHHhhcc-CCceEeccCc Confidence 11110 00 01123333332 123433322 233678999999998764322 2333333345 Q ss_pred hcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeec Q lcl|NC_011085. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARR 311 (343) Q Consensus 233 ~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~ 311 (343) .+|...+++|++|+.+++.+....++. ....++...+ ++..+....++++..+. T Consensus 303 ~~~~~~~l~G~PV~~~~~~~~~~~~~~-------------------------~~~i~~gd~~~~~~~~~~~~~~v~~~~~ 357 (408) T protein:vir:10 303 TKPNSYLIKGKQVIVVADRWLPNTGST-------------------------VYPLYYGDMSQAITLFDRENMSLLPTNI 357 (408) T ss_pred CCCCCceecceeeEEecccccCccCCC-------------------------ceEEEEEehhccEEEEEecceEEEEccc Confidence 677778999999999765432211110 0111233322 34444445556665543 Q ss_pred cc----hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 312 AE----YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 312 ~~----~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .. +....++..++++.++++|++.+.++++.- T Consensus 358 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~ 393 (408) T protein:vir:10 358 GAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) T ss_pred ccchhhcCceEEEEEEeeccEEeccccEEEEEeecc Confidence 31 223467788899999999999999998885 No 131 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.19 E-value=1e-11 Score=80.78 Aligned_cols=295 Identities=11% Similarity=0.022 Sum_probs=152.6 Q ss_pred CCC------CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-Ccceeeee Q lcl|NC_011085. 1 MAD------MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYL 73 (343) Q Consensus 1 ~~~------~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~ 73 (343) |+- -.-..+....... ..++.-.+..+.+..++.+..++.+.++.+.+...+. ++..++|+. +.+.+..+ T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~--~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~~p~~~~~~~a~~v 77 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQT--GDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMG-TTGQKIPHWTGDVSASWI 77 (326) T ss_pred CCCCccchhhhcCcchhhheec--cccCCcceechhhHHHHHHHHHhcchhhhhcceeecc-CCceEEEEEeCCcceEEe Confidence 110 0000000000000 0111112456888999999999999888887776654 556778775 45566777 Q ss_pred cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 74 QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 74 ~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) ..|+.++.+ +++..++++...+. ..-+.|.+-=-.++.+|+.+.+.++.++++++.+|+.++.- .....| .. T Consensus 78 ~Eg~~~~~~--~~~f~~i~~~~~k~-~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G----~gs~~p-~g 149 (326) T protein:vir:42 78 GEGDMKPIT--KGNMTSQTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAING----TDSPFP-TF 149 (326) T ss_pred cCCcccccc--ccceeEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcc----cCCCcc-cc Confidence 788888764 46677777776554 33455654222345789999999999999999999988732 110000 00 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchh Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPE 233 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (343) ..........+.... +..+......+. .+..+...+. +.....-..+++|..+..|.+-..- +..+.-..... T Consensus 150 i~~~~~~~~~~~~~~-~~~~~~~~~~~~---~~~~~~~~~~--~~~~~~a~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~ 222 (326) T protein:vir:42 150 LAQTTKEVSLVDPDG-TGSNADLTVYDA---VAVNALSLLV--NAGKKWTHTLLDDITEPILNGAKDK-SGRPLFIESTY 222 (326) T ss_pred ccccccccceeeccc-ccccccchhHHH---HHHHHHhhhh--hhccCccEEEEeHHHHHHHHHhhcc-CCceeeccccc Confidence 000000000110000 001100011111 1111222222 2223344567899999998753221 12222111122 Q ss_pred -----cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 234 -----RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 234 -----~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ....+.+.|++|+.++.+|..... -+-++|+... ++.+ ..+.++. T Consensus 223 ~~~~~~~~~~~l~G~pv~~~~~~~~~~~~--------------------~~~Gd~s~~~--~~~~--------~~~~v~~ 272 (326) T protein:vir:42 223 TEENSPFRLGRIVARPTILSDHVASGTVV--------------------GYQGDFRQLV--WGQV--------GGLSFDV 272 (326) T ss_pred cCccccccCceeeeeeEEEcCCCCCCceE--------------------EEEeecceEE--EEEe--------cceEEEE Confidence 223457999999999999842210 0112333322 2222 2233333 Q ss_pred eeccc--------------hhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 309 ARRAE--------------YQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 309 ~~~~~--------------~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .++.. +.. -.++..++++.+++||++.+.|+...- T Consensus 273 ~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 273 TDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDA 323 (326) T ss_pred eecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccc Confidence 32211 222 345788899999999999988876655 No 132 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.19 E-value=7.5e-12 Score=81.58 Aligned_cols=278 Identities=11% Similarity=0.061 Sum_probs=150.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--CcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (343) |-......+... ....+++--.+..+.|..++.+..+..+.+++++++.++.+ .+.++|.. +...+.....+.. T Consensus 100 l~~~~~~~~~~~---~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~~~ 175 (394) T protein:vir:10 100 IHSHGKVIDNAA---GHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTT-PKGTYPILKRATDRFSSVAELAE 175 (394) T ss_pred Hhccchhhhhhh---cccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccC-CceEEEEEecCCCcccccccccc Confidence 111110000000 00111111123458999999999999999999988777654 34555543 4445555555655 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) .+.+ ..++..++++.+-+.- .-..|.+-=-.++.+|+.+.+.++.++++++..|+.|+.... T Consensus 176 ~~~~-~~~~~~~v~l~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g---------------- 237 (394) T protein:vir:10 176 NPAL-AEPEFEQVDWSVSTYR-GAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQ---------------- 237 (394) T ss_pred cccc-ccccceeEEeeeeeeE-eeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------- Confidence 5432 2345566666664442 223454422234668999999999999999999998864321 Q ss_pred CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc----ccchhc Q lcl|NC_011085. 159 GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA----LIDPER 234 (343) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~----~~~~~~ 234 (343) .+... +..+. ..++.+.+.+ ...++... .-.+|++|..|..|.+-..- +..|.- ...... T Consensus 238 -~~~~~--~~~~~-----~~~d~l~~~~---~~~~~~~~----~a~~vmn~~~~~~l~~lkd~-~G~~i~~~~~~~~~~~ 301 (394) T protein:vir:10 238 -SFTAK--ATTTD-----TLVDSLKHIL---NVDLDPAY----SRALVVTQSLFNTLDTLKDK-NGRYLLHDASDSITDG 301 (394) T ss_pred -ccccc--ccccc-----ccHHHHHHHH---Hhhhhhhc----cCEEEecHHHHHHHHHhhcc-CCCeeeeccccccccC Confidence 11110 01111 1122232221 12233222 23678999999998753211 222211 112223 Q ss_pred ceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeech-hhheeeeeeeeEEeeeeccc Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHR-SAVGTVKLKDLSLERARRAE 313 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~Av~~~~~~~~~~e~~~~~~ 313 (343) |.-++++|.+|+.+++........ +.+.++... +++..+...+++++..++ . T Consensus 302 ~~~~~L~G~PV~~~~~~~~~~~~~--------------------------~~~i~~gd~s~~~~~~~~~~~~v~~~~~-~ 354 (394) T protein:vir:10 302 TAKGTVLGVPVYVVGDALLGSAAG--------------------------DQKAFVGDLKRGVLFADRQQVTLAWEDS-K 354 (394) T ss_pred CcccccccceeEEecccccCCCCC--------------------------ceEEEEeeccccEEEEeecceEEEEecc-c Confidence 445789999999876543211110 001122221 133334445556665544 3 Q ss_pred hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 ~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .|...+++.+++++++++|++++.++.+.- T Consensus 355 ~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 355 IYGRYLGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred ccceeEEEEEEeccEEeccccEEEEEeecc Confidence 455678889999999999999999887765 No 133 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.18 E-value=4.9e-12 Score=82.59 Aligned_cols=290 Identities=11% Similarity=0.025 Sum_probs=152.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCc-----ceeeeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGR-----TRAAYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~-----~t~~~~~~ 75 (343) +..... .....+-..++..++.-.+..+.|+.++.+.....+.+++++++.++. +.++.+++... ..+.-... T Consensus 105 ~~~~~~-~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~E 182 (413) T protein:vir:81 105 YVAPRV-KAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMT-NTTIKYLMEKANRVVEGGFKTVAE 182 (413) T ss_pred hhhhHH-HhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEeccccccccccceecC Confidence 000000 000011112222334345567899999999999999999998888765 44566665432 22344555 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI 155 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~ 155 (343) |+.++.+. ....+++++.+.+.. .-+.|.+- -.+.+.++.+.+.++.++++++..|+.+|.- .....+ + T Consensus 183 g~~~~~~~-~~~f~~i~~~~~k~~-~~~~iS~e-ll~ds~~l~~~i~~~la~~~~~~~d~~~l~G----~G~~~~----~ 251 (413) T protein:vir:81 183 GGKKPYMR-FADFDIVTESLSKIA-GLTKITDE-MIEDYDFLVSYINARLLEELAIEEERQLLLG----DGTGNN----L 251 (413) T ss_pred cccccccC-cccceeeEeeeeeEE-EeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCCCc----c Confidence 66654321 123455666665442 22455542 2222345778888899999999999988731 111111 1 Q ss_pred cccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc------- Q lcl|NC_011085. 156 AGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA------- 228 (343) Q Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~------- 228 (343) .|. +...... +........+++.+.++...+..+..-.... +|++|..|..|.+-..- +..|.- T Consensus 252 ~Gi-----~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~vmn~~~~~~l~~lkd~-~G~~l~~~~~~~~ 322 (413) T protein:vir:81 252 TGL-----LKRDGIQ--TLAVSNKDELADSIYKAMTNISLATPFQADA-LVINPLDYQELRLAKDA-NGQYYGGGVFQGQ 322 (413) T ss_pred ccc-----ccccccc--cccccccchhHHHHHHHHHHhhhhccCCCcE-EEEcHHHHHHHHHhhcc-CCceecccccccc Confidence 111 1111000 0000011223455555554444443322233 67899999987543211 112211 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ......+...+++|.+|+.|+.+|.+.. +-++|+... ..+....++++. T Consensus 323 ~~~~~~~~~~~l~G~pv~~s~~~~~~~~----------------------~~gd~~~~~---------~~~~~~~~~v~~ 371 (413) T protein:vir:81 323 YGSGGIMLDPAPWGLRTVQSQVVPVGKP----------------------VVGAFRSAA---------SVLRKGGVRIDS 371 (413) T ss_pred ccccccccCceecceeeEEcCCCCcccE----------------------EEEecccEE---------EEEEecceEEEE Confidence 1111222335799999999999984310 112333222 122233445555 Q ss_pred eeccc-hh-hh--hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 309 ARRAE-YQ-AD--QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 309 ~~~~~-~~-~d--~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+... +| .+ .+++.++++..+.+|++.+.++++.- T Consensus 372 ~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 410 (413) T protein:vir:81 372 TNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEV 410 (413) T ss_pred eccccchhhcCcEEEEEEEeeccEEecccceEEEEecCC Confidence 54432 22 23 66777889999999999999998777 No 134 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.16 E-value=2.4e-12 Score=84.25 Aligned_cols=275 Identities=15% Similarity=0.079 Sum_probs=146.2 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccc-eEEEEeccCcceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSG-KSAQFPVLGRTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G-~tv~i~~iG~~t~~~~~~g~~i 79 (343) +....... .+.-.+....+...+..+.+..++...- ....++...+..++..+ -.+.++..+...+.....+... T Consensus 121 ~~~~~~~~---~~~~~~~~~~~~~~~vp~~~~~~i~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~ 196 (397) T protein:vir:96 121 NAFVKSKG---AEKRDGFTSVEGGALIPQELLQPQLEPK-DIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKN 196 (397) T ss_pred HHHHHhhh---hhhhhcccccccccchhHHHHHHHHHhh-hhhhHHHhhhhccccccceeEEEEeccCCccccccccccc Confidence 00000000 0000111112222345577788877643 33344555555554432 2344444455555555555554 Q ss_pred CCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccC Q lcl|NC_011085. 80 DDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLG 159 (343) Q Consensus 80 ~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~ 159 (343) +.. ..+...++++.+.+. +.-..|.+---.++.+|+.+.+.++.++++++..|..|+.-. + T Consensus 197 ~~~-~~~~~~~i~~~~~~~-~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~-----------------g 257 (397) T protein:vir:96 197 PQL-ANPKMVEIDYSVATR-RGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVL-----------------K 257 (397) T ss_pred ccc-ccccccceeecHhHh-hcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----------------c Confidence 322 234566677766544 233344332222356789999999999999999998876321 0 Q ss_pred CceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEE Q lcl|NC_011085. 160 SASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRN 239 (343) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~ 239 (343) .++ .+. ...++.+.+.+.. .++. . .+-..|++|..|..|..-.. .+..|.-...+.+|.-++ T Consensus 258 ~~~------~~~----~~~~d~~~~~~~~---~~~~--~--~~a~~v~n~~~~~~l~~lkd-~~G~~~~~~~~~~~~~~~ 319 (397) T protein:vir:96 258 TAT------AKS----VVGVDGLKDLINK---EIKK--V--YDVKLFISASMYSELDKLKD-KNGRYLLQDSITAASGKQ 319 (397) T ss_pred ccc------ccc----ccchHHHHHHHHH---hhhh--h--cCcEEEEcHHHHHHHHHhhc-cCCCeEeccCccCCCccc Confidence 000 000 1112333333211 1221 1 23367999999999876321 223343333566677789 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccchhhhh Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAEYQADQ 318 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~~~~d~ 318 (343) ++|.+|+.+++.+...... ..+.++...+ ++.......++++...+ .++... T Consensus 320 l~G~pv~~~~~~~~~~~~~--------------------------~~~~~~gd~~~~~~~~~~~~~~~~~~~~-~~~~~~ 372 (397) T protein:vir:96 320 LLGKEVVVLDDDVIGKSVG--------------------------NVVGFIGDAKAFASFFDRKQVSVSWVDN-NIYGQL 372 (397) T ss_pred ccccceEEecccccCCCCC--------------------------ceEEEEeehhcceEeEeecceEEEEecc-ccccee Confidence 9999999988654322110 0111222222 22333344455555443 445667 Q ss_pred hhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 319 IIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 319 i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++.+++|.++++|++.+.|+++.+ T Consensus 373 ~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 373 LAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEEEEEEccEEecccceEEEEeecC Confidence 8999999999999999999999999 No 135 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.13 E-value=2.3e-11 Score=78.92 Aligned_cols=274 Identities=9% Similarity=0.009 Sum_probs=153.7 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccccccc-ceEEEEeccCcc--eeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISS-GKSAQFPVLGRT--RAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~iG~~--t~~~~~~g~ 77 (343) +..+..+ . +-.+ +| -.+..+.|+.++.+..+..+.++.+.+..++.+ ..++.++..... .+.....|+ T Consensus 102 ~~~~~~~-~--~~~~----~g--g~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 172 (395) T protein:vir:38 102 KNLVTSG-T--TGTG----NA--GLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESA 172 (395) T ss_pred HHHHhhc-c--CccC----CC--ceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcccccccccc Confidence 1111000 0 1111 11 123558899999999999999999988776653 234444444332 223344566 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) .++.+. .++..++++...+.- .-..|.+-=-..+.+|+.+.+.++.+++|++..|+.|+.-. T Consensus 173 ~~~~~~-~~~f~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~---------------- 234 (395) T protein:vir:38 173 LIGDND-DPELTVVKYLIHRYA-GITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVM---------------- 234 (395) T ss_pred cccccc-ccceeeEEeeeeeeE-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------- Confidence 554321 234455566554442 22344432122356889999999999999999999887321 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) +.+.... +. ..++.+.+++. ..|+...- .+-.++++|..|..|.+-..- +..|.-...+.+|.. T Consensus 235 -g~~~~~~--~~-------~~~~~i~~~~~---~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~-~G~~l~~~~~~~~~~ 298 (395) T protein:vir:38 235 -GKAPKKP--TI-------SQFDNIKDLEN---NTLDPAIE--STSSFITNQSGYNILSKVKDA-DGRYLMQPDVTSPDK 298 (395) T ss_pred -ccccccc--cc-------ccHHHHHHHHH---Hhhhhhhc--CCCEEEEcHHHHHHHHHhhcc-CCceeeccCcCCCCc Confidence 1111110 00 11223333221 12333322 234678999999998753221 233333345667777 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeeccc--- Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRAE--- 313 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~~--- 313 (343) .+++|++|+.+++.+....... .+.++...+ ++..+..+.++++..+... T Consensus 299 ~~l~G~pV~~~~~~~~~~~~~~--------------------------~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~ 352 (395) T protein:vir:38 299 YLIDGKPVIRIADKWLPDVSGS--------------------------HPLYFGDLKQGITLFDRQQMQIDTTNVGAGSF 352 (395) T ss_pred ceeccceeEEecccccCcCCCc--------------------------ceEEEEeccccEEEEEecceEEEEeccccchh Confidence 8999999999987654321110 011233322 3444545566666665432 Q ss_pred -hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 314 -YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 314 -~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +-...++...+||.++++|++.+.++++.- T Consensus 353 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (395) T protein:vir:38 353 EHDTTKLRFIDRFDVQLIDDGAFAAASFKTV 383 (395) T ss_pred hcCceEEEEEEeeccEEecccceEEEEeecc Confidence 223567788889999999999999998866 No 136 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.13 E-value=1.1e-11 Score=80.58 Aligned_cols=279 Identities=12% Similarity=0.022 Sum_probs=145.0 Q ss_pred CCC----CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--Ccceeeeec Q lcl|NC_011085. 1 MAD----MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRTRAAYLQ 74 (343) Q Consensus 1 ~~~----~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~ 74 (343) +.. +..+. .+..+....++.-.+..+.+...+... ...+.++.++++.++..+ +..+|.. +...+.... T Consensus 141 ~~~~~~~~~~~e---~~~~~~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 215 (437) T protein:vir:10 141 VTAFADYLKTGE---VRDVTGIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTT-TGKLPIFNNSTDLLTAHT 215 (437) T ss_pred hhhhHHHHHhhh---hhhhhhcccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccC-ceeeEEeecccccccccc Confidence 000 00000 011111111111123446777777654 445566777776655443 3445443 334455555 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) .+..++.. ..++..++++.+.+. +.-+.|.+-=-..+.+|+.+.+.++.+++|++..|..|+.-. T Consensus 216 e~~~~~e~-~~~~~~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~------------- 280 (437) T protein:vir:10 216 EYGQTTKN-ATPVITPILWDLKTY-TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITAL------------- 280 (437) T ss_pred cccccccc-ccccceeeeeehhhe-eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh------------- Confidence 55555422 224455566655443 222344431122356789999999999999999998887422 Q ss_pred ccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhc Q lcl|NC_011085. 155 IAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPER 234 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (343) +++.. ...++.. +..+.+.+ ...|+....+ +-..|++|..|..|..-.. .+..|.-...+.. T Consensus 281 ----g~~~~--~~~~~~~------~~~~~~~~---~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd-~~g~~~~~~~~~~ 342 (437) T protein:vir:10 281 ----TDGIK--KTTSTYL------LGDLKKVL---NVTLKPQDSA--AASIVMSQSAYNLFDMATD-AMGRPLLQPNVTA 342 (437) T ss_pred ----ccccc--ccccccc------hhhHHHHH---Hhhhhhhhhc--CCEEEEcHHHHHHHHHhhc-cCCCeeeccCccC Confidence 11111 1111111 12223322 1234444332 3356999999998865321 1223333334567 Q ss_pred ceeEEEeceEEEEeccc--cccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeecc Q lcl|NC_011085. 235 GSIRNVMGFEVVEVPHL--TAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRA 312 (343) Q Consensus 235 G~V~~i~Gf~V~~sn~l--p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~ 312 (343) |..++++|.+|+.+++. |..+.+.... +-++|+. ++..+..++++++...+- T Consensus 343 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~~-----------------~~gd~~~---------~~~~~~r~~~~~~~~~~~ 396 (437) T protein:vir:10 343 ATGYTLLGKTVVIVDDKLFPSASAGDVNI-----------------VVAPLKK---------AVINFKLTEITGQFQDTY 396 (437) T ss_pred CCCcccccceeEEecccccCCcCCCceEE-----------------EEeeccc---------cEEEEeeeceEEEEeccc Confidence 77789999999998765 3221110000 1123322 233333445556555444 Q ss_pred chhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 313 EYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 313 ~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ..+...+++.++|+.++++|++.+.|+...- T Consensus 397 ~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 427 (437) T protein:vir:10 397 DIWYKQLGIFLRQNVVQASKDLIVNLTGKLK 427 (437) T ss_pred ccccceeeEEEEEccEEecccceEEEEeecc Confidence 5566778888999999999999998875544 No 137 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=99.11 E-value=1.7e-11 Score=79.56 Aligned_cols=299 Identities=13% Similarity=0.097 Sum_probs=150.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--CcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (343) +.+....... -+.-..+.+++--.+..+.|..++.+..++.+.+++++++..+.++ ++.||+. +...+.....|+. T Consensus 138 ~~~~~~~~~~-~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~ 215 (497) T protein:vir:78 138 FADGETAPAA-IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGT 215 (497) T ss_pred HhhhhhhHHH-HHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCcc Confidence 0000000000 0000011111111245689999999999999999999988777654 5888874 3456677777887 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+ +++.+++++...+.-. -..|.+ +-.+.+.++.+.+.++.++++++..|+.+|.- .... .+.+.... T Consensus 216 ~~~s--~~~f~~i~~~~~k~a~-~~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G----~G~~-~p~Gil~~- 285 (497) T protein:vir:78 216 YPFS--SEEFARVYEQVGKVAN-ALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAG----GGYP-GVNGLLQR- 285 (497) T ss_pred cccc--cccceeeEeeeeeeEe-ecHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcC----CCcc-cccccccc- Confidence 7653 3566666666554422 233432 22233456888899999999999999988731 0000 00000000 Q ss_pred CCceeecccccccc---------------cchHHHH------------------------------HHHHHHHHHHHHHH Q lcl|NC_011085. 159 GSASILEVGAKGDL---------------TSPVELG------------------------------KAVIAQLTIARAKL 193 (343) Q Consensus 159 ~~~~~~~~~~~~~~---------------~~~~~~~------------------------------~~i~~~l~~a~~~L 193 (343) ..+..+..+..... +...... ..++..++.+...+ T Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (497) T protein:vir:78 286 STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI 365 (497) T ss_pred cccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhh Confidence 00000000000000 0000000 00111122222222 Q ss_pred hhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc------cccchhcceeEEEeceEEEEeccccccccccccccccccc Q lcl|NC_011085. 194 TSNYVPSADRTFYTTPEVYSAILAALMPNAANYA------ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQ 267 (343) Q Consensus 194 d~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~------~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~ 267 (343) ..... ...-..|++|..|..|.+-..-. ..|. +......+...+++|.+|++++.+|.+.. T Consensus 366 ~~~~~-~~~~~~vmn~~~~~~l~~lkd~~-G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~----------- 432 (497) T protein:vir:78 366 QLTLF-QTPNAVVMNPRDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI----------- 432 (497) T ss_pred hhhcc-cCCCeEEEchHHHHHHHHhhcCC-CceeccCcccccccccccCCceeeceeeEecCCCCCCce----------- Confidence 11111 00114779999998875432211 1121 11111222335899999999999984310 Q ss_pred cccccccccccccccccceEeEeechhhheeeeeeeeEEeeeec--cchhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 268 KHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARR--AEYQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~--~~~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +-++|... ++..+....++++.... ..+.. ..|++..+++..+++|++.+.+.++++ T Consensus 433 -----------~~Gd~~~~--------~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 433 -----------LVGHFAPS--------VIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred -----------EEeecccc--------eEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 11233222 22233334444444322 11222 347777889999999999999999999 No 138 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=99.11 E-value=1.7e-11 Score=79.56 Aligned_cols=299 Identities=13% Similarity=0.097 Sum_probs=150.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--CcceeeeecCCCc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRTRAAYLQAGQS 78 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (343) +.+....... -+.-..+.+++--.+..+.|..++.+..++.+.+++++++..+.++ ++.||+. +...+.....|+. T Consensus 138 ~~~~~~~~~~-~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~ 215 (497) T protein:vir:10 138 FADGETAPAA-IGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGT 215 (497) T ss_pred HhhhhhhHHH-HHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCcc Confidence 0000000000 0000011111111245689999999999999999999988777654 5888874 3456677777887 Q ss_pred CCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc Q lcl|NC_011085. 79 LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL 158 (343) Q Consensus 79 i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~ 158 (343) ++.+ +++.+++++...+.-. -..|.+ +-.+.+.++.+.+.++.++++++..|+.+|.- .... .+.+.... T Consensus 216 ~~~s--~~~f~~i~~~~~k~a~-~~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~G----~G~~-~p~Gil~~- 285 (497) T protein:vir:10 216 YPFS--SEEFARVYEQVGKVAN-ALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLAG----GGYP-GVNGLLQR- 285 (497) T ss_pred cccc--cccceeeEeeeeeeEe-ecHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhcC----CCcc-cccccccc- Confidence 7653 3566666666554422 233432 22233456888899999999999999988731 0000 00000000 Q ss_pred CCceeecccccccc---------------cchHHHH------------------------------HHHHHHHHHHHHHH Q lcl|NC_011085. 159 GSASILEVGAKGDL---------------TSPVELG------------------------------KAVIAQLTIARAKL 193 (343) Q Consensus 159 ~~~~~~~~~~~~~~---------------~~~~~~~------------------------------~~i~~~l~~a~~~L 193 (343) ..+..+..+..... +...... ..++..++.+...+ T Consensus 286 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 365 (497) T protein:vir:10 286 STGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDI 365 (497) T ss_pred cccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhh Confidence 00000000000000 0000000 00111122222222 Q ss_pred hhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc------cccchhcceeEEEeceEEEEeccccccccccccccccccc Q lcl|NC_011085. 194 TSNYVPSADRTFYTTPEVYSAILAALMPNAANYA------ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQ 267 (343) Q Consensus 194 d~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~------~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~ 267 (343) ..... ...-..|++|..|..|.+-..-. ..|. +......+...+++|.+|++++.+|.+.. T Consensus 366 ~~~~~-~~~~~~vmn~~~~~~l~~lkd~~-G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~----------- 432 (497) T protein:vir:10 366 QLTLF-QTPNAVVMNPRDWELLRLTKDAN-GQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTI----------- 432 (497) T ss_pred hhhcc-cCCCeEEEchHHHHHHHHhhcCC-CceeccCcccccccccccCCceeeceeeEecCCCCCCce----------- Confidence 11111 00114779999998875432211 1121 11111222335899999999999984310 Q ss_pred cccccccccccccccccceEeEeechhhheeeeeeeeEEeeeec--cchhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 268 KHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARR--AEYQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 268 ~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~--~~~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +-++|... ++..+....++++.... ..+.. ..|++..+++..+++|++.+.+.++++ T Consensus 433 -----------~~Gd~~~~--------~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 433 -----------LVGHFAPS--------VIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred -----------EEeecccc--------eEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 11233222 22233334444444322 11222 347777889999999999999999999 No 139 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.11 E-value=1.7e-11 Score=79.60 Aligned_cols=273 Identities=12% Similarity=0.057 Sum_probs=152.4 Q ss_pred CCCCCcccc-ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--CcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQ-LGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~-~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~ 77 (343) .+....... ..-+.|-...+|- .+..+.|..++.+..+..+.+++++++.++.+|+ .++|.. +..++..+..|. T Consensus 115 ~~~~~~~~~~~~~~~~~t~~~gg--~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~~ 191 (394) T protein:vir:97 115 LMPINETTPVEPQKDGIKKENAK--PVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELE 191 (394) T ss_pred HHHHHhhhhhhhhcccccccccc--ccChHHHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceecccc Confidence 000000000 0001011111111 2455889999999888889999998887765543 555654 445566666676 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) ..+.+ ..++..++++...+. +.-..|.+-=-.++.+|+.+.+..+.+++|++..|+.|+..+. T Consensus 192 ~~~~~-~~~~~~~v~l~~~k~-~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~--------------- 254 (394) T protein:vir:97 192 KNPAL-AKPDFKDVAWNIDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLK--------------- 254 (394) T ss_pred ccccc-ccccceeEEeehhhe-eeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------- Confidence 66432 234566677766543 2334444321223567899999999999999999998864211 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhccee Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSI 237 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V 237 (343) .+ +..... .++.+++++. ..++. ...-.+|++|..|..|..-..- +..|.-...+.+|.- T Consensus 255 --~~------~~~~~~----~~~~~~~~~~---~~~~~----~~~a~~v~n~~~~~~l~~lkd~-~G~~i~~~~~~~~~~ 314 (394) T protein:vir:97 255 --SF------TTKTVK----NLDEIKALLN---GGFDP----AYNVSLIVSQSFYQTLDTLKDG-NGRYLLQDDITAVSG 314 (394) T ss_pred --cc------cccccc----cHHHHHHHHH---hhhhh----hhCCEEEEcHHHHHHHHHhhcc-CCCeeeecCcCCCCC Confidence 00 000011 1223333221 11221 1233477999999988753211 222322234566767 Q ss_pred EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhh Q lcl|NC_011085. 238 RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQAD 317 (343) Q Consensus 238 ~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d 317 (343) ++++|++|+.+++.+..... -+-++|+.. +..+..++++++...+ .++.. T Consensus 315 ~~l~G~pv~~~~~~~~~~~~--------------------~~~gd~~~~---------~~~~~~~~~~~~~~~~-~~~~~ 364 (394) T protein:vir:97 315 KVLLGKPVFVLSDEVLGANK--------------------AFIGDFKRG---------VLFADRKDLGLRWADN-EIYGQ 364 (394) T ss_pred ceeccceeEEecccccCCcc--------------------EEEeecccc---------EEEEEecceEEEEecc-cccce Confidence 89999999997765422110 011233322 2223334455554443 44566 Q ss_pred hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 318 QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 318 ~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+++.+++|.++.+|++.+.|+++.= T Consensus 365 ~~~~~~r~d~~v~~~~a~~~~~~~~~ 390 (394) T protein:vir:97 365 YLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) T ss_pred eEEEEEEEccEEecccceEEEEeccc Confidence 78999999999999999999999766 No 140 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=99.09 E-value=5.7e-11 Score=76.77 Aligned_cols=290 Identities=11% Similarity=0.060 Sum_probs=146.5 Q ss_pred CCCCCccc--------------cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccc--cccc-ceEEEEe Q lcl|NC_011085. 1 MADMKGGQ--------------QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMR--SISS-GKSAQFP 63 (343) Q Consensus 1 ~~~~~~~~--------------~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~--~i~~-G~tv~i~ 63 (343) +|...++. .+.+.++. +|.. +..+.|.+++.+..+..++++.+-... ...+ --.++|| T Consensus 315 ~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~---~Gg~--~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip 389 (645) T protein:vir:93 315 VARRQYPDDSRLHHVLKSAVGAGTTTDPQW---AGSL--SEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVH 389 (645) T ss_pred HHHhhcccchhhhhhhhhhhhccccccccc---cCCc--cCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeee Confidence 11111100 00011111 1211 234788999998888888887664322 1111 1246777 Q ss_pred cc-CcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 64 VL-GRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELA 142 (343) Q Consensus 64 ~i-G~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~ 142 (343) +. +.+++.....|+.++.+ +++.+++++..-+. +.-..|.+-=-.++.+|+.+.+..+.+++|++.+|+.+|..-. T Consensus 390 ~~t~~~~a~wv~Eg~~~~~s--~~~f~~v~l~~~kl-a~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g 466 (645) T protein:vir:93 390 AQVSGGAAGWVGEGKTKPLT--KFDFESITFSHAKV-SAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKK 466 (645) T ss_pred eeecCcceEEeccCcccccc--ccceeEEEEeeEEE-EEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCC Confidence 64 55667777778877654 35666666655322 2223343311124668999999999999999999998873110 Q ss_pred hhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhh Q lcl|NC_011085. 143 GLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPN 222 (343) Q Consensus 143 ~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~ 222 (343) +......+.+.. .+..+.... . .....+..+...|..+++...+-+.|++|..+..|.+-..- T Consensus 467 -----~~~~~~~p~gi~------~~~~~~~~~-~----~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~- 529 (645) T protein:vir:93 467 -----AAVADVSPASIT------HDVKGTASS-G----NPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNA- 529 (645) T ss_pred -----cccCCcccccee------ccccccccc-c----chHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhcccc- Confidence 000000111110 010000000 0 11233455566677777765666778899999998764322 Q ss_pred hhccc-cccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeee Q lcl|NC_011085. 223 AANYA-ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKL 301 (343) Q Consensus 223 ~~~~~-~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~ 301 (343) +..+. -.... .| ++++|++|+.|+++|..-. + ++|+.. ++.....+..... T Consensus 530 ~G~~~~~~~~~-~~--~tL~G~PV~~s~~vp~~~~--------------~---------gd~s~~--~ig~~~~v~i~~s 581 (645) T protein:vir:93 530 LGQKEYPDMTL-LG--GSFQGLPVIVSQYVGDQLV--------------L---------VNAPDI--YLADDGGVAVDMS 581 (645) T ss_pred CCceeecCCCC-CC--ceeeceeeEEeccCCccee--------------E---------eccccE--EEEEecceEEEee Confidence 11211 01111 22 4899999999999984210 0 111111 1111111111111 Q ss_pred eeeEEeeeecc--------------chhh--hhhhhhhhhccceecccceEEEEecC-C Q lcl|NC_011085. 302 KDLSLERARRA--------------EYQA--DQIIARYAMGHGGLRPEAAGALVFTA-G 343 (343) Q Consensus 302 ~~~~~e~~~~~--------------~~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~-g 343 (343) .+-+++..-.+ .++. -.|+..++++.+++||+++++|+--. | T Consensus 582 ~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g 640 (645) T protein:vir:93 582 REASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYG 640 (645) T ss_pred cceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCC Confidence 11122211111 1222 24677788899999999999887221 1 No 141 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.09 E-value=4.9e-11 Score=77.09 Aligned_cols=284 Identities=11% Similarity=0.057 Sum_probs=152.5 Q ss_pred CCCCCccc--------cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCccee Q lcl|NC_011085. 1 MADMKGGQ--------QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRA 70 (343) Q Consensus 1 ~~~~~~~~--------~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~ 70 (343) |.+..-.. ..-.+.......++--.+..+.|..++.+..+..+.++++++...+.++. +..++. .+...+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11000000 00000000000011111345889999999999999999999988876432 334444 344566 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) .....|..++.+ ..++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.-.. T Consensus 164 ~~v~E~~~~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeeccccccccc-ccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 677777766532 124566777766544 3334555422223568999999999999999999998863211 Q ss_pred ccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_011085. 151 SNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALI 230 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (343) .+.. ... ..++.+++++ ...|+....+ +-..|++|..|..|.+-.. .+..|.-.. T Consensus 234 ---------~~~~------~~~----~~~d~i~~~~---~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~~~ 288 (392) T protein:vir:10 234 ---------KLTK------QAI----KSLDDIKDVL---NVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYILQS 288 (392) T ss_pred ---------cccc------cCc----cCHHHHHHHH---HHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEeec Confidence 0000 000 1123344333 1344444432 3457899999999965321 122332223 Q ss_pred chhcceeEEEeceEEEE-e-ccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEe Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVE-V-PHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLE 307 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~-s-n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e 307 (343) .+..|.-++++|.+++. + +++|...... ....+.++...+ ++..+....++++ T Consensus 289 ~~~~~~~~tllG~~~v~~~~~~~~~~~~~~------------------------~~~~~~~~gdfs~~~~i~~~~~~~~~ 344 (392) T protein:vir:10 289 DPTQKNKKLFAGTNPVVVVSNRFLKSKGTT------------------------AKKAPLIIGDLKEAIVLFKREDMELA 344 (392) T ss_pred CccCCccccccCcccEEEecccccCCCccc------------------------CCceEEEEEehhceEEEEeecceEEE Confidence 45567778899986554 3 3333211000 001112333322 2333444445555 Q ss_pred eeec--cchhhh--hhhhhhhhccceecccceEEEEe--------cCC Q lcl|NC_011085. 308 RARR--AEYQAD--QIIARYAMGHGGLRPEAAGALVF--------TAG 343 (343) Q Consensus 308 ~~~~--~~~~~d--~i~~~~~~G~~v~rpe~~~~i~~--------~~g 343 (343) ..+. ..+..+ .+++.+++|.++++|++.+.+++ +|| T Consensus 345 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 345 STDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 4432 222223 37788889999999999999877 556 No 142 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.09 E-value=4.9e-11 Score=77.09 Aligned_cols=284 Identities=11% Similarity=0.057 Sum_probs=152.5 Q ss_pred CCCCCccc--------cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCccee Q lcl|NC_011085. 1 MADMKGGQ--------QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRA 70 (343) Q Consensus 1 ~~~~~~~~--------~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~ 70 (343) |.+..-.. ..-.+.......++--.+..+.|..++.+..+..+.++++++...+.++. +..++. .+...+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11000000 00000000000011111345889999999999999999999988876432 334444 344566 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) .....|..++.+ ..++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.-.. T Consensus 164 ~~v~E~~~~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeeccccccccc-ccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 677777766532 124566777766544 3334555422223568999999999999999999998863211 Q ss_pred ccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_011085. 151 SNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALI 230 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (343) .+.. ... ..++.+++++ ...|+....+ +-..|++|..|..|.+-.. .+..|.-.. T Consensus 234 ---------~~~~------~~~----~~~d~i~~~~---~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~~~ 288 (392) T protein:vir:10 234 ---------KLTK------QAI----KSLDDIKDVL---NVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYILQS 288 (392) T ss_pred ---------cccc------cCc----cCHHHHHHHH---HHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEeec Confidence 0000 000 1123344333 1344444432 3457899999999965321 122332223 Q ss_pred chhcceeEEEeceEEEE-e-ccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEe Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVE-V-PHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLE 307 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~-s-n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e 307 (343) .+..|.-++++|.+++. + +++|...... ....+.++...+ ++..+....++++ T Consensus 289 ~~~~~~~~tllG~~~v~~~~~~~~~~~~~~------------------------~~~~~~~~gdfs~~~~i~~~~~~~~~ 344 (392) T protein:vir:10 289 DPTQKNKKLFAGTNPVVVVSNRFLKSKGTT------------------------AKKAPLIIGDLKEAIVLFKREDMELA 344 (392) T ss_pred CccCCccccccCcccEEEecccccCCCccc------------------------CCceEEEEEehhceEEEEeecceEEE Confidence 45567778899986554 3 3333211000 001112333322 2333444445555 Q ss_pred eeec--cchhhh--hhhhhhhhccceecccceEEEEe--------cCC Q lcl|NC_011085. 308 RARR--AEYQAD--QIIARYAMGHGGLRPEAAGALVF--------TAG 343 (343) Q Consensus 308 ~~~~--~~~~~d--~i~~~~~~G~~v~rpe~~~~i~~--------~~g 343 (343) ..+. ..+..+ .+++.+++|.++++|++.+.+++ +|| T Consensus 345 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 345 STDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 4432 222223 37788889999999999999877 556 No 143 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.09 E-value=4.9e-11 Score=77.09 Aligned_cols=284 Identities=11% Similarity=0.057 Sum_probs=152.5 Q ss_pred CCCCCccc--------cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCccee Q lcl|NC_011085. 1 MADMKGGQ--------QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRA 70 (343) Q Consensus 1 ~~~~~~~~--------~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~ 70 (343) |.+..-.. ..-.+.......++--.+..+.|..++.+..+..+.++++++...+.++. +..++. .+...+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11000000 00000000000011111345889999999999999999999988876432 334444 344566 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) .....|..++.+ ..++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.-.. T Consensus 164 ~~v~E~~~~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeeccccccccc-ccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 677777766532 124566777766544 3334555422223568999999999999999999998863211 Q ss_pred ccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_011085. 151 SNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALI 230 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (343) .+.. ... ..++.+++++ ...|+....+ +-..|++|..|..|.+-.. .+..|.-.. T Consensus 234 ---------~~~~------~~~----~~~d~i~~~~---~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~~~ 288 (392) T protein:vir:10 234 ---------KLTK------QAI----KSLDDIKDVL---NVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYILQS 288 (392) T ss_pred ---------cccc------cCc----cCHHHHHHHH---HHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEeec Confidence 0000 000 1123344333 1344444432 3457899999999965321 122332223 Q ss_pred chhcceeEEEeceEEEE-e-ccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEe Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVE-V-PHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLE 307 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~-s-n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e 307 (343) .+..|.-++++|.+++. + +++|...... ....+.++...+ ++..+....++++ T Consensus 289 ~~~~~~~~tllG~~~v~~~~~~~~~~~~~~------------------------~~~~~~~~gdfs~~~~i~~~~~~~~~ 344 (392) T protein:vir:10 289 DPTQKNKKLFAGTNPVVVVSNRFLKSKGTT------------------------AKKAPLIIGDLKEAIVLFKREDMELA 344 (392) T ss_pred CccCCccccccCcccEEEecccccCCCccc------------------------CCceEEEEEehhceEEEEeecceEEE Confidence 45567778899986554 3 3333211000 001112333322 2333444445555 Q ss_pred eeec--cchhhh--hhhhhhhhccceecccceEEEEe--------cCC Q lcl|NC_011085. 308 RARR--AEYQAD--QIIARYAMGHGGLRPEAAGALVF--------TAG 343 (343) Q Consensus 308 ~~~~--~~~~~d--~i~~~~~~G~~v~rpe~~~~i~~--------~~g 343 (343) ..+. ..+..+ .+++.+++|.++++|++.+.+++ +|| T Consensus 345 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 345 STDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 4432 222223 37788889999999999999877 556 No 144 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.09 E-value=4.9e-11 Score=77.09 Aligned_cols=284 Identities=11% Similarity=0.057 Sum_probs=152.5 Q ss_pred CCCCCccc--------cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccce-EEEEec-cCccee Q lcl|NC_011085. 1 MADMKGGQ--------QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGK-SAQFPV-LGRTRA 70 (343) Q Consensus 1 ~~~~~~~~--------~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~-tv~i~~-iG~~t~ 70 (343) |.+..-.. ..-.+.......++--.+..+.|..++.+..+..+.++++++...+.++. +..++. .+...+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 11000000 00000000000011111345889999999999999999999988876432 334444 344566 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) .....|..++.+ ..++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..++.-.. T Consensus 164 ~~v~E~~~~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeeccccccccc-ccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 677777766532 124566777766544 3334555422223568999999999999999999998863211 Q ss_pred ccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_011085. 151 SNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALI 230 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (343) .+.. ... ..++.+++++ ...|+....+ +-..|++|..|..|.+-.. .+..|.-.. T Consensus 234 ---------~~~~------~~~----~~~d~i~~~~---~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~~~ 288 (392) T protein:vir:10 234 ---------KLTK------QAI----KSLDDIKDVL---NVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYILQS 288 (392) T ss_pred ---------cccc------cCc----cCHHHHHHHH---HHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEeec Confidence 0000 000 1123344333 1344444432 3457899999999965321 122332223 Q ss_pred chhcceeEEEeceEEEE-e-ccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEe Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVE-V-PHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLE 307 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~-s-n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e 307 (343) .+..|.-++++|.+++. + +++|...... ....+.++...+ ++..+....++++ T Consensus 289 ~~~~~~~~tllG~~~v~~~~~~~~~~~~~~------------------------~~~~~~~~gdfs~~~~i~~~~~~~~~ 344 (392) T protein:vir:10 289 DPTQKNKKLFAGTNPVVVVSNRFLKSKGTT------------------------AKKAPLIIGDLKEAIVLFKREDMELA 344 (392) T ss_pred CccCCccccccCcccEEEecccccCCCccc------------------------CCceEEEEEehhceEEEEeecceEEE Confidence 45567778899986554 3 3333211000 001112333322 2333444445555 Q ss_pred eeec--cchhhh--hhhhhhhhccceecccceEEEEe--------cCC Q lcl|NC_011085. 308 RARR--AEYQAD--QIIARYAMGHGGLRPEAAGALVF--------TAG 343 (343) Q Consensus 308 ~~~~--~~~~~d--~i~~~~~~G~~v~rpe~~~~i~~--------~~g 343 (343) ..+. ..+..+ .+++.+++|.++++|++.+.+++ +|| T Consensus 345 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 345 STDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 4432 222223 37788889999999999999877 556 No 145 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.08 E-value=4.3e-11 Score=77.40 Aligned_cols=280 Identities=10% Similarity=0.059 Sum_probs=150.6 Q ss_pred CC-CCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--CcceeeeecCCC Q lcl|NC_011085. 1 MA-DMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~-~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t~~~~~~g~ 77 (343) +. -+.++... .+.-....+++-=.+..+.|..++.+..+..+.++++.++.++.++ +.+++.. +.........+. T Consensus 95 ~~~~lr~~~~~-~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~ 172 (389) T protein:vir:10 95 INDFIHSHGKV-IDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELA 172 (389) T ss_pred HHHHhhcchhh-hhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCC-eeEEEEEecCCCccccccccc Confidence 00 00001100 0000011111111134488999999999999999998888777543 3445443 334445555555 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) ..+.. ..++..++++.+.+. +.-+.|.+-=-..+.+|+.+.+.++.+++|++..|..|+..+.. T Consensus 173 ~~~~~-~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~-------------- 236 (389) T protein:vir:10 173 ENPKL-AEPEFNKVDWSVATY-RGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQS-------------- 236 (389) T ss_pred ccccc-ccccceeeeeeheee-EeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-------------- Confidence 55432 235566666666544 23344543222335688999999999999999999988643211 Q ss_pred cCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccc----cchh Q lcl|NC_011085. 158 LGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAAL----IDPE 233 (343) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~----~~~~ 233 (343) +. +.+..+ ...++.+.+.+ ...++.. .+-.++++|..|..|..-..- +..|.-. .... T Consensus 237 ---~~--~~~~~~-----~~~~d~l~~~~---~~~~~~~----~~a~~~~n~~~~~~L~~lkd~-~G~~i~~~~~~~~~~ 298 (389) T protein:vir:10 237 ---FT--AKKTTT-----DTLVDSLKHIL---NVDLDPA----YSRALVVTQSLFNTLDTLKDK-NGRYLLHDASDSITD 298 (389) T ss_pred ---cc--cccccc-----cccHHHHHHHH---Hhhhhhh----hCcEEEecHHHHHHHHHhhcc-CCCeeeecCcccccc Confidence 00 001111 11123333322 1233322 234678999999998764321 2233211 1223 Q ss_pred cceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh-hheeeeeeeeEEeeeecc Q lcl|NC_011085. 234 RGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS-AVGTVKLKDLSLERARRA 312 (343) Q Consensus 234 ~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~-Av~~~~~~~~~~e~~~~~ 312 (343) .|..++++|.+|+.+++........ +.+.++...+ ++.....++++++..++ T Consensus 299 ~~~~~~l~G~pV~~~~~~~~~~~~~--------------------------~~~~~~gd~~~~~~~~~~~~~~i~~~~~- 351 (389) T protein:vir:10 299 GTAKGTILGVPVYVVGDTLLGSLAG--------------------------DQKAFVGDLKRGVLFTDRQQVTLAWEDS- 351 (389) T ss_pred cccccccccceeEEecccccCCCCC--------------------------ceEEEEeeccccEEEEeecceEEEeecc- Confidence 4556789999999876543211110 0011222222 33344445566666554 Q ss_pred chhhhhhhhhhhhccceecccceEEEEec--CC Q lcl|NC_011085. 313 EYQADQIIARYAMGHGGLRPEAAGALVFT--AG 343 (343) Q Consensus 313 ~~~~d~i~~~~~~G~~v~rpe~~~~i~~~--~g 343 (343) ..|...++..+++|.++++|++.+.+.++ .+ T Consensus 352 ~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~ 384 (389) T protein:vir:10 352 KIYGKYLGAAFRFGVQKADSKAGYFVTNTDVPG 384 (389) T ss_pred ccccceEEEEEEeccEEecccceEEEEeeccCC Confidence 45566788999999999999998877755 33 No 146 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.07 E-value=8.3e-11 Score=75.86 Aligned_cols=293 Identities=10% Similarity=-0.002 Sum_probs=147.9 Q ss_pred CCCCCccccc---cccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCC Q lcl|NC_011085. 1 MADMKGGQQL---GKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~---~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g 76 (343) +-.++....- ..+. .+..++--.|..+.|..++.+..++.+.+++++++.++.+ ....||+. +..++.....+ T Consensus 69 ~~~l~~~~r~~~~~~~~--~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~-~~~~i~~~~~~~~a~~~~E~ 145 (390) T protein:vir:40 69 ANALTSDESKYYNEVIA--GNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTA-TTEWIISVGDVATAWWGPLC 145 (390) T ss_pred chhccHHHHHHHHHHHh--ccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCC-ceeEEEEEcCCcceeeeccc Confidence 0000000000 0000 0111111224559999999999999999999998887655 44556654 44455555555 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIA 156 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~ 156 (343) ..+... .+++.+++++.+-++ +.-+.|.+-=-.++.+|+.+.+.++.++++++..|+.++.- .+. ..+.+... T Consensus 146 ~~~~~~-~~~~f~~i~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G--~G~---~~P~Gil~ 218 (390) T protein:vir:40 146 AEIKEV-LDNGFDKIQTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNG--SGK---DQPIGMMR 218 (390) T ss_pred cccCcc-ccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc--cCC---Cccceeee Confidence 555432 235666777766544 33345554323346788999999999999999999988731 111 11111111 Q ss_pred ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCc-CCcEEEeCHHHHHHHhccchhhhhccccccchhcc Q lcl|NC_011085. 157 GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPS-ADRTFYTTPEVYSAILAALMPNAANYAALIDPERG 235 (343) Q Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~-~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (343) .....+ .+.....+...-.+..+.+++......+.....+. ..-+.+++|..+..+++..+.. .+-.|. +..+ T Consensus 219 ~~~~~~---~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~-~d~~G~--~v~~ 292 (390) T protein:vir:40 219 DLNNVT---AGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSY-MTPQGV--WVTG 292 (390) T ss_pred cccccc---ccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhc-cCCCCc--cccc Confidence 000000 00000000111111222333333333443322211 2335678887665544432211 111111 1111 Q ss_pred eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccch- Q lcl|NC_011085. 236 SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEY- 314 (343) Q Consensus 236 ~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~- 314 (343) ....|.+|+.|+++|.... . -++|+.. ..+...+++++...+..+ T Consensus 293 --~~~~g~pvv~~~~~p~~~i-------------~---------~Gd~s~~----------~i~~~~~~~v~~~~~~~f~ 338 (390) T protein:vir:40 293 --ILPVPLEIVQSVAVPVGKA-------------V---------AGRAKDY----------FMGIGSEQVIRTSTEYRLL 338 (390) T ss_pred --cCCCceeEEEcCCCCCCcE-------------E---------EEeeceE----------EEEeecceEEEecchhhhh Confidence 1246999999999985321 0 1233221 122334455655443322 Q ss_pred -hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 315 -QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 315 -~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) -...+++.++++.++++|++.+.+.+++= T Consensus 339 ~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 339 DDETLYYAKQYANGRPKDNSSFLVFDITGL 368 (390) T ss_pred cCcEEEEEEEEeCCEEecccceEEEEeecc Confidence 23557899999999999999998876554 No 147 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.05 E-value=1.1e-10 Score=75.14 Aligned_cols=296 Identities=10% Similarity=0.053 Sum_probs=141.4 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccccccc-ceEEEEeccCcce--eeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMRSISS-GKSAQFPVLGRTR--AAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~~i~~-G~tv~i~~iG~~t--~~~~~~g 76 (343) ...........+-.+. +| .+.+ +...+++.+..+..++++++++..++.+ +.++.||++.... ..-...| T Consensus 148 ~~~~~~~~~~~~~~~~---gg---~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg 221 (477) T protein:vir:84 148 AKVGEEYRDLDRNGGT---GG---YAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADN 221 (477) T ss_pred HHhhhhhccccccCCC---cc---eeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccC Confidence 0011011111111111 11 1334 4457888888888888888888887764 5679999863322 2223334 Q ss_pred CcCCCccC---CCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_011085. 77 QSLDDKRK---DIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASN 152 (343) Q Consensus 77 ~~i~~~~~---~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (343) ..+..... ++...++++ +-.++.. +.|.+-=-.++.+|+.+.+.++.+++|++..|+.+|. +.... T Consensus 222 ~~~~~~~~~~s~~~f~~i~~--~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~----G~Gt~---- 291 (477) T protein:vir:84 222 AALTAPSAHEVDLTDGFVQA--NVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVIS----GTGSN---- 291 (477) T ss_pred cccccccccccccceeeEEE--eeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhc----cCCCC---- Confidence 43322111 122333444 4444444 3343321233578999999999999999999998762 11111 Q ss_pred ccccccCCc---eeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhh----- Q lcl|NC_011085. 153 ENIAGLGSA---SILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAA----- 224 (343) Q Consensus 153 ~~~~g~~~~---~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~----- 224 (343) ..+.|.... +.+..+. ...+.. ....+++.|.++...++.... ....+.+++|..|..|.+-..-... T Consensus 292 ~~p~Gi~~~~~~~~~~~~~--~~~t~~-~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~ 367 (477) T protein:vir:84 292 NQVVGVRATAGITQVTATS--AGSALE-KHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRPLIVP 367 (477) T ss_pred Cccceeeeccccccccccc--cccchh-hHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCeeeec Confidence 112221111 1111111 111111 112344555555554444332 2234678899998887653221111 Q ss_pred ccc-------cccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhhe Q lcl|NC_011085. 225 NYA-------ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVG 297 (343) Q Consensus 225 ~~~-------~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~ 297 (343) ++. ....+.+|..++++|++|+.|+.+|....... . . ..-+-++|... . T Consensus 368 ~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~-d------~-------~~i~~gd~~~~----------~ 423 (477) T protein:vir:84 368 SGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGT-D------Q-------DVIHVLRASDL----------A 423 (477) T ss_pred CcccccccccccccccccccchhcccceEecCcccccccccC-C------c-------ceEEEEEeceE----------E Confidence 110 11235566678999999999999995311000 0 0 00011222211 1 Q ss_pred eeeeeeeEEeeeeccchhhhhhhhhh-h---hccceec-ccceEEEEecCC Q lcl|NC_011085. 298 TVKLKDLSLERARRAEYQADQIIARY-A---MGHGGLR-PEAAGALVFTAG 343 (343) Q Consensus 298 ~~~~~~~~~e~~~~~~~~~d~i~~~~-~---~G~~v~r-pe~~~~i~~~~g 343 (343) ..+ ..+.++ .++..+++.....+ + +..+.+| |++.+.++.++= T Consensus 424 i~~-~~~~~~--~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 424 LFE-SSVRMR--ALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTAL 471 (477) T ss_pred EEe-eceeEE--eccccccccceeeeeehhhhhhhhhccccceEEeecccc Confidence 111 112333 33333333222221 1 2335666 999998887766 No 148 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=99.02 E-value=2.2e-10 Score=73.56 Aligned_cols=326 Identities=11% Similarity=0.052 Sum_probs=172.3 Q ss_pred CccccccccccccccccchhHHHHHHHHHHHHHHHHHh-hh---hcc----------------------Cccccccc--c Q lcl|NC_011085. 5 KGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFART-SV---TTN----------------------RHIMRSIS--S 56 (343) Q Consensus 5 ~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~-s~---~~~----------------------~~~~~~i~--~ 56 (343) -+++. |..+ .+|+. -.++|+.-|...-.+. +. |.+ +++..++. . T Consensus 1 ~~~a~--T~~~----~~~p~--a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~ 72 (430) T protein:vir:10 1 MTASK--TTMR----YGDPN--AMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNK 72 (430) T ss_pred Cccee--eecc----cCChh--HHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCC Confidence 23333 3333 33443 4678888876655443 22 222 44555553 5 Q ss_pred ceEEEEeccCcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeecc-chHHHHhchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 57 GKSAQFPVLGRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIY-DIEDAMNHYDVRSEYTSQIGESLAMAADG 135 (343) Q Consensus 57 G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Id-d~D~~q~~~d~~~~~~~~~~~aLa~~~D~ 135 (343) |++|.|+-+...+-.....++.+.+.-+.++.....|.|||.. .++.+. .+++-.+.+|+|.+.-..++.=+++..|| T Consensus 73 GD~Vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R-~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq 151 (430) T protein:vir:10 73 GDEVRFHFVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQAR-FPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQ 151 (430) T ss_pred ccEEEEeEeeccccCceecCceeeccccceEEEeeEEEEeeec-cccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 9999999988877666666788888778888889999999885 333333 34555678999999999999999999999 Q ss_pred HHHHHHHhhhh----------------ccc-cccccccccCCc--eeeccccccccc------chHHHHHH-HHHHHHHH Q lcl|NC_011085. 136 AVLAELAGLCN----------------MPA-ASNENIAGLGSA--SILEVGAKGDLT------SPVELGKA-VIAQLTIA 189 (343) Q Consensus 136 ~i~~~~~~~a~----------------~~~-~~~~~~~g~~~~--~~~~~~~~~~~~------~~~~~~~~-i~~~l~~a 189 (343) .+|.+++.+.- ++. ..+... .++. .+...+.+++.. ..-...+. -++.|.+| T Consensus 152 ~~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~--aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a 229 (430) T protein:vir:10 152 SMLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVK--APTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSI 229 (430) T ss_pred HHHHHHhhhhcccccccccccccCCcchhhhhccccC--CCCCceeEeecccccccccccccccchhhhcccCHHHHHHH Confidence 99999976411 000 000000 0111 111112111100 00011121 15666677 Q ss_pred HHHHhhcCCCc-------CC-------cEEEeCHHHHHHHhccchhhh----h-cc--cc-ccchhcceeEEEeceEEEE Q lcl|NC_011085. 190 RAKLTSNYVPS-------AD-------RTFYTTPEVYSAILAALMPNA----A-NY--AA-LIDPERGSIRNVMGFEVVE 247 (343) Q Consensus 190 ~~~Ld~~~VP~-------~g-------R~~vv~P~~~~~Ll~~~~~~~----~-~~--~~-~~~~~~G~V~~i~Gf~V~~ 247 (343) +..++..+.|- +. ++++++|.+|..|..+..+.. + .. .+ ...|-+|.++.++|+-|++ T Consensus 230 ~~~a~~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~ 309 (430) T protein:vir:10 230 ATYMDQIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIK 309 (430) T ss_pred HHHHHhhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEec Confidence 88888765332 22 678899999999999987641 1 11 12 3467799999999999998 Q ss_pred ecccc---ccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeE--Eeeeeccchh--h--hh Q lcl|NC_011085. 248 VPHLT---AGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLS--LERARRAEYQ--A--DQ 318 (343) Q Consensus 248 sn~lp---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~--~e~~~~~~~~--~--d~ 318 (343) -.+.- .+............ ..........+...+.-..+|++-..|++.+.+.... +.-.|.++.+ + -. T Consensus 310 ~~~virf~~g~~~~~~a~~~~~--~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~~ 387 (430) T protein:vir:10 310 MPKPIRFYAGDTIKYCAAYNSE--AESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKLE 387 (430) T ss_pred CCceeeecCCCccccccCCccc--ccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchhh Confidence 76441 11111110000000 0000001111112222223444445555555554211 1112222111 1 23 Q ss_pred hhhhhhhccceeccc----------ceEEEEecCC Q lcl|NC_011085. 319 IIARYAMGHGGLRPE----------AAGALVFTAG 343 (343) Q Consensus 319 i~~~~~~G~~v~rpe----------~~~~i~~~~g 343 (343) |.....+|.+=.|-. =-++|.+.-- T Consensus 388 i~~~~i~G~kK~rF~~~~~~~~~~~DfGvi~idta 422 (430) T protein:vir:10 388 LLIGAILGCSKIRFAVEATNGLEYTDHGVMAIDTA 422 (430) T ss_pred hhhhHHhccceeeecCCCCCCceeeeeEEEEhhhh Confidence 444445555554432 1233333322 No 149 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=98.99 E-value=2e-11 Score=79.29 Aligned_cols=269 Identities=14% Similarity=0.094 Sum_probs=141.4 Q ss_pred CCCC------------CccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--C Q lcl|NC_011085. 1 MADM------------KGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--G 66 (343) Q Consensus 1 ~~~~------------~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G 66 (343) |... .-+.+.++. ..|. -|..+.+..++.+..+..+.+++++++.++.+ . ++|.+ + T Consensus 64 ~~~~~~~~~~~~~~~~~~al~~~~~-----~~gG--~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~-~--~~p~~~~~ 133 (352) T protein:vir:78 64 ILPNEFEKPSMEAQRLLHALPTGND-----SGGD--KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG-L--EIPRVSYT 133 (352) T ss_pred hhhhHHHHHHhhHHHHHHHhccCCC-----CCCc--eeccHhHHHHHHHHHHhhcchhhheeeEecCC-c--eEEEEecC Confidence 1000 000011111 1111 14458899999999999999999988776543 2 33332 2 Q ss_pred cceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011085. 67 RTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCN 146 (343) Q Consensus 67 ~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~ 146 (343) ..++.-...|..++.+ +++.+++++.+.++- .-+.|.+-=-.++.+|+.+.+.++.++++++..++.++.. + T Consensus 134 ~~~a~~v~E~~~~~~~--~~~f~~v~~~~~k~~-~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~---g-- 205 (352) T protein:vir:78 134 LDDDDFITDVETAKEL--KLKGDTVKFTTNKFK-VFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV---S-- 205 (352) T ss_pred CCcccccccccccccc--cccceeeeecceeEE-eechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhc---C-- Confidence 3345555566666553 356677777665442 2245544222235689999999999999998755544421 0 Q ss_pred ccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcc Q lcl|NC_011085. 147 MPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANY 226 (343) Q Consensus 147 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~ 226 (343) ...+.+.+.... .. ....+... .++.|.++...|+..... .. ..+++|..|..|+.-.+-.+ T Consensus 206 -------~g~~~~~g~l~~-~~-~~~~t~~~----~~d~i~~~~~~l~~~~~~-~a-~~~mn~~t~~~l~~~~~~~~--- 267 (352) T protein:vir:78 206 -------PKSGLEHMSFYN-GS-VKEVEGAN----MYDAIINALADLHEDYRD-NA-TIYMRYADYVKIISVLSNGT--- 267 (352) T ss_pred -------CCCcccccceec-cc-cccccccc----hHHHHHHHHhccChhhhc-CC-EEEEehHHHHHHHHHHhccC--- Confidence 011111111111 11 11111122 244555555556555432 23 45667777777665322111 Q ss_pred ccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEE Q lcl|NC_011085. 227 AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSL 306 (343) Q Consensus 227 ~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~ 306 (343) ..+..|.-.+++|.+|+.++..+.- +-++|...... .+ .+.. T Consensus 268 ---~~~~~~~~~~llG~PV~~~~~~~~~------------------------~~Gdf~~~~~~---~~--------~~~~ 309 (352) T protein:vir:78 268 ---TNFFDTPAEKVFGKPVVFTDAAVKP------------------------IVGDFNYFGIN---YD--------GTTY 309 (352) T ss_pred ---CcccccCCccccccceEEecCCCce------------------------eEeehhhhhhh---hh--------hhee Confidence 1223344457999999998755410 01223221110 01 1234 Q ss_pred eeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 307 ERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 307 e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.+++....-..+++.+++++++++|++.+.+++++- T Consensus 310 ~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 310 DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred eeeccccCCeeEEEEEeeeCceeechhheEEEEeecc Confidence 4444443333456667899999999999999888777 No 150 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.98 E-value=4.5e-11 Score=77.32 Aligned_cols=261 Identities=10% Similarity=0.044 Sum_probs=149.6 Q ss_pred CCCCCccccccccccccc----cccchhHHHHHHHHHHHHHHHHHhhhhc---------cCccccccc--cceEEEEecc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ----SGGDKLALFLKVFGGEVLTAFARTSVTT---------NRHIMRSIS--SGKSAQFPVL 65 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~----~~~d~~al~ie~~~g~V~~~f~~~s~~~---------~~~~~~~i~--~G~tv~i~~i 65 (343) |.++.+|+-.-..+ ++. ...| -.+++|++.+...-.+.+-+. .+++..++. .|++|.|.-+ T Consensus 1 mt~~~~~~~~~~~~-~~~ft~~~~~~---~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~ 76 (318) T protein:vir:27 1 MTTVTSAQANKLFQ-VALFTAANRNR---SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (318) T ss_pred CCccCCCChHHHHH-HHHHHHHhcCC---hHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEe Confidence 88887776421110 000 1111 146789998866555544332 223333443 5999999998 Q ss_pred CcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeec-cchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011085. 66 GRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLI-YDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGL 144 (343) Q Consensus 66 G~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~I-dd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~ 144 (343) ...+-.....++.+.+.-+.++....+|.||+.. .++.. ..+++-.+.+|+|++.-..++.-+++..||.+|.+++.+ T Consensus 77 ~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r-~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGa 155 (318) T protein:vir:27 77 HKLSKRPTMGDERVEGRGEDLSHADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA 155 (318) T ss_pred eccccCccccCceeeccccceEEEeeEEEEeeec-cccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 8877666666778888778888888999999875 22222 344555677999999999999999999999999999765 Q ss_pred hhc-----------cccccc----cccccCC-ceeecccccccccchHHHHHH-HHHHHHHHHHHHhhcCCC-------c Q lcl|NC_011085. 145 CNM-----------PAASNE----NIAGLGS-ASILEVGAKGDLTSPVELGKA-VIAQLTIARAKLTSNYVP-------S 200 (343) Q Consensus 145 a~~-----------~~~~~~----~~~g~~~-~~~~~~~~~~~~~~~~~~~~~-i~~~l~~a~~~Ld~~~VP-------~ 200 (343) ... .++... +..-.++ ..++-.+.++...+- ...+. -++.|.++++.+++..-| . T Consensus 156 rg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l-~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g 234 (318) T protein:vir:27 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQI-EAADIFSIGLVDNLSLFIDEMAHPLQPVRLSG 234 (318) T ss_pred ccccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhh-hhcccccHHHHHHHHHHHHHhCCCCcceeecc Confidence 521 000000 0000000 111111111111100 01111 134455566777663222 1 Q ss_pred CC-------cEEEeCHHHHHHHhccch------hh-hhccc---cccchhcceeEEEeceEEEEeccccccccccccccc Q lcl|NC_011085. 201 AD-------RTFYTTPEVYSAILAALM------PN-AANYA---ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDE 263 (343) Q Consensus 201 ~g-------R~~vv~P~~~~~Ll~~~~------~~-~~~~~---~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~ 263 (343) +. ++++++|.++..|..+.. +. ++... ....|-.|.+|.++|+=|.+.+++|-.=.. T Consensus 235 ~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf~~------ 308 (318) T protein:vir:27 235 DELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQ------ 308 (318) T ss_pred ccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEEcC------ Confidence 12 678899999999998752 21 22222 234578999999999999999988732100 Q ss_pred cccccccccccccccccccccceE Q lcl|NC_011085. 264 TTNQKHAFPKTAEGDTKVALDNVV 287 (343) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~ 287 (343) -.+.+|. +.. T Consensus 309 ----------G~~v~~~----~~~ 318 (318) T protein:vir:27 309 ----------GQRFWYQ----RIT 318 (318) T ss_pred ----------CCeeeee----ecC Confidence 0000000 000 No 151 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.97 E-value=2.9e-10 Score=72.87 Aligned_cols=280 Identities=10% Similarity=0.014 Sum_probs=157.5 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCc--ceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGR--TRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~--~t~~~~~~g~ 77 (343) |-.-.+-.-.+.....-. ..-++| |++.|+..+.+.+ .+++..|..++..|++++++.-.. ...++...|+ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siD--f~~~f~~~i~~L~----~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe 74 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITID--VTNKFQENISKLL----EMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGE 74 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhh--hHHHHhhhHHHHH----HHhhhcccccccCCCEEeeccceeeeeccccccCCc Confidence 433322222222211222 233455 8999998887665 356777777888899997764322 2346788899 Q ss_pred cCCCccCCCcc---ceEEEEeeeeeeeeeeccchHHH-Hh--chhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_011085. 78 SLDDKRKDIKH---TEKTIVIDGLLTADVLIYDIEDA-MN--HYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAAS 151 (343) Q Consensus 78 ~i~~~~~~~~~---~~~~l~iD~~~~~~~~Idd~D~~-q~--~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (343) .|+.+. ++. +..+++|.++.-. +. ||+ |. ..|...+..+++..++++++|..++..+..+.... T Consensus 75 ~Iplsk--vt~~~~~t~t~~ikK~rK~---tT--dEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~--- 144 (296) T protein:vir:98 75 VIPLSK--VERKIHSEKKIELKKYRKA---TT--GEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ--- 144 (296) T ss_pred ccchhh--heeeecceEEEEeeccccc---cC--HHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccccee--- Confidence 998753 333 3366777654322 43 455 53 35699999999999999999999997764322110 Q ss_pred cccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccc Q lcl|NC_011085. 152 NENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALID 231 (343) Q Consensus 152 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (343) ..+....-.++...+.++..+|.+.+ ....+++|+|...+.+|.+..+.....-|..- T Consensus 145 --------------------~~t~~~lQ~Ala~~~~~l~~~feded--~~~~V~FVnP~D~a~ylg~a~it~qt~fG~ty 202 (296) T protein:vir:98 145 --------------------DALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQTAFGLTY 202 (296) T ss_pred --------------------eechhhHHHHHHHHhhhhhhhccccC--CCceEEEEehHHHHHHhcCCccchhheechhh Confidence 01122333445555666677777664 24679999999999999887765433222222 Q ss_pred hhcceeEEEeceEEEEecccccccccccccccccccccccccccccc------ccccccceEeEeechhhheeeeeeeeE Q lcl|NC_011085. 232 PERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGD------TKVALDNVVGLFQHRSAVGTVKLKDLS 305 (343) Q Consensus 232 ~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~l~~~~~Av~~~~~~~~~ 305 (343) +. ++.|..|+.|+.+|.+..-..+... -..+..+..++. ...|-+..+|+. |... ...++ T Consensus 203 l~-----nfLG~~II~S~kV~~G~~~~T~~~N---i~~ay~~~~~~~l~~~f~~~~d~tglIGv~-h~~~-----~~~~t 268 (296) T protein:vir:98 203 LV-----DFTGTVIISTNDVTKGEIWATVPEN---IIFAYINPNNSELAKEFNLYGDPTGYIGMN-HFQE-----NTTLT 268 (296) T ss_pred hh-----hccccEEEEcCcCCCceEEEeeecc---eEEEeecccccchhhhhccccccccceEEE-eccc-----cceee Confidence 22 4889999999999987544433221 111111111111 122233333321 1100 00111 Q ss_pred EeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 306 LERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 306 ~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) . ..-+..+... =+=|+|+++..+++.| T Consensus 269 ~--------eT~~~~~~~l---fpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 269 I--------QTLLVSGMLM---YPERIDGIVKVTLTPG 295 (296) T ss_pred e--------hhHhHhHHHh---cccccceEEEEEecCC Confidence 1 1111222222 2457789999999999 No 152 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=98.95 E-value=2e-11 Score=79.22 Aligned_cols=271 Identities=12% Similarity=0.085 Sum_probs=142.0 Q ss_pred CCC---------CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--Ccce Q lcl|NC_011085. 1 MAD---------MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRTR 69 (343) Q Consensus 1 ~~~---------~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~t 69 (343) +.+ ........+ |....+| .+..+.|+.++.+..+..+.+++++++.++.+ .++|++ +..+ T Consensus 115 ~~~~~~~~~~~~~~~~~a~~~--~t~~~GG---~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~ 186 (402) T protein:vir:93 115 LPNEFEKPSMEAQRLLHALPT--GNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDD 186 (402) T ss_pred hhhhHHHHHHhHHHHHhhhcc--CCCcCCc---cccchhHHHHHHHhHHhhhhhhhhceeeecCC---ceeeeeeccCCc Confidence 000 000000000 0001111 23458899999999998899999988877643 334443 3344 Q ss_pred eeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 70 AAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 70 ~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) +.-...|...+.+ +++.+++++.+.+. +.-+.|.+-=-..+.+|+.+.+.++.++++++..++.++... T Consensus 187 a~~v~Eg~~~~~~--~~~f~~i~~~~~k~-~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g-------- 255 (402) T protein:vir:93 187 DDFITDVETAKEL--KAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS-------- 255 (402) T ss_pred ccccccccccccc--ccccceeeecceee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC-------- Confidence 5555667666543 35566666665444 222445432122357899999999999999998776655221 Q ss_pred cccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_011085. 150 ASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAAL 229 (343) Q Consensus 150 ~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (343) ...|.+.+.... .. ....+... .++.|.++...|+..... ...|+ +++..|..|+.-.+-.+ T Consensus 256 ----~g~g~p~g~~~~-~~-~~~~~~~~----~~d~l~~~~~~l~~~y~~-na~~i-mn~~t~~~~~~~~~d~~------ 317 (402) T protein:vir:93 256 ----PKSGLEHMSFYN-GS-VKEVEGAD----MYDAIINALADLHEDYRD-NATIY-MRYADYVKIISVLSNGT------ 317 (402) T ss_pred ----CCccccceeeec-cc-cccccccc----hHHHHHHHHhccChhhhc-CCEEE-EechHHHHHHHHHhcCC------ Confidence 111112221111 11 11112222 245555555567666543 45564 55555544433211111 Q ss_pred cchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeee Q lcl|NC_011085. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERA 309 (343) Q Consensus 230 ~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~ 309 (343) ..+..|.-.+++|.+|+.++..|.- +-++|......+ . .+.++.+ T Consensus 318 ~~~~~~~~~~llG~PV~~t~~~~~i------------------------~~GDf~~~~~~~-~----------~~~~~~~ 362 (402) T protein:vir:93 318 TNFFDTPAEKVFGKPVVFTDAAVKP------------------------IVGDFNYFGINY-D----------GTTYDTD 362 (402) T ss_pred CcccccCCccccccceEEecCCCce------------------------eeechhhhhhhh-h----------hhhhhhh Confidence 2223344467999999998865421 012232221111 1 1123444 Q ss_pred eccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 310 RRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++...-.-.+++..++++++++|++...+++++- T Consensus 363 ~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 363 KDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred hcccCCceEEEEEEEeCcEEechhheEEEEeecC Confidence 4443333456777889999999999998888766 No 153 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=98.95 E-value=9.2e-11 Score=75.60 Aligned_cols=272 Identities=12% Similarity=0.073 Sum_probs=140.4 Q ss_pred CCCCCcccccc-------ccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEec--cCcceee Q lcl|NC_011085. 1 MADMKGGQQLG-------KDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPV--LGRTRAA 71 (343) Q Consensus 1 ~~~~~~~~~~~-------t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~--iG~~t~~ 71 (343) +.......... ...+....+| .+..+.|..++.+..+..+.+++++++.++.+. .+|+ .+..++. T Consensus 100 ~~~~~~~~~~~~~~~~~al~~~t~s~gG---~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~---~~p~~~~~~~~a~ 173 (387) T protein:vir:93 100 LPNEFEKPSMEAQRLLHALPTGNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDDDD 173 (387) T ss_pred hhhhhhhhhhhhHHHHHhhccCcCCCCc---eeechhHHHHHHHHHHhhchhhhheeeeecCCc---eEEEEeecCCccc Confidence 00000000000 0001111111 134588899999999888888998888776432 3443 2344555 Q ss_pred eecCCCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 72 YLQAGQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 72 ~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) -...|+..+.+ +++.+++++.. .++.. +.|.+---..+.+|+.+.+.++.++++++..++.++... T Consensus 174 ~v~E~~~~~~~--~~~f~~v~~~~--~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g--------- 240 (387) T protein:vir:93 174 FITDVETAKEL--KLKGDTVKFTT--NKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS--------- 240 (387) T ss_pred cccCccccccc--ccccceeeeeh--eeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC--------- Confidence 56667666543 35666666654 44444 445432122356899999999999999988776554211 Q ss_pred ccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccccc Q lcl|NC_011085. 151 SNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALI 230 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (343) ...+.+.+.... +. ....+.... ++.|.++...|+..... ...| ++++..|..|+.-.+-.+ . T Consensus 241 ---~g~g~p~g~l~~-~~-~~~v~~~~~----~d~i~~~~~~l~~~~~~-~a~~-~mn~~t~~~~~~~~~d~~------~ 303 (387) T protein:vir:93 241 ---PKSGLDHMSFYN-GS-VKEVEGADM----YDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGT------T 303 (387) T ss_pred ---CCccccceeeec-cc-cccccccch----HHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCC------C Confidence 111111111111 11 111122222 44455555566665543 3456 566665555443211111 2 Q ss_pred chhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeee Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERAR 310 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~ 310 (343) .+..|.-.+++|.+|+.++..|.- +-++|......+. .+..+.++ T Consensus 304 ~~~~~~~~~llG~PV~~~~~~~~~------------------------~~GDf~~~~~~~~-----------~~~~~~~~ 348 (387) T protein:vir:93 304 NFFDTPAEKVFGKPVVFTDAAVKP------------------------IVGDFNYFGINYD-----------GTTYDTDK 348 (387) T ss_pred cccccCCccccccceEEecCCCce------------------------eeeehhhhheehh-----------hheeeecc Confidence 222344468999999998755411 1123332211111 11233333 Q ss_pred ccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 311 RAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 311 ~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +...-...+.+..+||+++++|++.+.+++++. T Consensus 349 ~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~ 381 (387) T protein:vir:93 349 DVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred cccCCceeEEEEeeeCceeechhheEEEEeecC Confidence 333334456677799999999999988877666 No 154 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.90 E-value=3.4e-10 Score=72.46 Aligned_cols=271 Identities=13% Similarity=0.057 Sum_probs=149.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC-cceeeeecCCCcC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG-RTRAAYLQAGQSL 79 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG-~~t~~~~~~g~~i 79 (343) ||....--. +..+ ..-+++ |++.|+..+.+.+ .+++..|..++..|+++++|... .....++..|+.| T Consensus 1 mAe~nlt~~--~dL~---~~~sid--fv~~f~~~i~~L~----~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~I 69 (295) T protein:vir:99 1 MAEKNLNTM--ADLG---DIKSID--FVNKFSKNINDLL----KLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETI 69 (295) T ss_pred CCCcccccH--hhcc---Cceeeh--hhHHhhhhHHHHH----HHhccccccccccCCeEEeeeeeeecccccccCCccc Confidence 887411111 1111 233444 9999998776555 35667777788889999999865 2345789999999 Q ss_pred CCccCCCcc---ceEEEEeeeeeeeeeeccchHHH-Hh--chhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 80 DDKRKDIKH---TEKTIVIDGLLTADVLIYDIEDA-MN--HYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 80 ~~~~~~~~~---~~~~l~iD~~~~~~~~Idd~D~~-q~--~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) +.+. ++. +..++++.++. . .+. ||+ |. ..|...+..+++..+|++++|..++..+..+.-.. T Consensus 70 plsk--vt~~~~~t~t~kikK~r--K-~tT--dEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~----- 137 (295) T protein:vir:99 70 PLSK--VTRTKDKDYTVKWFKKR--R-ATT--AEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKV----- 137 (295) T ss_pred chhh--heeeeeeeeEEEeeeec--c-ccc--HHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceee----- Confidence 8763 333 34566675543 2 344 455 43 35699999999999999999999987763211100 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhh-cCCCcCCcEEEeCHHHHHHHhccchhh--hhcccccc Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTS-NYVPSADRTFYTTPEVYSAILAALMPN--AANYAALI 230 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~-~~VP~~gR~~vv~P~~~~~Ll~~~~~~--~~~~~~~~ 230 (343) +... -+..++.+..+...+.| .++ ..+++|+|..++.||.+-... .+...|.. T Consensus 138 --------------------tg~~-lq~a~a~~~~al~~f~Ee~~~---~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~ 193 (295) T protein:vir:99 138 --------------------KGVG-LQKALSASWAKLATFNEFEGS---PLVSFVSPLDVANYLGDTKVGADASNVFGMT 193 (295) T ss_pred --------------------ehhh-HHHHHHHhhhhhhhcccccCC---ceEEEEehHHHHHHHhccccccchhhhhhhh Confidence 0000 01112233222333333 333 469999999999999876543 22212333 Q ss_pred chhcceeEEEeceE-EEEecccccccccccccccccccccccccccccccc------ccccceEeEeechhhheeeeeee Q lcl|NC_011085. 231 DPERGSIRNVMGFE-VVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTK------VALDNVVGLFQHRSAVGTVKLKD 303 (343) Q Consensus 231 ~~~~G~V~~i~Gf~-V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~l~~~~~Av~~~~~~~ 303 (343) -+. ++.|++ |+.|+.+|.+..-..+... -..+-.+..++... .|-+..+|+. |-.. . T Consensus 194 ~L~-----nfLG~q~II~S~kv~~G~~~aT~~~N---i~~ay~~~~~g~l~~~f~~~~D~tglIg~~-h~~~-----~-- 257 (295) T protein:vir:99 194 LLK-----NFLGMQNVIVMPSVPEGKIYSTAVEN---LVFASLNVKGGDLGGLFADFTDETGLIAAA-RNRQ-----L-- 257 (295) T ss_pred hhh-----hhhccceEEEcccCCCceEEEeeccc---eEEEEecCCchhhhhhhhhccCcccceEEE-eccc-----c-- Confidence 333 499997 9999999987644333221 11111122212111 1222223211 1100 0 Q ss_pred eEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 304 LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++-.+..-+..+... =+=|+|+++..+++.+ T Consensus 258 ------~~~t~et~~~~~~~l---fpE~~dgiv~~tI~~~ 288 (295) T protein:vir:99 258 ------SNLTYESVFFGANVL---FAEIPEGVVEATIEAA 288 (295) T ss_pred ------ceeeehhhhHhHHHh---cccccceEEEEEEecC Confidence 111111111122221 2447789999999777 No 155 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.90 E-value=1e-09 Score=69.90 Aligned_cols=303 Identities=12% Similarity=0.075 Sum_probs=166.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhcc-Cc---------cccccc--cceEEEEeccCcc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTN-RH---------IMRSIS--SGKSAQFPVLGRT 68 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~-~~---------~~~~i~--~G~tv~i~~iG~~ 68 (343) ||-+..+. +|+. -.++|+..+...-.+.|-|.+ ++ +..++. .|++|.|.-+... T Consensus 1 Ma~T~~~~------------~~p~--a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 66 (364) T protein:vir:93 1 MSQTVIPF------------GDPK--AVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHL 66 (364) T ss_pred CceeccCc------------CCHH--HHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeec Confidence 76554433 3444 458999999887776664443 22 222332 4999999999888 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeecc---chHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIY---DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLC 145 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Id---d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a 145 (343) +-.....++.+.+.-+.++....+|.||+.. +.|+ .+++-.+.+|+|.+.-...+.=+++..|+.++.+++.+. T Consensus 67 ~g~gv~Gd~~leGnee~L~~~~~~i~idq~r---~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGar 143 (364) T protein:vir:93 67 RGKPTYGDARVEGKEESLRFYQDEVRIDQVR---HSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGAR 143 (364) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeecc---ccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 7666677788888778888899999999885 3453 466777899999999999999999999999998887532 Q ss_pred hcccccc--cccccc-------C-Cceeecccccc--cccchHHHHHHHHHHHHHHHHHHhhcCCC--c----------- Q lcl|NC_011085. 146 NMPAASN--ENIAGL-------G-SASILEVGAKG--DLTSPVELGKAVIAQLTIARAKLTSNYVP--S----------- 200 (343) Q Consensus 146 ~~~~~~~--~~~~g~-------~-~~~~~~~~~~~--~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP--~----------- 200 (343) ....+.. ....+. + ...++-.+.++ ...+... ..-++.|.+|...++....+ + T Consensus 144 g~~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD--~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g 221 (364) T protein:vir:93 144 GINLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATD--IMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDG 221 (364) T ss_pred ccccccccccCcccccccccCCCCCCcEEeccccCchhhccccc--cccHHHHHHHHHHHHHhCCCCCCCcccceeEecC Confidence 1110000 000000 0 01111111111 1111110 01256677777776655321 1 Q ss_pred CC-cEEEeCHHHHHHHhccc--h---hhhhcc--cc-ccchhcceeEEEeceEEEEeccccccccccccccccccccccc Q lcl|NC_011085. 201 AD-RTFYTTPEVYSAILAAL--M---PNAANY--AA-LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAF 271 (343) Q Consensus 201 ~g-R~~vv~P~~~~~Ll~~~--~---~~~~~~--~~-~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~ 271 (343) ++ -+++++|.++..|..+. . +.+.-. .| ...+-+|.+|.++|+-|++.++++..+..+.... T Consensus 222 ~~~yV~~l~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~--------- 292 (364) T protein:vir:93 222 DDHYVCVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGAN--------- 292 (364) T ss_pred cceeEEEEcchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCcc--------- Confidence 12 26779999999998543 3 222111 22 2357789999999999999999975542221110 Q ss_pred cccccccccccccceEeEeechhhheeeeeee----eE-Eeeeeccchhhhhhhhhhhhccceeccc--ceEEEEecCC Q lcl|NC_011085. 272 PKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD----LS-LERARRAEYQADQIIARYAMGHGGLRPE--AAGALVFTAG 343 (343) Q Consensus 272 ~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~----~~-~e~~~~~~~~~d~i~~~~~~G~~v~rpe--~~~~i~~~~g 343 (343) +.-..+|++=..|++.+.... .. .|-.+|-.+.- .|.....+|.+=.|=+ =-++|.+.-- T Consensus 293 -----------v~~~ralllGaQA~~~a~g~~~g~~~~w~Ee~~D~gn~~-~i~~~~i~G~kK~rF~~~DfGvi~idta 359 (364) T protein:vir:93 293 -----------VEAARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEP-AIAAGFIAGMKKARFNNKDFGVISIDTA 359 (364) T ss_pred -----------ccchhhheecceeeEEEeecCCCCCceeeecccCCCCch-hhhhhhHhhhhhcccCCccceEEEeccc Confidence 001112333333333333221 11 22222221111 1333333444433332 1233333222 No 156 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=98.87 E-value=3.2e-11 Score=78.12 Aligned_cols=271 Identities=11% Similarity=0.049 Sum_probs=141.7 Q ss_pred CCCCCc----------cccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--Ccc Q lcl|NC_011085. 1 MADMKG----------GQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRT 68 (343) Q Consensus 1 ~~~~~~----------~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~ 68 (343) |..... .+...+ |....+| .+..+.|..++.+..+..+.+++++++.++.+. ++|++ +.. T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~--~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~ 170 (387) T protein:vir:94 99 ILPNEFEKPSMEAQRLLHALPT--GNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLD 170 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhcc--CCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCC Confidence 100000 000000 0001111 235588999999999988988998887776433 33432 334 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) ++.-...|...+.+ +++.+++++...++. .-+.|.+-=-..+.+|+.+.+.++.++++++..++.++... T Consensus 171 ~a~~v~Eg~~~~~~--~~~f~~v~l~~~k~~-~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g------- 240 (387) T protein:vir:94 171 DDDFITDVETAKEL--KAKGDTVKFTTNKFK-VFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS------- 240 (387) T ss_pred cccccccccccccc--ccccceeeechheee-eechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC------- Confidence 45555667766553 356677666665442 22445431122256889999999999999988776655221 Q ss_pred ccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_011085. 149 AASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA 228 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (343) ...|.+.+.....+ ....+.... ++.|.++...|+....+ ...|+ +++..|..|+.-.+-.+ T Consensus 241 -----~g~g~~~g~~~~~~--~~~~~~~~~----~d~i~~~~~~l~~~y~~-na~~i-mn~~t~~~~~~~~~~~~----- 302 (387) T protein:vir:94 241 -----PKSGLEHMSFYNGS--VKEVEGADM----YDAIINALADLHEDYRD-NATIY-MRYADYVKIISVLSNGT----- 302 (387) T ss_pred -----CCccccceeeeccc--cccccccch----HHHHHHHHhccChhhhc-CCEEE-EechHHHHHHHHHhcCC----- Confidence 01111112111111 111122222 44455555566665443 34564 56555555543211111 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ..+..|.-.+++|.+|+.++..|.- +-++|...... + ..+..+. T Consensus 303 -~~~~~~~~~~llG~PV~~~~~~~~~------------------------~~GDf~~~~~~-~----------~~~~~~~ 346 (387) T protein:vir:94 303 -TNFFDTPAEKVFGKPVVFTDAAVKP------------------------IVGDFNYFGIN-Y----------DGTTYDT 346 (387) T ss_pred -CcccccCCccccccceEEecCCCce------------------------eeechhhhhhh-h----------hhhhhee Confidence 2233344568999999998865421 01222221110 0 0112333 Q ss_pred eeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 309 ARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 309 ~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++...-...++...+|++++++|++.+.++.++. T Consensus 347 ~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:94 347 DKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred cccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 44433333456667789999999999999999888 No 157 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=98.87 E-value=3.2e-11 Score=78.12 Aligned_cols=271 Identities=11% Similarity=0.049 Sum_probs=141.7 Q ss_pred CCCCCc----------cccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--Ccc Q lcl|NC_011085. 1 MADMKG----------GQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRT 68 (343) Q Consensus 1 ~~~~~~----------~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~ 68 (343) |..... .+...+ |....+| .+..+.|..++.+..+..+.+++++++.++.+. ++|++ +.. T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~--~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~ 170 (387) T protein:vir:96 99 ILPNEFEKPSMEAQRLLHALPT--GNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLD 170 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhcc--CCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCC Confidence 100000 000000 0001111 235588999999999988988998887776433 33432 334 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) ++.-...|...+.+ +++.+++++...++. .-+.|.+-=-..+.+|+.+.+.++.++++++..++.++... T Consensus 171 ~a~~v~Eg~~~~~~--~~~f~~v~l~~~k~~-~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g------- 240 (387) T protein:vir:96 171 DDDFITDVETAKEL--KAKGDTVKFTTNKFK-VFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS------- 240 (387) T ss_pred cccccccccccccc--ccccceeeechheee-eechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC------- Confidence 45555667766553 356677666665442 22445431122256889999999999999988776655221 Q ss_pred ccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_011085. 149 AASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA 228 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (343) ...|.+.+.....+ ....+.... ++.|.++...|+....+ ...|+ +++..|..|+.-.+-.+ T Consensus 241 -----~g~g~~~g~~~~~~--~~~~~~~~~----~d~i~~~~~~l~~~y~~-na~~i-mn~~t~~~~~~~~~~~~----- 302 (387) T protein:vir:96 241 -----PKSGLEHMSFYNGS--VKEVEGADM----YDAIINALADLHEDYRD-NATIY-MRYADYVKIISVLSNGT----- 302 (387) T ss_pred -----CCccccceeeeccc--cccccccch----HHHHHHHHhccChhhhc-CCEEE-EechHHHHHHHHHhcCC----- Confidence 01111112111111 111122222 44455555566665443 34564 56555555543211111 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ..+..|.-.+++|.+|+.++..|.- +-++|...... + ..+..+. T Consensus 303 -~~~~~~~~~~llG~PV~~~~~~~~~------------------------~~GDf~~~~~~-~----------~~~~~~~ 346 (387) T protein:vir:96 303 -TNFFDTPAEKVFGKPVVFTDAAVKP------------------------IVGDFNYFGIN-Y----------DGTTYDT 346 (387) T ss_pred -CcccccCCccccccceEEecCCCce------------------------eeechhhhhhh-h----------hhhhhee Confidence 2233344568999999998865421 01222221110 0 0112333 Q ss_pred eeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 309 ARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 309 ~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++...-...++...+|++++++|++.+.++.++. T Consensus 347 ~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:96 347 DKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred cccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 44433333456667789999999999999999888 No 158 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=98.87 E-value=3.2e-11 Score=78.12 Aligned_cols=271 Identities=11% Similarity=0.049 Sum_probs=141.7 Q ss_pred CCCCCc----------cccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc--Ccc Q lcl|NC_011085. 1 MADMKG----------GQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL--GRT 68 (343) Q Consensus 1 ~~~~~~----------~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i--G~~ 68 (343) |..... .+...+ |....+| .+..+.|..++.+..+..+.+++++++.++.+. ++|++ +.. T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~--~~~~~gG---~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~ 170 (387) T protein:vir:26 99 ILPNEFEKPSMEAQRLLHALPT--GNDSGGD---KLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLD 170 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhcc--CCCCCCc---eeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCC Confidence 100000 000000 0001111 235588999999999988988998887776433 33432 334 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) ++.-...|...+.+ +++.+++++...++. .-+.|.+-=-..+.+|+.+.+.++.++++++..++.++... T Consensus 171 ~a~~v~Eg~~~~~~--~~~f~~v~l~~~k~~-~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g------- 240 (387) T protein:vir:26 171 DDDFITDVETAKEL--KAKGDTVKFTTNKFK-VFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS------- 240 (387) T ss_pred cccccccccccccc--ccccceeeechheee-eechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC------- Confidence 45555667766553 356677666665442 22445431122256889999999999999988776655221 Q ss_pred ccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_011085. 149 AASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA 228 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (343) ...|.+.+.....+ ....+.... ++.|.++...|+....+ ...|+ +++..|..|+.-.+-.+ T Consensus 241 -----~g~g~~~g~~~~~~--~~~~~~~~~----~d~i~~~~~~l~~~y~~-na~~i-mn~~t~~~~~~~~~~~~----- 302 (387) T protein:vir:26 241 -----PKSGLEHMSFYNGS--VKEVEGADM----YDAIINALADLHEDYRD-NATIY-MRYADYVKIISVLSNGT----- 302 (387) T ss_pred -----CCccccceeeeccc--cccccccch----HHHHHHHHhccChhhhc-CCEEE-EechHHHHHHHHHhcCC----- Confidence 01111112111111 111122222 44455555566665443 34564 56555555543211111 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ..+..|.-.+++|.+|+.++..|.- +-++|...... + ..+..+. T Consensus 303 -~~~~~~~~~~llG~PV~~~~~~~~~------------------------~~GDf~~~~~~-~----------~~~~~~~ 346 (387) T protein:vir:26 303 -TNFFDTPAEKVFGKPVVFTDAAVKP------------------------IVGDFNYFGIN-Y----------DGTTYDT 346 (387) T ss_pred -CcccccCCccccccceEEecCCCce------------------------eeechhhhhhh-h----------hhhhhee Confidence 2233344568999999998865421 01222221110 0 0112333 Q ss_pred eeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 309 ARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 309 ~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +++...-...++...+|++++++|++.+.++.++. T Consensus 347 ~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:26 347 DKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred cccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 44433333456667789999999999999999888 No 159 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.83 E-value=8.2e-10 Score=70.41 Aligned_cols=288 Identities=12% Similarity=0.033 Sum_probs=146.2 Q ss_pred CCCCCccccccc---------cccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC-ccee Q lcl|NC_011085. 1 MADMKGGQQLGK---------DQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG-RTRA 70 (343) Q Consensus 1 ~~~~~~~~~~~t---------~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG-~~t~ 70 (343) +++. ....... ..+.+.+.+ -.|..+.|..++.+...+.|.++.++++.++. | .++|++-. ..++ T Consensus 59 ~~~~-~~~~lt~ee~~~~~~~~~~~~~~~g--g~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~~a 133 (377) T protein:vir:96 59 DLRD-KNRELTAEEIKFFNDIDKNVGGKDK--FKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETSGTA 133 (377) T ss_pred Hhcc-CCcccCHHHHHHHHHHHhcCCCCCC--ceecCHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEecCCcce Confidence 1110 0000000 001111111 12455889999999999999999999888764 3 35566543 3344 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) .-...+..+..+ .+++..+++|.. .++.. ..|..-=-..+.+|+-+.+.++.+.++++..|+.++.- .+. . T Consensus 134 ~wv~e~~~~~~~-~~~~f~~i~l~~--~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G--~G~---~ 205 (377) T protein:vir:96 134 VWGDIFGEIKGQ-LKQAFKEQDFSQ--FKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKG--NGL---L 205 (377) T ss_pred eEeecccccccc-cCccceeEeeee--eeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEec--cCC---C Confidence 444444444432 234555555554 44444 34443222236688999999999999999999988631 011 0 Q ss_pred ccccccc-----------ccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCC--C---cCCcEEEeCHHHHH Q lcl|NC_011085. 150 ASNENIA-----------GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYV--P---SADRTFYTTPEVYS 213 (343) Q Consensus 150 ~~~~~~~-----------g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~V--P---~~gR~~vv~P~~~~ 213 (343) .+.+... +...+............+ ....+.+++.+..+...+....- | ...-+.+++|..|. T Consensus 206 ~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~ 284 (377) T protein:vir:96 206 QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLS-DLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRW 284 (377) T ss_pred cceeeeeccccccccccccccccceeeccccccccc-cCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHH Confidence 1111111 000011111000000000 00123344444444444433211 1 11235778888877 Q ss_pred HHhccchhhhhccccccchhcceeEEEece--EEEEeccccccccccccccccccccccccccccccccccccceEeEee Q lcl|NC_011085. 214 AILAALMPNAANYAALIDPERGSIRNVMGF--EVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQ 291 (343) Q Consensus 214 ~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf--~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 291 (343) .++......+ .+|.-.++.|+ +|++|+.+|.+.+. -++|+. T Consensus 285 ~~~~~~~~~~---------~~G~~~~~l~~p~~v~~s~~~p~~~i~----------------------fgdf~~------ 327 (377) T protein:vir:96 285 TLEAKFTSRN---------QFGEYVTVLPHGITILESLAVETGKAI----------------------AFVANR------ 327 (377) T ss_pred hccccccccC---------CCCCceeccCCCceEEecCCCCcccEE----------------------EEEcCc------ Confidence 6643211111 24554556554 57888888842110 012222 Q ss_pred chhhheeeeeeeeEEeeeeccch--hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 292 HRSAVGTVKLKDLSLERARRAEY--QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 292 ~~~Av~~~~~~~~~~e~~~~~~~--~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ...+....++++.+.+... -...+++.+++++++++|++.++|.++-| T Consensus 328 ----Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 328 ----YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ----EEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 2223344555665543221 22458899999999999999999999999 No 160 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.83 E-value=2.2e-09 Score=68.08 Aligned_cols=335 Identities=11% Similarity=0.111 Sum_probs=169.1 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhh---------ccCccccccc--cceEEEEeccCcc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFGGEVLTAFARTSVT---------TNRHIMRSIS--SGKSAQFPVLGRT 68 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~g~V~~~f~~~s~~---------~~~~~~~~i~--~G~tv~i~~iG~~ 68 (343) |--...++- ...+-.+. ...-.+.-++++|.+.+...=+..+-+ +..++..++. .|++|.|.-+... T Consensus 1 ~~~~~~~~a-~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L 79 (404) T protein:vir:32 1 MTTVTSAQA-NKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) T ss_pred CCCcCCcch-hhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeec Confidence 322211111 12211111 000011225678887754432222111 2233333443 5999999999888 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeec-cchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLI-YDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~I-dd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) +-.....++.+.+.-++++....+|.||+..-. +.. ..+++-.+.+|+|++.-..++.-+++..||.+|.+|+.+... T Consensus 80 ~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~-V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~ 158 (404) T protein:vir:32 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeeccc-ccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 766667678888887888999999999988522 222 355566678999999999999999999999999998854421 Q ss_pred ------cccc--ccccc-------ccCCce-eecccccccccchHHHHHH-HHHHHHHHHHHHhhcCCCcC-------C- Q lcl|NC_011085. 148 ------PAAS--NENIA-------GLGSAS-ILEVGAKGDLTSPVELGKA-VIAQLTIARAKLTSNYVPSA-------D- 202 (343) Q Consensus 148 ------~~~~--~~~~~-------g~~~~~-~~~~~~~~~~~~~~~~~~~-i~~~l~~a~~~Ld~~~VP~~-------g- 202 (343) ..|. +.... -.++.. ++-.+.++...+ -...+. -++.|.++++.+++..-|-. . T Consensus 159 ~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~-l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~ 237 (404) T protein:vir:32 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ-IEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) T ss_pred cccccceeeccccccccceeecccCCCCCCcEEeccCccchhh-hhhcccccHHHHHHHHHHHHHhCCCCcceEeccccc Confidence 0000 00000 000000 111111111100 001111 14566667777766433322 2 Q ss_pred ------cEEEeCHHHHHHHhccch------hhh-hcc---ccccchhcceeEEEeceEEEEecccccccccccccccccc Q lcl|NC_011085. 203 ------RTFYTTPEVYSAILAALM------PNA-ANY---AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTN 266 (343) Q Consensus 203 ------R~~vv~P~~~~~Ll~~~~------~~~-~~~---~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~ 266 (343) ++++++|.+|..|..+.. +.. +.. .....+-.|.++.++|+-|.+.++.|..-..........+ T Consensus 238 ~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n 317 (404) T protein:vir:32 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) T ss_pred cCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCC Confidence 678899999999999852 211 211 1234678899999999999998887732111111111111 Q ss_pred ccccccccccccccccccceEeEeechhhheeeeeee----eE-Eeeeeccchhhhhhhhhhhhccceec-cc------c Q lcl|NC_011085. 267 QKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD----LS-LERARRAEYQADQIIARYAMGHGGLR-PE------A 334 (343) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~----~~-~e~~~~~~~~~d~i~~~~~~G~~v~r-pe------~ 334 (343) ...+.... ....+.-..+|++=..|++-|.++. .. .|-.+|-.+. -.|.....+|.+=.| |. - T Consensus 318 ~~~a~~~~----~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~D 392 (404) T protein:vir:32 318 NLTATTKE----VAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQD 392 (404) T ss_pred cccccccc----ccccccchhheeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceee Confidence 11111110 0111111223444444444443331 11 2322332222 245555666777666 42 2 Q ss_pred eEEEEecCC Q lcl|NC_011085. 335 AGALVFTAG 343 (343) Q Consensus 335 ~~~i~~~~g 343 (343) -++|.+.-- T Consensus 393 fGvi~idta 401 (404) T protein:vir:32 393 HGVIAVDTA 401 (404) T ss_pred EEEEEeccc Confidence 333333333 No 161 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.83 E-value=2.2e-09 Score=68.08 Aligned_cols=335 Identities=11% Similarity=0.111 Sum_probs=169.1 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhh---------ccCccccccc--cceEEEEeccCcc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFGGEVLTAFARTSVT---------TNRHIMRSIS--SGKSAQFPVLGRT 68 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~g~V~~~f~~~s~~---------~~~~~~~~i~--~G~tv~i~~iG~~ 68 (343) |--...++- ...+-.+. ...-.+.-++++|.+.+...=+..+-+ +..++..++. .|++|.|.-+... T Consensus 1 ~~~~~~~~a-~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L 79 (404) T protein:vir:10 1 MTTVTSAQA-NKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) T ss_pred CCCcCCcch-hhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeec Confidence 322211111 12211111 000011225678887754432222111 2233333443 5999999999888 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeec-cchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLI-YDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~I-dd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) +-.....++.+.+.-++++....+|.||+..-. +.. ..+++-.+.+|+|++.-..++.-+++..||.+|.+|+.+... T Consensus 80 ~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~-V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~ 158 (404) T protein:vir:10 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeeccc-ccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 766667678888887888999999999988522 222 355566678999999999999999999999999998854421 Q ss_pred ------cccc--ccccc-------ccCCce-eecccccccccchHHHHHH-HHHHHHHHHHHHhhcCCCcC-------C- Q lcl|NC_011085. 148 ------PAAS--NENIA-------GLGSAS-ILEVGAKGDLTSPVELGKA-VIAQLTIARAKLTSNYVPSA-------D- 202 (343) Q Consensus 148 ------~~~~--~~~~~-------g~~~~~-~~~~~~~~~~~~~~~~~~~-i~~~l~~a~~~Ld~~~VP~~-------g- 202 (343) ..|. +.... -.++.. ++-.+.++...+ -...+. -++.|.++++.+++..-|-. . T Consensus 159 ~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~-l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~ 237 (404) T protein:vir:10 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ-IEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) T ss_pred cccccceeeccccccccceeecccCCCCCCcEEeccCccchhh-hhhcccccHHHHHHHHHHHHHhCCCCcceEeccccc Confidence 0000 00000 000000 111111111100 001111 14566667777766433322 2 Q ss_pred ------cEEEeCHHHHHHHhccch------hhh-hcc---ccccchhcceeEEEeceEEEEecccccccccccccccccc Q lcl|NC_011085. 203 ------RTFYTTPEVYSAILAALM------PNA-ANY---AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTN 266 (343) Q Consensus 203 ------R~~vv~P~~~~~Ll~~~~------~~~-~~~---~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~ 266 (343) ++++++|.+|..|..+.. +.. +.. .....+-.|.++.++|+-|.+.++.|..-..........+ T Consensus 238 ~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n 317 (404) T protein:vir:10 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) T ss_pred cCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCC Confidence 678899999999999852 211 211 1234678899999999999998887732111111111111 Q ss_pred ccccccccccccccccccceEeEeechhhheeeeeee----eE-Eeeeeccchhhhhhhhhhhhccceec-cc------c Q lcl|NC_011085. 267 QKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD----LS-LERARRAEYQADQIIARYAMGHGGLR-PE------A 334 (343) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~----~~-~e~~~~~~~~~d~i~~~~~~G~~v~r-pe------~ 334 (343) ...+.... ....+.-..+|++=..|++-|.++. .. .|-.+|-.+. -.|.....+|.+=.| |. - T Consensus 318 ~~~a~~~~----~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~D 392 (404) T protein:vir:10 318 NLTATTKE----VAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQD 392 (404) T ss_pred cccccccc----ccccccchhheeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceee Confidence 11111110 0111111223444444444443331 11 2322332222 245555666777666 42 2 Q ss_pred eEEEEecCC Q lcl|NC_011085. 335 AGALVFTAG 343 (343) Q Consensus 335 ~~~i~~~~g 343 (343) -++|.+.-- T Consensus 393 fGvi~idta 401 (404) T protein:vir:10 393 HGVIAVDTA 401 (404) T ss_pred EEEEEeccc Confidence 333333333 No 162 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.83 E-value=2.2e-09 Score=68.08 Aligned_cols=335 Identities=11% Similarity=0.111 Sum_probs=169.1 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhh---------ccCccccccc--cceEEEEeccCcc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFGGEVLTAFARTSVT---------TNRHIMRSIS--SGKSAQFPVLGRT 68 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~g~V~~~f~~~s~~---------~~~~~~~~i~--~G~tv~i~~iG~~ 68 (343) |--...++- ...+-.+. ...-.+.-++++|.+.+...=+..+-+ +..++..++. .|++|.|.-+... T Consensus 1 ~~~~~~~~a-~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L 79 (404) T protein:vir:10 1 MTTVTSAQA-NKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) T ss_pred CCCcCCcch-hhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeec Confidence 322211111 12211111 000011225678887754432222111 2233333443 5999999999888 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeec-cchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLI-YDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~I-dd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) +-.....++.+.+.-++++....+|.||+..-. +.. ..+++-.+.+|+|++.-..++.-+++..||.+|.+|+.+... T Consensus 80 ~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~-V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~ 158 (404) T protein:vir:10 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeeccc-ccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 766667678888887888999999999988522 222 355566678999999999999999999999999998854421 Q ss_pred ------cccc--ccccc-------ccCCce-eecccccccccchHHHHHH-HHHHHHHHHHHHhhcCCCcC-------C- Q lcl|NC_011085. 148 ------PAAS--NENIA-------GLGSAS-ILEVGAKGDLTSPVELGKA-VIAQLTIARAKLTSNYVPSA-------D- 202 (343) Q Consensus 148 ------~~~~--~~~~~-------g~~~~~-~~~~~~~~~~~~~~~~~~~-i~~~l~~a~~~Ld~~~VP~~-------g- 202 (343) ..|. +.... -.++.. ++-.+.++...+ -...+. -++.|.++++.+++..-|-. . T Consensus 159 ~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~-l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~ 237 (404) T protein:vir:10 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ-IEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) T ss_pred cccccceeeccccccccceeecccCCCCCCcEEeccCccchhh-hhhcccccHHHHHHHHHHHHHhCCCCcceEeccccc Confidence 0000 00000 000000 111111111100 001111 14566667777766433322 2 Q ss_pred ------cEEEeCHHHHHHHhccch------hhh-hcc---ccccchhcceeEEEeceEEEEecccccccccccccccccc Q lcl|NC_011085. 203 ------RTFYTTPEVYSAILAALM------PNA-ANY---AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTN 266 (343) Q Consensus 203 ------R~~vv~P~~~~~Ll~~~~------~~~-~~~---~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~ 266 (343) ++++++|.+|..|..+.. +.. +.. .....+-.|.++.++|+-|.+.++.|..-..........+ T Consensus 238 ~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n 317 (404) T protein:vir:10 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) T ss_pred cCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCC Confidence 678899999999999852 211 211 1234678899999999999998887732111111111111 Q ss_pred ccccccccccccccccccceEeEeechhhheeeeeee----eE-Eeeeeccchhhhhhhhhhhhccceec-cc------c Q lcl|NC_011085. 267 QKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD----LS-LERARRAEYQADQIIARYAMGHGGLR-PE------A 334 (343) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~----~~-~e~~~~~~~~~d~i~~~~~~G~~v~r-pe------~ 334 (343) ...+.... ....+.-..+|++=..|++-|.++. .. .|-.+|-.+. -.|.....+|.+=.| |. - T Consensus 318 ~~~a~~~~----~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~D 392 (404) T protein:vir:10 318 NLTATTKE----VAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQD 392 (404) T ss_pred cccccccc----ccccccchhheeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceee Confidence 11111110 0111111223444444444443331 11 2322332222 245555666777666 42 2 Q ss_pred eEEEEecCC Q lcl|NC_011085. 335 AGALVFTAG 343 (343) Q Consensus 335 ~~~i~~~~g 343 (343) -++|.+.-- T Consensus 393 fGvi~idta 401 (404) T protein:vir:10 393 HGVIAVDTA 401 (404) T ss_pred EEEEEeccc Confidence 333333333 No 163 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.83 E-value=2.2e-09 Score=68.08 Aligned_cols=335 Identities=11% Similarity=0.111 Sum_probs=169.1 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhh---------ccCccccccc--cceEEEEeccCcc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFGGEVLTAFARTSVT---------TNRHIMRSIS--SGKSAQFPVLGRT 68 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~g~V~~~f~~~s~~---------~~~~~~~~i~--~G~tv~i~~iG~~ 68 (343) |--...++- ...+-.+. ...-.+.-++++|.+.+...=+..+-+ +..++..++. .|++|.|.-+... T Consensus 1 ~~~~~~~~a-~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L 79 (404) T protein:vir:81 1 MTTVTSAQA-NKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKL 79 (404) T ss_pred CCCcCCcch-hhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeec Confidence 322211111 12211111 000011225678887754432222111 2233333443 5999999999888 Q ss_pred eeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeec-cchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 69 RAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLI-YDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 69 t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~I-dd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) +-.....++.+.+.-++++....+|.||+..-. +.. ..+++-.+.+|+|++.-..++.-+++..||.+|.+|+.+... T Consensus 80 ~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~-V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~ 158 (404) T protein:vir:81 80 SKRPTMGDERVEGRGEDLSHADFSLKINQGRHL-VDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGD 158 (404) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeeeccc-ccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 766667678888887888999999999988522 222 355566678999999999999999999999999998854421 Q ss_pred ------cccc--ccccc-------ccCCce-eecccccccccchHHHHHH-HHHHHHHHHHHHhhcCCCcC-------C- Q lcl|NC_011085. 148 ------PAAS--NENIA-------GLGSAS-ILEVGAKGDLTSPVELGKA-VIAQLTIARAKLTSNYVPSA-------D- 202 (343) Q Consensus 148 ------~~~~--~~~~~-------g~~~~~-~~~~~~~~~~~~~~~~~~~-i~~~l~~a~~~Ld~~~VP~~-------g- 202 (343) ..|. +.... -.++.. ++-.+.++...+ -...+. -++.|.++++.+++..-|-. . T Consensus 159 ~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~-l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~ 237 (404) T protein:vir:81 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQ-IEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDEL 237 (404) T ss_pred cccccceeeccccccccceeecccCCCCCCcEEeccCccchhh-hhhcccccHHHHHHHHHHHHHhCCCCcceEeccccc Confidence 0000 00000 000000 111111111100 001111 14566667777766433322 2 Q ss_pred ------cEEEeCHHHHHHHhccch------hhh-hcc---ccccchhcceeEEEeceEEEEecccccccccccccccccc Q lcl|NC_011085. 203 ------RTFYTTPEVYSAILAALM------PNA-ANY---AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTN 266 (343) Q Consensus 203 ------R~~vv~P~~~~~Ll~~~~------~~~-~~~---~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~ 266 (343) ++++++|.+|..|..+.. +.. +.. .....+-.|.++.++|+-|.+.++.|..-..........+ T Consensus 238 ~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n 317 (404) T protein:vir:81 238 HGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSEN 317 (404) T ss_pred cCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCC Confidence 678899999999999852 211 211 1234678899999999999998887732111111111111 Q ss_pred ccccccccccccccccccceEeEeechhhheeeeeee----eE-Eeeeeccchhhhhhhhhhhhccceec-cc------c Q lcl|NC_011085. 267 QKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD----LS-LERARRAEYQADQIIARYAMGHGGLR-PE------A 334 (343) Q Consensus 267 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~----~~-~e~~~~~~~~~d~i~~~~~~G~~v~r-pe------~ 334 (343) ...+.... ....+.-..+|++=..|++-|.++. .. .|-.+|-.+. -.|.....+|.+=.| |. - T Consensus 318 ~~~a~~~~----~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~-~~i~~~~i~G~kK~rF~~~~g~~~D 392 (404) T protein:vir:81 318 NLTATTKE----VAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNR-TEIAISWINGLKKIRFPEKSGKMQD 392 (404) T ss_pred cccccccc----ccccccchhheeecceeEEEEeeccCCCCceeEeeccccCch-hhhhhHHHhhhhhccccCCCCceee Confidence 11111110 0111111223444444444443331 11 2322332222 245555666777666 42 2 Q ss_pred eEEEEecCC Q lcl|NC_011085. 335 AGALVFTAG 343 (343) Q Consensus 335 ~~~i~~~~g 343 (343) -++|.+.-- T Consensus 393 fGvi~idta 401 (404) T protein:vir:81 393 HGVIAVDTA 401 (404) T ss_pred EEEEEeccc Confidence 333333333 No 164 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.79 E-value=2.1e-09 Score=68.17 Aligned_cols=293 Identities=12% Similarity=0.030 Sum_probs=158.7 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhccCcccc-ccccceEEEE----eccCcceeeeec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTTNRHIMR-SISSGKSAQF----PVLGRTRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~~~~~~~-~i~~G~tv~i----~~iG~~t~~~~~ 74 (343) |-+-+ .+ +....++ .=.++.|.= ..|-........+...+.+..-.+ .-+++-+|+| |........+.. T Consensus 1 ~~~~~---~i-~s~~~~~-~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~Va 75 (318) T protein:vir:10 1 MTAPT---GI-VSVSDGP-AITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVA 75 (318) T ss_pred CCCCC---cc-eeeecCC-ceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhcc Confidence 33331 11 1112221 112222211 122222222222333334333222 3455667888 445566777888 Q ss_pred CCCcCCCccCCCccceEEE-EeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTI-VIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l-~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) +|.+++-.. ..+.+..+ .+.++ --.+.|.|--......|.++...++++.+++++.|+.++..+..+.-...+. T Consensus 76 EggEiP~~~--~~~G~~~ia~~~K~-G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~-- 150 (318) T protein:vir:10 76 EFGEIPVSA--GARGLPRTAFAVKK-ALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAV-- 150 (318) T ss_pred CcccccccC--CCCCchhhhhhehh-ccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccC-- Confidence 898887543 33333333 33333 3568888877777899999999999999999999999887654322111111 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccccc--- Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALI--- 230 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~--- 230 (343) .+.+.++... ..+..+..+..+.....+..+...-.+.+-.=.--.+|++|..|..|++++.+... |.+.. T Consensus 151 -s~~w~~~~~~----~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~-y~~~a~~~ 224 (318) T protein:vir:10 151 -PTAWDNGGKV----RTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKV-YERNANYV 224 (318) T ss_pred -CcCCCCcccc----cccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhh-hhccchhh Confidence 1111111100 01111111111111111111111111111111123899999999999998776542 21111 Q ss_pred ---chhccee-EEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeee-eeeeE Q lcl|NC_011085. 231 ---DPERGSI-RNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVK-LKDLS 305 (343) Q Consensus 231 ---~~~~G~V-~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~-~~~~~ 305 (343) .-..|.+ ++++|++|+.|+++|... ++++++..+|+.. ..+++ T Consensus 225 ~~~~~~tg~~~g~~lGl~vi~s~~~p~~~--------------------------------alvlq~g~vG~~~d~~pl~ 272 (318) T protein:vir:10 225 STAPDWTGNFPGSVMGLNVIRSRTFPIDR--------------------------------VLIMERGTVGFYSDTRPLQ 272 (318) T ss_pred hhcccccccccceeeceEEeecCccCCCe--------------------------------eEEEecCCcceeeccccce Confidence 1113443 678999999999999421 2555666666542 45577 Q ss_pred Eeeeecc-------chhhhhhhhhhhhccceecccceEEEE---ec Q lcl|NC_011085. 306 LERARRA-------EYQADQIIARYAMGHGGLRPEAAGALV---FT 341 (343) Q Consensus 306 ~e~~~~~-------~~~~d~i~~~~~~G~~v~rpe~~~~i~---~~ 341 (343) ++.+|.+ ....|.++.++.....|.+|.++..|+ .+ T Consensus 273 ~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 273 FTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred eeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 7888865 677899999999999999999988765 22 No 165 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.74 E-value=3.5e-09 Score=66.97 Aligned_cols=288 Identities=11% Similarity=0.032 Sum_probs=143.6 Q ss_pred CCCCCccccccc---------cccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceee Q lcl|NC_011085. 1 MADMKGGQQLGK---------DQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAA 71 (343) Q Consensus 1 ~~~~~~~~~~~t---------~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~ 71 (343) .....+.+.... +.+. ..+|. -|..+.|..++.+...+.|.++.+.++.++ +|. .++++....... T Consensus 56 ~~~~~~~~~l~~~e~~~~~~~~~~t-~~~Gg--~lvP~~~~~~I~~~l~~~spir~~a~v~~~-~~~-~~i~~~~~~~~a 130 (381) T protein:vir:10 56 SSLPKSAQTLSANQRNFFMDINKSV-GYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNA-GLR-LKFLKSETSGVA 130 (381) T ss_pred HHhcccccccCHHHHHHHHHHhhcC-CCCCc--eecCHHHHHHHHHHHHhhcceeeeeeeEec-Ccc-eEEEeecCCcce Confidence 000111111000 0111 11111 245599999999999999999999988776 343 455554333322 Q ss_pred ee-cCCCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 72 YL-QAGQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 72 ~~-~~g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) .. ..+..+..+ .+++.++++| ...++.. ..|..-=-..+.+|+-+.+..+.++++++..|+.++.- .++ . T Consensus 131 ~W~~e~~~~~~~-~~~~f~~i~l--~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~G--dG~---~ 202 (381) T protein:vir:10 131 VWGKIYGEIKGQ-LDAAFSEETA--IQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG--TGK---D 202 (381) T ss_pred EEeecccccccc-cCccceeEee--cceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEec--ccC---C Confidence 22 222233322 1234444444 4444444 33432111225678999999999999999999887621 111 1 Q ss_pred cccccccccCCceeeccccccc--------ccchHHHHHHHHHHHHHHHHHHhhcCC-CcCCcEEEeCHHHHHHHhccch Q lcl|NC_011085. 150 ASNENIAGLGSASILEVGAKGD--------LTSPVELGKAVIAQLTIARAKLTSNYV-PSADRTFYTTPEVYSAILAALM 220 (343) Q Consensus 150 ~~~~~~~g~~~~~~~~~~~~~~--------~~~~~~~~~~i~~~l~~a~~~Ld~~~V-P~~gR~~vv~P~~~~~Ll~~~~ 220 (343) .+.+......++.....+...+ ..++...+..+.+.+.........+.. +..+.+++++|..+..|+.... T Consensus 203 qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~ 282 (381) T protein:vir:10 203 QPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT 282 (381) T ss_pred CceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccc Confidence 1112211111111111111110 112222233333322222222222222 3445778899999888865432 Q ss_pred hhhhccccccchhcce-eEEE-eceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhhee Q lcl|NC_011085. 221 PNAANYAALIDPERGS-IRNV-MGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGT 298 (343) Q Consensus 221 ~~~~~~~~~~~~~~G~-V~~i-~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~ 298 (343) ..+ .+|. |..+ .|.+|++++.+|.+.+ .-++|+.. + - T Consensus 283 ~~~---------~~G~~v~~lp~g~~vv~~~~~p~~~i----------------------~fGDfs~Y--~--------i 321 (381) T protein:vir:10 283 HLN---------ANGVYVTALPFNLNVIESTVQEAGKV----------------------LTYVKGLY--D--------G 321 (381) T ss_pred cCC---------CCCceeecCCCCceeEEcCCCCcCcE----------------------EEEEcccE--E--------E Confidence 221 1222 1111 4778999999984321 01233321 1 1 Q ss_pred eeeeeeEEeeeeccchhh---hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 299 VKLKDLSLERARRAEYQA---DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 299 ~~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +....++++.+.+. +|. ..+++.+++++++++|++.+++.++.= T Consensus 322 ~~r~~~~i~~~~~~-~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 322 YLAGGINVQKFKET-LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred EEecccEEEeechh-hhhcCceEEEEEEEEcCEEecCCcEEEEEEeec Confidence 23344455544332 232 368888999999999999999888743 No 166 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=98.74 E-value=3.8e-09 Score=66.77 Aligned_cols=295 Identities=16% Similarity=0.141 Sum_probs=164.5 Q ss_pred CCCCCcccccc-----ccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcce--eeee Q lcl|NC_011085. 1 MADMKGGQQLG-----KDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTR--AAYL 73 (343) Q Consensus 1 ~~~~~~~~~~~-----t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t--~~~~ 73 (343) |==++-..++. +..+. | -|-.++++ ++.+..++.|.++.+.++.+-.+..+..|+.+|... .... T Consensus 1 ~~~~~~~~~~~k~it~~d~~g----G---~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~ 72 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLGK----G---ILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGR 72 (314) T ss_pred CchhhhHHHhhcccccccCCC----c---eeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCccccccc Confidence 43333222221 11111 1 12337775 677888999999999886543344667888887431 1222 Q ss_pred cCCCc-CCCccCCCccceEEEEeeeeeeeeeeccchHHHHh---chhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 74 QAGQS-LDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMN---HYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 74 ~~g~~-i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~---~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) ..++. -..+..+++.++.+|..-+... .+.|.+ |..+- ..|+.+.++.+.++++++.....++.- .++..+. T Consensus 73 ~~~~~~~~~~~~~~tf~~~~l~~~kl~~-~v~is~-e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nG--dg~~~s~ 148 (314) T protein:vir:41 73 NTSGTKVAPTADEVTVSTNTLEMKELVT-KVVLED-EALEDNIEQSAFEQTITSLLASGVTYDLECFFLHA--DSSLTTG 148 (314) T ss_pred ccccCCccCCcccccccceeeeeEEEEE-eecccH-HHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhcc--ccCCcCc Confidence 21111 1112234566777777765543 355633 22222 248999999999999999888766532 1111111 Q ss_pred -cccccccccCC---ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcC-CcEEEeCHHHHHHHhccchhhhh Q lcl|NC_011085. 150 -ASNENIAGLGS---ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSA-DRTFYTTPEVYSAILAALMPNAA 224 (343) Q Consensus 150 -~~~~~~~g~~~---~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~-gR~~vv~P~~~~~Ll~~~~~~~~ 224 (343) |-...+.|+.. +.++..++. + +....+.+.++...|..+.--.. .-..+++++.+..+.+-. -.+. T Consensus 149 ~~~~~~p~G~l~~a~~~~~~~~~~-~-------~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l-~~~~ 219 (314) T protein:vir:41 149 RELYRINDGWMKLAGNQYTDAEPE-D-------ENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQL-LVRE 219 (314) T ss_pred ccchhcchhhhhhcccceeecCcc-c-------cccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHH-hccC Confidence 11112334321 111111111 1 11223444445555655432111 224556888887765421 1123 Q ss_pred ccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeee Q lcl|NC_011085. 225 NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDL 304 (343) Q Consensus 225 ~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~ 304 (343) .+.++..+..|....+.|++|+.++.+|..+... .+.++.+++-+..+....+ T Consensus 220 ~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~---------------------------~~i~fgd~~nlv~~~~~~i 272 (314) T protein:vir:41 220 TGLGDSALIGATGLQYDGIPIQYVPALDALGDDK---------------------------ARALLTVPTNLVYGFWRNI 272 (314) T ss_pred CcccchhhhCCCCceecceeeEecccccccCCCC---------------------------ceEEEechhheEEEeecee Confidence 3456667778888899999999999998432211 1225556666555667778 Q ss_pred EEeeeeccchhhhhhhhhhhhccceecccceEEEEec---CC Q lcl|NC_011085. 305 SLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFT---AG 343 (343) Q Consensus 305 ~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~---~g 343 (343) +.+.+|+.+.....+...+++++.+..+++++...+. +| T Consensus 273 r~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 273 RIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred EEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 8888888877777888888899999888776665543 33 No 167 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.74 E-value=2.4e-09 Score=67.88 Aligned_cols=288 Identities=12% Similarity=0.052 Sum_probs=145.4 Q ss_pred CCCCCcccccc---------ccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCc-cee Q lcl|NC_011085. 1 MADMKGGQQLG---------KDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGR-TRA 70 (343) Q Consensus 1 ~~~~~~~~~~~---------t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~-~t~ 70 (343) ++.. .++... -..+. ...|. .|..+.+..++.+...+.|.++.++++.++. |+ .+|++... ..+ T Consensus 57 ~~~~-~~~~lt~~e~~~~~~~~~~~-~~~gg--~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 130 (381) T protein:vir:95 57 SLPK-SAQSLSANQRSFFMDINKNV-NYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) T ss_pred Hhcc-CcccccHHHHHHHHHHhccc-CCCCc--eecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcce Confidence 2111 111100 00011 11222 2456999999999999999999998887764 44 46666543 333 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) .-...+..+..+ .+++..+++|..-++ +.-..|..-=-..+.+|+.+.+.++.++++++..|+.++.- ... .. T Consensus 131 ~w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G----~G~-~q 203 (381) T protein:vir:95 131 VWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG----TGK-DQ 203 (381) T ss_pred eeeccccccccc-ccccceeeeecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEec----cCC-CC Confidence 333333444332 134455555554433 33344443112225679999999999999999999887621 111 11 Q ss_pred ccccccccCCceeecccc------c--ccccchHHHHHHHHHHHHHHHHHHhhcCC-CcCCcEEEeCHHHHHHHhccchh Q lcl|NC_011085. 151 SNENIAGLGSASILEVGA------K--GDLTSPVELGKAVIAQLTIARAKLTSNYV-PSADRTFYTTPEVYSAILAALMP 221 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~------~--~~~~~~~~~~~~i~~~l~~a~~~Ld~~~V-P~~gR~~vv~P~~~~~Ll~~~~~ 221 (343) +.+.......+.....+. . ....++...++.+.+.+............ +..+-+++++|..+..|+..... T Consensus 204 P~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~ 283 (381) T protein:vir:95 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) T ss_pred ceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccccc Confidence 111111111111111100 0 01112233333333333322222222222 34456778999988887643221 Q ss_pred hhhccccccchhcceeEEE--eceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheee Q lcl|NC_011085. 222 NAANYAALIDPERGSIRNV--MGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTV 299 (343) Q Consensus 222 ~~~~~~~~~~~~~G~V~~i--~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~ 299 (343) .+ .+|..... .|.+|++|+.+|.+.+ .-++|+.. ..+ T Consensus 284 ~~---------~~G~~v~~l~~g~~vv~s~~~p~~~i----------------------ifgDfs~Y----------~i~ 322 (381) T protein:vir:95 284 LN---------ANGVYVTALPFNLNVIESTVQEAGKV----------------------LTYVKGLY----------DGY 322 (381) T ss_pred CC---------CCCceeecCCCCceEEecCCCCcCcE----------------------EEEecccE----------EEE Confidence 11 13433233 3667899998884221 01233221 122 Q ss_pred eeeeeEEeeeeccchhh---hhhhhhhhhccceecccceEEEEecC--C Q lcl|NC_011085. 300 KLKDLSLERARRAEYQA---DQIIARYAMGHGGLRPEAAGALVFTA--G 343 (343) Q Consensus 300 ~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~v~rpe~~~~i~~~~--g 343 (343) ....++++.+.+. +|. ..+++.+++++++++|++.+++.++- + T Consensus 323 ~r~~~~i~~~~~~-~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~ 370 (381) T protein:vir:95 323 LAGGINVQKFKET-LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) T ss_pred EecccEEEeechh-HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCC Confidence 3344455544332 222 36888999999999999988866654 4 No 168 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.74 E-value=2.4e-09 Score=67.88 Aligned_cols=288 Identities=12% Similarity=0.052 Sum_probs=145.4 Q ss_pred CCCCCcccccc---------ccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCc-cee Q lcl|NC_011085. 1 MADMKGGQQLG---------KDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGR-TRA 70 (343) Q Consensus 1 ~~~~~~~~~~~---------t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~-~t~ 70 (343) ++.. .++... -..+. ...|. .|..+.+..++.+...+.|.++.++++.++. |+ .+|++... ..+ T Consensus 57 ~~~~-~~~~lt~~e~~~~~~~~~~~-~~~gg--~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 130 (381) T protein:vir:10 57 SLPK-SAQSLSANQRSFFMDINKNV-NYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) T ss_pred Hhcc-CcccccHHHHHHHHHHhccc-CCCCc--eecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcce Confidence 2111 111100 00011 11222 2456999999999999999999998887764 44 46666543 333 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) .-...+..+..+ .+++..+++|..-++ +.-..|..-=-..+.+|+.+.+.++.++++++..|+.++.- ... .. T Consensus 131 ~w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G----~G~-~q 203 (381) T protein:vir:10 131 VWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKG----TGK-DQ 203 (381) T ss_pred eeeccccccccc-ccccceeeeecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEec----cCC-CC Confidence 333333444332 134455555554433 33344443112225679999999999999999999887621 111 11 Q ss_pred ccccccccCCceeecccc------c--ccccchHHHHHHHHHHHHHHHHHHhhcCC-CcCCcEEEeCHHHHHHHhccchh Q lcl|NC_011085. 151 SNENIAGLGSASILEVGA------K--GDLTSPVELGKAVIAQLTIARAKLTSNYV-PSADRTFYTTPEVYSAILAALMP 221 (343) Q Consensus 151 ~~~~~~g~~~~~~~~~~~------~--~~~~~~~~~~~~i~~~l~~a~~~Ld~~~V-P~~gR~~vv~P~~~~~Ll~~~~~ 221 (343) +.+.......+.....+. . ....++...++.+.+.+............ +..+-+++++|..+..|+..... T Consensus 204 P~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~ 283 (381) T protein:vir:10 204 PIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) T ss_pred ceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccccc Confidence 111111111111111100 0 01112233333333333322222222222 34456778999988887643221 Q ss_pred hhhccccccchhcceeEEE--eceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheee Q lcl|NC_011085. 222 NAANYAALIDPERGSIRNV--MGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTV 299 (343) Q Consensus 222 ~~~~~~~~~~~~~G~V~~i--~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~ 299 (343) .+ .+|..... .|.+|++|+.+|.+.+ .-++|+.. ..+ T Consensus 284 ~~---------~~G~~v~~l~~g~~vv~s~~~p~~~i----------------------ifgDfs~Y----------~i~ 322 (381) T protein:vir:10 284 LN---------ANGVYVTALPFNLNVIESTVQEAGKV----------------------LTYVKGLY----------DGY 322 (381) T ss_pred CC---------CCCceeecCCCCceEEecCCCCcCcE----------------------EEEecccE----------EEE Confidence 11 13433233 3667899998884221 01233221 122 Q ss_pred eeeeeEEeeeeccchhh---hhhhhhhhhccceecccceEEEEecC--C Q lcl|NC_011085. 300 KLKDLSLERARRAEYQA---DQIIARYAMGHGGLRPEAAGALVFTA--G 343 (343) Q Consensus 300 ~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~v~rpe~~~~i~~~~--g 343 (343) ....++++.+.+. +|. ..+++.+++++++++|++.+++.++- + T Consensus 323 ~r~~~~i~~~~~~-~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~ 370 (381) T protein:vir:10 323 LAGGINVQKFKET-LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) T ss_pred EecccEEEeechh-HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCC Confidence 3344455544332 222 36888999999999999988866654 4 No 169 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.73 E-value=2.3e-09 Score=67.89 Aligned_cols=277 Identities=11% Similarity=0.054 Sum_probs=147.6 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC----cceeeeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG----RTRAAYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG----~~t~~~~~~ 75 (343) |+ --.|+.+. .-. ..-+++ |++.|+..+.+.++ +++..|..++..|.+++++... ....++... T Consensus 1 M~---~e~nl~~~--~dL~~a~siD--F~~~f~~~i~~L~~----~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaE 69 (303) T protein:vir:10 1 MS---AENNLINV--EALGKAKSID--FANKLGVGLNKLFE----ALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAE 69 (303) T ss_pred CC---CCcCCcch--hhcccceeeh--hhhhhhhhHHHHHH----HhhhhccccccCCceeeeeeeeceeeccccccccC Confidence 43 33443211 122 344555 99999998876663 4555666666678888766542 122457788 Q ss_pred CCcCCCccCCCc-cceEEEEeeeeeeeeeeccchHHH-Hh--chhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Q lcl|NC_011085. 76 GQSLDDKRKDIK-HTEKTIVIDGLLTADVLIYDIEDA-MN--HYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAAS 151 (343) Q Consensus 76 g~~i~~~~~~~~-~~~~~l~iD~~~~~~~~Idd~D~~-q~--~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~ 151 (343) |+.|+.+.-..+ .+..++++.++. . .+. ||+ |. ..|...+.-+++..++++++|..++..+..+.-... T Consensus 70 Ge~Iplskvt~~~~~t~~~~~kK~r--K-~tT--dEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~-- 142 (303) T protein:vir:10 70 GDVIPLTKVTREQVDITELQFAKYR--K-STS--AEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGK-- 142 (303) T ss_pred CcccchhhheeeecceEEEEeeccc--c-ccc--HHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccc-- Confidence 999987532111 234577776543 3 343 344 53 345999999999999999999999977654321100 Q ss_pred cccccccCCceeecccccccccchHHHHHHHHHHHHHHHHH---HhhcCCCcCCcEEEeCHHHHHHHhccchhhhh-ccc Q lcl|NC_011085. 152 NENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAK---LTSNYVPSADRTFYTTPEVYSAILAALMPNAA-NYA 227 (343) Q Consensus 152 ~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~---Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~ 227 (343) .+..++. + ++.+-.+|-....+ ++|.++ .-+++|+|.-.+.||.+-..... .-- T Consensus 143 --------------~t~~t~~-s----~~glq~Al~~~~~kl~~~~ed~~---~~V~FvNP~Daa~yl~~A~i~~~~t~f 200 (303) T protein:vir:10 143 --------------RTNKTKL-S----AENLQGALSKGRANLSVLLDDEI---TPIAFVNPNDTAEYLANGFINSTGAQF 200 (303) T ss_pred --------------cccceee-c----HHHHHHHHHhhhhhccccccccc---cEEEEEchHHHHHHhhcCCcchhhhhh Confidence 0000000 1 12222222222222 234332 24888999999999987665532 112 Q ss_pred cccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccc------cccccccceEeEeechhhheeeee Q lcl|NC_011085. 228 ALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEG------DTKVALDNVVGLFQHRSAVGTVKL 301 (343) Q Consensus 228 ~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~l~~~~~Av~~~~~ 301 (343) |..-+. ++.|+.|+.|+.+|.+..-..+... -..+..+. ++ ++..|-+..+|+.-++ . . T Consensus 201 G~n~L~-----nfLG~~II~S~kv~~G~~~~T~~~N---i~~ay~~~-~g~l~~~f~~t~D~tglIGv~h~~-~-----~ 265 (303) T protein:vir:10 201 GVNLLT-----PYVGVKIVEFADVPQGEVWMTVAEN---LNVAYANP-RGELSRAFAFATDATGFVGVLHDI-Q-----P 265 (303) T ss_pred hhhhhh-----hhhcceEEEeccCCCceEEEeeccc---eEEEEecC-chhhhhhhhhccccccceEEEecc-c-----c Confidence 333333 4999999999999987544333221 11111111 11 2333333444422111 0 0 Q ss_pred eeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 302 KDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 302 ~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +.++. ..-...+... =+=|+|+++..+++.| T Consensus 266 ~~~t~--------eT~~~~~~~l---fpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 266 QRLTS--------DTIYASAISM---FPENIDAVIKVTIKKD 296 (303) T ss_pred ceeee--------hhHhHhHHHh---cccccceEEEEEEecc Confidence 01111 1111122221 2447789999999888 No 170 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=98.71 E-value=3.1e-09 Score=67.23 Aligned_cols=284 Identities=12% Similarity=0.059 Sum_probs=139.9 Q ss_pred CCCCCcc----------ccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-Ccce Q lcl|NC_011085. 1 MADMKGG----------QQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTR 69 (343) Q Consensus 1 ~~~~~~~----------~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t 69 (343) +++.... +.. ... .+...+. -+..+.+..++.+...+.|.++.++++.++. |+ +++++- +..+ T Consensus 59 ~~~~~~~~lt~ee~~~~~~~-~~~-~~~~~gg--~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~~~~~~~~~~ 132 (377) T protein:vir:98 59 DLRDKNRELTAEEIKFFNDI-DKN-VGGKDKF--KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGT 132 (377) T ss_pred HhccCCcccCHHHHHHHHHH-Hhc-cCCCCCc--cccCHHHHHHHHHHHHHhhhhhhheeeEecC-cc-eEEEEecCCcc Confidence 1111000 000 011 1112222 2455889999999999999999998887764 44 466653 4444 Q ss_pred eeeecCCCcCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 70 AAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 70 ~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) +.-...+..+..+ .+++..++ ++...++.. ..|..-=-..+.+|+-+.+.++.++++++..|+.++.- ... T Consensus 133 a~w~~e~~~~~~~-~~~~f~~i--~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G----~G~- 204 (377) T protein:vir:98 133 AVWGDIFGEIKGQ-LKQAFKEQ--DFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKG----DGL- 204 (377) T ss_pred eeEeecccccCcc-cCccceeE--eecceeEEeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEec----cCC- Confidence 4444444444332 12344444 445555444 34432112235678999999999999999999988621 111 Q ss_pred ccccccccccCCceeeccccccccc----chHHH--------------HHHHHHHHHHHHHHHhhcCCCcCCcEEE-eCH Q lcl|NC_011085. 149 AASNENIAGLGSASILEVGAKGDLT----SPVEL--------------GKAVIAQLTIARAKLTSNYVPSADRTFY-TTP 209 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~~~~~~~~~----~~~~~--------------~~~i~~~l~~a~~~Ld~~~VP~~gR~~v-v~P 209 (343) ..+.+.......+. +......... +.... +..++..+. ...+++.. -..||++. ++| T Consensus 205 ~qP~Gil~~~~~~~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t--~~~~~klk-d~~G~~i~~~n~ 280 (377) T protein:vir:98 205 LQPVGLLKDLSQPT-VDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLS--VNDKKRPL-KIAGQVKLILNP 280 (377) T ss_pred Ccceeeeecccccc-cccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHH--HHHHhhhh-ccCCceEEEecc Confidence 11111111111111 1011111100 00001 011111110 11111111 12466544 677 Q ss_pred HHHHHHhccchhhhhccccccchhcceeEEEeceE--EEEeccccccccccccccccccccccccccccccccccccceE Q lcl|NC_011085. 210 EVYSAILAALMPNAANYAALIDPERGSIRNVMGFE--VVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVV 287 (343) Q Consensus 210 ~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~--V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (343) .-|+.++.... ....+|.-.+++|++ |++|+.+|...+. -++|+. T Consensus 281 ~~~~~~~p~~~---------~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~----------------------fgdf~~-- 327 (377) T protein:vir:98 281 EDRWALEAQFT---------SRNQFGEYVTVLPHGITILESLAVETGKAI----------------------AFVANR-- 327 (377) T ss_pred cchhhcccccc---------ccCCCCccccccCCCceEEecCCCCcccEE----------------------EEEecc-- Confidence 66655542211 111345445666654 7788888843210 012222 Q ss_pred eEeechhhheeeeeeeeEEeeeeccch--hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 288 GLFQHRSAVGTVKLKDLSLERARRAEY--QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 288 ~l~~~~~Av~~~~~~~~~~e~~~~~~~--~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) -..+....++++.+.+... -...+++.+++++++++|++.++|.++-| T Consensus 328 --------Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 328 --------YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred --------eeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 1223344455655433322 12458888999999999999999999999 No 171 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.69 E-value=5.3e-09 Score=65.94 Aligned_cols=298 Identities=15% Similarity=0.101 Sum_probs=155.7 Q ss_pred CCCCCccccccccccccccccchhHHHH--HHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcc--eeeeecCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL--KVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRT--RAAYLQAG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i--e~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~--t~~~~~~g 76 (343) |.+..- .+.-+.....|...-|+ ++++ ++.+..++.|.++.+.++.+..++.+..|+.+|.. ....++.+ T Consensus 7 ~~~~~~-----~~~~k~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~ 80 (315) T protein:vir:41 7 IRGGKP-----FEIVPKIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDET 80 (315) T ss_pred hhcCCh-----hhhhhhcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccc Confidence 221111 11111111122222233 6665 46677888899999888755445566667766532 22222221 Q ss_pred CcC-CCccCCCccceEEEEeeeeeeeeeeccc--hHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 77 QSL-DDKRKDIKHTEKTIVIDGLLTADVLIYD--IEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 77 ~~i-~~~~~~~~~~~~~l~iD~~~~~~~~Idd--~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) +.- ..+...++..+.+|..-+.. +...|.+ +|+..-..|+.+.++.+.++++++..+..++.- ..+ ...|.-. T Consensus 81 ~~~~~~~~~~~~f~~~~l~~~~l~-~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nG--dg~-s~~p~~~ 156 (315) T protein:vir:41 81 GQKLAPPESTAEVKTNTLYMREMV-TKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHG--DTS-SSDPLLR 156 (315) T ss_pred cCcCCCCCCccccceeeeceeeee-eeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhcc--CCc-CcCcccc Confidence 111 11112245555555554442 3344522 222211358999999999999999988776531 011 1111111 Q ss_pred cccccCC---ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCc-CCcEEEeCHHHHHHHhccchhhhhccccc Q lcl|NC_011085. 154 NIAGLGS---ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPS-ADRTFYTTPEVYSAILAALMPNAANYAAL 229 (343) Q Consensus 154 ~~~g~~~---~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~-~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (343) .+.|+-. +.+.... .+..+. ....+.|.++...|..+.--. .+-..++++..+..|.+-. -.+..|.++ T Consensus 157 ~~~G~l~~a~~~~~~~~--~~~~a~----~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk-~~~g~~lw~ 229 (315) T protein:vir:41 157 MSDGWLKLASEKLTESD--VDPEAE----DWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDAL-KGRETGLGD 229 (315) T ss_pred ccccceecccccccccc--cccccc----cccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHh-ccCCCcccc Confidence 2233211 1111000 000111 111233333344444432211 2335688888888775421 223456777 Q ss_pred cchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeee Q lcl|NC_011085. 230 IDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERA 309 (343) Q Consensus 230 ~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~ 309 (343) ..+..|....+.|.+|+.++++|..+..... .++.+++-+..+....++.+.+ T Consensus 230 ~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~---------------------------ilf~d~~nl~~~~~~~i~i~~~ 282 (315) T protein:vir:41 230 QALTGANSILYDGRPVQYVPALEALNDGKSR---------------------------ALFVVPTQLVYGFWRNIKVVPD 282 (315) T ss_pred chhhcCCCceecccceEecccccccCCCCcc---------------------------EEEecccceEEEeccccEEEee Confidence 7888899899999999999999854322111 1333444344455566788888 Q ss_pred eccchhhhhhhhhhhhccceecccceEEEEecC Q lcl|NC_011085. 310 RRAEYQADQIIARYAMGHGGLRPEAAGALVFTA 342 (343) Q Consensus 310 ~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~ 342 (343) |+.......+...++.|..+..++++++-.++- T Consensus 283 ~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 283 YDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred ecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 887776677777888899888788866655555 No 172 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.66 E-value=6.5e-09 Score=65.48 Aligned_cols=285 Identities=12% Similarity=0.075 Sum_probs=137.4 Q ss_pred CCCCCccc-----------cccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcce Q lcl|NC_011085. 1 MADMKGGQ-----------QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTR 69 (343) Q Consensus 1 ~~~~~~~~-----------~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t 69 (343) +.. .+.. ...+. ...++. .|..+.|..++.+...+.|.++.++++.++ +|+ .+||+..... T Consensus 64 ~~~-~g~~~lt~~e~~~~~~~~~~---~~~~gg--~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~-~~~-~~i~~~~~~~ 135 (383) T protein:vir:78 64 SAS-RTDKNITNEEIKFFNDINKE---VGYKEE--TLLPQTVVDEIFEDLTTEHPFLASIGMRTT-GLR-TKFLKSETSG 135 (383) T ss_pred Hhc-CChhhhhHHHHHHHHHHhcc---CCCCCc--cccCHHHHHHHHHHHHhhccceeeeeeEec-CCc-eEEEEEcCCc Confidence 100 0000 00011 111221 245599999999999999999999988776 455 4777765544 Q ss_pred eeee-cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 70 AAYL-QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 70 ~~~~-~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) .... ..+..+..+ .+++..+++|..-++ +.-+.|..-=-..+.+|+.+.+.++.++++++..|+.++.- .. . T Consensus 136 ~a~w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G----~G-~ 208 (383) T protein:vir:78 136 VAVWGKIFGEIKGQ-LDATFSDEESIQNKL-TAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVG----DG-N 208 (383) T ss_pred ceEEeecccccccc-cCcceeeEeecceee-EeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEec----cC-C Confidence 3333 333334322 234556666665433 34445543212235688999999999999999999988621 11 1 Q ss_pred ccccccccccCCceeeccccccccc--------chHHHHHHHHHHHHHHHHHHhhcCC-CcCC-cEEEeCHHHHHHHhcc Q lcl|NC_011085. 149 AASNENIAGLGSASILEVGAKGDLT--------SPVELGKAVIAQLTIARAKLTSNYV-PSAD-RTFYTTPEVYSAILAA 218 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~~~~~~~~~--------~~~~~~~~i~~~l~~a~~~Ld~~~V-P~~g-R~~vv~P~~~~~Ll~~ 218 (343) ..+.+.+...........+...+.+ +...... ++..+.+....+....- ...+ ...+++|.-|+.++.. T Consensus 209 ~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~ 287 (383) T protein:vir:78 209 DKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVN-ELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQ 287 (383) T ss_pred CCceeeeeccCCcccccccccccccccchhhhhhhHHHHH-HHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccc Confidence 1111111111111111111000000 1111111 11111111111111111 0111 2345666555444322 Q ss_pred chhhhhccccccchhcceeEEEece--EEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhh Q lcl|NC_011085. 219 LMPNAANYAALIDPERGSIRNVMGF--EVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAV 296 (343) Q Consensus 219 ~~~~~~~~~~~~~~~~G~V~~i~Gf--~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av 296 (343) .... -.+|...++.|+ +|++|+++|...+. -++|+. . T Consensus 288 ~~~~---------~~~G~~~t~l~~~~~iv~s~~~p~~~ii----------------------fgdfs~----------Y 326 (383) T protein:vir:78 288 YTSL---------NANGVYVTALPFNLNIIESLFVPEKKAI----------------------SYVAER----------Y 326 (383) T ss_pred hhcc---------CCCCceeeecCCCceEEecCCCCcccEE----------------------Eeeccc----------e Confidence 1111 124554456554 58888888843210 012222 1 Q ss_pred eeeeeeeeEEeeeeccchhh---hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 297 GTVKLKDLSLERARRAEYQA---DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 297 ~~~~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ..+....++++.+.+ .+|. ..+++.+++++++++|++.+++.++-- T Consensus 327 ~i~~r~~~~i~~~~~-~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~ 375 (383) T protein:vir:78 327 DALIGGPLDIGTYDQ-TLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNIN 375 (383) T ss_pred EEEecccceEEecch-hhhhcCceEEEEEEEEcCEEecCCeEEEEEEEec Confidence 223344556665433 3333 468899999999999999888655432 No 173 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.63 E-value=5.3e-09 Score=65.96 Aligned_cols=291 Identities=12% Similarity=0.091 Sum_probs=140.5 Q ss_pred CCCCCccccc---------cccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcce-e Q lcl|NC_011085. 1 MADMKGGQQL---------GKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTR-A 70 (343) Q Consensus 1 ~~~~~~~~~~---------~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t-~ 70 (343) +....+.+.. .-+.+.+...| .|..+.+..++.+..++.|.++.++++.++. |+ +.|+...... + T Consensus 66 ~~~~r~~~~l~~ee~~~~~~~~~~t~~~gG---~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 140 (395) T protein:vir:95 66 ILAKRSQDPLTSEERKFFNDINYDVGYTDE---KILPETVVERVFDDLQKDHPLLSKINFQNAG-IK-TRVIKADPAGQA 140 (395) T ss_pred HHhhcCccccchHHHHHHHHHhhccCCCCc---eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCcce Confidence 0000000000 00111111111 1355899999999999999999999887763 43 5677654433 3 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) .-...+..+..+ .+++.+++++..-+. +.-+.|.+-=-..+.+|+-+.+.++.++++++..|+.++.- .++.... T Consensus 141 ~w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G--~G~~~~q- 215 (395) T protein:vir:95 141 VWGKVFGEIKGQ-LDAAFREENFTQYKL-TCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIING--GGAAKTQ- 215 (395) T ss_pred EEeecccccCcc-ccccceeeeeceeeE-EEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeec--cCCCCcC- Confidence 322222333322 234556666655333 33344543222235688999999999999999999988621 0100000 Q ss_pred ccccccccCCcee-ecccccccccchHHHHHHHHHHHHHHHHHHhh----cC-CCcCCcEEEeCHHHHHHHhccchhhhh Q lcl|NC_011085. 151 SNENIAGLGSASI-LEVGAKGDLTSPVELGKAVIAQLTIARAKLTS----NY-VPSADRTFYTTPEVYSAILAALMPNAA 224 (343) Q Consensus 151 ~~~~~~g~~~~~~-~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~----~~-VP~~gR~~vv~P~~~~~Ll~~~~~~~~ 224 (343) +.+........+. ...+..+...+... ....++.+..+...+.- .. ........+++|..+..+..... T Consensus 216 P~Gil~~~~~~~~~~~~~~~~~~~t~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~---- 290 (395) T protein:vir:95 216 PVGLMKDVNTNSGAVTDKASSGTLTFAD-ADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYT---- 290 (395) T ss_pred ceeeeecccccccccccccccchhhhhh-hHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcce---- Confidence 0111100000000 00001111111110 11122333222222211 11 11123355788877665432111 Q ss_pred ccccccchhcceeEEEe--ceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeee Q lcl|NC_011085. 225 NYAALIDPERGSIRNVM--GFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLK 302 (343) Q Consensus 225 ~~~~~~~~~~G~V~~i~--Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~ 302 (343) +. ...|...++. |.+|++++.+|...+ . -++|+.. ..+... T Consensus 291 -~~----~~~G~~~~~lg~g~~v~~~~~~p~~~i-------------~---------fgdfs~y----------~i~~r~ 333 (395) T protein:vir:95 291 -YL----TANGGFVTVLPYNVTIITSEFVPEGKL-------------V---------AFVTDRY----------NAVRGG 333 (395) T ss_pred -ec----cCCCcceeccCCcceEEEcCCCCCCcE-------------E---------EEecccE----------EEEEec Confidence 11 1245556665 556899999984211 0 1233221 112234 Q ss_pred eeEEeeeeccch--hhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 303 DLSLERARRAEY--QADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 303 ~~~~e~~~~~~~--~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .++++.+.+... -...+++..++|+++++|++.++|.++-- T Consensus 334 ~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~ 376 (395) T protein:vir:95 334 GLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVA 376 (395) T ss_pred ceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeecc Confidence 445554433211 12447888899999999999999998855 No 174 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.62 E-value=4.4e-09 Score=66.40 Aligned_cols=294 Identities=11% Similarity=0.020 Sum_probs=149.2 Q ss_pred CCCCCcccccccccc--ccccccchhHHH--HHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeec-- Q lcl|NC_011085. 1 MADMKGGQQLGKDQG--KGQSGGDKLALF--LKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQ-- 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g--~~~~~~d~~al~--ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~-- 74 (343) ||.-.--+.. .+.- .....+|...=| ...+..++.+..++.|.++.+.++..+.+ .+.+|+.+|-......+ T Consensus 1 ~~~k~~~~~l-~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~-~~~~i~~~~~~~~~~~~~~ 78 (321) T protein:vir:31 1 MASRTINNDL-SRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGA-KKTRIPTLNIGERHRRPQD 78 (321) T ss_pred CchHHHHHHH-HHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccC-cceeeeeeccCCccccccc Confidence 5544333321 1111 111222222212 36778888888888899998888877643 33556665432111111 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeeccc--hHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYD--IEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASN 152 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd--~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (343) .++.... ..+++.+++++..-+.. +...|.+ +|+.....|+.+.+....++++++..++..+.- ...+.++. T Consensus 79 e~~~~~~-~~~~~~~~~~~~~~k~~-~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nG----d~~~~~~~ 152 (321) T protein:vir:31 79 EGEWNEN-ESDVSTGTIDISTEKAT-VAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANG----DEDAEDSF 152 (321) T ss_pred ccccccc-cccceeeeeeeeeEEEE-eehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeec----cccCCCcc Confidence 2221111 12234555566554443 3334432 222222468999999999999999988866521 11111111 Q ss_pred c-cccccCC---ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc Q lcl|NC_011085. 153 E-NIAGLGS---ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA 228 (343) Q Consensus 153 ~-~~~g~~~---~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (343) . ...|+-. ........+++.. -++.|.++...|+++.--..+-+.+++++.+..++.-.. ......+ T Consensus 153 ~~~n~G~l~~a~~~~~~~~~~~~~~--------~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~-~~~~~~~ 223 (321) T protein:vir:31 153 ENQNDGFITVAEGDVETIDAADDIL--------DNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLT-DRDTPLG 223 (321) T ss_pred cccchhhhhhhcccccccccccccc--------CHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHh-cCCCccc Confidence 0 1122211 0010011111111 134455555666665432223467899988766543111 1112334 Q ss_pred ccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEee Q lcl|NC_011085. 229 LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLER 308 (343) Q Consensus 229 ~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~ 308 (343) ...+..|...+++|++|+.++.+|...+ ++.+.+-+.....++++.++ T Consensus 224 ~~~l~~~~~~tl~G~pvv~~~~mP~~~i--------------------------------l~t~~~nl~~~~~~~~~~~~ 271 (321) T protein:vir:31 224 DNVIMGEADVNPFSFPIIGSGLWPDDKA--------------------------------MFTDPQNLIYALYRDLEIDV 271 (321) T ss_pred cchhhccccccccceeEEEcCCCCCCcE--------------------------------EEeccccEEEEEeeccEEEE Confidence 5557777778899999999999995321 33333433334445556666 Q ss_pred eeccchh---hhhhhhhhh--hccceecccceEEEE-ecCC Q lcl|NC_011085. 309 ARRAEYQ---ADQIIARYA--MGHGGLRPEAAGALV-FTAG 343 (343) Q Consensus 309 ~~~~~~~---~d~i~~~~~--~G~~v~rpe~~~~i~-~~~g 343 (343) .++.+.. .+.+...++ ++..+-++++++.++ ++.. T Consensus 272 ~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 272 LTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDP 312 (321) T ss_pred eecCccccccceeeEeeeeeecceeEeccccEEEEecCCcc Confidence 6654332 233443332 577778888888877 3443 No 175 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.61 E-value=3.4e-08 Score=61.52 Aligned_cols=321 Identities=12% Similarity=0.082 Sum_probs=158.4 Q ss_pred CCCCCccccc--cccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccccc--ccceEEEEeccCccee--eeec Q lcl|NC_011085. 1 MADMKGGQQL--GKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSI--SSGKSAQFPVLGRTRA--AYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~--~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i--~~G~tv~i~~iG~~t~--~~~~ 74 (343) |-+-+...+. .|-+|.. + ..+...-|-..++..-++..++..+-..+++ .+|+|+++.+--...- .-.+ T Consensus 1 ~~~~~a~~~~~~~s~~g~~---~--~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~ 75 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGAN---S--DQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNIND 75 (401) T ss_pred CCccCCCcccccccccccc---c--ceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchh Confidence 5444332221 1111111 1 1233334566665555555677777777766 3699999887543211 1112 Q ss_pred CCCcCCCc----------cCC----------------------CccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHH Q lcl|NC_011085. 75 AGQSLDDK----------RKD----------------------IKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYT 122 (343) Q Consensus 75 ~g~~i~~~----------~~~----------------------~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~ 122 (343) .|.+..+. ..+ .+-.++...|-|+-.|..+=|.++..-..-.+...++ T Consensus 76 eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s 155 (401) T protein:vir:95 76 QGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLS 155 (401) T ss_pred cCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHH Confidence 23322221 001 1112233445566555443344443333344555444 Q ss_pred HHHHHHH-HHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCc- Q lcl|NC_011085. 123 SQIGESL-AMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPS- 200 (343) Q Consensus 123 ~~~~~aL-a~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~- 200 (343) .+.-..= .+..|. +.+.+..++... .-+|...+ ....+..+..++ ...++.|.++...|+++..|. T Consensus 156 ~ell~g~~~~t~d~-i~~dll~ag~~v-----iyAg~ats-~At~~~~~~~~t-----~vt~~~l~rl~~~L~~nRapk~ 223 (401) T protein:vir:95 156 RELMNGATQITEAV-LQKDLLAAAGTV-----LYAGAATS-DATITGEGSTPS-----VVSYKNLMRLDQILTENRTPTQ 223 (401) T ss_pred HHHhhhhhhhHHHH-HHHHHHhhcCee-----ecCCccce-eeeccccccccc-----eechhHHHHHHHHHHhcccccc Confidence 4433322 223333 333333222000 00100000 000111111111 122677888899999877776 Q ss_pred ----------------CCcEEEeCH------HHHHHHhccchhhhh-ccccccchhcceeEEEeceEEEEecccc-cccc Q lcl|NC_011085. 201 ----------------ADRTFYTTP------EVYSAILAALMPNAA-NYAALIDPERGSIRNVMGFEVVEVPHLT-AGGA 256 (343) Q Consensus 201 ----------------~gR~~vv~P------~~~~~Ll~~~~~~~~-~~~~~~~~~~G~V~~i~Gf~V~~sn~lp-~~~~ 256 (343) .-|++++.| +...+|+.++.|+.. .|...+.+.+|.||++.+|++++++.+- +... T Consensus 224 t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~a 303 (401) T protein:vir:95 224 TTIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGA 303 (401) T ss_pred hhhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCC Confidence 126888888 455777888889975 5888889999999999999999988743 3332 Q ss_pred ccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeE--------E----e---eeeccchhhhhhhh Q lcl|NC_011085. 257 GDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLS--------L----E---RARRAEYQADQIIA 321 (343) Q Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~--------~----e---~~~~~~~~~d~i~~ 321 (343) +..... ....-........++|+. .-.|++-++|-++...+-.. + + ..-||.-+.=.+.= T Consensus 304 g~~a~~-~~~~y~~~~~~~gg~~dV----yp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgw 378 (401) T protein:vir:95 304 GAQATG-ANPGYRTSMVSGQEHYDV----YPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSI 378 (401) T ss_pred cccccc-cccccccccccCCCccee----eeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhh Confidence 211110 000000000011122221 11244445554443322110 0 0 11355556656777 Q ss_pred hhhhccceecccceEEEEecCC Q lcl|NC_011085. 322 RYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 322 ~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++.|++.++||+..+.|+..+= T Consensus 379 K~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 379 KWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred hhhhhhheeccceeEEEEeecC Confidence 7888999999999999999988 No 176 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.55 E-value=4.8e-09 Score=66.20 Aligned_cols=292 Identities=12% Similarity=0.095 Sum_probs=140.0 Q ss_pred CCCCC---------------ccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc Q lcl|NC_011085. 1 MADMK---------------GGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL 65 (343) Q Consensus 1 ~~~~~---------------~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i 65 (343) |.... ...+. .....+..++ ..+..+.+...+.+.....+.+++++++.++.+ .++++.- T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~g~--~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g--~~~~~~~ 197 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTL-AQQKRAVSGA--ELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG--TARQNIA 197 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHH-hhhhhhhccc--cccccHHHHHHHHHhhhhhhhhhhheeeeecCc--eeEeeee Confidence 00000 00000 0001111111 124557888888888888888888888777643 3455554 Q ss_pred Ccce-eeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011085. 66 GRTR-AAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGL 144 (343) Q Consensus 66 G~~t-~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~ 144 (343) +... +.-...|..++.. +++..++++.+.++ +.-+.|.+-=-..+..|+-+.+..+.+++++...|+.|+.- T Consensus 198 ~~~~~a~wv~E~~~~~~~--~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G---- 270 (466) T protein:vir:80 198 GAIPEGVWTEAVANLNEL--SLSFSQIEVDGYKV-GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYG---- 270 (466) T ss_pred cCCcceeecccccccccc--cccccceeecceee-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeec---- Confidence 4433 3333445555432 35566666665544 22244543222235578999999999999999999988631 Q ss_pred hhccccccccccccCCceeecccccc----cccchHHH---------HHHHHHHHHHHHHHHhhcCCCcCCc-EEEeCHH Q lcl|NC_011085. 145 CNMPAASNENIAGLGSASILEVGAKG----DLTSPVEL---------GKAVIAQLTIARAKLTSNYVPSADR-TFYTTPE 210 (343) Q Consensus 145 a~~~~~~~~~~~g~~~~~~~~~~~~~----~~~~~~~~---------~~~i~~~l~~a~~~Ld~~~VP~~gR-~~vv~P~ 210 (343) ... ..+.+........++....... ...++... +...+..+.. ...+.+.+. ..++ +.++++. T Consensus 271 ~G~-~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~w~~~~~ 347 (466) T protein:vir:80 271 TGT-KMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVL-KLSKARANY-SNGMKFWAMSSN 347 (466) T ss_pred cCC-CCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHH-HHHhhhccc-cCCceeEEecch Confidence 110 1111111111110000000000 00000000 0011111111 111112222 2333 4567888 Q ss_pred HHHHHhccchhhhh--ccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEe Q lcl|NC_011085. 211 VYSAILAALMPNAA--NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVG 288 (343) Q Consensus 211 ~~~~Ll~~~~~~~~--~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 288 (343) .+..|+.-....+. .+.. ...++ ..++|.+|+.|+++|.+.. T Consensus 348 ~~~~l~~~~~~~~~~g~~~~--~~~~~--~~i~G~pvv~s~~~~~~~~-------------------------------- 391 (466) T protein:vir:80 348 THAVLMSKAITFNSAGALVA--SLNNT--MPIVGGDIVILDFIPDNDI-------------------------------- 391 (466) T ss_pred hHHHhhcccccccCCccccc--cCCCc--ccccccceeecCccCccce-------------------------------- Confidence 88877654322111 1111 11122 2589999999999985321 Q ss_pred EeechhhheeeeeeeeEEeeeeccchhh--hhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 289 LFQHRSAVGTVKLKDLSLERARRAEYQA--DQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 289 l~~~~~Av~~~~~~~~~~e~~~~~~~~~--d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ++...+....+..+.++++...+..+.- ..+++.+++++++++|++.+.+.++.= T Consensus 392 ~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~ 448 (466) T protein:vir:80 392 IGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANA 448 (466) T ss_pred eeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCC Confidence 1111111122333445555554433223 357888999999999999998865433 No 177 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.15 E-value=5.7e-07 Score=54.82 Aligned_cols=294 Identities=12% Similarity=0.078 Sum_probs=153.6 Q ss_pred CCCCCccccccccccccccccchhHHHH-HHHHHHHHHHHHHhhhhc--cCcc-cccc-----ccceEEEEeccCccee- Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFL-KVFGGEVLTAFARTSVTT--NRHI-MRSI-----SSGKSAQFPVLGRTRA- 70 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~g~V~~~f~~~s~~~--~~~~-~~~i-----~~G~tv~i~~iG~~t~- 70 (343) |+.+ ++. |+. ..+|+ |+|...|.+...+.+-|. +.+. ...+ .+|+.|.+|..+...- T Consensus 1 M~~~---~~~-T~l---------~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~ 67 (367) T protein:vir:80 1 MPDF---NNQ-VRL---------VDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSL 67 (367) T ss_pred Ccch---hhh-hhh---------hhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCC Confidence 5554 333 331 23566 999999988777666543 3332 2222 5799999999987642 Q ss_pred -eeecCCCcCC-CccCCCccceEE-EEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 71 -AYLQAGQSLD-DKRKDIKHTEKT-IVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 71 -~~~~~g~~i~-~~~~~~~~~~~~-l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) ..|..+.+.. -++..++..+.. .++ ..-.++...|+-..-+--|+|..+..+.+.--.|..-+.+|..|...-.. T Consensus 68 ~~n~~~d~~~~~~t~~kittg~~~a~v~--~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~ 145 (367) T protein:vir:80 68 EPNYGSDNPNVEAPIDGLGSGEMKTTKT--WLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKS 145 (367) T ss_pred ccccCCCCCcccccccccccchheeeee--hhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhcc Confidence 1221111110 112223332221 222 23345778899888888899999999988777776555555444433221 Q ss_pred ccccc-----------ccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHh Q lcl|NC_011085. 148 PAASN-----------ENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAIL 216 (343) Q Consensus 148 ~~~~~-----------~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll 216 (343) ....+ +...+.....+++..+.+...+..-. .+.+.+|+..|-++. +.=-.++|.+.+|..|. T Consensus 146 ~~a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s----~~~~~~A~~~lGD~~--~~l~~i~mHS~V~~~L~ 219 (367) T protein:vir:80 146 NLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFN----REAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMT 219 (367) T ss_pred ccccchhhhhhhhccccccccccCceeeeeeccCCCccceec----HHHHHHHHHHhcccc--ccccEEEEchHHHHHHH Confidence 11110 11223344555555443322211111 234556777786653 33468899999999998 Q ss_pred ccchhhhhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhh Q lcl|NC_011085. 217 AALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAV 296 (343) Q Consensus 217 ~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av 296 (343) +...+.-..+. +. +..|+.++|.+|++...+|.....+. +.| ...+|-.-|+ T Consensus 220 ~~~li~~i~~s-d~---~~~i~ty~G~~VIvDD~~Pv~~~~a~-----------------~~y-------ttYlfg~GAi 271 (367) T protein:vir:80 220 NNDEIEFIPDS-KG---QLTIPTYMGKVVIVDDGMPVFGTGAD-----------------KTY-------LSILFGGAAF 271 (367) T ss_pred hccccccccCC-CC---ccccceecceeEEEeCCCcccccCCC-----------------ceE-------EEEEEeccee Confidence 76432221221 11 35689999999999999996543211 111 1123444455 Q ss_pred eeeeeeeeE-Eeeeeccchh----hhhhhhh-----hhhccceecccceEE-E-EecCC Q lcl|NC_011085. 297 GTVKLKDLS-LERARRAEYQ----ADQIIAR-----YAMGHGGLRPEAAGA-L-VFTAG 343 (343) Q Consensus 297 ~~~~~~~~~-~e~~~~~~~~----~d~i~~~-----~~~G~~v~rpe~~~~-i-~~~~g 343 (343) +.....+.. +|..||+... .|.+..+ |.+|.+-....-+.- . -.+.| T Consensus 272 ~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~~~~~v~~~~~~~~~~~ 330 (367) T protein:vir:80 272 GYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSG 330 (367) T ss_pred eecccCCccceecccchhhhcCCceEEEEeeeeEEeecceeeecccccccccccccccc Confidence 555544333 6888888653 1443332 334444332211100 0 00111 No 178 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=98.03 E-value=3e-07 Score=56.38 Aligned_cols=299 Identities=15% Similarity=0.186 Sum_probs=164.9 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLD 80 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~ 80 (343) |+.-.-+++-.+|-- ..+++..=|..++-++-|.++-.--.+-..++-.-.++.|.+-.|+.+|.....+...|+.++ T Consensus 59 m~G~~p~~eV~~~e~--mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~ 136 (393) T protein:vir:79 59 MEGETPTNEVNLREF--MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYDVAEGQEIP 136 (393) T ss_pred hcCCCchhheehhhh--hcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeecccccccccc Confidence 886666666445433 344444334558888888775433233333444446778999999999988888888888877 Q ss_pred CccCC-CccceEEEEeeeeeeeeeeccchHHH--HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 81 DKRKD-IKHTEKTIVIDGLLTADVLIYDIEDA--MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 81 ~~~~~-~~~~~~~l~iD~~~~~~~~Idd~D~~--q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) ...-+ -++. .+++-+.++. ..|.=-|+. .+..|+++-..+.++.+|+|..|+-++++.-+-...+ ..+ T Consensus 137 ~~sld~~T~d--sv~~~~gK~G-~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtv------fDa 207 (393) T protein:vir:79 137 EDSIDWQTHE--SPEIRVGKSG-IRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTV------FDN 207 (393) T ss_pred ccchhhhcCC--ceeEEechhh-hhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhccccee------eec Confidence 65433 2233 3445555543 344322332 2679999999999999999999999998865332211 112 Q ss_pred cCCceeeccccccc--ccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhh--h-ccc----- Q lcl|NC_011085. 158 LGSASILEVGAKGD--LTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNA--A-NYA----- 227 (343) Q Consensus 158 ~~~~~~~~~~~~~~--~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~--~-~~~----- 227 (343) +..++.....+-+- ...+.-..+.+.|.++.+ ...-- .+-++++.|=.|+..-+....-. . -|+ T Consensus 208 ~st~t~ahptGr~~~~~qNGTlSleDllDm~~av---~~~hy---t~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~ 281 (393) T protein:vir:79 208 YSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAV---MANEY---TPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAK 281 (393) T ss_pred cccCccceeecCCccccccccccHHHHHHHHHHH---hcccC---CcceEEEcCchhhhhhhhhhhcceeeccccccCcc Confidence 22222211111000 111111223344443322 22222 34588888888887765432111 1 111 Q ss_pred ---ccc----chhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeee Q lcl|NC_011085. 228 ---ALI----DPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVK 300 (343) Q Consensus 228 ---~~~----~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~ 300 (343) .+. .+.+|++ -+.|+|+.||-+|.-..... | .|..--.+++++..-++ T Consensus 282 ~~~ts~algp~~i~~~~--~~nlnv~~sPfvp~d~k~~r-----------F------d~~~Vd~NnvgvlLV~D------ 336 (393) T protein:vir:79 282 GAPSSMALGPDSIQGRL--PFNFNVNLSPFIPLDKKSRR-----------F------DVYAVDRNNVGVLLVRD------ 336 (393) T ss_pred ccchhhhhchhhhcccc--ccceeEEEecccccccccce-----------e------eEEEeecCCceEEEEec------ Confidence 000 0111111 14589999998885432110 0 00000123445444332 Q ss_pred eeeeEEeeeeccchhhhhhhhhhhhccceecccceEE----EEecCC Q lcl|NC_011085. 301 LKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGA----LVFTAG 343 (343) Q Consensus 301 ~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~----i~~~~g 343 (343) ++++|.+-|+.+--.-|+-.-+||.+++.-..++. |..+.- T Consensus 337 --~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~k~ 381 (393) T protein:vir:79 337 --DLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNISMDKS 381 (393) T ss_pred --CcceeccccccccceeeeeeeeeceeeeeCCceEEEEecceeecc Confidence 46788888887777788889999999998866554 233332 No 179 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=97.54 E-value=2.6e-05 Score=45.76 Aligned_cols=286 Identities=14% Similarity=0.108 Sum_probs=138.7 Q ss_pred CCCCCccccccccccccccccchhHHH--HHHHHHHHHHHHHHhhhhc--cCccc-ccc-----ccceEEEEeccCccee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALF--LKVFGGEVLTAFARTSVTT--NRHIM-RSI-----SSGKSAQFPVLGRTRA 70 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~--ie~~~g~V~~~f~~~s~~~--~~~~~-~~i-----~~G~tv~i~~iG~~t~ 70 (343) ||-+ +. .|. +. +|+|...|.+...+.+.|. +.+.. ..+ .+|+.+.+|..+...- T Consensus 1 Ma~T--------~l------~D~--iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g 64 (349) T protein:vir:78 1 MAIT--------TI------GDI--VTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDT 64 (349) T ss_pred CCce--------EE------eee--eccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCC Confidence 5532 21 111 12 2479888888777666543 33332 222 4699999999987542 Q ss_pred e---eecCC---CcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_011085. 71 A---YLQAG---QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGL 144 (343) Q Consensus 71 ~---~~~~g---~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~ 144 (343) . .|... ..+ ++..++..+..-. =..+-.++...|+-..-+--|+|..++.+.+.--.|...+.+|..|... T Consensus 65 ~~e~nv~~D~~~~~~--t~~kitt~~~~a~-~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gv 141 (349) T protein:vir:78 65 SIEPNYSNDVYQDIA--TPRAIQTGEMMAR-VAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGL 141 (349) T ss_pred CcccccCCCCccccc--ccccccccceeee-eeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHh Confidence 1 11111 111 1222333332211 1234456778888777777799999999988877776665555555433 Q ss_pred hhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhc---CCCcCCcEEEeCHHHHHHHhccchh Q lcl|NC_011085. 145 CNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSN---YVPSADRTFYTTPEVYSAILAALMP 221 (343) Q Consensus 145 a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~---~VP~~gR~~vv~P~~~~~Ll~~~~~ 221 (343) -........ ...+....+...++.+ ..+ ++.+.+ |..+|.+. +....=..+++.+..|..|.+...+ T Consensus 142 f~~~~~a~~-~~~~~~~~t~d~s~~a-~~~----~~~~~d----A~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li 211 (349) T protein:vir:78 142 YNDNVSATD-AYHEQNDMVVDVSATL-GFD----AGAFID----ATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI 211 (349) T ss_pred hcccccccc-hhhhcccceeeecccc-CCC----hhhhhh----hHHHHHHHhccccccceeEEEEchHHHHHHHhhhhh Confidence 221100000 0011111222222111 111 233333 34444443 1111225788999999999865432 Q ss_pred hhhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeee Q lcl|NC_011085. 222 NAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKL 301 (343) Q Consensus 222 ~~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~ 301 (343) . |.- ..-.+..|..++|..|+++..+|....++.. ......|-.-|++.... T Consensus 212 ~---~i~-~s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~------------------------~yttylfg~GAi~~~~~ 263 (349) T protein:vir:78 212 D---FIR-DAENNTMFATYQGYRVIVDDSMTVVGQGAQR------------------------KFISIIFGQGAIGYGEG 263 (349) T ss_pred h---hcc-CcccCcccceecCeEEEEeCCCccccCCCCc------------------------eEEEEEeecceEEEccC Confidence 2 210 1113446889999999999999965432110 11113333445555554 Q ss_pred eee-EEeeeeccchh----hhhhhh-----hhhhccceeccc-------------------------------c--eEEE Q lcl|NC_011085. 302 KDL-SLERARRAEYQ----ADQIIA-----RYAMGHGGLRPE-------------------------------A--AGAL 338 (343) Q Consensus 302 ~~~-~~e~~~~~~~~----~d~i~~-----~~~~G~~v~rpe-------------------------------~--~~~i 338 (343) .+. .+|..||+... .|.+.. +|.+|.+-..+. . ++.| T Consensus 264 ~~~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~NW~~v~~~K~I~iv~~ 343 (349) T protein:vir:78 264 NPVMPLEYEREASRANGGGVETLWTRKTWLLHPFGYRFTSAVITGNGTETIARSASWQDLANATNWNRVVDRKHVPIAFL 343 (349) T ss_pred CCccceeeecccccCCcceeEEEEEeeEEEeeeeeeeeccccccCCccccccCCCChHHhcCCcCcccccChhhcceEEE Confidence 432 25666676432 255554 333444433221 0 0111 Q ss_pred EecCC Q lcl|NC_011085. 339 VFTAG 343 (343) Q Consensus 339 ~~~~g 343 (343) ++.-| T Consensus 344 ~~~~~ 348 (349) T protein:vir:78 344 VTGVG 348 (349) T ss_pred EeccC Confidence 11111 No 180 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.38 E-value=5e-05 Score=44.18 Aligned_cols=296 Identities=11% Similarity=0.033 Sum_probs=138.2 Q ss_pred CCCCCcccccc-------ccccccccccchhHHHH-HHHH---HHHHHHHHHhhhhccCccccc-cccc-eEEEEec--- Q lcl|NC_011085. 1 MADMKGGQQLG-------KDQGKGQSGGDKLALFL-KVFG---GEVLTAFARTSVTTNRHIMRS-ISSG-KSAQFPV--- 64 (343) Q Consensus 1 ~~~~~~~~~~~-------t~~g~~~~~~d~~al~i-e~~~---g~V~~~f~~~s~~~~~~~~~~-i~~G-~tv~i~~--- 64 (343) |-+|.+.-... -..|.-..+.+..++|+ ++|. ..|.+.....-+.+.++.+++ +--| .++.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~ 80 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDK 80 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeecc Confidence 76666551111 11122223333334565 5555 233333333345556666653 2223 3454443 Q ss_pred cCccee-eeecCCCcCCCccCCCccceEEEEeeee-eeeeeeccchHHHH-hchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 65 LGRTRA-AYLQAGQSLDDKRKDIKHTEKTIVIDGL-LTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAEL 141 (343) Q Consensus 65 iG~~t~-~~~~~g~~i~~~~~~~~~~~~~l~iD~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~ 141 (343) .|..+. .++. .+++.. +..-++....|-.. .-+.+.+.+++.++ ...++-.+-...++.++++..|+.++.-. T Consensus 81 ~G~a~~~~d~~--~dip~v--~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~ 156 (319) T protein:vir:10 81 VGTAQIIADYT--DDLPLV--DALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGS 156 (319) T ss_pred ccceeeecCcc--ccccce--eccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec Confidence 455442 2332 222221 12233334333332 22334456666665 47788888888999999999999886321 Q ss_pred HhhhhccccccccccccCCceeecccccccccc-hHHHHHHHHHHHHHHHHHHhhc--CCCcCCcEEEeCHHHHHHHhcc Q lcl|NC_011085. 142 AGLCNMPAASNENIAGLGSASILEVGAKGDLTS-PVELGKAVIAQLTIARAKLTSN--YVPSADRTFYTTPEVYSAILAA 218 (343) Q Consensus 142 ~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~-~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~~ 218 (343) . ...+.|+-...-++....+...+ ..+..+.++++|..+..+|..+ .+ ...-.++|+|+.|..|..- T Consensus 157 ~---------~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~-~~p~~L~L~p~~~~~L~~~ 226 (319) T protein:vir:10 157 A---------PHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQ-HRATNILIPPSMRKVLAIR 226 (319) T ss_pred c---------cccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCce-eeceEEEecHHHHHhhhcc Confidence 1 11112221111111111111111 1123466788888887777654 32 1224788999999988531 Q ss_pred chhhhhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhhee Q lcl|NC_011085. 219 LMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGT 298 (343) Q Consensus 219 ~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~ 298 (343) ..+.+..-..-+++ +.-+++|...+.|...+.. + .+.++...-+++-+.. T Consensus 227 --~~~~~~t~l~~lk~----~~~~l~I~~~pel~~ag~~-------------------g-----~~~~v~y~~~~~~~~~ 276 (319) T protein:vir:10 227 --MPETTMSYLDYFKS----QNSGIEIDSIAELEDIDGA-------------------G-----TKGVLVYEKNPMNMSI 276 (319) T ss_pred --cCCCCeeHHHHHHH----hcCCceEEEeeeecccCCC-------------------c-----ceEEEEEecCCceEEE Confidence 11111111111211 2245667776666532110 0 0111111222333334 Q ss_pred eeeeeeEEeeeeccchhhhhhhhhhhh-ccceecccceEEEEec Q lcl|NC_011085. 299 VKLKDLSLERARRAEYQADQIIARYAM-GHGGLRPEAAGALVFT 341 (343) Q Consensus 299 ~~~~~~~~e~~~~~~~~~d~i~~~~~~-G~~v~rpe~~~~i~~~ 341 (343) +..+++++.. ..++.....+....+. |.-+.||++++.+.-= T Consensus 277 ~v~~~~~~~~-~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 277 EIPEAFNMLP-AQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred ecCcceeeee-eeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 4344444332 2334455566666654 7899999998776433 No 181 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.26 E-value=0.00011 Score=42.24 Aligned_cols=288 Identities=14% Similarity=0.096 Sum_probs=139.4 Q ss_pred CCCCCccccccccccccccccchhHHH--HHHHHHHHHHHHHHhhhhc--cCcccc-cc-----ccceEEEEeccCccee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALF--LKVFGGEVLTAFARTSVTT--NRHIMR-SI-----SSGKSAQFPVLGRTRA 70 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~--ie~~~g~V~~~f~~~s~~~--~~~~~~-~i-----~~G~tv~i~~iG~~t~ 70 (343) ||-+ +. .|. +. +|+|...|.+...+.+.|. +.+..+ .+ .+|+.+.+|.++...- T Consensus 1 Ma~T--------~l------~D~--iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g 64 (349) T protein:vir:94 1 MAIT--------TI------GNI--VTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDT 64 (349) T ss_pred CCce--------EE------eee--eccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCC Confidence 5532 21 111 12 2479888888777666543 333322 22 4699999998876431 Q ss_pred ---eeecCCCcCC-CccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011085. 71 ---AYLQAGQSLD-DKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCN 146 (343) Q Consensus 71 ---~~~~~g~~i~-~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~ 146 (343) ..|...++.. .++..++..+.. -+=..+-.++...|+-..-+--|+|..++++.+.--.|...+.+|..|...-. T Consensus 65 ~~e~n~~~dt~~~~~t~~kit~~~~~-a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~ 143 (349) T protein:vir:94 65 SIEPNYSNDVYQDIATPRAIQTGEMM-ARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYN 143 (349) T ss_pred CcccccCCCCccccccccccccccee-eeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhc Confidence 1122112111 112223333222 11123444677888877777779999999999888887666655555543322 Q ss_pred ccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCC--CcCC-cEEEeCHHHHHHHhccchhhh Q lcl|NC_011085. 147 MPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYV--PSAD-RTFYTTPEVYSAILAALMPNA 223 (343) Q Consensus 147 ~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~V--P~~g-R~~vv~P~~~~~Ll~~~~~~~ 223 (343) ........ .......+....+.+ ..+ ++.+.++ ..+|-++-- ..+. -.+++.+..|..|.+...+.. T Consensus 144 ~~~~~~~~-~~~~~~~~~d~~~~a-~~~----~~~~~~A----~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~ 213 (349) T protein:vir:94 144 DNVSATDA-YHEQNDMVVDVSATS-GFD----AGAFIDA----TQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDF 213 (349) T ss_pred cccccccc-ccccCceeEEecccC-CCC----hhhHHHH----HHHHHHHhccccccceeEEEEchHHHHHHHhcchhhh Confidence 11111100 111111222222111 111 2333443 333333210 1122 478899999999887643221 Q ss_pred hccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeee Q lcl|NC_011085. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD 303 (343) Q Consensus 224 ~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~ 303 (343) .. ..-.+..|..++|..|++...+|.....+.. .| ....|-.-|++...... T Consensus 214 i~----~s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~-----------------~y-------ttylfg~GAi~~~~~~~ 265 (349) T protein:vir:94 214 IR----DAENNTMFATYQGYRVIVDDSMTVVGQDTSR-----------------KF-------ISIIFGQGAIGYGEGNP 265 (349) T ss_pred cc----CcccCcccceecCcEEEEeCCCccccCCCCc-----------------eE-------EEEEeecceEEeecCCC Confidence 11 1112445889999999999999965322110 11 11233344555555543 Q ss_pred e-EEeeeeccchh----hhhhhh-----hhhhccceecccc---------------------------------eEEEEe Q lcl|NC_011085. 304 L-SLERARRAEYQ----ADQIIA-----RYAMGHGGLRPEA---------------------------------AGALVF 340 (343) Q Consensus 304 ~-~~e~~~~~~~~----~d~i~~-----~~~~G~~v~rpe~---------------------------------~~~i~~ 340 (343) . -+|..|++... .|.+.. +|.+|.+-..+.- ++.|++ T Consensus 266 ~~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~aeLa~~~NW~~v~~~K~I~iv~~~~ 345 (349) T protein:vir:94 266 EMPLEYEREASRANGGGVETLWTRKTWLLHPFGYSFTSAVITGNGTETIARSASWQDLANAANWNRVVDRKHVPIAFLVT 345 (349) T ss_pred CcceeeecccccCCcceeEEEEEeeEEEeeeeeeeecccccCCCccccccCCCChHHhcCCcCcccccChhhcceEEEEe Confidence 2 35666766542 255544 3344444433210 111111 Q ss_pred cCC Q lcl|NC_011085. 341 TAG 343 (343) Q Consensus 341 ~~g 343 (343) .-| T Consensus 346 ~~~ 348 (349) T protein:vir:94 346 GVG 348 (349) T ss_pred ccC Confidence 111 No 182 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=97.17 E-value=0.00014 Score=41.66 Aligned_cols=284 Identities=14% Similarity=0.134 Sum_probs=133.7 Q ss_pred ccc-cccchhHHHHHHHHHHHHHHHHHhhhhccCccccc-cc-cceEEEEeccCcc-eeeeecCC-CcCCCccCCCccce Q lcl|NC_011085. 16 KGQ-SGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRS-IS-SGKSAQFPVLGRT-RAAYLQAG-QSLDDKRKDIKHTE 90 (343) Q Consensus 16 ~~~-~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~-i~-~G~tv~i~~iG~~-t~~~~~~g-~~i~~~~~~~~~~~ 90 (343) .++ .+|-..+=+++.+...|.+.....-+.+.++.+++ +- +..++.++....+ .++-|..+ .+++.. +..-.+ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~--~~~~~~ 78 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLV--DVDMVR 78 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccc--ccccee Confidence 111 11211222334455667677767777788877763 32 3445665544322 23333322 223221 122234 Q ss_pred EEEEeeee-eeeeeeccchHHHH-hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecc-- Q lcl|NC_011085. 91 KTIVIDGL-LTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEV-- 166 (343) Q Consensus 91 ~~l~iD~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~-- 166 (343) ....|-.. .-|.+.+.+++.++ ...++-.+-...++.++++..|+.++.-... ..+.|+-...-++. T Consensus 79 ~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~---------~g~~GLlN~p~~~~~~ 149 (301) T protein:vir:80 79 KSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK---------YAIKGAFEATGIQIDV 149 (301) T ss_pred EEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc---------ccceeeecCCCccccc Confidence 44444332 22334455666665 4788888889999999999999988743211 11112111111100 Q ss_pred ---cccccccc-hHHHHHHHHHHHHHHHHHHhhc--CCCcCCcEEEeCHHHHHHHhccchhhh-hccccccchhcceeEE Q lcl|NC_011085. 167 ---GAKGDLTS-PVELGKAVIAQLTIARAKLTSN--YVPSADRTFYTTPEVYSAILAALMPNA-ANYAALIDPERGSIRN 239 (343) Q Consensus 167 ---~~~~~~~~-~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~~~~~~~-~~~~~~~~~~~G~V~~ 239 (343) ...+...+ ..++.+.|++.|.++..+|.++ .+ ...-.++|+|+.|..|..-. ... ....-..-+++ + T Consensus 150 ~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~-~~p~~L~L~p~~~~~L~~~~-~~~~~~~tvl~~l~~----~ 223 (301) T protein:vir:80 150 SPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYG-TASLKLCLPPKQFELINKKR-YSNEDSRSVLKVLQD----N 223 (301) T ss_pred ccCcccccccccccCCHHHHHHHHHHHHHHHHHhcCce-ecccEEEecHHHHHhhhhcc-ccCCCCeeHHHHHHH----H Confidence 11111111 1234567888888888888664 22 11247889999999986321 101 01100111211 1 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhh Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQI 319 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i 319 (343) .-+.+|...+.|...+.. + .+.++...-.++-+.....++++.-.. .++.....+ T Consensus 224 ~~~~~I~~~p~L~~~g~~---------g---------------~~~~v~~~~~~d~~~~~v~~~~~~~~~-e~~~~~~~~ 278 (301) T protein:vir:80 224 AWFSAIVRVPDLAGMGTA---------G---------------SDSFAVIHDSNETAELIIPMDITRHPE-EYSFPRTKV 278 (301) T ss_pred cCcceEEEcceeccCCCC---------c---------------ccEEEEEecCCcEEEEEecCceeeecc-eecCceeEe Confidence 223566666666422110 0 011111111223333333333332111 111223344 Q ss_pred hhhhhh-ccceecccceEEEEec Q lcl|NC_011085. 320 IARYAM-GHGGLRPEAAGALVFT 341 (343) Q Consensus 320 ~~~~~~-G~~v~rpe~~~~i~~~ 341 (343) ....+. |..+.||++++.+.-= T Consensus 279 ~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 279 PFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eeeeeeEEEEEEccceEEEEecC Confidence 455555 7899999998876533 No 183 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=96.90 E-value=0.0002 Score=40.86 Aligned_cols=263 Identities=14% Similarity=0.134 Sum_probs=141.3 Q ss_pred ccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcc-----ccccccceEEEEeccCcc--eeeeecCCCcCC-- Q lcl|NC_011085. 10 LGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHI-----MRSISSGKSAQFPVLGRT--RAAYLQAGQSLD-- 80 (343) Q Consensus 10 ~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~-----~~~i~~G~tv~i~~iG~~--t~~~~~~g~~i~-- 80 (343) +..| .|-|+|.|.+.+-|++++.|++..- .+-++..++.---..... -++.|..+.+.- T Consensus 1 ~avr------------~y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FG 68 (287) T protein:vir:39 1 MAIK------------YFTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFG 68 (287) T ss_pred CCcc------------cccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccc Confidence 1122 4889999999999999999875542 223333333222222211 234454433211 Q ss_pred -CccCCCccceE--EEEeeee------eeeeeeccchHHHHhchh---hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 81 -DKRKDIKHTEK--TIVIDGL------LTADVLIYDIEDAMNHYD---VRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 81 -~~~~~~~~~~~--~l~iD~~------~~~~~~Idd~D~~q~~~d---~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) ++.......++ .+.+|+. +.++.-| |+...+-| ...+..+.++.|-++.+|..+=..|...+... T Consensus 69 tGTg~ssRFG~rkEi~y~dt~V~Y~~~~~ihEGi---D~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t 145 (287) T protein:vir:39 69 SGTGNTSRFGQRKEVKSVNKQVSYDAPLAINEGI---DDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASET 145 (287) T ss_pred cCCCccccccceeEEEEecccccceecccccccc---ccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchh Confidence 00000001111 1222221 1222223 33333323 44566778899999999987654443332111 Q ss_pred ccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCC-cEEEeCHHHHHHHhccchhhhhccc Q lcl|NC_011085. 149 AASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSAD-RTFYTTPEVYSAILAALMPNAANYA 227 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (343) . .. . .+ .+.+..++.+|.+++..++|-... ..+.|+|++|.+|..++..+.+ -+ T Consensus 146 ~---------------~~---~--~t----~d~V~~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~Tsa-K~ 200 (287) T protein:vir:39 146 L---------------TV---K--LD----EDSVTKLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTA-KN 200 (287) T ss_pred e---------------ee---e--ec----ccchHHHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhcccccccc-cc Confidence 0 00 0 00 122344566678888877775444 6777999999999988755543 33 Q ss_pred cccchhcceeEEEeceEEEEecccc-ccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEE Q lcl|NC_011085. 228 ALIDPERGSIRNVMGFEVVEVPHLT-AGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSL 306 (343) Q Consensus 228 ~~~~~~~G~V~~i~Gf~V~~sn~lp-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~ 306 (343) ++..+-+--|.++-||-+-+.|.-- +.+. ...|.++-|+.+..-.-.. T Consensus 201 SsaNiDen~i~kFkGf~l~e~P~~~~q~g~-------------------------------~a~fs~dnig~af~GI~va 249 (287) T protein:vir:39 201 SSANVDEQTLYKFKGFILSELPDEKFQLNE-------------------------------GAYFAADNVGVAGVGIQVT 249 (287) T ss_pred ceeeeccCCcceecceEEEecchHhhccCc-------------------------------EEEEccccceeecccceeE Confidence 4455555557789999998865221 1110 1234444444333211123 Q ss_pred eeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 307 ERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 307 e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .....+++-+-.+.|---||--++.-...++++.+.- T Consensus 250 R~i~sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~ 286 (287) T protein:vir:39 250 RAMDSEDFAGTALQAAAKYGKYLPEKNKKAILKATVT 286 (287) T ss_pred EeeecccccceeeecccccccccccccceEEEEEecC Confidence 3334556667777777788888888888888877776 No 184 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=96.69 E-value=0.00029 Score=39.95 Aligned_cols=282 Identities=10% Similarity=0.083 Sum_probs=140.2 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccc-----cccccceEEEEeccCcce--ee-e Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIM-----RSISSGKSAQFPVLGRTR--AA-Y 72 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~-----~~i~~G~tv~i~~iG~~t--~~-~ 72 (343) +-|.+-.++..+++.. . .-.|-|+|.+-+.+-|++++.|++..-- +-+.+.++.---....+. ++ . T Consensus 11 ~~~~~~~~~~t~N~n~-----a-vr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~ 84 (314) T protein:vir:98 11 LNNIQFFASGTANQNK-----A-ARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNE 84 (314) T ss_pred ccceeeeeeccccCcc-----c-eeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCc Confidence 3444433332222111 1 1248899999999999999998865432 223333332111111111 11 2 Q ss_pred ecCCCcCC---CccCCCccceE--EEEeeee-eeee-eec-cchHHHHhchh---hHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 73 LQAGQSLD---DKRKDIKHTEK--TIVIDGL-LTAD-VLI-YDIEDAMNHYD---VRSEYTSQIGESLAMAADGAVLAEL 141 (343) Q Consensus 73 ~~~g~~i~---~~~~~~~~~~~--~l~iD~~-~~~~-~~I-dd~D~~q~~~d---~~~~~~~~~~~aLa~~~D~~i~~~~ 141 (343) |..+...- ++.......++ .+..|+. .|.. ..| .-+|..-.+-| ...+..+.++.|-.+.+|..+=..| T Consensus 85 Y~TdeNvaFGtGTg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~l 164 (314) T protein:vir:98 85 YNKDENVGFGEGTSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFI 164 (314) T ss_pred ccCCCCcccccCCccccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33332210 00000011111 1222221 1211 111 12333333333 3445667788888999997664444 Q ss_pred HhhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchh Q lcl|NC_011085. 142 AGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMP 221 (343) Q Consensus 142 ~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~ 221 (343) ...+..+. .. ++.+ .+.+..++..+.+.....+|- ....+.|+|++|.+|..++.. T Consensus 165 S~~As~te----~l--------------td~~-----~d~V~~LF~~as~~yvn~ev~-~~~~AyV~~evYnaiiD~~l~ 220 (314) T protein:vir:98 165 SSIAEKTE----TL--------------TDYS-----ADNVLRLFNELSKYYVNIEAI-GTKAAKVSPELYNAIVDHPLT 220 (314) T ss_pred Hhhhhhhh----hh--------------hhcc-----hhhHHHHHHHHHhhhhcceee-EEEEEEEchhHHhHhhccccc Confidence 33221110 00 1111 123344555667777766663 347788999999999988755 Q ss_pred hhhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeee Q lcl|NC_011085. 222 NAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKL 301 (343) Q Consensus 222 ~~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~ 301 (343) +.+ -+++..+-+--|.++-||.|-+.|.-...+. .+ .+++.+-|+.+.. T Consensus 221 Tsa-K~SsaNIDengi~~FkGf~i~e~P~~~~q~g-----------------------------~i-a~~s~dnig~aft 269 (314) T protein:vir:98 221 TSA-KSSSANIDQNGIVNFKGFAIQEIPESMLQSG-----------------------------DV-AYTYITNIGKAFT 269 (314) T ss_pred ccc-ccceeeeccCCcceecceEEEecchhhcCCC-----------------------------cE-EEEccccceeecc Confidence 543 3344555555577899999988653322110 00 1222222222221 Q ss_pred eeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 302 KDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 302 ~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) -.-.......+++-+-.+.|-=-||--++.-...+++++++- T Consensus 270 GIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~t 311 (314) T protein:vir:98 270 GINTSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTST 311 (314) T ss_pred cceeeeeeecccccceeeecccccccccccccceeeEEEecC Confidence 111122223445566677777778888888888888887766 No 185 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=96.66 E-value=0.00043 Score=39.05 Aligned_cols=275 Identities=12% Similarity=0.019 Sum_probs=133.1 Q ss_pred ccccccchhHHHH-HHHHHHHHHHHH----HhhhhccCccccc-cc-cceEEEEecc---CcceeeeecCC-CcCCCccC Q lcl|NC_011085. 16 KGQSGGDKLALFL-KVFGGEVLTAFA----RTSVTTNRHIMRS-IS-SGKSAQFPVL---GRTRAAYLQAG-QSLDDKRK 84 (343) Q Consensus 16 ~~~~~~d~~al~i-e~~~g~V~~~f~----~~s~~~~~~~~~~-i~-~G~tv~i~~i---G~~t~~~~~~g-~~i~~~~~ 84 (343) .+...+|.-..|+ ++|. .++.... ..-+.+.++..++ +- .-.++.++.. |..+ -|..+ .+++. . T Consensus 1 ~~~~~a~~~~~f~~~ql~-~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~--~~~~~~~dip~--v 75 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLT-ASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQ--IVADYTDDLPL--V 75 (296) T ss_pred CcccchhhhHHHHHHHHH-HHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCcee--EeCCCccccce--e Confidence 3333334333454 7776 4444333 3345566666653 22 2345554443 4443 33322 22322 1 Q ss_pred CCccceEEEEeeee-eeeeeeccchHHHHh-chhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCce Q lcl|NC_011085. 85 DIKHTEKTIVIDGL-LTADVLIYDIEDAMN-HYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSAS 162 (343) Q Consensus 85 ~~~~~~~~l~iD~~-~~~~~~Idd~D~~q~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~ 162 (343) +..-++....|-.. .-+...+.+++.++. ..++-......++.++++..|+.++.-- ....+.|+-... T Consensus 76 ~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~---------~~~g~~GLlN~p 146 (296) T protein:vir:10 76 DALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGS---------TAHGIPSVFDYP 146 (296) T ss_pred eccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec---------ccccceeEeecC Confidence 22233344433332 123344566766654 6788888888999999999998876321 111122221111 Q ss_pred eecccccccccchHHHHHHHHHHHHHHHHHHhhc--CCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcceeEEE Q lcl|NC_011085. 163 ILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSN--YVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERGSIRNV 240 (343) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~i 240 (343) -++...+ ..+..+. ..+++.|.++...|.++ .+ ...-.++|+|+.|..|...- .+.+..-..-+++ +. T Consensus 147 ~v~~~~~--~~~W~~~-t~i~~Di~~~~~~l~~~s~g~-~~p~~l~L~p~~~~~L~~~~--~~~~~t~l~~ik~----~~ 216 (296) T protein:vir:10 147 NINNVVS--GGSWSQP-TTAVSDITSLLDIIETSTNGQ-HRATHLLLPTTARRIMQNLV--PGTSVSYGEFFRQ----NN 216 (296) T ss_pred CCccccc--cCCccCH-HHHHHHHHHHHHHHHHhhCce-ecceeEEeCHHHHHHHhhcc--CCCCccHHHHHHH----hc Confidence 1111111 1112111 25677777777766554 32 11236889999999886321 1111110111221 23 Q ss_pred eceEEEEeccccccccccccccccccccccccccccccccccccceEeEee--chhhheeeeeeeeEEeeeeccchhhhh Q lcl|NC_011085. 241 MGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQ--HRSAVGTVKLKDLSLERARRAEYQADQ 318 (343) Q Consensus 241 ~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~--~~~Av~~~~~~~~~~e~~~~~~~~~d~ 318 (343) .+.+|...+.|...+.. . +...+++ .++-+..+..++++... ..++...+. T Consensus 217 ~~l~i~~~~~l~~a~~~------------------------g--~~~~v~~~~~~~~~~~~v~~~~~~~~-~e~~~l~~~ 269 (296) T protein:vir:10 217 SGVTVEFVQYLNDYNGT------------------------G--TSAAIAYEKDPNNMAIEIPEATNALP-AQPKDLHFK 269 (296) T ss_pred CCceEEEeeeeccCCCC------------------------c--ceEEEEEEcCCceEEEEcCcceeeec-ccccCceEE Confidence 46666766666432110 0 0111222 23344444445544332 344556667 Q ss_pred hhhhhhh-ccceecccceEEE---Eec Q lcl|NC_011085. 319 IIARYAM-GHGGLRPEAAGAL---VFT 341 (343) Q Consensus 319 i~~~~~~-G~~v~rpe~~~~i---~~~ 341 (343) +....+. |..+.||++++.+ ++. T Consensus 270 ~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 270 IPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EeeEeeEEEEEEECCceeEEEeeeecC Confidence 7777766 7999999998886 555 No 186 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=96.07 E-value=0.0011 Score=36.91 Aligned_cols=280 Identities=11% Similarity=0.027 Sum_probs=108.3 Q ss_pred CCCCCccccccccc--cccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc-CcceeeeecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQ--GKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL-GRTRAAYLQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~--g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i-G~~t~~~~~~g~ 77 (343) ++......+..... ......+. +-...+...+...+...+.+...++..++. ...++.- ....+..+..|. T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~---~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~---~~~~~~~~~~~~a~~~~eG~ 299 (517) T protein:vir:97 226 SASLTKDPKAAWTAELKERGISGM---PAPAGILKRIQDAVNDEGSLLPFIRHENLP---TLVVGGDNALTQGTGHTTGT 299 (517) T ss_pred Hhcccccccceeeeeccccccccc---ccchHHHHHHHHhhhhhccceeeeeecccc---ceeeecccccceeeeeecCC Confidence 11111111110000 00000000 011233344444555555555555544432 2333322 122334455565 Q ss_pred cCCCccCCCccceEEEEeeeeeeee-eeccchHHHHhchh----hHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTAD-VLIYDIEDAMNHYD----VRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASN 152 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~-~~Idd~D~~q~~~d----~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (343) ..+.+ +++..++++.+-+ +.. +.+..---..+.+| +.+.+..+.+++|+++.++.++.-- ++ . T Consensus 300 ~kp~s--~~tf~~~~~~~~~--ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~Gd--Gt---g--- 367 (517) T protein:vir:97 300 DKTES--NITLQTRVLTPQY--VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGG--VT---G--- 367 (517) T ss_pred ccccc--ccceeeEEeeHhh--hhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhccc--CC---C--- Confidence 54432 3445555554422 222 22221111112344 7778899999999999998886210 00 0 Q ss_pred ccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccch Q lcl|NC_011085. 153 ENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDP 232 (343) Q Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (343) .+.. ..++........ .......+.+.+......+.+. .+-.+|++|..|..|.+-.. .+..|.=...+ T Consensus 368 ~~~~-----gi~~~a~~~~~~-~~~~~~~~~d~i~~l~~a~~~a----~~a~~vmn~~t~~~I~klKD-~~G~Yl~~~~~ 436 (517) T protein:vir:97 368 VSET-----QIYPVVGDAWAT-NVTGTTNIQELLEKLSVATPKA----ADSTLVIHRNDLAAIRFLKD-KNGNYVFPVGV 436 (517) T ss_pred cccc-----cccccccccccc-cccccchHHHHHHHHHHHhhhc----cCCEEEECHHHHHHHHHhhc-CCCCeeccCcC Confidence 0000 011111100000 0011122223322222223322 23356799999998865432 12334333344 Q ss_pred hcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeech-hhheeeeeeeeEEeeeec Q lcl|NC_011085. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHR-SAVGTVKLKDLSLERARR 311 (343) Q Consensus 233 ~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~Av~~~~~~~~~~e~~~~ 311 (343) ..+.+..++|+.-+.. .++..... +++.. -.++..+. +..-..++ T Consensus 437 ~~~~~~~l~G~~~~~~-~~~~~~~~-------------------------------~~~~~~y~i~~~~g--~~~~~~fd 482 (517) T protein:vir:97 437 SNQTIATHFGFNRLVQ-SVAVDEKT-------------------------------AVSLSGYVTNGSRG--MEFEQGTI 482 (517) T ss_pred CcccccccCCcccccc-ccccCcee-------------------------------EeeccccEEEeecc--eeeeeeee Confidence 5666667777432221 12211000 00000 01111111 01111111 Q ss_pred cchhhhhhhhhhhhccceecccceEEEEec---CC Q lcl|NC_011085. 312 AEYQADQIIARYAMGHGGLRPEAAGALVFT---AG 343 (343) Q Consensus 312 ~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~---~g 343 (343) -.+-.+.+...++.|..++.|++++..+++ +| T Consensus 483 ~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 483 LVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred cccCceeEeeeeeeccccccccceEEEEEcCCCCC Confidence 112233344445667777777776665543 34 No 187 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=96.04 E-value=0.0011 Score=36.83 Aligned_cols=293 Identities=12% Similarity=0.070 Sum_probs=127.8 Q ss_pred CCC-------CCccccc-ccccccc--ccccchhHHHHHHHHHHHHHHHHHh-hhhccCccccccc---cceEEEEeccC Q lcl|NC_011085. 1 MAD-------MKGGQQL-GKDQGKG--QSGGDKLALFLKVFGGEVLTAFART-SVTTNRHIMRSIS---SGKSAQFPVLG 66 (343) Q Consensus 1 ~~~-------~~~~~~~-~t~~g~~--~~~~d~~al~ie~~~g~V~~~f~~~-s~~~~~~~~~~i~---~G~tv~i~~iG 66 (343) ||- ....... -...+.. ..++|-=.++...-...++..|+.. .-++.|.+..+++ ..+.+++-. T Consensus 336 lAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~-- 413 (652) T protein:vir:79 336 YARMSLTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGG-- 413 (652) T ss_pred HHHHHHHhhccCCCCCCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCC-- Confidence 221 1000000 0000111 2344422222233444455566644 3456666666544 444555433 Q ss_pred cceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011085. 67 RTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCN 146 (343) Q Consensus 67 ~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~ 146 (343) .+++.....|.++.. ..+....-++.+.++- --|.|....-.=-..+....+....|.+-++.+++.+...+..-.. T Consensus 414 ~~~L~~V~E~gEyk~--~t~~e~~e~~~l~tyG-~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~ 490 (652) T protein:vir:79 414 FSALRQVREGAEYKY--VTTGDKQATIALATYG-ELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPK 490 (652) T ss_pred CCCccccCCCCccce--eeecCccceeeeeccc-CeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcc Confidence 234555555555533 2345555567776542 1122321000000133555677888888899999888876653221 Q ss_pred cccccccccccc-CCceeecccccccccchHHHHHHHHHHHHHHHHHHhhc-----CCCcCCcEEEeCHHHHHHHhccch Q lcl|NC_011085. 147 MPAASNENIAGL-GSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSN-----YVPSADRTFYTTPEVYSAILAALM 220 (343) Q Consensus 147 ~~~~~~~~~~g~-~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~-----~VP~~gR~~vv~P~~~~~Ll~~~~ 220 (343) ... .....+++ ..++..+.+ ..+-+ .|-.++..|.++ .+--..||++|||+......+ T Consensus 491 ~~~-DGk~LF~hA~H~Nl~~~a----a~~~~--------~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~--- 554 (652) T protein:vir:79 491 IST-DNVSLFDKAKHANVLESA----AMDVA--------SLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQ--- 554 (652) T ss_pred ccc-CCceeecccccccccccc----cCCHH--------HHHHHHHHHHHhccCCccccccccEEEecchhHHHHHH--- Confidence 110 11122222 223333211 12211 122222222222 222347899999987654332 Q ss_pred hhhhccccccchhcceeEEEece-EEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheee Q lcl|NC_011085. 221 PNAANYAALIDPERGSIRNVMGF-EVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTV 299 (343) Q Consensus 221 ~~~~~~~~~~~~~~G~V~~i~Gf-~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~ 299 (343) +++...........|.+.-+.|+ +|++.++|...+.+. .-+.... . .+.+..+ T Consensus 555 ll~s~~v~~a~~~~~~~Np~~~~~~~i~eprL~~~s~~~----------wylaa~~-~---------------~dtiev~ 608 (652) T protein:vir:79 555 VIRSSSVKGADINAGIINPVKDFATVIAEPRLDDNSQTT----------FYLAASK-G---------------SDTIEVA 608 (652) T ss_pred HhccCCCcccccccccccccccccccccccccCCCCccc----------EEEecCC-C---------------CCeEEEE Confidence 22222211122334555556665 888888886322111 0000000 0 0111111 Q ss_pred e---eeeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEe Q lcl|NC_011085. 300 K---LKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVF 340 (343) Q Consensus 300 ~---~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~ 340 (343) . .+.+.+|....-..-+=.++-++-||+++++.-+++-.+. T Consensus 609 yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 609 YLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred EecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 1 1223444432222234445556778999998887765555 No 188 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=95.75 E-value=0.0013 Score=36.36 Aligned_cols=290 Identities=12% Similarity=0.026 Sum_probs=130.5 Q ss_pred CCCCCccccccc---cc-cccccccchhHHHH-HHHHHHHHHHHHH----hhhhccCccccc-ccc-ceEEEEec---cC Q lcl|NC_011085. 1 MADMKGGQQLGK---DQ-GKGQSGGDKLALFL-KVFGGEVLTAFAR----TSVTTNRHIMRS-ISS-GKSAQFPV---LG 66 (343) Q Consensus 1 ~~~~~~~~~~~t---~~-g~~~~~~d~~al~i-e~~~g~V~~~f~~----~s~~~~~~~~~~-i~~-G~tv~i~~---iG 66 (343) || |.+..+... +. -.+....|....|+ +++. .|+....+ .-..+.++..++ +-. -.++.+.. .| T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~-~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G 78 (314) T protein:vir:10 1 MA-IKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLT-AALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVG 78 (314) T ss_pred Cc-cchHHHHHHHHHHHHhhcccchhhhHHHHHHHHH-HHHHHHhhhhccccccceeeccccCCCCceeEEEeeeecccc Confidence 33 222222111 10 11223333322344 6655 44444433 234455666553 211 23455443 34 Q ss_pred ccee-eeecCCCcCCCccCCCccceEEEEeeee-eeeeeeccchHHHH-hchhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011085. 67 RTRA-AYLQAGQSLDDKRKDIKHTEKTIVIDGL-LTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAG 143 (343) Q Consensus 67 ~~t~-~~~~~g~~i~~~~~~~~~~~~~l~iD~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~ 143 (343) ..+. .++. .+++.. +..-.+....|-.+ .-+...+.++..++ ...++-.+-...+..++++..|+.++.-- T Consensus 79 ~a~~~~d~~--~dip~v--d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~-- 152 (314) T protein:vir:10 79 IAQIIADYS--DDLPLV--DAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGS-- 152 (314) T ss_pred ceeeeCCcc--ccccee--ecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec-- Confidence 4432 2332 223321 22233334433332 12223345555554 46777778888888899999998776221 Q ss_pred hhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhc--CCCcCCcEEEeCHHHHHHHhccchh Q lcl|NC_011085. 144 LCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSN--YVPSADRTFYTTPEVYSAILAALMP 221 (343) Q Consensus 144 ~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~~~~~ 221 (343) ....+.|+-...-++.... ..+.. +.+.++++|.++..+|.++ .+ ...-.++|+|+.|..|.. . T Consensus 153 -------~~~g~~GLlN~p~v~~~~~--~~~Wa-T~~ei~~Di~~~~~~l~~~s~g~-~~p~~l~Lpp~~~~~L~~---~ 218 (314) T protein:vir:10 153 -------APHGIVSVFDQPNINNVVA--TPNWS-VPQNAIDDVTAMIDAVESSTQGL-HHVTDILLPASARRVMQG---L 218 (314) T ss_pred -------ccccceeEeecCCCccccC--CCCcc-cHHHHHHHHHHHHHHHHHhcCcc-ccceeEEecHHHHHhhcc---c Confidence 1111222222221221111 12222 3467788888888888775 21 112368899999977632 1 Q ss_pred hh-hccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeee Q lcl|NC_011085. 222 NA-ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVK 300 (343) Q Consensus 222 ~~-~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~ 300 (343) .. .+.+-..-+.+ +.-+++|...+.|...+.. +. ..++...-.++-+.... T Consensus 219 ~~~~~~tvl~~l~~----n~~~l~I~~~~el~~ag~~---------g~---------------~~~v~y~~~~~~~~~~v 270 (314) T protein:vir:10 219 VPQTNLSYGELFTR----NNPGLTIRFLQFLDNYDGA---------GG---------------KAALAFEKSPLNMSIEI 270 (314) T ss_pred ccCCCccHHHHHHH----hCCCcEEEEcccccccCCC---------cc---------------eEEEEEecCCcEEEEec Confidence 11 11111111221 1235667766665422100 00 01111111222233333 Q ss_pred eeeeEEeeeeccchhhhhhhhhhhh-ccceecccceE---EEEec Q lcl|NC_011085. 301 LKDLSLERARRAEYQADQIIARYAM-GHGGLRPEAAG---ALVFT 341 (343) Q Consensus 301 ~~~~~~e~~~~~~~~~d~i~~~~~~-G~~v~rpe~~~---~i~~~ 341 (343) .++++.-. ..++.....+....+. |..+.||.+++ -|++. T Consensus 271 p~~~~~l~-~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 271 PEVTNVLP-AQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred Cccceeec-ceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 33333221 2334455666666665 79999999988 35555 No 189 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=95.30 E-value=0.0023 Score=35.00 Aligned_cols=308 Identities=11% Similarity=-0.019 Sum_probs=132.7 Q ss_pred CCCC-----------------CccccccccccccccccchhHHHHHHHHHHH---HHHHHHhhhhccCccccccccce-- Q lcl|NC_011085. 1 MADM-----------------KGGQQLGKDQGKGQSGGDKLALFLKVFGGEV---LTAFARTSVTTNRHIMRSISSGK-- 58 (343) Q Consensus 1 ~~~~-----------------~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V---~~~f~~~s~~~~~~~~~~i~~G~-- 58 (343) |+.. +......+++.++. .+..-+..++.|.+.+ -++|........-.......+|. T Consensus 162 ~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~-tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~ 240 (523) T protein:vir:59 162 SSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGD-PENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPS 240 (523) T ss_pred cccceeeeecccccccccccccccccccccccccc-ccccccchhhccccccccccccccccccccccccccccCCCccc Confidence 1111 00000011111110 0111111112221111 11111110000000000000000 Q ss_pred -----EEEEeccCcceeeeec-CCCcCCCccCCCccceEEEEeeeeeeee--------eeccchHHHHh---chhhHHHH Q lcl|NC_011085. 59 -----SAQFPVLGRTRAAYLQ-AGQSLDDKRKDIKHTEKTIVIDGLLTAD--------VLIYDIEDAMN---HYDVRSEY 121 (343) Q Consensus 59 -----tv~i~~iG~~t~~~~~-~g~~i~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~---~~d~~~~~ 121 (343) .+.-...|..+..--+ .+....+ .....-.+.-++||+...-+ ..+.-..+.++ -.|.-.|+ T Consensus 241 t~~~~~lyt~~~g~~t~~~~~~~~~~~~~-~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~EL 319 (523) T protein:vir:59 241 TQDLDLVYYIDARNDFEDQSTDPDYPDPG-FQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEI 319 (523) T ss_pred ccccccccccccccchhhccccccccccc-cccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHH Confidence 0000000110000000 0000000 11223456677787764432 44554555666 38888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccch--------HHHHHHHHHHHHH-HHHH Q lcl|NC_011085. 122 TSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSP--------VELGKAVIAQLTI-ARAK 192 (343) Q Consensus 122 ~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~--------~~~~~~i~~~l~~-a~~~ 192 (343) +.=.+..+...+.+-||+.+...+..-. ..+.....+.+....++.... .+.++.++-.|.+ +.+. T Consensus 320 anILStEImlEINR~ii~~~~~~a~~~~-----~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i 394 (523) T protein:vir:59 320 VTLMSQYIAREIDLEILSTIMAHARRTD-----NYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRI 394 (523) T ss_pred HHHHHHHHHHHhhHHHHHhHhhhheeee-----eccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHH Confidence 9999999999999999998875543211 122222222222222221111 1111222211111 1111 Q ss_pred HhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccccccchhcc--eeEEE-eceEEEEeccccccccccccccccccccc Q lcl|NC_011085. 193 LTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAALIDPERG--SIRNV-MGFEVVEVPHLTAGGAGDDREDETTNQKH 269 (343) Q Consensus 193 Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G--~V~~i-~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~ 269 (343) ..+-. --.+-|+|++|++.+.|-..+-+...... ....+| .+|.+ .|++||.-++.|..=..-+ T Consensus 395 ~~~t~-~~~~~~~~~s~~v~~~l~~~~~~~~~~~~--~~~~~~~~~~g~l~~~~~vy~d~~~~~dy~~~g---------- 461 (523) T protein:vir:59 395 QQKTA-VAGANFLVTSPQVAALLESMPGFTPGNDN--RDGGTGIFYVGMVQGRYRLYKNIYQNQPVIIMG---------- 461 (523) T ss_pred HHhcc-cccccEEEEchhHHHHHHhccccccCCcc--ccccccceeEEEecCceEEEecCCCCcceEEEE---------- Confidence 11111 11456999999999998776655432221 111122 24554 4579998887664221111 Q ss_pred cccccccccccccc-cceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecC--C Q lcl|NC_011085. 270 AFPKTAEGDTKVAL-DNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTA--G 343 (343) Q Consensus 270 ~~~~~~~~~~~~~~-~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~--g 343 (343) |+++. ..-.+++|.|-- .... +....||+.|--.|-.+.+||..|.+|...+.|.++- - T Consensus 462 ---------~k~~~~~~~~~~~y~Py~----~l~~--~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 462 ---------NQDLNTPWQTGAVYAPYV----PLLF--TPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred ---------ecccCCcccccceecccc----hhhc--ccccccCCcccceeeeeeehhheecchhHhhhhhhhhcCC Confidence 11211 111468888842 1111 2333599999999999999999999998877665432 2 No 190 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=95.10 E-value=0.0028 Score=34.61 Aligned_cols=268 Identities=16% Similarity=0.087 Sum_probs=136.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcc----ccccccceEEEEeccCc--ceeeeec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHI----MRSISSGKSAQFPVLGR--TRAAYLQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~----~~~i~~G~tv~i~~iG~--~t~~~~~ 74 (343) |+.. -|++..| .|-|+|.+-+.+-|++++.|++..- .+-+.+.++.---.... +-++.|. T Consensus 1 m~t~--N~n~avr------------~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~ 66 (286) T protein:vir:94 1 MATT--NNDLPVR------------VYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYS 66 (286) T ss_pred CCCC--cccccee------------ehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEeccc Confidence 4432 1222233 4889999999999999999875542 22333333322111111 1234454 Q ss_pred CCCcCC---CccCCCccceE--EEEeeee-eeee-eec-cchHHHHhchh---hHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011085. 75 AGQSLD---DKRKDIKHTEK--TIVIDGL-LTAD-VLI-YDIEDAMNHYD---VRSEYTSQIGESLAMAADGAVLAELAG 143 (343) Q Consensus 75 ~g~~i~---~~~~~~~~~~~--~l~iD~~-~~~~-~~I-dd~D~~q~~~d---~~~~~~~~~~~aLa~~~D~~i~~~~~~ 143 (343) .+...- ++.......++ .+..|+. .|.. ..| .-+|....+-| ...+..+.++.|-.+.+|..+=..|.. T Consensus 67 TdeNv~FGtgTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~ 146 (286) T protein:vir:94 67 TDANTAFGTGTSNSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALAT 146 (286) T ss_pred CCCccccccCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 433221 00000011111 1222221 1111 111 12333333333 344566778888889998766443322 Q ss_pred hhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhh Q lcl|NC_011085. 144 LCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNA 223 (343) Q Consensus 144 ~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (343) .+.. +. .-+.+..++..+.+.....+|-..-| +.|+|++|.+|..++..+. T Consensus 147 ~A~~----------------------t~------~~D~V~~LF~~as~~yvn~ev~~~~~-ayV~~evYnaiiD~~l~Ts 197 (286) T protein:vir:94 147 AGTD----------------------LG------AVDDVNALFESAVEKYTDLEVIAPVR-AYVTASVYNAIIDLANVTT 197 (286) T ss_pred hhhh----------------------hh------hhhhHHHHHHHHHHHhhhhheeeeeE-EEEchhHHHHHhccccccc Confidence 2210 00 01234455556777777777743334 8999999999998875554 Q ss_pred hccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeee Q lcl|NC_011085. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD 303 (343) Q Consensus 224 ~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~ 303 (343) + -+++..+-+--|.++-||.|-+.|.-...+. .++|+++-|+.+..-. T Consensus 198 a-K~SsaNiDengi~~FkGf~i~e~P~~~~~g~-------------------------------~aifs~dnig~aftGI 245 (286) T protein:vir:94 198 A-KNSAVNIDTNGMLSFRGIAITKVPTQYMGGK-------------------------------AVIFAPDNVARVFTGI 245 (286) T ss_pred c-ccceeeeccCCcceecceEEeecchhhccCc-------------------------------eEEEccccceeeeccc Confidence 3 3344555555577999999998663221110 1234444444333211 Q ss_pred eEEeeeeccchhhhhhhhhhhhccceecccceEEEEec-CC Q lcl|NC_011085. 304 LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFT-AG 343 (343) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~-~g 343 (343) -.......+++-+-.+.|-=-||--++.-...++++.+ .| T Consensus 246 n~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 246 NIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred eeeeeeeccccCceeeeccccccccccccCceeEEEeecCC Confidence 12222334455566677777778777777776666543 34 No 191 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=94.99 E-value=0.003 Score=34.40 Aligned_cols=274 Identities=11% Similarity=0.058 Sum_probs=104.3 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHH----HH----------hhhhccCcccccccc--------ce Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAF----AR----------TSVTTNRHIMRSISS--------GK 58 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f----~~----------~s~~~~~~~~~~i~~--------G~ 58 (343) |-+....+.. ... .+ .+..+ +-+.+.+.-...| +. .++...+.+...+.. .. T Consensus 171 ~~~~~~~~~~-~~~-~~---~e~r~-~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 244 (480) T protein:vir:40 171 REASIPSEKP-EDA-ER---KFMRE-LGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGL 244 (480) T ss_pred hhhhccccch-hhh-hh---HHHHH-HHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhcc Confidence 1111111100 000 00 00000 0000000000000 00 011111111111100 00 Q ss_pred EEEEeccCcc---eee-eecCCCcCCCccCCCccceEEEEeeee--eeee---eeccchHHHHhchhhHHHHHHHHHHHH Q lcl|NC_011085. 59 SAQFPVLGRT---RAA-YLQAGQSLDDKRKDIKHTEKTIVIDGL--LTAD---VLIYDIEDAMNHYDVRSEYTSQIGESL 129 (343) Q Consensus 59 tv~i~~iG~~---t~~-~~~~g~~i~~~~~~~~~~~~~l~iD~~--~~~~---~~Idd~D~~q~~~d~~~~~~~~~~~aL 129 (343) ++. ..|.. .+. ....+.... . ...++.++. ++. ++.. .....+|+ ..++.+.+..+.++.| T Consensus 245 ~~~--~~g~~~~~~~~e~~~~~~~~~--~--~~~~~~~~~-~~~v~~l~~~~k~t~~lLDD---a~~l~~~i~~~l~~~~ 314 (480) T protein:vir:40 245 TLA--EDGVDDTFISGTFKAGTDKNK--S--QTATKRSLR-PQMAEAYLQMDKATVRGVND---SGALSEYVMSEMVNRV 314 (480) T ss_pred eee--eccccceeeeeeeeccccccc--c--cccccchhh-HHHHHHHHHhHHHHHHHhhh---hHHHHHHHHHHHHHHH Confidence 111 11111 011 111111110 0 011111110 100 0100 01111121 2358888999999999 Q ss_pred HHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCc-EEEeC Q lcl|NC_011085. 130 AMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADR-TFYTT 208 (343) Q Consensus 130 a~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR-~~vv~ 208 (343) +++.++.++.--. .. ..++ .+ ++....+ .+. ...+..+++.|..+..+-..+ +. .+|++ T Consensus 315 ~~~ee~a~l~G~g--~g--------~~~~-~g--~~~~~~~-~~~-~~~~~d~id~L~~al~~~y~~-----~a~~~vmn 374 (480) T protein:vir:40 315 IQKVEYNMILGSV--DG--------SNGF-YG--LKTATDG-WTK-QIEYTDLFEGITDAVAECSIS-----DAITIVMS 374 (480) T ss_pred HHHHHHHhhccCC--CC--------cccc-cc--ceeeccc-ccc-cchhHHHHHHHHHhhhHHhhC-----CCCEEEEC Confidence 9999987762200 00 0001 11 1111111 111 112233444444332222222 23 57899 Q ss_pred HHHHHHHhccchhhhhccccccchhcceeEEEeceEEEEec-cccccccccccccccccccccccccccccccccccceE Q lcl|NC_011085. 209 PEVYSAILAALMPNAANYAALIDPERGSIRNVMGFEVVEVP-HLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVV 287 (343) Q Consensus 209 P~~~~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~sn-~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 287 (343) |..+..|.+-..- +..|.=...+..|...+++|++|++++ .+|..... . +++.. . T Consensus 375 ~~t~~~I~klKD~-~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~----------------~------~~~~~-~ 430 (480) T protein:vir:40 375 PQTFAELRKAKGT-DGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVA----------------V------YNHDE-Y 430 (480) T ss_pred HHHHHHHHHhhcC-CCCeeccCcccccCcceecccceeeeeccccCCcce----------------e------eeCCc-c Confidence 9999988654322 234654556778889999999988764 33321100 0 00011 1 Q ss_pred eEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 288 GLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 288 ~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+++.++ .+....++-++-...+....+.|..+.+|+++..++...+ T Consensus 431 ~~~~d~~---------~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 431 VLIGDLN---------VENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS 477 (480) T ss_pred EEEEecc---------cceecccccccchhhhhhhhhhceeeEccccEEEEEeccC Confidence 1233331 1111112223444566777788999999999999998888 No 192 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=94.17 E-value=0.0052 Score=33.12 Aligned_cols=305 Identities=15% Similarity=0.066 Sum_probs=142.3 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCc---cccccccceEE------------EEecc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRH---IMRSISSGKSA------------QFPVL 65 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~---~~~~i~~G~tv------------~i~~i 65 (343) |-. ++|.=-.-|.-.++..+...+-.-|.|-.|.++.|.-..-..... ......+.... ....- T Consensus 97 mTg-PTGLIFAmRsrY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~ 175 (457) T protein:vir:10 97 MTG-PTGLIFAMRTNYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQADDAT 175 (457) T ss_pred CCC-cceeeeeeeeeecCccccccccccceeeeccCcccCcccccccccccccccccccccccccCcccccccccccccc Confidence 111 011100111111111110000112233333444443211100000 00000011000 00111 Q ss_pred CcceeeeecCCCcCCCccCCCccceEEEEeeeeeeee--------eeccchHHHHh-c-hhhHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 66 GRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTAD--------VLIYDIEDAMN-H-YDVRSEYTSQIGESLAMAADG 135 (343) Q Consensus 66 G~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~-~-~d~~~~~~~~~~~aLa~~~D~ 135 (343) |..++. ++.+..........+.-++||+...-+ ..+.-..+.++ | .|.-.|++.=.+..+...+.+ T Consensus 176 gmsTA~----aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINR 251 (457) T protein:vir:10 176 GMSTAT----VEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINR 251 (457) T ss_pred chhhhh----hhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhH Confidence 111110 111111111122355667777664332 44555555566 3 888889999999999999999 Q ss_pred HHHHHHHhhhhccccccccccccCCceeecccccccccchHHHHHHH-HHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHH Q lcl|NC_011085. 136 AVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAV-IAQLTIARAKLTSNYVPSADRTFYTTPEVYSA 214 (343) Q Consensus 136 ~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i-~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~ 214 (343) -|++.+...+..- ...+.....+.+.....+.....+.++.+ +....+|.....+--- -.+.|+|.+|++.++ T Consensus 252 eii~~l~~~a~~~-----~~~~~~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~T~r-g~gn~~i~S~~Va~~ 325 (457) T protein:vir:10 252 EVVRTIYTNAVAG-----AQNNTATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQTRR-GKGNILICSADVVSA 325 (457) T ss_pred HHHHhHhhhheee-----eccccccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHhhcc-ccceEEEEchhHHHH Confidence 9999887555322 12222333344443333322233334444 3333444444333322 357899999999999 Q ss_pred Hhccch--hhhhc--ccc---ccchhcceeEEE-eceEEEEe----cccccccccccccccccccccccccccccccccc Q lcl|NC_011085. 215 ILAALM--PNAAN--YAA---LIDPERGSIRNV-MGFEVVEV----PHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVA 282 (343) Q Consensus 215 Ll~~~~--~~~~~--~~~---~~~~~~G~V~~i-~Gf~V~~s----n~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (343) |-...- +..+. ..+ .++.....+|.+ .|++||.- +|-|+-=.. ..|+++ T Consensus 326 L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~-------------------vG~KG~ 386 (457) T protein:vir:10 326 LGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYV-------------------AGYKGT 386 (457) T ss_pred HhhcccccccchhhccccccccccccceeEEEecCCeEEEEecccccCCccceEE-------------------EEEeCC Confidence 877543 22111 111 123445567776 46888886 343321111 123333 Q ss_pred ccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 283 LDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 283 ~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .....++||.|-- .+.++. .-||+.|--.|-.+.+||- .++|.....=--+++ T Consensus 387 ~~~~~glfy~PYv----~l~~~~---~~dp~sfqP~~g~~tRY~l-~~NP~~~~~~~~~~~ 439 (457) T protein:vir:10 387 SPYDAGLFYCPYV----PLQQVR---AINPDTFQPKIGFKTRYGM-VSNPFAGGLTQGSGA 439 (457) T ss_pred cceecceeecccc----cccccC---ccCCccccceeeeeeeeee-eeccccccccccccc Confidence 4444678988842 233222 2399999999999999999 888986643322222 No 193 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=93.96 E-value=0.0058 Score=32.84 Aligned_cols=291 Identities=10% Similarity=0.035 Sum_probs=132.2 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccC---cceeeeec-CC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLG---RTRAAYLQ-AG 76 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG---~~t~~~~~-~g 76 (343) |... |..-+.-.+.|.+ ...|.+.|.+.|.+..+.+-.++. |++.+.++.- ........ +- T Consensus 1 mpal-------tLaea~k~~~d~l-------~~~ViE~~~~~s~lL~~LpF~~ve-g~~~~ynR~~~~~~~~~~~v~~~~ 65 (310) T protein:vir:97 1 MASV-------TLAESAKLAQDEL-------VAGVIENIITVNRMFDVLPFDSIE-GNSLAYNRENVLGDVIMAGVGTTF 65 (310) T ss_pred Cccc-------chHHHhhcCcchH-------HHHHHHHHhccchHHHhCCccccc-CCcceeeEeeccCCcccccccccc Confidence 4322 2222333333322 456778888777766666666654 5567666552 22221110 00 Q ss_pred CcCCCccCCCccceEEEEeeeeeeeeeeccch-HHHH-h-chhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Q lcl|NC_011085. 77 QSLDDKRKDIKHTEKTIVIDGLLTADVLIYDI-EDAM-N-HYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNE 153 (343) Q Consensus 77 ~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~-D~~q-~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~ 153 (343) ........+.+.++++..+--. .-.+.||.. .+.. . -+|.+.+..+...++|++++...+|.- -....+-.+ T Consensus 66 ~~~g~~~~~~t~~~~~~~L~i~-~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lING----D~a~n~F~G 140 (310) T protein:vir:97 66 SGAGAGKAAATFTKVNSNLTTI-MGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLING----NGAGNEFAG 140 (310) T ss_pred cCCCccccccccceeeeeeeee-eehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhcc----ccCCCcccc Confidence 0000001112334444332111 122333321 1211 2 356777778888999999988766531 000001101 Q ss_pred cccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhc-CCCcCCcEEEeCHHHHHHHhccchhhhh--cccccc Q lcl|NC_011085. 154 NIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSN-YVPSADRTFYTTPEVYSAILAALMPNAA--NYAALI 230 (343) Q Consensus 154 ~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~-~VP~~gR~~vv~P~~~~~Ll~~~~~~~~--~~~~~~ 230 (343) ...-...++.+..++.+...+|.. +|.| |+.. +-..+..+++.+|.++..+..--|--.. -|.-.. T Consensus 141 L~~~~~~~q~i~~~~~gg~~t~d~-----LDeL------l~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~ 209 (310) T protein:vir:97 141 LIQLCASGQKATTGATGSAISFAI-----LDEL------MDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVE 209 (310) T ss_pred hhhcCCccceeecCCCCCCCCHHH-----HHHH------HHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccc Confidence 111123345555444444555532 2222 2221 1122456999999876666544333221 222223 Q ss_pred chhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccc------cceEeEeechhhheeeeeeee Q lcl|NC_011085. 231 DPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVAL------DNVVGLFQHRSAVGTVKLKDL 304 (343) Q Consensus 231 ~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~l~~~~~Av~~~~~~~~ 304 (343) ..---.|-.+.|++|+.++.+|.........+ .... |...| ...+|+...... . + T Consensus 210 ~~~G~~v~~~~GiPi~~~d~ip~~~~~~~~~g----tTsI--------ya~r~Ge~~~~~Gv~Gl~~~~~~-----g--l 270 (310) T protein:vir:97 210 LPSGAEVPAYSGTPIFRNDYIPTNQTKGGTTG----CTTI--------FAGTLDDGSRTHGIAGLTATQAA-----G--I 270 (310) T ss_pred cCCCCEEeeeCCeEEEEeCccCCCccccccCC----ceeE--------EEEeeCccccccceeccccCCcc-----c--e Confidence 33333578999999999999997532211100 0000 11111 112232211111 1 2 Q ss_pred EEeeee---ccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 305 SLERAR---RAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 305 ~~e~~~---~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .++..- ++--+.+.| .+-+|..++.|+++++|+---- T Consensus 271 sVr~~G~~~~~~v~~~~V--~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 271 QVVDVGESEDSDEHIWRV--KWYCGLALFSEKGLACADGITN 310 (310) T ss_pred eEEeCCcccCCcceeEEE--EEeeeEEEecccceeeeccccC Confidence 222211 222233333 2237999999999999874333 No 194 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=93.87 E-value=0.0061 Score=32.72 Aligned_cols=298 Identities=9% Similarity=0.009 Sum_probs=127.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHH---HHHHHHHHHhhhhccCccccc-cc-cceEEEEecc---Ccceeee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFG---GEVLTAFARTSVTTNRHIMRS-IS-SGKSAQFPVL---GRTRAAY 72 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~---g~V~~~f~~~s~~~~~~~~~~-i~-~G~tv~i~~i---G~~t~~~ 72 (343) =++....++-.-.++.-..+.+..+++.++|. ..|.+.-...-+.+.++..++ +. +-.++.+..+ |..+ - T Consensus 14 ~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~--~ 91 (329) T protein:vir:79 14 EFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAK--I 91 (329) T ss_pred hhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeee--e Confidence 00000000000001111112222333334543 334333333344555666553 22 2335555544 4443 2 Q ss_pred ecCC-CcCCCccCCCccceEEEEeeee-eeeeeeccchHHHH-hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 73 LQAG-QSLDDKRKDIKHTEKTIVIDGL-LTADVLIYDIEDAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 73 ~~~g-~~i~~~~~~~~~~~~~l~iD~~-~~~~~~Idd~D~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) |..+ .+++.. +..-.+....|-.. .-+.+.+.++..++ ...++-.+-...+..++++..|+.++.--. T Consensus 92 ~~d~~~dip~v--d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~------- 162 (329) T protein:vir:79 92 IADYTDDLSTV--DALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSK------- 162 (329) T ss_pred ecCccccccee--ecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecc------- Confidence 3221 233221 12222222333222 12234456666665 477888888888999999999987763211 Q ss_pred cccccccccCCceeeccccccc--c-cchHHHHHHHHHHHHHHHHHHhhc--CCCcCCcEEEeCHHHHHHHhccchhhhh Q lcl|NC_011085. 150 ASNENIAGLGSASILEVGAKGD--L-TSPVELGKAVIAQLTIARAKLTSN--YVPSADRTFYTTPEVYSAILAALMPNAA 224 (343) Q Consensus 150 ~~~~~~~g~~~~~~~~~~~~~~--~-~~~~~~~~~i~~~l~~a~~~Ld~~--~VP~~gR~~vv~P~~~~~Ll~~~~~~~~ 224 (343) ...+.|+-...-++....++ . .-..++.+.|++.|.++..+|..+ .+ ...-.++|+|+.|..|..- ..+. T Consensus 163 --~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~-~~p~~L~Lpp~~~~~L~~~--~~~~ 237 (329) T protein:vir:79 163 --PHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQ-HRANMILIPPSMRKVLMVR--MPET 237 (329) T ss_pred --cccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCce-ecccEEEecHHHHHHhhcc--cCCC Confidence 11112221111111111111 1 111234677888888888888765 22 1123789999999888531 1111 Q ss_pred ccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeee Q lcl|NC_011085. 225 NYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDL 304 (343) Q Consensus 225 ~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~ 304 (343) +..-..-+++ +...++|...+.|-..+. +..+.++.....++-+.....+++ T Consensus 238 ~~tvl~~lk~----~~~~l~I~~~~el~~ag~------------------------~g~~~~v~y~~~~~~~~~~vp~~~ 289 (329) T protein:vir:79 238 TMSYLDYFKQ----QNGGITIESISELEDIDG------------------------AGTKAALVYEKDPMNMSIEIPEAF 289 (329) T ss_pred CccHHHHHHH----hCCCcEEEEcccccccCC------------------------CCceEEEEEecCCceEEEecCcce Confidence 1111111211 112345555444421100 011122222223333333334444 Q ss_pred EEeeeeccchhhhhhhhhhhh-ccceecccceEEEE-ecCC Q lcl|NC_011085. 305 SLERARRAEYQADQIIARYAM-GHGGLRPEAAGALV-FTAG 343 (343) Q Consensus 305 ~~e~~~~~~~~~d~i~~~~~~-G~~v~rpe~~~~i~-~~~g 343 (343) +... ..++.....+....+. |.-+.||++++.+. +-.| T Consensus 290 ~~l~-~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 290 NMLT-AQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred eeee-ceecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 4322 2333344556666665 78999999876542 2223 No 195 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=93.06 E-value=0.0089 Score=31.82 Aligned_cols=293 Identities=13% Similarity=0.107 Sum_probs=125.9 Q ss_pred CC-------CCCccc----cccccccccccccchhHHHHHHHHHHHHHHHHHh-hhhccCcccccc---ccceEEEEecc Q lcl|NC_011085. 1 MA-------DMKGGQ----QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFART-SVTTNRHIMRSI---SSGKSAQFPVL 65 (343) Q Consensus 1 ~~-------~~~~~~----~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~-s~~~~~~~~~~i---~~G~tv~i~~i 65 (343) || +..... ++..|- -...++|==.++-..-...++..|+.. +-+..|....++ +..+.+.+-.. T Consensus 371 lAr~~L~~rg~~~~~~~~~~~~~~a-~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~ 449 (693) T protein:vir:95 371 LARASLVDRGIGVASLNAPQMVGLA-FTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEF 449 (693) T ss_pred HHHHHHHhcCCccCCCCHHHHHHHH-HhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCC Confidence 11 111000 010110 002333322222244445566666644 445566665544 44444444333 Q ss_pred CcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011085. 66 GRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLC 145 (343) Q Consensus 66 G~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a 145 (343) | ++.....|.++.. ..+....-++.+.++- --|.|..-.-.=-..+....+....|++-++.+++.+...+..-. T Consensus 450 ~--~L~~V~E~gEyk~--~t~~e~~e~~~l~tyG-~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np 524 (693) T protein:vir:95 450 S--SLRQVREGAEYKY--VTLGERGEQIILATYG-ELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNP 524 (693) T ss_pred C--ChhhcCCCCceee--eecCCccceeehhhcC-CeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 3 3444445554432 2234444456665442 123333211000123455677888999999999999987765322 Q ss_pred hccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcC----------CCcCCcEEEeCHHHHHHH Q lcl|NC_011085. 146 NMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNY----------VPSADRTFYTTPEVYSAI 215 (343) Q Consensus 146 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~----------VP~~gR~~vv~P~~~~~L 215 (343) ..... +.++-...++.++.+. ...+ ++.|-.++..|..+. +--..+|+||||+..... T Consensus 525 ~m~DG--k~LFhadH~Nl~tga~--sals--------~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a 592 (693) T protein:vir:95 525 AMSDG--KTLFHADHSNLLTGAA--SALS--------IDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKA 592 (693) T ss_pred cccCC--cceeeccccccccccc--cccC--------hHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHH Confidence 22211 1222222233322111 1111 223333333333322 112368999999876654 Q ss_pred hccchhhhhccccccchhcceeEEEece-EEEEeccccccccccccccccccccccccccccccccccccceEeEeechh Q lcl|NC_011085. 216 LAALMPNAANYAALIDPERGSIRNVMGF-EVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS 294 (343) Q Consensus 216 l~~~~~~~~~~~~~~~~~~G~V~~i~Gf-~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~ 294 (343) .+ +++..+.-......|.+.-+.|| +|+..++|...+.+ .+- +... .+ . + T Consensus 593 ~~---l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL~~~s~~--~Wy--------l~a~-~~-----~----------d 643 (693) T protein:vir:95 593 NQ---IINSESVPGADVNSGIVNPIRAFAQVIGEPRLDDASAT--AWY--------MAAK-KG-----S----------D 643 (693) T ss_pred HH---HhccccccccccccccccchhccccccccceecCCCCC--ceE--------EecC-CC-----C----------C Confidence 43 33333322233445555556675 78888888532211 110 0000 00 0 1 Q ss_pred hheeeee---eeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 295 AVGTVKL---KDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 295 Av~~~~~---~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .+..+.+ +.+.+|....-..-+=.++-++-||+++++.-++. -..| T Consensus 644 tie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~---kn~G 692 (693) T protein:vir:95 644 TIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQ---KSNG 692 (693) T ss_pred eEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCceeeccccc---cCCC Confidence 1111111 12233433222222333445566899888876543 2333 No 196 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=92.97 E-value=0.0093 Score=31.73 Aligned_cols=256 Identities=15% Similarity=0.079 Sum_probs=119.5 Q ss_pred CCccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcc----ccccccceEEEEeccCc--ceeeeecCCC Q lcl|NC_011085. 4 MKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHI----MRSISSGKSAQFPVLGR--TRAAYLQAGQ 77 (343) Q Consensus 4 ~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~----~~~i~~G~tv~i~~iG~--~t~~~~~~g~ 77 (343) |+.-|+...| .|-|+|.|-+.+-|++++.|++..- .+-+.+.++.---.... +-++.|..++ T Consensus 1 mp~N~n~avr------------~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~Y~Tde 68 (295) T protein:vir:47 1 MPSNQNNAVR------------RYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGE 68 (295) T ss_pred CCCCCCccch------------hhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeecccCCC Confidence 4333332222 4889999999999999999875542 22333333322222211 1234566555 Q ss_pred cCC--C--ccCCCccceE--EEEeeee-eeee-eec-cchHHHHhchh---hHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011085. 78 SLD--D--KRKDIKHTEK--TIVIDGL-LTAD-VLI-YDIEDAMNHYD---VRSEYTSQIGESLAMAADGAVLAELAGLC 145 (343) Q Consensus 78 ~i~--~--~~~~~~~~~~--~l~iD~~-~~~~-~~I-dd~D~~q~~~d---~~~~~~~~~~~aLa~~~D~~i~~~~~~~a 145 (343) ..- + +.......++ .+..|+. .|.. ..| .-+|..-.+-| ...+..+.++.|-++.+|..+=..|...+ T Consensus 69 NvagFGtGTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A 148 (295) T protein:vir:47 69 NDGGFGDNSGAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTA 148 (295) T ss_pred cccccccCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 442 1 1111111111 1222221 1211 111 22333333333 34456677888889999976654443332 Q ss_pred hccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhc Q lcl|NC_011085. 146 NMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAAN 225 (343) Q Consensus 146 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~ 225 (343) ..+ ..+ ++.+ .+.+..++.++.+.....+|-..-| +.|+|++|.+|..++..+.+ T Consensus 149 ~~t----e~~--------------td~t-----~d~V~~LF~~as~~yvn~ev~~~~~-AyV~~evYnaiiD~~l~Tsa- 203 (295) T protein:vir:47 149 TKT----EAL--------------ADFT-----DDKVKALFNKLSAFYTNNEVTAPIT-VYLRSEFYNAIVDMASVTSA- 203 (295) T ss_pred hhh----hhh--------------hccc-----chhHHHHHHHHHHHhhhhheeeeeE-EEEchhHHHHHhcccccccc- Confidence 111 000 1111 1234455667788888888843334 99999999999988755543 Q ss_pred cccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeee----- Q lcl|NC_011085. 226 YAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVK----- 300 (343) Q Consensus 226 ~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~----- 300 (343) -+++..+-+--|.++-||.|-+.|.-.... +.. .+|+++-|+.+. T Consensus 204 K~SsaNiDengi~~FkGf~i~e~P~~~~q~-G~~-----------------------------aifs~dnig~aftGIn~ 253 (295) T protein:vir:47 204 KGATISLDENGLPKYKGFTLEETPAQYFET-GVI-----------------------------AIFSPNGIIIPFVGIST 253 (295) T ss_pred ccceeeeccCCcceecceEEEeccHhhccC-CcE-----------------------------EEEccccceeeccccee Confidence 334455555557789999998865433210 000 122222222211 Q ss_pred eeeeEEeeeeccchhhhhhhhhhh------hc----------cceec Q lcl|NC_011085. 301 LKDLSLERARRAEYQADQIIARYA------MG----------HGGLR 331 (343) Q Consensus 301 ~~~~~~e~~~~~~~~~d~i~~~~~------~G----------~~v~r 331 (343) ++-++.|.+ -+-.+.-+++ |- --..| T Consensus 254 aR~IesEdF-----~GValQ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (295) T protein:vir:47 254 ARVIEAENF-----DGVNCKLLLRVVLTLLMTIRKQFTKLQELLYRR 295 (295) T ss_pred eeeeecccc-----cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 111122221 1111111111 00 00011 No 197 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=89.26 E-value=0.027 Score=29.15 Aligned_cols=275 Identities=13% Similarity=0.076 Sum_probs=111.6 Q ss_pred CCCCCcccccccccc-------------cc---------------------ccccchhHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 1 MADMKGGQQLGKDQG-------------KG---------------------QSGGDKLALFLKVFGGEVLTAFARTSVTT 46 (343) Q Consensus 1 ~~~~~~~~~~~t~~g-------------~~---------------------~~~~d~~al~ie~~~g~V~~~f~~~s~~~ 46 (343) |.+..... +..|+ |+ ..++|........|-+.+.+-....-... T Consensus 85 ~~~~~r~~--p~~~~veyRSaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~ 162 (410) T protein:vir:83 85 AISAMRGS--PVGTEVEYRSAGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLV 162 (410) T ss_pred hhccCcCC--CCCCCcccccHHHHHHHHhccCCchHHHHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccchh Confidence 11110000 01111 11 11111111112234444444444332323 Q ss_pred cCccccccccceEEEEecc-Ccceeeee-------cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhH Q lcl|NC_011085. 47 NRHIMRSISSGKSAQFPVL-GRTRAAYL-------QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVR 118 (343) Q Consensus 47 ~~~~~~~i~~G~tv~i~~i-G~~t~~~~-------~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~ 118 (343) ++...=.. .|.|..-+.+ .++++..+ ..|..++. ..+.....+-.|+.+--.. .+....--.++.... T Consensus 163 slf~tLP~-~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~--gKl~~~t~tA~ikTyGGyt-~LSRQ~IERs~v~~L 238 (410) T protein:vir:83 163 STLGTLPL-NNATFYRPIVSQRPAVGLQGVAGGASDEKTELDS--QKMVIDRLTVNAKTLGGYV-NVSRQAIDFSSPSAL 238 (410) T ss_pred hhhhhCCC-CCCeeEEeeecccccccccccccccccccccccc--cceeeeeccceeehhcCcc-cccceeeecCChhhH Confidence 33222122 2667666544 22333322 13555543 2344444555566553222 111111112333333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhc-- Q lcl|NC_011085. 119 SEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSN-- 196 (343) Q Consensus 119 ~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~-- 196 (343) +-.++-.+.+-|+.....+=..|. .+. ++ ....... +++.+..++.++....+.+ T Consensus 239 ~~~lraL~~AYA~atea~vra~L~-~t~---------t~---------~~a~~~~----Tad~~~~~i~da~~~v~da~~ 295 (410) T protein:vir:83 239 DLVVNGLGQQYAIETEALVGAALA-STS---------TG---------AVGYGNA----TADNVASAIWQAAGAVYTAVK 295 (410) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-Hhh---------hh---------hhhhhhc----cHHHHHHHHHHHHHHHhhhhc Confidence 334444444444444433322221 110 00 0011122 2445566666778888876 Q ss_pred CCCcCCcEEEeCHHHHHHHhccchhhhhc---ccc--ccchhcceeEEEeceEEEEeccccccccccccccccccccccc Q lcl|NC_011085. 197 YVPSADRTFYTTPEVYSAILAALMPNAAN---YAA--LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAF 271 (343) Q Consensus 197 ~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~---~~~--~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~ 271 (343) ++ .=+++.|+|+.+..+..--.-.+.+ ..| ...+-.|.-|.+.|++|.+.+.+|.+. T Consensus 296 ~~--~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgT---------------- 357 (410) T protein:vir:83 296 GM--GRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGD---------------- 357 (410) T ss_pred cc--eeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCe---------------- Confidence 43 2368999999976665432222222 222 122336766899999999999887432 Q ss_pred cccccccccccccceEeEeechhhheeeeeee--eEEeeeeccchhhhhhhhhhhhccceecccceEEEEec Q lcl|NC_011085. 272 PKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD--LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFT 341 (343) Q Consensus 272 ~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~--~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~ 341 (343) +.++.+.||-.-+..- ++... .+.....-.+.|.+ +..+.-|++++=+.-+ T Consensus 358 ----------------A~f~~~~Ai~~~eS~~gp~qL~d-~~i~nLt~~ySgY~--a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 358 ----------------AYLFSTAAIECFEQRVGTLQVVE-PSVFGLQVAYAGYF--STLVVNEDAIVPLVGS 410 (410) T ss_pred ----------------eeEeccceeeeeecCCceeEeeC-Cchhhhhhhheeee--eeccccccceeeeccC Confidence 1233444432221110 11110 01111111111222 4555666666666555 No 198 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=84.86 E-value=0.057 Score=27.40 Aligned_cols=292 Identities=12% Similarity=0.031 Sum_probs=99.1 Q ss_pred ccccccccccccchhHHHHHHHHHHHHHHH-HHhhhhccCccccccccceEEEEeccCcceeeeecCCCcCCCccCCCcc Q lcl|NC_011085. 10 LGKDQGKGQSGGDKLALFLKVFGGEVLTAF-ARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAYLQAGQSLDDKRKDIKH 88 (343) Q Consensus 10 ~~t~~g~~~~~~d~~al~ie~~~g~V~~~f-~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~~~~g~~i~~~~~~~~~ 88 (343) ++..|+.-+ . ++..| .+..+....+..+. +.|.---+|.+-+. .||..... -+- T Consensus 1 i~~~P~~~g---~------------~~glff~~~~v~T~~V~ie~-~~~~l~lip~v~rg-----~~g~~~~~----~~~ 55 (320) T protein:vir:10 1 MNLLPVNYG---D------------SRALFAREKKVRTRTILVEE-KNGVLTLIQSREPG-----STENVAKR----GKR 55 (320) T ss_pred CCcCCchhh---h------------hhhhccCCCCcccceEEEEE-ecCceeeeeccCCC-----CCceeecC----Ccc Confidence 344443321 1 11222 11112111122211 22222222222211 11111110 011 Q ss_pred ceEEEEeeeeeeeeeeccchHHHH-----------hchhhHHHHHHHHHHHHHHHHHHH---HHHHHHhhhhcccccccc Q lcl|NC_011085. 89 TEKTIVIDGLLTADVLIYDIEDAM-----------NHYDVRSEYTSQIGESLAMAADGA---VLAELAGLCNMPAASNEN 154 (343) Q Consensus 89 ~~~~l~iD~~~~~~~~Idd~D~~q-----------~~~d~~~~~~~~~~~aLa~~~D~~---i~~~~~~~a~~~~~~~~~ 154 (343) ..+.+.+--.+.. ..| .-++.| +--+++.+...+ |.+.+|.- +..++.++. ........ T Consensus 56 ~~~~f~~p~~~~~-d~i-~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~----lr~~~~~T~E~m~~~AL~G~-ildadGtv 128 (320) T protein:vir:10 56 KVRSFVIPHLPLE-DVI-LPDEYEGLRGFGTTALAAKSELVKERXET----MKSSHDITHEHLRMGAKKGQ-ILDADGTV 128 (320) T ss_pred eEEEEecceeccC-Ccc-CHHHHcCcccCCCchHHHHHHHHHHHHHH----HHHHHHHHHHHHHHhhhcCe-EEcCCCcE Confidence 1111211111000 011 112222 211222222222 33444321 111111110 00000000 Q ss_pred c----cccCC-ceeecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhc--cc Q lcl|NC_011085. 155 I----AGLGS-ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAAN--YA 227 (343) Q Consensus 155 ~----~g~~~-~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~--~~ 227 (343) + ..++- ...+.....+..++ ..+.+.+.+..+...|. ..+..+-+++++|++|..|+.++.+...- +. T Consensus 129 ~~d~y~~fGi~~~~i~~~l~~a~~d---v~~~~~~~~~~i~~~l~--g~~~t~v~al~g~~f~~al~~h~~Vke~y~~~~ 203 (320) T protein:vir:10 129 LYDLYAEFGITKKTIYFGLDNKDAN---VAESCRQVLRHVEDNLR--GDVMKDVSVDVSEEFFDKFIKHASVKEVFLNHE 203 (320) T ss_pred EEechhhhCCccceeEEecCCCCcc---HHHHHHHHHHHHHHHhc--cCCCCceEEEEChHHHHHHhcCHHHHHHHHhhh Confidence 0 00110 01111111111222 23334455555555664 45666778899999999999998765431 11 Q ss_pred -cccchhcc--eeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeee Q lcl|NC_011085. 228 -ALIDPERG--SIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDL 304 (343) Q Consensus 228 -~~~~~~~G--~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~ 304 (343) +...++.. .-..+.|+.+++..--.....+.....++....|.++... .+.|....+.+-..+++.+. ++++ T Consensus 204 ~~~~~l~~~~~~~f~~gGi~~~~Y~g~~~d~~g~~~~~I~~~~~~~~p~g~----~~~f~~~~apad~~e~vnt~-g~p~ 278 (320) T protein:vir:10 204 AAVNRLGGDTRKGFKFGGLIFNENRARHVDEEGKETRFIKAGKGHAFPTGT----TNTFFTALAPADFNETAGTL-GKRY 278 (320) T ss_pred hhhhhccccccceEEecCEEEEEcccEEEcCCCCeeEeecCCeeEEEEecC----chhheeeecccCcHhhcCCc-cccc Confidence 11112211 1136789888885432111112222223444444443211 12222222222222222221 1222 Q ss_pred EEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 305 SLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 305 ~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) -...+.++.-.+..+..-..-=.-..||++++.++..+= T Consensus 279 y~k~~~~~~~~g~~l~~qS~PLpi~~rP~~lv~~~~~a~ 317 (320) T protein:vir:10 279 YAKMEPRRMGRGFDLHSQSNVLPMCCRPGVLVELDAAAQ 317 (320) T ss_pred ccccccccCCCeEEEEeeecccccccCcceEEEEEecCC Confidence 122222222222222222222345679999987776543 No 199 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=84.06 E-value=0.063 Score=27.15 Aligned_cols=314 Identities=12% Similarity=0.040 Sum_probs=128.9 Q ss_pred CCCCCccccc------------------cccccccccccchhHHHH-------HHHHHHHHHH-HHHh-hhhccCccccc Q lcl|NC_011085. 1 MADMKGGQQL------------------GKDQGKGQSGGDKLALFL-------KVFGGEVLTA-FART-SVTTNRHIMRS 53 (343) Q Consensus 1 ~~~~~~~~~~------------------~t~~g~~~~~~d~~al~i-------e~~~g~V~~~-f~~~-s~~~~~~~~~~ 53 (343) |=. .|+++. .+..+..++.+-.....+ ....+...++ |.+. ..+.....-.+ T Consensus 127 MRs-rY~~~~~~~~~~eaf~~~~~pda~~sga~~~ga~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~ 205 (529) T protein:vir:10 127 LRS-VYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGAS 205 (529) T ss_pred eee-eecCCcccccccccccccccccccccccccccccccccccccccccccccccccccceeeecccCceeeccccccc Confidence 111 000000 000000000000000000 0011111111 1111 12211111011 Q ss_pred ccc--------ceEEEEeccCcceeeeecC------CCcCC--CccCCCccceEEEEeeeeeeee--------eeccchH Q lcl|NC_011085. 54 ISS--------GKSAQFPVLGRTRAAYLQA------GQSLD--DKRKDIKHTEKTIVIDGLLTAD--------VLIYDIE 109 (343) Q Consensus 54 i~~--------G~tv~i~~iG~~t~~~~~~------g~~i~--~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D 109 (343) +.. ++......++...+..+.. ++.+. +......-.+.-+.||+...-+ ..+.-.. T Consensus 206 ~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQ 285 (529) T protein:vir:10 206 VTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQ 285 (529) T ss_pred cccCccccCcccccccccccccccccccccchhhhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHH Confidence 100 0000000111111111111 11110 0011123456677787764432 4455555 Q ss_pred HHHh-c-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccc---cchHHHHHHHHH Q lcl|NC_011085. 110 DAMN-H-YDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDL---TSPVELGKAVIA 184 (343) Q Consensus 110 ~~q~-~-~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~---~~~~~~~~~i~~ 184 (343) +.++ | .|.-.|++.=.+..+...+++-|++.+...+..-........|. ...+.+.....+. --..+.++.++- T Consensus 286 DLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a~~~~~~~~~~~~~-~~Gv~d~~~~~~~~~~~~~~e~~~~L~~ 364 (529) T protein:vir:10 286 DLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTKTDGS-ASGVFDFQDPIDVRGARWAGESYKALLI 364 (529) T ss_pred HHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhcccccccccc-ccceeecccCccccccchHHHHHHHHHH Confidence 5566 3 88888999999999999999999998875553211110001111 1112222221111 001122233332 Q ss_pred HHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcc----cc--ccchhcceeEEE-eceEEEEeccccccccc Q lcl|NC_011085. 185 QLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANY----AA--LIDPERGSIRNV-MGFEVVEVPHLTAGGAG 257 (343) Q Consensus 185 ~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~----~~--~~~~~~G~V~~i-~Gf~V~~sn~lp~~~~~ 257 (343) .+-+.....-.+---..+-|+|.+|++.++|-....+..... .+ .++-.....|.+ .|++||.-++.|..=.. T Consensus 365 ~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 444 (529) T protein:vir:10 365 QIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFT 444 (529) T ss_pred HHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcccccccccccccccccccCCceEEEEecCceEEEecCCCCcceEE Confidence 222222222111111235699999999998865432222111 00 112222345554 45899988877642211 Q ss_pred cccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceeccc---- Q lcl|NC_011085. 258 DDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPE---- 333 (343) Q Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe---- 333 (343) ..|+++-....++||.|- +.+++ -+..||+.|--.|-.+.+||..+ .|= T Consensus 445 -------------------vG~KG~~~~~~glfy~PY----v~l~~---~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~ 497 (529) T protein:vir:10 445 -------------------MGYRGANNLDAGIYYCPY----VALTP---LRGFDPKNFQPVMGFKTRYAIGV-NPFAESR 497 (529) T ss_pred -------------------EEEeCCcccccceeeccc----ccccc---ccccCCCcccceeeeeeeeceee-cCccccc Confidence 112333334456888883 23333 34579999999999999998765 341 Q ss_pred -ceEEEEecCC Q lcl|NC_011085. 334 -AAGALVFTAG 343 (343) Q Consensus 334 -~~~~i~~~~g 343 (343) ....-++..| T Consensus 498 ~~~~~~r~~~g 508 (529) T protein:vir:10 498 TQAPQGRITSG 508 (529) T ss_pred cccccccccCC Confidence 1112234444 No 200 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=83.59 E-value=0.067 Score=27.01 Aligned_cols=305 Identities=14% Similarity=0.023 Sum_probs=129.8 Q ss_pred CCCCCcccccc-----ccc-cccccccchhHH--HHHHHHHHHHHHHHHhhhhccCcccccccc--------c------- Q lcl|NC_011085. 1 MADMKGGQQLG-----KDQ-GKGQSGGDKLAL--FLKVFGGEVLTAFARTSVTTNRHIMRSISS--------G------- 57 (343) Q Consensus 1 ~~~~~~~~~~~-----t~~-g~~~~~~d~~al--~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~--------G------- 57 (343) ++..++....+ +++ +.........+. --|-|-.|..+.|. +...-.++.. + T Consensus 151 da~~sga~~~ga~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs------~~~~g~~~~~g~~~~~~~~~~~~~~~ 224 (529) T protein:vir:10 151 DAWHSSLATKGATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFL------QNVSGASVTVGTNETGEALDKLINAA 224 (529) T ss_pred ccccccccccccccccCccccccccccccccccCcceeeeecccceec------ccccccccccCccccCcccccccccc Confidence 11111111100 000 000000000000 00111112222221 1111000000 0 Q ss_pred ---eEEEEeccCcceeeeecCCCcCC--CccCCCccceEEEEeeeeeeee--------eeccchHHHHh-c-hhhHHHHH Q lcl|NC_011085. 58 ---KSAQFPVLGRTRAAYLQAGQSLD--DKRKDIKHTEKTIVIDGLLTAD--------VLIYDIEDAMN-H-YDVRSEYT 122 (343) Q Consensus 58 ---~tv~i~~iG~~t~~~~~~g~~i~--~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~-~-~d~~~~~~ 122 (343) ..+....-|..+.. ++.+. +......-.+.-+.||+...-+ ..+.-..+.++ | .|.-.|++ T Consensus 225 ~a~~~~~~~~~Gm~Ta~----aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELs 300 (529) T protein:vir:10 225 IGEGKLAEIAEGMATSI----AELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELN 300 (529) T ss_pred cccccccccccccchhh----hhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHH Confidence 11111111222211 11110 0011123456677787764332 44555555566 3 88888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeeccccccccc---chHHHHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_011085. 123 SQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLT---SPVELGKAVIAQLTIARAKLTSNYVP 199 (343) Q Consensus 123 ~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~---~~~~~~~~i~~~l~~a~~~Ld~~~VP 199 (343) .=.+..+...+++-|++.+...+..-........|. ...+.+.....+.. -..+.++.++-.+-+.....-.+--- T Consensus 301 NILStEImlEINReii~~l~~~a~~~k~~g~~~~~~-~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~r 379 (529) T protein:vir:10 301 GILANEVMLEINREVIDWINYTAQVGKSGWTKTDGS-ASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGR 379 (529) T ss_pred HHHHHHHHHHhhHHHHHhHhhhhhhhhccccccccc-ccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 999999999999999998875553221111111111 11222222221110 01122233332222222222111111 Q ss_pred cCCcEEEeCHHHHHHHhccchhhhhcc----c--cccchhcceeEEE-eceEEEEecccccccccccccccccccccccc Q lcl|NC_011085. 200 SADRTFYTTPEVYSAILAALMPNAANY----A--ALIDPERGSIRNV-MGFEVVEVPHLTAGGAGDDREDETTNQKHAFP 272 (343) Q Consensus 200 ~~gR~~vv~P~~~~~Ll~~~~~~~~~~----~--~~~~~~~G~V~~i-~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~ 272 (343) ..+-|+|.+|++.++|-....+..... . ..++-.....|.+ .|++||.-++.|..=.. T Consensus 380 g~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~--------------- 444 (529) T protein:vir:10 380 GAGNFIIASRNVVSALALIDTNISPAAQGMASGLNADTTKGVFAGILGGRYKVYIDQYARQDYFT--------------- 444 (529) T ss_pred ccceEEEEchHHHHHHHhhhhhccccccccccccccccCCceEEEEecCceEEEecCCCCcceEE--------------- Confidence 235699999999998865433222110 0 0112222345554 45899988877642211 Q ss_pred ccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceeccc-----ceEEEEecCC Q lcl|NC_011085. 273 KTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPE-----AAGALVFTAG 343 (343) Q Consensus 273 ~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe-----~~~~i~~~~g 343 (343) ..|+++-....++||.|- +.+ +.-+..||+.|--.|-.+.+||..+ .|= ....-++..| T Consensus 445 ----vG~KG~~~~~~glfy~PY----v~l---~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g 508 (529) T protein:vir:10 445 ----MGYRGANNLDAGIYYCPY----VAL---TPLRGSDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPQGRITSG 508 (529) T ss_pred ----EEEeCCcccccceeeccc----ccc---ccccccCCCcccceeeeeeeeceee-cCccccccccccccccCC Confidence 112333334457888884 233 3334579999999999999998765 341 1112234444 No 201 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=83.06 E-value=0.072 Score=26.86 Aligned_cols=287 Identities=9% Similarity=-0.040 Sum_probs=122.1 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHH-HHHH----HHHHHhhhhccCcccccccc--ceEEEEec---cCccee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFG-GEVL----TAFARTSVTTNRHIMRSISS--GKSAQFPV---LGRTRA 70 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~-g~V~----~~f~~~s~~~~~~~~~~i~~--G~tv~i~~---iG~~t~ 70 (343) ||=.. . . .-|..+ +.. ..+|..|. ..|+ +.-...-..+.++.+.+.-. -+++.+.. .|.+++ T Consensus 35 ~a~d~-~-~--~~~~~~--~~~--~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ 106 (339) T protein:vir:94 35 YAMDA-V-N--LTPTLQ--TTA--NAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVAT 106 (339) T ss_pred hhccc-c-c--cccccc--ccc--ccchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEE Confidence 11110 0 0 001011 000 01332222 2222 22222234455666554321 25666654 455543 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHH---hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM---NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q---~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) |..+.+.+-...+.+-.++++.+=+ ..+.+..++... +..|+-..-.+.+..+|.+..|+..+.- T Consensus 107 --ygd~ad~Pl~~~~v~~~~~~v~~~~---~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~G------- 174 (339) T protein:vir:94 107 --YSDWSANGMSKANVNFESRQNYRYQ---TWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLG------- 174 (339) T ss_pred --cccccCCCcccccceeeEEeEEEEE---EEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeee------- Confidence 3333333221223344444554433 234455554333 4678888888888888888888755421 Q ss_pred cccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcC----CCcCCcEEEeCHHHHHHHhccchhhh Q lcl|NC_011085. 148 PAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNY----VPSADRTFYTTPEVYSAILAALMPNA 223 (343) Q Consensus 148 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~----VP~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (343) .....+.|+-....++...+...+=..++.+.|+++|..+...|.... -|.....++|||..|..|-.-..+ T Consensus 175 --d~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~n~~-- 250 (339) T protein:vir:94 175 --VAGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRTNNF-- 250 (339) T ss_pred --ecccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccCCcC-- Confidence 111112222221112111111111112345667888887777775553 244456899999999988643211 Q ss_pred hccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeee Q lcl|NC_011085. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD 303 (343) Q Consensus 224 ~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~ 303 (343) ...-..-+++ +.-+++|...+.|-..+. .....+. .+.. .++-......+. T Consensus 251 -~~Tvl~~lk~----n~pnl~i~~~~el~~a~g---------~~~~~~~-----~~~~----------~~~~~~~~~p~~ 301 (339) T protein:vir:94 251 -GLSAGAKIAQ----TYPNIQFVAVPEFDTASG---------RLVQLWV-----PEVN----------GQPTGEVAFAEK 301 (339) T ss_pred -CccHHHHHHH----hcCCcEEEEccccccCCC---------ceEEEEE-----Eecc----------CCcceEEEcchh Confidence 0111112222 134566666555521110 0000000 0000 000000110111 Q ss_pred eEEeeeeccchhhhhhhhhhh-hccceecccceEEEEec Q lcl|NC_011085. 304 LSLERARRAEYQADQIIARYA-MGHGGLRPEAAGALVFT 341 (343) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~-~G~~v~rpe~~~~i~~~ 341 (343) ++.=. ..++...+.+....+ .|+-+.||.+++.+.-= T Consensus 302 ~~~lp-vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 302 LRSHS-IERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred hhccc-cEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 11000 012334566666667 58999999997775544 No 202 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=82.83 E-value=0.074 Score=26.80 Aligned_cols=300 Identities=12% Similarity=0.078 Sum_probs=128.4 Q ss_pred CCCCCcccccc---ccccccc-------cccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEecc---Cc Q lcl|NC_011085. 1 MADMKGGQQLG---KDQGKGQ-------SGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVL---GR 67 (343) Q Consensus 1 ~~~~~~~~~~~---t~~g~~~-------~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~i---G~ 67 (343) |--.-+...-. |-.||-+ .-.+.-.|.-......|.+.|.+.+-+..+.+...+. |+..+.++. +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve-~~~~~~~r~~~lp~ 79 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIE-GNALAYNRENVLGD 79 (330) T ss_pred CceecCCccccceeehhccccccchhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhccccccc-CCcceeeeeecCCc Confidence 11111100000 0000000 0000011122345677888888777666655555544 344555543 33 Q ss_pred ceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHh-----chhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 68 TRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMN-----HYDVRSEYTSQIGESLAMAADGAVLAELA 142 (343) Q Consensus 68 ~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~-----~~d~~~~~~~~~~~aLa~~~D~~i~~~~~ 142 (343) ++..+. +..++.. .+.+..+.+..+ ..+. .+-++|+.-+ ..|.+.+..+...++|++++...+|.- T Consensus 80 a~~r~~--n~~~~~~-~~~Tf~q~t~~l---~~l~-~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linG-- 150 (330) T protein:vir:94 80 VQFLAV--GGTITAK-NPATFTKVTSEL---TTLI-GDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITG-- 150 (330) T ss_pred ceeeec--ccccccc-Ccceeeeeeech---hhhh-hhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhcc-- Confidence 332222 3332211 111122333221 1111 1224444332 357888888889999999888766542 Q ss_pred hhhhccccccccccccCCceeecccccccccchHHHHHHHHHHHHHHHHHHhhcC-CCcCCcEEEeCHHHHHHHhccchh Q lcl|NC_011085. 143 GLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNY-VPSADRTFYTTPEVYSAILAALMP 221 (343) Q Consensus 143 ~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~-VP~~gR~~vv~P~~~~~Ll~~~~~ 221 (343) ..+. ..-.+...-...++.+..++.+...++.. +|.| |+... -|-+.-+++++..+...+..-.+- T Consensus 151 Ds~~--~~F~GL~~~~~~~q~i~tg~~gg~~T~d~-----LDeL------l~~v~~~~g~~~~~l~n~a~~r~I~a~~R~ 217 (330) T protein:vir:94 151 DGTG--NSFQGMMGLVAASQTISAGANGGTLTFEL-----LDQL------LDLVKDKDGQVDYLMSSFAMRRKYFSLLRA 217 (330) T ss_pred CCCC--ccccchhhcCCcccEEecCCCCCCCCHHH-----HHHH------HHHhcCCCCCCcEEEechhHHHHHHHHHHh Confidence 0010 00001111123345555544455555533 2222 22221 122345888888877777664442 Q ss_pred hhh-c-cccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccc------cceEeEeech Q lcl|NC_011085. 222 NAA-N-YAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVAL------DNVVGLFQHR 293 (343) Q Consensus 222 ~~~-~-~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~l~~~~ 293 (343) ... . +.-....---.|-.+.|++|+.++-+|.........+. ... |...| -..+||.... T Consensus 218 ~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~t----tsI--------yav~~G~~~~~qgV~Gl~~~g 285 (330) T protein:vir:94 218 LGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNA----TAI--------FAGTFDDGSNKYGIAGLTARG 285 (330) T ss_pred ccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCc----eeE--------EEEeecccccccceEeecCCC Confidence 221 1 11112232335778999999999999865321100000 000 11110 0223332111 Q ss_pred hhheeeeeeeeEEeeeeccchhhhhhhh--hhhhccceecccceEEEEe-cCC Q lcl|NC_011085. 294 SAVGTVKLKDLSLERARRAEYQADQIIA--RYAMGHGGLRPEAAGALVF-TAG 343 (343) Q Consensus 294 ~Av~~~~~~~~~~e~~~~~~~~~d~i~~--~~~~G~~v~rpe~~~~i~~-~~g 343 (343) .. -+.++..-. ..-..+++. .+.+|..++.|+++++|+- .-| T Consensus 286 ~~-------glsVr~~G~-~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 286 SA-------GLRVQNVGA-KENADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred CC-------cceeeeCCC-ccccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 10 122222110 011122222 2346999999999999873 344 No 203 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=82.01 E-value=0.081 Score=26.58 Aligned_cols=274 Identities=14% Similarity=0.064 Sum_probs=115.9 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHh-----hhhc----c-CccccccccceEEEEeccCcc-----eeeeecCCCcCCCc Q lcl|NC_011085. 18 QSGGDKLALFLKVFGGEVLTAFART-----SVTT----N-RHIMRSISSGKSAQFPVLGRT-----RAAYLQAGQSLDDK 82 (343) Q Consensus 18 ~~~~d~~al~ie~~~g~V~~~f~~~-----s~~~----~-~~~~~~i~~G~tv~i~~iG~~-----t~~~~~~g~~i~~~ 82 (343) .+-+|. ++|..++..++-+. .+|- + ++.....-.|+-+..|..-.. +..++.....+. T Consensus 1 m~lsD~-----~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt-- 73 (325) T protein:vir:95 1 MALSDL-----AVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVA-- 73 (325) T ss_pred Cchhhh-----hhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceec-- Confidence 222221 24555555444432 1111 1 111112224777777765432 222333222222 Q ss_pred cCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCce Q lcl|NC_011085. 83 RKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSAS 162 (343) Q Consensus 83 ~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~ 162 (343) +..++..+..-++ -..-..+...|+...-...|.+++++++.|..+++...+.++..+..+...+-... ... T Consensus 74 ~~kitt~~~~av~-~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~-------~~~ 145 (325) T protein:vir:95 74 EKVLKHLVDTSVK-VAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQV-------SDV 145 (325) T ss_pred cceeccccceeeE-EecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------ccc Confidence 2223332222111 11112233445544445577889999999999999887777666554332211111 111 Q ss_pred eecccccccccchHHHHHHHHHHHHHHHHHHhhcCCCcCC-cEEEeCHHHHHHHhccchhhhh-c-cccccchhcceeEE Q lcl|NC_011085. 163 ILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNYVPSAD-RTFYTTPEVYSAILAALMPNAA-N-YAALIDPERGSIRN 239 (343) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~-~-~~~~~~~~~G~V~~ 239 (343) +....+..+..+.... ...|.+|..+|-++. +. ..+++.+..|..|.+.. +++. . +..... . .|.. T Consensus 146 v~dis~~~~~~~~~~s----~~~l~~A~~klGD~~---~~l~~~~MHS~v~~~L~~~~-L~~~~~~~~~~g~-~--~i~t 214 (325) T protein:vir:95 146 VYDATANTDAADKLPT----WNNLNNGQAKFGDQS---SQIAAWIMHSTPMHKLYGSN-LTNGERLFTYGTV-N--VVRD 214 (325) T ss_pred eeeeecccCccccccc----HHHHHHHHHHhcccc---cceeEEEEchHHHHHHHHhh-ccccccccccCCc-c--cccc Confidence 2222222221111111 245666788887763 23 57789999999998753 3322 1 111111 1 3567 Q ss_pred EeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeee---eccchhh Q lcl|NC_011085. 240 VMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERA---RRAEYQA 316 (343) Q Consensus 240 i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~---~~~~~~~ 316 (343) ++|-+|+++..+|.....+... |. .+.+-+-|++.....++..... +++ ... T Consensus 215 ~~G~~VIVdD~~p~~~~g~~~~-----------------yt-------ty~lg~GAi~~~~~~~~~~~~~~~~~~~-~~~ 269 (325) T protein:vir:95 215 PFGKLLVMTDSPNLFAAGTPNV-----------------YH-------ILGLVPGGVLIGQNNDFDANEETKNGDE-NII 269 (325) T ss_pred cCCcEEEEeCCCCCCCccCcee-----------------EE-------EEEEecCeEEecCCCCccccccccCccc-cee Confidence 8999999999999654332111 10 1222223333333333222111 111 111 Q ss_pred hhhh-----hhhhhcccee-----------------------cc--cceEEEEecC Q lcl|NC_011085. 317 DQII-----ARYAMGHGGL-----------------------RP--EAAGALVFTA 342 (343) Q Consensus 317 d~i~-----~~~~~G~~v~-----------------------rp--e~~~~i~~~~ 342 (343) .-++ .++.+|.+-. .| +.++++.-+. T Consensus 270 ~~~~~~~tf~lhp~G~sw~~s~~g~sPt~aeL~~~~NW~rv~~~~K~tagv~~~~~ 325 (325) T protein:vir:95 270 RTYQAEWSYNIGVKGFAWDKANGGKSPTDAALFTSTNWDKYATSHKDLAGVVVKTN 325 (325) T ss_pred eeeeeeeeEEeecceeeeecccccCCcChHhhcCCcCcceecCCCccccceeEeeC Confidence 1111 1222333221 01 0011111111 No 204 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=80.05 E-value=0.024 Score=29.43 Aligned_cols=119 Identities=15% Similarity=-0.018 Sum_probs=62.1 Q ss_pred EeCHHHHHHHhccchhhhhcc--ccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccc Q lcl|NC_011085. 206 YTTPEVYSAILAALMPNAANY--AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVAL 283 (343) Q Consensus 206 vv~P~~~~~Ll~~~~~~~~~~--~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 283 (343) +|+--+|..++.+.-...+-. +..-.+..+.--+++|.+++.|+|||.. .+-....+..++.+.-+-....|... T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~--~a~vlDst~lGgmaDE~l~~Pgya~~- 77 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTGSLPVSAYGLTWVTSRHITGT--DPWLFDVEQLGGMADEKLLSPEFAPA- 77 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEecCcceeeeceeeeecCCCCCC--ccceeehhhhccccccccCCCcccCC- Confidence 555556887776542221111 1222333344457999999999999932 22222233333322222222222211 Q ss_pred cceEeEeechhhheeeeeeeeEEeeeeccc--hhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 284 DNVVGLFQHRSAVGTVKLKDLSLERARRAE--YQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 284 ~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~--~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ....+++...|..+ .-.+.|+++-+-=.-++.|.+.+.|+-.-= T Consensus 78 ----------------~~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 78 ----------------GNTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred ----------------CCcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 01113444455555 667888888888888888887776653211 No 205 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=79.47 E-value=0.1 Score=25.97 Aligned_cols=290 Identities=9% Similarity=0.045 Sum_probs=105.9 Q ss_pred CCCCCccccccccccccccccchh--HHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceee--e--ec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKL--ALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAA--Y--LQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~--al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~--~--~~ 74 (343) |..+. + .|+ -|+. ++-+.-+ ...|-...+ .+.+++ ...+.+++.+|+-... + .. T Consensus 1 m~~~~---~--~~~------~dp~LT~~A~gy~----n~~~Iad~l-fP~vpV----~~~~~k~~~f~~e~f~~~~t~ra 60 (307) T protein:vir:79 1 MGRLS---K--LRI------VDPVLTNLAIGYT----NAEFIGQTL-MPVVEV----EKEGGKIPKFGKESFRLYQTERA 60 (307) T ss_pred CCCCC---C--Ccc------cCHHHHHHHhhcc----chhhhhhhc-CCcccc----cccccceeeeccccccccccccc Confidence 55442 1 222 1221 1111111 111211111 222222 2233444444432211 1 11 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNEN 154 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~ 154 (343) ++...+- ...-..+..++.+++.. -...||+.+...+.||++...++.....+.+..+-.+...+-..++ T Consensus 61 ~~~~~~~-v~~~~~~~~~~~~~~~~-l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~-------- 130 (307) T protein:vir:79 61 LRAKSNR-MNPEDIDSVDVNLDEHD-LEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSS-------- 130 (307) T ss_pred cCCCcce-eeeeccccccccccccc-hhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccc-------- Confidence 2221111 11011233455555543 2346777777788898877555544333333333222211111111 Q ss_pred ccccCCceeecccccc----cccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhc-cccc Q lcl|NC_011085. 155 IAGLGSASILEVGAKG----DLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAAN-YAAL 229 (343) Q Consensus 155 ~~g~~~~~~~~~~~~~----~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-~~~~ 229 (343) ...++.+.++++. +.-|| +..|.++++.+.+..- ...-.+++++..|..|+.++++.+.- +.+. T Consensus 131 ---y~~~~k~tLsgt~~Wsd~~sDP-------i~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~ 199 (307) T protein:vir:79 131 ---YAAGNKKQLSATEKFTAANSDP-------VGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMK 199 (307) T ss_pred ---cCCCceEEEccCcccCCCCCCc-------HHHHHHHHHHHHHhhC-CccceEEeCHHHHHHHhcCHHHHHHhcCccc Confidence 1222233333221 12234 5556666666665432 34569999999999999999988753 3333 Q ss_pred cchhcceeEEEeceE-EEEeccccccccccccccccccccccccc-----cccccccccccceEeEeechhhheeeeeee Q lcl|NC_011085. 230 IDPERGSIRNVMGFE-VVEVPHLTAGGAGDDREDETTNQKHAFPK-----TAEGDTKVALDNVVGLFQHRSAVGTVKLKD 303 (343) Q Consensus 230 ~~~~~G~V~~i~Gf~-V~~sn~lp~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~ 303 (343) +.+.--.+..+.|++ |+.-....... .......++.+.... ...+. ..-+.-+.|-.+.. .. T Consensus 200 g~it~~~la~l~~v~~V~vg~a~y~~~---~~~~~~iw~~~~~l~y~~~~~~~~~-~~~~~ps~Gyt~~~--------~g 267 (307) T protein:vir:79 200 GIVTVDLLKEIFEVENIAVGEAIYADD---KDRFTDIWGANIVLAYVPLQRGGQQ-RTPYEPSYGYTLRK--------KG 267 (307) T ss_pred cccCHHHHHHHhCceeEEEeeeeeecc---cccchhcCCCceEEEecccccCCCC-CcccccccceeEEe--------cC Confidence 322222345667775 44333222111 111111111111000 00000 00000011111110 00 Q ss_pred eEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 304 LSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) -.....|.....+|.|+.....=-.++=||+---|.=.-| T Consensus 268 ~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 268 NPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred ceEEecccCCCceeEEeecccccceeeccccchhhccCCC Confidence 0011112222333443333332222333332222222222 No 206 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=78.73 E-value=0.11 Score=25.81 Aligned_cols=302 Identities=15% Similarity=0.100 Sum_probs=126.8 Q ss_pred CCCCCccccccccc--cccccccchhHHHH-----HHHHHHHHH----------HHHHhhhhccCccccccccceEEE-- Q lcl|NC_011085. 1 MADMKGGQQLGKDQ--GKGQSGGDKLALFL-----KVFGGEVLT----------AFARTSVTTNRHIMRSISSGKSAQ-- 61 (343) Q Consensus 1 ~~~~~~~~~~~t~~--g~~~~~~d~~al~i-----e~~~g~V~~----------~f~~~s~~~~~~~~~~i~~G~tv~-- 61 (343) |-. ++|.=-.-|. +....++. -|+|. .-|+|..-. ++...+.+... -+..+|+... T Consensus 114 MTg-PTGLIFAMRsrY~~~~~tg~-EAf~~~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~---~t~~~gd~~~~~ 188 (514) T protein:vir:56 114 MTG-PTSQVFTLRSVYGKDPLTGA-EAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAE---VTTSGGDVSMRY 188 (514) T ss_pred CCc-hhhhheeeeeeecCCCcccc-cccccccccCcCcccccccccccccccccccccccccccc---cccccccccccc Confidence 100 0000000000 00000000 01110 001111000 00000000000 0011111111 Q ss_pred ----------------------------EeccC--cceeeeecCCCc---CCCccCCCccceEEEEeeeeeeee------ Q lcl|NC_011085. 62 ----------------------------FPVLG--RTRAAYLQAGQS---LDDKRKDIKHTEKTIVIDGLLTAD------ 102 (343) Q Consensus 62 ----------------------------i~~iG--~~t~~~~~~g~~---i~~~~~~~~~~~~~l~iD~~~~~~------ 102 (343) +..+| ..+. .++. +.+. ....-.+.-+.||+...-+ T Consensus 189 ~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta----~aEal~~lggs-~~~~f~EMaFsIdK~tVtAKSRaLK 263 (514) T protein:vir:56 189 FLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATS----QAELQENFNGS-SNNEWNEMSFRIDKQVVEAKSRQLK 263 (514) T ss_pred ccccccccccccccccccccccccccchhhhhhhhhhhh----hhhhcccCCCC-cccccceeeeEEEEEEEeeecccee Confidence 01111 1110 0111 1111 1123456677787764332 Q ss_pred --eeccchHHHHh-c-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccchHHH Q lcl|NC_011085. 103 --VLIYDIEDAMN-H-YDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVEL 178 (343) Q Consensus 103 --~~Idd~D~~q~-~-~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 178 (343) ..|.-..+.++ | .|.-.|++.=.+..+...+++-|++.+...+..... -...+.....+++.....+.....-. T Consensus 264 AEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~--~~~~~~~~~G~~d~~~~~d~~~~~~~ 341 (514) T protein:vir:56 264 AQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKS--GWTQGAGAAGVFDFSDAVDVKGARWA 341 (514) T ss_pred ccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhc--ccccccccccccccccccccccchHH Confidence 44555556666 3 888899999999999999999998777544422111 12233333334444433332222222 Q ss_pred H---HHHHHHHHH-HHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhh--------hhcccc--ccchhcceeEEEeceE Q lcl|NC_011085. 179 G---KAVIAQLTI-ARAKLTSNYVPSADRTFYTTPEVYSAILAALMPN--------AANYAA--LIDPERGSIRNVMGFE 244 (343) Q Consensus 179 ~---~~i~~~l~~-a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~--------~~~~~~--~~~~~~G~V~~i~Gf~ 244 (343) + +.++-.|.+ +.+...+-- --.+.|+|.+|++.++|-...-+. .+.... ...+--|.+. .|++ T Consensus 342 ~e~~~~l~~~i~~~an~i~~~T~-rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~~~~~~aG~l~--~~~~ 418 (514) T protein:vir:56 342 GEAYKALLIQIEKEANEIGRQTG-RGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLG--GRFK 418 (514) T ss_pred HHHHHHHHHHHHHHHHHHHhhcc-cccccEEEEchhHHHHHHhhhhhccccccCccccccccccCcceEEEEec--CceE Confidence 2 222222221 111111111 124789999999999986543221 111111 1112223332 6789 Q ss_pred EEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhh Q lcl|NC_011085. 245 VVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYA 324 (343) Q Consensus 245 V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~ 324 (343) ||.-++.|..=.. ..|+++.....++||.|-. .+. .-+..||+.|--.|-.+.+ T Consensus 419 vy~D~y~~~dy~~-------------------vG~KG~~~~~~glfyaPYv----~l~---~~~~~dp~sfqP~~g~~tR 472 (514) T protein:vir:56 419 VYIDQYAVNDYFT-------------------VGFKGSTEMDAGVFYSPYV----PLT---PLRGSDSKNFQPVIGFKTR 472 (514) T ss_pred EEecCCCCcceEE-------------------EEEecCcceecceeecccc----ccc---cccccCCccccceeeeeee Confidence 9988877642111 1233333444578998853 222 2234699999999999999 Q ss_pred hcccee--cccceEEEEecCC Q lcl|NC_011085. 325 MGHGGL--RPEAAGALVFTAG 343 (343) Q Consensus 325 ~G~~v~--rpe~~~~i~~~~g 343 (343) ||..+- -++....+....| T Consensus 473 Y~l~~NPy~~~~~~~~~~~~~ 493 (514) T protein:vir:56 473 YGVQVNPFADPTASATKVGNG 493 (514) T ss_pred eceeeCCCCCccccccccCCc Confidence 987643 2222223333222 No 207 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=77.84 E-value=0.12 Score=25.62 Aligned_cols=302 Identities=12% Similarity=0.015 Sum_probs=129.2 Q ss_pred CCC---------CCcc-ccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccccccc---ceEEEEec--- Q lcl|NC_011085. 1 MAD---------MKGG-QQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISS---GKSAQFPV--- 64 (343) Q Consensus 1 ~~~---------~~~~-~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~---G~tv~i~~--- 64 (343) |+. |-.. ....|.+ +.-+.+-|++-|...+.+....--+...++.+.+ ++ -+++.++. T Consensus 51 ~~~~~~~~~~~amDa~~~~~~t~~-----~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t-~g~W~~~t~ty~~~e~ 124 (382) T protein:vir:96 51 LAKAGAFRSGSAMDSNFTAPVTTP-----SIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDT-VGSWEDQEIVQGIVEP 124 (382) T ss_pred hhhhhhhhhhcccccccCCccccC-----CccHHHHHHhhhhhhhhhhhhhhhhhhhhccccc-cCCccceEEEEeeeec Confidence 110 0000 0011221 2223566888888766555444445566666654 22 24666654 Q ss_pred cCcceeeeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHH---hchhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_011085. 65 LGRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM---NHYDVRSEYTSQIGESLAMAADGAVLAEL 141 (343) Q Consensus 65 iG~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q---~~~d~~~~~~~~~~~aLa~~~D~~i~~~~ 141 (343) .|.+++ |..+++.+-...+.+..++++..=+ ..+.+.++++.+ +.+|+-++-...+..+|.+..|+..+.-. T Consensus 125 ~G~A~~--ygd~~D~Pl~d~~~~~~~r~v~~~~---~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~ 199 (382) T protein:vir:96 125 AGTAVE--YGDHTNIPLTSWNANFERRTIVRGE---LGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGW 199 (382) T ss_pred ccceEE--eecccCCCccccccceeEEEEEEEE---EeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEee Confidence 476653 3333333222223344444444322 235566676655 47888888888888888888887554211 Q ss_pred HhhhhccccccccccccCCceeeccccccccc-chHHHHHHHHHHHHHHHHHHhhcCC----CcC-CcEEEeCHHHHHHH Q lcl|NC_011085. 142 AGLCNMPAASNENIAGLGSASILEVGAKGDLT-SPVELGKAVIAQLTIARAKLTSNYV----PSA-DRTFYTTPEVYSAI 215 (343) Q Consensus 142 ~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~-~~~~~~~~i~~~l~~a~~~Ld~~~V----P~~-gR~~vv~P~~~~~L 215 (343) . ....+ .+.|+-....++...+.... -..++.+.|+++|..+...|....- |.. ...++|||..|..| T Consensus 200 ~-----~g~~~-~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~L 273 (382) T protein:vir:96 200 Q-----SGLGN-RTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYL 273 (382) T ss_pred e-----cCcCc-ceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhc Confidence 0 00000 01122111112211111111 1234467788888888888866542 433 34688999999888 Q ss_pred hccchhhhhcccc--ccchhcceeEEEeceEEEEecccccccccccccccccccccccccccccccc--ccccceEeEee Q lcl|NC_011085. 216 LAALMPNAANYAA--LIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTK--VALDNVVGLFQ 291 (343) Q Consensus 216 l~~~~~~~~~~~~--~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~l~~ 291 (343) -... +|+- ..-+++ +.-+++|...+.|-..+..+... ....-....... .+....+...| T Consensus 274 s~~n-----~~g~Tvl~~lk~----n~Pnl~i~t~peL~~a~~~g~g~-------~~~~~~~~~e~~~~~~~s~~~p~~f 337 (382) T protein:vir:96 274 SVTT-----PYGISVSDWIEQ----TYPKMRIVSAPELSGVQMQGKTP-------EDALVLFVEEVDASVDGSTDGGSVF 337 (382) T ss_pred cccC-----ccCccHHHHHHH----hcCCcEEEEccccccccCCCccc-------eeEEEEecchhhhhcccccccCcce Confidence 5321 1210 111222 23455666655553221111000 000000000000 01111122222 Q ss_pred chhhheeeeeeeeEEeeeeccchhhhhhhhhhh-hccceecccceEEEEec Q lcl|NC_011085. 292 HRSAVGTVKLKDLSLERARRAEYQADQIIARYA-MGHGGLRPEAAGALVFT 341 (343) Q Consensus 292 ~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~rpe~~~~i~~~ 341 (343) ...-- ++.+.+.+| + ..-++.+....+ .|+-+.||.+++-+.== T Consensus 338 ~q~~p--~~~~~l~ve--~--~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 338 SQLVQ--SKFITLGVE--K--RAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred ecccc--ceeeeccce--e--ecceeEeccccceeeeEEEcchhhhhccCC Confidence 21100 000001111 1 111223333333 47777777776554333 No 208 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=75.25 E-value=0.15 Score=25.12 Aligned_cols=302 Identities=9% Similarity=0.009 Sum_probs=119.4 Q ss_pred CCCCCcccccc----ccccccc-cccchh--HHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeee- Q lcl|NC_011085. 1 MADMKGGQQLG----KDQGKGQ-SGGDKL--ALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAY- 72 (343) Q Consensus 1 ~~~~~~~~~~~----t~~g~~~-~~~d~~--al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~- 72 (343) |.+..+..++- ++.-+.. ..+|.. -|=.+++...|...+. ++-+.+..++... ..++.+|+++|-....- T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l~~g~L~p~~a~~Fl~~v~~-~t~iL~~~r~~~~-~s~~~ei~kig~G~r~~r 78 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAELDGFQLPVDVTEEFLERMQK-GVQILGMADTMTL-ARLEMEVPQFGVPRLSGH 78 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccccCceeecHHHHHHHHHHHhh-ccchhhhcceeec-ccccccccccccceeecc Confidence 77776665532 1111111 111111 1223666666655554 4444566655432 45677777776543322 Q ss_pred -ecCCCcCCCccCCCccceEEE-EeeeeeeeeeeccchHHHHhc-------hhhHHHHHHHHHHHHHHHHHH---H---- Q lcl|NC_011085. 73 -LQAGQSLDDKRKDIKHTEKTI-VIDGLLTADVLIYDIEDAMNH-------YDVRSEYTSQIGESLAMAADG---A---- 136 (343) Q Consensus 73 -~~~g~~i~~~~~~~~~~~~~l-~iD~~~~~~~~Idd~D~~q~~-------~d~~~~~~~~~~~aLa~~~D~---~---- 136 (343) ++.+...+..+ +++...+.+ ..+...++...+.+-.+...+ -.+++.++++.++-|....-+ . T Consensus 79 ~~~e~~~~~~~~-~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~ 157 (360) T protein:vir:99 79 TRDEEGSRTENS-EAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNL 157 (360) T ss_pred ccccCCCCCcCC-cCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhccc Confidence 22211211111 122222222 344444444333221111111 124555555555543332111 0 Q ss_pred -----------HHHHHH-hhhhccccccccccccCCceeecccc-ccccc--chHHH--------HHHHHHHHHHHHHHH Q lcl|NC_011085. 137 -----------VLAELA-GLCNMPAASNENIAGLGSASILEVGA-KGDLT--SPVEL--------GKAVIAQLTIARAKL 193 (343) Q Consensus 137 -----------i~~~~~-~~a~~~~~~~~~~~g~~~~~~~~~~~-~~~~~--~~~~~--------~~~i~~~l~~a~~~L 193 (343) +..-|. ++..... .+...+.++-+.... ....+ .|... +..-.+++.++.+.| T Consensus 158 ~~~~~~d~fl~~~dGwlKka~~~~~----~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~L 233 (360) T protein:vir:99 158 QSIGGAAELDNTFKGWIARAEGDAQ----SVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTL 233 (360) T ss_pred ccCcccchhhhhhHHHHHHhhcccc----hhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhc Confidence 000011 1100000 000000000000000 00000 00000 000123344556666 Q ss_pred hhcCC--CcCCcEEEeCHHHHHHHhccchhhhhcc-ccccchhcceeEEEeceEEEEecccccccccccccccccccccc Q lcl|NC_011085. 194 TSNYV--PSADRTFYTTPEVYSAILAALMPNAANY-AALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHA 270 (343) Q Consensus 194 d~~~V--P~~gR~~vv~P~~~~~Ll~~~~~~~~~~-~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~ 270 (343) ..+.- |...-+.+++|..+..... .+.++.. .|+..+..+..-++.|++|...+++|... T Consensus 234 p~kyr~~~~~~~~~~~s~~~~~~yr~--~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~--------------- 296 (360) T protein:vir:99 234 DSRYRESDAYSPVLMTSPNQVQSYTM--SLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEY--------------- 296 (360) T ss_pred chhhhcCcccceEEEccCchHHHHHH--HHhccCcccchhheecccccccceeeeEEcCCCCCCc--------------- Confidence 66642 1112256677665544443 2223332 45555665555578999999999998321 Q ss_pred ccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhh----hhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 271 FPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQAD----QIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 271 ~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d----~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .++.+++-+..+...+++.+...++.+.++ .+..+.+.=--+++-+-++++.+--. T Consensus 297 -----------------~mlT~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~ 356 (360) T protein:vir:99 297 -----------------MMFTDPNNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLE 356 (360) T ss_pred -----------------eEEeccCceeEEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCC Confidence 255667777677777777665444444333 11111111112233333333333222 No 209 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=74.54 E-value=0.16 Score=24.99 Aligned_cols=294 Identities=11% Similarity=0.045 Sum_probs=106.2 Q ss_pred CCCCCccccccccccccccccchh--HHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCcceeee-ecCCC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKL--ALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAY-LQAGQ 77 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~--al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~-~~~g~ 77 (343) |..+ ++ .|+ -|+. ++-+--+.. .|-..++ .+.+++. .++|+-.+|+.-.-....+ +.++. T Consensus 1 m~~~---~~--~~~------~dp~LT~~A~gy~n~----~~ia~~l-~P~vpv~-~~~~k~~~f~~eaF~~~~t~r~~~~ 63 (307) T protein:vir:10 1 MGRL---SK--LRI------VDPVLTNLAIGYTNA----EFIGQSL-MPVVEVE-KEGGKIPKFGKESFRLYKTERALRA 63 (307) T ss_pred CCCC---CC--Ccc------cChhHHHHHHhhcch----hhhhhhc-CCccccc-ccccceeeECcccccchhhhcccCC Confidence 4433 22 222 1221 111111111 1211211 2223221 1234444443211000000 11111 Q ss_pred cCCCccCCCccceEEEEeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc Q lcl|NC_011085. 78 SLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG 157 (343) Q Consensus 78 ~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g 157 (343) ... ..+.-..+.....+-+.- -...||+-+...+.||++....+.....|.+..+-.+...+...++ T Consensus 64 ~~~-~v~~~~~~~~~~~~~~~~-L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~----------- 130 (307) T protein:vir:10 64 RSN-RMNPEDLGSIDIVLDEHD-LEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNS----------- 130 (307) T ss_pred Ccc-eeeccccccccccccccc-ccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccc----------- Confidence 111 001001111222222221 1245776677778899877766665554444443322221111111 Q ss_pred cCCceeecccccc----cccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhh-ccccccch Q lcl|NC_011085. 158 LGSASILEVGAKG----DLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAA-NYAALIDP 232 (343) Q Consensus 158 ~~~~~~~~~~~~~----~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~~~~~ 232 (343) .+.+..+.++++. +.-|| +..|.++++++.+..- .....+++++..|..|+.++++... .+.+.+.+ T Consensus 131 y~~~~k~tLsGt~~Wsd~~sDP-------i~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~i 202 (307) T protein:vir:10 131 YAGGNKKQLSATEKFTAAGSDP-------VGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIV 202 (307) T ss_pred cCCCceEEeccccccCCCCCCc-------HHHHHHHHHHHHhhhC-CccceEEeCHHHHHHHhcCHHHHHHhCCcccccc Confidence 1122223332221 12234 5566666666666433 3456999999999999999998865 34333323 Q ss_pred hcceeEEEeceEEEEeccccccccccccccccccccccccc-----cccccccccccceEeEeechhhheeeeeeeeEEe Q lcl|NC_011085. 233 ERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPK-----TAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLE 307 (343) Q Consensus 233 ~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e 307 (343) .--.+..+.|++.+....--.. .+.......++.+.... ...++ .....-+.|..+++ +.-... T Consensus 203 t~~~la~ll~v~~i~vg~a~~~--~~~~~~~~iw~~~~vl~yv~~~~~~~~-~~~~epsfGyT~~~--------~g~~~~ 271 (307) T protein:vir:10 203 TVDLLKEIFEVENIAVGEAIYA--DDKDRFTDIWGANIVLAYVPLQRGGQQ-RTPYEPSYGYTLRK--------KGNPVV 271 (307) T ss_pred CHHHHHHHhCceeEEEeeeeee--ccCCccceeCCCceEEEecccccCCCC-CcccccccceeEEE--------cCCeEe Confidence 2224466778766654322111 01111111111111100 00000 00000011211111 111111 Q ss_pred eeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 308 RARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 308 ~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) ..+.....+|.|+..-..=--++=|++---|.=.-| T Consensus 272 d~~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 272 DTRIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred eceecCCceeEEeccccccceeecccccceeccCCC Confidence 112223333333333222222333333223333333 No 210 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=68.24 E-value=0.24 Score=23.98 Aligned_cols=308 Identities=10% Similarity=0.033 Sum_probs=118.2 Q ss_pred CCCCC---ccccccccccccc-cccchhHHHHHHHHHHHHHHHHHhhhhccCcccccccc---ceEEEEecc---Cccee Q lcl|NC_011085. 1 MADMK---GGQQLGKDQGKGQ-SGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISS---GKSAQFPVL---GRTRA 70 (343) Q Consensus 1 ~~~~~---~~~~~~t~~g~~~-~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~---G~tv~i~~i---G~~t~ 70 (343) +.++. .+.. ....|..- ++.-+.+-|+.-|...|.+....--+...++.+.+ ++ -+++.|+.. |.+.+ T Consensus 57 ~~~~~~~~~a~d-a~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t-~g~W~~~~~~f~v~e~~G~A~~ 134 (388) T protein:vir:99 57 LHEGGVATQAFD-SAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKT-VGSWEDQEIVQGIVEPAGTAME 134 (388) T ss_pred hhhhhhhhcccC-cccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccc-cCCccceeEEEeeeecceeEEE Confidence 00000 0000 00001110 11112344666666555443333334455665554 22 245666553 66543 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHH---hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM---NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNM 147 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q---~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~ 147 (343) |..+.+++-...+.+-.++++..=+ ..+.+.+.+..+ +.+|+-.+-.+.+..+|.+..++..|.-.. T Consensus 135 --ygd~~D~Pl~d~~~~~~~r~v~~~~---~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~----- 204 (388) T protein:vir:99 135 --YGDLTNIPLSSWNVNFERRTIVRGE---MGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWE----- 204 (388) T ss_pred --eecccCCCceeccceeeeeeEEEEE---eeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeec----- Confidence 3323333211122233333333222 224455444332 478888888888888888888876542211 Q ss_pred cccccccccccCC-ce----eecccccccccchHHHHHHHHHHHHHHHHHHhhcC--C--CcC-CcEEEeCHHHHHHHhc Q lcl|NC_011085. 148 PAASNENIAGLGS-AS----ILEVGAKGDLTSPVELGKAVIAQLTIARAKLTSNY--V--PSA-DRTFYTTPEVYSAILA 217 (343) Q Consensus 148 ~~~~~~~~~g~~~-~~----~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~--V--P~~-gR~~vv~P~~~~~Ll~ 217 (343) ......+.|+-. .+ +...+..+..+=..++.+.|+++|..+...|.... + |+. ...++|||..|..|-. T Consensus 205 -g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~ 283 (388) T protein:vir:99 205 -GKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV 283 (388) T ss_pred -CCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhccc Confidence 000000111110 10 00111111111123346678888888777775442 2 322 3478899999999853 Q ss_pred cchhhhhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhhe Q lcl|NC_011085. 218 ALMPNAANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVG 297 (343) Q Consensus 218 ~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~ 297 (343) ...+ +.+-..-+++ +.-+++|...+.|-..+.+.+....--.... .-+...++-++..+..+.+-....++. T Consensus 284 ~n~~---g~Tvl~~lk~----n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~-~~~~~~~~~~~~~t~~~~~p~~~~~l~ 355 (388) T protein:vir:99 284 VTDL---GISVRDWLKQ----TYPRVRVMSAPELQGGNPDDGKDIAYMFLDS-VDTAVDGSTDGGDTWAQLVQSKFVTLG 355 (388) T ss_pred cCcC---CccHHHHHHH----hcCCcEEEEecccccccccCCceeEEEEecc-cccccccCccCcceeEEeccccccccc Confidence 2111 1111111222 2445566665555322111100000000000 000000000000000000000001110 Q ss_pred eeeeeeeEEeeeeccchhhhhhhhhhh-hccceecccceEEEEec Q lcl|NC_011085. 298 TVKLKDLSLERARRAEYQADQIIARYA-MGHGGLRPEAAGALVFT 341 (343) Q Consensus 298 ~~~~~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~rpe~~~~i~~~ 341 (343) ++. ....+.+....+ .|+-+.||.+++.+.== T Consensus 356 -vq~-----------~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 356 -VEK-----------RVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred -cee-----------cCceeEeccccceeeeEEeccchhheeccC Confidence 111 112233333334 37778888776654433 No 211 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=64.84 E-value=0.29 Score=23.50 Aligned_cols=300 Identities=13% Similarity=0.030 Sum_probs=124.5 Q ss_pred CCCC--CccccccccccccccccchhHHHHHHHHHHHHHHHHHhhhhccCcccccccc--ceEEEEec---cCcceeeee Q lcl|NC_011085. 1 MADM--KGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISS--GKSAQFPV---LGRTRAAYL 73 (343) Q Consensus 1 ~~~~--~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~--G~tv~i~~---iG~~t~~~~ 73 (343) |... ..++...+-......+|=+. |+.-|-..+.+..-.--+...++.+.+.-. -+++.|+. .|.++ -| T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~--~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~--~y 131 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQ--FLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQ--PY 131 (379) T ss_pred hccccccccccccCccccccccchHH--HHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeE--Ee Confidence 4322 22222111111112223232 777777655554444445566666655211 24555554 45554 33 Q ss_pred cCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHH---hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc Q lcl|NC_011085. 74 QAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM---NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAA 150 (343) Q Consensus 74 ~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q---~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~ 150 (343) ..+.+.+-...+.+-.++.+..=+ ..+.+.+.+..+ +..|+-.+-.+.+..+|.+..|+..+.-... .... T Consensus 132 gd~~d~pl~d~~~~~~~r~v~~~~---~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d---~~~~ 205 (379) T protein:vir:10 132 TDGGNMALMSWTPTFETRTVVRFE---AGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYND---GSGR 205 (379) T ss_pred ccccCCCeeeeeeeeeeeeeEEEE---EEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecC---CCcc Confidence 323332111111122222222111 223444444222 4688888888888888888888765421100 0000 Q ss_pred ccccc--cccCCceeecccccccccch-HHHHHHHHHHHHHHHHHHhhc---C-CCcCC-cEEEeCHHHHHHHhccchhh Q lcl|NC_011085. 151 SNENI--AGLGSASILEVGAKGDLTSP-VELGKAVIAQLTIARAKLTSN---Y-VPSAD-RTFYTTPEVYSAILAALMPN 222 (343) Q Consensus 151 ~~~~~--~g~~~~~~~~~~~~~~~~~~-~~~~~~i~~~l~~a~~~Ld~~---~-VP~~g-R~~vv~P~~~~~Ll~~~~~~ 222 (343) .++.. ...........++ +..++. .++.+.|+++|..+-..|-.+ . .|.+- ..++++|..+..|-.-..+ T Consensus 206 ~yGllNdP~l~a~~t~atg~-~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n~~- 283 (379) T protein:vir:10 206 TFGFLNDPNLPAYVAVPNGA-GGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPTEL- 283 (379) T ss_pred eEEEEeCCCCcccccccCCc-ccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhcccccc- Confidence 00000 0111111111111 111111 234566777777766665433 2 25433 4789999999999643211 Q ss_pred hhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccc---cceEeEeechh--hhe Q lcl|NC_011085. 223 AANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVAL---DNVVGLFQHRS--AVG 297 (343) Q Consensus 223 ~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~l~~~~~--Av~ 297 (343) +..-..-+++ +.-+++|...+.|-..+.. +.... ....+..+.- ...+-+.+... ++. T Consensus 284 --g~Tvl~~lk~----n~Pnl~i~t~pEL~~aggg---------~~~~~--~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ 346 (379) T protein:vir:10 284 --GYSVAQYMRE----SYPNVTFVSAPELNDANGG---------SSAIY--YYADAVENNGTDDGRTWLQVVPTKMFTLG 346 (379) T ss_pred --CccHHHHHHH----hcCCcEEEEcccccccCCC---------ccEEE--EEeeccCCCccCCcceEEEecchhhhhcc Confidence 0100111221 2445677777766321100 00000 0000000000 00111111111 110 Q ss_pred eeeeeeeEEeeeeccchhhhhhhhhhhh-ccceecccceEEEEec Q lcl|NC_011085. 298 TVKLKDLSLERARRAEYQADQIIARYAM-GHGGLRPEAAGALVFT 341 (343) Q Consensus 298 ~~~~~~~~~e~~~~~~~~~d~i~~~~~~-G~~v~rpe~~~~i~~~ 341 (343) + .+....+.+....+. |+-+.||-+++-+.-. T Consensus 347 -v-----------e~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 347 -V-----------EKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred -c-----------eecCceeEeccccceeeeeeecchhhheecCC Confidence 0 112334455555554 8888889887776666 No 212 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=64.82 E-value=0.29 Score=23.50 Aligned_cols=299 Identities=18% Similarity=0.120 Sum_probs=129.4 Q ss_pred CCCCCccc------cccccccccccccchhHHHHHHHHHHHHHHHHHhhh--------hccCccccccccceE------- Q lcl|NC_011085. 1 MADMKGGQ------QLGKDQGKGQSGGDKLALFLKVFGGEVLTAFARTSV--------TTNRHIMRSISSGKS------- 59 (343) Q Consensus 1 ~~~~~~~~------~~~t~~g~~~~~~d~~al~ie~~~g~V~~~f~~~s~--------~~~~~~~~~i~~G~t------- 59 (343) |-.= +|. .+++....++.++. .+|| .|.+..|....- +..........+... T Consensus 97 MTgP-TGLIFAmRsrY~~~~~~~nq~gt-EAlf-----nEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ 169 (462) T protein:vir:10 97 MTGP-TGLIFAMRSFYGSERRPANSDFR-EALF-----NEPNAGFSGGAGTGLSNYDPTASSSAVNDAEGANPGLLNDSP 169 (462) T ss_pred CCcc-hhhhheeeeeccCCccccccccc-hhhh-----ccCCcCccccccccccccccccccccccccccccceeecCCC Confidence 2110 000 11111111111221 1222 333333321100 000000000000000 Q ss_pred ---EEEe--ccCcceeeeecCCCcCCCccCCCccceEEEEeeeeeeee--------eeccchHHHHh-c-hhhHHHHHHH Q lcl|NC_011085. 60 ---AQFP--VLGRTRAAYLQAGQSLDDKRKDIKHTEKTIVIDGLLTAD--------VLIYDIEDAMN-H-YDVRSEYTSQ 124 (343) Q Consensus 60 ---v~i~--~iG~~t~~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~-~-~d~~~~~~~~ 124 (343) ..+. ..|..+. .++.+..........+.-++||+...-+ ..+.-..+.++ | .|.-.|++.= T Consensus 170 ~g~~~~~~~~~GM~Ta----~aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNI 245 (462) T protein:vir:10 170 AGTYEVTGDATGMATA----TAEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANI 245 (462) T ss_pred ccceecccccccccch----hccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHH Confidence 0010 0111111 0111111011123456677787764332 44555555666 3 8888899999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccchHHHHHHHH-HHHHHHHHHHhhcCCCcCCc Q lcl|NC_011085. 125 IGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVELGKAVI-AQLTIARAKLTSNYVPSADR 203 (343) Q Consensus 125 ~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~-~~l~~a~~~Ld~~~VP~~gR 203 (343) .+..+...+.+-|++.+...+.. ....+.....+++.....+.....+.++.++ ....++.+...+- ---.+- T Consensus 246 LSTEImlEINReii~~l~~~a~~-----~k~~~~~~~Gv~dl~~~~~gr~~~e~~k~l~~qi~~ean~i~~~t-~r~~~n 319 (462) T protein:vir:10 246 LSTEILAEINREVVRTIYVNAVK-----GAIANTATDGIFDLDVDSNGRWSVEKFKGLLFQIERDSNAIGQET-RRGKGN 319 (462) T ss_pred HHHHHHHHhhHHHHhhhhhhhee-----eecccccccceeeeccccchHHHHHHHHHHHHHHHHHHHHHHHHh-ccccce Confidence 99999999999999988754432 1223333334444322222111122223222 2112222222222 123467 Q ss_pred EEEeCHHHHHHHhccchh--h---hhc-cc-cccchhcceeEEE-eceEEEEec----cccccccccccccccccccccc Q lcl|NC_011085. 204 TFYTTPEVYSAILAALMP--N---AAN-YA-ALIDPERGSIRNV-MGFEVVEVP----HLTAGGAGDDREDETTNQKHAF 271 (343) Q Consensus 204 ~~vv~P~~~~~Ll~~~~~--~---~~~-~~-~~~~~~~G~V~~i-~Gf~V~~sn----~lp~~~~~~~~~~~~~~~~~~~ 271 (343) |+|++|++.+.|-...-+ . +.+ .. ..++.-...+|.+ .|++||.-+ |-|.-=.. T Consensus 320 ~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~-------------- 385 (462) T protein:vir:10 320 ILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYV-------------- 385 (462) T ss_pred EEEEchhHHHHhhhccchhccccccccccccccccccceeEEEecCceEEEEecccCCCcccceEE-------------- Confidence 999999999998554422 1 111 11 1223344456765 457888753 33321111 Q ss_pred cccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEE----EEecCC Q lcl|NC_011085. 272 PKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGA----LVFTAG 343 (343) Q Consensus 272 ~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~----i~~~~g 343 (343) ..|+++-....++||.|- +.+.+ .+.-||+.|--.|-.+.+||-.+- |=.-.. -...+| T Consensus 386 -----vG~KG~~~~~~glfy~PY----v~l~~---~~~~dp~sfqP~~g~~tRY~l~~N-P~t~~~~~~~~~~~~~ 448 (462) T protein:vir:10 386 -----AGYKGTSPYDAGLFYCPY----VPLQQ---VRAINPNTFQPKIGFKTRYGMVSN-PFSGGLTQGSGALTAN 448 (462) T ss_pred -----EEEeCCcccccceeeccc----ccccc---ccccCCccccceeeeeeeeeeeec-CCCCCcCCcccccccc Confidence 112333334457899885 22332 223499999999999999986542 321111 112223 No 213 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=61.86 E-value=0.35 Score=23.11 Aligned_cols=309 Identities=15% Similarity=0.067 Sum_probs=127.3 Q ss_pred CCCCCccccccccccccccccc---hhHHHHH-------------------------------HHHHHHHHHHHHhhhhc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGD---KLALFLK-------------------------------VFGGEVLTAFARTSVTT 46 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d---~~al~ie-------------------------------~~~g~V~~~f~~~s~~~ 46 (343) |-. ++|.=-.-|.-.++.+++ .-++|-| .|..++.+.....+... T Consensus 125 MTg-PTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~ 203 (534) T protein:vir:10 125 MTS-STGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTV 203 (534) T ss_pred CCc-hhhhheeeeeeecCCCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 100 000000000000000000 0011111 11111111111111111 Q ss_pred cCccccccc--------cceEE---------EEeccCcceeeeecCCCcCC--CccCCCccceEEEEeeeeeeee----- Q lcl|NC_011085. 47 NRHIMRSIS--------SGKSA---------QFPVLGRTRAAYLQAGQSLD--DKRKDIKHTEKTIVIDGLLTAD----- 102 (343) Q Consensus 47 ~~~~~~~i~--------~G~tv---------~i~~iG~~t~~~~~~g~~i~--~~~~~~~~~~~~l~iD~~~~~~----- 102 (343) ..+....+- .|..+ -...-|..+. .++.+- +...+..-.+..+.||+...-+ T Consensus 204 ~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta----~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaL 279 (534) T protein:vir:10 204 QFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMATA----FAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQL 279 (534) T ss_pred ccccccccccccCCccccccccccccccccceecccccchh----hHhhhccCCCCcccchhhcceEEEEEEEeeeccce Confidence 110000000 00000 0000011110 011110 0011122345667777764332 Q ss_pred ---eeccchHHHHh-c-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccchHH Q lcl|NC_011085. 103 ---VLIYDIEDAMN-H-YDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTSPVE 177 (343) Q Consensus 103 ---~~Idd~D~~q~-~-~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~ 177 (343) ..|.-..+.++ | .|.-.|++.=.+..+...+++-|++.+...+..-........+ ....+.+.....+.....- T Consensus 280 KAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~-~~~G~~d~~~~~~~~~~~~ 358 (534) T protein:vir:10 280 KAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHG-GKAGVFDFQDTKDIRGARW 358 (534) T ss_pred eccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccc-cccceeeeeccccccchhH Confidence 44555556666 3 8888899999999999999999999887655332111000000 1112333333332222111 Q ss_pred HH---HHHHHHHHHH-HHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc-----c-cchhcceeEEE-eceEEE Q lcl|NC_011085. 178 LG---KAVIAQLTIA-RAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA-----L-IDPERGSIRNV-MGFEVV 246 (343) Q Consensus 178 ~~---~~i~~~l~~a-~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~-----~-~~~~~G~V~~i-~Gf~V~ 246 (343) .+ +.++-.|.+. .+...+-. --.+-|+|++|++.+.|-....+......+ . +....=.+|.+ .|++|| T Consensus 359 ~~e~~~~L~~~i~~~an~i~~~T~-rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 437 (534) T protein:vir:10 359 AGESYKALVVQIDKEANEIARQTG-RGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRVY 437 (534) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhc-cccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEEE Confidence 22 2222222211 11111111 013569999999999997766543221111 0 11111135665 468999 Q ss_pred EeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhc Q lcl|NC_011085. 247 EVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMG 326 (343) Q Consensus 247 ~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G 326 (343) .-++.|..=.. ..|+++.....+++|.|-. .+ ...+..||+.|--.|-.+.+|| T Consensus 438 ~D~y~~~dy~~-------------------vG~KG~~~~~~glfyaPYv----~l---~~~~~~dp~sfqP~~g~~tRY~ 491 (534) T protein:vir:10 438 IDQYAVEDYFT-------------------VGYKGASEMDAGLYYCPYV----AL---TPLRGTDPKNFQPVLGFKTRYG 491 (534) T ss_pred ecCCCCcceEE-------------------EEEeCCcccccceeecccc----cc---ccccccCCccccceeeeeeeec Confidence 88877642111 1233444444678998853 23 3334579999999999999998 Q ss_pred cceecccce-----EEEEecCC Q lcl|NC_011085. 327 HGGLRPEAA-----GALVFTAG 343 (343) Q Consensus 327 ~~v~rpe~~-----~~i~~~~g 343 (343) ..+- |=.- ..-++..| T Consensus 492 l~~N-P~~~~~~~~~~~~i~~g 512 (534) T protein:vir:10 492 VKLH-PMADATQNKGFAKISNG 512 (534) T ss_pred eeec-CcccccCCccccccccC Confidence 7653 3110 00122222 No 214 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=61.15 E-value=0.36 Score=23.02 Aligned_cols=273 Identities=13% Similarity=0.070 Sum_probs=109.6 Q ss_pred ccccccchhHHHHHHHHHHHHHHHHHhh-hhccCccccccccceEEEEeccCcc-eeeeecCCCcCCCccCCCccceEEE Q lcl|NC_011085. 16 KGQSGGDKLALFLKVFGGEVLTAFARTS-VTTNRHIMRSISSGKSAQFPVLGRT-RAAYLQAGQSLDDKRKDIKHTEKTI 93 (343) Q Consensus 16 ~~~~~~d~~al~ie~~~g~V~~~f~~~s-~~~~~~~~~~i~~G~tv~i~~iG~~-t~~~~~~g~~i~~~~~~~~~~~~~l 93 (343) +........+|+ +-+...+...|+... -...+.+ +.-+..++-+...+|.. .+.... |+-. . ..+....-+| T Consensus 1 m~it~~~l~~l~-~~~~~~~~~~y~~a~~~~~~~a~-~~~sdf~~~~~~~lg~~p~l~e~~-Ge~~-~--~~l~~~~~~i 74 (302) T protein:vir:10 1 MLINKQSLNAAF-VAIKTIFNNAFAAAPTTWQKIAM-EVPSNTSSNDYKWLSTFPKMRRWI-GAKV-V--KNLKAYKYVV 74 (302) T ss_pred CcccHHHHHHHH-HHHHHHHHHHHHhhhhhhhceee-ecCCCcceeeceecCCCCCccccc-ccee-e--ccccccceeE Confidence 111112223333 245555666666442 2233322 21123344444444432 121111 2111 1 1233344455 Q ss_pred EeeeeeeeeeeccchHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc--cccCCc-----eee-- Q lcl|NC_011085. 94 VIDGLLTADVLIYDIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENI--AGLGSA-----SIL-- 164 (343) Q Consensus 94 ~iD~~~~~~~~Idd~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~--~g~~~~-----~~~-- 164 (343) .+.++- -.+.|+.-+-.-=++..-..+.+++|++-++..|+.++..|..+.+..-...... +.|..+ +.. T Consensus 75 ~~~~~g-~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~ 153 (302) T protein:vir:10 75 ENEDFE-ATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTA 153 (302) T ss_pred Eeeccc-ceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccch Confidence 554432 2233432111111356777889999999999999999988764322110000000 111100 000 Q ss_pred cccccccccchHHHHHHHHHHHHHHHHHH-hhc--CCCcCCcEEEeCHHHHHH---HhccchhhhhccccccchhcceeE Q lcl|NC_011085. 165 EVGAKGDLTSPVELGKAVIAQLTIARAKL-TSN--YVPSADRTFYTTPEVYSA---ILAALMPNAANYAALIDPERGSIR 238 (343) Q Consensus 165 ~~~~~~~~~~~~~~~~~i~~~l~~a~~~L-d~~--~VP~~gR~~vv~P~~~~~---Ll~~~~~~~~~~~~~~~~~~G~V~ 238 (343) .........+++. ++..+.+..++ +.. .+--..+++||+|..... |+.+.+.. .+......|. T Consensus 154 ~~~~~~~~l~~~~-----~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~----~g~~Np~~g~-- 222 (302) T protein:vir:10 154 PLSNASQAAAKAG-----YGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLA----DNTPNPYVGT-- 222 (302) T ss_pred hhhhcccccchHH-----HHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhccccC----CCCcceeccc-- Confidence 0000011111111 22222222222 222 222236899999987554 34443322 1222222232 Q ss_pred EEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeee---eeeeEEeeeeccchh Q lcl|NC_011085. 239 NVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVK---LKDLSLERARRAEYQ 315 (343) Q Consensus 239 ~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~---~~~~~~e~~~~~~~~ 315 (343) +++++++.|...+ + .| |+..++.+-... .+.+++|..-+++.- T Consensus 223 ----~~~vv~p~L~s~~----a-----------------Wy---------L~a~~~~i~~~~l~g~~~P~~~~~~~~~~d 268 (302) T protein:vir:10 223 ----AELVVDGRIESDT----A-----------------WF---------LLDTTKPVKPFIFQPRKQPEFVSQVNLDSD 268 (302) T ss_pred ----eEEEEeeccCCCC----c-----------------eE---------EEecCCccceEEEcCccccEEEeccCCCCC Confidence 5788888774210 0 11 111111111111 223445554444444 Q ss_pred hhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 316 ADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 316 ~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) +=.++-.+.||+ +.-+.+-+-+.|. T Consensus 269 gv~~k~~~d~Gv---d~R~~~G~~~wq~ 293 (302) T protein:vir:10 269 DVFNLRKLKFGA---EARAAAGYGFWQL 293 (302) T ss_pred ceEEEEEEEEee---eeeeecchhhhhh Confidence 445566666774 3333444434443 No 215 >protein:vir:95258 Length: 368 # NCBI annotation: Phage conserved protein # Family: family:all:570 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944891;genbank:gi:38707831;genbank:GeneID:2744044 Probab=55.04 E-value=0.49 Score=22.29 Aligned_cols=317 Identities=11% Similarity=0.036 Sum_probs=120.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHHHHHHH-HHHHhhhh--ccCccccccccceEEEEeccCc-cee-eeecC Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFGGEVLT-AFARTSVT--TNRHIMRSISSGKSAQFPVLGR-TRA-AYLQA 75 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~g~V~~-~f~~~s~~--~~~~~~~~i~~G~tv~i~~iG~-~t~-~~~~~ 75 (343) |.|.- + .| +.-+-.-...|.+ -|... .+ +++...+.++ ..+|.|...+. .++ ....+ T Consensus 1 ~~d~f--~------------~d--~Fs~~~LT~ain~~p~~p~-~l~~lglF~~~~v~-t~~v~iE~~~~~l~Lvp~~~r 62 (368) T protein:vir:95 1 MLTNS--E------------KS--RFFLADLTGEVQSIPNTYG-YISNLGLFRSAPIT-QTTFLMDLTDWDVSLLDAVDR 62 (368) T ss_pred Ccccc--c------------CC--cccHHHHHHHHHhcCCCcc-eecccccccCCCcc-ceEEEEEEEcCeEEEccccCC Confidence 33320 0 00 0000001111111 11111 11 1445555443 46666655432 222 22233 Q ss_pred CCcCCCccCCCccceEEEEeeeeeeeeeeccchHHHH------------hchhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_011085. 76 GQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIEDAM------------NHYDVRSEYTSQIGESLAMAADGAVLAELAG 143 (343) Q Consensus 76 g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~~q------------~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~ 143 (343) |........+-.-+.+.+.+-...... .| .-|+.| +.-+++.+....+...+.....-..+..| + T Consensus 63 g~~~~~~~~~~~r~~~~f~~ph~~~~d-~I-~a~eiQg~RafG~~~~l~~v~~~v~~kl~~~r~~~d~T~E~~r~gAL-~ 139 (368) T protein:vir:95 63 DSRKAETSAPERVRQISFPMMYFKEVE-SI-TPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQAL-K 139 (368) T ss_pred CCCCcccccCCceeEEEEecceecccc-cc-chHHHccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-c Confidence 432211111111123333332221111 11 112222 22222223333333333332221112221 1 Q ss_pred hhhccccccccc----cccCC-ceeecccccccccchHHHHHHHHHHHHHHHHHHh-hcCCCcCCcEEEeCHHHHHHHhc Q lcl|NC_011085. 144 LCNMPAASNENI----AGLGS-ASILEVGAKGDLTSPVELGKAVIAQLTIARAKLT-SNYVPSADRTFYTTPEVYSAILA 217 (343) Q Consensus 144 ~a~~~~~~~~~~----~g~~~-~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld-~~~VP~~gR~~vv~P~~~~~Ll~ 217 (343) +- ...+....+ ..++- -.++...-.++.++. ...+-+.++.....|. ..-.+..+-.++++|++|..|.. T Consensus 140 G~-ilDadGtvl~dly~eFGit~~~v~f~l~~~~tdv---~~~~~~~~~~i~d~l~g~~~~~~~~v~alcg~~Ffd~L~~ 215 (368) T protein:vir:95 140 GK-VVDARGTLYADLYKQFDVEKKTIYFDLDNPNADI---DASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTK 215 (368) T ss_pred Ce-eECCCCcEEecchhhhCCccceEEEEeCCCCcCH---HHHHHHHHHHHHHhhcccccccccceEEEEChHHHHHhhc Confidence 11 000000000 00110 111222222222222 2333344444455554 34457778889999999999999 Q ss_pred cchhhhh--ccccc-------cchhcc---------eeEEEeceEEEEecc-ccccccc------ccccccccccccccc Q lcl|NC_011085. 218 ALMPNAA--NYAAL-------IDPERG---------SIRNVMGFEVVEVPH-LTAGGAG------DDREDETTNQKHAFP 272 (343) Q Consensus 218 ~~~~~~~--~~~~~-------~~~~~G---------~V~~i~Gf~V~~sn~-lp~~~~~------~~~~~~~~~~~~~~~ 272 (343) ++.+... .+... ..++.| ....+.|+.+.+..- .+..+.. .....++.+..|+++ T Consensus 216 h~~Vkeay~~~~~a~~~~~lr~~~r~g~~~~~~~~~~~F~fgGi~f~eYrg~~~~~~g~~~~~v~~d~v~I~~gea~~~P 295 (368) T protein:vir:95 216 HPKIRDAYLAQQTPLAWQQITGSLRTGGADGVQAHMNTFYYGGVKFVQYNGKFKDKRGKVHTLVSIDSVADTVGVGHAFP 295 (368) T ss_pred ChhHHHHHHHHHhhhhhhhhccccccccccccccccceeEecCEEEEEcceeecCCCcceeeeecCCceeeccCceEEEe Confidence 9875543 12111 122322 224678888877442 2211100 001123445566665 Q ss_pred ccc-cccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEEEEecCC Q lcl|NC_011085. 273 KTA-EGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGALVFTAG 343 (343) Q Consensus 273 ~~~-~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~i~~~~g 343 (343) .-. .+..++.|....+.+-+.+++.+- ++++-...+..+.-.+.-+..-...=.-..||+.++.++..+. T Consensus 296 ~G~~~~~~~~~F~~~~aPad~~e~vNt~-g~p~Ya~~~~~~~~~g~~le~qSnpLpic~RP~~lv~~~~~a~ 366 (368) T protein:vir:95 296 NVAMLGEANNIFEVAYGPCPKMGYANTL-GQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADAK 366 (368) T ss_pred ecccccccCcceEEEecCCCcHhhcCCC-cccccceeeeccCCCeeEEEEeecccchhcccceeEEEEecCC Confidence 421 112333444444444444444432 2222222222222233334443444556789999999999888 No 216 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=54.92 E-value=0.49 Score=22.27 Aligned_cols=309 Identities=14% Similarity=0.053 Sum_probs=130.1 Q ss_pred CCCCCccccccccccccc-------------cccchh--HHHHHHHHHHHHHHHHHhhhhccCc----cc---------c Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-------------SGGDKL--ALFLKVFGGEVLTAFARTSVTTNRH----IM---------R 52 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-------------~~~d~~--al~ie~~~g~V~~~f~~~s~~~~~~----~~---------~ 52 (343) .+.+..+....+-.+.++ ..++.. .-..+.|-.|....|-......... .. . T Consensus 144 f~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~ 223 (529) T protein:vir:10 144 FHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSA 223 (529) T ss_pred ccccccccccccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCcccccccc Confidence 111111111001111110 000000 0011222222222222111100000 00 0 Q ss_pred ccccceEEEEeccCcceeeeecCCCcC---CCccCCCccceEEEEeeeeeeee--------eeccchHHHHh-c-hhhHH Q lcl|NC_011085. 53 SISSGKSAQFPVLGRTRAAYLQAGQSL---DDKRKDIKHTEKTIVIDGLLTAD--------VLIYDIEDAMN-H-YDVRS 119 (343) Q Consensus 53 ~i~~G~tv~i~~iG~~t~~~~~~g~~i---~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~~q~-~-~d~~~ 119 (343) .+..| .+....-|..+.. ++.+ .++ ....-.+.-+.||+...-+ ..+.-..+.++ | .|.-. T Consensus 224 ~~a~~-~~~~~~~gmsTa~----aEal~~~g~s-s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEt 297 (529) T protein:vir:10 224 KIAAG-ELAEIAEGMATSI----AELRQGFNGT-TDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADS 297 (529) T ss_pred ccccc-cccccccccchhh----hhccccCCCC-ccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHH Confidence 00011 1111112222221 1111 111 1123456677777764332 44555556666 3 88889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccc---hHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_011085. 120 EYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTS---PVELGKAVIAQLTIARAKLTSN 196 (343) Q Consensus 120 ~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~---~~~~~~~i~~~l~~a~~~Ld~~ 196 (343) |++.=.+..+...+++-||+.+-..++....-.....+. ...+.+.....+... ..+.++.++-.+-+.....-.+ T Consensus 298 ELsNILStEImlEINReii~~i~~~a~~~~~g~~~~~~~-~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~ 376 (529) T protein:vir:10 298 ELNGILANEVMLEINREVIDWINYTAQVGKSGWTQTVGS-AAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQ 376 (529) T ss_pred HHHHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccc-cccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHh Confidence 999999999999999999986655444321100000000 111222222221110 1222333333332222222221 Q ss_pred CCCcCCcEEEeCHHHHHHHhccchhhhhccc--ccc---chh-cceeEEE-eceEEEEeccccccccccccccccccccc Q lcl|NC_011085. 197 YVPSADRTFYTTPEVYSAILAALMPNAANYA--ALI---DPE-RGSIRNV-MGFEVVEVPHLTAGGAGDDREDETTNQKH 269 (343) Q Consensus 197 ~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~---~~~-~G~V~~i-~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~ 269 (343) ---..+-|+|.+|++.++|-..+.+...... +++ +-. .=..|.+ .|++||.-++.|..=.. T Consensus 377 T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~------------ 444 (529) T protein:vir:10 377 TGRGAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFT------------ 444 (529) T ss_pred hccccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEE------------ Confidence 1112366999999999988643222211110 011 111 1134554 56799988776642211 Q ss_pred cccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccce-----EEEEecCC Q lcl|NC_011085. 270 AFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAA-----GALVFTAG 343 (343) Q Consensus 270 ~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~-----~~i~~~~g 343 (343) ..|+++-....+++|.|- +. +++-+..||+.|--.|-.+.+||..+ .|=.. -.-++..| T Consensus 445 -------vG~KG~~~~~~glfy~PY----v~---l~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~~~~~~~~r~~~g 508 (529) T protein:vir:10 445 -------MGYRGANNLDAGIYYCPY----VA---LTPLRGSDPKNFQPVMGFKTRYAIGV-NPFAESRTQAPTSRISNG 508 (529) T ss_pred -------EEEeCCcccccceeeccc----cc---cccccccCCCcccceeeeeeeeceee-cCccccccccccccccCC Confidence 112333334457888885 22 34444579999999999999998765 44111 12234444 No 217 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=54.31 E-value=0.51 Score=22.20 Aligned_cols=312 Identities=12% Similarity=0.054 Sum_probs=129.6 Q ss_pred CCC---------CCccccccccccccc---cccchhHHHHHHHHHHHHHHHHHhhhhccCccccccccceEEEEeccCc- Q lcl|NC_011085. 1 MAD---------MKGGQQLGKDQGKGQ---SGGDKLALFLKVFGGEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGR- 67 (343) Q Consensus 1 ~~~---------~~~~~~~~t~~g~~~---~~~d~~al~ie~~~g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~- 67 (343) |-. -.|+++. +..+.+. ..-.++++|=|.-..+-.....-..+|+.+.+..+...|+.+.++.... T Consensus 116 MTgPTGlIFAmRs~Y~~~~-~~~~~~eAfh~~~g~ea~fsea~t~~a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~ 194 (528) T protein:vir:66 116 MSTPTSQIFAIRSVYGGDP-LKSGAREAFHPMYAPDAFHSSLAAKEATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETG 194 (528) T ss_pred CCchhhhheeeeeeecCCc-ccccccccccccccccccccccccccccccCCccceeecccccccccccceeeecccccc Confidence 211 0001110 0000000 0011122221110000000000111222222111222222222211000 Q ss_pred --------------------------------ceeeeecC------CCc---CCCccCCCccceEEEEeeeeeeee---- Q lcl|NC_011085. 68 --------------------------------TRAAYLQA------GQS---LDDKRKDIKHTEKTIVIDGLLTAD---- 102 (343) Q Consensus 68 --------------------------------~t~~~~~~------g~~---i~~~~~~~~~~~~~l~iD~~~~~~---- 102 (343) ..+..+.. ++. +.++ ....-.+.-+.||+...-+ T Consensus 195 fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~aEale~lg~~-s~~~f~EMaFsIeK~tVtAKSRa 273 (528) T protein:vir:66 195 IAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSIAEIQEGFNGS-SNNPWAEMSMRIDKQVVEAKSRQ 273 (528) T ss_pred eeeeccccccccccCcccccccccccccccccccceecccccchhhhhhhcccCCC-cccchhhcceEEEeEEEEeeccc Confidence 00000000 000 1010 0112345667777664332 Q ss_pred ----eeccchHHHHh-c-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccc-CCceeecccccccccc- Q lcl|NC_011085. 103 ----VLIYDIEDAMN-H-YDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGL-GSASILEVGAKGDLTS- 174 (343) Q Consensus 103 ----~~Idd~D~~q~-~-~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~-~~~~~~~~~~~~~~~~- 174 (343) ..+.-..+.++ | .|.-.|++.=.+..+...+.+-||..+--.++.-. .....+. ....+.+.....+... T Consensus 274 LKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~~a~~~~--~~~t~~~~~~aG~~dl~~~~d~~g~ 351 (528) T protein:vir:66 274 LKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGK--TGMTQTVGSKAGVFDLQDPIDTRGA 351 (528) T ss_pred eeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeee--eeeeeccccccceeecccccccccc Confidence 34554555556 3 78888999889999999999999865422221110 0000000 0112233222222111 Q ss_pred --hHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhccc------cccchhcceeEEE-eceEE Q lcl|NC_011085. 175 --PVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYA------ALIDPERGSIRNV-MGFEV 245 (343) Q Consensus 175 --~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~------~~~~~~~G~V~~i-~Gf~V 245 (343) ..+.++.++-.|-+.....-.+---..+-|+|++|++.+.|-........+.. ..+....=.+|.+ .|++| T Consensus 352 rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~v 431 (528) T protein:vir:66 352 RWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVLAGKYKV 431 (528) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceeEEEecCceEE Confidence 11223333333333222222221113457999999999999776532221111 1111112235665 46899 Q ss_pred EEeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhh Q lcl|NC_011085. 246 VEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAM 325 (343) Q Consensus 246 ~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~ 325 (343) |.-++.|..=.. ..|+++-....+++|.|-.- ++..+..||+.|--.|-.+.+| T Consensus 432 y~D~y~~~dy~~-------------------vG~KG~~~~~~glfyaPYv~-------l~~~~~~dp~sfqP~~g~~tRY 485 (528) T protein:vir:66 432 FIDQYARQDYFT-------------------VGYKGDNEMDAGIYYAPYVA-------LTPLRATDPQSFHPVLGFKTRY 485 (528) T ss_pred EecCCCCcceEE-------------------EEEeCCcccccceeeccccc-------ceeeEeeCCccccceeeeeeee Confidence 988876642211 12334444446789988632 3445678999999999999999 Q ss_pred ccceecccceE-----EEEecCC Q lcl|NC_011085. 326 GHGGLRPEAAG-----ALVFTAG 343 (343) Q Consensus 326 G~~v~rpe~~~-----~i~~~~g 343 (343) |..+ .|=... .-++..| T Consensus 486 ~l~v-NP~~~~~~~~~~~ri~~g 507 (528) T protein:vir:66 486 GIGI-NPFADSKSQEPSARITSG 507 (528) T ss_pred ceee-cCcccccCcccccccccc Confidence 8765 441111 1122233 No 218 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=53.72 E-value=0.52 Score=22.13 Aligned_cols=279 Identities=10% Similarity=-0.007 Sum_probs=98.8 Q ss_pred CCCCCccccccccccccccccchhHHHHHHHH-HHHHHHHHHhhhhccCccccccccceEEEEeccCcceeee-----ec Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKVFG-GEVLTAFARTSVTTNRHIMRSISSGKSAQFPVLGRTRAAY-----LQ 74 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~~~-g~V~~~f~~~s~~~~~~~~~~i~~G~tv~i~~iG~~t~~~-----~~ 74 (343) |++-. |+ -|+. |..++ |.=...|-..++| +.+++ ...+.+++..|+..... .. T Consensus 1 ~~~~~-------~~------~dp~---LT~~A~gy~n~~~Ia~~l~-P~vpV----~~~~~~~~~f~~~e~F~~~~t~r~ 59 (309) T protein:vir:99 1 MSNAP-------FP------IDPE---LTAIAIAYRNGRMISDEVL-PRVPV----GKQEFKFWKYDLAQGFTVPETLVG 59 (309) T ss_pred CCCCC-------cC------cCHh---HHHHHhhccChhhhhhhcC-Ccccc----Cccccceeeechhhcccccchhhc Confidence 33321 11 0111 11111 1001112222222 33332 22344555555543211 11 Q ss_pred CCCcCCCccCCCccceEEEEeeeeeeeeeecc--chHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Q lcl|NC_011085. 75 AGQSLDDKRKDIKHTEKTIVIDGLLTADVLIY--DIEDAMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASN 152 (343) Q Consensus 75 ~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Id--d~D~~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~ 152 (343) ++.... .-+...++.++.+.+.-- ...|| ++.++...+|++....+.....|....+-.+...+-..++. T Consensus 60 ~~~~~~--~v~~~~~~~~~~~~~~~L-~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y----- 131 (309) T protein:vir:99 60 RKSKPN--EVEFSATDETGSTEDHGL-DAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSY----- 131 (309) T ss_pred cCCCcc--eEeecccCceeeecccce-eecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhc----- Confidence 222211 112233344444433321 23444 44456667998888777665555444443222111111111 Q ss_pred ccccccCCceeecccccc----cccchHHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhc-cc Q lcl|NC_011085. 153 ENIAGLGSASILEVGAKG----DLTSPVELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAAN-YA 227 (343) Q Consensus 153 ~~~~g~~~~~~~~~~~~~----~~~~~~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~-~~ 227 (343) +.+..+.++++. ..-|| +..|..++.++. ...-.++++...|..|+.++.+..+- +. T Consensus 132 ------~~~~k~~Lsgt~~wsd~~SDP-------i~~i~~~~~~~g-----~~PN~~vlg~~~~~~l~~hp~i~~~ik~~ 193 (309) T protein:vir:99 132 ------AAGNKTTLSGADQWSDPTSNP-------LPVITDALDSVI-----LRPNIGVLGRRTATILRRHPKIVKAYNGS 193 (309) T ss_pred ------CCCceEEecCccccCCCCCCc-------HHHHHHHHHhhC-----CCcceEEechHHHHHHhhCHHHHHHhcCC Confidence 112222222221 12233 444555544431 23358999999999999999988763 43 Q ss_pred cc--cchhcceeEEEece-EEEEecccccccc-ccccccccccccccccccccccccccccceEeEeechhhheeeeeee Q lcl|NC_011085. 228 AL--IDPERGSIRNVMGF-EVVEVPHLTAGGA-GDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKD 303 (343) Q Consensus 228 ~~--~~~~~G~V~~i~Gf-~V~~sn~lp~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~ 303 (343) +. +.+..-.+..++|+ +|+........+. .........++ +.+.|++......+.+.-. T Consensus 194 ~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~~~~~iwg-----------------~~~~L~y~~~~~~~~~~ps 256 (309) T protein:vir:99 194 LGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWG-----------------PHASFIYRDRLADTRNGTT 256 (309) T ss_pred CccccccCHHHHHHHhCcceEEeecceeeccccccccccccccC-----------------CcEEEEEcCCCCCCccccc Confidence 32 22333345677888 4665433321110 00001111111 1111222221111111000 Q ss_pred eEEeeeeccchhhhhhhhhhhh-ccceecccceEE-EEe--cCC Q lcl|NC_011085. 304 LSLERARRAEYQADQIIARYAM-GHGGLRPEAAGA-LVF--TAG 343 (343) Q Consensus 304 ~~~e~~~~~~~~~d~i~~~~~~-G~~v~rpe~~~~-i~~--~~g 343 (343) .-.-..|..+..+..+.-.+-- |...+|---.+. +.+ -.| T Consensus 257 ~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G 300 (309) T protein:vir:99 257 FGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLG 300 (309) T ss_pred ccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcc Confidence 0000011212222111111111 122222110000 000 011 No 219 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=38.99 E-value=1 Score=20.49 Aligned_cols=273 Identities=13% Similarity=-0.002 Sum_probs=106.0 Q ss_pred cchhHHHHHHHHHHHHHHHHH----hhhhccCccccc-cccce-EEEEec---cCcceeeee---cCCCcCCCccCCCcc Q lcl|NC_011085. 21 GDKLALFLKVFGGEVLTAFAR----TSVTTNRHIMRS-ISSGK-SAQFPV---LGRTRAAYL---QAGQSLDDKRKDIKH 88 (343) Q Consensus 21 ~d~~al~ie~~~g~V~~~f~~----~s~~~~~~~~~~-i~~G~-tv~i~~---iG~~t~~~~---~~g~~i~~~~~~~~~ 88 (343) =+.+| |+..=.-.|+....+ .-..+.++.+.+ +--+. ++.+.. +|..+ +| ....+|+.-. ..- T Consensus 1 ~~~la-fl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~--~~~i~~~a~dip~vd--~~~ 75 (304) T protein:vir:52 1 MSLLA-YVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLD--DGLITVGTSTLDQVE--VGF 75 (304) T ss_pred CchHH-HHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCccc--ccccCCcCCccceee--ccc Confidence 22233 332211222222211 122334555443 11122 344333 45554 22 1123343322 222 Q ss_pred ceEEEEeeee-eeeeeeccchHHHHh-chhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecc Q lcl|NC_011085. 89 TEKTIVIDGL-LTADVLIYDIEDAMN-HYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEV 166 (343) Q Consensus 89 ~~~~l~iD~~-~~~~~~Idd~D~~q~-~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~ 166 (343) ++....|-.. .-+...+.++..++. ..++-..-.+-+..++.+..|+..+.- . .....+.|+-....++. T Consensus 76 ~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~G-----d---~~~~g~~GllN~p~v~~ 147 (304) T protein:vir:52 76 TPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLG-----H---AKDSRLTGLLNNKSVEV 147 (304) T ss_pred ceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEe-----e---ccccceEEEEeCCCcce Confidence 2223222211 122334556666664 455555555555566667666654321 0 01111223222222221 Q ss_pred ---cccccccc-hHHHHHHHHHHHHHHHHHHhhc----CCCcCCcEEEeCHHHHHHHhccchhhhhccccccch-hccee Q lcl|NC_011085. 167 ---GAKGDLTS-PVELGKAVIAQLTIARAKLTSN----YVPSADRTFYTTPEVYSAILAALMPNAANYAALIDP-ERGSI 237 (343) Q Consensus 167 ---~~~~~~~~-~~~~~~~i~~~l~~a~~~Ld~~----~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~-~~G~V 237 (343) ++.+..++ ..++.+.|++.|.++..++-.+ ..|. .++|+|..|..|..- +..+.+..-..-+ ++.-- T Consensus 148 ~~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~---tl~Lpp~~~~~l~~~-~~~~~~~Tvl~~l~~n~~~ 223 (304) T protein:vir:52 148 YAIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPN---TFAIDSLDLAHLALV-QRANTDTTALEFLTKHLSA 223 (304) T ss_pred eeecCCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCc---eEEeCHHHHHHHhhc-cCCCCCchHHHHHHHhccc Confidence 11111111 1224556777777777766544 2233 699999999999642 1111111101111 12110 Q ss_pred EEEeceEEEEecc-ccccccccccccccccccccccccccccccccccceEeEeechh--hheeeeeeeeEEeeeeccch Q lcl|NC_011085. 238 RNVMGFEVVEVPH-LTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS--AVGTVKLKDLSLERARRAEY 314 (343) Q Consensus 238 ~~i~Gf~V~~sn~-lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--Av~~~~~~~~~~e~~~~~~~ 314 (343) .+--+++|...+. +-.. +. +..+ +.+++.++ -+..-.-++ ..+-+-. T Consensus 224 ~~g~~l~I~~v~~~~~~~--------------------g~----~g~~--r~vvY~~d~~~~~~~vP~p----~~~l~~q 273 (304) T protein:vir:52 224 AAGRQVAIKALPSNYGTR--------------------VT----DGKT--RAMVYVNSKEHVIFDVPMS----PTVLDAQ 273 (304) T ss_pred ccCCcceEEEeccccccc--------------------CC----CCce--EEEEEecChhheEEecCcc----ccccchh Confidence 1111233433221 1100 00 0011 12344333 222211111 1222222 Q ss_pred hh----hhhhhhhhh-ccceecccceEEEEe Q lcl|NC_011085. 315 QA----DQIIARYAM-GHGGLRPEAAGALVF 340 (343) Q Consensus 315 ~~----d~i~~~~~~-G~~v~rpe~~~~i~~ 340 (343) +. ..+-+..+. |.-+.||++++-+=. T Consensus 274 ~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 274 PKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred hcCCceEEecceeeeeeEEEEccceeeeecC Confidence 22 223345544 888899998888777 No 220 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=26.42 E-value=1.9 Score=19.00 Aligned_cols=283 Identities=13% Similarity=0.006 Sum_probs=111.7 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHH--HHHHHHHHHhhhhccCcccccccc--ceEEEEec---cCcceeee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFG--GEVLTAFARTSVTTNRHIMRSISS--GKSAQFPV---LGRTRAAY 72 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~--g~V~~~f~~~s~~~~~~~~~~i~~--G~tv~i~~---iG~~t~~~ 72 (343) .|+- ..|++.. ..+-+-+ |+.-|- +-++..+... +...++.+.+.-. -+++.|+. .|.+.+ T Consensus 34 da~d-------~~~~~~~~~~~~~~~-~l~~~i~p~~~~~~~~~~-~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~-- 102 (336) T protein:vir:36 34 DAAD-------LSPHLSSTGSSGIPN-YLTTYVDPSVIDILVAPM-KAAELVGESKKGDWTTLVAAFITAEPTTKVAT-- 102 (336) T ss_pred hhhh-------ccCccccCCCcchHH-HHHHhhccceEeeecchh-hhhhhccccccCCccceeEEEeeeeceeeEEE-- Confidence 1211 1122211 1111111 333333 2222222222 2334444443111 24555554 455443 Q ss_pred ecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchHH---HHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 73 LQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIED---AMNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 73 ~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D~---~q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) |..+.+++-...+..-.++++..=+. .+.+...+. .++.+|+-.+-.+.+..+|.+..++..+.-.. T Consensus 103 ygd~~D~P~~d~~~~~~~~~v~~~~~---g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~------- 172 (336) T protein:vir:36 103 YGDYSSDGDSGANINYPQRQSYFFQT---WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA------- 172 (336) T ss_pred eeccCCCceeecccceeeeeEEEEEe---eeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEecc------- Confidence 33233332111112222223222111 233433332 23578888888888888888888865432111 Q ss_pred cccccccccCCceeec--ccccccccchHHHHHHHHHHHHHHHHHHhhcC---C-CcCCcEEEeCHHHHHHHhccchhhh Q lcl|NC_011085. 150 ASNENIAGLGSASILE--VGAKGDLTSPVELGKAVIAQLTIARAKLTSNY---V-PSADRTFYTTPEVYSAILAALMPNA 223 (343) Q Consensus 150 ~~~~~~~g~~~~~~~~--~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~---V-P~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (343) ...+.|+-....++ .+..+.. -..++.+.|++.|.++...|.... + +...-.++|||..+..|-.-..+ T Consensus 173 --~~~~yGllNdP~l~a~~t~~t~~-~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~n~~-- 247 (336) T protein:vir:36 173 --GLENYGLINDPSLSAPITATTPW-SGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQY-- 247 (336) T ss_pred --ccceEEEEecCCCccccccCCCc-ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCCCcc-- Confidence 11111211111111 1111111 012235677888888777776642 2 23346899999998888432111 Q ss_pred hccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh--hheeeee Q lcl|NC_011085. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS--AVGTVKL 301 (343) Q Consensus 224 ~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--Av~~~~~ 301 (343) ...-..-+++ +.-+++|...+.+-..+ .. ....+. ....+. ..+-+.+... ++. ++. T Consensus 248 -g~Tvl~~lk~----n~Pnl~i~t~pEl~~a~----g~-----~~~l~~----~~~~~~--~t~~~~~p~~~~~l~-vq~ 306 (336) T protein:vir:36 248 -GLAAAAKLKD----IFPKLEFVTIPEYDTAS----GR-----LVQLWA----PRVEGK--DTATCGFTEKMRAHS-IER 306 (336) T ss_pred -CccHHHHHHH----hcCccEEEEccccccCC----Cc-----eEEEEE----EecCCC--cceeeecchhhhccc-eee Confidence 1110111221 23445666666552111 00 000000 000000 0001111111 111 111 Q ss_pred eeeEEeeeeccchhhhhhhhhhh-hccceecccceEEEEec Q lcl|NC_011085. 302 KDLSLERARRAEYQADQIIARYA-MGHGGLRPEAAGALVFT 341 (343) Q Consensus 302 ~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~rpe~~~~i~~~ 341 (343) ....+.+....+ .|+-+.||-+++.+.-= T Consensus 307 -----------~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 307 -----------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred -----------cCceeEeccccceeeeeeeccchheeeecC Confidence 122244444455 48888888887665444 No 221 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=25.59 E-value=2 Score=18.89 Aligned_cols=296 Identities=9% Similarity=0.010 Sum_probs=115.8 Q ss_pred CCCC-Cc-----cccccccccccccccchhHHHH--HHHHHHHHHHHHHhhhhccCccccccc--cceEEEEeccCccee Q lcl|NC_011085. 1 MADM-KG-----GQQLGKDQGKGQSGGDKLALFL--KVFGGEVLTAFARTSVTTNRHIMRSIS--SGKSAQFPVLGRTRA 70 (343) Q Consensus 1 ~~~~-~~-----~~~~~t~~g~~~~~~d~~al~i--e~~~g~V~~~f~~~s~~~~~~~~~~i~--~G~tv~i~~iG~~t~ 70 (343) |+++ +- .+++-.+..+.++-.|...-|- ---+-.+..+.+++|-|+..+++-.+. .|..|-+-.-|..+- T Consensus 1 m~~~m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iag 80 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTG 80 (341) T ss_pred CcccccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceee Confidence 5542 10 0111111122222222221111 112345677888999999888876554 478887776665543 Q ss_pred eeecCCCcCCCccCCCccceEEEEeeeeeeeeee-ccchHHHHh---chhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011085. 71 AYLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVL-IYDIEDAMN---HYDVRSEYTSQIGESLAMAADGAVLAELAGLCN 146 (343) Q Consensus 71 ~~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~-Idd~D~~q~---~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~ 146 (343) ..-+ +.-+ .++..+...+.+-+.-+..+. =..+|.+-. ..|+...+......++| .|...++--...+. T Consensus 81 rtdt--~R~~---r~~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~A--LD~i~IGfnGts~A 153 (341) T protein:vir:27 81 RKAG--GRFT---KQVGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFA--LDIMRIGWNGVSAE 153 (341) T ss_pred ccCC--Ccee---cccccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHh--hhhhhhcccceeec Confidence 3222 1111 111222233333333222211 124443332 35666666555555444 34433322111110 Q ss_pred cccccccccc-------------ccCCceeecccccccccchHHHHHHHHHHHHHHHH-HHhhcCCCcCCcEEEeCHHHH Q lcl|NC_011085. 147 MPAASNENIA-------------GLGSASILEVGAKGDLTSPVELGKAVIAQLTIARA-KLTSNYVPSADRTFYTTPEVY 212 (343) Q Consensus 147 ~~~~~~~~~~-------------g~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~a~~-~Ld~~~VP~~gR~~vv~P~~~ 212 (343) .......++- ......++..+.....+. ..|.++=+++.++.. .+++..--+.+.+++|. T Consensus 154 ~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~--gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG---- 227 (341) T protein:vir:27 154 ADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETN--GDYRTLDAMASDIINNQIHPMFRNDPRLTVFVG---- 227 (341) T ss_pred cCCChhhcccccccchhHHHHHHhhcccceeccceeeccCC--CccccHHHHHHHHHhcccChHHhcCCCEEEEEc---- Confidence 0100111111 111223332221111111 113333233334444 34555543445677777 Q ss_pred HHHhccchh--hhhccccccchhcce-eEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeE Q lcl|NC_011085. 213 SAILAALMP--NAANYAALIDPERGS-IRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGL 289 (343) Q Consensus 213 ~~Ll~~~~~--~~~~~~~~~~~~~G~-V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 289 (343) ..|+.++.+ ++..-..+..+.--. ..++.|.+.+..|.+|..+.- +|. +.|- .+ T Consensus 228 ~dLla~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~PffP~~~~l-----VT~-----L~NL-------------sI 284 (341) T protein:vir:27 228 SGLIGAAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPPFLPDNAMV-----VTI-----PENL-------------QV 284 (341) T ss_pred hhhhhhhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEccccCCCceE-----Eee-----ccce-------------EE Confidence 456665543 332211111111111 247899999999999954321 111 1111 14 Q ss_pred eechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceE-----EEEecCC Q lcl|NC_011085. 290 FQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAG-----ALVFTAG 343 (343) Q Consensus 290 ~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~-----~i~~~~g 343 (343) .|++.+.- + +.+...+-+++.++-.+ |+ |=.-.|.. -++++.| T Consensus 285 Y~Q~gs~R----R--~~~d~p~r~rie~yes~-Yv----VEdyg~~~~~~~~~vkl~~~ 332 (341) T protein:vir:27 285 LTQHGTAQ----R--KAKHESDRKRSKTHTGA-WK----VTQWVCWKRSPLTTQKKSTS 332 (341) T ss_pred EEecCcEE----E--EEEeccccccccchhhh-he----eehhhhhhhccccccccCcc Confidence 55554321 1 12222222233332121 11 11112222 2455555 No 222 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=24.97 E-value=2.1 Score=18.80 Aligned_cols=283 Identities=13% Similarity=0.005 Sum_probs=112.5 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHH--HHHHHHHHHhhhhccCcccccccc---ceEEEEec---cCcceee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFG--GEVLTAFARTSVTTNRHIMRSISS---GKSAQFPV---LGRTRAA 71 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~--g~V~~~f~~~s~~~~~~~~~~i~~---G~tv~i~~---iG~~t~~ 71 (343) .|+- +.|++.. +..-+-+ |+.-|- +.++..+... +...++.+.+. + -+++.++. .|.+.+ T Consensus 34 da~d-------~~~~~~t~~~~g~~~-~l~~~i~p~~~~~~~~~~-~~~~l~~v~t~-g~W~~~~~~~~~~e~~G~a~~- 102 (336) T protein:vir:78 34 DAAD-------LSPHLSSTGSSGIPN-YLTTYVDPSVIDILVAPM-KAAELVGESKK-GDWTTLVAAFITAEPTTTVAT- 102 (336) T ss_pred hhhh-------hccccccCCCcchHH-HHHHhcccceeeehhhhh-hhhhhcccccC-CCccccEEEEeeeecceeeEE- Confidence 1222 1122221 1111222 444444 2223333322 23344444331 1 14555543 455543 Q ss_pred eecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchH--HHH-hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 72 YLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIE--DAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 72 ~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D--~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) |..+.+++-. +..-++.+-+|-..- ..+.+...+ .++ +..|+-.+-.+.+..+|.+..++..+.-- T Consensus 103 -ygd~~D~P~v--d~~~~~~~~~v~~~~-~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd------- 171 (336) T protein:vir:78 103 -YGDYSSDGDS--GTNINYPQRQSYFFQ-TWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGV------- 171 (336) T ss_pred -eecccCCCee--ecceeeEEEEEEEEE-eeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEec------- Confidence 3323333211 122222222221111 123333333 333 46888888888888888888886543211 Q ss_pred ccccccccccCCceeec--ccccccccchHHHHHHHHHHHHHHHHHHhhcC---C-CcCCcEEEeCHHHHHHHhccchhh Q lcl|NC_011085. 149 AASNENIAGLGSASILE--VGAKGDLTSPVELGKAVIAQLTIARAKLTSNY---V-PSADRTFYTTPEVYSAILAALMPN 222 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~--~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~---V-P~~gR~~vv~P~~~~~Ll~~~~~~ 222 (343) ....+.|+-....++ .+.++.. -..++.+.|+++|..+...|.... + |...-.+++||..+..|-.-..+ T Consensus 172 --~~~~~~GllN~P~l~a~~t~~~~~-w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~n~~- 247 (336) T protein:vir:78 172 --AGLENYGLINDPSLSAPITATTPW-SGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQY- 247 (336) T ss_pred --cccceEEEEeCCCCCcccccCcCc-ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCCCcc- Confidence 111112222111121 1111110 012335677888887777775553 2 33445799999999998543211 Q ss_pred hhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeech-hhheeeee Q lcl|NC_011085. 223 AANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHR-SAVGTVKL 301 (343) Q Consensus 223 ~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~Av~~~~~ 301 (343) ...-..-+++ +.-+++|...+.|-..+ +... ..+-....+.-+..+. +-.+ .++ -++. T Consensus 248 --g~tv~~~lk~----n~Pnl~i~t~pel~~Ag-----------g~~~--~~~~~~~~~~~t~~~~-~p~~f~~l-pvq~ 306 (336) T protein:vir:78 248 --GLSAAAKLKE----IFPKLEFVTIPEYDTAS-----------GRLV--QLWAPRVEGKDTATCG-FTEKMRAH-SIER 306 (336) T ss_pred --CccHHHHHHH----hcCccEEEEcccccccC-----------cceE--EEEEeeccCCcceeee-cchhhhcc-ceee Confidence 1111112222 13345666655552111 0000 0000000000001111 1111 111 1111 Q ss_pred eeeEEeeeeccchhhhhhhhhhh-hccceecccceEEEEec Q lcl|NC_011085. 302 KDLSLERARRAEYQADQIIARYA-MGHGGLRPEAAGALVFT 341 (343) Q Consensus 302 ~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~rpe~~~~i~~~ 341 (343) ....+.+....+ .|+-+.||-+++.+.-= T Consensus 307 -----------~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 307 -----------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred -----------cCceeEeccccceeeeeeeccchheeeccC Confidence 112334444444 47888888877665444 No 223 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=24.88 E-value=2.1 Score=18.79 Aligned_cols=304 Identities=12% Similarity=0.026 Sum_probs=126.4 Q ss_pred CCCCCccccccccccccccccchhHHHHH-----HHHHHHH--------------------HHHH-HhhhhccCc----- Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLK-----VFGGEVL--------------------TAFA-RTSVTTNRH----- 49 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie-----~~~g~V~--------------------~~f~-~~s~~~~~~----- 49 (343) |=.. |+++..+ .+ ++ -+.|.+ -|+|.-- ..|. .+.+..... T Consensus 127 MRsr-Y~~q~~~----~~--g~-eaf~~~~~ada~fSG~~~at~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~ 198 (521) T protein:vir:10 127 LRAV-YGKDPIA----AG--AK-EAFHPMYGPDAMFSGQGAAKKFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTI 198 (521) T ss_pred eeee-ccCCccc----cc--cc-cccchhccccccccccccccccccccccccccccccccccccccccceecccccccC Confidence 1111 1111000 00 00 000000 0111100 0000 000000000 Q ss_pred -ccc-ccccceEEEEeccCcceeeeecCC------CcC---CCccCCCccceEEEEeeeeeeee--------eeccchHH Q lcl|NC_011085. 50 -IMR-SISSGKSAQFPVLGRTRAAYLQAG------QSL---DDKRKDIKHTEKTIVIDGLLTAD--------VLIYDIED 110 (343) Q Consensus 50 -~~~-~i~~G~tv~i~~iG~~t~~~~~~g------~~i---~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D~ 110 (343) ... +-..++......+....+..+..| +.+ .++ ....-.+.-+.||+...-+ ..+.-..+ T Consensus 199 ~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~s-s~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQD 277 (521) T protein:vir:10 199 SSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGS-TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQD 277 (521) T ss_pred CCcccccccccccccccccccceeecccccchhhHhhhccCCCC-ccccccceeeEEEEEEEeeeccceeccccHHHHHH Confidence 000 000011111111111111111111 111 111 1123456677787764432 44555556 Q ss_pred HHh-c-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccc---hHHHHHHHHHH Q lcl|NC_011085. 111 AMN-H-YDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTS---PVELGKAVIAQ 185 (343) Q Consensus 111 ~q~-~-~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~---~~~~~~~i~~~ 185 (343) .++ | .|.-.|++.=.+..+...+++-|+..+--.++....-.....|+. ..+.+.....+... ....++.++-. T Consensus 278 LKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t~~~~~~-~G~~d~~~~~d~~~~~~~~e~~k~L~~~ 356 (521) T protein:vir:10 278 LRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSK-AGVFDFQDPIDIRGARWAGESFKALLFQ 356 (521) T ss_pred HHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCcc-ccceecccccccccchHHHHHHHHHHHH Confidence 666 3 888899999999999999999999765333222111000001111 12233322222111 11122222222 Q ss_pred HH-HHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc--ccch--hcc--eeEEE-eceEEEEeccccccccc Q lcl|NC_011085. 186 LT-IARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA--LIDP--ERG--SIRNV-MGFEVVEVPHLTAGGAG 257 (343) Q Consensus 186 l~-~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~--~~~~--~~G--~V~~i-~Gf~V~~sn~lp~~~~~ 257 (343) |. .+.+...+-.. -.+-|+|++|++.+.|-..+.+....-.+ .+.. .++ ..|.+ .|++||.-++.|..=.. T Consensus 357 i~~~an~i~~~T~r-~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~ 435 (521) T protein:vir:10 357 IDKEAVEIARQTGR-GEGNFIIASRNVVNVLASVDTGISYAAQGLATGFNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFT 435 (521) T ss_pred HHHHHHHHHHhccc-ccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEecCceEEEecCCCCcceEE Confidence 22 22222222211 24579999999999988765444332221 1110 122 23554 56799988776642211 Q ss_pred cccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceEE Q lcl|NC_011085. 258 DDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAGA 337 (343) Q Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~~ 337 (343) ..|+++.....+++|.|-. .+ ..-+..||+.|--.|-.+.+||..+ .| .+.. T Consensus 436 -------------------vG~KG~~~~~~glfyaPYv----~l---~~~~~~dp~sfqP~~g~~tRY~l~~-NP-~~~~ 487 (521) T protein:vir:10 436 -------------------VGYKGPNEMDAGIYYAPYV----AL---TPLRGSDPKNFQPVMGFKTRYGIGI-NP-FAES 487 (521) T ss_pred -------------------EEEeCCcccccceeecccc----cc---ccccccCCccccceeeeeeeeceee-cC-cccc Confidence 1233344444578998853 23 3334579999999999999998765 45 2221 Q ss_pred E------EecCC Q lcl|NC_011085. 338 L------VFTAG 343 (343) Q Consensus 338 i------~~~~g 343 (343) . .+..| T Consensus 488 ~~~~~~~~i~~~ 499 (521) T protein:vir:10 488 AAQAPASRIQSG 499 (521) T ss_pred cCCccceeeccc Confidence 1 11222 No 224 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=24.58 E-value=2.2 Score=18.75 Aligned_cols=283 Identities=14% Similarity=0.014 Sum_probs=113.1 Q ss_pred CCC-CCccccccccccccccccchhHHHHHHHH-HHH-HHHHHHhhhhccCcccccccc--ceEEEEec---cCcceeee Q lcl|NC_011085. 1 MAD-MKGGQQLGKDQGKGQSGGDKLALFLKVFG-GEV-LTAFARTSVTTNRHIMRSISS--GKSAQFPV---LGRTRAAY 72 (343) Q Consensus 1 ~~~-~~~~~~~~t~~g~~~~~~d~~al~ie~~~-g~V-~~~f~~~s~~~~~~~~~~i~~--G~tv~i~~---iG~~t~~~ 72 (343) .|+ .+.+-... ..++-+ + |+.-|- ..+ +..+.- -+...++.+.+.-. -+++.|+. .|.+.+ T Consensus 34 da~d~~~~~~~~------~~~~i~-~-~l~~~i~p~~~~~~~~p-~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~-- 102 (336) T protein:vir:10 34 DAADLSPHLSST------GSSGIP-N-YLTTYVDPAVIDILVAP-MKAAELVGESKKGDWTTLVAAFITAEPTTKVAT-- 102 (336) T ss_pred hhhhccCccccC------CCchhH-H-HHHhhcccceeeehhhh-hhhhhhccccccCCccceeEEEeeeeceeeEEE-- Confidence 132 11111111 112222 1 554454 222 222221 23344454443111 24555554 455543 Q ss_pred ecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchH--HH-HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Q lcl|NC_011085. 73 LQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIE--DA-MNHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPA 149 (343) Q Consensus 73 ~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D--~~-q~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~ 149 (343) |..+.+++-...+..-.++++..=+. .+.+...+ .+ ++.+|+-.+-.+.+..+|.+..++..+.-.. T Consensus 103 ygd~~D~P~~d~~~~~~~~~v~~~~~---g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~------- 172 (336) T protein:vir:10 103 YGDYSSDGDSGANINYPQRQSYFFQT---WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA------- 172 (336) T ss_pred eeccCCCceeecccceeeeeEEEEEe---eeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEecc------- Confidence 33233332111112222223222111 23344333 22 3468888888888888888888865432111 Q ss_pred cccccccccCCceeec--ccccccccchHHHHHHHHHHHHHHHHHHhhcC---C-CcCCcEEEeCHHHHHHHhccchhhh Q lcl|NC_011085. 150 ASNENIAGLGSASILE--VGAKGDLTSPVELGKAVIAQLTIARAKLTSNY---V-PSADRTFYTTPEVYSAILAALMPNA 223 (343) Q Consensus 150 ~~~~~~~g~~~~~~~~--~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~---V-P~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (343) ...+.|+-....++ .+..+.. -..++.+.|++.|..+...|..+. + +...-.++|||..+..|-.-..+ T Consensus 173 --~~~~yGllN~P~l~a~~t~~t~~-~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~n~~-- 247 (336) T protein:vir:10 173 --GLENYGLINDPSLSAPITATTPW-SGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQY-- 247 (336) T ss_pred --ccceEEEEeCCCCccccccCCCc-ccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCCCcc-- Confidence 11111211111111 1111111 012234677888888777776643 2 23456899999998888432111 Q ss_pred hccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeechh--hheeeee Q lcl|NC_011085. 224 ANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRS--AVGTVKL 301 (343) Q Consensus 224 ~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~--Av~~~~~ 301 (343) ...-..-+++ +.-+++|...+.+-..+ .. ....+. ....+.-+.. +.+... ++. ++. T Consensus 248 -g~Tvl~~lk~----n~Pnl~i~t~pEl~~a~----G~-----~~~l~~----~~~~~~~t~~--~~~p~~~~~l~-vq~ 306 (336) T protein:vir:10 248 -GLAAAAKLKD----IFPKLEFVTIPEYDTAS----GR-----LVQLWA----PRVEGKDTAT--CGFTEKMRAHS-IER 306 (336) T ss_pred -CccHHHHHHH----hcCccEEEEccccccCC----Cc-----eEEEEE----EecCCCccee--eecchhhhccc-eee Confidence 0110111221 23455666666552111 00 000000 0000000001 111111 111 111 Q ss_pred eeeEEeeeeccchhhhhhhhhhh-hccceecccceEEEEec Q lcl|NC_011085. 302 KDLSLERARRAEYQADQIIARYA-MGHGGLRPEAAGALVFT 341 (343) Q Consensus 302 ~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~rpe~~~~i~~~ 341 (343) ....+.+....+ .|+-+.||-+++.+.-= T Consensus 307 -----------~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 307 -----------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred -----------cCceeEeccccceeeeeeeccchheeeecC Confidence 112234444445 48888888887665444 No 225 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=22.54 E-value=2.4 Score=18.47 Aligned_cols=304 Identities=13% Similarity=0.059 Sum_probs=126.1 Q ss_pred CCCCCcccccc----------------cccccc-------------ccccchhHHHHHH--HHHHHHHHHH-HhhhhccC Q lcl|NC_011085. 1 MADMKGGQQLG----------------KDQGKG-------------QSGGDKLALFLKV--FGGEVLTAFA-RTSVTTNR 48 (343) Q Consensus 1 ~~~~~~~~~~~----------------t~~g~~-------------~~~~d~~al~ie~--~~g~V~~~f~-~~s~~~~~ 48 (343) |=.. |+++.. +-.|.+ ...+|. |... .++.+..... ..+++.+. T Consensus 127 MRsr-Y~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~~~~~~~~~~a~Gd~---~~~~~~~~gt~~~~~~~~~~~~~g~ 202 (521) T protein:vir:72 127 LRAV-YGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKFPALAASTQTTVGDI---YTHFFQETGTVYLQASVQVTIDAGA 202 (521) T ss_pred eeee-ecCCCCCcccccccchhcccccccccccccccccccccccccccccc---cccccccccccccccccccccCCCC Confidence 1100 000000 000000 001110 0000 0000000000 00011111 Q ss_pred cccc--------ccccceEEEEeccCcceeeeecCCCc---CCCccCCCccceEEEEeeeeeeee--------eeccchH Q lcl|NC_011085. 49 HIMR--------SISSGKSAQFPVLGRTRAAYLQAGQS---LDDKRKDIKHTEKTIVIDGLLTAD--------VLIYDIE 109 (343) Q Consensus 49 ~~~~--------~i~~G~tv~i~~iG~~t~~~~~~g~~---i~~~~~~~~~~~~~l~iD~~~~~~--------~~Idd~D 109 (343) .... .+..+. +....-|..|.. ++. +.++ .+....+.-+.||+...-+ ..|.-.. T Consensus 203 t~~~~t~~~v~~~~~a~~-~y~~g~gm~Ta~----aEal~~~g~s-s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQ 276 (521) T protein:vir:72 203 TDAAKLDAEIKKQMEAGA-LVEIAEGMATSI----AELQEGFNGS-TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQ 276 (521) T ss_pred CCccccccccccccccCc-eeeeecccchhh----hhhhcccCCc-ccccccceeeEEEEEEEeeeccceeccccHHHHH Confidence 0000 000011 111111111111 111 1111 1123456677788764432 4455555 Q ss_pred HHHh-c-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCCceeecccccccccc---hHHHHHHHHH Q lcl|NC_011085. 110 DAMN-H-YDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAGLGSASILEVGAKGDLTS---PVELGKAVIA 184 (343) Q Consensus 110 ~~q~-~-~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~---~~~~~~~i~~ 184 (343) +.++ | .|.-.|++.=.+..+...+.+-|+..+--.++..........|+. ..+.+.....+... ....++.++- T Consensus 277 DLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t~~~~~~-~G~~d~~~~~d~~~~~~~~e~~k~L~~ 355 (521) T protein:vir:72 277 DLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMTLTPGSK-AGVFDFQDPIDIRGARWAGESFKALLF 355 (521) T ss_pred HHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeeeeccCcc-ccceecccccccccchHHHHHHHHHHH Confidence 6666 3 888889999999999999999999665332222111000001111 12233322222111 1112222222 Q ss_pred HHH-HHHHHHhhcCCCcCCcEEEeCHHHHHHHhccchhhhhcccc--cc----chhcceeEEE-eceEEEEecccccccc Q lcl|NC_011085. 185 QLT-IARAKLTSNYVPSADRTFYTTPEVYSAILAALMPNAANYAA--LI----DPERGSIRNV-MGFEVVEVPHLTAGGA 256 (343) Q Consensus 185 ~l~-~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~--~~----~~~~G~V~~i-~Gf~V~~sn~lp~~~~ 256 (343) .|. .+.+...+-.. -.+-|+|++|++.+.|-..+.+....-++ .+ +-..=..|.+ .|++||.-++.|..=. T Consensus 356 ~i~~~an~i~~~T~r-~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~g~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~ 434 (521) T protein:vir:72 356 QIDKEAVEIARQTGR-GEGNFIIASRNVVNVLASVDTGISYAAQGLATGFSTDTTKSVFAGVLGGKYRVYIDQYAKQDYF 434 (521) T ss_pred HHHHHHHHHHHhccc-ccceEEEEchHHHHHHhhcccccccccccccccccccCCCceEEEEccCceEEEecCCCCcceE Confidence 222 22222222211 24579999999999988655444322111 11 0011123443 5689998877664221 Q ss_pred ccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhccceecccceE Q lcl|NC_011085. 257 GDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMGHGGLRPEAAG 336 (343) Q Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G~~v~rpe~~~ 336 (343) . ..|+++.....+++|.|-. .+ ..-+..||+.|--.|-.+.+||..+ .|=.-. T Consensus 435 ~-------------------vG~KG~~~~~~glfyaPYv----~l---~~~~~~dp~sfqP~~g~~tRY~l~~-NP~~~~ 487 (521) T protein:vir:72 435 T-------------------VGYKGPNEMDAGIYYAPYV----AL---TPLRGSDPKNFQPVMGFKTRYGIGI-NPFAES 487 (521) T ss_pred E-------------------EEEeCCcccccceeecccc----cc---ccccccCCccccceeeeeeeeceee-cCcccc Confidence 1 1233344444578998853 23 3334579999999999999998765 441111 Q ss_pred -----EEEecCC Q lcl|NC_011085. 337 -----ALVFTAG 343 (343) Q Consensus 337 -----~i~~~~g 343 (343) .-++..| T Consensus 488 ~~~~~a~~i~~~ 499 (521) T protein:vir:72 488 AAQAPASRIQSG 499 (521) T ss_pred cCcccceeecCc Confidence 1122222 No 226 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=21.73 E-value=2.6 Score=18.35 Aligned_cols=283 Identities=13% Similarity=0.008 Sum_probs=110.1 Q ss_pred CCCCCccccccccccccc-cccchhHHHHHHHH-HH-HHHHHHHhhhhccCcccccccc---ceEEEEec---cCcceee Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQ-SGGDKLALFLKVFG-GE-VLTAFARTSVTTNRHIMRSISS---GKSAQFPV---LGRTRAA 71 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~-~~~d~~al~ie~~~-g~-V~~~f~~~s~~~~~~~~~~i~~---G~tv~i~~---iG~~t~~ 71 (343) .|+- +.|++.. +..-+-+ |+.-|- .. ++..+... +...++.+.+ ++ -+++.|+. .|.+. T Consensus 34 da~d-------~~~~~~t~~~~g~~~-~l~~~i~p~~~~~~~~~~-~~~~l~~v~t-~g~w~~~~~~~~~~e~~G~a~-- 101 (336) T protein:vir:10 34 DAAD-------LSPHLSSTGSSGIPN-YLTTYVDPSVIDILVAPM-KAAELVGESK-KGDWTTLVAAFITAEPTTKVA-- 101 (336) T ss_pred hhhh-------hccccccCCCcchHH-HHHhhcCcceeeeeechh-chhhhccccc-CCCcceeeEEEEeeeeeeeEE-- Confidence 1222 1122221 1111222 444444 22 22233222 2344444444 22 23444433 45543 Q ss_pred eecCCCcCCCccCCCccceEEEEeeeeeeeeeeccchH--HHH-hchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_011085. 72 YLQAGQSLDDKRKDIKHTEKTIVIDGLLTADVLIYDIE--DAM-NHYDVRSEYTSQIGESLAMAADGAVLAELAGLCNMP 148 (343) Q Consensus 72 ~~~~g~~i~~~~~~~~~~~~~l~iD~~~~~~~~Idd~D--~~q-~~~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~ 148 (343) -|-.+.+++-. +..-+..+-++--.- ..+.+...+ .++ +..|+-.+-.+.+..+|.+..++..+.-- T Consensus 102 ~ygd~~d~P~~--d~~~~~~~~~v~~~~-~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd------- 171 (336) T protein:vir:10 102 TYGDYSSDGDS--GTNINYPQRQSYFFQ-TWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGV------- 171 (336) T ss_pred EccccCCCcce--eeeeeeeeeeEEEEE-EEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEee------- Confidence 23222233221 111111111111111 123333333 222 46778777777778888877776443211 Q ss_pred ccccccccccCCceeecc--cccccccchHHHHHHHHHHHHHHHHHHhhcC---C-CcCCcEEEeCHHHHHHHhccchhh Q lcl|NC_011085. 149 AASNENIAGLGSASILEV--GAKGDLTSPVELGKAVIAQLTIARAKLTSNY---V-PSADRTFYTTPEVYSAILAALMPN 222 (343) Q Consensus 149 ~~~~~~~~g~~~~~~~~~--~~~~~~~~~~~~~~~i~~~l~~a~~~Ld~~~---V-P~~gR~~vv~P~~~~~Ll~~~~~~ 222 (343) ....+.|+-....++. +.++.. -..++.+.|++.|..+...|.... + |...-.++|||..+..|-.-..+ T Consensus 172 --~~~~~~GllN~P~l~a~~t~~~~~-w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~~n~~- 247 (336) T protein:vir:10 172 --AGLENYGLINDPSLSAPITATTPW-SGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQY- 247 (336) T ss_pred --cccceEEEeecCCCCcccccCcCc-ccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCCCcc- Confidence 1111122211111211 111110 012335677888887777775543 2 33445799999999998543211 Q ss_pred hhccccccchhcceeEEEeceEEEEeccccccccccccccccccccccccccccccccccccceEeEeech-hhheeeee Q lcl|NC_011085. 223 AANYAALIDPERGSIRNVMGFEVVEVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHR-SAVGTVKL 301 (343) Q Consensus 223 ~~~~~~~~~~~~G~V~~i~Gf~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~Av~~~~~ 301 (343) ...-..-+++ +.-+++|...+.|-..+ . .....+.. ...+.-+..+. +-.+ .++ -++. T Consensus 248 --g~tv~~~lk~----n~Pnl~i~t~pel~~Ag----g-----~~~~~~~~----~~~~~~t~~~~-~P~~f~~l-pvq~ 306 (336) T protein:vir:10 248 --GLSAAAKLKE----IFPKLEFVTIPEYDTAS----G-----RLVQLWAP----RVEGKDTATCG-FTEKMRAH-SIER 306 (336) T ss_pred --CccHHHHHHH----hCCccEEEEcccccccC----C-----ceEEEEEe----cccCCcceeee-cChhhhcc-ceee Confidence 0111112222 13345676655552111 0 00000000 00000001111 1111 111 1111 Q ss_pred eeeEEeeeeccchhhhhhhhhhh-hccceecccceEEEEec Q lcl|NC_011085. 302 KDLSLERARRAEYQADQIIARYA-MGHGGLRPEAAGALVFT 341 (343) Q Consensus 302 ~~~~~e~~~~~~~~~d~i~~~~~-~G~~v~rpe~~~~i~~~ 341 (343) ....+.+....+ .|+-+.||-+++.+.-= T Consensus 307 -----------~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 307 -----------YSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred -----------cCceeEeccccceeeeeeeccchheeeccC Confidence 112334444444 47788888776664433 No 227 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=20.46 E-value=2.8 Score=18.16 Aligned_cols=297 Identities=14% Similarity=0.085 Sum_probs=126.5 Q ss_pred CCCCCccccccccccccccccchhHHHHHH----------HHHHH-HHHHHHhhhh---------------ccCcccccc Q lcl|NC_011085. 1 MADMKGGQQLGKDQGKGQSGGDKLALFLKV----------FGGEV-LTAFARTSVT---------------TNRHIMRSI 54 (343) Q Consensus 1 ~~~~~~~~~~~t~~g~~~~~~d~~al~ie~----------~~g~V-~~~f~~~s~~---------------~~~~~~~~i 54 (343) |=.. |+++.. ..|. -++|-|- |+|.- .+.|...+.. .+.+...+. T Consensus 127 mRsr-Y~n~~~-~~gt-------eA~~nEAf~~~ye~dt~fSG~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~ 197 (524) T protein:vir:98 127 LRAV-YGKDPL-AGGT-------PADVREAFHPMFAPDTMYSGEGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNV 197 (524) T ss_pred hhee-ecCCCC-Cccc-------ccccccccccccccccccCCccccccccccccccccccccccccccccccceecccc Confidence 1111 111100 0000 0011111 11100 0000000000 000000000 Q ss_pred -------ccce---------------EEEEeccCcceeeeecCCCcC---CCccCCCccceEEEEeeeeeeee------- Q lcl|NC_011085. 55 -------SSGK---------------SAQFPVLGRTRAAYLQAGQSL---DDKRKDIKHTEKTIVIDGLLTAD------- 102 (343) Q Consensus 55 -------~~G~---------------tv~i~~iG~~t~~~~~~g~~i---~~~~~~~~~~~~~l~iD~~~~~~------- 102 (343) .+.+ ++.-...|..+.. ++.+ .++ ....-.+..+.||+...-+ T Consensus 198 ~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~----aEaL~~~g~s-s~~~f~EMaFsIeKvtVtAKSRaLKA 272 (524) T protein:vir:98 198 TSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSV----AELQENFNGS-SANPWNEMAFRIDKQVIEARSRQLKA 272 (524) T ss_pred ccCcccccccccccccccccccccccceeecccccchhh----hhhhccCCCC-ccccccceeeEEEEEEEeeecccccc Confidence 0000 0111111111110 1111 111 1123456677787764432 Q ss_pred -eeccchHHHHh-c-hhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccc-cCCceeecccccccccc---h Q lcl|NC_011085. 103 -VLIYDIEDAMN-H-YDVRSEYTSQIGESLAMAADGAVLAELAGLCNMPAASNENIAG-LGSASILEVGAKGDLTS---P 175 (343) Q Consensus 103 -~~Idd~D~~q~-~-~d~~~~~~~~~~~aLa~~~D~~i~~~~~~~a~~~~~~~~~~~g-~~~~~~~~~~~~~~~~~---~ 175 (343) ..+.-..+.++ | .|.-.|++.=.+..+...+++-|+..+...++.... ....+ .....+.......+... . T Consensus 273 EYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~~--g~t~~~~~~~G~~dl~~~~d~~~~r~~ 350 (524) T protein:vir:98 273 QYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGKS--GFTQTVGSKAGSFDFQDPVDIRGARWA 350 (524) T ss_pred cccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheecee--ecccccccccceeeccccccccccchh Confidence 44555555666 3 888899999999999999999999776544433211 00000 00112233322222111 1 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhcc-chhhhhc-cc-------cccchhcceeEEEeceEEE Q lcl|NC_011085. 176 VELGKAVIAQLTIARAKLTSNYVPSADRTFYTTPEVYSAILAA-LMPNAAN-YA-------ALIDPERGSIRNVMGFEVV 246 (343) Q Consensus 176 ~~~~~~i~~~l~~a~~~Ld~~~VP~~gR~~vv~P~~~~~Ll~~-~~~~~~~-~~-------~~~~~~~G~V~~i~Gf~V~ 246 (343) .+.++.++-.+-+.....-.+---..+-|+|++|++.++|-.. .-+.... .. ....+.-|.+. .|++|| T Consensus 351 ~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~~~~~~~~~d~~~~~~~G~l~--~~~~vy 428 (524) T protein:vir:98 351 GESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQGLQKTLNVDTTKAVFAGVLG--GTYKVY 428 (524) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccccchhhcccccCCccceEEEEec--CceEEE Confidence 1223333333322222222211112367999999999988763 3332211 11 11123335554 478999 Q ss_pred EeccccccccccccccccccccccccccccccccccccceEeEeechhhheeeeeeeeEEeeeeccchhhhhhhhhhhhc Q lcl|NC_011085. 247 EVPHLTAGGAGDDREDETTNQKHAFPKTAEGDTKVALDNVVGLFQHRSAVGTVKLKDLSLERARRAEYQADQIIARYAMG 326 (343) Q Consensus 247 ~sn~lp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~Av~~~~~~~~~~e~~~~~~~~~d~i~~~~~~G 326 (343) .-++.|..=.. ..|+++-....+++|.|-. .+.+ -+..||+.|--.|-.+.+|| T Consensus 429 ~D~y~~~dy~~-------------------vG~KG~~~~~~glfyaPYv----~l~~---~~~~dp~sfqP~~g~~tRY~ 482 (524) T protein:vir:98 429 IDQYARQDYFT-------------------VGFKGDNEMDAGIYYAPYV----ALTP---LRGSDPKNFQPVMGFKTRYG 482 (524) T ss_pred ecCCCCcceEE-------------------EEeeCCcccccceeecccc----cccc---ccccCCccccceeeeeeeec Confidence 88877642211 1233333444578888853 3333 34579999999999999998 Q ss_pred cceecccceEEE------EecCC Q lcl|NC_011085. 327 HGGLRPEAAGAL------VFTAG 343 (343) Q Consensus 327 ~~v~rpe~~~~i------~~~~g 343 (343) ..+ .| ..... ++..| T Consensus 483 l~~-NP-~~~~~~~~~~~ri~~g 503 (524) T protein:vir:98 483 IGI-NP-FANSRSQAPADRITSG 503 (524) T ss_pred eee-cC-cccccCCccccccccC Confidence 765 44 22111 22333 Done!