Query lcl|NC_019506.1_cdsid_YP_007005028.1 [gene=F390_gp13] [protein=P22 coat protein] [protein_id=YP_007005028.1] [location=9261..10091] Match_columns 276 No_of_seqs 133 out of 339 Neff 9.4 Searched_HMMs 1612 Date Thu Nov 7 16:55:52 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:102605 Length: 273 100.0 1E-63 6.3E-67 366.0 28.8 272 1-276 1-273 (273) 2 protein:vir:105822 Length: 273 100.0 1E-63 6.3E-67 366.0 28.8 272 1-276 1-273 (273) 3 protein:vir:7990 Length: 273 # 100.0 2.1E-63 1.3E-66 364.2 28.5 272 1-276 1-273 (273) 4 protein:vir:94622 Length: 341 100.0 1.2E-56 7.2E-60 327.3 23.0 272 1-276 1-339 (341) 5 protein:vir:78739 Length: 332 100.0 2.1E-54 1.3E-57 314.9 19.5 269 1-274 7-332 (332) 6 protein:vir:3364 Length: 347 # 100.0 3E-52 1.8E-55 303.1 20.2 270 1-276 1-345 (347) 7 protein:vir:10450 Length: 344 100.0 6.4E-52 4E-55 301.2 19.4 270 1-276 1-344 (344) 8 protein:vir:1541 Length: 347 # 100.0 1.8E-51 1.1E-54 298.8 21.5 271 1-276 1-345 (347) 9 protein:vir:94711 Length: 347 100.0 8.9E-52 5.6E-55 300.4 18.5 270 1-276 1-346 (347) 10 protein:vir:2201 Length: 345 # 100.0 7.2E-51 4.5E-54 295.5 20.2 271 1-276 1-345 (345) 11 protein:vir:8885 Length: 347 # 100.0 1.2E-50 7.3E-54 294.3 19.4 271 1-276 1-346 (347) 12 protein:vir:99075 Length: 392 100.0 5.1E-49 3.2E-52 285.3 23.5 271 1-276 1-302 (392) 13 protein:vir:3136 Length: 322 # 100.0 9.4E-50 5.9E-53 289.4 15.4 268 1-276 1-318 (322) 14 protein:vir:174 Length: 423 # 100.0 1.1E-47 6.8E-51 278.0 25.8 271 1-276 1-423 (423) 15 protein:vir:80213 Length: 334 100.0 2.7E-48 1.7E-51 281.4 20.9 271 1-276 1-334 (334) 16 protein:vir:94576 Length: 347 100.0 2.8E-48 1.7E-51 281.3 19.8 271 1-276 1-347 (347) 17 protein:vir:100057 Length: 375 100.0 1.3E-47 8E-51 277.7 23.0 271 1-276 1-370 (375) 18 protein:vir:80930 Length: 278 100.0 5.9E-47 3.6E-50 274.0 25.1 267 1-276 1-277 (278) 19 protein:vir:108303 Length: 418 100.0 5.4E-47 3.3E-50 274.3 24.1 268 1-276 1-294 (418) 20 protein:vir:80180 Length: 381 100.0 2.8E-47 1.7E-50 275.8 20.5 271 1-276 15-381 (381) 21 protein:vir:105522 Length: 423 100.0 3.8E-46 2.3E-49 269.6 23.1 272 1-276 1-306 (423) 22 protein:vir:105374 Length: 423 100.0 3.9E-46 2.4E-49 269.5 22.4 272 1-276 1-315 (423) 23 protein:vir:3525 Length: 423 # 100.0 3.5E-46 2.2E-49 269.8 21.9 273 1-276 1-306 (423) 24 protein:vir:103323 Length: 364 100.0 2.1E-45 1.3E-48 265.6 23.3 271 1-276 1-339 (364) 25 protein:vir:1239 Length: 274 # 100.0 7.1E-45 4.4E-48 262.6 24.9 260 1-276 1-270 (274) 26 protein:vir:96123 Length: 274 100.0 9.9E-45 6.1E-48 261.8 25.4 260 1-276 1-270 (274) 27 protein:vir:3613 Length: 272 # 100.0 8.6E-45 5.3E-48 262.2 24.5 264 1-276 1-272 (272) 28 protein:vir:97433 Length: 274 100.0 1.8E-44 1.1E-47 260.4 25.4 260 1-276 1-270 (274) 29 protein:vir:94494 Length: 274 100.0 1.8E-44 1.1E-47 260.4 25.4 260 1-276 1-270 (274) 30 protein:vir:99675 Length: 324 100.0 1E-45 6.4E-49 267.2 18.3 242 28-276 1-303 (324) 31 protein:vir:96262 Length: 274 100.0 2E-44 1.3E-47 260.1 24.9 260 1-276 1-270 (274) 32 protein:vir:95898 Length: 274 100.0 2E-44 1.3E-47 260.1 24.9 260 1-276 1-270 (274) 33 protein:vir:93742 Length: 274 100.0 3.2E-44 2E-47 259.0 25.5 260 1-276 1-270 (274) 34 protein:vir:6324 Length: 335 # 100.0 1.8E-44 1.1E-47 260.4 21.1 270 1-276 1-329 (335) 35 protein:vir:97331 Length: 319 100.0 2.8E-43 1.8E-46 253.8 26.5 266 1-276 25-296 (319) 36 protein:vir:94800 Length: 319 100.0 2.8E-43 1.8E-46 253.8 26.5 266 1-276 25-296 (319) 37 protein:vir:96833 Length: 275 100.0 1.6E-43 9.7E-47 255.2 24.0 260 1-276 3-271 (275) 38 protein:vir:78935 Length: 335 100.0 6.1E-44 3.8E-47 257.5 21.0 270 1-276 1-329 (335) 39 protein:vir:107120 Length: 329 100.0 1.4E-42 8.9E-46 250.0 26.2 266 1-276 36-307 (329) 40 protein:vir:97031 Length: 402 100.0 1E-43 6.5E-47 256.2 18.8 271 1-276 1-340 (402) 41 protein:vir:105334 Length: 276 100.0 4.6E-41 2.8E-44 241.7 23.9 260 1-276 1-273 (276) 42 protein:vir:102655 Length: 322 100.0 2E-39 1.2E-42 232.8 21.5 271 1-276 13-321 (322) 43 protein:vir:3033 Length: 272 # 100.0 3.3E-38 2.1E-41 226.1 25.5 259 1-276 1-269 (272) 44 protein:vir:9820 Length: 272 # 100.0 3.3E-38 2.1E-41 226.1 25.5 259 1-276 1-269 (272) 45 protein:vir:79008 Length: 299 100.0 6.6E-38 4.1E-41 224.4 26.4 275 1-276 1-298 (299) 46 protein:vir:7019 Length: 401 # 100.0 2E-39 1.2E-42 232.7 17.4 271 1-276 1-339 (401) 47 protein:vir:105645 Length: 400 100.0 4E-39 2.5E-42 231.1 18.4 271 1-276 1-336 (400) 48 protein:vir:95107 Length: 270 100.0 1.6E-36 1E-39 216.8 23.3 258 1-276 1-264 (270) 49 protein:vir:78920 Length: 290 100.0 2E-35 1.2E-38 210.8 25.2 268 1-276 1-290 (290) 50 protein:vir:739 Length: 231 # 100.0 9E-36 5.6E-39 212.7 20.9 230 35-276 1-231 (231) 51 protein:vir:105464 Length: 346 100.0 6.6E-33 4.1E-36 197.0 25.3 270 1-276 1-299 (346) 52 protein:vir:102335 Length: 312 100.0 1.2E-32 7.2E-36 195.7 24.9 272 1-276 1-310 (312) 53 protein:vir:79712 Length: 285 100.0 4.6E-30 2.9E-33 181.4 22.8 267 1-276 1-285 (285) 54 protein:vir:100939 Length: 430 99.9 3.1E-30 1.9E-33 182.3 17.6 270 1-276 1-334 (430) 55 protein:vir:9265 Length: 430 # 99.9 3.1E-30 1.9E-33 182.3 17.6 270 1-276 1-334 (430) 56 protein:vir:99523 Length: 311 99.9 2.9E-28 1.8E-31 171.6 23.2 266 1-275 1-311 (311) 57 protein:vir:1781 Length: 221 # 99.9 7.9E-30 4.9E-33 180.1 14.2 186 78-276 1-202 (221) 58 protein:vir:2106 Length: 430 # 99.9 3.1E-29 1.9E-32 176.9 16.7 270 1-276 1-334 (430) 59 protein:vir:78090 Length: 302 99.9 8.8E-27 5.5E-30 163.4 23.1 267 1-276 1-302 (302) 60 protein:vir:95451 Length: 313 99.9 1.5E-24 9E-28 151.3 14.0 272 1-276 1-311 (313) 61 protein:vir:5974 Length: 324 # 99.8 1.1E-21 6.8E-25 135.5 21.5 264 1-276 1-293 (324) 62 protein:vir:102944 Length: 330 99.8 9.3E-22 5.8E-25 135.9 20.6 264 1-276 1-299 (330) 63 protein:vir:1583 Length: 351 # 99.8 6.8E-21 4.2E-24 131.2 20.6 265 1-276 1-297 (351) 64 protein:vir:79987 Length: 415 99.7 4.9E-17 3E-20 110.0 23.6 267 1-276 120-404 (415) 65 protein:vir:98339 Length: 415 99.7 4.9E-17 3E-20 110.0 23.6 267 1-276 120-404 (415) 66 protein:vir:81100 Length: 415 99.7 4.9E-17 3E-20 110.0 23.6 267 1-276 120-404 (415) 67 protein:vir:4700 Length: 415 # 99.7 5.2E-17 3.2E-20 109.9 23.5 266 1-276 120-404 (415) 68 protein:vir:4600 Length: 415 # 99.7 5.2E-17 3.2E-20 109.9 23.5 266 1-276 120-404 (415) 69 protein:vir:9410 Length: 415 # 99.7 7.8E-17 4.8E-20 108.9 23.1 267 1-276 127-404 (415) 70 protein:vir:41 Length: 299 # N 99.6 1.8E-16 1.1E-19 106.9 23.6 264 1-276 6-298 (299) 71 protein:vir:80684 Length: 315 99.6 1.6E-16 1E-19 107.1 22.0 267 1-276 1-307 (315) 72 protein:vir:94771 Length: 298 99.6 3.5E-16 2.1E-19 105.3 23.5 264 1-275 1-298 (298) 73 protein:vir:105905 Length: 304 99.6 3.7E-16 2.3E-19 105.2 23.4 259 1-275 1-304 (304) 74 protein:vir:94142 Length: 304 99.6 3.7E-16 2.3E-19 105.2 23.4 259 1-275 1-304 (304) 75 protein:vir:96223 Length: 324 99.6 3.4E-16 2.1E-19 105.4 22.7 259 1-276 30-315 (324) 76 protein:vir:78523 Length: 338 99.6 5.5E-16 3.4E-19 104.2 23.9 268 1-276 1-336 (338) 77 protein:vir:1638 Length: 298 # 99.6 4.3E-16 2.7E-19 104.8 23.1 264 1-275 1-298 (298) 78 protein:vir:9309 Length: 324 # 99.6 5.6E-16 3.5E-19 104.2 23.0 259 1-276 30-315 (324) 79 protein:vir:97148 Length: 324 99.6 6.7E-16 4.2E-19 103.8 23.3 259 1-276 31-315 (324) 80 protein:vir:485 Length: 407 # 99.6 5E-16 3.1E-19 104.5 22.6 265 1-276 106-400 (407) 81 protein:vir:99749 Length: 324 99.6 9.2E-16 5.7E-19 103.0 23.1 259 1-276 30-315 (324) 82 protein:vir:78223 Length: 333 99.6 1.2E-15 7.3E-19 102.4 23.6 269 1-276 20-333 (333) 83 protein:vir:9574 Length: 300 # 99.6 1.1E-15 7E-19 102.5 22.7 265 1-276 1-300 (300) 84 protein:vir:7771 Length: 330 # 99.6 2.1E-15 1.3E-18 101.0 24.2 265 1-276 1-323 (330) 85 protein:vir:96392 Length: 324 99.6 1.5E-15 9.5E-19 101.8 23.3 259 1-276 30-315 (324) 86 protein:vir:78830 Length: 324 99.6 1.5E-15 9.5E-19 101.8 23.3 259 1-276 30-315 (324) 87 protein:vir:100247 Length: 425 99.6 1.2E-15 7.5E-19 102.4 22.1 265 1-276 130-424 (425) 88 protein:vir:4339 Length: 395 # 99.6 3.3E-15 2E-18 100.0 24.1 260 1-276 117-395 (395) 89 protein:vir:6242 Length: 390 # 99.6 1.2E-15 7.2E-19 102.5 21.4 263 1-276 116-389 (390) 90 protein:vir:103955 Length: 324 99.6 2.6E-15 1.6E-18 100.5 23.3 259 1-276 30-315 (324) 91 protein:vir:100135 Length: 418 99.6 2.9E-15 1.8E-18 100.3 23.4 260 1-276 136-415 (418) 92 protein:vir:9759 Length: 303 # 99.6 2.1E-15 1.3E-18 101.1 22.3 265 1-276 1-303 (303) 93 protein:vir:4511 Length: 409 # 99.6 1.1E-15 6.5E-19 102.7 20.6 268 1-276 117-406 (409) 94 protein:vir:104085 Length: 320 99.6 4.1E-15 2.6E-18 99.4 23.8 263 1-276 14-318 (320) 95 protein:vir:108211 Length: 318 99.6 5.6E-16 3.5E-19 104.2 18.3 263 1-276 22-318 (318) 96 protein:vir:95763 Length: 297 99.6 5.1E-15 3.1E-18 99.0 23.3 259 1-276 9-296 (297) 97 protein:vir:8187 Length: 311 # 99.5 6.6E-15 4.1E-18 98.3 23.7 265 1-276 1-310 (311) 98 protein:vir:95376 Length: 425 99.5 4E-15 2.5E-18 99.5 22.3 263 1-276 141-421 (425) 99 protein:vir:3870 Length: 400 # 99.5 2.8E-15 1.7E-18 100.4 21.3 255 1-276 137-399 (400) 100 protein:vir:4456 Length: 401 # 99.5 3.8E-15 2.4E-18 99.6 21.3 265 1-276 107-401 (401) 101 protein:vir:100172 Length: 394 99.5 7E-15 4.3E-18 98.2 22.8 257 1-276 111-384 (394) 102 protein:vir:1328 Length: 392 # 99.5 8.2E-15 5.1E-18 97.8 22.4 262 1-276 114-391 (392) 103 protein:vir:101607 Length: 379 99.5 1.5E-14 9.6E-18 96.3 23.9 257 1-276 109-379 (379) 104 protein:vir:4856 Length: 293 # 99.5 1.4E-14 8.5E-18 96.6 23.6 260 1-276 5-281 (293) 105 protein:vir:94673 Length: 419 99.5 1.6E-14 9.9E-18 96.2 23.8 262 1-276 130-417 (419) 106 protein:vir:191 Length: 385 # 99.5 1.4E-14 8.7E-18 96.5 23.5 260 1-276 105-384 (385) 107 protein:vir:1886 Length: 385 # 99.5 1.4E-14 8.7E-18 96.5 23.5 260 1-276 105-384 (385) 108 protein:vir:97053 Length: 390 99.5 1.6E-14 9.6E-18 96.3 22.8 258 1-274 113-390 (390) 109 protein:vir:4953 Length: 397 # 99.5 2.9E-14 1.8E-17 94.8 23.9 260 1-276 109-385 (397) 110 protein:vir:1025 Length: 408 # 99.5 3.1E-14 1.9E-17 94.7 23.6 260 1-276 121-393 (408) 111 protein:vir:81070 Length: 390 99.5 2.6E-14 1.6E-17 95.0 23.0 258 1-274 113-390 (390) 112 protein:vir:4997 Length: 397 # 99.5 3.7E-14 2.3E-17 94.2 23.5 260 1-276 109-385 (397) 113 protein:vir:2430 Length: 318 # 99.5 3.7E-14 2.3E-17 94.2 23.4 263 1-276 14-313 (318) 114 protein:vir:99920 Length: 311 99.5 2.2E-14 1.4E-17 95.4 22.1 266 1-275 1-311 (311) 115 protein:vir:4830 Length: 397 # 99.5 4.8E-14 3E-17 93.6 22.9 260 1-276 109-385 (397) 116 protein:vir:9704 Length: 394 # 99.5 3.8E-14 2.4E-17 94.1 22.1 254 1-276 133-393 (394) 117 protein:vir:4226 Length: 326 # 99.5 6.1E-14 3.8E-17 93.0 22.6 262 1-276 22-323 (326) 118 protein:vir:10364 Length: 390 99.5 1E-13 6.4E-17 91.8 23.5 258 1-274 114-390 (390) 119 protein:vir:104256 Length: 458 99.4 1.3E-13 8.1E-17 91.2 23.6 268 1-276 165-458 (458) 120 protein:vir:2504 Length: 305 # 99.4 9.4E-14 5.8E-17 92.0 22.5 259 1-276 1-303 (305) 121 protein:vir:3991 Length: 404 # 99.4 2.1E-13 1.3E-16 90.0 24.0 260 1-276 116-393 (404) 122 protein:vir:81160 Length: 371 99.4 1.8E-13 1.1E-16 90.5 23.1 259 1-276 91-371 (371) 123 protein:vir:93616 Length: 645 99.4 1.7E-13 1.1E-16 90.5 22.7 268 1-276 344-638 (645) 124 protein:vir:1268 Length: 397 # 99.4 2E-13 1.2E-16 90.2 22.6 259 1-276 123-397 (397) 125 protein:vir:100884 Length: 389 99.4 2.4E-13 1.5E-16 89.8 22.7 257 1-276 109-383 (389) 126 protein:vir:2344 Length: 397 # 99.4 3E-13 1.9E-16 89.2 23.0 263 1-276 10-306 (397) 127 protein:vir:1433 Length: 435 # 99.4 3.5E-13 2.2E-16 88.9 23.3 264 1-276 130-434 (435) 128 protein:vir:8102 Length: 543 # 99.4 2.8E-13 1.7E-16 89.4 22.4 262 1-276 249-542 (543) 129 protein:vir:1084 Length: 437 # 99.4 2.2E-13 1.3E-16 90.0 21.1 258 1-276 156-427 (437) 130 protein:vir:1383 Length: 421 # 99.4 3.2E-13 2E-16 89.1 21.9 256 1-276 116-383 (421) 131 protein:vir:80446 Length: 367 99.4 8.4E-14 5.2E-17 92.3 18.5 265 1-276 1-338 (367) 132 protein:vir:80376 Length: 435 99.4 8.7E-13 5.4E-16 86.7 24.1 264 1-276 130-434 (435) 133 protein:vir:5739 Length: 366 # 99.4 8.3E-13 5.2E-16 86.8 23.9 264 1-276 64-366 (366) 134 protein:vir:102119 Length: 404 99.4 4.9E-13 3E-16 88.1 22.5 268 1-276 110-400 (404) 135 protein:vir:81227 Length: 413 99.4 8.7E-13 5.4E-16 86.7 23.8 263 1-276 118-410 (413) 136 protein:vir:96762 Length: 632 99.4 1.1E-13 6.9E-17 91.6 18.9 259 1-275 357-632 (632) 137 protein:vir:78640 Length: 352 99.4 1.1E-13 6.9E-17 91.6 18.9 252 1-276 83-346 (352) 138 protein:vir:7409 Length: 408 # 99.4 5.4E-13 3.3E-16 87.8 22.5 260 1-276 116-395 (408) 139 protein:vir:3845 Length: 395 # 99.4 8.1E-13 5E-16 86.9 23.1 260 1-276 105-383 (395) 140 protein:vir:6212 Length: 434 # 99.4 3.3E-13 2.1E-16 89.0 20.5 264 1-276 141-431 (434) 141 protein:vir:962 Length: 397 # 99.3 6E-13 3.7E-16 87.6 20.2 254 1-276 138-397 (397) 142 protein:vir:4092 Length: 390 # 99.3 2.7E-12 1.7E-15 84.0 23.8 259 1-276 84-368 (390) 143 protein:vir:9927 Length: 295 # 99.3 7.3E-14 4.5E-17 92.6 14.9 252 1-276 1-288 (295) 144 protein:vir:105038 Length: 428 99.3 3.8E-12 2.3E-15 83.2 23.9 263 1-276 125-428 (428) 145 protein:vir:93881 Length: 387 99.3 6.6E-13 4.1E-16 87.4 19.2 252 1-276 118-381 (387) 146 protein:vir:9361 Length: 402 # 99.3 5.3E-13 3.3E-16 87.9 18.1 252 1-276 133-396 (402) 147 protein:vir:96978 Length: 387 99.3 4E-13 2.5E-16 88.6 17.4 252 1-276 118-381 (387) 148 protein:vir:2685 Length: 387 # 99.3 4E-13 2.5E-16 88.6 17.4 252 1-276 118-381 (387) 149 protein:vir:94424 Length: 387 99.3 4E-13 2.5E-16 88.6 17.4 252 1-276 118-381 (387) 150 protein:vir:4197 Length: 314 # 99.3 6E-12 3.7E-15 82.1 23.0 268 1-276 19-311 (314) 151 protein:vir:102082 Length: 392 99.3 4.9E-12 3E-15 82.6 22.2 259 1-276 106-386 (392) 152 protein:vir:107593 Length: 392 99.3 4.9E-12 3E-15 82.6 22.2 259 1-276 106-386 (392) 153 protein:vir:105004 Length: 392 99.3 4.9E-12 3E-15 82.6 22.2 259 1-276 106-386 (392) 154 protein:vir:102873 Length: 392 99.3 4.9E-12 3E-15 82.6 22.2 259 1-276 106-386 (392) 155 protein:vir:7855 Length: 497 # 99.3 9.4E-12 5.8E-15 81.0 23.5 265 1-276 151-493 (497) 156 protein:vir:101650 Length: 497 99.3 9.4E-12 5.8E-15 81.0 23.5 265 1-276 151-493 (497) 157 protein:vir:95875 Length: 401 99.3 4E-13 2.5E-16 88.6 15.9 272 1-276 19-400 (401) 158 protein:vir:8420 Length: 477 # 99.2 7.2E-12 4.5E-15 81.7 19.8 269 1-276 157-474 (477) 159 protein:vir:78387 Length: 349 99.2 7.8E-12 4.8E-15 81.5 19.6 267 1-276 1-318 (349) 160 protein:vir:79928 Length: 393 99.2 2.7E-12 1.7E-15 84.0 16.3 266 1-276 74-385 (393) 161 protein:vir:9875 Length: 296 # 99.1 2E-12 1.2E-15 84.7 14.1 251 1-276 1-295 (296) 162 protein:vir:94989 Length: 349 99.1 2.8E-11 1.8E-14 78.4 20.2 263 1-276 1-318 (349) 163 protein:vir:106647 Length: 303 99.1 1.2E-11 7.5E-15 80.4 15.2 258 1-276 1-302 (303) 164 protein:vir:4159 Length: 315 # 99.1 1.7E-10 1.1E-13 74.1 20.8 266 1-275 19-315 (315) 165 protein:vir:95963 Length: 395 99.0 1.8E-10 1.1E-13 74.0 20.3 254 1-276 86-376 (395) 166 protein:vir:9643 Length: 377 # 99.0 2.3E-10 1.4E-13 73.4 20.5 255 1-276 82-377 (377) 167 protein:vir:9509 Length: 381 # 99.0 2.7E-10 1.7E-13 73.1 18.6 254 1-276 76-371 (381) 168 protein:vir:101291 Length: 381 99.0 2.7E-10 1.7E-13 73.1 18.6 254 1-276 76-371 (381) 169 protein:vir:3158 Length: 321 # 99.0 6.5E-10 4E-13 70.9 20.3 264 1-276 24-312 (321) 170 protein:vir:80128 Length: 466 98.9 4.9E-10 3E-13 71.6 18.9 259 1-276 154-448 (466) 171 protein:vir:100632 Length: 381 98.9 9E-10 5.6E-13 70.2 19.2 256 1-276 80-372 (381) 172 protein:vir:78350 Length: 383 98.8 2.7E-09 1.6E-12 67.6 19.3 254 1-276 83-376 (383) 173 protein:vir:98635 Length: 377 98.8 3.4E-09 2.1E-12 67.0 19.7 254 1-276 79-377 (377) 174 protein:vir:93696 Length: 364 98.7 1.2E-08 7.7E-12 63.9 19.9 273 1-276 1-361 (364) 175 protein:vir:10123 Length: 404 98.6 6.9E-08 4.3E-11 59.8 19.6 272 1-276 22-403 (404) 176 protein:vir:819 Length: 404 # 98.6 6.9E-08 4.3E-11 59.8 19.6 272 1-276 22-403 (404) 177 protein:vir:104439 Length: 404 98.6 6.9E-08 4.3E-11 59.8 19.6 272 1-276 22-403 (404) 178 protein:vir:3298 Length: 404 # 98.6 6.9E-08 4.3E-11 59.8 19.6 272 1-276 22-403 (404) 179 protein:vir:2770 Length: 318 # 98.5 1.4E-07 8.7E-11 58.1 19.7 227 1-237 22-318 (318) 180 protein:vir:95131 Length: 325 98.4 3.5E-07 2.2E-10 56.0 19.2 263 1-276 1-296 (325) 181 protein:vir:105610 Length: 430 98.4 2.3E-07 1.4E-10 57.0 17.8 272 1-276 1-424 (430) 182 protein:vir:3969 Length: 287 # 98.1 1.2E-06 7.2E-10 53.1 16.6 268 1-276 1-286 (287) 183 protein:vir:97255 Length: 310 98.0 6.1E-06 3.8E-09 49.2 21.0 259 1-275 1-310 (310) 184 protein:vir:97397 Length: 517 97.9 5.9E-06 3.7E-09 49.2 17.6 255 1-276 237-514 (517) 185 protein:vir:96792 Length: 315 97.8 1.6E-05 1E-08 46.9 20.3 260 1-276 1-283 (315) 186 protein:vir:8324 Length: 410 # 97.7 5.5E-06 3.4E-09 49.4 14.4 255 1-274 127-410 (410) 187 protein:vir:94933 Length: 330 97.7 2.7E-05 1.7E-08 45.7 19.5 260 1-276 25-329 (330) 188 protein:vir:79548 Length: 652 97.6 4E-05 2.5E-08 44.7 19.4 264 1-273 359-652 (652) 189 protein:vir:98871 Length: 314 97.5 2.3E-05 1.4E-08 46.0 15.4 268 1-276 21-311 (314) 190 protein:vir:94528 Length: 286 97.5 3.2E-05 2E-08 45.3 16.0 252 1-276 1-284 (286) 191 protein:vir:107687 Length: 319 97.3 8.9E-05 5.5E-08 42.8 19.5 260 1-274 24-319 (319) 192 protein:vir:95512 Length: 693 97.3 9.1E-05 5.6E-08 42.7 17.1 258 1-276 394-691 (693) 193 protein:vir:103285 Length: 296 96.9 0.00025 1.6E-07 40.3 19.9 260 1-274 1-296 (296) 194 protein:vir:80068 Length: 301 96.9 0.00029 1.8E-07 40.0 21.0 263 1-274 1-301 (301) 195 protein:vir:4074 Length: 480 # 96.7 9.6E-05 5.9E-08 42.6 11.2 260 1-276 184-477 (480) 196 protein:vir:107882 Length: 307 96.0 0.0011 6.7E-07 36.8 15.5 271 1-276 2-307 (307) 197 protein:vir:94070 Length: 339 95.9 0.0012 7.7E-07 36.5 16.1 259 1-274 49-339 (339) 198 protein:vir:104342 Length: 314 95.7 0.0016 9.9E-07 35.9 17.6 261 1-275 1-314 (314) 199 protein:vir:79078 Length: 307 95.4 0.0021 1.3E-06 35.2 14.3 269 1-276 1-307 (307) 200 protein:vir:79642 Length: 329 95.3 0.0024 1.5E-06 34.9 19.4 262 1-276 31-328 (329) 201 protein:vir:103886 Length: 302 92.6 0.011 6.7E-06 31.4 20.7 257 1-276 1-302 (302) 202 protein:vir:99424 Length: 360 92.2 0.012 7.6E-06 31.1 19.5 263 1-276 1-358 (360) 203 protein:vir:348 Length: 321 # 87.6 0.038 2.3E-05 28.4 17.5 265 1-274 1-321 (321) 204 protein:vir:99888 Length: 309 84.9 0.057 3.5E-05 27.4 13.0 265 1-276 1-297 (309) 205 protein:vir:3643 Length: 336 # 82.2 0.079 4.9E-05 26.6 13.1 259 1-274 34-336 (336) 206 protein:vir:78558 Length: 336 80.6 0.094 5.8E-05 26.2 13.4 258 1-274 45-336 (336) 207 protein:vir:96490 Length: 348 80.0 0.099 6.1E-05 26.1 19.9 268 1-276 1-347 (348) 208 protein:vir:101557 Length: 336 78.7 0.11 6.9E-05 25.8 13.4 258 1-274 45-336 (336) 209 protein:vir:78148 Length: 123 78.4 0.078 4.8E-05 26.7 7.5 107 167-276 1-123 (123) 210 protein:vir:107732 Length: 379 71.9 0.19 0.00012 24.5 15.0 259 1-274 74-379 (379) 211 protein:vir:4786 Length: 295 # 70.1 0.21 0.00013 24.3 14.2 245 1-258 1-295 (295) 212 protein:vir:99576 Length: 388 69.7 0.22 0.00014 24.2 12.1 260 1-274 76-388 (388) 213 protein:vir:8843 Length: 317 # 67.7 0.25 0.00015 23.9 18.8 257 1-276 1-316 (317) 214 protein:vir:106734 Length: 336 67.6 0.25 0.00015 23.9 13.6 258 1-274 45-336 (336) 215 protein:vir:5942 Length: 523 # 58.3 0.42 0.00026 22.7 14.5 268 1-276 162-523 (523) 216 protein:vir:5255 Length: 304 # 57.4 0.44 0.00027 22.6 18.0 267 1-273 1-304 (304) 217 protein:vir:106590 Length: 349 54.9 0.49 0.00031 22.3 19.0 267 1-274 1-349 (349) 218 protein:vir:4902 Length: 348 # 52.3 0.56 0.00035 22.0 19.8 267 1-276 1-347 (348) 219 protein:vir:270 Length: 341 # 50.9 0.6 0.00037 21.8 10.9 261 1-276 29-334 (341) 220 protein:vir:1991 Length: 305 # 47.3 0.71 0.00044 21.4 14.0 199 1-213 1-305 (305) 221 protein:vir:2736 Length: 348 # 45.9 0.75 0.00047 21.3 20.3 268 1-276 1-347 (348) 222 protein:vir:10324 Length: 320 36.0 1.2 0.00074 20.2 19.2 257 4-276 1-318 (320) 223 protein:vir:98480 Length: 348 31.8 1.5 0.00091 19.7 21.5 267 1-275 1-348 (348) 224 protein:vir:79399 Length: 455 27.4 1.9 0.0011 19.1 10.6 263 1-276 45-366 (455) 225 protein:vir:15 Length: 472 # N 26.7 1.9 0.0012 19.0 9.8 257 1-276 52-362 (472) 226 protein:vir:393 Length: 341 # 26.0 2 0.0012 18.9 21.8 266 3-274 1-341 (341) 227 protein:vir:96079 Length: 382 23.8 2.3 0.0014 18.6 17.1 261 1-274 73-382 (382) No 1 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=1e-63 Score=365.95 Aligned_cols=272 Identities=31% Similarity=0.467 Sum_probs=247.9 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEEe Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEINK 80 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld~ 80 (276) ||+++|+||+|++++++.|++.+++.++++++|+.+. ++||||+||+++.+++.||++.+++...+++++++++++||+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcccccccc-ccCceEEEeecccccccccccCCCccCccccccceEEEEEee Confidence 9999999999999999999999999999999999875 569999999999999999987776666688999999999999 Q ss_pred eeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019506. 81 QKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEKNVP 160 (276) Q Consensus 81 ~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP 160 (276) ++++++.|+|.|+.+.++++. +++++++++||+++|+++++++..+.... ..+++.++.++++.|.+|+++|++++|| T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~alA~~vD~~i~~~~~~a~~~~-~~~~~~~~~~~~~~i~~a~~~ld~~~vP 157 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTAL-TGSAPTDADDAFDLIAKALKELTKANVP 157 (273) T ss_pred eeecceEeecHHHhhhhccHH-HHHHHHHHHHHHHHHHHHHHHHhcccccc-ccccccchhHHHHHHHHHHHHhhhcCCC Confidence 999999999999999999975 59999999999999999999998765443 3455667888999999999999999999 Q ss_pred ccCCEEEECHHHHHHHhhhHHhh-hhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeeeee Q lcl|NC_019506. 161 TIGRFLIIPPDVHGLLLAADLIV-GTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIVQT 239 (276) Q Consensus 161 ~~~r~~vv~p~~~~~L~~~~~~~-~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~~~ 239 (276) .++|++||+|++++.|+++++++ +.+..+++..+++|.||+++||+|++|+++|... +..++++|++|+++++|+.++ T Consensus 158 ~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~-~~~~~~~~~~A~~~a~q~~~~ 236 (273) T protein:vir:10 158 NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD-DEQFVAFHPSAAAYVSQIDTV 236 (273) T ss_pred cCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCC-ccEEEEEeccceeeeeeeehh Confidence 99999999999999999988754 4566667778999999999999999999999754 466789999999999999999 Q ss_pred eeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 240 EAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 240 e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) |..|++++|+|.|+++++||++++||+++++++++.- T Consensus 237 e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 237 EALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 9999999999999999999999999999999998777 No 2 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=1e-63 Score=365.95 Aligned_cols=272 Identities=31% Similarity=0.467 Sum_probs=247.9 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEEe Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEINK 80 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld~ 80 (276) ||+++|+||+|++++++.|++.+++.++++++|+.+. ++||||+||+++.+++.||++.+++...+++++++++++||+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhcccccccc-ccCceEEEeecccccccccccCCCccCccccccceEEEEEee Confidence 9999999999999999999999999999999999875 569999999999999999987776666688999999999999 Q ss_pred eeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019506. 81 QKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEKNVP 160 (276) Q Consensus 81 ~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP 160 (276) ++++++.|+|.|+.+.++++. +++++++++||+++|+++++++..+.... ..+++.++.++++.|.+|+++|++++|| T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~alA~~vD~~i~~~~~~a~~~~-~~~~~~~~~~~~~~i~~a~~~ld~~~vP 157 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTAL-TGSAPTDADDAFDLIAKALKELTKANVP 157 (273) T ss_pred eeecceEeecHHHhhhhccHH-HHHHHHHHHHHHHHHHHHHHHHhcccccc-ccccccchhHHHHHHHHHHHHhhhcCCC Confidence 999999999999999999975 59999999999999999999998765443 3455667888999999999999999999 Q ss_pred ccCCEEEECHHHHHHHhhhHHhh-hhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeeeee Q lcl|NC_019506. 161 TIGRFLIIPPDVHGLLLAADLIV-GTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIVQT 239 (276) Q Consensus 161 ~~~r~~vv~p~~~~~L~~~~~~~-~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~~~ 239 (276) .++|++||+|++++.|+++++++ +.+..+++..+++|.||+++||+|++|+++|... +..++++|++|+++++|+.++ T Consensus 158 ~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~-~~~~~~~~~~A~~~a~q~~~~ 236 (273) T protein:vir:10 158 NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD-DEQFVAFHPSAAAYVSQIDTV 236 (273) T ss_pred cCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCC-ccEEEEEeccceeeeeeeehh Confidence 99999999999999999988754 4566667778999999999999999999999754 466789999999999999999 Q ss_pred eeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 240 EAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 240 e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) |..|++++|+|.|+++++||++++||+++++++++.- T Consensus 237 e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 237 EALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 9999999999999999999999999999999998777 No 3 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=2.1e-63 Score=364.20 Aligned_cols=272 Identities=30% Similarity=0.463 Sum_probs=247.0 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEEe Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEINK 80 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld~ 80 (276) ||+++|+||+|++++++.|++.+++.++++++|+.. .++||||+||+++.+++.+|.+.+++...+++++++++++||+ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~-~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGI-ASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhcccccc-ccCCcEEEEeecCcccccccccCCCccCccccccceEEEEEee Confidence 999999999999999999999999999999999875 5789999999999999999987666556688999999999999 Q ss_pred eeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_019506. 81 QKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEKNVP 160 (276) Q Consensus 81 ~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP 160 (276) ++++++.|+|+|+.+++++++ +++++++.+||+++|+++++++..+.... +.+++.++.++++.|.+|+.+|++++|| T Consensus 80 ~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~~-~~~~~~~~~~~~~~i~~a~~~ld~~~vP 157 (273) T protein:vir:79 80 EKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTAL-TGSAPSDADDAFDLIASALKELTKANVP 157 (273) T ss_pred ecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhccccc-ccccccchhhHHHHHHHHHHHhhhccCC Confidence 999999999999999999985 59999999999999999999997765433 3455667788899999999999999999 Q ss_pred ccCCEEEECHHHHHHHhhhHH-hhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeeeee Q lcl|NC_019506. 161 TIGRFLIIPPDVHGLLLAADL-IVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIVQT 239 (276) Q Consensus 161 ~~~r~~vv~p~~~~~L~~~~~-~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~~~ 239 (276) .++|++||+|++++.|+++++ +.+.+..+.+..+++|.||+|+||+|++|+++|..+ +..++++|++|+++++|..++ T Consensus 158 ~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~-~~~~~a~~~~A~~~a~~~~~~ 236 (273) T protein:vir:79 158 NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD-DEQFVAFHPSAAAYVSQIDTV 236 (273) T ss_pred ccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccC-ceEEEEEeccceeeeeehhhh Confidence 999999999999999999876 555666666778999999999999999999999754 456789999999999999999 Q ss_pred eeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 240 EAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 240 e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) |..|++++|+|+|+++++||++++||+++++++++.- T Consensus 237 e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 237 EALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 9999999999999999999999999999999998777 No 4 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=1.2e-56 Score=327.26 Aligned_cols=272 Identities=16% Similarity=0.257 Sum_probs=231.8 Q ss_pred Cc--cc------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCc Q lcl|NC_019506. 1 MA--VT------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAP 66 (276) Q Consensus 1 MA--~~------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~ 66 (276) || |+ .|+||+|++++++.|++.++|.+++ ++|+.++ .+||||+||++|.+++.||+++..+.. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~-~d~~~~~-~~Gdtv~ip~~g~~~~~d~~~~~~i~~- 77 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVV-KTWGAQV-KKGDTFHVPRISELGVEDKATDVPVGV- 77 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhcc-ccccccc-cCCceEEEeccCcceeeeecCCCcccc- Confidence 43 43 2889999999999999999999987 6888775 459999999999999999999887764 Q ss_pred cccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-----c---cccC Q lcl|NC_019506. 67 EELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLK-----P---AATL 138 (276) Q Consensus 67 ~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~-----~---~~~~ 138 (276) +++++++++++||+++++++.|+|+|+.++++|++++++++++++||+++|+++++++...+..... . .+.. T Consensus 78 ~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~ 157 (341) T protein:vir:94 78 QPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGN 157 (341) T ss_pred ccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCc Confidence 7899999999999999999999999999999999999999999999999999999988665422110 0 1111 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) .....++.|.+|++.|++++||.++|++||+|++++.|+++++|.+++..+ +..+++|.||+++||+|++|+++|..+. T Consensus 158 ~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g-~~~l~~G~ig~i~G~~V~~Sn~lp~~~~ 236 (341) T protein:vir:94 158 GQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFIN-NAPIAQGQIGSLMGVRVIRTSLIGNNSA 236 (341) T ss_pred hhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccc-cchhheeeeeeEeceEEEEecccccccc Confidence 223457899999999999999999999999999999999999999987654 5678999999999999999999996432 Q ss_pred c---------------------------------eEEEEEecceEEeee------------eeeeeeeccCcccceeeEE Q lcl|NC_019506. 219 G---------------------------------TGAIAGVKMACTFAE------------QIVQTEAYRMEKRFADAVK 253 (276) Q Consensus 219 ~---------------------------------~~~~~~~~~a~~~~~------------~~~~~e~~~~~~~~~~~i~ 253 (276) . ...+.+|+++++.++ +...++..+.+.+++|+|. T Consensus 237 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 316 (341) T protein:vir:94 237 TGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMV 316 (341) T ss_pred ccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhh Confidence 1 124677888887765 3345677788899999999 Q ss_pred eeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 254 GLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 254 ~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +++.||++++||+++|.|+..++ T Consensus 317 ~~~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 317 GRQAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred hhhhhcccccCcceeEEEecCcC Confidence 99999999999999999999999 No 5 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=2.1e-54 Score=314.90 Aligned_cols=269 Identities=16% Similarity=0.196 Sum_probs=233.0 Q ss_pred Ccc-----------------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCC Q lcl|NC_019506. 1 MAV-----------------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDI 63 (276) Q Consensus 1 MA~-----------------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~ 63 (276) |.+ ++++ |+|++++++.|.+.++|.++++.. ...+|+|++||.+|..++.+|++|+.+ T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r----~i~~G~tv~i~~ig~~~~~~~~~g~~l 81 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY----DLRGGKSKQFMFTGKLSAGYHTPGTPI 81 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccc----cccccceEEEEeccceeEeeecCCCCC Confidence 211 2455 999999999999999999998742 124699999999999999999999998 Q ss_pred CCccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------ Q lcl|NC_019506. 64 DAPEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK------------ 131 (276) Q Consensus 64 ~~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~------------ 131 (276) ....++++++++++||+.+++++.|+|.|+.++.+|++.++.++++++||+.+|+.++..+..++... T Consensus 82 ~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~ 161 (332) T protein:vir:78 82 VGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHV 161 (332) T ss_pred CCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccccc Confidence 66556899999999999999999999999999999999999999999999999999998886543221 Q ss_pred -ccccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhh--hHHhhhhcccccccceeeee-eeEEeceEE Q lcl|NC_019506. 132 -LKPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLA--ADLIVGTGGAMAESITKNGF-VGTILGFDV 207 (276) Q Consensus 132 -~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~--~~~~~~~~~~~~~~~~~~G~-i~~~~G~~v 207 (276) .+.+..+++.+++++|.+|++.|++++||.++||+||+|++|+.|++ ++.|.+.+..+.++.+++|. |++++||+| T Consensus 162 ~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V 241 (332) T protein:vir:78 162 NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRI 241 (332) T ss_pred ccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeeeEE Confidence 23445667888999999999999999999999999999999999998 77888888877778899986 899999999 Q ss_pred EEecccccccc--------------------ceEEEEEecceEEeeeee----eeeeeccCcccceeeEEeeeeeeeEEE Q lcl|NC_019506. 208 YLSNNMGSLTN--------------------GTGAIAGVKMACTFAEQI----VQTEAYRMEKRFADAVKGLNVFGCKVI 263 (276) Q Consensus 208 ~~s~~lp~~~~--------------------~~~~~~~~~~a~~~~~~~----~~~e~~~~~~~~~~~i~~~~~yg~~v~ 263 (276) |+|+++|..+. ...++++|++|+++++.. ..++.+|++++|+|+|+++++||++++ T Consensus 242 ~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G~~v~ 321 (332) T protein:vir:78 242 LKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSL 321 (332) T ss_pred EecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhcCcee Confidence 99999996432 233678999999998744 356788999999999999999999999 Q ss_pred cCCeEEEEEec Q lcl|NC_019506. 264 YPDALVCLKKT 274 (276) Q Consensus 264 ~~~~vv~~~~~ 274 (276) |||++++|+++ T Consensus 322 rPe~~v~l~~a 332 (332) T protein:vir:78 322 RTSVAGSFQAA 332 (332) T ss_pred cccceEEEeeC Confidence 99999999888 No 6 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=3e-52 Score=303.09 Aligned_cols=270 Identities=18% Similarity=0.176 Sum_probs=227.0 Q ss_pred Cccc---------------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecC Q lcl|NC_019506. 1 MAVT---------------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTE 59 (276) Q Consensus 1 MA~~---------------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~ 59 (276) |||+ +++ |+|++++++.|++.++|.++++.. ....|++++||++|..++.+|++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r----~~~~G~sv~i~~iG~~t~~~~~~ 75 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLR----SIASGKSAQFPVIGRTKAAYLKP 75 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccc----cccccceeEeeeccceeeeeecC Confidence 6642 344 999999999999999999998742 13469999999999999999999 Q ss_pred CCCCCC-ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc----c----- Q lcl|NC_019506. 60 NSDIDA-PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNA----T----- 129 (276) Q Consensus 60 ~~~~~~-~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~----~----- 129 (276) ++.+.. +++++.++.+++||+.+++++.|+|.|+.++++|++.++.++++++||+.+|+.++..+.... . T Consensus 76 g~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:33 76 GENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENI 155 (347) T ss_pred CCCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 988754 356888999999999999999999999999999999999999999999999999986653210 0 Q ss_pred -----------cccccccc----CCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccce Q lcl|NC_019506. 130 -----------SKLKPAAT----LDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESIT 194 (276) Q Consensus 130 -----------~~~~~~~~----~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~ 194 (276) ...+.+.. .++.++++.|.+|++.|++++||.++||+||+|++|+.|+++++|++.++ ++.+.+ T Consensus 156 ~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~-~~~~~~ 234 (347) T protein:vir:33 156 EGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANY-QALLDP 234 (347) T ss_pred ccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccc-cccccc Confidence 01111111 23467899999999999999999999999999999999999999988876 456789 Q ss_pred eeeeeeEEeceEEEEeccccccccc----------------------------eEEEEEecceEEeeeeee-eeeeccCc Q lcl|NC_019506. 195 KNGFVGTILGFDVYLSNNMGSLTNG----------------------------TGAIAGVKMACTFAEQIV-QTEAYRME 245 (276) Q Consensus 195 ~~G~i~~~~G~~v~~s~~lp~~~~~----------------------------~~~~~~~~~a~~~~~~~~-~~e~~~~~ 245 (276) .+|.|++++||+||+|+++|..+.. ..++.+|++|++.++... ++|..|++ T Consensus 235 ~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~ 314 (347) T protein:vir:33 235 ERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA 314 (347) T ss_pred ccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccch Confidence 9999999999999999999964321 124678999999998765 89999999 Q ss_pred ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 246 KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 246 ~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++|+|+|+++++||++++||+++|.|+..-= T Consensus 315 ~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 315 NYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhHhhhhhhhcCCceecccceEEEecCCC Confidence 9999999999999999999999999953222 No 7 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=6.4e-52 Score=301.24 Aligned_cols=270 Identities=17% Similarity=0.175 Sum_probs=225.5 Q ss_pred Cccc----------------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeec Q lcl|NC_019506. 1 MAVT----------------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYT 58 (276) Q Consensus 1 MA~~----------------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~ 58 (276) |||. +++ |+|++++++.|.+.++|.++++.. ...+|++++||.+|..++.+++ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r----~i~~g~s~~~~~iG~~~~~~~~ 75 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVR----SISSGKSAQFPVLGRTQAAYLA 75 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceee----eecccceEEEEeeceeEEEeee Confidence 7764 233 999999999999999999998642 2356999999999999999999 Q ss_pred CCCCCCCc-cccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----c- Q lcl|NC_019506. 59 ENSDIDAP-EELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-----K- 131 (276) Q Consensus 59 ~~~~~~~~-~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-----~- 131 (276) +|+.+... +++..++++++||+.+++.+.|+|.|+.++++|++.++.++++++||+.+|+.++..+...... . T Consensus 76 ~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~ 155 (344) T protein:vir:10 76 PGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNEN 155 (344) T ss_pred cCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Confidence 99998764 5788899999999999999999999999999999999999999999999999998776432110 0 Q ss_pred -----------cc-c-----cccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccce Q lcl|NC_019506. 132 -----------LK-P-----AATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESIT 194 (276) Q Consensus 132 -----------~~-~-----~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~ 194 (276) .. . ....++..+++.|.+|++.|++++||.++||+||+|++|+.|++++.+.+.+ +++...+ T Consensus 156 ~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~-~~~~~~~ 234 (344) T protein:vir:10 156 ITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAAN-YAALIDP 234 (344) T ss_pred cccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccc-cccccce Confidence 00 0 1112345678999999999999999999999999999999999998887665 4667789 Q ss_pred eeeeeeEEeceEEEEeccccccc---------------------------cceEEEEEecceEEeeeeee-eeeeccCcc Q lcl|NC_019506. 195 KNGFVGTILGFDVYLSNNMGSLT---------------------------NGTGAIAGVKMACTFAEQIV-QTEAYRMEK 246 (276) Q Consensus 195 ~~G~i~~~~G~~v~~s~~lp~~~---------------------------~~~~~~~~~~~a~~~~~~~~-~~e~~~~~~ 246 (276) ++|.|++++||+||+|+++|... ....++.+|+.|++.++... ++|.+|+++ T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 314 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 314 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchh Confidence 99999999999999999998421 01124678999999998776 899999999 Q ss_pred cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 247 RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 247 ~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +|+|+|.+++.||++++|||++++++-+-- T Consensus 315 ~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 315 FQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHHHHHHhhcccceecccceEEEEeecC Confidence 999999999999999999998866553333 No 8 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=1.8e-51 Score=298.83 Aligned_cols=271 Identities=18% Similarity=0.168 Sum_probs=225.6 Q ss_pred Cccch--------------------hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCC Q lcl|NC_019506. 1 MAVTS--------------------FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTEN 60 (276) Q Consensus 1 MA~~~--------------------l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 60 (276) |||+. +-=|+|+.++++.|++.++|.++++.. ....|++++||++|..++.+++++ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~----~~~~G~sv~i~~ig~~t~~~~~~g 76 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLR----SIASGKSAQFPVIGRTKAAYLKPG 76 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccc----cccccceeEeeeccceeeeeeccC Confidence 66531 223899999999999999999998642 235699999999999999999999 Q ss_pred CCCCC-ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------- Q lcl|NC_019506. 61 SDIDA-PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS--------- 130 (276) Q Consensus 61 ~~~~~-~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~--------- 130 (276) +.+.. +++++.++.+++||+.+++++.|+|.|+.++++|++.++.++++++||+.+|+.++..+...... T Consensus 77 ~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~ 156 (347) T protein:vir:15 77 ENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIE 156 (347) T ss_pred CCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 88643 45688899999999999999999999999999999999999999999999999998776532110 Q ss_pred -----------cccccccC----CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccccee Q lcl|NC_019506. 131 -----------KLKPAATL----DKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITK 195 (276) Q Consensus 131 -----------~~~~~~~~----t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~ 195 (276) ..+.+... ....+++.|.+|+++|++++||.++||+||+|++|+.|++++.+.+.++ .+...++ T Consensus 157 ~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~-~~~~~~~ 235 (347) T protein:vir:15 157 GLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANY-QALIDHE 235 (347) T ss_pred ccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccc-ccccccc Confidence 00011111 2345688999999999999999999999999999999999999988776 4556799 Q ss_pred eeeeeEEeceEEEEeccccccccc----------------------------eEEEEEecceEEeeeeee-eeeeccCcc Q lcl|NC_019506. 196 NGFVGTILGFDVYLSNNMGSLTNG----------------------------TGAIAGVKMACTFAEQIV-QTEAYRMEK 246 (276) Q Consensus 196 ~G~i~~~~G~~v~~s~~lp~~~~~----------------------------~~~~~~~~~a~~~~~~~~-~~e~~~~~~ 246 (276) +|.|++++||+||+|+++|..+.+ ...+.+|++|++.++... ++|..|+++ T Consensus 236 ~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~ 315 (347) T protein:vir:15 236 RGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN 315 (347) T ss_pred ceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch Confidence 999999999999999999954321 124678999999998665 899999999 Q ss_pred cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 247 RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 247 ~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +|+|+|.++++||++++||+++|.|+..-= T Consensus 316 ~~~d~i~~~~~~G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 316 YQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhhhehhhhcCCceeccccEEEEecCCC Confidence 999999999999999999999999853222 No 9 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=8.9e-52 Score=300.45 Aligned_cols=270 Identities=19% Similarity=0.166 Sum_probs=225.6 Q ss_pred Cccc--------------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCC Q lcl|NC_019506. 1 MAVT--------------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTEN 60 (276) Q Consensus 1 MA~~--------------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 60 (276) ||+. +++ |+|..++...|.+.++|.+++... ....|++++||.+|..++.++++| T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r----~i~~G~sv~i~~iG~~tv~~~t~G 75 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVR----TIQNGKSAQFPVMGRTSGVYLAPG 75 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc----cccccceEEEecccceeeeeecCC Confidence 5543 232 788999999999999999988542 235799999999999999999999 Q ss_pred CCCCC-ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----c--- Q lcl|NC_019506. 61 SDIDA-PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-----K--- 131 (276) Q Consensus 61 ~~~~~-~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-----~--- 131 (276) +.+.. ++++++++++++||+++++.+.|+|.|+.++.+|++.++.++++++||+.+|+.++..+...+.. . T Consensus 76 ~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~ 155 (347) T protein:vir:94 76 ERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIA 155 (347) T ss_pred CCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 98743 35788899999999999999999999999999999999999999999999999998766421100 0 Q ss_pred ----------cccc----ccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeee Q lcl|NC_019506. 132 ----------LKPA----ATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNG 197 (276) Q Consensus 132 ----------~~~~----~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G 197 (276) ...+ ...++..+++.|.+|++.|++++||.++||+||+|++|+.|+.++.+.+.+ +.++..+++| T Consensus 156 g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~-~~~~~~~~~G 234 (347) T protein:vir:94 156 GLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAAN-YAALIDPETG 234 (347) T ss_pred CCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhh-cccccccccc Confidence 0001 112356678999999999999999999999999999999999988877665 4556679999 Q ss_pred eeeEEeceEEEEecccccccc--------------------------------ceEEEEEecceEEeeeeee-eeeeccC Q lcl|NC_019506. 198 FVGTILGFDVYLSNNMGSLTN--------------------------------GTGAIAGVKMACTFAEQIV-QTEAYRM 244 (276) Q Consensus 198 ~i~~~~G~~v~~s~~lp~~~~--------------------------------~~~~~~~~~~a~~~~~~~~-~~e~~~~ 244 (276) .|++++||+||+|+++|..+. ...++.+|+.|++.+++.. ++|.+|+ T Consensus 235 ~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 314 (347) T protein:vir:94 235 NIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRD 314 (347) T ss_pred ceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhc Confidence 999999999999999995211 1235678999999998887 8999999 Q ss_pred cccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 245 EKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 245 ~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +++|+|+|+++++||++++||+++++|+.++- T Consensus 315 ~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 315 VDAQGDLIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred hhhHHHHhhhhhhhcCcccccceeEEEEecCC Confidence 99999999999999999999999999976644 No 10 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=7.2e-51 Score=295.48 Aligned_cols=271 Identities=17% Similarity=0.163 Sum_probs=227.8 Q ss_pred Cccc---------------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecC Q lcl|NC_019506. 1 MAVT---------------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTE 59 (276) Q Consensus 1 MA~~---------------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~ 59 (276) ||+. .|.-|+|+.++++.|.+.++|.++++. . ....|++++||++|..++.++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~-r---~i~~gks~~~~~iG~~~~~~~~~ 76 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMV-R---SISSGKSAQFPVLGRTQAAYLAP 76 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhccccee-e---eccccceEEEeeecceEEEeeec Confidence 4331 133399999999999999999999864 1 23469999999999999999999 Q ss_pred CCCCCCc-cccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------- Q lcl|NC_019506. 60 NSDIDAP-EELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK------- 131 (276) Q Consensus 60 ~~~~~~~-~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~------- 131 (276) |+.+... .++..++.+++||+.+++.+.|+|.|+.++++|++.++.++++++||+.+|+.++..+...+... T Consensus 77 G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~ 156 (345) T protein:vir:22 77 GENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENI 156 (345) T ss_pred CCCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 9998653 45777899999999999999999999999999999999999999999999999987664321100 Q ss_pred -----------cccc-----ccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccccee Q lcl|NC_019506. 132 -----------LKPA-----ATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITK 195 (276) Q Consensus 132 -----------~~~~-----~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~ 195 (276) ...+ ...++.+++++|.+|++.|++++||.++||+||+|++|+.|++++.+.+.++ ++....+ T Consensus 157 ~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~-~~~~~~~ 235 (345) T protein:vir:22 157 EGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANY-AALIDPE 235 (345) T ss_pred cccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccccccccc-ccccccc Confidence 0001 1234667899999999999999999999999999999999999998877664 5666789 Q ss_pred eeeeeEEeceEEEEecccccccc----------------------------ceEEEEEecceEEeeeeee-eeeeccCcc Q lcl|NC_019506. 196 NGFVGTILGFDVYLSNNMGSLTN----------------------------GTGAIAGVKMACTFAEQIV-QTEAYRMEK 246 (276) Q Consensus 196 ~G~i~~~~G~~v~~s~~lp~~~~----------------------------~~~~~~~~~~a~~~~~~~~-~~e~~~~~~ 246 (276) +|.|++++||+||+|+++|.... ...++.+|+.|++++++.. ++|.+|+++ T Consensus 236 ~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 315 (345) T protein:vir:22 236 KGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 (345) T ss_pred cceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechh Confidence 99999999999999999984211 1235778999999998876 899999999 Q ss_pred cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 247 RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 247 ~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +|+|+|.+++.||++++||+++++++...- T Consensus 316 ~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 316 FQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 999999999999999999999999886666 No 11 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=1.2e-50 Score=294.30 Aligned_cols=271 Identities=18% Similarity=0.165 Sum_probs=226.8 Q ss_pred Cccc--------------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCC Q lcl|NC_019506. 1 MAVT--------------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTEN 60 (276) Q Consensus 1 MA~~--------------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 60 (276) |||. .|.-|+|++++++.|++.++|.++++.. ...+|++++||++|..++.+++++ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r----~i~~G~sv~~~~iG~~~~~~~~~g 76 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR----TIQNGKSASFPVMGRTKGYYLAPG 76 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccc----cccCcceEEEeeecceeeeeeccc Confidence 7652 1223999999999999999999998642 235799999999999999999999 Q ss_pred CCCCCc-cccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------- Q lcl|NC_019506. 61 SDIDAP-EELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-------- 131 (276) Q Consensus 61 ~~~~~~-~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------- 131 (276) ..+..+ +++.+++++++||+.+++++.|+|.|+.+.++|+++++.++++++||+.+|+.++..+...+... T Consensus 77 ~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:88 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) T ss_pred cCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 987654 56888999999999999999999999999999999999999999999999999987664332100 Q ss_pred ----------cccc----ccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeee Q lcl|NC_019506. 132 ----------LKPA----ATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNG 197 (276) Q Consensus 132 ----------~~~~----~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G 197 (276) ++.+ ....+..+++.|.+|++.|++++||.++||+||+|++|+.|++++.+...++ .+...+++| T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~-~~~~~~~~G 235 (347) T protein:vir:88 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANY-AALIDPETG 235 (347) T ss_pred CccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhh-ccccchhcc Confidence 0000 1123455689999999999999999999999999999999999887766554 455678999 Q ss_pred eeeEEeceEEEEeccccccccc-------------------------------eEEEEEecceEEeeeeee-eeeeccCc Q lcl|NC_019506. 198 FVGTILGFDVYLSNNMGSLTNG-------------------------------TGAIAGVKMACTFAEQIV-QTEAYRME 245 (276) Q Consensus 198 ~i~~~~G~~v~~s~~lp~~~~~-------------------------------~~~~~~~~~a~~~~~~~~-~~e~~~~~ 245 (276) .|++++||+|++|+++|....+ ...+.+|++|++.++..+ ++|.+|++ T Consensus 236 ~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~ 315 (347) T protein:vir:88 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) T ss_pred eeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeech Confidence 9999999999999999953221 123668899999997766 79999999 Q ss_pred ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 246 KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 246 ~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++|+|+|.+++.||++++|||++++++.+.+ T Consensus 316 ~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 316 EFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred hhHHHHhhhhhhhcCceeccceEEEEEeCCC Confidence 9999999999999999999999999988877 No 12 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=5.1e-49 Score=285.33 Aligned_cols=271 Identities=14% Similarity=0.104 Sum_probs=213.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhcccccccccc-CCcEEEEeccCcccceeecCC----CCCCCccccccceEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKA-YGDTVKINQIGAITVKEYTEN----SDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~-~Gdtv~ip~~~~~~~~d~~~~----~~~~~~~~~~~~~~~ 75 (276) |||++|+||+|++++++.|++.++|.++++|+|++++.. .||||+||+++.+++.+|+.. ++....+++.+++++ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccccceEE Confidence 999999999999999999999999999999999999874 699999999999999988642 222345789999999 Q ss_pred EEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHh Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLD 155 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 155 (276) ++||+++++++.|+|+|..+.+.|++++++++++++||+++|+++++++..+...........++.+.|+.|.+|+++|+ T Consensus 81 ~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~~a~~~L~ 160 (392) T protein:vir:99 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) T ss_pred EEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999998877777777777888899999999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccc--cceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEee Q lcl|NC_019506. 156 EKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAE--SITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFA 233 (276) Q Consensus 156 ~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~--~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~ 233 (276) +++||. ||+++++|+++..|+++++|.+.++.+.. ..+++|.||+++||+||+|+++|..+ ..++|+.++.++ T Consensus 161 ~~~vP~-~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t----~~a~~~~a~~~a 235 (392) T protein:vir:99 161 ELYIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHGD----AYLYHPTAFIMA 235 (392) T ss_pred hcCCCC-CCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeeccccccc----ceeeeccccccc Confidence 999996 89999999999999999999988776644 56899999999999999999998653 356777777665 Q ss_pred eeeeeeeec--------------------cCcccceeeEEeeeeeeeEEEcCC---eEEE-EEecCC Q lcl|NC_019506. 234 EQIVQTEAY--------------------RMEKRFADAVKGLNVFGCKVIYPD---ALVC-LKKTNP 276 (276) Q Consensus 234 ~~~~~~e~~--------------------~~~~~~~~~i~~~~~yg~~v~~~~---~vv~-~~~~~p 276 (276) ......... .+....++.......+|.+.+... +... .+.+.+ T Consensus 236 t~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~ 302 (392) T protein:vir:99 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLI 302 (392) T ss_pred cccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeee Confidence 432211000 011111111222233444433211 1110 111111 No 13 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=9.4e-50 Score=289.36 Aligned_cols=268 Identities=16% Similarity=0.198 Sum_probs=215.1 Q ss_pred Ccc--c------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV--T------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~--~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |+. + +|.||+|+++++..|++.+++..+.++... ..|||||||.+|++++.||++++.++. ++++++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~----g~GDtV~InsIg~~tV~dY~~~~~i~~-d~ltt~ 75 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDF----PDGDKLTIPSVGTPVVRSRPEQGDFTF-DNLDTG 75 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhccccc----CCCCeEEeccccccccccccCCCCccc-ccCCCc Confidence 872 2 366999999999999999999998775332 359999999999999999999998865 889999 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---cccc-----------ccccC Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT---SKLK-----------PAATL 138 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---~~~~-----------~~~~~ 138 (276) +.+++||+.||++|.|+| |..|...+++..+.++++++|+..+|+++...++.++. ..++ ..++. T Consensus 76 ~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt 154 (322) T protein:vir:31 76 EISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGT 154 (322) T ss_pred eEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCC Confidence 999999999999999999 99999999999999999999999999999887765442 1111 12344 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHH---------hhhHHhhhhcccccccceeeeeeeEEeceEEEE Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLL---------LAADLIVGTGGAMAESITKNGFVGTILGFDVYL 209 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L---------~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~ 209 (276) ++...|+.|++++.+|++++||.+|||+||+|.++..| +++++|......+..+.++ .||+++||+|++ T Consensus 155 ~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~--~Vg~~~GF~V~~ 232 (322) T protein:vir:31 155 DQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQ--FVRSVYGIDLFV 232 (322) T ss_pred CchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHH--HHHHHhceeeee Confidence 55678999999999999999999999999999997755 5566666544443322221 389999999999 Q ss_pred eccccccc-----cceE--EE-------E-----EecceEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEE Q lcl|NC_019506. 210 SNNMGSLT-----NGTG--AI-------A-----GVKMACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVC 270 (276) Q Consensus 210 s~~lp~~~-----~~~~--~~-------~-----~~~~a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~ 270 (276) ||+++... +... .. . .++..++..+++.+.|++|++++|+|.++++++||++++|||.+++ T Consensus 233 SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~ 312 (322) T protein:vir:31 233 SNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLVC 312 (322) T ss_pred eccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccceEE Confidence 99996321 0000 11 1 2344455567788889999999999999999999999999999999 Q ss_pred EEecCC Q lcl|NC_019506. 271 LKKTNP 276 (276) Q Consensus 271 ~~~~~p 276 (276) +.+.+- T Consensus 313 ~~a~~~ 318 (322) T protein:vir:31 313 VLANAD 318 (322) T ss_pred EEeccc Confidence 886655 No 14 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=1.1e-47 Score=278.03 Aligned_cols=271 Identities=13% Similarity=0.106 Sum_probs=214.0 Q ss_pred Cccchh--hHHHHHHHHHHHHHHhhcchhhhccccccccc--cCCcEEEEeccCcccceeecCCCCC-CCccccccceEE Q lcl|NC_019506. 1 MAVTSF--IPKLWSARLLAHLDKAHVVANLVNRDYEGEIK--AYGDTVKINQIGAITVKEYTENSDI-DAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l--~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~--~~Gdtv~ip~~~~~~~~d~~~~~~~-~~~~~~~~~~~~ 75 (276) |||+++ +|++|++++++.|++++++.++++|+|++|+. +.||||+||+++.+++.++...... ...+++.+++++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~v~ 80 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccceeE Confidence 999986 59999999999999999999999999999984 5899999999999999998754322 235889999999 Q ss_pred EEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-cccccccCCHHHHHHHHHHHHHHH Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-KLKPAATLDKTNIYEELIKVKVKL 154 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-~~~~~~~~t~~~~~~~i~~a~~~l 154 (276) ++||++++++++++|+|..+.+.++ ++++++++++||+.+|+++++++...+.. .++++++ .+.++.|.+++++| T Consensus 81 l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~---~~a~~~i~~a~~~L 156 (423) T protein:vir:17 81 GRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTP---ITKWSDVAQTASFL 156 (423) T ss_pred EEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcc---cccHHHHHHHHHHH Confidence 9999999999999999999888887 89999999999999999999997654433 3333333 34589999999999 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeee-eEEeceEEEEeccccccccceEE----------- Q lcl|NC_019506. 155 DEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFV-GTILGFDVYLSNNMGSLTNGTGA----------- 222 (276) Q Consensus 155 ~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i-~~~~G~~v~~s~~lp~~~~~~~~----------- 222 (276) ++++||.++|++|++|+++..|++++.+......+..+.+++|.| |+++||+||+|+++|.++.+... T Consensus 157 d~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v 236 (423) T protein:vir:17 157 KDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTV 236 (423) T ss_pred HhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCccccccceeceeeecccccc Confidence 999999999999999999999998877666656667788999987 89999999999999965332210 Q ss_pred -------------------------------------------------------------------------------- Q lcl|NC_019506. 223 -------------------------------------------------------------------------------- 222 (276) Q Consensus 223 -------------------------------------------------------------------------------- 222 (276) T Consensus 237 ~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~ 316 (423) T protein:vir:17 237 TYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDVTVTLSGV 316 (423) T ss_pred cccccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCceEEEecCc Confidence Q ss_pred ------------------------------------EEEecceEEeeeeeeee------------------eeccCcccc Q lcl|NC_019506. 223 ------------------------------------IAGVKMACTFAEQIVQT------------------EAYRMEKRF 248 (276) Q Consensus 223 ------------------------------------~~~~~~a~~~~~~~~~~------------------e~~~~~~~~ 248 (276) +++|++|++++.+-... ..+++.+.. T Consensus 317 ~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~ 396 (423) T protein:vir:17 317 PIYDTTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDAN 396 (423) T ss_pred cccccCCcccccceecccCCceeeccccccCCeeEEEEecCcceEEEEEcccCCCccceeecccCCcEEEEEEecccccc Confidence 12233333332221100 011111112 Q ss_pred eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 249 ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 249 ~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.++.+..||++.+||+-.|.+ ..+| T Consensus 397 ~~~~r~d~l~g~~~~~p~~~~~~-~g~~ 423 (423) T protein:vir:17 397 VQKMRFDLLPAYVCFNPHMGGQF-FGNP 423 (423) T ss_pred eeEEEEEeecceeeeccceEEEE-EecC Confidence 34567777799999999999888 7888 No 15 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=2.7e-48 Score=281.37 Aligned_cols=271 Identities=13% Similarity=0.109 Sum_probs=227.2 Q ss_pred Cccc-----------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCC Q lcl|NC_019506. 1 MAVT-----------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDI 63 (276) Q Consensus 1 MA~~-----------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~ 63 (276) |++- .+-=|+|+.++.+.|.+.++|.+++... ...+|+|++||.+|..++.++++++.+ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r----~i~~G~s~~~~~iG~~~~~~~~~g~~l 76 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVR----SLRGTNQLRVDRVGASTIAGRKAGEEL 76 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceee----eccccceEEEeeecceeeeeecCCCCC Confidence 6652 1112999999999999999999998642 246799999999999999999999998 Q ss_pred CCccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------c Q lcl|NC_019506. 64 DAPEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-----------L 132 (276) Q Consensus 64 ~~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-----------~ 132 (276) .. +.+.+++++++||+.+++.+.|+|.|+.++++|++.++.++++++||+++|+.++..+..++... + T Consensus 77 ~~-~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 77 VV-QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CC-CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 76 77888999999999999999999999999999999999999999999999998876655332110 0 Q ss_pred ----------cccccCCHHHHHHHHHHHHHHHhhcCCCc---cCCEEEECHHHHHHHhhhHHhhhhcccc--cccceeee Q lcl|NC_019506. 133 ----------KPAATLDKTNIYEELIKVKVKLDEKNVPT---IGRFLIIPPDVHGLLLAADLIVGTGGAM--AESITKNG 197 (276) Q Consensus 133 ----------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~---~~r~~vv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~G 197 (276) +.....++..+++++..|++.|++++||. .+||++|+|++|+.|+.+++|.+.++.+ +...+.+| T Consensus 156 ~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g 235 (334) T protein:vir:80 156 ILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGG 235 (334) T ss_pred cceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccce Confidence 11122345567789999999999999994 6799999999999999999999987643 33467899 Q ss_pred eeeEEeceEEEEeccccccccc-----------------eEEEEEecceEEeeeeee-eeeeccCcccceeeEEeeeeee Q lcl|NC_019506. 198 FVGTILGFDVYLSNNMGSLTNG-----------------TGAIAGVKMACTFAEQIV-QTEAYRMEKRFADAVKGLNVFG 259 (276) Q Consensus 198 ~i~~~~G~~v~~s~~lp~~~~~-----------------~~~~~~~~~a~~~~~~~~-~~e~~~~~~~~~~~i~~~~~yg 259 (276) .|++++||+|++|+++|..... ..+.++|++|++.++... ..|.+|++++|+|+|.+++.|| T Consensus 236 ~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G 315 (334) T protein:vir:80 236 RIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYN 315 (334) T ss_pred eEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcC Confidence 9999999999999999954211 224678999999998775 6899999999999999999999 Q ss_pred eEEEcCCeEEE--EEecCC Q lcl|NC_019506. 260 CKVIYPDALVC--LKKTNP 276 (276) Q Consensus 260 ~~v~~~~~vv~--~~~~~p 276 (276) ++++|||++++ |+.++| T Consensus 316 ~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 316 IGQRRPDAVAVHDITVTNP 334 (334) T ss_pred CceeccceEEEEEEeeecC Confidence 99999999888 458899 No 16 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=2.8e-48 Score=281.32 Aligned_cols=271 Identities=17% Similarity=0.166 Sum_probs=223.0 Q ss_pred Cccch--------------------hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCC Q lcl|NC_019506. 1 MAVTS--------------------FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTEN 60 (276) Q Consensus 1 MA~~~--------------------l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 60 (276) |||.. |.-|+|++++++.|.+.++|.++++.. -..+|++++||++|..++.++++| T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r----ti~~G~sv~~~~iG~~~~~~~~~G 76 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVR----SIQSGKSAQFPVLGRTKAAYLQPG 76 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhhe----eccccceEEeeeccceeEeeeecC Confidence 66420 233999999999999999999998641 135699999999999999999999 Q ss_pred CCCCCc-cccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----c--- Q lcl|NC_019506. 61 SDIDAP-EELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-----K--- 131 (276) Q Consensus 61 ~~~~~~-~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-----~--- 131 (276) +.+..+ +++..++++++||+.+++.+.|+|.|+.++++|++.++.++++++||+.+|+.++..+...+.. . T Consensus 77 ~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:94 77 ENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIA 156 (347) T ss_pred cCCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 988654 5688899999999999999999999999999999999999999999999999998765432110 0 Q ss_pred c---------------cccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceee Q lcl|NC_019506. 132 L---------------KPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKN 196 (276) Q Consensus 132 ~---------------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~ 196 (276) + ......++.++++.|.+|+.+|++++||+++||+|++|++|+.|++...+...+. .....+.+ T Consensus 157 g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~-~~~~~~~~ 235 (347) T protein:vir:94 157 GLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANY-QALIDPST 235 (347) T ss_pred cCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccc-cccccccc Confidence 0 0011235677899999999999999999999999999999999998654444433 44456889 Q ss_pred eeeeEEeceEEEEeccccccc-------------------------------cceEEEEEecceEEeeeeee-eeeeccC Q lcl|NC_019506. 197 GFVGTILGFDVYLSNNMGSLT-------------------------------NGTGAIAGVKMACTFAEQIV-QTEAYRM 244 (276) Q Consensus 197 G~i~~~~G~~v~~s~~lp~~~-------------------------------~~~~~~~~~~~a~~~~~~~~-~~e~~~~ 244 (276) |.|++++||+||+|+++|... .+..++.+|++|++.++... .+|.+|+ T Consensus 236 G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~ 315 (347) T protein:vir:94 236 GSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARR 315 (347) T ss_pred ceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeec Confidence 999999999999999999532 11135778999999887665 7899999 Q ss_pred cccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 245 EKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 245 ~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +++|+|+|.+++.||++++|||+++++..++- T Consensus 316 ~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 316 ANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 99999999999999999999999997764433 No 17 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=1.3e-47 Score=277.66 Aligned_cols=271 Identities=19% Similarity=0.234 Sum_probs=223.1 Q ss_pred Ccc---------------------c--hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceee Q lcl|NC_019506. 1 MAV---------------------T--SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEY 57 (276) Q Consensus 1 MA~---------------------~--~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~ 57 (276) |++ + .+.-|+|+.++++.|++.+++.++++.. ....|++++||.+|..++.+| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~r----ti~~Gksv~f~~iG~~t~~~~ 76 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKR----TLKNGKSLQFIYTGRMTSSFH 76 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhcccccc----ccccCceEEEEeeeeeEEeee Confidence 221 1 1334999999999999999999998641 234699999999999999999 Q ss_pred cCCCCCCCcc--ccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---- Q lcl|NC_019506. 58 TENSDIDAPE--ELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK---- 131 (276) Q Consensus 58 ~~~~~~~~~~--~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~---- 131 (276) ++|+.+..+. +...++++++||+.+++++.|+|.|+.++.+|++.++.++++++||+.+|+.++..+..++... T Consensus 77 t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~ 156 (375) T protein:vir:10 77 TPGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVS 156 (375) T ss_pred cCCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 9999875432 5566888999999999999999999999999999999999999999999999998775432110 Q ss_pred -----------------ccccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhh---HHhhhhcccccc Q lcl|NC_019506. 132 -----------------LKPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAA---DLIVGTGGAMAE 191 (276) Q Consensus 132 -----------------~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~---~~~~~~~~~~~~ 191 (276) .......++.+++++|.+++++|++++||.++||+||+|++|+.|+++ +.+.+.+. +++ T Consensus 157 ~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~-~~~ 235 (375) T protein:vir:10 157 ATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDV-QGS 235 (375) T ss_pred cccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecc-ccc Confidence 111233578889999999999999999999999999999999999976 45666554 566 Q ss_pred cceeeeeeeEEeceEEEEecccccccc----------------------------------------------ceEEEEE Q lcl|NC_019506. 192 SITKNGFVGTILGFDVYLSNNMGSLTN----------------------------------------------GTGAIAG 225 (276) Q Consensus 192 ~~~~~G~i~~~~G~~v~~s~~lp~~~~----------------------------------------------~~~~~~~ 225 (276) +...+|.+++++||+||+|+++|..+. ...++.+ T Consensus 236 ~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~ 315 (375) T protein:vir:10 236 ALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIF 315 (375) T ss_pred ceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEE Confidence 788899999999999999999995432 1235778 Q ss_pred ecceEEeeeeee-eee---eccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 226 VKMACTFAEQIV-QTE---AYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 226 ~~~a~~~~~~~~-~~e---~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) |++|.+.++.+. .+| ..+...+++|+|.+++.||++++||+++|.|+..+| T Consensus 316 ~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 316 QKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred chhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 999998775443 333 447899999999999999999999999999998877 No 18 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=5.9e-47 Score=274.05 Aligned_cols=267 Identities=15% Similarity=0.116 Sum_probs=231.6 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~ 73 (276) ||+. +|+||+|+++++++|++.+++.+++..+++-+ +..|++|+||+++..+ +.++.+++.++. ++++.++ T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~-g~~G~tv~ip~~~~~g~a~~~~~g~~i~~-~~lt~~~ 78 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLE-GQPGSEITVPKYKYIGDAQDVAEGAAIDY-SALETES 78 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceeccccc-CCCCCEEEEeeeccCCcceeecCCCcCcc-cccccce Confidence 9983 49999999999999999999999987776544 4579999999998765 778998888764 7899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .+++|++ ++..+.++|++..++..|++++..+++++++++++|.++++.+..+........+..+..+.++.|.++..+ T Consensus 79 ~~~~i~~-~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~~ 157 (278) T protein:vir:80 79 VKHGIKK-AGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPDA 157 (278) T ss_pred eeEeeeh-hhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHh Confidence 9999977 467899999999999999999999999999999999999999988766555444455566678999999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++.++|. .+++++||.++..|+++. .|+..... +++.+++|.||+++||+|++|+++|. +.++.++++|++ T Consensus 158 l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~----~t~~l~~~gAi~ 231 (278) T protein:vir:80 158 IEDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQL-GDDLLVKGAFGELLGWEIVRTKKLAD----GNALAVKAGALK 231 (278) T ss_pred hcccCCCc-ccEEEECHHHHHHHHhhhhhhccccccc-cccceeeccceeecceeEEEcCCCCc----ceEEEEecccee Confidence 99999985 568999999999999875 45544443 45688999999999999999999985 347888999998 Q ss_pred ee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. ++...+|.+|++++++|.|+++++||++++||+++|++++.|= T Consensus 232 ~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~ 277 (278) T protein:vir:80 232 TFLKRNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAG 277 (278) T ss_pred eeecCCcccccccchhhccceeeeeeEEEEEEEcCcceEEEeeccC Confidence 76 5556899999999999999999999999999999999999999 No 19 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=5.4e-47 Score=274.26 Aligned_cols=268 Identities=17% Similarity=0.178 Sum_probs=199.3 Q ss_pred Cc---cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MA---VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA---~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 77 (276) || |++++||+|++++++.|+++++|.++++|+|++|+.+.||||+||+++.+.++|+. ... ++++++++++++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg~---~~~-~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASGR---TLV-KQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecccC---Ccc-ccccccceEEEE Confidence 99 56788999999999999999999999999999999999999999999998888754 333 478899999999 Q ss_pred EEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhc Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEK 157 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 157 (276) ||+++++++.|+|+|+.+.+.+++++++++++++||+.+|+++++.+...+...++.++.. +.++.|.+++++|+++ T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~---~~~~~i~~a~~~Ld~~ 153 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRP---GAFIDFANAGAKQTTY 153 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCc---chHHHHHHHHHHHHhc Confidence 9999999999999999999999999999999999999999999999988776555444433 4589999999999999 Q ss_pred CCCcc-CCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEE-EEecceEEeeee Q lcl|NC_019506. 158 NVPTI-GRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAI-AGVKMACTFAEQ 235 (276) Q Consensus 158 ~vP~~-~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~-~~~~~a~~~~~~ 235 (276) +||.+ +|++|++|+++..|+++..+.. ...+.++.+++|.||+++||+||+|+++|.++.+.... .++.++...... T Consensus 154 ~VP~~G~R~lVv~P~~~~~L~~~~~~~~-~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~ 232 (418) T protein:vir:10 154 AVPQDGMRHAVLDPFTCASLSDEVTKLF-KESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDT 232 (418) T ss_pred CCCCCCceEEEeCHHHHHHHhhhccccc-cccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeeccccccee Confidence 99987 5999999999999998877654 44566778999999999999999999999877654221 222222211111 Q ss_pred eee---eeeccCcccceeeEEeeeeeeeEEE------cCCeEEEEEec------------CC Q lcl|NC_019506. 236 IVQ---TEAYRMEKRFADAVKGLNVFGCKVI------YPDALVCLKKT------------NP 276 (276) Q Consensus 236 ~~~---~e~~~~~~~~~~~i~~~~~yg~~v~------~~~~vv~~~~~------------~p 276 (276) ... .......-..||.+...-+++.-.+ ++...++..-. .| T Consensus 233 ~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p 294 (418) T protein:vir:10 233 VGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISP 294 (418) T ss_pred EEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEecc Confidence 100 0011111222333322222210000 11111111110 01 No 20 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=2.8e-47 Score=275.79 Aligned_cols=271 Identities=24% Similarity=0.312 Sum_probs=213.3 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhcc-ccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNR-DYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~-~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |+. +.|+||+|++++++.|++.++|.+++++ +|+ ...||||+||++|.+++.+++++..+.. ++++++++++ T Consensus 15 ~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~---~~~GdTV~ip~~g~~~a~d~~~g~~i~~-~~~~~~~~~i 90 (381) T protein:vir:80 15 VDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFE---GKKGDLIHIPNISRAAVYDKQPQTPVNL-QARTDSEFTF 90 (381) T ss_pred cchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccce---eecCceEEeeccCcceeeeecCCCcccc-cccCCceEEE Confidence 442 3588999999999999999999998765 444 3469999999999999999999987754 7889999999 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------------ccccccCCH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK----------------LKPAATLDK 140 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~----------------~~~~~~~t~ 140 (276) +||+++++++.|+|+|+.++++|+++++.++++.+||+++|++++..+....... ....+..+. T Consensus 91 tID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~ 170 (381) T protein:vir:80 91 TVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPA 170 (381) T ss_pred EEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccccchh Confidence 9999999999999999999999999999999999999999999998765432110 001123345 Q ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccce Q lcl|NC_019506. 141 TNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGT 220 (276) Q Consensus 141 ~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~ 220 (276) ..+++.|.+|++.|++++||.++|++||+|++++.|+++++|.++++ +++..+++|.|++++||+|++|+++|...... T Consensus 171 ~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~-~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~ 249 (381) T protein:vir:80 171 PLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDF-SQVKPVTSGVVGTILGMEVIVTTQIGINSLTG 249 (381) T ss_pred hHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhh-ccchhhhceeeeEEcceEEEeecccccccccc Confidence 66789999999999999999999999999999999999999998875 56678999999999999999999999643221 Q ss_pred EE-EEEecc--------------------eE----------------------------------Eee------------ Q lcl|NC_019506. 221 GA-IAGVKM--------------------AC----------------------------------TFA------------ 233 (276) Q Consensus 221 ~~-~~~~~~--------------------a~----------------------------------~~~------------ 233 (276) .. .++++. |+ +.. T Consensus 250 ~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (381) T protein:vir:80 250 YVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLGSFGGANRWATAVVC 329 (381) T ss_pred eeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCceeeeehhhhhhhhhccc Confidence 11 111100 00 000 Q ss_pred --------eeee-eeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 234 --------EQIV-QTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 234 --------~~~~-~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.. ..+..+..-+.+|.+.+++.||++++||.++|.|..+-- T Consensus 330 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 330 HPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred ccccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 0000 001223344678999999999999999999999966555 No 21 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=3.8e-46 Score=269.61 Aligned_cols=272 Identities=12% Similarity=0.096 Sum_probs=203.2 Q ss_pred Cccch--hhHHHHHHHHHHHHHHhhcchhhhccccccccc--cCCcEEEEeccCcccceeecCCC-CCCCccccccceEE Q lcl|NC_019506. 1 MAVTS--FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIK--AYGDTVKINQIGAITVKEYTENS-DIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~--l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~--~~Gdtv~ip~~~~~~~~d~~~~~-~~~~~~~~~~~~~~ 75 (276) |||++ |+|++|++++++.|++++++.++++|+|++|+. +.||||+||+++.+.+++..... ....++++.+.+++ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e~~v~ 80 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLISAKAT 80 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccccccceEE Confidence 99998 999999999999999999999999999999975 47999999999998888754332 22234678888999 Q ss_pred EEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cccccccccCCHHHHHHHHHHHHHHH Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNA-TSKLKPAATLDKTNIYEELIKVKVKL 154 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~-~~~~~~~~~~t~~~~~~~i~~a~~~l 154 (276) ++||++++++++++|+|..+.+.++ ++++++++++||..+|++++..+.... ...++++++. +.++.+.+++++| T Consensus 81 l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~---~a~~~~a~a~~~L 156 (423) T protein:vir:10 81 GEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGALSLGSPNTPI---KKWSDVAQTASFL 156 (423) T ss_pred EEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc---ccHHHHHHHHHHH Confidence 9999999999999999999888888 789999999999999999986665544 3333334433 3478999999999 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeee-eEEeceEEEEeccccccccceEEEEEecceEEee Q lcl|NC_019506. 155 DEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFV-GTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFA 233 (276) Q Consensus 155 ~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i-~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~ 233 (276) ++.++|..+|++|++|++++.|+++..+.......+.+.+++|.| |+++||+||+|+++|.++.+..+.++|..+.... T Consensus 157 ~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~v 236 (423) T protein:vir:10 157 KDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEV 236 (423) T ss_pred hhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEEecCCcccccccccceeeeeeeeEE Confidence 999999999999999999999998776666656667778999977 9999999999999999999888888887766654 Q ss_pred eeeeeee--eccC------cccceeeEEeeee--ee---eEEEcCC-----------eEEEE-Ee--cCC Q lcl|NC_019506. 234 EQIVQTE--AYRM------EKRFADAVKGLNV--FG---CKVIYPD-----------ALVCL-KK--TNP 276 (276) Q Consensus 234 ~~~~~~e--~~~~------~~~~~~~i~~~~~--yg---~~v~~~~-----------~vv~~-~~--~~p 276 (276) .+....+ ..+. .-.-+.+..|+.. -| .-.+..+ ..+|. .. .++ T Consensus 237 t~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~ 306 (423) T protein:vir:10 237 NYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSS 306 (423) T ss_pred EecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEeccccccc Confidence 3322110 0000 0011223333322 12 1112221 11111 00 122 No 22 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=3.9e-46 Score=269.51 Aligned_cols=272 Identities=13% Similarity=0.071 Sum_probs=196.4 Q ss_pred Cccchh--hHHHHHHHHHHHHHHhhcchhhhccccccccc--cCCcEEEEeccCcccceeecCCCC-CCCccccccceEE Q lcl|NC_019506. 1 MAVTSF--IPKLWSARLLAHLDKAHVVANLVNRDYEGEIK--AYGDTVKINQIGAITVKEYTENSD-IDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l--~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~--~~Gdtv~ip~~~~~~~~d~~~~~~-~~~~~~~~~~~~~ 75 (276) |||+++ +||+|++++++.|++++++.++++|+|++|+. +.||||+||+++.+++.+++.+.. ....+++++++++ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v~ 80 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccceeE Confidence 999986 49999999999999999999999999999984 589999999999999999986432 2245889999999 Q ss_pred EEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-cccccccCCHHHHHHHHHHHHHHH Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-KLKPAATLDKTNIYEELIKVKVKL 154 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-~~~~~~~~t~~~~~~~i~~a~~~l 154 (276) ++||++++++++++|+|..+.+.++ ++++++++++||+.+|+++++++...... .++++++ .+.++.|.+++++| T Consensus 81 l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~---~~a~~~i~~a~~~L 156 (423) T protein:vir:10 81 GRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTP---ITKWSDVAQTASFL 156 (423) T ss_pred EEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcc---cchHHHHHHHHHHH Confidence 9999999999999999999888887 89999999999999999999987765433 3333333 24589999999999 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeee-eEEeceEEEEeccccccccceEEEEEecceEEee Q lcl|NC_019506. 155 DEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFV-GTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFA 233 (276) Q Consensus 155 ~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i-~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~ 233 (276) ++++||..+|++|++|+++..|++++.+......+..+.+++|.| |+++||+||+|+++|.++.+.........+...+ T Consensus 157 d~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v 236 (423) T protein:vir:10 157 KDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTV 236 (423) T ss_pred HhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeeccee Confidence 999999999999999999999998777666656667788999987 8999999999999998876654221111000111 Q ss_pred --------eeeeeeee-----ccCcccceeeEEeeeeeeeEEEcCC-----------eEEEE-Ee--cC---------C Q lcl|NC_019506. 234 --------EQIVQTEA-----YRMEKRFADAVKGLNVFGCKVIYPD-----------ALVCL-KK--TN---------P 276 (276) Q Consensus 234 --------~~~~~~e~-----~~~~~~~~~~i~~~~~yg~~v~~~~-----------~vv~~-~~--~~---------p 276 (276) .+.+..-+ ..+.-..||.+..--++..-.++.+ ..++. .. .+ | T Consensus 237 ~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p 315 (423) T protein:vir:10 237 TYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSG 315 (423) T ss_pred ccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCceeeeccC Confidence 11110000 0111122332211111111111111 11111 01 01 1 No 23 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=3.5e-46 Score=269.80 Aligned_cols=273 Identities=14% Similarity=0.091 Sum_probs=199.0 Q ss_pred Cccchhh--HHHHHHHHHHHHHHhhcchhhhccccccccc--cCCcEEEEeccCcccceeecCCC-CCCCccccccceEE Q lcl|NC_019506. 1 MAVTSFI--PKLWSARLLAHLDKAHVVANLVNRDYEGEIK--AYGDTVKINQIGAITVKEYTENS-DIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l~--~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~--~~Gdtv~ip~~~~~~~~d~~~~~-~~~~~~~~~~~~~~ 75 (276) |||+++. ||+|++++++.|+++++|.++++|+|++|+. +.||||+||+++.++++++..+. ....++++.+++++ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~~v~ 80 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSAKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccceee Confidence 9999865 9999999999999999999999999999984 67999999999999999997643 22345888999999 Q ss_pred EEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHh Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLD 155 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 155 (276) ++||+++|+++.++|+|..+.+.++ ++++++++++|++++|+++++.+........ ++..+..+.++.|.+++++|+ T Consensus 81 l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a~~~v--gt~~t~~~~~~~i~~a~~~Ld 157 (423) T protein:vir:35 81 GKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNGALSL--GSPNTAIKKWADVAQTASFIK 157 (423) T ss_pred EEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccccCCcchHHHHHHHHHHHH Confidence 9999999999999999999999998 6889999999999999999987765443322 233333355899999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeee-eEEeceEEEEeccccccccceEEE-EEecceEEe- Q lcl|NC_019506. 156 EKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFV-GTILGFDVYLSNNMGSLTNGTGAI-AGVKMACTF- 232 (276) Q Consensus 156 ~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i-~~~~G~~v~~s~~lp~~~~~~~~~-~~~~~a~~~- 232 (276) +.+||..+|++|++|+++..|+++..+.......+.+.+++|.| |+++||+||+|+++|.++.+.... ..+..+... T Consensus 158 ~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~ 237 (423) T protein:vir:35 158 DIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVD 237 (423) T ss_pred HhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCccccccccccceeeccccccc Confidence 99999999999999999999997665444444456678999876 999999999999999887765422 222221110 Q ss_pred --e----eeeeeeeeccCcccceeeEEeee--eeeeEEEcCCeEEEE-----------Eec------CC Q lcl|NC_019506. 233 --A----EQIVQTEAYRMEKRFADAVKGLN--VFGCKVIYPDALVCL-----------KKT------NP 276 (276) Q Consensus 233 --~----~~~~~~e~~~~~~~~~~~i~~~~--~yg~~v~~~~~vv~~-----------~~~------~p 276 (276) + .+....-..+.....+.+..|+. .-|.+.+.|..-.++ +.+ ++ T Consensus 238 ~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~ 306 (423) T protein:vir:35 238 YLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTAS 306 (423) T ss_pred cccccccccceeeeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEecccccccc Confidence 0 01100001111111223333332 234444433322221 111 22 No 24 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=2.1e-45 Score=265.56 Aligned_cols=271 Identities=14% Similarity=0.099 Sum_probs=228.5 Q ss_pred Cccc---------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCC Q lcl|NC_019506. 1 MAVT---------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDA 65 (276) Q Consensus 1 MA~~---------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~ 65 (276) |++- .|.=|+|..++.+.|.+.++|.+++.. . ...+|++++||.+|..++.++++|+.+.. T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~-r---ti~~gkS~q~~~iG~~~~~~~~~G~~ld~ 76 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDV-Q---EVVGTNSVSNKYIGETELQVLSPGKSPDA 76 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-e---eecccceEEeeeeeeeEEeeeccCcccCC Confidence 6542 133499999999999999999998753 1 24679999999999999999999999864 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---------c--- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTP-LMDAAMQRAAYALADETEKILLKEMDTNATSK---------L--- 132 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d-~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~---------~--- 132 (276) +.+...+.+++||+.+++.+.|+|+|+.++++| ++.++.++++++||+.+|+.++..+..+.... . T Consensus 77 -~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~ 155 (364) T protein:vir:10 77 -SPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGH 155 (364) T ss_pred -CCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCC Confidence 778888999999999999999999999999999 89999999999999999999987765332100 0 Q ss_pred ---------cccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccc-ccccceeeeeeeEE Q lcl|NC_019506. 133 ---------KPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGA-MAESITKNGFVGTI 202 (276) Q Consensus 133 ---------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~-~~~~~~~~G~i~~~ 202 (276) ......++..++++|.+|.+.|++++||.++|+++++|++|+.|+++++|.+.++. .+...+++|+|+++ T Consensus 156 g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v 235 (364) T protein:vir:10 156 GFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKS 235 (364) T ss_pred cceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEE Confidence 01112345567889999999999999999999999999999999999999987753 24566899999999 Q ss_pred eceEEEEeccccccc-----------------------------cceEEEEEecceEEeeeee-eeeeeccCcccceeeE Q lcl|NC_019506. 203 LGFDVYLSNNMGSLT-----------------------------NGTGAIAGVKMACTFAEQI-VQTEAYRMEKRFADAV 252 (276) Q Consensus 203 ~G~~v~~s~~lp~~~-----------------------------~~~~~~~~~~~a~~~~~~~-~~~e~~~~~~~~~~~i 252 (276) +||+|++|+++|... ....+..||+.|++.++.. ..+|.++++.++++++ T Consensus 236 ~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~i 315 (364) T protein:vir:10 236 WNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYI 315 (364) T ss_pred eceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeee Confidence 999999999999421 1233677899999999766 4789999999999999 Q ss_pred EeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 253 KGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 253 ~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+++.||++++||+++++++..++ T Consensus 316 da~~a~G~g~lRPeaa~~i~~~~~ 339 (364) T protein:vir:10 316 DTFLAEGAIPDRWEAVAVVTAADT 339 (364) T ss_pred eeehcccCcccCccceEEEEecCC Confidence 999999999999999999998888 No 25 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=7.1e-45 Score=262.63 Aligned_cols=260 Identities=18% Similarity=0.124 Sum_probs=225.5 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVTS------FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~~------l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) |||.. ++||+|++++.+++.+.++|.+++.++++-+ +.+|+||+||.+... .+.++.+++.++ +++++.++ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~-g~~G~tv~iP~~~~ig~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ-GQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceeccccc-CCCCCEEEEeeecCCCccccccCCCccc-hhhcccce Confidence 99964 8999999999999999999999999887654 467999999999865 588899888876 47899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .+++|++ +++++.++|++..++..|++.+..++++.++++++|++++..+.++...... ....++.|.+|..+ T Consensus 79 ~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~------~a~~~d~i~dA~~~ 151 (274) T protein:vir:12 79 REAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA------DITKLNGLQSAIDK 151 (274) T ss_pred eeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc------cccCHHHHHHHHHH Confidence 9999976 6899999999999999999999999999999999999999999876554321 12337889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++++. .+|+++|||.++..|++++ +|+..... +.+.+++|.||+++||+|++|+.+|.. .++.++++|++ T Consensus 152 lgd~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~~----t~~l~~~gA~~ 224 (274) T protein:vir:12 152 FNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEALGAIIVRSNKLEAG----TAILAKKGAVK 224 (274) T ss_pred hccccc--cccEEEeCHHHHHHHHhhhhhhccccccc-cccceecccceeecCeeEEEeCCCCcc----eEEEEecccee Confidence 998763 7899999999999999985 66665544 457899999999999999999999853 46788999999 Q ss_pred ee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. ++...+|..|++++++|.++++++||++++||+++|+++.+.= T Consensus 225 ~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:12 225 LILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred eeecCCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCc Confidence 86 5556899999999999999999999999999999999996665 No 26 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=9.9e-45 Score=261.83 Aligned_cols=260 Identities=17% Similarity=0.153 Sum_probs=224.7 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) ||+. +++||+|+..+++.+++.+++.+++..+++.+ +.+|++|+||+++.. .+.++.+++.++. ++++.++ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~-g~~G~tv~ip~~~~~g~~~~~~~g~~i~~-~~it~~~ 78 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLV-GQPGDTLTFPAFTYSGDAQVIAEGEKIPV-DQIGTSK 78 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhccccccccccc-CCCCCEEEEEeeccCCCccccCCCCcCch-hhcccce Confidence 9973 58999999999999999999999988876543 467999999999864 6888998888764 7899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .+++|++ +++.+.++|++..++..|++.+..+++++++++++|.++++.+..++.... + ....++.|.+|..+ T Consensus 79 ~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~--~----~~~~~d~i~dA~~~ 151 (274) T protein:vir:96 79 REAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVE--A----DITKLDGLQTAIDK 151 (274) T ss_pred eEEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcC--c----ccccHHHHHHHHHH Confidence 9999976 588999999999999999999999999999999999999999987654322 1 12237889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |+++++ .+|+++|||.++..|+++. +|...... +++.+++|.||+++||+|++|+++|.. .++.++++|++ T Consensus 152 l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~-g~~~~~~g~ig~~~G~~Vi~s~~~p~~----t~~l~~~gA~~ 224 (274) T protein:vir:96 152 FNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQL-GDNIIVKGAFGEALGAVIVRSNKLNKG----EALLAKKGAVK 224 (274) T ss_pred hcccCC--CceEEEeCHHHHHHHHhcccccccccccc-cccceeecccceecCeeEEEcCCCCcc----eEEEEeCccee Confidence 998875 6799999999999999975 55555443 457899999999999999999999853 47889999999 Q ss_pred eeeee-eeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FAEQI-VQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~~~~-~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +..+. ..+|.+|++++++|.|+++++||++++||+++|++++++- T Consensus 225 ~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:96 225 LITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) T ss_pred eeecCCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcc Confidence 98554 4799999999999999999999999999999999998888 No 27 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=8.6e-45 Score=262.17 Aligned_cols=264 Identities=17% Similarity=0.146 Sum_probs=228.7 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~ 73 (276) |||+ +++||+|+.++.+.+.+.+++.+++..+.+-+ +..|+||+||+++.++ +.++.++..++ ++.++.++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~-g~~G~ti~iP~~~~~gda~~~~eg~~i~-~~~lt~~~ 78 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQ-GQPGNTLKFPAFTYIGDAADVAEGGEIS-LDKIGTTT 78 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccc-cCCCCEEEEeeeccCccccccCCCCccC-hhhcCCcc Confidence 9973 58899999999999999999999987765543 4579999999998875 66788888776 47899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .++++++ +++++.++|++..++..|++.+..++++.++++++|+++++.+....... +....++.|.+|... T Consensus 79 ~~~~i~~-~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~-------~~~~~~d~i~~A~~~ 150 (272) T protein:vir:36 79 KSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTV-------STKANVDGVQAALDI 150 (272) T ss_pred eeEeeeh-hhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-------cccccHHHHHHHHHH Confidence 9999966 57899999999999999999999999999999999999999887654332 222347889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEee Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFA 233 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~ 233 (276) |++.+.+ .|+++|||..+..|+++..+.......+++.+++|.||+++|++|++|+++|..+.....+.+.++|+++. T Consensus 151 lgd~~~~--~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~ 228 (272) T protein:vir:36 151 FNDEDAQ--AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLV 228 (272) T ss_pred hhhcCCC--ceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccceeee Confidence 9999874 68999999999999999888877667778899999999999999999999998776677788999999876 Q ss_pred -eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 234 -EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 234 -~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++...+|..|++++++|.++++.+||+++++|+++|+++.+-= T Consensus 229 ~~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 229 LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred ecCCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 5555899999999999999999999999999999999975555 No 28 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=1.8e-44 Score=260.38 Aligned_cols=260 Identities=17% Similarity=0.122 Sum_probs=225.2 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVTS------FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~~------l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) |||.. ++||+|++++++++++.++|.+++.++++.+ +.+|+||+||+++.. .+.++.+++.++ ++.++.++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~-g~~G~tv~iP~~~~~g~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ-GQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceeccccc-CCCCCEEEEeeecCCCccccccCCCccc-ccccccce Confidence 99964 8999999999999999999999998887654 467999999999865 578899888876 47899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .++++++ .++++.++|++..++..|++.+..+++++++++++|+++++.+..+.....+ .. ..++.|.+|..+ T Consensus 79 ~~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~--~~----~~~d~i~dA~~~ 151 (274) T protein:vir:97 79 REAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA--DI----TKLNGLQSAIDK 151 (274) T ss_pred eEEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--cc----cCHHHHHHHHHH Confidence 9999976 5789999999999999999999999999999999999999999887654322 12 237889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++.+. .+|+++|||.++..|+++. +|++.... ++..+++|.||+++||+|++|+++|. +.++.++++|++ T Consensus 152 l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~----~t~~l~~~gA~~ 224 (274) T protein:vir:97 152 FNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEALGAIIVRTNKLEA----GTAILAKKGAVK 224 (274) T ss_pred hhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcc-cccceeccccceecCeeEEEcCCCCc----ceEEEEeCcceE Confidence 998875 6799999999999999985 66665554 45688999999999999999999985 347889999999 Q ss_pred ee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. ++...+|..|+++.++|.++++.|||+++++|+++++++.+.= T Consensus 225 ~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:97 225 LILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred eeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 87 4555899999999999999999999999999999999997666 No 29 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=1.8e-44 Score=260.38 Aligned_cols=260 Identities=17% Similarity=0.122 Sum_probs=225.2 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVTS------FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~~------l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) |||.. ++||+|++++++++++.++|.+++.++++.+ +.+|+||+||+++.. .+.++.+++.++ ++.++.++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~-g~~G~tv~iP~~~~~g~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ-GQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceeccccc-CCCCCEEEEeeecCCCccccccCCCccc-ccccccce Confidence 99964 8999999999999999999999998887654 467999999999865 578899888876 47899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .++++++ .++++.++|++..++..|++.+..+++++++++++|+++++.+..+.....+ .. ..++.|.+|..+ T Consensus 79 ~~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~--~~----~~~d~i~dA~~~ 151 (274) T protein:vir:94 79 REAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA--DI----TKLNGLQSAIDK 151 (274) T ss_pred eEEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--cc----cCHHHHHHHHHH Confidence 9999976 5789999999999999999999999999999999999999999887654322 12 237889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++.+. .+|+++|||.++..|+++. +|++.... ++..+++|.||+++||+|++|+++|. +.++.++++|++ T Consensus 152 l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~----~t~~l~~~gA~~ 224 (274) T protein:vir:94 152 FNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEALGAIIVRTNKLEA----GTAILAKKGAVK 224 (274) T ss_pred hhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcc-cccceeccccceecCeeEEEcCCCCc----ceEEEEeCcceE Confidence 998875 6799999999999999985 66665554 45688999999999999999999985 347889999999 Q ss_pred ee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. ++...+|..|+++.++|.++++.|||+++++|+++++++.+.= T Consensus 225 ~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:94 225 LILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred eeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 87 4555899999999999999999999999999999999997666 No 30 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=1e-45 Score=267.21 Aligned_cols=242 Identities=18% Similarity=0.198 Sum_probs=200.2 Q ss_pred hhccccccccccCCcEEEEeccCcccceeecCCCCCCC-ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHH Q lcl|NC_019506. 28 LVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDA-PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQ 106 (276) Q Consensus 28 ~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~-~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~ 106 (276) ++ | ...+|++++||.+|..++.++++|+.+.. ++++...+..++||+.+++++.|+|.|+.++++|++.++.+ T Consensus 1 ~v-r-----~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~ 74 (324) T protein:vir:99 1 MT-R-----TITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYST 74 (324) T ss_pred Ce-e-----eeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHH Confidence 22 2 23569999999999999999999999854 46788899999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccc--------cc-------------ccccccCCHHHHHHHHHHHHHHHhhcCCCccCCE Q lcl|NC_019506. 107 RAAYALADETEKILLKEMDTNAT--------SK-------------LKPAATLDKTNIYEELIKVKVKLDEKNVPTIGRF 165 (276) Q Consensus 107 ~~~~ala~~~d~~~~~~~~~~~~--------~~-------------~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~ 165 (276) +++++||+.+|+.++..+..... +. .....+.++..++++|.+|++.|++++||.++|| T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~ 154 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRT 154 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCE Confidence 99999999999999877542110 00 0111234566789999999999999999999999 Q ss_pred EEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccc---------------------------- Q lcl|NC_019506. 166 LIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLT---------------------------- 217 (276) Q Consensus 166 ~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~---------------------------- 217 (276) +||+|++|+.|+.++.+. +..+++.+.+++|.|++++||+||+|+++|... T Consensus 155 ~vv~P~~y~~Ll~~~~~~-~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~ 233 (324) T protein:vir:99 155 FYTDPDTYSAILAALMPN-AANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMT 233 (324) T ss_pred EEeChHHHHHHhhccccc-ccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccc Confidence 999999999888765544 445567778999999999999999999999631 Q ss_pred ---cceEEEEEecceEEeeeeee-eeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEE--ecC-----C Q lcl|NC_019506. 218 ---NGTGAIAGVKMACTFAEQIV-QTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLK--KTN-----P 276 (276) Q Consensus 218 ---~~~~~~~~~~~a~~~~~~~~-~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~--~~~-----p 276 (276) .+..++.+|+++++.++... ++|.+|++++|+|+|.+++.||++++|||++++++ ..+ | T Consensus 234 ~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~ 303 (324) T protein:vir:99 234 VGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAP 303 (324) T ss_pred cccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCccccccc Confidence 12334778999998887766 79999999999999999999999999999997665 322 3 No 31 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=2e-44 Score=260.14 Aligned_cols=260 Identities=17% Similarity=0.132 Sum_probs=221.8 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) ||+. +++||+|++++++.+.+.++|.+++..+.+-+ +.+|+||+||++... .+.++.+++.++. +.++.++ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~-g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~ 78 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLV-GQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKK 78 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceeccccc-CCCCCEEEeeeecCCCccccccCCCccch-hhcccce Confidence 9984 58899999999999999999999964432221 346999999999875 5788988888764 7899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .+++|++ +++++.++|++..++..|++++..++++.++|+++|+++++.+..+...... .+ ..++.|.+|..+ T Consensus 79 ~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~--~~----~~~d~i~~A~~~ 151 (274) T protein:vir:96 79 REAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA--DI----TKLTGLQTAIDK 151 (274) T ss_pred eEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cc----cCHHHHHHHHHH Confidence 9999976 6899999999999999999999999999999999999999999876654322 11 237889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++.+. .+|+++|||.+++.|++++ +|+..... +.+.+++|.||+++||+|++|+++|. +.++.++++|++ T Consensus 152 lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~~~----~t~~l~~~gA~~ 224 (274) T protein:vir:96 152 FNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATEL-GDDVIVKGAFGEALGAVIVRSNKLEA----GTAILAKKGAVK 224 (274) T ss_pred hccccc--cccEEEeCHHHHHHHHhhccccccccccc-cccceeccccceecCeEEEEeCCCCC----ceEEEEecccee Confidence 998764 6899999999999999986 56655444 45789999999999999999999984 346788899999 Q ss_pred ee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. ++...+|..|++++++|.++++++||++++||+++|++++..= T Consensus 225 ~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:96 225 LITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred eeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 86 5666899999999999999999999999999999999997776 No 32 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=2e-44 Score=260.14 Aligned_cols=260 Identities=17% Similarity=0.132 Sum_probs=221.8 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) ||+. +++||+|++++++.+.+.++|.+++..+.+-+ +.+|+||+||++... .+.++.+++.++. +.++.++ T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~-g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~ 78 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLV-GQPGDTLTFPAFIYSGDAKVVAEGEKIPT-DILETKK 78 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceeccccc-CCCCCEEEeeeecCCCccccccCCCccch-hhcccce Confidence 9984 58899999999999999999999964432221 346999999999875 5788988888764 7899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .+++|++ +++++.++|++..++..|++++..++++.++|+++|+++++.+..+...... .+ ..++.|.+|..+ T Consensus 79 ~~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~--~~----~~~d~i~~A~~~ 151 (274) T protein:vir:95 79 REAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA--DI----TKLTGLQTAIDK 151 (274) T ss_pred eEEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cc----cCHHHHHHHHHH Confidence 9999976 6899999999999999999999999999999999999999999876654322 11 237889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++.+. .+|+++|||.+++.|++++ +|+..... +.+.+++|.||+++||+|++|+++|. +.++.++++|++ T Consensus 152 lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~~~----~t~~l~~~gA~~ 224 (274) T protein:vir:95 152 FNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATEL-GDDVIVKGAFGEALGAVIVRSNKLEA----GTAILAKKGAVK 224 (274) T ss_pred hccccc--cccEEEeCHHHHHHHHhhccccccccccc-cccceeccccceecCeEEEEeCCCCC----ceEEEEecccee Confidence 998764 6899999999999999986 56655444 45789999999999999999999984 346788899999 Q ss_pred ee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. ++...+|..|++++++|.++++++||++++||+++|++++..= T Consensus 225 ~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~ 270 (274) T protein:vir:95 225 LITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred eeecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCc Confidence 86 5666899999999999999999999999999999999997776 No 33 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=3.2e-44 Score=259.05 Aligned_cols=260 Identities=17% Similarity=0.119 Sum_probs=224.3 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVTS------FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~~------l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) |||+. ++||+|++++++.+++.+++.+++.++++.+ +.+|+||+||++... .+.++.++..++ ++.++.++ T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~-g~~G~tv~ip~~~~~g~~~~~~eg~~i~-~~~it~~~ 78 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQ-GQPGDTLTFPAFVYSGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhccccccccccc-CCCCCEEEEEeeccCCCcccccCCCccc-ccccccce Confidence 99974 8999999999999999999999998877654 467999999999864 688899888876 47899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) .++++++ .++.+.++|++..++..|++.+..+++++++++++|+++++.+..+.....+ ....++.|.+|..+ T Consensus 79 ~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~------~~~~~d~i~dA~~~ 151 (274) T protein:vir:93 79 REAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA------DITKLNGLQSAIDK 151 (274) T ss_pred eEEEeee-ecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc------cccCHHHHHHHHHH Confidence 9999976 5789999999999999999999999999999999999999999876644322 11237889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++.+. .+|+++|||.++..|+++. .|...... ++..+++|.||+++||+|++|+++|. +.+++++++|++ T Consensus 152 l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~----~t~~l~~~gai~ 224 (274) T protein:vir:93 152 FNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEALGAIIVRTNKLEA----GTAILAKKGAVK 224 (274) T ss_pred hhhccC--CccEEEeCHHHHHHHHhhhhhcccccccc-cccceeecccceecCeeEEEcCCCCc----ceEEEEeCCeEE Confidence 998875 6799999999999999885 55555443 45678999999999999999999985 347899999999 Q ss_pred eee-eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FAE-QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~~-~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.. +...+|..|++++++|.++++.+||+++++|+++++++.++= T Consensus 225 ~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~ 270 (274) T protein:vir:93 225 LILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred EEecCCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCcc Confidence 984 445899999999999999999999999999999999986666 No 34 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=1.8e-44 Score=260.39 Aligned_cols=270 Identities=13% Similarity=0.123 Sum_probs=226.7 Q ss_pred Cccc----------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCC Q lcl|NC_019506. 1 MAVT----------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDID 64 (276) Q Consensus 1 MA~~----------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~ 64 (276) |.|- +++ |+|+.++++.|.+.++|.+++... ...+|++++||.+|..++.++++|+.+. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r----ti~~g~s~~~~~iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR----DLRGSNVVRLDRLGNVEAKGRRAGEELE 75 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhcccccee----eeccceeEEEeeeeeeeeecccCCcCcC Confidence 6542 233 999999999999999999987642 2467999999999999999999999987 Q ss_pred CccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------c- Q lcl|NC_019506. 65 APEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-----------L- 132 (276) Q Consensus 65 ~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-----------~- 132 (276) . +.+...+..++||+.++....|+|.|+.++++|++.++.++++++||+..|+.++..+..++... + T Consensus 76 ~-~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:63 76 R-SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred C-CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCc Confidence 6 56777899999999999999999999999999999999999999999999999886655433210 0 Q ss_pred -------cccccCCHHHHHHHHHHHHHHHhhcCCCcc---CCEEEECHHHHHHHhhhHHhhhhcccc--cccceeeeeee Q lcl|NC_019506. 133 -------KPAATLDKTNIYEELIKVKVKLDEKNVPTI---GRFLIIPPDVHGLLLAADLIVGTGGAM--AESITKNGFVG 200 (276) Q Consensus 133 -------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~---~r~~vv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~G~i~ 200 (276) +.....++..+++++.+|.+.|++++||++ +|+++|+|++|+.|+.+++|.+.++.. +...+.+|.|+ T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:63 155 LEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeE Confidence 111112466677889999999999999964 599999999999999999999987543 34568999999 Q ss_pred EEeceEEEEeccccccccc-----------------eEEEEEecceEEeeeeee-eeeeccCcccceeeEEeeeeeeeEE Q lcl|NC_019506. 201 TILGFDVYLSNNMGSLTNG-----------------TGAIAGVKMACTFAEQIV-QTEAYRMEKRFADAVKGLNVFGCKV 262 (276) Q Consensus 201 ~~~G~~v~~s~~lp~~~~~-----------------~~~~~~~~~a~~~~~~~~-~~e~~~~~~~~~~~i~~~~~yg~~v 262 (276) +++||+|++|+++|..+.. ..+.++|+.|++.++... ..|.++++++|+|+|.+++.||+++ T Consensus 235 ~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~ 314 (335) T protein:vir:63 235 ILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGA 314 (335) T ss_pred EeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCcc Confidence 9999999999999954322 246789999999998775 6788999999999999999999999 Q ss_pred EcCCeEEEEEec-CC Q lcl|NC_019506. 263 IYPDALVCLKKT-NP 276 (276) Q Consensus 263 ~~~~~vv~~~~~-~p 276 (276) +||+++++++.+ .| T Consensus 315 lRPe~a~~i~~tg~~ 329 (335) T protein:vir:63 315 RRPDTAGAIELKGIG 329 (335) T ss_pred cccceEEEEEEcCCC Confidence 999999999854 44 No 35 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=2.8e-43 Score=253.84 Aligned_cols=266 Identities=16% Similarity=0.166 Sum_probs=224.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchh-hhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVAN-LVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEIN 79 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~-~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld 79 (276) =+|++...|.|++.+.+.+...+.... .+|++++. .+|++|+||+++..++.||+|++++.. ++++.++.+++|+ T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~---~gg~tVkIp~i~~~gl~DY~R~~g~~~-g~vt~~~~t~tid 100 (319) T protein:vir:97 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF---MEGRSFTVMKGDTTELKDYKRNATNEF-DHPKIEETTYFLD 100 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEe---ccCcEEEEeeecccccccccCCCCccc-CCcccceeEEEee Confidence 456677889999999888877776654 46777765 469999999999999999999988765 7899999999999 Q ss_pred eeeecceeechHHHHhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhc Q lcl|NC_019506. 80 KQKYFNFQIDDVDAAQIRTPL--MDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEK 157 (276) Q Consensus 80 ~~~~~~~~v~d~d~~~~~~d~--~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 157 (276) +.+++.|.|++.|..++...+ .....+++...++.++|.+.++.+.+.+....+ .+.+..++|+.|.++..+|+++ T Consensus 101 qdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~--~~~t~~n~y~~i~~a~~~Lde~ 178 (319) T protein:vir:97 101 QEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLT--VGTGSDAQYDAVLDVSVELDEI 178 (319) T ss_pred cccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccc--cccCHHHHHHHHHHHHHHHHhc Confidence 999999999999999987765 344566777889999999999999877655443 3468889999999999999999 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeee Q lcl|NC_019506. 158 NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIV 237 (276) Q Consensus 158 ~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~ 237 (276) +|| ++|||+|+|+++..|++++.|......+ +..+++|.|++++||+|+++++.. ..+..++++|++|+.++.+.. T Consensus 179 ~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~-~~~~~~g~Vg~idG~~Vi~vps~~--~k~in~i~~h~~A~~~~~k~~ 254 (319) T protein:vir:97 179 KAP-ENRVLFVSPTFYKGIKKFVIALPQGDTR-QQVLGKGVQGELDGFVIVKVPTKL--LQGLQAIAVVGEVLASPIQAD 254 (319) T ss_pred CCC-CCcEEEeCHHHHHHHHhhhhhhcccccc-ccceeeeeceeecCeEEEEecccc--cccceEEEEcCCeeeeeeeee Confidence 999 6999999999999999999998866654 567899999999999999864432 235568999999999999999 Q ss_pred eeeecc-CcccceeeEEeeeeeeeEEEcCC--eEEEEEecCC Q lcl|NC_019506. 238 QTEAYR-MEKRFADAVKGLNVFGCKVIYPD--ALVCLKKTNP 276 (276) Q Consensus 238 ~~e~~~-~~~~~~~~i~~~~~yg~~v~~~~--~vv~~~~~~p 276 (276) .++..+ .++.||++|+++.+||++|++|+ +|.+...++| T Consensus 255 ~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~ 296 (319) T protein:vir:97 255 LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) T ss_pred eeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCc Confidence 999876 58899999999999999999988 5666667777 No 36 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=2.8e-43 Score=253.84 Aligned_cols=266 Identities=16% Similarity=0.166 Sum_probs=224.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchh-hhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVAN-LVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEIN 79 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~-~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld 79 (276) =+|++...|.|++.+.+.+...+.... .+|++++. .+|++|+||+++..++.||+|++++.. ++++.++.+++|+ T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~---~gg~tVkIp~i~~~gl~DY~R~~g~~~-g~vt~~~~t~tid 100 (319) T protein:vir:94 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF---MEGRSFTVMKGDTTELKDYKRNATNEF-DHPKIEETTYFLD 100 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEe---ccCcEEEEeeecccccccccCCCCccc-CCcccceeEEEee Confidence 456677889999999888877776654 46777765 469999999999999999999988765 7899999999999 Q ss_pred eeeecceeechHHHHhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhc Q lcl|NC_019506. 80 KQKYFNFQIDDVDAAQIRTPL--MDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEK 157 (276) Q Consensus 80 ~~~~~~~~v~d~d~~~~~~d~--~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 157 (276) +.+++.|.|++.|..++...+ .....+++...++.++|.+.++.+.+.+....+ .+.+..++|+.|.++..+|+++ T Consensus 101 qdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~--~~~t~~n~y~~i~~a~~~Lde~ 178 (319) T protein:vir:94 101 QEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLT--VGTGSDAQYDAVLDVSVELDEI 178 (319) T ss_pred cccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccc--cccCHHHHHHHHHHHHHHHHhc Confidence 999999999999999987765 344566777889999999999999877655443 3468889999999999999999 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeee Q lcl|NC_019506. 158 NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIV 237 (276) Q Consensus 158 ~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~ 237 (276) +|| ++|||+|+|+++..|++++.|......+ +..+++|.|++++||+|+++++.. ..+..++++|++|+.++.+.. T Consensus 179 ~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~-~~~~~~g~Vg~idG~~Vi~vps~~--~k~in~i~~h~~A~~~~~k~~ 254 (319) T protein:vir:94 179 KAP-ENRVLFVSPTFYKGIKKFVIALPQGDTR-QQVLGKGVQGELDGFVIVKVPTKL--LQGLQAIAVVGEVLASPIQAD 254 (319) T ss_pred CCC-CCcEEEeCHHHHHHHHhhhhhhcccccc-ccceeeeeceeecCeEEEEecccc--cccceEEEEcCCeeeeeeeee Confidence 999 6999999999999999999998866654 567899999999999999864432 235568999999999999999 Q ss_pred eeeecc-CcccceeeEEeeeeeeeEEEcCC--eEEEEEecCC Q lcl|NC_019506. 238 QTEAYR-MEKRFADAVKGLNVFGCKVIYPD--ALVCLKKTNP 276 (276) Q Consensus 238 ~~e~~~-~~~~~~~~i~~~~~yg~~v~~~~--~vv~~~~~~p 276 (276) .++..+ .++.||++|+++.+||++|++|+ +|.+...++| T Consensus 255 ~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~ 296 (319) T protein:vir:94 255 LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) T ss_pred eeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCc Confidence 999876 58899999999999999999988 5666667777 No 37 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=1.6e-43 Score=255.25 Aligned_cols=260 Identities=16% Similarity=0.113 Sum_probs=219.0 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccceE Q lcl|NC_019506. 1 MAVT-----SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTEK 74 (276) Q Consensus 1 MA~~-----~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~ 74 (276) ||+. +++||+|+.++++.+.+.++|.+++..+.+-+ +.+|++|+||+|... .+.++.++..++. +.++.++. T Consensus 3 ~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~-g~~G~tv~iP~~~~ig~a~~~~~g~~i~~-~~lt~~~~ 80 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLV-GQPGNTITFPAFVYSGDAKVVPEGEEIPI-DLIETKKR 80 (275) T ss_pred CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceeccccc-CCCCCEEEeeeeccCCccccccCCCCcch-hhccccee Confidence 5553 58899999999999999999999976554322 456999999999876 4778888888764 78999999 Q ss_pred EEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHH Q lcl|NC_019506. 75 VLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKL 154 (276) Q Consensus 75 ~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l 154 (276) .+++.+ +++++.++|++..++..|++.+..++++.++++++|+++++.+..+.....+ ....++.|.+|..+| T Consensus 81 ~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~------~~~~~d~i~dA~~~l 153 (275) T protein:vir:96 81 QATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVEA------DITKLAGLQTAIDKF 153 (275) T ss_pred eEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc------cccCHHHHHHHHHHh Confidence 999955 6999999999999999999999999999999999999999999876544321 122378899999999 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEe Q lcl|NC_019506. 155 DEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTF 232 (276) Q Consensus 155 ~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~ 232 (276) ++.+. .+|+++|||+++..|+++. +|......+ +..+++|.||+++|++|++|+++|.. .++.++++|+++ T Consensus 154 gd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~----t~~i~~~gA~~~ 226 (275) T protein:vir:96 154 NDEDL--EPMVLFVNPLDAGKLRASATDNFTRATLLG-DNVIVKGAFGEALGAIIVRSNKIKEG----EAILAKRGAVKL 226 (275) T ss_pred ccccC--CccEEEeCHHHHHHHHhccccccccccccc-ccceeccccceecCeeEEEeCCCCcc----eEEEEeccceee Confidence 87763 6799999999999998875 666665544 56789999999999999999999853 467889999998 Q ss_pred eee-eeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 233 AEQ-IVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 233 ~~~-~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ..+ ...+|..|++++++|.|+++++||+++++|+++|+++.+.= T Consensus 227 ~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 227 ITKRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred eecCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 754 45899999999999999999999999999999999966433 No 38 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=6.1e-44 Score=257.52 Aligned_cols=270 Identities=14% Similarity=0.121 Sum_probs=227.4 Q ss_pred Cccc----------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCC Q lcl|NC_019506. 1 MAVT----------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDID 64 (276) Q Consensus 1 MA~~----------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~ 64 (276) |.|- +++ |+|+.++++.|.+.++|.+++... ..++|++++||.+|..++.++++|..+. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r----ti~~g~s~~~~~iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR----DLRGSNVVRLDRLGNVEAKGRRAGEELE 75 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhcccccee----eeccceeEEEeeeeeeeecccccCcccC Confidence 6542 233 999999999999999999998642 2467999999999999999999999987 Q ss_pred CccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------c- Q lcl|NC_019506. 65 APEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-----------L- 132 (276) Q Consensus 65 ~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-----------~- 132 (276) . +.+..++..++||+.++..+.|+|.|+.++++|++.++.++++++||+..|+.++..+..++... + T Consensus 76 ~-~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:78 76 R-SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred C-CCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCc Confidence 6 67888999999999999999999999999999999999999999999999998876655433210 0 Q ss_pred -------cccccCCHHHHHHHHHHHHHHHhhcCCCcc---CCEEEECHHHHHHHhhhHHhhhhcccc--cccceeeeeee Q lcl|NC_019506. 133 -------KPAATLDKTNIYEELIKVKVKLDEKNVPTI---GRFLIIPPDVHGLLLAADLIVGTGGAM--AESITKNGFVG 200 (276) Q Consensus 133 -------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~---~r~~vv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~G~i~ 200 (276) +.....++..+.+++.+|.+.|++++||.. +|+++|+|++|+.|+.+++|.+.++.. +...+.+|.++ T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:78 155 LEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeE Confidence 011223566678899999999999999964 699999999999999999999987543 34568999999 Q ss_pred EEeceEEEEeccccccccc-----------------eEEEEEecceEEeeeeee-eeeeccCcccceeeEEeeeeeeeEE Q lcl|NC_019506. 201 TILGFDVYLSNNMGSLTNG-----------------TGAIAGVKMACTFAEQIV-QTEAYRMEKRFADAVKGLNVFGCKV 262 (276) Q Consensus 201 ~~~G~~v~~s~~lp~~~~~-----------------~~~~~~~~~a~~~~~~~~-~~e~~~~~~~~~~~i~~~~~yg~~v 262 (276) +++||+|++|+++|..+.. ..+..+|+.|++.++... ..|.++++++|+|+|.+++.||+++ T Consensus 235 ~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~ 314 (335) T protein:vir:78 235 ILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYNIGA 314 (335) T ss_pred EeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcCCcc Confidence 9999999999999954311 246779999999998776 5688999999999999999999999 Q ss_pred EcCCeEEEEEec-CC Q lcl|NC_019506. 263 IYPDALVCLKKT-NP 276 (276) Q Consensus 263 ~~~~~vv~~~~~-~p 276 (276) +|||++++++.+ .| T Consensus 315 lRPe~a~~i~~tg~~ 329 (335) T protein:vir:78 315 RRPDTAGAIELKGIE 329 (335) T ss_pred cCcceEEEEEecCCC Confidence 999999999854 44 No 39 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=1.4e-42 Score=250.00 Aligned_cols=266 Identities=14% Similarity=0.153 Sum_probs=222.7 Q ss_pred CccchhhHHHHHHHHHHHHHHhhc-chhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHV-VANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEIN 79 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v-~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld 79 (276) =+|++...|+|.+.|.+.|...+. ...++|++++. .+|++|+||+++..++.||+|++++.. ++++.++.+++|+ T Consensus 36 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~---~~g~tVkIp~i~~~gl~DY~R~~g~~~-g~vt~~~~t~tid 111 (329) T protein:vir:10 36 EPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIF---MQGRSFTVIKGDVTELKDYKRNATNEF-DHPQIQETTYFLD 111 (329) T ss_pred CCchhHHHHHHHHHHHHHHHhhceeeeeecccceee---ccCcEEEEeeecccccccccCCCCccc-cccccceeEEEee Confidence 567788899999999999976544 44467888763 479999999999999999999998764 7899999999999 Q ss_pred eeeecceeechHHHHhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhc Q lcl|NC_019506. 80 KQKYFNFQIDDVDAAQIRTPL--MDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEK 157 (276) Q Consensus 80 ~~~~~~~~v~d~d~~~~~~d~--~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 157 (276) +.+++.|.|++.|..++...+ .....+++...+++++|.+.++.+.+.+.... ..+.+..++|+.|.++..+|+++ T Consensus 112 qdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~~--~~~~t~~nay~~i~~a~~~Lde~ 189 (329) T protein:vir:10 112 QEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKHL--TVGSGADAQYDAVLDVSVELDEI 189 (329) T ss_pred cccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhccccc--ccccCHHHHHHHHHHHHHHHHhc Confidence 999999999999999987655 34445677888999999999999987765543 34468889999999999999999 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeee Q lcl|NC_019506. 158 NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIV 237 (276) Q Consensus 158 ~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~ 237 (276) +|| ++||++|+|+++..|++++.|..... ..++.+++|.|++++||+|+++++... .+..++++|++|++++.+.. T Consensus 190 ~vp-~~Rvl~VtP~~~~~Lk~~~~f~~~~~-~~~~~~~~g~Vg~idG~~Ii~vps~~~--k~in~ii~~~~A~~~~~K~~ 265 (329) T protein:vir:10 190 GAG-ASRILFVTPKFYKGIKKFVIELPQGD-NRQQVLGKGVQGELDGFTIVKVPSKML--QGVEAMAVIGEVMASPIQAN 265 (329) T ss_pred CCC-CCcEEEeCHHHHHHHHhhhhhhcccc-ccccceeeeeeeeecCeEEEEecCCcc--cceeEEEEcCCceeeeeeee Confidence 998 59999999999999999998876543 345678999999999999998654432 34567899999999999999 Q ss_pred eeeecc-CcccceeeEEeeeeeeeEEEcCCeE--EEEEecCC Q lcl|NC_019506. 238 QTEAYR-MEKRFADAVKGLNVFGCKVIYPDAL--VCLKKTNP 276 (276) Q Consensus 238 ~~e~~~-~~~~~~~~i~~~~~yg~~v~~~~~v--v~~~~~~p 276 (276) .++..+ .++.+|++|+++.+||++|++|++. .+...++| T Consensus 266 ~~~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~ 307 (329) T protein:vir:10 266 EAKLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEV 307 (329) T ss_pred eeeeeCCCCccchheeeeeeeeeeEEEccccCEEEEecccCc Confidence 999876 5889999999999999999999844 44445555 No 40 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=1e-43 Score=256.23 Aligned_cols=271 Identities=15% Similarity=0.107 Sum_probs=226.8 Q ss_pred Cccc---------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCC Q lcl|NC_019506. 1 MAVT---------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDA 65 (276) Q Consensus 1 MA~~---------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~ 65 (276) |++- .|.=|+|..++.+.|.+.++|.+++.. . ...+|++++||.+|..++.++++|+.+.. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~v-r---ti~~GkS~qf~~iG~~~a~y~~~G~~ldg 76 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-Q---TVTGTNTVSNKYLGETELQVLAPGQSPNA 76 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCccee-e---eecccceEEEEEEeeeEEeeeccccccCC Confidence 6542 133499999999999999999998753 1 24679999999999999999999999865 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----------c-- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTP-LMDAAMQRAAYALADETEKILLKEMDTNATS-----------K-- 131 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d-~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-----------~-- 131 (276) +.+...+..++||+.+++...|+|+|+.++++| ++.++.++++++||+.+|+.+++.+..+... . T Consensus 77 -~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~ 155 (402) T protein:vir:97 77 -TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) T ss_pred -CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccccc Confidence 678888999999999999999999999999999 8999999999999999999998876542210 0 Q ss_pred ----c----cccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccc-ccccceeeeeeeEE Q lcl|NC_019506. 132 ----L----KPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGA-MAESITKNGFVGTI 202 (276) Q Consensus 132 ----~----~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~-~~~~~~~~G~i~~~ 202 (276) . ......++..++++|.++.+.|++++||.++|+++++|++|+.|+++++|.+.++. .+.+.+.+|.|+++ T Consensus 156 g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v 235 (402) T protein:vir:97 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) T ss_pred ccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEE Confidence 0 01123567778899999999999999999999999999999999999999988763 44566899999999 Q ss_pred eceEEEEeccccccc-----------------------cceEEEEEecceEEeeeeee-eeeeccCcccceeeEEeeeee Q lcl|NC_019506. 203 LGFDVYLSNNMGSLT-----------------------NGTGAIAGVKMACTFAEQIV-QTEAYRMEKRFADAVKGLNVF 258 (276) Q Consensus 203 ~G~~v~~s~~lp~~~-----------------------~~~~~~~~~~~a~~~~~~~~-~~e~~~~~~~~~~~i~~~~~y 258 (276) +||+||+|+++|..+ ....++.||+.|++.++... ..+.++++++|+++|.+.+.| T Consensus 236 ~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~ 315 (402) T protein:vir:97 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) T ss_pred eceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHh Confidence 999999999999632 11235778999999987654 568899999999999999999 Q ss_pred eeEEEcCCeEEEEEecC-------C Q lcl|NC_019506. 259 GCKVIYPDALVCLKKTN-------P 276 (276) Q Consensus 259 g~~v~~~~~vv~~~~~~-------p 276 (276) |++++|||++++++.-. | T Consensus 316 G~g~~RPeaa~vv~~~~~~t~~~~~ 340 (402) T protein:vir:97 316 GAIPDRWEAVSVVTTKRDATTGDAG 340 (402) T ss_pred CCcccCccceEEEEEecccccccCC Confidence 99999999999985433 2 No 41 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=4.6e-41 Score=241.74 Aligned_cols=260 Identities=18% Similarity=0.140 Sum_probs=220.6 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) |||. +++||+|++++.+.+++.++|.+++..+.+-+ +.+|++|+||.+... .+.++.++..++ ++.++.++ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~-g~~G~ti~iP~~~~igda~~~~eg~~i~-~~~lt~~~ 78 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLV-GQPGDTLTFPAFVYSGDATVVPEGQKIP-VDKIETNR 78 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceeccccc-CCCCCEEEeeeecCCCccccccCCCccC-ccccccce Confidence 9974 48999999999999999999999987765443 457999999999876 467788888876 47899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) ...++.+ +++.+.++|++..++..|++.+.+++++.++|+++|+++++.+..+..... +..+ .++.|.+|..+ T Consensus 79 ~~a~i~~-~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~--~~~~----t~d~i~~A~~~ 151 (276) T protein:vir:10 79 REAKIHK-IGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVS--ADIG----TLAGLEAAIDT 151 (276) T ss_pred eeEEeeh-ccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--cccc----CHHHHHHHHHH Confidence 9999955 699999999999999999999999999999999999999999987654432 1122 26789999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhh--HHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAA--DLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~--~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++++. ..++++|||..+..|+++ .+|......+ +..+++|.||+++|++|++|+++|. +.++.++++|++ T Consensus 152 lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~----~t~~l~~~gAi~ 224 (276) T protein:vir:10 152 FDDEDL--EPMVLFINPKDAGKLRSSASDNFTRATELG-DNIIVKGAFGEALGAVIVRSKKLDE----GEAILAKRGAVK 224 (276) T ss_pred hccccC--cccEEEEcHHHHHHHHHhcccccccccccc-ccceeccccceecceeEEEcCCCCc----ceEEEEecccee Confidence 998764 678999999999999875 5677665544 5678999999999999999999985 346788999999 Q ss_pred ee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecC---C Q lcl|NC_019506. 232 FA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTN---P 276 (276) Q Consensus 232 ~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~---p 276 (276) +. ++...+|..|++++++|.|+++.+||+++++|+++++++.+. | T Consensus 225 ~~~~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~ 273 (276) T protein:vir:10 225 LITKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGTTD 273 (276) T ss_pred eeecCCceeecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCcCCc Confidence 87 445589999999999999999999999999999999998543 3 No 42 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=2e-39 Score=232.76 Aligned_cols=271 Identities=14% Similarity=0.157 Sum_probs=204.3 Q ss_pred Cccch---hhHHHHHHHHHHHHH-HhhcchhhhccccccccccCCcEEEEeccCccc------ceeecCCCCCCCc-ccc Q lcl|NC_019506. 1 MAVTS---FIPKLWSARLLAHLD-KAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT------VKEYTENSDIDAP-EEL 69 (276) Q Consensus 1 MA~~~---l~~e~~~~~~~~~l~-~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~------~~d~~~~~~~~~~-~~~ 69 (276) |+.++ |+ ++|++++...++ +...+.+.+... . ...++++++.+.+.... .....+.+.++.+ .+. T Consensus 13 Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~-~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~~~ 88 (322) T protein:vir:10 13 IAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHK-N--ESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVNNK 88 (322) T ss_pred eechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccc-c--ccccccceeecccccccccccccccccccCcccCCCcccc Confidence 77653 44 889999998884 566666665321 1 23456777776553322 2222222333222 344 Q ss_pred ccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--c--------cccCC Q lcl|NC_019506. 70 STTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLK--P--------AATLD 139 (276) Q Consensus 70 ~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~--~--------~~~~t 139 (276) +.+...+.+++ +++++.|+|.|+.+..+|+++.+.++++.+|++++|+.+++.+...+..... + ....+ T Consensus 89 ~~~~r~~~~~d-~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~~g~ 167 (322) T protein:vir:10 89 PFAKRRTNVDT-YDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIGDGT 167 (322) T ss_pred ccceEEEeecc-cccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccCCCcccccCc Confidence 56667777755 4888999999999999999999999999999999999999766543321110 0 01112 Q ss_pred HHHHHHHHHHHHHHHhhcCCCcc-CCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 140 KTNIYEELIKVKVKLDEKNVPTI-GRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 140 ~~~~~~~i~~a~~~l~~~~vP~~-~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) ....++.|++|++.|++++||.+ +||++++|.++..|+.+++|+++++.+.+...++|.|++|+||+|++|++||.... T Consensus 168 ~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~ 247 (322) T protein:vir:10 168 KPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYTWIVSTRLDKFDP 247 (322) T ss_pred cchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEEEEEeccCCcccc Confidence 23457899999999999999976 59999999999999999999999998877777889999999999999999995432 Q ss_pred --------------ceEEEEEecceEEeeeeee-eee-eccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 219 --------------GTGAIAGVKMACTFAEQIV-QTE-AYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 219 --------------~~~~~~~~~~a~~~~~~~~-~~e-~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ...|+++|++|++++.+.+ .++ .+++.+.+++.|++.+.||+++++|++|++|+..-- T Consensus 248 t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 248 TQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred ccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEeccCcEEEEEEecc Confidence 2358999999999997543 344 566777789999999999999999999999997666 No 43 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=3.3e-38 Score=226.05 Aligned_cols=259 Identities=15% Similarity=0.140 Sum_probs=216.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 73 (276) ||++ +++||+|++.+.+.+.+.+++.+++..++..+ +..|++|+||+++. ..+.++.+|...+. ++++.++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~-g~~G~tv~iP~~~~~~~a~~v~eg~~i~~-~~~~~~~ 78 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLE-GQPGTTLTVPKWDYIGDAEDVAEGEAIPM-TQLGFKK 78 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhcccccccccc-CCCCCEEEEEEecCCCCcccccCCCcccc-cccccce Confidence 9975 48999999999999999999999987765533 35799999999875 46788888887764 7899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) +.+++++ .++.+.++|++..++..|+++++.+++++++++++|.++++.+..+..... ....++.|.+|... T Consensus 79 ~~~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~-------~~~t~d~i~da~~~ 150 (272) T protein:vir:30 79 TTMTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVE-------ATATVDGVSKALDI 150 (272) T ss_pred EEEEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-------cccCHHHHHHHHHH Confidence 9999977 477899999999999999999999999999999999999998876543322 12237889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++.+ ...++++|||.++..|+++. .+...... +.+.+++|.+++++|++|++|+++|. +.++++++++++ T Consensus 151 l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~-~~~~~~~g~ig~i~G~~Vi~s~~~p~----~t~~~~~~~a~~ 223 (272) T protein:vir:30 151 FNDED--DAETVIVMNPADASTLRLDAAKEWLGATEV-GANRVVSGVYGEVLGVQIVRSRKCPK----GTAYMVRKGALR 223 (272) T ss_pred HhccC--CCccEEEEcHHHHHHHHHhccccccccccc-cccccccccchhhcCeeEEEcCCCCc----ceEEEEcCCeEE Confidence 98776 45789999999999998874 33443333 34578899999999999999999984 336788999999 Q ss_pred eeee-eeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FAEQ-IVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~~~-~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++.+ ...+|.+|++.++++.++++.+||+++++|++++.++.++= T Consensus 224 ~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:30 224 IMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred EEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 9854 44789999999999999999999999999999999976544 No 44 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=3.3e-38 Score=226.05 Aligned_cols=259 Identities=15% Similarity=0.140 Sum_probs=216.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 73 (276) ||++ +++||+|++.+.+.+.+.+++.+++..++..+ +..|++|+||+++. ..+.++.+|...+. ++++.++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~-g~~G~tv~iP~~~~~~~a~~v~eg~~i~~-~~~~~~~ 78 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLE-GQPGTTLTVPKWDYIGDAEDVAEGEAIPM-TQLGFKK 78 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhcccccccccc-CCCCCEEEEEEecCCCCcccccCCCcccc-cccccce Confidence 9975 48999999999999999999999987765533 35799999999875 46788888887764 7899999 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) +.+++++ .++.+.++|++..++..|+++++.+++++++++++|.++++.+..+..... ....++.|.+|... T Consensus 79 ~~~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~-------~~~t~d~i~da~~~ 150 (272) T protein:vir:98 79 TTMTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVE-------ATATVDGVSKALDI 150 (272) T ss_pred EEEEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-------cccCHHHHHHHHHH Confidence 9999977 477899999999999999999999999999999999999998876543322 12237889999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhH--HhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAAD--LIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~--~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) |++.+ ...++++|||.++..|+++. .+...... +.+.+++|.+++++|++|++|+++|. +.++++++++++ T Consensus 151 l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~-~~~~~~~g~ig~i~G~~Vi~s~~~p~----~t~~~~~~~a~~ 223 (272) T protein:vir:98 151 FNDED--DAETVIVMNPADASTLRLDAAKEWLGATEV-GANRVVSGVYGEVLGVQIVRSRKCPK----GTAYMVRKGALR 223 (272) T ss_pred HhccC--CCccEEEEcHHHHHHHHHhccccccccccc-cccccccccchhhcCeeEEEcCCCCc----ceEEEEcCCeEE Confidence 98776 45789999999999998874 33443333 34578899999999999999999984 336788999999 Q ss_pred eeee-eeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FAEQ-IVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~~~-~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++.+ ...+|.+|++.++++.++++.+||+++++|++++.++.++= T Consensus 224 ~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:98 224 IMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred EEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 9854 44789999999999999999999999999999999976544 No 45 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=6.6e-38 Score=224.41 Aligned_cols=275 Identities=16% Similarity=0.064 Sum_probs=209.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccc-cCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIK-AYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEIN 79 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~-~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld 79 (276) ||+-. ++++|++++++.|.+.+++..|.++.+.++.. .+|++|+||+++..+..||+|++......+++.++.+++|+ T Consensus 1 MA~~n-~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~~ld 79 (299) T protein:vir:79 1 MAALN-YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPKVLT 79 (299) T ss_pred Cccch-hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEEEee Confidence 99533 36999999999999999999988777666543 56899999999999999999976433446788899999999 Q ss_pred eeeecceeechHHHHhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc--cccccCCHHHHHHHHHHHHHHHh Q lcl|NC_019506. 80 KQKYFNFQIDDVDAAQIRTPL--MDAAMQRAAYALADETEKILLKEMDTNATSKL--KPAATLDKTNIYEELIKVKVKLD 155 (276) Q Consensus 80 ~~~~~~~~v~d~d~~~~~~d~--~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~--~~~~~~t~~~~~~~i~~a~~~l~ 155 (276) +.+++.|.|++.|..++.... .....+.+...++.++|.+.++.+.+.+...+ ...+.+|++++|+.|.++..+|+ T Consensus 80 qdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~~~~~~ld 159 (299) T protein:vir:79 80 NQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFDKLMEKMT 159 (299) T ss_pred ccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHHHHHHHHH Confidence 999999999977766654332 22223334456888999999998877665433 34556789999999999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE--ecccccc---ccc---------eE Q lcl|NC_019506. 156 EKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL--SNNMGSL---TNG---------TG 221 (276) Q Consensus 156 ~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~--s~~lp~~---~~~---------~~ 221 (276) +++||.++||++|+|.++..|+++++|.+..........++|.|++++||+|++ |+++++. +.| -. T Consensus 160 e~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~~ak~in 239 (299) T protein:vir:79 160 EARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGAGAKQIF 239 (299) T ss_pred hcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCccccCcccccc Confidence 999999999999999999999999999887776666678899999999999987 5555531 111 23 Q ss_pred EEEEecceEEeeeeeeeeeeccCc-ccce-eeEEeeeeeeeEEEcCC--eEEEEEecCC Q lcl|NC_019506. 222 AIAGVKMACTFAEQIVQTEAYRME-KRFA-DAVKGLNVFGCKVIYPD--ALVCLKKTNP 276 (276) Q Consensus 222 ~~~~~~~a~~~~~~~~~~e~~~~~-~~~~-~~i~~~~~yg~~v~~~~--~vv~~~~~~p 276 (276) ++..|++|.........+....+. +..+ .++..+.++++.|++.. +|.+-..+|= T Consensus 240 ~ii~~~~a~~~~~K~~~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~ 298 (299) T protein:vir:79 240 MSLVHPSAIITPVSYQFSKLDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAG 298 (299) T ss_pred eEEEcCCeeeeeEeeeeEEeecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecC Confidence 577899998877666555544331 2222 26678888999999754 4433333333 No 46 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=2e-39 Score=232.73 Aligned_cols=271 Identities=14% Similarity=0.108 Sum_probs=219.4 Q ss_pred Cccc---------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCC Q lcl|NC_019506. 1 MAVT---------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDA 65 (276) Q Consensus 1 MA~~---------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~ 65 (276) |++- .|.=|+|..++++.|.+..+|.+++... ...+|++++||.+|..++.++++|+.+.. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vR----ti~~gkS~qf~~~G~s~~~~~~pG~~ld~ 76 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQ----TVTGTNTVSNKYLGETELQVLAPGQSPAA 76 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceee----eecccceEEEEEeeeeEeeeecCCCCcCC Confidence 6542 1334999999999999999999887531 34689999999999999999999999875 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------c--c--- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTP-LMDAAMQRAAYALADETEKILLKEMDTNATS-------K--L--- 132 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d-~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-------~--~--- 132 (276) +.+...+..|+||..++..+.|+|+|+.++++| ++.++.++++++||+.+|+.++..+..+... . + T Consensus 77 -~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~ 155 (401) T protein:vir:70 77 -TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGH 155 (401) T ss_pred -CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCC Confidence 677888999999999999999999999999999 8999999999999999999998887533210 0 0 Q ss_pred ---------cccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccc-ccccceeeeeeeEE Q lcl|NC_019506. 133 ---------KPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGA-MAESITKNGFVGTI 202 (276) Q Consensus 133 ---------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~-~~~~~~~~G~i~~~ 202 (276) ......++..+.++|.+|...|++++||.++++++++|..|+.|+..+.+.+.++. .+.+.+.+|+|.++ T Consensus 156 G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~v 235 (401) T protein:vir:70 156 GFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSS 235 (401) T ss_pred ceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEE Confidence 01123466778999999999999999996644445577777788887778777653 34567899999999 Q ss_pred eceEEEEeccccccc-----------------------cceEEEEEecceEEeeeeee-eeeeccCcccceeeEEeeeee Q lcl|NC_019506. 203 LGFDVYLSNNMGSLT-----------------------NGTGAIAGVKMACTFAEQIV-QTEAYRMEKRFADAVKGLNVF 258 (276) Q Consensus 203 ~G~~v~~s~~lp~~~-----------------------~~~~~~~~~~~a~~~~~~~~-~~e~~~~~~~~~~~i~~~~~y 258 (276) +||+||+|+++|..+ .+..++.||++|++.++... ..+.|++.+.|.++|.+.+.| T Consensus 236 aGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~ 315 (401) T protein:vir:70 236 YNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMAE 315 (401) T ss_pred eceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHHh Confidence 999999999999632 11235678999998887654 567899999999999999999 Q ss_pred eeEEEcCCeEEEEE--ec----CC Q lcl|NC_019506. 259 GCKVIYPDALVCLK--KT----NP 276 (276) Q Consensus 259 g~~v~~~~~vv~~~--~~----~p 276 (276) |++++|||++++++ -+ .| T Consensus 316 g~g~~RPeaa~vv~~k~~~~~~~~ 339 (401) T protein:vir:70 316 GAIPDRWEAVSVVTTKRNTTTGAV 339 (401) T ss_pred CCcccchhheEEEeecCccccccc Confidence 99999999999973 22 33 No 47 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=4e-39 Score=231.10 Aligned_cols=271 Identities=14% Similarity=0.099 Sum_probs=221.6 Q ss_pred Cccc---------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCC Q lcl|NC_019506. 1 MAVT---------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDA 65 (276) Q Consensus 1 MA~~---------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~ 65 (276) |++- .|.=|+|..++++.|.+..+|.+++... ...+|+|++||.+|..++..+++|+.+.. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vR----tI~~gkS~qf~~lG~s~a~y~~pG~~ldg 76 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQ----TVTGTNTVSNKYLGETELQVLAPGQSPAA 76 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceee----eecccceEEEEEeeeeEEeeecCCCCcCC Confidence 6542 1345999999999999999999887531 34679999999999999999999999875 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhh-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc----------c--- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTP-LMDAAMQRAAYALADETEKILLKEMDTNATS----------K--- 131 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d-~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~----------~--- 131 (276) +.+...+..|+||...+....|+|+|+.++++| ++.++.++++++||+.+|+.++..+..+... . T Consensus 77 -~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~ 155 (400) T protein:vir:10 77 -TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGH 155 (400) T ss_pred -CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcccc Confidence 567888999999999999999999999999999 8999999999999999999998776443210 0 Q ss_pred --------ccccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccc-ccccceeeeeeeEE Q lcl|NC_019506. 132 --------LKPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGA-MAESITKNGFVGTI 202 (276) Q Consensus 132 --------~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~-~~~~~~~~G~i~~~ 202 (276) .+.....++..+..+|.+|...|++++||.++++++++|..|+.|+..+.+.+.++. .+...+.+|+|.++ T Consensus 156 g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v 235 (400) T protein:vir:10 156 GFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSS 235 (400) T ss_pred ccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEE Confidence 011222456677889999999999999997766677788888888887888877754 23466899999999 Q ss_pred eceEEEEeccccccc-----------------------cceEEEEEecceEEeeeeee-eeeeccCcccceeeEEeeeee Q lcl|NC_019506. 203 LGFDVYLSNNMGSLT-----------------------NGTGAIAGVKMACTFAEQIV-QTEAYRMEKRFADAVKGLNVF 258 (276) Q Consensus 203 ~G~~v~~s~~lp~~~-----------------------~~~~~~~~~~~a~~~~~~~~-~~e~~~~~~~~~~~i~~~~~y 258 (276) +||+|++|+++|... .+..++.||++|++.++... ..+.|+++++|+++|.+.+.| T Consensus 236 ~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~ 315 (400) T protein:vir:10 236 YNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSE 315 (400) T ss_pred eceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHh Confidence 999999999999532 11235778999999887654 568899999999999999999 Q ss_pred eeEEEcCCeEEEEEecC---C Q lcl|NC_019506. 259 GCKVIYPDALVCLKKTN---P 276 (276) Q Consensus 259 g~~v~~~~~vv~~~~~~---p 276 (276) |++++|||++++++..= | T Consensus 316 G~g~~RPeaa~vv~~~~~~~~ 336 (400) T protein:vir:10 316 GAIPDRWEAVSVVTTKRQSTG 336 (400) T ss_pred CCcccchhheEEEEecCCccc Confidence 99999999999987432 2 No 48 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=100.00 E-value=1.6e-36 Score=216.79 Aligned_cols=258 Identities=13% Similarity=0.112 Sum_probs=214.7 Q ss_pred Cccch----hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVTS----FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~----l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~ 75 (276) ||.+- ++||+|+.++.+++.+.++|.+++..+.+-. +.+|++|+||.|... .+.++.++..++ ++.++.++.. T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~-g~~G~ti~~P~~~~igdae~~~eg~~i~-~~~lt~~~~~ 78 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLV-GQPGDTITRPKYAYIGAAEDLQEGVAMD-TTQMSMTTTK 78 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccC-CCCCCEEEeeeecCCCccccccCCCccc-hhhcccchhe Confidence 99874 5999999999999999999999988776643 568999999999875 477788888876 4789999999 Q ss_pred EEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHh Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLD 155 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 155 (276) .+|.+ +++++.++|++...+..|++.+..+|++..+++++|+++++.++....... .. ..++.|.+|..+|+ T Consensus 79 a~i~~-~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~---~~----~t~~~~~dA~~~lg 150 (270) T protein:vir:95 79 VTVKE-TGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTAT---VS----ADATGILDAIEVFN 150 (270) T ss_pred eeeeh-hhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccccc---cc----cCHHHHHHHHHHhc Confidence 99955 689999999999999999999999999999999999999999987654432 11 22567888999997 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeee Q lcl|NC_019506. 156 EKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQ 235 (276) Q Consensus 156 ~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~ 235 (276) +.. ....+++|||..+..|+++.. ..... .+++.+++|.|+.++|++|+++++.|. .+.++.++++|+++..+ T Consensus 151 d~~--~~~~~i~vhs~~~~~Lrk~~~-~~~~~-~~~~~~~~G~ig~~~G~~Viv~s~~~~---~~~~~l~~~gAi~~~~~ 223 (270) T protein:vir:95 151 SEN--DEDYVLYVNPKDYNKLVKSLF-KVGGN-VQDRAISKGDLVEIVGVSDIVKSKRVS---ENTAFLQRYGAMEIVNK 223 (270) T ss_pred ccc--CCCcEEEEcHHHHHHHHhhhc-ccccc-cccchhcccccceecceeEEEeCCCCC---ceeEEEEeccceeeeec Confidence 654 345689999999999998753 33322 345678999999999999887776653 34578899999998854 Q ss_pred e-eeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 236 I-VQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 236 ~-~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) . ..+|..|++.++.|.+.++.|||+++++|.++|+++. +| T Consensus 224 ~~~~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~-~~ 264 (270) T protein:vir:95 224 KKPEAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTF-KP 264 (270) T ss_pred CCceeeeccchhhcccEEEeeeEEEEEEEccceEEEEEe-cC Confidence 4 4799999999999999999999999999999999963 56 No 49 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=100.00 E-value=2e-35 Score=210.82 Aligned_cols=268 Identities=15% Similarity=0.080 Sum_probs=209.5 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEEe Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEINK 80 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld~ 80 (276) ||.++ .++|++.|++.|.+.+++..+.+++++. .+|++|+||+++..+..||+|++++.. .+++.++.+++|++ T Consensus 1 Main~--a~~~~~~Ld~~~~~~~~t~~l~~~~~~~---~ggktVkI~~i~~~gl~DY~R~~g~~~-g~v~~~~et~tl~q 74 (290) T protein:vir:78 1 MAINY--VDKYGKELDQKLVFGTYTNELETPNLLW---LDAKTFKIQTITTTGLKAHTRNKGYNE-GSASNTNKSYTIDF 74 (290) T ss_pred CchhH--HHHHHHHHHHHHHhhheeeeccccceee---ccCCEEEEeeeccCcccccccCCCccc-CccccceeeEEeec Confidence 99987 5899999999999999999999887754 469999999999999999999998865 57888999999999 Q ss_pred eeecceeec--hHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-ccccCCHHHHHHHHHHHHHHHhhc Q lcl|NC_019506. 81 QKYFNFQID--DVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLK-PAATLDKTNIYEELIKVKVKLDEK 157 (276) Q Consensus 81 ~~~~~~~v~--d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~-~~~~~t~~~~~~~i~~a~~~l~~~ 157 (276) .+++.|.|+ |.|+.+....+.....+.+...+++++|.+.++.+.+.+..... ...++|++++|+.|.++..+|++ T Consensus 75 dR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~~lde- 153 (290) T protein:vir:78 75 DRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIRKVKK- 153 (290) T ss_pred cccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHHHHHh- Confidence 999999999 77776555555555556666688999999999988766644332 33556899999999999999986 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHhhhhccccc-ccceeeeeeeEEeceEEEEecc---cc-----------cc-ccceE Q lcl|NC_019506. 158 NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMA-ESITKNGFVGTILGFDVYLSNN---MG-----------SL-TNGTG 221 (276) Q Consensus 158 ~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~-~~~~~~G~i~~~~G~~v~~s~~---lp-----------~~-~~~~~ 221 (276) ||.++|+|+++|.++..|++++.|.+....+. .....+|.|++++||.|++.+. +- .. +..-. T Consensus 154 -vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ak~in 232 (290) T protein:vir:78 154 -YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAGAKKLN 232 (290) T ss_pred -cCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcccccccCCcccee Confidence 89999999999999999999999987654433 2334599999999999998431 11 11 11123 Q ss_pred EEEEecceEEeeeeeeeeeec---cCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 222 AIAGVKMACTFAEQIVQTEAY---RMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 222 ~~~~~~~a~~~~~~~~~~e~~---~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++..|++|.........+... .+++..++++..+.++++.|++...-+++..++= T Consensus 233 ~ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 233 FLLVNKGSVVGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred EEEEcCCceeeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 567888888777555544333 3344567799999999999998776666543333 No 50 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=100.00 E-value=9e-36 Score=212.73 Aligned_cols=230 Identities=14% Similarity=0.097 Sum_probs=195.6 Q ss_pred cccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHH Q lcl|NC_019506. 35 GEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALAD 114 (276) Q Consensus 35 ~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~ 114 (276) ..-.+.||||+||++ -..+.++.+|..++ ++.++.++.+.+|.+ .++++.|+|++...+.+|++.+..+|++.+||+ T Consensus 1 ~~~~~~Gdtit~P~~-iGda~~v~eG~~i~-~~~l~~t~~~atIk~-~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND-IGDAADVAEGGEIS-LDKIGTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc-ccchhhhcCCCcCC-hhhccccceeeeEee-eccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 112367999999998 44688899999887 488999999999965 699999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccce Q lcl|NC_019506. 115 ETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESIT 194 (276) Q Consensus 115 ~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~ 194 (276) ++|.++++.+..++.... +. ..++.|.+|...|++.+ ..+++++|||..++.|+++..+.......+++.+ T Consensus 78 kvD~di~~~~~~a~l~~~---~~----~t~d~i~~A~~~fgde~--~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~ 148 (231) T protein:vir:73 78 KVDDDLLKAAKTTSQTVS---TK----ANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIGSEVGANAL 148 (231) T ss_pred hhhHHHHHhhcccccccc---cc----ccHHHHHHHHHHhcccc--ccceEEEEcchHHHhhhhccchhhhhhhhcccee Confidence 999999999887665432 12 23788999999998876 4568999999999999998877666556678899 Q ss_pred eeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeee-eeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEe Q lcl|NC_019506. 195 KNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQ-IVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKK 273 (276) Q Consensus 195 ~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~-~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~ 273 (276) ++|.||+++|++|+.|+++|..+.....+...++|+++..+ ...+|..|+++.++|.+++++||++++.+|+++|+++- T Consensus 149 ~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~ 228 (231) T protein:vir:73 149 INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) T ss_pred eecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEe Confidence 99999999999999999999755544456678999998854 44899999999999999999999999999999999974 Q ss_pred cCC Q lcl|NC_019506. 274 TNP 276 (276) Q Consensus 274 ~~p 276 (276) +=- T Consensus 229 ~g~ 231 (231) T protein:vir:73 229 TGV 231 (231) T ss_pred ecC Confidence 444 No 51 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=100.00 E-value=6.6e-33 Score=197.00 Aligned_cols=270 Identities=12% Similarity=0.132 Sum_probs=201.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchh-hhcccccccc-ccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVAN-LVNRDYEGEI-KAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~-~~~~~~~~~~-~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~ 77 (276) ||.++ .++|++.+++.|...++... +.+....+.. ..+|++|+||++. ..+..||+|++++....+++.++.+++ T Consensus 1 Mainy--a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et~t 78 (346) T protein:vir:10 1 MTINY--AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDSYE 78 (346) T ss_pred Ccchh--HHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeEEE Confidence 99987 68999999999987765533 3322222221 2468999999996 468999999998865578899999999 Q ss_pred EEeeeecceeec--hHHHHh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccccccCCHHHHHHHHHH Q lcl|NC_019506. 78 INKQKYFNFQID--DVDAAQ---IRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK---LKPAATLDKTNIYEELIK 149 (276) Q Consensus 78 ld~~~~~~~~v~--d~d~~~---~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~---~~~~~~~t~~~~~~~i~~ 149 (276) |++.+++.|.|+ |.|+.. ++.+++.++.+. .++.++|.+.++.+.+.+... .....+.|.+++|+.|.+ T Consensus 79 l~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~---~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~i~~ 155 (346) T protein:vir:10 79 LKNERYWSTLVDPSDIDETNMVVSLANITKQFNLD---SKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPAFDN 155 (346) T ss_pred eeccccceecccccchHHHHHHhHHHHHHHHHHHH---hhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHHHHH Confidence 999999999999 555543 345555555555 567799999998887654332 223455789999999999 Q ss_pred HHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE--ecccccc---c------- Q lcl|NC_019506. 150 VKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL--SNNMGSL---T------- 217 (276) Q Consensus 150 a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~--s~~lp~~---~------- 217 (276) +...|++++||.++|+|+|+|+++..|++++.|.+....+.... .+|.|++++||.|++ |+++++. + T Consensus 156 ~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~-i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~~~t 234 (346) T protein:vir:10 156 MMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNN-IQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSKIID 234 (346) T ss_pred HHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccc-cceeeeeecCeEEEEcchhhcccchhhccCccccC Confidence 99999999999999999999999999999999987666554444 599999999999987 4555421 1 Q ss_pred --cceEEEEEecceEEeeeeeeeeeeccC-cccce-eeEEeeeeeeeEEEcC--CeEEEEEecCC Q lcl|NC_019506. 218 --NGTGAIAGVKMACTFAEQIVQTEAYRM-EKRFA-DAVKGLNVFGCKVIYP--DALVCLKKTNP 276 (276) Q Consensus 218 --~~~~~~~~~~~a~~~~~~~~~~e~~~~-~~~~~-~~i~~~~~yg~~v~~~--~~vv~~~~~~p 276 (276) ..--++..|++|.........+....+ ++..+ .++..+.++++.|++. ++|.+--.++| T Consensus 235 ~ak~INfiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~ 299 (346) T protein:vir:10 235 TAKQIEMFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKP 299 (346) T ss_pred CccceeEEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeeccc Confidence 112356778988877765555544433 33344 4899999999999974 45555556778 No 52 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=100.00 E-value=1.2e-32 Score=195.66 Aligned_cols=272 Identities=14% Similarity=0.048 Sum_probs=200.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCC--CCCccccccceEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSD--IDAPEELSTTEKVLEI 78 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~--~~~~~~~~~~~~~~~l 78 (276) |||++-+.++|+++|++.+...+++..+...+-.-++ .+|++|+||++...+..||+|+++ +. ..+++.++.+++| T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~-~ggktVkIp~i~~~gl~DY~R~~g~~~~-~g~v~~~~et~tl 78 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKY-EGGKEVKIGKLSTDGLGDYSRGSANAYV-GGDVKFEYETKTM 78 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccccCCCceEEE-ecCcEEEEEeeecccccccccccCCccc-cccccccceeEEe Confidence 9999988999999999999999988877533222223 678999999999999999999877 43 2478999999999 Q ss_pred Eeeeecceeec--hHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----cccccCCHHHHHHHHHHHH Q lcl|NC_019506. 79 NKQKYFNFQID--DVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-----KPAATLDKTNIYEELIKVK 151 (276) Q Consensus 79 d~~~~~~~~v~--d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-----~~~~~~t~~~~~~~i~~a~ 151 (276) ++.+++.|.|+ |.|+.........-..+.+...++.++|.+.++.+.+.+.... +...++|.+++|+.|.++. T Consensus 79 ~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~i~~~~ 158 (312) T protein:vir:10 79 TQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINKIKTGI 158 (312) T ss_pred eecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHHHHHHH Confidence 99999999999 6666543333333334444557788999999998886654332 2345579999999999999 Q ss_pred HHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEe--ccccc---cccc------- Q lcl|NC_019506. 152 VKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLS--NNMGS---LTNG------- 219 (276) Q Consensus 152 ~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s--~~lp~---~~~~------- 219 (276) ..|++++|| ++|+|+|+|.++..|.++..+....... .....+|+|++++|++|++. +++.+ .+.| T Consensus 159 ~~lde~~vp-~~rvl~vTp~~~~lLk~~~~~~~~~~~~-~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~~~~~ 236 (312) T protein:vir:10 159 KIIRENGYN-GPLVCHLTYDSMFAIEEKVLEKLTAVTF-AQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTTSNQTA 236 (312) T ss_pred HHHHHccCC-CceEEEeChHHHHHHhhhhhceeccccc-ccceeeeeeeeecccEEEEchhhhccceeeeccCccccccc Confidence 999999999 6999999999998777654333333222 33456999999999999973 22311 0101 Q ss_pred -----------eEEEEEecceEEeeeeeeeee---eccCcccceeeEEeeeeeeeEEEcC--CeE-EEEEecCC Q lcl|NC_019506. 220 -----------TGAIAGVKMACTFAEQIVQTE---AYRMEKRFADAVKGLNVFGCKVIYP--DAL-VCLKKTNP 276 (276) Q Consensus 220 -----------~~~~~~~~~a~~~~~~~~~~e---~~~~~~~~~~~i~~~~~yg~~v~~~--~~v-v~~~~~~p 276 (276) --++..|++|.........+. ..-++...+.++..+.++++.|++. ++| +-++.+-| T Consensus 237 gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a~~ 310 (312) T protein:vir:10 237 GGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDAKP 310 (312) T ss_pred CceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecccC Confidence 125667888776665444443 3344566678999999999999974 455 44778888 No 53 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.95 E-value=4.6e-30 Score=181.41 Aligned_cols=267 Identities=13% Similarity=0.124 Sum_probs=198.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhcccccccc-ccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEI-KAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVLEI 78 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~-~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~l 78 (276) ||+++ .++|.+.+.+.+...+.+..+.+....... ..+|++|+||++. ..+..||.|+.++.. .+++.++.+++| T Consensus 1 Main~--~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~-g~v~~~~et~tl 77 (285) T protein:vir:79 1 MTVVL--DSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNAR-KTISVGKETVKL 77 (285) T ss_pred Ccchh--hHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccc-cccceeeeEEEe Confidence 99986 789999999999988888777654322222 2568999999996 468999999998754 789999999999 Q ss_pred EeeeecceeechHHHHh----hhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHH Q lcl|NC_019506. 79 NKQKYFNFQIDDVDAAQ----IRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKL 154 (276) Q Consensus 79 d~~~~~~~~v~d~d~~~----~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l 154 (276) ++++++.|.|+..|..+ ++.+.++++.+. .++.++|.+.++.+.+.+.... ..++|.+++|++|.++..+| T Consensus 78 ~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~---~vvPEiDayrfskla~~a~~~~--~~~~T~~nv~~~i~~~~~~l 152 (285) T protein:vir:79 78 THEDWFGYDLDQFDMDENGAYTVENVVREHNKM---ITIPHRDKVAVQKLFDSAAKKA--TDSITKDNALDAYDTAEAYM 152 (285) T ss_pred eccccceecccccchhhhhhhhHHHHHHHHHhh---hhcchhhHHHHHHHHhhccccc--ccccCHHHHHHHHHHHHHHH Confidence 99999999999555443 334555555554 5678999999999987765443 34578999999999999999 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccc--ceeeeeeeEEec-eEEEE--eccccccc--cceEEEEEec Q lcl|NC_019506. 155 DEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAES--ITKNGFVGTILG-FDVYL--SNNMGSLT--NGTGAIAGVK 227 (276) Q Consensus 155 ~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~--~~~~G~i~~~~G-~~v~~--s~~lp~~~--~~~~~~~~~~ 227 (276) ++.+|| ++|+|+++|.++..|++++.|.+......+. .-.+++|+.++| ++|++ |.++.+.+ ..-.++..|+ T Consensus 153 de~~vp-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv~~ 231 (285) T protein:vir:79 153 FDNEVP-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILTPL 231 (285) T ss_pred HHcCCC-CceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEEecC Confidence 999999 6999999999999999999888754332211 124678999999 89987 34554322 2234677889 Q ss_pred ceEEeeeeeeeeeec---cCcccceeeEEeeeeeeeEEEcCCeEEE--EEecCC Q lcl|NC_019506. 228 MACTFAEQIVQTEAY---RMEKRFADAVKGLNVFGCKVIYPDALVC--LKKTNP 276 (276) Q Consensus 228 ~a~~~~~~~~~~e~~---~~~~~~~~~i~~~~~yg~~v~~~~~vv~--~~~~~p 276 (276) +|......-..+.-. -++...+.++..+.++++.|++...-++ -..++= T Consensus 232 ~a~i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 232 SAIAPIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred ceeccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 887655444444333 3346667899999999999997554444 322222 No 54 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.95 E-value=3.1e-30 Score=182.33 Aligned_cols=270 Identities=15% Similarity=0.077 Sum_probs=185.7 Q ss_pred CccchhhH-HHHHHHHHHHHHHhhcchhh--hccccccccccCCcEEEEeccCcccceeecCCCCC-CCccccccceEEE Q lcl|NC_019506. 1 MAVTSFIP-KLWSARLLAHLDKAHVVANL--VNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDI-DAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~~l~~-e~~~~~~~~~l~~~~v~~~~--~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~-~~~~~~~~~~~~~ 76 (276) |||+...- +++.+++++.|+..++|..+ ++++|+.++.+.||||.+|.+......+ |..+ ..++++.+.++++ T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~---G~~~t~~~~~i~e~~v~~ 77 (430) T protein:vir:10 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GWDLTDKATGLLELNVAV 77 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc---CcccCCCCCccccceEEE Confidence 99998764 89999999999999999997 5689999988999999999987766554 3222 1234677889999 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--ccccCCHHHHHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLK--PAATLDKTNIYEELIKVKVKL 154 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~--~~~~~t~~~~~~~i~~a~~~l 154 (276) +++++++..+++++.|+ ...+..++++++++++||..+|.++++.+......+.+ .++...+.+.++++..+.+.| T Consensus 78 ~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L 155 (430) T protein:vir:10 78 NMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELM 155 (430) T ss_pred EEeeeccceEEechhHh--cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHH Confidence 99999999999999884 56677789999999999999999999998766544432 233444445578899999999 Q ss_pred hhcCCCcc-CCEEEECHHHHHHHhhh-HHhhhhcccccccceeeeeeeE-EeceEE-EEeccccccccceEEEEEecceE Q lcl|NC_019506. 155 DEKNVPTI-GRFLIIPPDVHGLLLAA-DLIVGTGGAMAESITKNGFVGT-ILGFDV-YLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 155 ~~~~vP~~-~r~~vv~p~~~~~L~~~-~~~~~~~~~~~~~~~~~G~i~~-~~G~~v-~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ++.+||.+ +|.+|++|..+..|... ..+..++. ...+.+++|.|++ +.||++ +.++.+|.++++......+.+|. T Consensus 156 ~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~-~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~ 234 (430) T protein:vir:10 156 FSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQ 234 (430) T ss_pred HHhcCCCCCCcEEEeChHHHHHHHhhhcccccccc-chhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccc Confidence 99999996 89999999999998753 22222222 2556799999997 999975 78999998887766544444443 Q ss_pred Ee---eeeee--------------eeeeccCcccceeeEEeeeeeeeEEE------cCCeEEEEEe-------------- Q lcl|NC_019506. 231 TF---AEQIV--------------QTEAYRMEKRFADAVKGLNVFGCKVI------YPDALVCLKK-------------- 273 (276) Q Consensus 231 ~~---~~~~~--------------~~e~~~~~~~~~~~i~~~~~yg~~v~------~~~~vv~~~~-------------- 273 (276) .+ +..++ .....-.--..||.+..--+|+.-.+ ++...+|... T Consensus 235 ~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~ 314 (430) T protein:vir:10 235 SFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVA 314 (430) T ss_pred ccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccc Confidence 21 01000 00001111222333333332222222 1222222111 Q ss_pred -----------------cCC Q lcl|NC_019506. 274 -----------------TNP 276 (276) Q Consensus 274 -----------------~~p 276 (276) ++| T Consensus 315 ~~~~~~~~~~~~y~nVsasp 334 (430) T protein:vir:10 315 LDDVSLSPEQRAYANVNTSL 334 (430) T ss_pred cccccccccccccceecccc Confidence 111 No 55 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.95 E-value=3.1e-30 Score=182.33 Aligned_cols=270 Identities=15% Similarity=0.077 Sum_probs=185.7 Q ss_pred CccchhhH-HHHHHHHHHHHHHhhcchhh--hccccccccccCCcEEEEeccCcccceeecCCCCC-CCccccccceEEE Q lcl|NC_019506. 1 MAVTSFIP-KLWSARLLAHLDKAHVVANL--VNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDI-DAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~~l~~-e~~~~~~~~~l~~~~v~~~~--~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~-~~~~~~~~~~~~~ 76 (276) |||+...- +++.+++++.|+..++|..+ ++++|+.++.+.||||.+|.+......+ |..+ ..++++.+.++++ T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~---G~~~t~~~~~i~e~~v~~ 77 (430) T protein:vir:92 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE---GWDLTDKATGLLELNVAV 77 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc---CcccCCCCCccccceEEE Confidence 99998764 89999999999999999997 5689999988999999999987766554 3222 1234677889999 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--ccccCCHHHHHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLK--PAATLDKTNIYEELIKVKVKL 154 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~--~~~~~t~~~~~~~i~~a~~~l 154 (276) +++++++..+++++.|+ ...+..++++++++++||..+|.++++.+......+.+ .++...+.+.++++..+.+.| T Consensus 78 ~v~~~k~V~~~~~~kel--~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L 155 (430) T protein:vir:92 78 NMGEPDNDFFQLRADDL--RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELM 155 (430) T ss_pred EEeeeccceEEechhHh--cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHH Confidence 99999999999999884 56677789999999999999999999998766544432 233444445578899999999 Q ss_pred hhcCCCcc-CCEEEECHHHHHHHhhh-HHhhhhcccccccceeeeeeeE-EeceEE-EEeccccccccceEEEEEecceE Q lcl|NC_019506. 155 DEKNVPTI-GRFLIIPPDVHGLLLAA-DLIVGTGGAMAESITKNGFVGT-ILGFDV-YLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 155 ~~~~vP~~-~r~~vv~p~~~~~L~~~-~~~~~~~~~~~~~~~~~G~i~~-~~G~~v-~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ++.+||.+ +|.+|++|..+..|... ..+..++. ...+.+++|.|++ +.||++ +.++.+|.++++......+.+|. T Consensus 156 ~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~-~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~ 234 (430) T protein:vir:92 156 FSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGR-IPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQ 234 (430) T ss_pred HHhcCCCCCCcEEEeChHHHHHHHhhhcccccccc-chhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccc Confidence 99999996 89999999999998753 22222222 2556799999997 999975 78999998887766544444443 Q ss_pred Ee---eeeee--------------eeeeccCcccceeeEEeeeeeeeEEE------cCCeEEEEEe-------------- Q lcl|NC_019506. 231 TF---AEQIV--------------QTEAYRMEKRFADAVKGLNVFGCKVI------YPDALVCLKK-------------- 273 (276) Q Consensus 231 ~~---~~~~~--------------~~e~~~~~~~~~~~i~~~~~yg~~v~------~~~~vv~~~~-------------- 273 (276) .+ +..++ .....-.--..||.+..--+|+.-.+ ++...+|... T Consensus 235 ~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~ 314 (430) T protein:vir:92 235 SFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVA 314 (430) T ss_pred ccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEeccccc Confidence 21 01000 00001111222333333332222222 1222222111 Q ss_pred -----------------cCC Q lcl|NC_019506. 274 -----------------TNP 276 (276) Q Consensus 274 -----------------~~p 276 (276) ++| T Consensus 315 ~~~~~~~~~~~~y~nVsasp 334 (430) T protein:vir:92 315 LDDVSLSPEQRAYANVNTSL 334 (430) T ss_pred cccccccccccccceecccc Confidence 111 No 56 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.94 E-value=2.9e-28 Score=171.56 Aligned_cols=266 Identities=14% Similarity=0.092 Sum_probs=196.7 Q ss_pred Cc---cch--hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MA---VTS--FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA---~~~--l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 75 (276) |+ |++ -+.++|.+.|++.|...++...+.+.++. +-.+|++|+||++...+..||+|++++. ..+++.++.+ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~--~~~Gak~VkIp~i~~~gl~dY~R~~g~~-~g~v~~~~et 77 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVD--LVNGGRSFTLKTISTSGLKDHTRGKGFN-SGTISDEKTI 77 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhhcccceecCchh--eeecCCEEEEEeeeeccccccccccCcc-ccceeeeeeE Confidence 32 332 23899999999999999988888876654 4467999999999999999999999875 4789999999 Q ss_pred EEEEeeeecceeechHHHHh-----hhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------cccccc Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQ-----IRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-------------LKPAAT 137 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~-----~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------------~~~~~~ 137 (276) ++|++++++.|.|+-.|..+ +..+++.++.+. ....++|.+.++.+.+.+... ...... T Consensus 78 ~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~---~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~ 154 (311) T protein:vir:99 78 YTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITE---HVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEET 154 (311) T ss_pred EEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHh---hhcchhhHHHHHHHHhhhhcccccccchhhhccccccccc Confidence 99999999999999444333 345666666665 566799999998887554322 122345 Q ss_pred CCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccc--cccceeeeeeeEEeceEEEEe---cc Q lcl|NC_019506. 138 LDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAM--AESITKNGFVGTILGFDVYLS---NN 212 (276) Q Consensus 138 ~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~G~i~~~~G~~v~~s---~~ 212 (276) ++.+++++.|..+...|++ ||.++|+|+++|..+..|..++.|.+.-... +++ -.++.|+.++|++|++. ++ T Consensus 155 lt~~nvl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~-~i~~~V~~lDgv~Ii~V~ps~r 231 (311) T protein:vir:99 155 LDETNAYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTT-ALESRITSIDGVQLIEVYESNR 231 (311) T ss_pred cCHHHHHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeeccccccc-ccccccceecCeEEEEecCchh Confidence 7899999999999999986 7889999999999999998888887643322 223 35888999999998864 33 Q ss_pred cccc---ccc---------eEEEEEecceEEeeeeeeeee---eccCcccceeeEEeeeeeeeEEEcC--CeEEEEEecC Q lcl|NC_019506. 213 MGSL---TNG---------TGAIAGVKMACTFAEQIVQTE---AYRMEKRFADAVKGLNVFGCKVIYP--DALVCLKKTN 275 (276) Q Consensus 213 lp~~---~~~---------~~~~~~~~~a~~~~~~~~~~e---~~~~~~~~~~~i~~~~~yg~~v~~~--~~vv~~~~~~ 275 (276) +.+. +.| --++..|++|.........+. ..-++...+.++..+.++++.|++. ++|.+=..+| T Consensus 232 ~~t~~~ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 232 FMTKYDFTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred hcchhhhcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEeeecC Confidence 3311 111 235677888877665444443 3334556688999999999999974 4554444555 No 57 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.94 E-value=7.9e-30 Score=180.15 Aligned_cols=186 Identities=16% Similarity=0.155 Sum_probs=146.4 Q ss_pred EEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------------cccccccCCHHHHH Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-------------KLKPAATLDKTNIY 144 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-------------~~~~~~~~t~~~~~ 144 (276) ||......+.|+|.|+.++++|++.++.++++++||+.+|+.++..+..++.. ...++.++++..++ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 99999999999999999999999999999999999999999999887654321 12234567788899 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhh--hHHhhhhcccccccceeeee-eeEEeceEEEEeccccccccceE Q lcl|NC_019506. 145 EELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLA--ADLIVGTGGAMAESITKNGF-VGTILGFDVYLSNNMGSLTNGTG 221 (276) Q Consensus 145 ~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~--~~~~~~~~~~~~~~~~~~G~-i~~~~G~~v~~s~~lp~~~~~~~ 221 (276) ++|.+|+.+|++++||.++||+|++|++|+.|++ ++.+.+.+..++++.+++|. |++++||+||+|+++|..+++. T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~- 159 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTN- 159 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCcccccc- Confidence 9999999999999999999999999998888886 46677777766777788884 9999999999999999754432 Q ss_pred EEEEecceEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 222 AIAGVKMACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 222 ~~~~~~~a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.++..++.+....+.+|.. |++.+ +.+..|+++++++.-.| T Consensus 160 ---~~~~ag~~~~~~~~~~~yr~~--fs~~~-------glv~~~~Avgtvkl~~~ 202 (221) T protein:vir:17 160 ---LVTDPGDATTSGENNGSYRPA--ITDRA-------GLVFHKEAADTVEVLLP 202 (221) T ss_pred ---cccCCcccccccccccccccc--ccceE-------EEEEcchheeeeeeecC Confidence 234444444444444444333 22222 45778999999998888 No 58 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.94 E-value=3.1e-29 Score=176.90 Aligned_cols=270 Identities=14% Similarity=0.077 Sum_probs=180.6 Q ss_pred Cccch--hhHHHHHHHHHHHHHHhhcchhh--hccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAVTS--FIPKLWSARLLAHLDKAHVVANL--VNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~~--l~~e~~~~~~~~~l~~~~v~~~~--~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |||+. +.. +.-+++++.|+..++|.++ ++++|+.++.+.||||.+|.+......+-..-+ ..++++.+.++++ T Consensus 1 Ma~~~~~~lt-i~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~G~~~t--~~~~~~~e~~v~~ 77 (430) T protein:vir:21 1 MALNEGQIVT-LAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLT--DKATGLLELNVAV 77 (430) T ss_pred CccccchhhH-HHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeecccccccccccccc--CCCccceeeeEeE Confidence 99975 333 3339999999999999997 568999999999999999988665554421111 2235688899999 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--ccccCCHHHHHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLK--PAATLDKTNIYEELIKVKVKL 154 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~--~~~~~t~~~~~~~i~~a~~~l 154 (276) +++++++..+++++.|. ...+..++++++++++||..+|.++++.+......+.+ .++..++.+.++++..+.+.| T Consensus 78 ~~~~~~~V~~~~~~kEl--~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L 155 (430) T protein:vir:21 78 NMGEPDNDFFQLRADDL--RDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEEIM 155 (430) T ss_pred EEeeeccceEEeehhHh--cChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHHHHHHH Confidence 99999999999998873 57788899999999999999999999998776544432 233334444578899999999 Q ss_pred hhcCCCcc-CCEEEECHHHHHHHhhhH-HhhhhcccccccceeeeeeeE-EeceEE-EEeccccccccceEEEEEecceE Q lcl|NC_019506. 155 DEKNVPTI-GRFLIIPPDVHGLLLAAD-LIVGTGGAMAESITKNGFVGT-ILGFDV-YLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 155 ~~~~vP~~-~r~~vv~p~~~~~L~~~~-~~~~~~~~~~~~~~~~G~i~~-~~G~~v-~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ++.+||.+ +|.++++|..+..|...- .+...+ ....+.+++|.|++ +.||++ +.++++|.++.+......+.+|. T Consensus 156 ~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~-~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~ 234 (430) T protein:vir:21 156 FSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG-RIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQ 234 (430) T ss_pred HHhcCCCCCCcEEEeChHHHHHHhhhhccccccc-cchhHHHhhcccccccchhhhhhhcCCcccccCccCcCceecccc Confidence 99999995 799999999999886642 232222 33556899999997 999985 77899999887776544444443 Q ss_pred Ee---eeeee--------------eeeeccCcccceeeEEeeeeeeeEEEcC------CeEEEEEe-------------- Q lcl|NC_019506. 231 TF---AEQIV--------------QTEAYRMEKRFADAVKGLNVFGCKVIYP------DALVCLKK-------------- 273 (276) Q Consensus 231 ~~---~~~~~--------------~~e~~~~~~~~~~~i~~~~~yg~~v~~~------~~vv~~~~-------------- 273 (276) .+ +..++ .....-.--..||.+..--+|..-.+.+ ...+|... T Consensus 235 ~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~ 314 (430) T protein:vir:21 235 SFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVA 314 (430) T ss_pred ccccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEEeecccc Confidence 11 00000 0000011122233333222222111111 11111110 Q ss_pred -----------------cCC Q lcl|NC_019506. 274 -----------------TNP 276 (276) Q Consensus 274 -----------------~~p 276 (276) +.| T Consensus 315 ~~~~~~~~~~~~y~nVsasp 334 (430) T protein:vir:21 315 LDDVSLSPEQRAYANVNTSL 334 (430) T ss_pred cccccccccccccceecccc Confidence 111 No 59 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.92 E-value=8.8e-27 Score=163.42 Aligned_cols=267 Identities=12% Similarity=0.058 Sum_probs=193.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-----cccceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-----AITVKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-----~~~~~d~~~~~~~~~~~~~~~~~~~ 75 (276) |||++-+.++|.+.|++.|...+++..|...+-.- ...+|++|+||++. ..+..||+|++++.. .+++..+.+ T Consensus 1 Mantl~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v-~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~-g~v~~~~et 78 (302) T protein:vir:78 1 MANSLALAQIYQDNIDKAIAVNSKSAFLEANPNNV-QYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQ-GSVTLAWSD 78 (302) T ss_pred CCchhHHHHHHHHHHHHHHHhhhceeecccCCceE-EEecCcEEEEEEEEeeccccccccccccccCccc-cceeeeeee Confidence 99998889999999999999999888875432222 34678999999996 458999999998764 678999999 Q ss_pred EEEEeeeecceeechHHHHhh-----hhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cccccCCHHHHHHH Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQI-----RTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL----KPAATLDKTNIYEE 146 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~-----~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~----~~~~~~t~~~~~~~ 146 (276) +++++++++.|.|+-+|..++ +.+++.++.+. .+++++|.+.++.+.+.+.... ......+.++++++ T Consensus 79 ~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~---~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl~~ 155 (302) T protein:vir:78 79 YTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRT---KIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALMGD 155 (302) T ss_pred EEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHh---hhcchhhHHHHHHHHHhhhccCccccccccchhHHHHHHH Confidence 999999999999995544333 34555555554 5677999999988876544322 12334688999999 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccc-ccceeeeeeeEEeceEEEEe--cccccc---cc-- Q lcl|NC_019506. 147 LIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMA-ESITKNGFVGTILGFDVYLS--NNMGSL---TN-- 218 (276) Q Consensus 147 i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~-~~~~~~G~i~~~~G~~v~~s--~~lp~~---~~-- 218 (276) |..+...|++. ++|+|+++|..+..|..++.+.+...... ...-.+++|+.++|++|++. +++.+. ++ T Consensus 156 i~~~~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~G~ 231 (302) T protein:vir:78 156 IATAMELVDDS----NQLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTKITFIQDVEVLQVPSEYLYDKVAPKVGV 231 (302) T ss_pred HHHHHHHhhcc----CCeEEEEChHHHHHHhcchhhccceeccccccccccceeeeecccEEEEchhhhcccceeccCCc Confidence 99999999996 58999999999999988877765432221 12234889999999999873 233221 11 Q ss_pred -------ceEEEEEecceEEeeeeeeeeeeccC-cccc--eeeEEeeeeeeeEEEcCCeEEEE---EecCC Q lcl|NC_019506. 219 -------GTGAIAGVKMACTFAEQIVQTEAYRM-EKRF--ADAVKGLNVFGCKVIYPDALVCL---KKTNP 276 (276) Q Consensus 219 -------~~~~~~~~~~a~~~~~~~~~~e~~~~-~~~~--~~~i~~~~~yg~~v~~~~~vv~~---~~~~p 276 (276) .--++..|++|.........+....+ ++.. +.++..+.++++.|++...-+++ +.+.- T Consensus 232 ~~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~~~ 302 (302) T protein:vir:78 232 PDYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGTIA 302 (302) T ss_pred cccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeeccccC Confidence 12367788888877765555544433 3444 45999999999999986544442 22222 No 60 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.86 E-value=1.5e-24 Score=151.26 Aligned_cols=272 Identities=20% Similarity=0.226 Sum_probs=201.6 Q ss_pred Cc---cc--hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MA---VT--SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA---~~--~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 75 (276) |- |+ ++..|+|+++++..|.+.+ +...+.|+.++. ..|++.|||.+|.++++...+.+++.. ++++.++++ T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~L-L~~~~~R~V~DF--~~G~~L~I~tiGs~~~~~~~E~~~~~~-~~i~TGEIt 76 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGL-LPETFYRNVSDF--GSGETLHIKTIGSVTLQEAEEDTPLIY-NPIETGEIT 76 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccc-cchhhhhhhccC--CCCCEEEecccCceeeeccccCCCeee-cccccceEE Confidence 54 32 5789999999999998877 444455655543 359999999999999998888888765 889999999 Q ss_pred EEEEeeeecceeechHHH--HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----------cccccCCHHH Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDA--AQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-----------KPAATLDKTN 142 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~--~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-----------~~~~~~t~~~ 142 (276) +.|..+++.+++|++.-. ...+.+++++...++.+++.+....+++++-......+. -.++.+++.. T Consensus 77 ~~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~ 156 (313) T protein:vir:95 77 FQITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVF 156 (313) T ss_pred EEEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCcee Confidence 999999998888987644 346778899999999999999999999876544322111 1245556666 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeee------eeEEeceEEEEecccccc Q lcl|NC_019506. 143 IYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGF------VGTILGFDVYLSNNMGSL 216 (276) Q Consensus 143 ~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~------i~~~~G~~v~~s~~lp~~ 216 (276) .+..|.+++-.|++.++|.+||+.+++|.....|-....+.+.-...+.-++.+|. |.+++|++++.|+.|.+. T Consensus 157 ~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~A 236 (313) T protein:vir:95 157 ALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVA 236 (313) T ss_pred hhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhc Confidence 78899999999999999999999999999999998766555422222333566664 567999999999988643 Q ss_pred c--------cceEE--EEE-----ecceEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 217 T--------NGTGA--IAG-----VKMACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 217 ~--------~~~~~--~~~-----~~~a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) . ++.-. |.. .+.-++..+++.+.+.+++.++-.+--...++||.++.|.|.++.+-..|- T Consensus 237 N~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L~~~~~~A~ 311 (313) T protein:vir:95 237 NYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTLGLLATSAT 311 (313) T ss_pred cccccccccCceeeeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceeecceeEEEeccc Confidence 2 11111 111 122233334555667777776666666778999999999999988866666 No 61 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.83 E-value=1.1e-21 Score=135.51 Aligned_cols=264 Identities=16% Similarity=0.139 Sum_probs=188.8 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhcchh--hhccc--ccccc--ccCCcEEEEeccCcc--cceeecCCCCCCCccc Q lcl|NC_019506. 1 MAVT----SFIPKLWSARLLAHLDKAHVVAN--LVNRD--YEGEI--KAYGDTVKINQIGAI--TVKEYTENSDIDAPEE 68 (276) Q Consensus 1 MA~~----~l~~e~~~~~~~~~l~~~~v~~~--~~~~~--~~~~~--~~~Gdtv~ip~~~~~--~~~d~~~~~~~~~~~~ 68 (276) ||.+ +++||+|+.++.+.+.+.+.|.. .+.++ ....+ ..+|++|++|.++.+ ..+++..++.+. ++. T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~-~~~ 79 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLV-PQK 79 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccc-hhh Confidence 9965 48999999999999988776633 11111 11111 246999999999886 477788777765 477 Q ss_pred cccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------ccccccCCHHH Q lcl|NC_019506. 69 LSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK------LKPAATLDKTN 142 (276) Q Consensus 69 ~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~------~~~~~~~t~~~ 142 (276) ++.++...++. ++++++.++|+....+-.|++.+..+|.+..++++.+..+++.+....... ....++.+..- T Consensus 80 l~t~~~~a~i~-~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~ 158 (324) T protein:vir:59 80 INAGQDKAVLI-LRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIY 158 (324) T ss_pred cccceeeEEEE-eecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeecccccee Confidence 88777777774 689999999999888889999999999999999999999999886432111 11111111112 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccc----- Q lcl|NC_019506. 143 IYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLT----- 217 (276) Q Consensus 143 ~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~----- 217 (276) ..+.|.+|..+|.++. ..-..++|||..+..|.++. +.....+.. ..+.|+.++|.+|+++..+|+.. T Consensus 159 s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li~~~~~s~----~~~~i~~~~G~~VivdD~~p~~~~~~~~ 231 (324) T protein:vir:59 159 SAETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQD-LIEFVKDSQ----SGIRFPTYMNKRVIVDDSMPVETLEDGT 231 (324) T ss_pred cHHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHhh-hhhhccccc----cCceeeeecccEEEEeCCCCccccCCCC Confidence 3577899999998764 23357999999999999874 333222211 24568999999999999998642 Q ss_pred cceEEEEEecceEEeee--eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEE----ecCC Q lcl|NC_019506. 218 NGTGAIAGVKMACTFAE--QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLK----KTNP 276 (276) Q Consensus 218 ~~~~~~~~~~~a~~~~~--~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~----~~~p 276 (276) ..+.++.+.++|+++.. +...+|..|++....|.+..+.+|...+. |+--.. ..+| T Consensus 232 ~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~~p~---G~s~~~~~~~~~sP 293 (324) T protein:vir:59 232 KVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHFVLHPR---GVKFTENAMAGTTP 293 (324) T ss_pred ceEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEEEeEee---eEEecccccCCCCC Confidence 24567889999999874 34567899999888888888988876553 322211 2457 No 62 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.82 E-value=9.3e-22 Score=135.88 Aligned_cols=264 Identities=9% Similarity=0.064 Sum_probs=190.0 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhh--hccc--cccccccCCcEEEEeccCcc--cceeecCCC-CCCCcc Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANL--VNRD--YEGEIKAYGDTVKINQIGAI--TVKEYTENS-DIDAPE 67 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~--~~~~--~~~~~~~~Gdtv~ip~~~~~--~~~d~~~~~-~~~~~~ 67 (276) ||++ +++||+|+.++.+.+.+.+.|... +..+ .......+|++|++|.++.+ ...++..+. .+. ++ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~-~~ 79 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALE-TG 79 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccc-hh Confidence 9974 489999999999999877666331 2121 22222247999999999876 355565554 444 47 Q ss_pred ccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------c--ccc Q lcl|NC_019506. 68 ELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK----------L--KPA 135 (276) Q Consensus 68 ~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~----------~--~~~ 135 (276) .++.++...++ .++.+++.++|+....+-.|++.+..+|.+...+++.+..+++.+....... . ... T Consensus 80 ki~t~~~~a~i-~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:10 80 KITAGADIACV-LYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQ 158 (330) T ss_pred hcccceeEEEE-EeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecc Confidence 78877777777 4578899999999999999999999999999999999999998776432110 0 000 Q ss_pred ccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccc Q lcl|NC_019506. 136 ATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGS 215 (276) Q Consensus 136 ~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~ 215 (276) +........+.|.+|..+|.++. ..-..++|||..+..|.++. +.+...+.. ..+.|+.++|..|+++..+|. T Consensus 159 ~~~~a~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li~~~~~s~----~~~~i~~~~G~~VivdD~~p~ 231 (330) T protein:vir:10 159 SKASTGIDAGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKDN-LIQYIQPTT----ATINIPTYLGYRVIIDDGIAP 231 (330) T ss_pred cccccccCHHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHhh-hhhhhcccc----cCcccccccceEEEEeCCCCC Confidence 11111122567899999998775 33457999999999998853 444333221 246789999999999999998 Q ss_pred cccceEEEEEecceEEeee----eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEE------EecCC Q lcl|NC_019506. 216 LTNGTGAIAGVKMACTFAE----QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCL------KKTNP 276 (276) Q Consensus 216 ~~~~~~~~~~~~~a~~~~~----~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~------~~~~p 276 (276) ..+...++.+.++|+++.. +...+|..|+++...|.+..+.+|...+ -|+--- ....| T Consensus 232 ~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~~~hp---~G~s~~~~~~~~~~~sP 299 (330) T protein:vir:10 232 TGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRALVMHP---YGVKWTGAEVDAGNITP 299 (330) T ss_pred CCCceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeEEEeee---eeeeecccccccCcCCc Confidence 8888888999999999863 4457899999988889999999987654 333322 23457 No 63 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.80 E-value=6.8e-21 Score=131.16 Aligned_cols=265 Identities=12% Similarity=0.082 Sum_probs=184.7 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhcchh--hhccc--cccccccCCcEEEEeccCcc--cceeecCCCCCCCccccc Q lcl|NC_019506. 1 MAVT----SFIPKLWSARLLAHLDKAHVVAN--LVNRD--YEGEIKAYGDTVKINQIGAI--TVKEYTENSDIDAPEELS 70 (276) Q Consensus 1 MA~~----~l~~e~~~~~~~~~l~~~~v~~~--~~~~~--~~~~~~~~Gdtv~ip~~~~~--~~~d~~~~~~~~~~~~~~ 70 (276) ||.+ +++||+|+.++.+.+.+.+.|.. .+..+ +......+|++|+||.++.+ ...++..+..+. ++.++ T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~-~~kit 79 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDID-VNNLT 79 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccc-hheec Confidence 9976 48999999999999877776633 12121 22223357999999999876 477777777765 47788 Q ss_pred cceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------c---cccccCCHH Q lcl|NC_019506. 71 TTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK------L---KPAATLDKT 141 (276) Q Consensus 71 ~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~------~---~~~~~~t~~ 141 (276) .++...++ .++.+++.++|+....+..|++.++.+|.+...+++.+..+++.++...... . +..+..+.. T Consensus 80 t~~~~a~i-~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~~~ 158 (351) T protein:vir:15 80 SGKQQGIK-FYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSEPM 158 (351) T ss_pred ccceeEEE-EeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceeccccccccccc Confidence 77777777 5578899999999988889999999999999999999999999876431110 0 011111122 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc--- Q lcl|NC_019506. 142 NIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN--- 218 (276) Q Consensus 142 ~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~--- 218 (276) -..+.|.+|..+|.+..- ..-..++|||..+..|.++. +.....+.. ..+.|+.+.|.+|+++..+|+... T Consensus 159 is~~~l~~A~~~~GD~~~-~~~~~ivmhS~v~~~L~~~~-li~~~~~s~----~~~~i~t~~G~~VivdD~~p~~~~~~~ 232 (351) T protein:vir:15 159 FGAKGFTGAIGLMGDLQD-TAFGAIAVNSATYSLMKVQG-LIETIQPQN----GATPFEAYNGLRIVLDDDIEIDLTDKT 232 (351) T ss_pred cCHHHHHHHHHHhccccc-cceEEEEEChHHHHHHHhhh-hhhhccccc----cCcccceecceEEEEcCCCccccCCCC Confidence 235779999999976421 11246889999999998864 333332211 245689999999999999997532 Q ss_pred --ceEEEEEecceEEeeeeeeeeeeccCcccc--eeeEEeeeeeeeEEEcCCeEEEE------EecCC Q lcl|NC_019506. 219 --GTGAIAGVKMACTFAEQIVQTEAYRMEKRF--ADAVKGLNVFGCKVIYPDALVCL------KKTNP 276 (276) Q Consensus 219 --~~~~~~~~~~a~~~~~~~~~~e~~~~~~~~--~~~i~~~~~yg~~v~~~~~vv~~------~~~~p 276 (276) .+.++.+.++|+++..+...+|..|++... .|.+..+.+|.. .|-|+--- ....| T Consensus 233 ~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~~~~g~d~l~~r~~~~~---hp~G~s~~~~~~~~~~~sP 297 (351) T protein:vir:15 233 KPVSTSYIFAPGAVRYSTNMRSTETKYDPLINGGQDVIVQKRVGTI---HVAGTSIKASFSPSKASFP 297 (351) T ss_pred CceeEEEEEecceeeeecCCcCcceeecccCCCCceEEEEeeeeee---eeeeeeecccccccCcCCc Confidence 245788899999998877778888876643 355566666553 44444332 23346 No 64 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.67 E-value=4.9e-17 Score=110.01 Aligned_cols=267 Identities=11% Similarity=0.079 Sum_probs=174.5 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MA-------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA-------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) ++ ...++|+.|...+++.+++..++..+++.-. .... .-++.+++. +...+....++.........+.+ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~~~~-~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~ 196 (415) T protein:vir:79 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNG-SGKYPVVRQSEVAALEKVEELEENPELAVKPFF 196 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee--ccCC-ceeEEEEeecCCccceeeccccccCccccccee Confidence 11 2347899999999999999998888875321 1111 124445543 33445666666655433345667 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------cccccCCHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-------KPAATLDKTNIYE 145 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-------~~~~~~t~~~~~~ 145 (276) .+++.+.+. +.-+.|+++-...+..++...+.+..++++++.+|..++.....+..... ......++...++ T Consensus 197 ~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) T protein:vir:79 197 QLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) T ss_pred eEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchh Confidence 778887655 44477888877677789999999999999999999999876654322111 1112223334578 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEE-EE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGA-IA 224 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~-~~ 224 (276) .|.++...+...... +-.+++||..+..|.+...- ...+.....+..|..++++|++|+.++.+|..+.+... ++ T Consensus 276 ~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~--~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:79 276 DIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDK--LGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhcc--CCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 888888888766653 33588999999998763221 11222233455677789999999999988865544433 33 Q ss_pred Ee-cceEEeee-eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 225 GV-KMACTFAE-QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 225 ~~-~~a~~~~~-~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. +.++.... ....++..+ ...+.+.+++..++|+.+.+|++++.++.+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:79 352 GNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEec-cccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 33 33443333 223344333 34456778899999999999999999998887 No 65 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.67 E-value=4.9e-17 Score=110.01 Aligned_cols=267 Identities=11% Similarity=0.079 Sum_probs=174.5 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MA-------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA-------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) ++ ...++|+.|...+++.+++..++..+++.-. .... .-++.+++. +...+....++.........+.+ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~~~~-~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~ 196 (415) T protein:vir:98 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNG-SGKYPVVRQSEVAALEKVEELEENPELAVKPFF 196 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee--ccCC-ceeEEEEeecCCccceeeccccccCccccccee Confidence 11 2347899999999999999998888875321 1111 124445543 33445666666655433345667 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------cccccCCHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-------KPAATLDKTNIYE 145 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-------~~~~~~t~~~~~~ 145 (276) .+++.+.+. +.-+.|+++-...+..++...+.+..++++++.+|..++.....+..... ......++...++ T Consensus 197 ~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) T protein:vir:98 197 QLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) T ss_pred eEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchh Confidence 778887655 44477888877677789999999999999999999999876654322111 1112223334578 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEE-EE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGA-IA 224 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~-~~ 224 (276) .|.++...+...... +-.+++||..+..|.+...- ...+.....+..|..++++|++|+.++.+|..+.+... ++ T Consensus 276 ~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~--~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:98 276 DIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDK--LGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhcc--CCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 888888888766653 33588999999998763221 11222233455677789999999999988865544433 33 Q ss_pred Ee-cceEEeee-eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 225 GV-KMACTFAE-QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 225 ~~-~~a~~~~~-~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. +.++.... ....++..+ ...+.+.+++..++|+.+.+|++++.++.+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:98 352 GNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEec-cccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 33 33443333 223344333 34456778899999999999999999998887 No 66 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.67 E-value=4.9e-17 Score=110.01 Aligned_cols=267 Identities=11% Similarity=0.079 Sum_probs=174.5 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MA-------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA-------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) ++ ...++|+.|...+++.+++..++..+++.-. .... .-++.+++. +...+....++.........+.+ T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~~~~-~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~ 196 (415) T protein:vir:81 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNG-SGKYPVVRQSEVAALEKVEELEENPELAVKPFF 196 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee--ccCC-ceeEEEEeecCCccceeeccccccCccccccee Confidence 11 2347899999999999999998888875321 1111 124445543 33445666666655433345667 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------cccccCCHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-------KPAATLDKTNIYE 145 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-------~~~~~~t~~~~~~ 145 (276) .+++.+.+. +.-+.|+++-...+..++...+.+..++++++.+|..++.....+..... ......++...++ T Consensus 197 ~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (415) T protein:vir:81 197 QLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLD 275 (415) T ss_pred eEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchh Confidence 778887655 44477888877677789999999999999999999999876654322111 1112223334578 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEE-EE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGA-IA 224 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~-~~ 224 (276) .|.++...+...... +-.+++||..+..|.+...- ...+.....+..|..++++|++|+.++.+|..+.+... ++ T Consensus 276 ~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~--~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:81 276 DIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDK--LGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhcc--CCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 888888888766653 33588999999998763221 11222233455677789999999999988865544433 33 Q ss_pred Ee-cceEEeee-eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 225 GV-KMACTFAE-QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 225 ~~-~~a~~~~~-~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. +.++.... ....++..+ ...+.+.+++..++|+.+.+|++++.++.+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:81 352 GNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEec-cccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 33 33443333 223344333 34456778899999999999999999998887 No 67 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.67 E-value=5.2e-17 Score=109.88 Aligned_cols=266 Identities=11% Similarity=0.080 Sum_probs=171.2 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCC-cEEEEecc-CcccceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MA-------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYG-DTVKINQI-GAITVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA-------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~G-dtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~ 71 (276) ++ ...++|+.|...+++.+++..++..+++.- +...| -++.++.. +...+..+.++.........+. T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~ 195 (415) T protein:vir:47 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK----RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhccee----eccCCceeEEEEEecCCcceeecccccccccccccce Confidence 11 224789999999999999999998887532 11112 23333332 3334556666655543234566 Q ss_pred ceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------cccccCCHHHHH Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-------KPAATLDKTNIY 144 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-------~~~~~~t~~~~~ 144 (276) ..+++...+. +.-+.|+++-...+..++...+.+..++++++.+|..++.....+..... ......+....+ T Consensus 196 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:47 196 FQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccch Confidence 6777777554 44577888777677789999999999999999999999876654322111 111122233346 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceE-EE Q lcl|NC_019506. 145 EELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTG-AI 223 (276) Q Consensus 145 ~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~-~~ 223 (276) +.|.++...+...... +-.+|+||..+..|.+... ....+.....+.+|..++++|++|+.++.+|..+.+.. ++ T Consensus 275 ~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd--~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~ 350 (415) T protein:vir:47 275 DDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD--KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLI 350 (415) T ss_pred HHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc--cCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEE Confidence 7788887777666542 3368999999999865321 11122233345677778999999999998886544433 23 Q ss_pred EEe-cceEEeee-eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 224 AGV-KMACTFAE-QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 224 ~~~-~~a~~~~~-~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++. +.++.... +...++.. +...+...+++..++|+++++|++++.++.+++ T Consensus 351 ~gd~~~~~~~~~~~~~~v~~~-~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:47 351 IGNLKDAIVLFDRSQYQASWT-DYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEehhccEEEEeecceEEEee-ccccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 332 33443333 22333333 233445678899999999999999999998888 No 68 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.67 E-value=5.2e-17 Score=109.88 Aligned_cols=266 Identities=11% Similarity=0.080 Sum_probs=171.2 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCC-cEEEEecc-CcccceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MA-------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYG-DTVKINQI-GAITVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA-------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~G-dtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~ 71 (276) ++ ...++|+.|...+++.+++..++..+++.- +...| -++.++.. +...+..+.++.........+. T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~ 195 (415) T protein:vir:46 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK----RVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhccee----eccCCceeEEEEEecCCcceeecccccccccccccce Confidence 11 224789999999999999999998887532 11112 23333332 3334556666655543234566 Q ss_pred ceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------cccccCCHHHHH Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-------KPAATLDKTNIY 144 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-------~~~~~~t~~~~~ 144 (276) ..+++...+. +.-+.|+++-...+..++...+.+..++++++.+|..++.....+..... ......+....+ T Consensus 196 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:46 196 FQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccch Confidence 6777777554 44577888777677789999999999999999999999876654322111 111122233346 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceE-EE Q lcl|NC_019506. 145 EELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTG-AI 223 (276) Q Consensus 145 ~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~-~~ 223 (276) +.|.++...+...... +-.+|+||..+..|.+... ....+.....+.+|..++++|++|+.++.+|..+.+.. ++ T Consensus 275 ~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd--~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~ 350 (415) T protein:vir:46 275 DDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKD--KLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLI 350 (415) T ss_pred HHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhc--cCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEE Confidence 7788887777666542 3368999999999865321 11122233345677778999999999998886544433 23 Q ss_pred EEe-cceEEeee-eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 224 AGV-KMACTFAE-QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 224 ~~~-~~a~~~~~-~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++. +.++.... +...++.. +...+...+++..++|+++++|++++.++.+++ T Consensus 351 ~gd~~~~~~~~~~~~~~v~~~-~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:46 351 IGNLKDAIVLFDRSQYQASWT-DYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEehhccEEEEeecceEEEee-ccccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 332 33443333 22333333 233445678899999999999999999998888 No 69 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.66 E-value=7.8e-17 Score=108.90 Aligned_cols=267 Identities=11% Similarity=0.069 Sum_probs=174.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVLEIN 79 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~ld 79 (276) -....++|+.|...+++.+++..++..+++.-. . .....++.++... ...+....++.........+.+.+++.+. T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~ 203 (415) T protein:vir:94 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--V-TNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDIN 203 (415) T ss_pred ccccccCcHHHHHHHHHHHHhhhhhhhhcceee--c-cCCceeEEEEeecCCccceeccccccccccccccceeeEeehe Confidence 112347899999999999999999988875421 1 1112345555443 34466666666654323456677777776 Q ss_pred eeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------cccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 80 KQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-------KPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 80 ~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-------~~~~~~t~~~~~~~i~~a~~ 152 (276) +. +.-+.|+++-...+..++.+.+.++.++++++.+|..++.....+..... ......++...++.|.++.. T Consensus 204 k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 282 (415) T protein:vir:94 204 TH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAIN 282 (415) T ss_pred ee-eeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHH Confidence 54 44467888766667789999999999999999999998876654432111 11112223344778888887 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceE-EEEEe-cceE Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTG-AIAGV-KMAC 230 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~-~~~~~-~~a~ 230 (276) .+...... +-.+++||..+..|.+...- ...+.....+.+|..++++|++|+.++.+|..+.+.. ++++. +.++ T Consensus 283 ~~~~~~~~--~~~~vmn~~~~~~l~~lkd~--~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~ 358 (415) T protein:vir:94 283 LNVKPNYE--HNVAIVSQTMFAKLDKMKDK--LGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAI 358 (415) T ss_pred hhhhhccC--CCEEEEcHHHHHHHHHhhcc--CCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccE Confidence 77766653 33689999999999764221 1122223345667778999999999999886554433 23332 4444 Q ss_pred Eeeeee-eeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFAEQI-VQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~~~~-~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ....+. ..++.. +...+.+.+++..++|+.+++|++++.++.+.+ T Consensus 359 ~~~~~~~~~v~~~-~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 404 (415) T protein:vir:94 359 VLFDRSQYQASWT-DYMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEeecceEEEEe-ccccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 433322 233333 334456778899999999999999999997777 No 70 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.64 E-value=1.8e-16 Score=106.86 Aligned_cols=264 Identities=10% Similarity=0.048 Sum_probs=174.7 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceE Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEK 74 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~ 74 (276) |+.. .++|+.++.++.+.+++.+++.++++. .+ ..|.+.++|+.....+..+.++..... .+.+.+++ T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~----~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~f~~v 79 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKA----VP-MTKPEEEFTFMSGVGAFWVDEAERIQT-SKPTFTKA 79 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhcee----ee-cCCCcEEEEEEcCCceeeeecCccccc-cccceeEE Confidence 3332 368999999999999999999988753 12 246778889887777777888777654 45777888 Q ss_pred EEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-c-----ccccccCCHHHHHHHHH Q lcl|NC_019506. 75 VLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-K-----LKPAATLDKTNIYEELI 148 (276) Q Consensus 75 ~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-~-----~~~~~~~t~~~~~~~i~ 148 (276) ++...+. +.-+.|+++-...+..++.+.+.+..++++++++|+.++..-.+.... . ............++.|. T Consensus 80 ~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~~l~ 158 (299) T protein:vir:41 80 KMRSKKM-GVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYDDLN 158 (299) T ss_pred EEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHHHHH Confidence 8888654 555788887777778899999999999999999999888533221110 0 00011112223478888 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecc Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM 228 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~ 228 (276) ++...+...+.+ +-.+++||..+..|.+...- ...+....... +-.++++|.+|+.++.+|..++....+.+.-+ T Consensus 159 ~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lkd~--~G~~l~~~~~~-~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs 233 (299) T protein:vir:41 159 EAIGLIEAEDLE--PNGIATIRKQRVKYRSTKDG--NGMPIFNTATS-NGVDDVLGLPIAYTPKYTFGDKDISELVGDWN 233 (299) T ss_pred HHHHhhhcccCC--cCEEEEcHHHHHHHHHhhcc--CCceeecCCcC-CCCceecceeeEEecccCCCCCceEEEEEecc Confidence 888888877764 33689999999999864321 11111222222 33468999999999999965544344444333 Q ss_pred eEEeee-eeeeeeeccCc--------c-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAE-QIVQTEAYRME--------K-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~-~~~~~e~~~~~--------~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) -+.+.. +...++..++. + .| ...+++..++|.++.+|+++++++..+= T Consensus 234 ~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 234 QAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred cEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 222322 22233332221 1 11 2467888999999999999999986666 No 71 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.63 E-value=1.6e-16 Score=107.14 Aligned_cols=267 Identities=14% Similarity=0.043 Sum_probs=165.3 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) ||.. .++|+.++.++++.+++.+++.+++.+- ...+..++||+.. ...+..+.++..... .+.+.++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i-----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~-s~~~f~~ 74 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQ-----PTIFGPVKGAVFSGVPRAKIVGEGEVKPS-ASVDVSA 74 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhccee-----ecCCCceEEEEEeCCcceEEeeCCccccc-cccceee Confidence 9964 4889999999999999999999987542 1234678899864 456777777776654 5667777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcccc--cc------cccc-ccCCH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTP----LMDAAMQRAAYALADETEKILLKEMDTNAT--SK------LKPA-ATLDK 140 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d----~~~~~~~~~~~ala~~~d~~~~~~~~~~~~--~~------~~~~-~~~t~ 140 (276) +++...+. +.-+.|+++-..++..+ +...+.++.++++++++|..++..-..... .. .... ..... T Consensus 75 v~l~~~kl-~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (315) T protein:vir:80 75 FTAQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDAT 153 (315) T ss_pred eEeeeeeE-EeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeecc Confidence 77776553 44467777755444433 557778899999999999988754221110 00 0000 01112 Q ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccc--ccceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 141 TNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMA--ESITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 141 ~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~--~~~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) ...+..|.++...+...+.-.. ...++||..+..|++.......+..+. ...+..|..++++|.+|+.++.+|.... T Consensus 154 ~~~~~d~~~~~~~~~~~~~~~~-~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~ 232 (315) T protein:vir:80 154 DSATADLVKAVGLIAGAGLQVP-NGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPE 232 (315) T ss_pred ccchHHHHHHHHHHhhccCccc-eEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCcccc Confidence 2235677777777765544222 358899999999977543221111110 0124455667899999999999985432 Q ss_pred c-----eEEEEE--ecceEEeeeeeeeeeeccC--cc-----cc---eeeEEeeeeeeeEEEcCCeEEEEE-ecCC Q lcl|NC_019506. 219 G-----TGAIAG--VKMACTFAEQIVQTEAYRM--EK-----RF---ADAVKGLNVFGCKVIYPDALVCLK-KTNP 276 (276) Q Consensus 219 ~-----~~~~~~--~~~a~~~~~~~~~~e~~~~--~~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~-~~~p 276 (276) . ...+++ .+-.++....+ .++..+. ++ .| ...+++..++|.+|++|+++++|+ +++| T Consensus 233 ~~~~~~~~~~~GDfs~~~~g~~~~~-~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~ 307 (315) T protein:vir:80 233 MSPASGVKAIVGDFSRVHWGFQRNF-PIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) T ss_pred cccccccEEEEeecccEEEEEecCe-eEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCC Confidence 1 222222 22233343322 2222221 11 11 247788899999999999999999 5558 No 72 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.63 E-value=3.5e-16 Score=105.35 Aligned_cols=264 Identities=12% Similarity=0.099 Sum_probs=173.0 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MAVT--SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA~~--~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~ 77 (276) ||.+ .++|+.+..++.+.+++.+++..++..- + ..+.+++||+.. ...+..+.++..... .+++.+.+++. T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~----~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~f~~v~l~ 74 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK----P-IPFNGEKVFTFTMDSEIDVVAESGKKTH-GGVTLAPQTMV 74 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEecCcceEEeeCCccccc-cccceeEEEEe Confidence 9976 4889999999999999999998887532 1 223567888874 445777777766543 45666777777 Q ss_pred EEeeeecceeechHHHH---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----c----------ccccccCCH Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAA---QIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS----K----------LKPAATLDK 140 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~---~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~----~----------~~~~~~~t~ 140 (276) ..+. +.-+.|+++-.. .+..++.+.+.++.++++++++|..++......... . ......... T Consensus 75 ~~k~-~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) T protein:vir:94 75 PIKV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) T ss_pred eeEE-EEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccccccccc Confidence 6554 444677777553 244678888999999999999999988653221100 0 000111223 Q ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc-- Q lcl|NC_019506. 141 TNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN-- 218 (276) Q Consensus 141 ~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~-- 218 (276) ..+++.|.++...+...+... ..+++||..+..|.+...- ...+...+....|..++++|++|+.++.+|.... T Consensus 154 ~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~--~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~ 229 (298) T protein:vir:94 154 ADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKDL--QGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) T ss_pred ccHHHHHHHHHHhhhhcCCCc--cEEEEcHHHHHHHHHhhcc--CCCeeecCcccCCCCceecceeeEEecccccccCCC Confidence 345778999998888877533 3699999999999764321 2223334455667778999999999999985432 Q ss_pred ceEEEEEe-cceEEeee-eeeeeee--ccCcc-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 219 GTGAIAGV-KMACTFAE-QIVQTEA--YRMEK-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 219 ~~~~~~~~-~~a~~~~~-~~~~~e~--~~~~~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) ....+.+. ..++.+.. +...++. +-+++ .| ...+++..++|..+++|++++.++.+. T Consensus 230 ~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 22333332 23333322 2222222 21221 12 136788899999999999999999777 No 73 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.62 E-value=3.7e-16 Score=105.21 Aligned_cols=259 Identities=11% Similarity=0.091 Sum_probs=172.6 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCC Q lcl|NC_019506. 1 MAVT--------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDA 65 (276) Q Consensus 1 MA~~--------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~ 65 (276) ||.. .++|+.+...+.+.+++.+++.+++..- + -.+.+++||+... ..+..+.++..... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~ip~~~~~~~a~~v~E~~~~~~ 75 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE----P-MTAQKKKFTYLAKGVGAYWVSETERIQT 75 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee----e-ccCCceEEEEEeCCcceEEeecCccccc Confidence 5542 3689999999999999999998887542 1 2356688998743 45666777766544 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------cc-c Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK----------LK-P 134 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~----------~~-~ 134 (276) .+.+.+++++.+.+. +.-+.|+++-...+..++.+.+.++.++++++++|..++..-.+..... .. . T Consensus 76 -~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 153 (304) T protein:vir:10 76 -SKPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG 153 (304) T ss_pred -ccceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc Confidence 457777778877664 4447888877777888999999999999999999999875433211100 00 1 Q ss_pred cccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 135 AATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 135 ~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) .+..+....++.|.++...+....... ..+++||..+..|.+.. . .. + ..+..+..++++|.+|+.++++| T Consensus 154 ~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~lk---d--~~-G-~~l~~~~~~~l~G~PV~~~~~~~ 224 (304) T protein:vir:10 154 NVVTDTNNLYVDLSALMATIEDEELDP--NGVLTTRSFRSKMRNAL---D--AN-D-RPLFDANGNEIMGLPLSYTGADV 224 (304) T ss_pred cccccccchHHHHHHHHHHhhhccCCc--CEEEEcHHHHHHHHHhh---c--cC-C-cEeecCCCccccceeeEEecccc Confidence 111233445888999988888776533 36899999999997632 1 11 1 22344456789999999999999 Q ss_pred ccccceEEEEEecceEEeee-eeeeeeecc----------Ccc-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 215 SLTNGTGAIAGVKMACTFAE-QIVQTEAYR----------MEK-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 215 ~~~~~~~~~~~~~~a~~~~~-~~~~~e~~~----------~~~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) ...+....+.+..+-+.+.. +...++..+ +.+ .| -..+++..++|..+++|+++++|+.+= T Consensus 225 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 225 YDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 76555444444333222221 111222111 111 12 146788899999999999999999888 No 74 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.62 E-value=3.7e-16 Score=105.21 Aligned_cols=259 Identities=11% Similarity=0.091 Sum_probs=172.6 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCC Q lcl|NC_019506. 1 MAVT--------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDA 65 (276) Q Consensus 1 MA~~--------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~ 65 (276) ||.. .++|+.+...+.+.+++.+++.+++..- + -.+.+++||+... ..+..+.++..... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~ip~~~~~~~a~~v~E~~~~~~ 75 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE----P-MTAQKKKFTYLAKGVGAYWVSETERIQT 75 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee----e-ccCCceEEEEEeCCcceEEeecCccccc Confidence 5542 3689999999999999999998887542 1 2356688998743 45666777766544 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------cc-c Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK----------LK-P 134 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~----------~~-~ 134 (276) .+.+.+++++.+.+. +.-+.|+++-...+..++.+.+.++.++++++++|..++..-.+..... .. . T Consensus 76 -~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 153 (304) T protein:vir:94 76 -SKPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG 153 (304) T ss_pred -ccceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc Confidence 457777778877664 4447888877777888999999999999999999999875433211100 00 1 Q ss_pred cccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 135 AATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 135 ~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) .+..+....++.|.++...+....... ..+++||..+..|.+.. . .. + ..+..+..++++|.+|+.++++| T Consensus 154 ~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~lk---d--~~-G-~~l~~~~~~~l~G~PV~~~~~~~ 224 (304) T protein:vir:94 154 NVVTDTNNLYVDLSALMATIEDEELDP--NGVLTTRSFRSKMRNAL---D--AN-D-RPLFDANGNEIMGLPLSYTGADV 224 (304) T ss_pred cccccccchHHHHHHHHHHhhhccCCc--CEEEEcHHHHHHHHHhh---c--cC-C-cEeecCCCccccceeeEEecccc Confidence 111233445888999988888776533 36899999999997632 1 11 1 22344456789999999999999 Q ss_pred ccccceEEEEEecceEEeee-eeeeeeecc----------Ccc-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 215 SLTNGTGAIAGVKMACTFAE-QIVQTEAYR----------MEK-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 215 ~~~~~~~~~~~~~~a~~~~~-~~~~~e~~~----------~~~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) ...+....+.+..+-+.+.. +...++..+ +.+ .| -..+++..++|..+++|+++++|+.+= T Consensus 225 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 225 YDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 76555444444333222221 111222111 111 12 146788899999999999999999888 No 75 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.62 E-value=3.4e-16 Score=105.42 Aligned_cols=259 Identities=11% Similarity=0.052 Sum_probs=171.1 Q ss_pred Cc-c--chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MA-V--TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA-~--~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |+ . ..++|+.|+.++.+.+++.+++.+++..- + ..|.+++||+... ..+..+.++..... .+++.+.+++ T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~----~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~f~~v~~ 103 (324) T protein:vir:96 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----P-MEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATM 103 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEecCcceeeecCCccccc-cccceeEEEE Confidence 33 2 24789999999999999999999987542 1 2356788998743 45666677766654 5677778888 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccccCCHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAATLDKTNIYEELIKV 150 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~~~t~~~~~~~i~~a 150 (276) ...+. +.-+.|+++-...+..++.+.+.++.++++++++|+.++..-.+.....+ .......+...++.|.++ T Consensus 104 ~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) T protein:vir:96 104 RAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHHHHHH Confidence 87654 44578888777777789999999999999999999988754322211111 001111122347788888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceE Q lcl|NC_019506. 151 KVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 151 ~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ...+....... -.+++||..+..|.+... ..+...+..|..++++|++|+.++..+.. ....+++..+.+ T Consensus 183 ~~~i~~~~~~~--~~~i~n~~~~~~L~~lkd------~~G~~~~~~~~~~~l~G~PV~~~~~~~~~--~~~~~~gd~s~~ 252 (324) T protein:vir:96 183 EALLEDDELEA--NAFISKTQNRSLLRKIVD------PETKERIYDRNSDSLDGLPVVNLKSSNLK--RGELITGDFDKL 252 (324) T ss_pred HHhhhhccCCC--CEEEEcHHHHHHHHHhhC------CCCCeeecCCCCCcccceeeEeecCCCCC--cceEEEEecceE Confidence 88887766432 368999999998875421 12233455666778999999987766542 223444433333 Q ss_pred Eeee-eeeeeeeccC--------cc-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFAE-QIVQTEAYRM--------EK-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~~-~~~~~e~~~~--------~~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.. +...++..+. ++ .| ...+++..++|.++++|+++++|+.+.| T Consensus 253 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 3322 2222332222 11 12 2577888999999999999999999988 No 76 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.62 E-value=5.5e-16 Score=104.24 Aligned_cols=268 Identities=13% Similarity=0.064 Sum_probs=168.2 Q ss_pred Ccc---------------------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-------- Q lcl|NC_019506. 1 MAV---------------------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-------- 51 (276) Q Consensus 1 MA~---------------------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-------- 51 (276) ||+ ..++|+.|+.++.+.+++.+++..++..- .-.+..+++|+... T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~-----~~~~~~~~ip~~~~~~~a~~v~ 75 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENI-----PISYGETIIPTTVKRPEVGQVG 75 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhccee-----eccCCceEEEEEecCccceeec Confidence 111 12789999999999999999999987541 23467888887632 Q ss_pred -ccceeecCCCCCCCccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_019506. 52 -ITVKEYTENSDIDAPEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS 130 (276) Q Consensus 52 -~~~~d~~~~~~~~~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~ 130 (276) ..+....++..... .+++...+++...+. +.-+.|+++-...+..++.+.+.++.++++++++|+.++..-.+.... T Consensus 76 ~~~~~~~~Eg~~~~~-~~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~ 153 (338) T protein:vir:78 76 VGTSNEQREGGTKPL-SGTAWDTRSVAPIKL-ATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGS 153 (338) T ss_pred ccccccccccccccc-cccceeEEEEEEEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccc Confidence 22333344444432 445666677776553 445678887777777899999999999999999999988644322110 Q ss_pred ------------cc--cccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhc-cccccccee Q lcl|NC_019506. 131 ------------KL--KPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTG-GAMAESITK 195 (276) Q Consensus 131 ------------~~--~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~-~~~~~~~~~ 195 (276) .. ......+....++.|.++...+.... ......++++|..+..|.+...+.+.+ .+....... T Consensus 154 ~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~ 232 (338) T protein:vir:78 154 ALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANT-DVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINL 232 (338) T ss_pred cccccccccccccccccccccccchhhHHHHHHHHHHhhhhc-cccceEEEEchHHHHHHHHHhhhccCCCceeeccccc Confidence 00 01112223345778888877765433 223346899999999997755443322 233344456 Q ss_pred eeeeeEEeceEEEEeccccccc-----cceEEEEEecceEEeeee-eeeeeeccC--------c-----ccc---eeeEE Q lcl|NC_019506. 196 NGFVGTILGFDVYLSNNMGSLT-----NGTGAIAGVKMACTFAEQ-IVQTEAYRM--------E-----KRF---ADAVK 253 (276) Q Consensus 196 ~G~i~~~~G~~v~~s~~lp~~~-----~~~~~~~~~~~a~~~~~~-~~~~e~~~~--------~-----~~~---~~~i~ 253 (276) .|..++++|++|+.++.+|... ....++.+.-+.+.+..+ ...++..+. + ..| -..++ T Consensus 233 ~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 312 (338) T protein:vir:78 233 AASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAIL 312 (338) T ss_pred CCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEE Confidence 6777899999999999998532 112233333222222211 112222211 1 112 14678 Q ss_pred eeeeeeeEEEcCCeEEEEE-ecCC Q lcl|NC_019506. 254 GLNVFGCKVIYPDALVCLK-KTNP 276 (276) Q Consensus 254 ~~~~yg~~v~~~~~vv~~~-~~~p 276 (276) +..++|.+++||+++++|+ .++| T Consensus 313 ~~~r~d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 313 IEVTFGWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred EEEEeccEeecccceEEEecccCC Confidence 8899999999999998877 7788 No 77 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.62 E-value=4.3e-16 Score=104.82 Aligned_cols=264 Identities=11% Similarity=0.092 Sum_probs=171.1 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MAVT--SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA~~--~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~ 77 (276) ||.+ .++|+.+..++.+.+++.+++.+++..- + ..+..++||+.. ...+..+.++..... .+++...+++. T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~----~-~~~~~~~ip~~~~~~~a~~v~E~~~~~~-~~~~f~~v~l~ 74 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQK----P-IPFNGEKVFTFTMDSEIDVVAESGKKTH-GGVTLAPQTMV 74 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhccee----e-ccCCceEEEEEecCcceEEecCCccccc-cccceeEEEEe Confidence 9975 4788888999999999999999887532 1 123457788764 345777777766554 45666666666 Q ss_pred EEeeeecceeechHHHH---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------c---c--cccccCCH Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAA---QIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS---------K---L--KPAATLDK 140 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~---~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~---------~---~--~~~~~~t~ 140 (276) ..+. +.-+.|+++-.. .+..++.+.+.++.++++++++|..++......... . . ........ T Consensus 75 ~~k~-a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) T protein:vir:16 75 PIKV-EYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) T ss_pred eeeE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccccccc Confidence 6543 334677777653 345688889999999999999999998653221100 0 0 00111122 Q ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc-- Q lcl|NC_019506. 141 TNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN-- 218 (276) Q Consensus 141 ~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~-- 218 (276) .+.++.|.++...+...+.+.. .+++||..+..|.+...- ...+........|..++++|.+|+.++.+|.... T Consensus 154 ~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~--~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~ 229 (298) T protein:vir:16 154 ADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDL--QDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQ 229 (298) T ss_pred ccHHHHHHHHHHHhhhcCCCcc--EEEEcHHHHHHHHHhhcc--CCCeeecCcccCCCCceecceeeEEecccccccCCC Confidence 3346778888888887776433 588999999999765321 1222334455667778999999999999986432 Q ss_pred ceEEEEE-ecceEEee-eeeeeeeecc--Ccc-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 219 GTGAIAG-VKMACTFA-EQIVQTEAYR--MEK-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 219 ~~~~~~~-~~~a~~~~-~~~~~~e~~~--~~~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) ....+++ -+.++.+. .+..+++..+ +++ .| -..+++..++|.++++|++++.|+.+. T Consensus 230 ~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 230 RDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 2233333 23333333 2222232222 221 12 146788899999999999999999877 No 78 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.61 E-value=5.6e-16 Score=104.19 Aligned_cols=259 Identities=10% Similarity=0.052 Sum_probs=171.9 Q ss_pred Cc-c--chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MA-V--TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA-~--~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |+ . ..++|+.|..++.+.+++.+++.+++..- + ..|.+++||+.. ...+..+.++..... ..++.+.+++ T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~----~-~~~~~~~ip~~~~~~~a~~v~Eg~~~~~-~~~~f~~i~~ 103 (324) T protein:vir:93 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----P-MEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATM 103 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEecCcceeeecCCccccc-cccceeEEEE Confidence 22 1 24789999999999999999999987542 1 235678898873 445666777777654 4677777888 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccccCCHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAATLDKTNIYEELIKV 150 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~~~t~~~~~~~i~~a 150 (276) +..+. +.-+.|+++-...+..++.+.+.++.++++++++|+.++..-.+.....+ .......+...++.|.++ T Consensus 104 ~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) T protein:vir:93 104 RAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred EeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccccccHHHHHHH Confidence 77554 45578888777777789999999999999999999988754332211111 001111122347788888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceE Q lcl|NC_019506. 151 KVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 151 ~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ...+...+.. ...++++|..+..|.+.. + ..+...+..|..++++|.+|+.++..+.. ....+.+..+-+ T Consensus 183 ~~~l~~~~~~--~~~~v~n~~~~~~L~~l~---d---~~G~~~~~~~~~~~l~G~PVv~~~~~~~~--~~~i~~gdfs~~ 252 (324) T protein:vir:93 183 EALLEDDELE--ANAFISKTQNRSLLRKIV---D---PETKERIYDRNSDSLDGLPVVNLKSSNLK--RGELITGDFDKL 252 (324) T ss_pred HHhhhhccCC--CCEEEEcHHHHHHHHHhh---C---CCCCeeecCCCCCcccceeeEeecCCCCC--cceEEEEecceE Confidence 8888877643 236899999999987532 1 12234455667788999999987765532 223333333333 Q ss_pred Eeee-eeeeeeeccCc--------c-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFAE-QIVQTEAYRME--------K-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~~-~~~~~e~~~~~--------~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.. +...++..++. + .| ...+++..++|..+++|+++++|+.+.+ T Consensus 253 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:93 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 3322 12233332221 1 12 2678899999999999999999998888 No 79 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.61 E-value=6.7e-16 Score=103.76 Aligned_cols=259 Identities=10% Similarity=0.047 Sum_probs=170.9 Q ss_pred Ccc--chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MAV--TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA~--~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~ 77 (276) ++. ..++|+.|...+.+.+++.+++.+++.+- + ..|.++++|+.. ...+..+.++..... .+.+...++++ T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~----~-~~~~~~~ip~~~~~~~a~~v~Eg~~~~~-~~~~f~~v~~~ 104 (324) T protein:vir:97 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----P-MEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATMR 104 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhcchhhhccee----e-ccCCceEEEEEecCcceeEeccCccccc-cccceeEEEEe Confidence 222 24789999999999999999999987542 1 235678999874 345667777776554 56777777777 Q ss_pred EEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccccCCHHHHHHHHHHHH Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAATLDKTNIYEELIKVK 151 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~~~t~~~~~~~i~~a~ 151 (276) ..+. +.-+.|+++-...+..++...+.++.++++++++|+.++..-.......+ .......+...++.|.++. T Consensus 105 ~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~ 183 (324) T protein:vir:97 105 AFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE 183 (324) T ss_pred eEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceeccccCCHHHHHHHH Confidence 7554 45578888767677789999999999999999999998865433221111 0011111223477888888 Q ss_pred HHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 152 VKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 152 ~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) ..+...+... -.+++||..+..|.+... . .+...+..+.-+.++|++|+.++..+... ...+.+..+-+. T Consensus 184 ~~l~~~~~~~--~~~v~n~~~~~~L~~lkd---~---~g~~~~~~~~~~tl~G~PV~~~~~~~~~~--~~~~~gd~~~~~ 253 (324) T protein:vir:97 184 ALLEDDELEA--NAFISKTQNRSLLRKIVD---P---ETKERIYDRNSDTLDGLPVVNLKSSNLKR--GELITGDFDKLI 253 (324) T ss_pred HhhhhccCCC--CEEEEcHHHHHHHHHhhc---C---CCceeecCCCCccccceeeEeecCCCCCc--ceEEEEecccEE Confidence 8888776432 368999999998865321 1 12334445556789999999987765422 223333322233 Q ss_pred eeee-eeeeeeccCc--------c-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FAEQ-IVQTEAYRME--------K-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~~~-~~~~e~~~~~--------~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +..+ ...++..++. + .| ...+++..++|+++.+|+++++|+.+.| T Consensus 254 i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) T protein:vir:97 254 YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 3322 2233332221 1 12 2577888999999999999999998888 No 80 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.61 E-value=5e-16 Score=104.47 Aligned_cols=265 Identities=15% Similarity=0.045 Sum_probs=168.2 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. -.++|+.|..++++.+++.+++.++++.- + ..+.++.+|+. +...+....++.........+... T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~ 180 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVI----T-LGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGL 180 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceee----e-cCCCceEEEEecCCcceeeeccccccccccccccee Confidence 222 24789999999999999999988887531 1 22446777754 334566666666543323345666 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------ccccc----------cc Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN---------ATSKL----------KP 134 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~---------~~~~~----------~~ 134 (276) +++.+.+. +.-+.|+++-...+..++.+.+.++.+++++.++|..++..=.+. ..... .. T Consensus 181 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~ 259 (407) T protein:vir:48 181 IEPFMGEI-YGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHI 259 (407) T ss_pred EEeeeeee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccccccccccccc Confidence 77777554 333678888777788899999999999999999999877431110 00000 00 Q ss_pred cccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 135 AATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 135 ~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) .+...+...++.|.++...|..... .+-..++||..+..|.+...- ...+.....+..|..++++|.+|+.++.+| T Consensus 260 ~~~~~~~~~~d~i~~l~~~l~~~~~--~~a~~v~n~~~~~~L~~lkD~--~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p 335 (407) T protein:vir:48 260 ASGAASGVTADAIIKLIYTLRKAHR--SGAKFMMNNSSLFAIRLLKDN--DGNYLWRPGIELGQPSSLAGYGIVENEQMP 335 (407) T ss_pred ccccccccChHHHHHHHHhhchhhh--cCCEEEEcHHHHHHHHHhhcc--CCceeeccCcCCCCCceecceeeEEecCcC Confidence 0111122236778887777766543 233578999999988664321 112223344567788899999999999999 Q ss_pred ccccceEEEEE--ecceEEeeeeeeeeeeccCcc--cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 215 SLTNGTGAIAG--VKMACTFAEQIVQTEAYRMEK--RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 215 ~~~~~~~~~~~--~~~a~~~~~~~~~~e~~~~~~--~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ....+..++.+ -+.++.+..+. .++..+++. .--..+++..++|+++++|+++++++.++. T Consensus 336 ~~~~~~~~i~~Gd~~~~~~i~~~~-~~~i~~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa 400 (407) T protein:vir:48 336 DIAADAKAIAFGNFKRGYTIVDRI-GTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAA 400 (407) T ss_pred CccCCccEEEEEeccccEEEEEee-ceEEEeeccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 76655544332 22333333222 122222222 122467888999999999999999998888 No 81 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.60 E-value=9.2e-16 Score=103.02 Aligned_cols=259 Identities=11% Similarity=0.053 Sum_probs=171.1 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAVT---SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~---~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |+.+ .++|+.|...+.+.+++.+++.+++..- + ..+.+++||+... ..+..+.++..... .+++...+++ T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~----~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~ 103 (324) T protein:vir:99 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYE----P-MEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATM 103 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEecCcceeEeccCccccc-cccceeEEEE Confidence 3322 4789999999999999999999987542 1 2356788998743 45667777776654 5577777777 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccccCCHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAATLDKTNIYEELIKV 150 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~~~t~~~~~~~i~~a 150 (276) ...+. +.-+.|+++-...+..++.+.+.++.++++++++|+.++..-..+....+ .......+...++.|.++ T Consensus 104 ~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) T protein:vir:99 104 RAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred eeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHH Confidence 77554 45578888777777789999999999999999999988854333221111 011111223347788888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceE Q lcl|NC_019506. 151 KVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 151 ~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ...|...+.... .+++||..+..|.+.. .. .+...+..+.-++++|.+|+.++..+... ...+.+..+-+ T Consensus 183 ~~~l~~~~~~~~--~~v~n~~~~~~L~~l~---d~---~g~~~~~~~~~~~l~G~PVv~~~~~~~~~--~~~i~gd~~~~ 252 (324) T protein:vir:99 183 EALLEDDELEAN--AFISKTQNRSLLRKIV---DP---ETKERIYDRNSDTLDGLPVVNLKSSNLKR--GELITGDFDKL 252 (324) T ss_pred HHhhhhccCCCC--EEEEcHHHHHHHHHhh---cC---CCceeecCCCCccccceeEEeecCCCCCc--ceEEEEecccE Confidence 888887764322 5899999999886532 11 22334445556789999999987766422 22333333323 Q ss_pred Eee-eeeeeeeeccCc--------c-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFA-EQIVQTEAYRME--------K-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~-~~~~~~e~~~~~--------~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+. .+...++..++. + .| ...+++..++|.++++|+++++++.+.| T Consensus 253 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~ 315 (324) T protein:vir:99 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 332 222233332221 1 12 2577888999999999999999998888 No 82 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.59 E-value=1.2e-15 Score=102.43 Aligned_cols=269 Identities=12% Similarity=0.056 Sum_probs=165.0 Q ss_pred Cc--cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCC-------Cccccc Q lcl|NC_019506. 1 MA--VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDID-------APEELS 70 (276) Q Consensus 1 MA--~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~-------~~~~~~ 70 (276) |. ...++|+.+..++.+.+++.+++.+++..- ...+.++++|+..... +....++.... .....+ T Consensus 20 ~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~-----~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~ 94 (333) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQI-----PISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTA 94 (333) T ss_pred eecCCccccchhHHHHHHHHHHhhchhhhhccee-----eccCCceEEEEEeCCceeEeecCcccccccccccccccccc Confidence 21 112679999999999999999998887542 1235677888875433 33333332111 112344 Q ss_pred cceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------------cccccc Q lcl|NC_019506. 71 TTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS--------------KLKPAA 136 (276) Q Consensus 71 ~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~--------------~~~~~~ 136 (276) ...+++...+. +.-+.|+++-...+..++.+.+.++.++++++++|..++..-.+.... ...... T Consensus 95 f~~i~l~~~kl-~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~~ 173 (333) T protein:vir:78 95 WDTRSVSPIKL-ATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNVDYL 173 (333) T ss_pred eeEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccccccccccc Confidence 44455544332 344677776666788899999999999999999999988533321100 000111 Q ss_pred cCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhc-ccccccceeeeeeeEEeceEEEEeccccc Q lcl|NC_019506. 137 TLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTG-GAMAESITKNGFVGTILGFDVYLSNNMGS 215 (276) Q Consensus 137 ~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~-~~~~~~~~~~G~i~~~~G~~v~~s~~lp~ 215 (276) ...+...++.|.++...+..+.- .....++++|..+..|++.....+.+ .+........|..++++|++|+.++++|. T Consensus 174 ~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~ 252 (333) T protein:vir:78 174 QETGDPLLDRLLDGYDLVSANTD-VEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGG 252 (333) T ss_pred ccccchhHHHHHHHHHhhccccc-cCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEccccCC Confidence 22233457788888777765432 23336888999999998765544332 23334455667788999999999999985 Q ss_pred ccc-----ceEEEEEecceEEeeee-eeeeeeccC-----c-----ccce---eeEEeeeeeeeEEEcCCeEEEEE-ecC Q lcl|NC_019506. 216 LTN-----GTGAIAGVKMACTFAEQ-IVQTEAYRM-----E-----KRFA---DAVKGLNVFGCKVIYPDALVCLK-KTN 275 (276) Q Consensus 216 ~~~-----~~~~~~~~~~a~~~~~~-~~~~e~~~~-----~-----~~~~---~~i~~~~~yg~~v~~~~~vv~~~-~~~ 275 (276) ... ....+++..+-+.+..+ ...++..+. . ..|. ..+++..++|+++++|+++++|+ .++ T Consensus 253 ~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a 332 (333) T protein:vir:78 253 DLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQ 332 (333) T ss_pred CccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCC Confidence 421 12233333222323221 112222211 1 1121 35788899999999999999998 899 Q ss_pred C Q lcl|NC_019506. 276 P 276 (276) Q Consensus 276 p 276 (276) | T Consensus 333 ~ 333 (333) T protein:vir:78 333 P 333 (333) T ss_pred C Confidence 9 No 83 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.59 E-value=1.1e-15 Score=102.51 Aligned_cols=265 Identities=12% Similarity=0.107 Sum_probs=170.2 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceE Q lcl|NC_019506. 1 MAVT-----SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEK 74 (276) Q Consensus 1 MA~~-----~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~ 74 (276) ||.+ .++|+.++.++++.++..+++..++..- ...+..+++|+.. ...+..+.++..... .+++.+.+ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~-----~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~-s~~~f~~v 74 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQK-----PIPFNGQREFVFDFDSDIDIVAENGKKTH-GGVSLDPV 74 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhccee-----eccCCceEEEEEecCcceEEeeCCccccc-ccccceee Confidence 9975 3678889999999999999888876431 1234567888753 345666677766543 45777777 Q ss_pred EEEEEeeeecceeechHHHH---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------cccccccCC Q lcl|NC_019506. 75 VLEINKQKYFNFQIDDVDAA---QIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS------------KLKPAATLD 139 (276) Q Consensus 75 ~~~ld~~~~~~~~v~d~d~~---~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~------------~~~~~~~~t 139 (276) +++..+. +.-+.|+++-.. .+..++...+.++.++++++++|+.++......... ........+ T Consensus 75 ~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~ 153 (300) T protein:vir:95 75 TIVPLKV-EYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFK 153 (300) T ss_pred EeeeEEE-EEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeeccc Confidence 7777553 445677777553 345788899999999999999999998653221110 001111223 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccc Q lcl|NC_019506. 140 KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNG 219 (276) Q Consensus 140 ~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~ 219 (276) +...++.|.++...+...+.. ....++||..+..|.+...- ...+........|..++++|++|+.++.+|..... T Consensus 154 ~~~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~L~~lkd~--~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~ 229 (300) T protein:vir:95 154 DTNPDESMEDAVGMIDGSERD--ITGAILDPIFTTALSKMKNA--EGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTD 229 (300) T ss_pred ccchHHHHHHHHHHhhhcCCC--ccEEEECHHHHHHHHHhhcc--CCCeeccCccccCCCceecceeeEEecCCCCCCCC Confidence 445578888888888776642 23589999999998764321 11222234445667789999999999999865433 Q ss_pred eE--EEEEe-cceEEee-eeeeeee--eccCcc-----cce---eeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 220 TG--AIAGV-KMACTFA-EQIVQTE--AYRMEK-----RFA---DAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 220 ~~--~~~~~-~~a~~~~-~~~~~~e--~~~~~~-----~~~---~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .. ++.+- ..++-+. .+...++ .+-+++ .|. ..+++..++|..+++|++++.|+..+= T Consensus 230 ~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 230 PKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred CccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 22 22221 2223222 1111222 121221 122 577888999999999999999987766 No 84 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.59 E-value=2.1e-15 Score=101.03 Aligned_cols=265 Identities=16% Similarity=0.068 Sum_probs=166.7 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCC Q lcl|NC_019506. 1 MAVT--------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDA 65 (276) Q Consensus 1 MA~~--------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~ 65 (276) ||-. .++|+.+..++++.+++.+++.+++..- + ..+.++.+|+... ..+..+.++..... T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~----~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~ 75 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKV----P-MGPTGISIPHWTGAVSASWTGEAERKPI 75 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhccee----e-ccCCceEEEEEcCCcceeEecCCCcccc Confidence 3321 2445556778999999999998887541 1 2356688998744 45666777766654 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------------- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS--------------- 130 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~--------------- 130 (276) .+++..++++...+. +.-+.|+++-...+..++.+.+.++.++++++++|+.++..-.+.... T Consensus 76 -~~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~ 153 (330) T protein:vir:77 76 -TKGSFGKQELEPVKI-TTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLAD 153 (330) T ss_pred -ccceeeEEEEeEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeec Confidence 456777777777553 445688887676777899999999999999999999988432221110 Q ss_pred cccccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceee-----eeeeEEece Q lcl|NC_019506. 131 KLKPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKN-----GFVGTILGF 205 (276) Q Consensus 131 ~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~-----G~i~~~~G~ 205 (276) .............++.|.++...+...+.. ...+++||..+..|.+...- ...+.....+.. +.-++++|+ T Consensus 154 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~--~G~~l~~~~~~~~~~~~~~~~~l~G~ 229 (330) T protein:vir:77 154 TNLTTASGPQGNAYLAVNNALSLLVNSGKK--WTGTLLDNVTEPILNTAVDG--NGRPLFVESTYTEQVGAIREGRILGR 229 (330) T ss_pred ccccccccccchhHHHHHHHHHhhhhcCCC--ccEEEEcHHHHHHHHHHhcc--CCceeecCccccccccccCCceecce Confidence 001111223344578888888888777653 33689999999988764211 111111111212 233579999 Q ss_pred EEEEeccccccccce--EEEEEecceEEeeee-eeeeeeccCc-----------------ccc---eeeEEeeeeeeeEE Q lcl|NC_019506. 206 DVYLSNNMGSLTNGT--GAIAGVKMACTFAEQ-IVQTEAYRME-----------------KRF---ADAVKGLNVFGCKV 262 (276) Q Consensus 206 ~v~~s~~lp~~~~~~--~~~~~~~~a~~~~~~-~~~~e~~~~~-----------------~~~---~~~i~~~~~yg~~v 262 (276) +|+.++.+|..+.+. ..+.+..+.+.+..+ ...++..++. +.| ...+++..++|+.+ T Consensus 230 PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v 309 (330) T protein:vir:77 230 PTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMV 309 (330) T ss_pred eeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEE Confidence 999999999654332 233333333333221 1122221110 011 35778889999999 Q ss_pred EcCCeEEEEEecCC Q lcl|NC_019506. 263 IYPDALVCLKKTNP 276 (276) Q Consensus 263 ~~~~~vv~~~~~~p 276 (276) ++|+++++++.++| T Consensus 310 ~~~~a~~~i~~~~~ 323 (330) T protein:vir:77 310 NDKDAFVKLTDQVA 323 (330) T ss_pred ecccceEEEEeccC Confidence 99999999998877 No 85 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.58 E-value=1.5e-15 Score=101.81 Aligned_cols=259 Identities=10% Similarity=0.046 Sum_probs=171.7 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |.. ..++|+.|...+++.+++.+++.+++.+- + ..|.++++|+.. ...+..+.++..... .+++.+.+++ T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~----~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~ 103 (324) T protein:vir:96 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----P-MEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATM 103 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEecCcceeEecCCccccc-cccceeEEEE Confidence 322 24889999999999999999999987542 2 336678899874 345666677766654 5677777787 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccccCCHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAATLDKTNIYEELIKV 150 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~~~t~~~~~~~i~~a 150 (276) ...+. +.-+.|+++-...+..++.+.+.++.++++++++|..++..-.+.....+ .......+...++.|.++ T Consensus 104 ~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~ 182 (324) T protein:vir:96 104 RAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred eeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHH Confidence 77554 44577888777777789999999999999999999988754333221111 011111233347888888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceE Q lcl|NC_019506. 151 KVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 151 ~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ...+...... ...+++||..+..|.+... . .+...+..|..++++|++|+.++..+.. ....+.+..+-+ T Consensus 183 ~~~l~~~~~~--~~~~vmn~~~~~~L~~l~d-----~-~G~~~~~~~~~~~l~G~PV~~~~~~~~~--~~~~~~gd~~~~ 252 (324) T protein:vir:96 183 EALLEDDELE--ANAFISKTQNRSLLRKIVD-----P-ETKERIYDRNSDSLDGLPVVNLKSSNLK--RGELITGDFDKL 252 (324) T ss_pred HHhhhhccCC--CCEEEEcHHHHHHHHHhhc-----c-CCCeeecCCCCCcccceeeEeeCCCCCC--cceEEEEecceE Confidence 8888877653 2368999999998865321 1 1233455677788999999987766532 223334333323 Q ss_pred Eeee-eeeeeeeccCc-------------ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFAE-QIVQTEAYRME-------------KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~~-~~~~~e~~~~~-------------~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .++. +...++..++. ..| ...+++..++|..+++|+++++|+.+.+ T Consensus 253 ~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:96 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 2322 22233332221 112 3577888999999999999999998888 No 86 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.58 E-value=1.5e-15 Score=101.81 Aligned_cols=259 Identities=10% Similarity=0.046 Sum_probs=171.7 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |.. ..++|+.|...+++.+++.+++.+++.+- + ..|.++++|+.. ...+..+.++..... .+++.+.+++ T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~----~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~ 103 (324) T protein:vir:78 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----P-MEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATM 103 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEecCcceeEecCCccccc-cccceeEEEE Confidence 322 24889999999999999999999987542 2 336678899874 345666677766654 5677777787 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccccCCHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAATLDKTNIYEELIKV 150 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~~~t~~~~~~~i~~a 150 (276) ...+. +.-+.|+++-...+..++.+.+.++.++++++++|..++..-.+.....+ .......+...++.|.++ T Consensus 104 ~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~ 182 (324) T protein:vir:78 104 RAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred eeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHH Confidence 77554 44577888777777789999999999999999999988754333221111 011111233347888888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceE Q lcl|NC_019506. 151 KVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 151 ~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ...+...... ...+++||..+..|.+... . .+...+..|..++++|++|+.++..+.. ....+.+..+-+ T Consensus 183 ~~~l~~~~~~--~~~~vmn~~~~~~L~~l~d-----~-~G~~~~~~~~~~~l~G~PV~~~~~~~~~--~~~~~~gd~~~~ 252 (324) T protein:vir:78 183 EALLEDDELE--ANAFISKTQNRSLLRKIVD-----P-ETKERIYDRNSDSLDGLPVVNLKSSNLK--RGELITGDFDKL 252 (324) T ss_pred HHhhhhccCC--CCEEEEcHHHHHHHHHhhc-----c-CCCeeecCCCCCcccceeeEeeCCCCCC--cceEEEEecceE Confidence 8888877653 2368999999998865321 1 1233455677788999999987766532 223334333323 Q ss_pred Eeee-eeeeeeeccCc-------------ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFAE-QIVQTEAYRME-------------KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~~-~~~~~e~~~~~-------------~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .++. +...++..++. ..| ...+++..++|..+++|+++++|+.+.+ T Consensus 253 ~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:78 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 2322 22233332221 112 3577888999999999999999998888 No 87 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.58 E-value=1.2e-15 Score=102.36 Aligned_cols=265 Identities=12% Similarity=0.054 Sum_probs=167.3 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. -.++|+.|...+++.+++.+++.++++.- + ..+...++|.. +...+....++......+..+... T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~----~-~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~ 204 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQ----P-VSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQP 204 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceee----e-ccCCceEEEEEcCCcceeeeccccccccccccccce Confidence 322 13889999999999999999999987531 1 12345666654 334566666665543323345566 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------cccccc----------c Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN---------ATSKLK----------P 134 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~---------~~~~~~----------~ 134 (276) +++...+. +.-+.|+++-...+..++.+.+.++.++++++++|..++..=... +..... . T Consensus 205 v~~~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~ 283 (425) T protein:vir:10 205 LSFASGEI-YANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVV 283 (425) T ss_pred eeeeheee-EeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccc Confidence 77776554 344677877777777899999999999999999999887531110 000000 0 Q ss_pred cccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 135 AATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 135 ~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) .+..+....++.|.++...|..... .+-..++||..+..|.+... ....+.....+..|.-++++|.+|+.++.+| T Consensus 284 ~~~~~~~~~~d~l~~l~~~l~~~~~--~~a~~vmn~~~~~~L~~lkD--~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p 359 (425) T protein:vir:10 284 NSGAAADITSDGIIDLVYDLPSAFT--GNARFAMNRNTQRQVRKLKD--GQGNYLWQPSYVAGQPATLAGYPVTEVPDMP 359 (425) T ss_pred cccccccccHHHHHHHHhhhhhhhc--cCCEEEEchHHHHHHHHhhc--CCCceeeccCccCCCCceecceeeEEecCcC Confidence 0111222346677777766655432 33468999999998876432 1122333344567777899999999999999 Q ss_pred ccccceEEEEE--ecceEEeeeeeeeeeeccCcc--cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 215 SLTNGTGAIAG--VKMACTFAEQIVQTEAYRMEK--RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 215 ~~~~~~~~~~~--~~~a~~~~~~~~~~e~~~~~~--~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ....+...+.+ .+.++....+. .++..+++. .--..+++..++|++|++|+++++++.+|. T Consensus 360 ~~~~~~~~i~~Gd~~~~~~i~~~~-~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as 424 (425) T protein:vir:10 360 DVAANSTPILFGDFQQTYLIIDRI-GVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAAS 424 (425) T ss_pred CccCCccEEEEEehhccEEEEEec-ceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeecc Confidence 66554443332 23333333222 122222221 122467888999999999999999999999 No 88 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.57 E-value=3.3e-15 Score=100.00 Aligned_cols=260 Identities=12% Similarity=0.084 Sum_probs=165.6 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAVT--SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~--~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) ...+ .++|+.|...+++.+++...+.++++... ..|.++++|+... ..+..+.+++.... .+++.+.+++ T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~-----~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~~~~i~~ 190 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGT-----TESNSVEYVRETGFVNNAAPVSEGTQKPY-SDLTFELENA 190 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhcccee-----cCCCceEEEEEecCCCceeeecCCccccc-cccceeEEEE Confidence 1111 36788889999999999999999886531 2356788887533 34556666665543 4567777888 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----------ccccccccCCHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT----------SKLKPAATLDKTNIYEE 146 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~----------~~~~~~~~~t~~~~~~~ 146 (276) .+.+. +.-+.|++. .++...++...+.+..+.++++.+|..++..-.++.. .........+....++. T Consensus 191 ~~~k~-~~~~~is~e-ll~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~ 268 (395) T protein:vir:43 191 PVRTI-AHLFKASRQ-ILDDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDR 268 (395) T ss_pred eeeeE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHH Confidence 87665 444677776 4445567777778889999999999998864222111 01111222334456788 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEe Q lcl|NC_019506. 147 LIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGV 226 (276) Q Consensus 147 i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~ 226 (276) +.++...+.....+ .-.+++||..+..|.+...- ...+... ...+|..++++|++|+.++.+|... ..+... T Consensus 269 i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~--~G~~i~~-~~~~~~~~~l~G~pVv~~~~~~~~~---~~~gd~ 340 (395) T protein:vir:43 269 IRLAILQAQLAEFP--ASGIVLNPIDWALIELNKDA--ENRYIIG-SPQNGTTPTLWRLPVVETQAITQDE---FLTGAF 340 (395) T ss_pred HHHHHHhhccccCC--CcEEEEcHHHHHHHHHhhcc--CCceecc-ccccCCCceecceeeEEcCCCCCCc---EEEEec Confidence 88888888776653 23689999999988654311 1112222 2346667789999999999998532 233333 Q ss_pred cceE-EeeeeeeeeeeccCcc-cc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 227 KMAC-TFAEQIVQTEAYRMEK-RF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 227 ~~a~-~~~~~~~~~e~~~~~~-~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.++ .+......++..+... .| ...+++..++|+++++|++++.++.++= T Consensus 341 ~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 341 SLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred cceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 3333 2222222333333222 12 3477888999999999999999986655 No 89 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.57 E-value=1.2e-15 Score=102.46 Aligned_cols=263 Identities=10% Similarity=-0.004 Sum_probs=165.3 Q ss_pred Ccc-chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccceEEEEE Q lcl|NC_019506. 1 MAV-TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTEKVLEI 78 (276) Q Consensus 1 MA~-~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~~l 78 (276) -++ ..+.|+++...+.+.+++..++..+++.- +...|..+.||+... ..+..+.++..... .+++...+++.+ T Consensus 116 ~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~----~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~-~~~~f~~i~~~~ 190 (390) T protein:vir:62 116 AGNPNVLSRTLYGQLIAQAVERSAIMRGGATTF----TTSDANPLDFTVITGRSSASIVGETAEIPE-SYPATAQRSMGG 190 (390) T ss_pred cCCCccccccchHHHHHHHHhhhhhhhhcceee----ecCCCceeEEEEEcCCcceeeecccccccc-cccceeeeEeee Confidence 111 24677888888888888887777776431 223467788997744 45666777776654 467778888888 Q ss_pred EeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh-------hccccccccccccCCHHHHHHHHHHHH Q lcl|NC_019506. 79 NKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEM-------DTNATSKLKPAATLDKTNIYEELIKVK 151 (276) Q Consensus 79 d~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~-------~~~~~~~~~~~~~~t~~~~~~~i~~a~ 151 (276) .+. +.-+.|+++-...+..++.+.+.++.+++++.++|..++..- ........+..........++.|.++. T Consensus 191 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 269 (390) T protein:vir:62 191 FKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALIDLF 269 (390) T ss_pred eeE-EeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccceecccccccchHHHHHHH Confidence 655 445678888777788899999999999999999999887531 111111111111111223466777777 Q ss_pred HHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEE Q lcl|NC_019506. 152 VKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACT 231 (276) Q Consensus 152 ~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~ 231 (276) ..|+.... .+-..++||..+..|.+... ....+.....+..|..++++|++|+.++.+|... ..+..-+..+. T Consensus 270 ~~l~~~~~--~~a~~vmn~~~~~~L~~lkd--~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~---i~~gd~s~~~i 342 (390) T protein:vir:62 270 HEVPSAYR--ANAKYVVNDLRAAQMRKLKD--ANGQYLWQSGLTVGAPSLFNGKVVETDDGMPADK---ILFADLSKYRV 342 (390) T ss_pred Hhhhhhhh--cCCEEEEchHHHHHHHHhhc--cCCCeeecCCcCCCccceecccceEEecCCCCcc---EEEeeccceeE Confidence 77765443 34468999999998865321 1122333444566777789999999999998532 22221122221 Q ss_pred eeeeeeeeeeccCcccc--eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FAEQIVQTEAYRMEKRF--ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~~~~~~~e~~~~~~~~--~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .......++...+.... ...+++..++|+++++|+++.+|+.++= T Consensus 343 ~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~ 389 (390) T protein:vir:62 343 RFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPG 389 (390) T ss_pred EeecceEEEeeccccccCCcEEEEEEEEeCcEeechhheEEEEeecC Confidence 11122233333332211 2467889999999999999999886655 No 90 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.57 E-value=2.6e-15 Score=100.53 Aligned_cols=259 Identities=11% Similarity=0.054 Sum_probs=169.7 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |+- ..++|+.|...+.+.+++.+++.+++..- + ..+.++++|+... ..+..+.++..... .+++.+.+++ T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~----~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~~ 103 (324) T protein:vir:10 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----P-MEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATM 103 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEeCCcceeEeccCccccc-cccceeEEEE Confidence 332 24789999999999999999999987542 2 2356789998743 45677777777654 4567777777 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccccCCHHHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAATLDKTNIYEELIKV 150 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~~~t~~~~~~~i~~a 150 (276) ...+. +.-+.|+++-...+..++.+.+.++.++++++++|..++..-..+....+ .......+...++.|.++ T Consensus 104 ~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~ 182 (324) T protein:vir:10 104 RAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) T ss_pred eeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHH Confidence 77554 44577888777677789999999999999999999988754333221111 011111223347788888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceE Q lcl|NC_019506. 151 KVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMAC 230 (276) Q Consensus 151 ~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~ 230 (276) ...+...+.... .+++||..+..|.+.. +. .+...+..|.-++++|.+|+.++..+.. ....+.+..+.+ T Consensus 183 ~~~l~~~~~~~~--~~v~n~~~~~~L~~l~---d~---~g~~~~~~~~~~~l~G~PV~~~~~~~~~--~~~~~~gd~~~~ 252 (324) T protein:vir:10 183 EALLEDDELEAN--AFISKTQNRSLLRKIV---DP---ETKERIYDRNSDTLDGLPVVNLKSSNLK--RGELITGDFDKL 252 (324) T ss_pred HHhhhhccCCCC--EEEEcHHHHHHHHHhh---cc---CCceeecCCCCccccceeEEeecCCCCC--cceEEEEecccE Confidence 888877664322 5899999999886532 11 2233455566678999999987766532 223333333333 Q ss_pred Eeee-eeeeeeeccC--------cc-----cc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFAE-QIVQTEAYRM--------EK-----RF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~~-~~~~~e~~~~--------~~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.. +...++..++ ++ .| ...+++..++|..+++|+++++|+.+.| T Consensus 253 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:10 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 3322 2223332222 11 12 2577888999999999999999998777 No 91 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.57 E-value=2.9e-15 Score=100.30 Aligned_cols=260 Identities=13% Similarity=0.087 Sum_probs=163.9 Q ss_pred Cc-----cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MA-----VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA-----~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~~ 73 (276) +. ...++|+.|+..+.+.+++...+.++++.- + ..|.++++|.... ..+..+.+++.... .+++... T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~f~~ 209 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPG----Q-TSSSSIEYTVETGFTNNAAAVAEGAQKPT-SDLKFNL 209 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHhhccee----e-ccCCceeEEEEecCCCceeeeccCccccc-cccceee Confidence 11 123789999999999999999999987542 1 2356788887543 34556666666543 4567777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccccccCCHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK--------LKPAATLDKTNIYE 145 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~--------~~~~~~~t~~~~~~ 145 (276) +++.+.+. +.-+.|+++-. +...++.+.+.+..+.++++++|..++..-.++.... .......+....++ T Consensus 210 v~~~~~k~-~~~~~is~ell-~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 287 (418) T protein:vir:10 210 KNQPVRTI-AHLFKASRQIL-DDAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPID 287 (418) T ss_pred EEEeeeeE-EEeehhhHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHH Confidence 77777654 33467777644 4556888888889999999999999885432221110 11111122223467 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAG 225 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~ 225 (276) .|..+...+...+.+.. .+++||..+..|.+...- ...+... ....|..++++|++|+.++.+|.. ...+.. T Consensus 288 ~i~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~--~G~~i~~-~~~~~~~~~l~G~pV~~~~~~p~~---~~~~gd 359 (418) T protein:vir:10 288 KIRLALLQAVLAEFPAT--GIVLNPIDWASIELTKDS--QGRYIVG-NPVNGTTPRLWNLPVVETQAMTAN---EFLVGA 359 (418) T ss_pred HHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC--CCceecc-ccccCCCceecceeeEEcCCCCCC---cEEEee Confidence 77777777665554322 588999999988654311 1112222 234566788999999999999853 223322 Q ss_pred ecceEEee-eeeeeeeeccCcc----cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 226 VKMACTFA-EQIVQTEAYRMEK----RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 226 ~~~a~~~~-~~~~~~e~~~~~~----~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.++-.. .....++..+... +-...+++..++|+++++|++++.++.++| T Consensus 360 ~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~ 415 (418) T protein:vir:10 360 FSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQ 415 (418) T ss_pred ccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEeccC Confidence 33333222 2222233222211 122467788999999999999999999999 No 92 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.56 E-value=2.1e-15 Score=101.09 Aligned_cols=265 Identities=11% Similarity=0.047 Sum_probs=166.1 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVT----SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~----~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~ 75 (276) ||.. .++|+.++.++++.+++.+++..++..-. -.+.++++|+.. ...+..+.++..... .+++.+.++ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~-----~~~~~~~ip~~~~~~~a~wv~E~~~~~~-s~~~f~~v~ 74 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKP-----IPFNGSKEFTFTLDSDIDVVAENGKKTH-GGLSLEPVT 74 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceee-----cCCCceEEEEEecCcceEEeecCccccc-cccceeeEE Confidence 9864 48899999999999999999999885421 234678898864 345667777766543 456666777 Q ss_pred EEEEeeeecceeechHHHH---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------ccccccCC Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAA---QIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-------------LKPAATLD 139 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~---~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------------~~~~~~~t 139 (276) +...+. +.-+.++++-.. .+..++.+.+.++.++++++++|+.++....+..... .......+ T Consensus 75 l~~~kl-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (303) T protein:vir:97 75 IVPIKV-EYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTE 153 (303) T ss_pred eeeEEE-EEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccccccccccc Confidence 766443 444677776543 3456788999999999999999999886532211100 00111123 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccc-ceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 140 KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAES-ITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 140 ~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~-~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) ....++.|.++...+...+... ..+++||..+..|++...-.. .+.... .-..+..++++|.+++.|+++|.... T Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~L~~lkd~~g--~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 229 (303) T protein:vir:97 154 SEDADANIEAAVNLIQGAEGVV--TGLAMDTEFSTALAKVTNGEM--GPKMYPELAWGANPDSINGLKSSVNTTVGAGAD 229 (303) T ss_pred ccchHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhccCC--CeEEecCccCCCCCceecceeeEEecccCCccc Confidence 3445788888888887665432 358999999999976421111 111111 12234556899999999999985321 Q ss_pred ----ceEEEEE-ecceEEeeee-eeeeeec--cCcc-----cce---eeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 219 ----GTGAIAG-VKMACTFAEQ-IVQTEAY--RMEK-----RFA---DAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 219 ----~~~~~~~-~~~a~~~~~~-~~~~e~~--~~~~-----~~~---~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ....+++ ...++.+..+ ..+++.. .+++ .|. ..+++..++|.+|++|+++++|+.+-= T Consensus 230 ~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 230 EAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred cCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 1112222 2333333322 2223221 1111 121 467888999999999999999964333 No 93 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.56 E-value=1.1e-15 Score=102.69 Aligned_cols=268 Identities=10% Similarity=0.079 Sum_probs=162.6 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc--cceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI--TVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~--~~~d~~~~~~~~~~~~~~~~ 72 (276) |... .++|+.|...+++.+++.+.+.++++.- +...+..+.+|..... ......++..... .+++.. T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~-~~~~f~ 191 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQIL----TTSDGRTMEWATADGTSEVGVLLGENEEAGE-EDTDFG 191 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceee----ecCCCceEEEEeeccCccccccccccccccc-cccccc Confidence 3221 3789999999999999999888877542 2234556667765432 2334445544332 344555 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----c---c--ccccccCCHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT----S---K--LKPAATLDKTNI 143 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~----~---~--~~~~~~~t~~~~ 143 (276) .+++...+....-+.|+++-...+..++...+.++.+++++.++|..++..-.+... . . .......+.... T Consensus 192 ~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~ 271 (409) T protein:vir:45 192 MGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVK 271 (409) T ss_pred eeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccc Confidence 445443222122246788777777789999999999999999999998853322110 0 0 001111122223 Q ss_pred HHHHHHHHHHHhhcCCCccCCE-EEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEE Q lcl|NC_019506. 144 YEELIKVKVKLDEKNVPTIGRF-LIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGA 222 (276) Q Consensus 144 ~~~i~~a~~~l~~~~vP~~~r~-~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~ 222 (276) ++.|.++...|..... ....| +++||..+..|.+... ....+.....+..|...+++|.+|+.++.+|..+.+... T Consensus 272 ~d~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lkd--~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~ 348 (409) T protein:vir:45 272 WQEILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEMED--GQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKF 348 (409) T ss_pred hHHHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhhc--CCCceeeccCcCCCCCceecceeeEEecCcCCccCCccE Confidence 5677777777765542 23344 5789999988865321 112222334455677778999999999999975554443 Q ss_pred EEE-e-cceEEeeeeeeeeeeccCcccc--eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 223 IAG-V-KMACTFAEQIVQTEAYRMEKRF--ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 223 ~~~-~-~~a~~~~~~~~~~e~~~~~~~~--~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +++ . ...+........++..+++... -..|++..++|+++++|+++++++..++ T Consensus 349 i~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s 406 (409) T protein:vir:45 349 MFCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGS 406 (409) T ss_pred EEEeehhhhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccC Confidence 332 2 2222222222233434333221 2468899999999999999999998777 No 94 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.56 E-value=4.1e-15 Score=99.44 Aligned_cols=263 Identities=14% Similarity=0.038 Sum_probs=161.1 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |+.+ .++|+.|..++++.+++.+++.+++..- ...+.+++||+... ..+..+.++..... ..++.++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~-----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~-~~~~f~~ 87 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV-----PMGTTGQKIPHWIGDVSAQWIGEGDMKPI-TKGNMTS 87 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhccee-----eccCCceEEEEEeCCcceEEecCCccccc-cccceeE Confidence 4432 3678889999999999999998887542 12356788998743 45666777766654 4567777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------ccc-ccccCCHHH-- Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS--------KLK-PAATLDKTN-- 142 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~--------~~~-~~~~~t~~~-- 142 (276) +++...+. +.-+.|+++-...+..++.+.+.++.++++++++|+.++..-.+.... ... .....+... T Consensus 88 v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (320) T protein:vir:10 88 QNIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATASDLT 166 (320) T ss_pred EEEeeEEE-EEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccccccccc Confidence 77777554 455788888777778899999999999999999999987532221110 000 011111111 Q ss_pred -HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceee-----eeeeEEeceEEEEecccccc Q lcl|NC_019506. 143 -IYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKN-----GFVGTILGFDVYLSNNMGSL 216 (276) Q Consensus 143 -~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~-----G~i~~~~G~~v~~s~~lp~~ 216 (276) ..+.+.++...+..... ..-++++||..+..|.+...-.. .+........ ..-++++|++++.++.+|.. T Consensus 167 ~~~~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd~~G--~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~ 242 (320) T protein:vir:10 167 AYDAVAVNGLSLLVNAKK--KWTHTLLDDIVEPILNGAKDKNG--RPLFIESTYTDENSPFRAGRIVSRPTILSDHVADG 242 (320) T ss_pred cHHHHHHHHHhhhhcccC--CCcEEEEcHHHHHHHHHhhccCC--ceeeccccccCccccccCceeeeeeeEecCCCCCC Confidence 11235555555555443 34578999999999976432111 1111111111 11246899999999998753 Q ss_pred ccceEEEEEecceEEeeee-eeeeeeccC--------cc-----cc---eeeEEeeeeeeeEEEcCCeEEEEE-ecCC Q lcl|NC_019506. 217 TNGTGAIAGVKMACTFAEQ-IVQTEAYRM--------EK-----RF---ADAVKGLNVFGCKVIYPDALVCLK-KTNP 276 (276) Q Consensus 217 ~~~~~~~~~~~~a~~~~~~-~~~~e~~~~--------~~-----~~---~~~i~~~~~yg~~v~~~~~vv~~~-~~~p 276 (276) . ...+.+..+-+.++.+ ...++..++ ++ .| ...+++..++|.++++|+++++|+ ++|| T Consensus 243 ~--~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap 318 (320) T protein:vir:10 243 T--TVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTP 318 (320) T ss_pred c--eEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 2 2222333222222221 112222211 11 11 256788899999999999999998 8899 No 95 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.55 E-value=5.6e-16 Score=104.20 Aligned_cols=263 Identities=12% Similarity=0.076 Sum_probs=171.1 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEe----ccCcccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKIN----QIGAITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip----~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) |++ |+++..++.+.+++..+...++.+ .. .+.+-.+.+. ........++.+++.+... ........+ T Consensus 22 l~~----P~~I~~~i~e~~~~~~iad~lf~~-~~---a~~~~~v~f~~~~p~~~~~d~e~VaEggEiP~~-~~~~G~~~i 92 (318) T protein:vir:10 22 VGN----PLWIPTALKKMMVNQFISESLFRN-GG---ANPNGVVAYNEGNPSFLEDDVADVAEFGEIPVS-AGARGLPRT 92 (318) T ss_pred hCC----chhHHHHHHHHHhccchhhhhhhc-cc---ccccceeEEEecccccccCcHhhccCccccccc-CCCCCchhh Confidence 333 778888888878777766666643 22 2335567773 3444567777888887653 344444555 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc-CCHH----HH---HHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAAT-LDKT----NI---YEELI 148 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~-~t~~----~~---~~~i~ 148 (276) -.-+..+..+.|+++....+..+.+++.+++++.+++++.|+.++..+.++........++ .+.. .+ .+.+. T Consensus 93 a~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~d~~~A~e~v~ 172 (318) T protein:vir:10 93 AFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRTDIAIAIEQIS 172 (318) T ss_pred hhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccccccchhhhhhhh Confidence 3323347889999999999999999999999999999999999999886654332222111 1111 11 12222 Q ss_pred HHHHHHhhcCC-------CccCCEEEECHHHHHHHhhhHHhhhhcccccccc----eeeeee-eEEeceEEEEecccccc Q lcl|NC_019506. 149 KVKVKLDEKNV-------PTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESI----TKNGFV-GTILGFDVYLSNNMGSL 216 (276) Q Consensus 149 ~a~~~l~~~~v-------P~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~----~~~G~i-~~~~G~~v~~s~~lp~~ 216 (276) .|...+..+.+ .-..-.+|+||..+..|++++.+...-....+.. -..|.+ ++++|++|+.|.++|.. T Consensus 173 ~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~ 252 (318) T protein:vir:10 173 TAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPID 252 (318) T ss_pred hhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeeceEEeecCccCCC Confidence 22222211111 1111269999999999999988765422112111 123555 67899999999999953 Q ss_pred ccceEEEEEecceEEeeeeee--eeeeccCc-------ccceeeEEeeeeeeeEEEcCCeEEEEE-ecCC Q lcl|NC_019506. 217 TNGTGAIAGVKMACTFAEQIV--QTEAYRME-------KRFADAVKGLNVFGCKVIYPDALVCLK-KTNP 276 (276) Q Consensus 217 ~~~~~~~~~~~~a~~~~~~~~--~~e~~~~~-------~~~~~~i~~~~~yg~~v~~~~~vv~~~-~~~p 276 (276) . ++.+.++.+|+-.... +.+.++.+ ...+..++........|.+|.+++.|+ .--| T Consensus 253 ~----alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 253 R----VLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred e----eEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 3 6777788888764332 45666654 334568888899999999999999998 5566 No 96 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.55 E-value=5.1e-15 Score=98.96 Aligned_cols=259 Identities=11% Similarity=0.051 Sum_probs=165.3 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.|+.++.+.+.+.+++.+++.+- +...+..+.+|+.. ...+..+.++..... .+.+.+. T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~f~~ 83 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQ----EMEGEQEKTVYVQTDGISAYWVNETEKIKT-DKPEVVP 83 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhccee----ecCCCccEEEEEEcCCceeEEeecCccccc-cccceeE Confidence 211 13789999999999999999999987652 12223345666554 445677777776654 4567777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----ccccccCCHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-----LKPAATLDKTNIYEELI 148 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-----~~~~~~~t~~~~~~~i~ 148 (276) +++...+. +.-+.|+++-...+..++.+.+.++.++++++++|..++..-.+..... ............++.|. T Consensus 84 v~l~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~~~i~ 162 (297) T protein:vir:95 84 VTLKAHKL-GIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPINYDNIL 162 (297) T ss_pred EEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccCHHHHH Confidence 77777553 4457788876767778999999999999999999999885322211110 00011111122367788 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecc Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM 228 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~ 228 (276) ++...|...+.+.. .++++|..+..|.+.. .. . ...+..+..++++|.+++.++..+... +..+++..+ T Consensus 163 ~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~---d~--~--G~~i~~~~~~~l~G~Pv~~~~~~~~~~--~~~~~gd~s 231 (297) T protein:vir:95 163 KLQDALYDADVEPN--AFVSKIQNRSALREAR---DG--N--KVSIYDKAANTIDGITTVDLKSARFEK--GDLLAGDFD 231 (297) T ss_pred HHHHHhhhccCCcC--EEEEcHHHHHHHHHhh---cc--C--CceeecCCCCcccceeeEeecCCCCCC--ceEEEEecc Confidence 88888887765433 5899999999987532 11 1 123445666789999999876655322 223333322 Q ss_pred eEEeee-eeeeeeeccC--------c-----ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAE-QIVQTEAYRM--------E-----KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~-~~~~~e~~~~--------~-----~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.+.. +...++..++ + +.| ...+++..++|.++++|+++++|+.+.| T Consensus 232 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~ 296 (297) T protein:vir:95 232 NLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAER 296 (297) T ss_pred cEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCC Confidence 222221 1112222211 1 111 3567888999999999999999999999 No 97 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.55 E-value=6.6e-15 Score=98.33 Aligned_cols=265 Identities=11% Similarity=0.043 Sum_probs=165.2 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVT----SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~----~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~ 75 (276) ||.. +++|+.++..+++.+++.+++..++..- + ..+..+++|+.. ...+..+.++..... .+++.++++ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i----~-~~~~~~~~p~~~~~~~a~wv~Eg~~~~~-~~~~f~~v~ 74 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----P-QEFGEQQYMTLTAPPRGEVVGEGAQKSE-STATFAPVT 74 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhccee----e-cCCCceEEEEEeCCceeEEeecCccccc-ccceeeEEE Confidence 8753 5899999999999999999999987542 1 234568899874 455677777776654 456777777 Q ss_pred EEEEeeeecceeechHHHH---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------ccccccCC Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAA---QIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-------------LKPAATLD 139 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~---~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------------~~~~~~~t 139 (276) +...+. +.-+.|+++-.. .+..++.+.+.++.++++++++|..++....+..... ....+..+ T Consensus 75 l~~~kl-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~ 153 (311) T protein:vir:81 75 AIPRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGT 153 (311) T ss_pred EeeEEE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccc Confidence 777554 444677776443 2345688889999999999999999886532211110 00111122 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc- Q lcl|NC_019506. 140 KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN- 218 (276) Q Consensus 140 ~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~- 218 (276) ....+..+.++...+...+... ...++||..+..|.+... ....+........|..++++|.+++.++.+|.... T Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd--~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~ 229 (311) T protein:vir:81 154 SATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRD--SQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEA 229 (311) T ss_pred cchHHHHHHHHHHHhhhcCCCc--eEEEEcHHHHHHHHhhhc--cCCCeeecCccccCCCceecceeEEecccccccccc Confidence 2334455666776666655422 358999999999976421 11222233444556778999999999999884321 Q ss_pred -------------ceEEEEEecceEEeee-eeeeeeeccC--cc----cc---eeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 219 -------------GTGAIAGVKMACTFAE-QIVQTEAYRM--EK----RF---ADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 219 -------------~~~~~~~~~~a~~~~~-~~~~~e~~~~--~~----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) ....+++--+-+.... +...++..++ ++ .| ...+++..++|++|++|++++.++-++ T Consensus 230 ~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~ 309 (311) T protein:vir:81 230 VTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDAD 309 (311) T ss_pred cccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeec Confidence 1111222111122211 1112222221 11 12 247788899999999999999998555 Q ss_pred C Q lcl|NC_019506. 276 P 276 (276) Q Consensus 276 p 276 (276) = T Consensus 310 ~ 310 (311) T protein:vir:81 310 E 310 (311) T ss_pred c Confidence 5 No 98 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.55 E-value=4e-15 Score=99.53 Aligned_cols=263 Identities=15% Similarity=0.118 Sum_probs=161.7 Q ss_pred Cc---cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MA---VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA---~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) .+ ..+++|+.+..++.+.+++.+.+.+++..- + ..|+ ..+|+.... .+..+.+++........+.+++++ T Consensus 141 ~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~----~-~~g~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l 214 (425) T protein:vir:95 141 RAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKI----R-VKGT-TRILVDTDTSPATWIEQSGALPTGDVGTIASIDF 214 (425) T ss_pred cccccCceeccHHHHHHHHHHHHhhhhHHHhhcee----e-cCce-eEEEEecCCccccccccccccccccccccceeee Confidence 11 124789999999999999999999887531 1 2344 478876554 355556666553323345667777 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---cc----cc--ccccCCHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT---SK----LK--PAATLDKTNIYEEL 147 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---~~----~~--~~~~~t~~~~~~~i 147 (276) ...+. +.-+.|+++-...+..++...+.++.+++++.++|..++..-.++.. +. .. ..+.......++.+ T Consensus 215 ~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 293 (425) T protein:vir:95 215 DGFKV-GKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNL 293 (425) T ss_pred eheee-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHHH Confidence 76543 44468888877778889999999999999999999998864322110 00 00 01111223346777 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHH-HhhhHHhhh-hcccccccceeeeeeeEEeceEEEEeccccccccceEEEEE Q lcl|NC_019506. 148 IKVKVKLDEKNVPTIGRFLIIPPDVHGL-LLAADLIVG-TGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAG 225 (276) Q Consensus 148 ~~a~~~l~~~~vP~~~r~~vv~p~~~~~-L~~~~~~~~-~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~ 225 (276) .++...+.....+..+-++++++..+.. |.......+ ...+.+. .-.+..++++|.+|+.++.+|.. ...+.. T Consensus 294 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~--~~~~~~~~l~G~pvv~~~~~~~~---~i~~Gd 368 (425) T protein:vir:95 294 VKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGK--LPNLRTPDLLGLRVVFNNFLDDD---TVLFGE 368 (425) T ss_pred HHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeec--cCCCCCccccceeeEEcCcCCCc---cEEEEe Confidence 7777766655544445456788776543 433222111 1112221 22455678999999999999853 222222 Q ss_pred ecceEEeeeeeeeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 226 VKMACTFAEQIVQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 226 ~~~a~~~~~~~~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+..+....+...++..++. +| ...+++..++++++++|+++++++.+.| T Consensus 369 ~~~~~~~~~~~~~i~~~~~~-~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~ 421 (425) T protein:vir:95 369 FEQYTLVERENITIDSSTHV-KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDP 421 (425) T ss_pred cccEEEEeecceEEEeeccc-ccccCceEEEEEEeeCcEeecccceEEEEecCc Confidence 22222222222233333332 23 3578888999999999999999999999 No 99 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.54 E-value=2.8e-15 Score=100.39 Aligned_cols=255 Identities=12% Similarity=0.038 Sum_probs=162.7 Q ss_pred Cc---cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC--cccceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MA---VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG--AITVKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA---~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~--~~~~~d~~~~~~~~~~~~~~~~~~~ 75 (276) +. ...++|+.|...+++.+++...+.++++.- ...+.++++|.+. ...+..+.+++.......++...++ T Consensus 137 ~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~ 211 (400) T protein:vir:38 137 VKAADAASTIPETISNTPQRELQTVVDLKPFTNVF-----QASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVN 211 (400) T ss_pred ccccCCcccccHHHHHHHHHHHHhhhhhhhcceeE-----eccCcceEEEEEecCCCccccccccccccccccccceeeE Confidence 11 125889999999999999988888877531 1224456666553 3445666666555433456667777 Q ss_pred EEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHh Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLD 155 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 155 (276) +.+.+. +.-+.|+++-...+..++...+.+..+.+++...|..++....+.. +.+..+ ++.+.++....- T Consensus 212 ~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~-----~~~~~~----~~~~~~~~~~~~ 281 (400) T protein:vir:38 212 WSVETY-RQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFT-----AKTISS----VDDLKHINNVDL 281 (400) T ss_pred eehhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc-----cccccc----HHHHHHHHHhhh Confidence 777554 4456788876667778899999999999999999998876554432 112222 334444332211 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEE-e-cceEEee Q lcl|NC_019506. 156 EKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAG-V-KMACTFA 233 (276) Q Consensus 156 ~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~-~-~~a~~~~ 233 (276) . +..+-..++||..+..|.+...- ...+.....+..|..++++|++|+.++++|..+.+...+.+ . ..++... T Consensus 282 ~---~~~~a~~v~~~~~~~~l~~lkd~--~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~ 356 (400) T protein:vir:38 282 D---PAYSRVIIASQSFYNFLDTVKDG--NGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFA 356 (400) T ss_pred h---hhhCcEEEEcHHHHHHHHHhhcc--CCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEE Confidence 1 12245789999999998764321 12222333455677789999999999998865544433332 2 3333333 Q ss_pred -eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 234 -EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 234 -~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+...+... +...+...+++..++|+++++|++++.++.++= T Consensus 357 ~~~~~~~~~~-~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 357 NRADFMVRWV-DDQIYGQFLQAGMRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred eecceEEEEe-cccccceeEEEEEEeccEEecccceEEEEeecC Confidence 333333333 345567789999999999999999999876433 No 100 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.54 E-value=3.8e-15 Score=99.64 Aligned_cols=265 Identities=14% Similarity=0.031 Sum_probs=166.5 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. -+++|+.|..++++.+++.+++..++..- + ..|.+..+|.. +...+....++.........+.+. T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~ 181 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVI----T-VGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGL 181 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceee----e-cCCCceEEEEecCCccceeeccccccCcccccccee Confidence 332 24889999999999999999888887531 1 23456666654 334455555655443323345666 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc--------c-------ccc--- Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNA-TS--------K-------LKP--- 134 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~-~~--------~-------~~~--- 134 (276) +++.+.+. +.-+.|+++-...+..++.+.+.+..+.++++++|..++..=.+.. .. . +.. T Consensus 182 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~ 260 (401) T protein:vir:44 182 IEPFMGEI-YGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHI 260 (401) T ss_pred eeeehhhe-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccccccccc Confidence 77776554 3346778877777778999999999999999999998884311110 00 0 000 Q ss_pred cccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 135 AATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 135 ~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) .+.......++.|.++...|..... .+-++++||..+..|.+...- ...+.....+..|..++++|.+|+.++.+| T Consensus 261 ~t~~~~~~~~d~i~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~--~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p 336 (401) T protein:vir:44 261 VSGEATAVTADAIIKLIYTLRKAHR--TGAKFMMNNNSLFAIRLLKDT--EGNYLWRPGLELGQPSSLAGYGIAENEQMP 336 (401) T ss_pred ccccccccCHHHHHHHHHhcchhhh--cCCEEEEcHHHHHHHHHhhcc--CCceeecCCcCCCCCceecceeeEEecCcC Confidence 0011111236778877777765432 344689999999998764321 112223344567888899999999999999 Q ss_pred ccccceEEEEE-e-cceEEeeeeeeeeeeccCccc--ceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 215 SLTNGTGAIAG-V-KMACTFAEQIVQTEAYRMEKR--FADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 215 ~~~~~~~~~~~-~-~~a~~~~~~~~~~e~~~~~~~--~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ....+...+++ . +.++....+.. ++..+++.. --..+++..++|+.+++|++++.++.++= T Consensus 337 ~~~~~~~~i~~Gd~~~~~~i~~~~~-~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 337 DIAADAKAIAFGNFKRGYTIVDRIG-TRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred CccCCccEEEEeehhccEEEEEecc-eEEeeeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 76655554432 2 33444333221 222222221 12457888999999999999999996666 No 101 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.54 E-value=7e-15 Score=98.18 Aligned_cols=257 Identities=16% Similarity=0.151 Sum_probs=161.6 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MA------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |. -..++|+.|...+++.+++..++.++++.- ...+.+.++|.. +...+....+++.......++.. T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~ 185 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT-----PVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFE 185 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceee-----eccCCceEEEEEecCCCccccccccccccccccccce Confidence 11 125789999999999999999998887542 123455666654 33345555565554433456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 152 (276) .+++.+.+. +.-+.|+++-...+..++...+.+..+++++...|..++.....+..... . ....++.|.++.. T Consensus 186 ~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~~~--~----~~~~~d~l~~~~~ 258 (394) T protein:vir:10 186 QVDWSVSTY-RGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAKAT--T----TDTLVDSLKHILN 258 (394) T ss_pred eEEeeeeee-EeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc--c----ccccHHHHHHHHH Confidence 888888665 44467888877778889999999999999999999998877654332211 1 1122445555433 Q ss_pred -HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccc----ceeeeeeeEEeceEEEEecc--ccccccceEEEEE Q lcl|NC_019506. 153 -KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAES----ITKNGFVGTILGFDVYLSNN--MGSLTNGTGAIAG 225 (276) Q Consensus 153 -~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~----~~~~G~i~~~~G~~v~~s~~--lp~~~~~~~~~~~ 225 (276) .++.. .+-.+|+||..+..|.+...-. ..+.... ....|.-++++|++|+.++. +|..++....+.+ T Consensus 259 ~~~~~~----~~a~~vmn~~~~~~l~~lkd~~--G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~g 332 (394) T protein:vir:10 259 VDLDPA----YSRALVVTQSLFNTLDTLKDKN--GRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVG 332 (394) T ss_pred hhhhhh----ccCEEEecHHHHHHHHHhhccC--CCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEe Confidence 22222 1346899999999997643211 1111111 11224446899999987543 3433333222332 Q ss_pred -ecceEEeee-eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 226 -VKMACTFAE-QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 226 -~~~a~~~~~-~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.++.... +...++. .+...|.+.+++..++|+++++|++++.++.+.+ T Consensus 333 d~s~~~~~~~~~~~~v~~-~~~~~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 333 DLKRGVLFADRQQVTLAW-EDSKIYGRYLGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred eccccEEEEeecceEEEE-ecccccceeEEEEEEeccEEeccccEEEEEeecc Confidence 233343332 2333433 3345567788999999999999999999987777 No 102 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.53 E-value=8.2e-15 Score=97.81 Aligned_cols=262 Identities=11% Similarity=0.016 Sum_probs=161.8 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) ++. ..+.|+++...+.+.+.+..++..++.. .+...|..+.+|.... ..+..+.++..... .+++...+++ T Consensus 114 t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~----~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~f~~v~~ 188 (392) T protein:vir:13 114 TKAGNPNVLSRTLYGQLIAQAVERSAIMRGGAST----FTTSDANPMDFTVITGRATAGIVGETAEIPE-SYPATTQRSM 188 (392) T ss_pred cccCCCccccccchHHHHHHHHhhhhhhhhccee----eecCCCceeEEEEEcCCcceeeecccccccc-cccceeeEEe Confidence 221 1456777887777777777777666543 1234567788887644 45666677766544 4567777777 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------ccccccccccCCHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN---------ATSKLKPAATLDKTNIYEEL 147 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~---------~~~~~~~~~~~t~~~~~~~i 147 (276) .+.+. +.-+.|+++-...+..++.+.+.++.+.++++.+|..++..=.++ +...............++.| T Consensus 189 ~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l 267 (392) T protein:vir:13 189 GGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDAL 267 (392) T ss_pred eeeeE-EeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccccccccccHHHH Confidence 77554 445678888777778899999999999999999999988521111 00000111111222346777 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEec Q lcl|NC_019506. 148 IKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVK 227 (276) Q Consensus 148 ~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~ 227 (276) .++...|..... .+-..++||..+..|..... ....+.....+..|..++++|.+|+.++.+|..+ .+.+.- T Consensus 268 ~~~~~~l~~~~~--~~a~~v~n~~~~~~l~~lkd--~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~----i~~Gdf 339 (392) T protein:vir:13 268 IDLFHEVPSAYR--KNAKFVVNDLRAAQMRKLKD--ANGQYLWQSALTVGAPDTFNGKVVETDDGMPADK----VLFADL 339 (392) T ss_pred HHHHHhhhhhhh--cCCEEEEcHHHHHHHHHhhc--cCCceeecCCcCCCCCceecceeeEEcCCCCCCc----EEEeec Confidence 777766654432 23357889999998865321 1112223334556767789999999999998532 223332 Q ss_pred ceEEeee-eeeeeeeccCcccc--eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 228 MACTFAE-QIVQTEAYRMEKRF--ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 228 ~a~~~~~-~~~~~e~~~~~~~~--~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.+.... +...++...++... .+.+++..++|+++++|+++++++.++= T Consensus 340 ~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 340 SKYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPA 391 (392) T ss_pred cceeEEeecceEEEeeccccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 3332222 22233333333221 2577899999999999999998775444 No 103 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.53 E-value=1.5e-14 Score=96.30 Aligned_cols=257 Identities=14% Similarity=0.090 Sum_probs=161.3 Q ss_pred Ccc----chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc---cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV----TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI---TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~----~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~---~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.|...+++.+...+.+.++++. ....+.++.+|+.... ......+++.... .+++... T Consensus 109 ~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~-~~~~f~~ 182 (379) T protein:vir:10 109 MTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGA-----VSISGGTYTFVRENGAGEGAIGAQVEGATKGQ-KDYDISM 182 (379) T ss_pred cccCCCCccccchhhhhHHHHhHHhhhhHHhhcee-----eeccCCceEEEEeecCCCcccccccCCccccc-cccceee Confidence 221 1267999999999999888888888753 1234667888875322 3344556655443 4567788 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) +++.+.+... -+.|+++- ++...++...+.+..+++++.++|..++....+......... +....++.|.++... T Consensus 183 i~~~~~k~~~-~~~iS~el-l~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~~---~~~~~~d~i~~~~~~ 257 (379) T protein:vir:10 183 IDVNTDFIAG-FTRYSKKM-ANNLPFLTSFIPNALRRDYAKAENAAFNAVLAANATASTEII---TNKNKVEMLINEIAK 257 (379) T ss_pred eEeeeeeEEe-eehhhHHH-HhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccc---cCcccHHHHHHHHHh Confidence 8888866533 46777764 344456777777889999999999988876655432221111 222335677777777 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccc--ceeeeeeeEEeceEEEEeccccccccceEEEEEecc-eE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAES--ITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM-AC 230 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~--~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~-a~ 230 (276) +...+.+. ..+++||..+..|.+...-. ..+.... ....|...+++|++|+.++.+|. +...+...+. ++ T Consensus 258 ~~~~~~~~--~~~vmn~~~~~~l~~lkd~~--G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~a---g~~~~gdf~~~~~ 330 (379) T protein:vir:10 258 QENLDFPV--TAIVLRPTDYYDILVTQKSV--GAGYGLPGVVTQDNGVLRINGIPLFRATWLAA---NKYYVGDWTRVTK 330 (379) T ss_pred hhhccCCC--CEEEEcHHHHHHHHHhhccC--CceeccCCccCCCCCcceecceeeEecCCCCC---CceEEeecccEEE Confidence 76666532 35889999999886543211 1111211 22345556899999999999874 2232322222 23 Q ss_pred EeeeeeeeeeeccCc-ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFAEQIVQTEAYRME-KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~~~~~~~e~~~~~-~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+ .+...++..+.. +.| -..+++..|+|+.|++|+++|.++.++= T Consensus 331 ~~-~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 331 VT-TEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred EE-EeceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 32 222234433332 223 2477888999999999999999997777 No 104 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.52 E-value=1.4e-14 Score=96.60 Aligned_cols=260 Identities=13% Similarity=0.066 Sum_probs=168.9 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~ 72 (276) |+.. .++|+.|+.++++.+++...+.++++.-. . ....-+..+|+... ..+..+.++.........+.. T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~--~-~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 81 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVEN--V-TTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLS 81 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeee--c-cCCcceEEEEeecCCCcceeeecCCccccccccccee Confidence 5432 47899999999999999999988875321 1 11223566766543 235566666665433446677 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 152 (276) .+++...+. +.-+.|+++-...+..++.+.+.++.++++++..|+.++......+.. .+. ..++.|.++.. T Consensus 82 ~i~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~----~~~----~~~d~i~~~~~ 152 (293) T protein:vir:48 82 LIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK----PTL----TKWDDIIDLEA 152 (293) T ss_pred EEEEeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc----ccc----cCHHHHHHHHH Confidence 778887654 445788888777788899999999999999999999988765543321 122 22667777777 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec--cccccccceEE-EEE-ecc Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN--NMGSLTNGTGA-IAG-VKM 228 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~--~lp~~~~~~~~-~~~-~~~ 228 (276) .+..... .+-..++||..+..|.+...- ...+.....+.+|..++++|.+|+.+. .+|..+.+... +.+ .+. T Consensus 153 ~l~~~~~--~~a~~vmn~~~~~~L~~lkd~--~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~ 228 (293) T protein:vir:48 153 KVDPAIK--QTSFFLTNTSGFTALKKVKNA--LGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQ 228 (293) T ss_pred hhhhhhc--CCCEEEEcHHHHHHHHHhhcc--CCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccc Confidence 7765543 334688999999998764321 122223334566777899999997744 44544433332 222 244 Q ss_pred eEEeee-eeeeeeeccCc----ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAE-QIVQTEAYRME----KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~-~~~~~e~~~~~----~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++.... +...++..+.. ......+++..++|+++.+|++++.++.++. T Consensus 229 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 281 (293) T protein:vir:48 229 AVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 281 (293) T ss_pred eEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeecc Confidence 444332 22234333221 1223578899999999999999999986655 No 105 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.52 E-value=1.6e-14 Score=96.23 Aligned_cols=262 Identities=14% Similarity=0.102 Sum_probs=162.5 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc---------cceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI---------TVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~---------~~~d~~~~~~~~~~~~~~~ 71 (276) -+...+.|+.+...+....+..+++..+++.- ...+.++++|+.... .+..+.++..... .+++. T Consensus 130 ~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~ 203 (419) T protein:vir:94 130 NPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQ-----NADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQ-STLSF 203 (419) T ss_pred CCcccccchhhhHHHHHHHhhhhhhhhcceee-----eccCCceeeeeeccccccccccCcccceecCCccccc-cccce Confidence 12224678999998888777777777766431 123566777764322 2334445555433 45667 Q ss_pred ceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------c---cccccccccCC Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN---------A---TSKLKPAATLD 139 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~---------~---~~~~~~~~~~t 139 (276) ..+++.+.+. +.-+.|+.+-. +...++.+.+.++.+++++.++|..++..=.+. . ..........+ T Consensus 204 ~~i~~~~~k~-~~~~~is~ell-~d~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t 281 (419) T protein:vir:94 204 DTITTTLKTV-AHWLPITRQAA-DDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPAT 281 (419) T ss_pred eeEEeeeeeE-EEeehhhHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccccc Confidence 7777777654 44467777544 445677777888899999999999988521111 0 00111122334 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccc Q lcl|NC_019506. 140 KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNG 219 (276) Q Consensus 140 ~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~ 219 (276) ....++.|.++...+.....+. -.+++||..+..|.+...-.. ..+........|..++++|++|+.++.+|.. T Consensus 282 ~~~~~~~l~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~~k~~~~-~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~--- 355 (419) T protein:vir:94 282 DEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQAPGS-GVFRVIANVQGEATPRIWGLNVVSTVAIAQG--- 355 (419) T ss_pred cchhHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHHhhcCC-CceeecCCcccCCCccccceeeEEcCCCCCc--- Confidence 5556888999988888776532 368999999999876432111 1122233455667789999999999999842 Q ss_pred eEEEEEecceE-EeeeeeeeeeeccCcc-cc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 220 TGAIAGVKMAC-TFAEQIVQTEAYRMEK-RF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 220 ~~~~~~~~~a~-~~~~~~~~~e~~~~~~-~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ...+...+.++ .+......++..+... .| ...+++..++|.++++|+++++++.++. T Consensus 356 ~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa 417 (419) T protein:vir:94 356 TALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) T ss_pred cEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccC Confidence 33333344443 3333333343333221 22 3577899999999999999999995555 No 106 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.52 E-value=1.4e-14 Score=96.53 Aligned_cols=260 Identities=11% Similarity=0.073 Sum_probs=164.8 Q ss_pred Ccc-----chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV-----TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~-----~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.+...+++.+.+.+.+..++..- + ..|.++++|+... ..+..+.+++.... .+++... T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~~~~ 178 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG----R-TSSNALEYVREEVFTNNADVVAEKALKPE-SDITFSK 178 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee----c-ccCcceEEEEEecCCcceeeeccCccccc-cccceeE Confidence 222 12567778888999999999998887542 1 2356788887643 34555566655443 4567777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccccccCCHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK--------LKPAATLDKTNIYE 145 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~--------~~~~~~~t~~~~~~ 145 (276) +++++.+. +.-+.|+++ ..+...++...+.++.+.++++.+|+.++..-.++.... .......++...++ T Consensus 179 ~~~~~~k~-~~~~~is~e-ll~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d 256 (385) T protein:vir:19 179 QTANVKTI-AHWVQASRQ-VMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRAD 256 (385) T ss_pred EEEeeeeE-EEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHH Confidence 77777664 444677875 444556788888999999999999998885432222110 11111223334578 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAG 225 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~ 225 (276) .|.++...|...... .-.+++||..+..|.+...- ...+... ....|..++++|.+|+.++.+|. +...+.. T Consensus 257 ~i~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd~--~G~~l~~-~~~~~~~~~l~G~pV~~~~~~p~---~~~~~gd 328 (385) T protein:vir:19 257 IIAHAIYQVTESEFS--ASGIVLNPRDWHNIALLKDN--EGRYIFG-GPQAFTSNIMWGLPVVPTKAQAA---GTFTVGG 328 (385) T ss_pred HHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhcC--CCceecc-CcccCCCceecceeeEEcCcCCC---CcEEEee Confidence 888888888766543 33689999999998764321 1111121 13466678899999999999984 2333333 Q ss_pred ecceEEeeee-eeeeeeccCc-ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 226 VKMACTFAEQ-IVQTEAYRME-KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 226 ~~~a~~~~~~-~~~~e~~~~~-~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.++....+ ...++..+.. +.| ...+++..++|+.+.+|+++++++.++= T Consensus 329 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 329 FDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred cccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 3444444432 2233332221 112 2467888999999999999999986666 No 107 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.52 E-value=1.4e-14 Score=96.53 Aligned_cols=260 Identities=11% Similarity=0.073 Sum_probs=164.8 Q ss_pred Ccc-----chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV-----TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~-----~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.+...+++.+.+.+.+..++..- + ..|.++++|+... ..+..+.+++.... .+++... T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~a~~v~E~~~~~~-~~~~~~~ 178 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG----R-TSSNALEYVREEVFTNNADVVAEKALKPE-SDITFSK 178 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee----c-ccCcceEEEEEecCCcceeeeccCccccc-cccceeE Confidence 222 12567778888999999999998887542 1 2356788887643 34555566655443 4567777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccccccCCHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK--------LKPAATLDKTNIYE 145 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~--------~~~~~~~t~~~~~~ 145 (276) +++++.+. +.-+.|+++ ..+...++...+.++.+.++++.+|+.++..-.++.... .......++...++ T Consensus 179 ~~~~~~k~-~~~~~is~e-ll~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d 256 (385) T protein:vir:18 179 QTANVKTI-AHWVQASRQ-VMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRAD 256 (385) T ss_pred EEEeeeeE-EEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHH Confidence 77777664 444677875 444556788888999999999999998885432222110 11111223334578 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAG 225 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~ 225 (276) .|.++...|...... .-.+++||..+..|.+...- ...+... ....|..++++|.+|+.++.+|. +...+.. T Consensus 257 ~i~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd~--~G~~l~~-~~~~~~~~~l~G~pV~~~~~~p~---~~~~~gd 328 (385) T protein:vir:18 257 IIAHAIYQVTESEFS--ASGIVLNPRDWHNIALLKDN--EGRYIFG-GPQAFTSNIMWGLPVVPTKAQAA---GTFTVGG 328 (385) T ss_pred HHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhcC--CCceecc-CcccCCCceecceeeEEcCcCCC---CcEEEee Confidence 888888888766543 33689999999998764321 1111121 13466678899999999999984 2333333 Q ss_pred ecceEEeeee-eeeeeeccCc-ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 226 VKMACTFAEQ-IVQTEAYRME-KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 226 ~~~a~~~~~~-~~~~e~~~~~-~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.++....+ ...++..+.. +.| ...+++..++|+.+.+|+++++++.++= T Consensus 329 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 329 FDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred cccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 3444444432 2233332221 112 2467888999999999999999986666 No 108 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.51 E-value=1.6e-14 Score=96.29 Aligned_cols=258 Identities=12% Similarity=0.065 Sum_probs=165.4 Q ss_pred Cc-c-----chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MA-V-----TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA-~-----~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~ 72 (276) |. . -.++|+.+...+.+.+++...+.+++..- + ..+.++++|.... ..+..+.+++.... .+++.. T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~----~-~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~ 186 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG----R-TDSALIEYVQETGFVNNAAIVAEGALKPE-SSLKFA 186 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhccee----e-ccCCceEEEEEecCCcceeeecCCccccc-ccccee Confidence 11 1 13677788888999999988888887532 1 2356788887643 34666677766543 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------cccccccCCHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS--------KLKPAATLDKTNIY 144 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~--------~~~~~~~~t~~~~~ 144 (276) .+++.+.+. +.-+.|+++-. ....++.+.+.++.++++++++|+.++..-.++... ........++...+ T Consensus 187 ~i~~~~~k~-~~~~~is~ell-~ds~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~ 264 (390) T protein:vir:97 187 KKTDTTHVI-AHTMKATRQIL-SDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRV 264 (390) T ss_pred EEEEeeeeE-EEeehhhHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccccccccchH Confidence 888888664 44567887644 444678888888999999999999888542221110 01111222344457 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEE Q lcl|NC_019506. 145 EELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIA 224 (276) Q Consensus 145 ~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~ 224 (276) +.+.++...+.....+.. .+++||..+..|.+...-. ..+.... ...|..++++|.+|+.++.+|. +...+. T Consensus 265 d~~~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~~--G~~l~~~-~~~~~~~~l~G~pV~~~~~~~~---~~~~~g 336 (390) T protein:vir:97 265 DQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKDAN--NQYLIGN-ARGTLTPTLWGLPVVATQAMAP---GEFLVG 336 (390) T ss_pred HHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCC--CceeecC-ccCCCCceecceeeEEcCCCCC---CcEEEE Confidence 788888888887776433 5889999999987543111 1111111 2345557899999999999984 233333 Q ss_pred EecceEEeee-eeeeeeeccCcccc-e--eeEEeeeeeeeEEEcCCeEEEEEec Q lcl|NC_019506. 225 GVKMACTFAE-QIVQTEAYRMEKRF-A--DAVKGLNVFGCKVIYPDALVCLKKT 274 (276) Q Consensus 225 ~~~~a~~~~~-~~~~~e~~~~~~~~-~--~~i~~~~~yg~~v~~~~~vv~~~~~ 274 (276) ..+.++.+.. ....++..+....| . ..+++..+||..+++|++++++.-+ T Consensus 337 d~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 337 AFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred eccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 3344443332 33344444433222 2 3577889999999999999999988 No 109 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.51 E-value=2.9e-14 Score=94.79 Aligned_cols=260 Identities=13% Similarity=0.058 Sum_probs=168.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~ 72 (276) |+. .+++|+.|...+++.+++..++.++++.-.. ....|+ +.+|.... ..+..+.++........++.. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~ 185 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENV--TTLTGS-RVYEKWTDITGLANIDDEAGKIADVDDPKLS 185 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeec--ccCccc-eEEEeeccCCcceeeecCcccccccccccee Confidence 332 2478999999999999999999888754211 112233 44554433 335666666665433456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 152 (276) .+++.+.+. +.-+.|+++-...+..++...+.++.++++++.+|..++......... +.. ..++.|.++.. T Consensus 186 ~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~----~~~----~~~d~i~~~~~ 256 (397) T protein:vir:49 186 LIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAALPTK----PTL----TKWDDIIDLEA 256 (397) T ss_pred eEEeeeeeE-EeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----ccc----ccHHHHHHHHH Confidence 888888654 455678887776777899999999999999999999988765443221 111 22567777877 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEe--ccccccccceEEEEE--ecc Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLS--NNMGSLTNGTGAIAG--VKM 228 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s--~~lp~~~~~~~~~~~--~~~ 228 (276) .+.....+ +-.+++||..+..|.+...- ...+.....+..|.-++++|++|+.. ..+|..+.+...+.+ -+. T Consensus 257 ~l~~~~~~--~a~~vmn~~~~~~l~~lkd~--~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~ 332 (397) T protein:vir:49 257 KVDPAIKQ--TSFFLTNTSGFTALKKVKNA--LGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQ 332 (397) T ss_pred hhhhhhcC--CCEEEEcHHHHHHHHHhhcC--CCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccc Confidence 77766543 34789999999999765321 11222233355677789999999874 446655544433332 233 Q ss_pred eEEee-eeeeeeeeccCc----ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFA-EQIVQTEAYRME----KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~-~~~~~~e~~~~~----~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++... .....++..+.. ......+++..++|+++++|++++.++.++. T Consensus 333 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 333 AVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 385 (397) T ss_pred eEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeecc Confidence 44333 233334333221 1223578899999999999999999997665 No 110 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.50 E-value=3.1e-14 Score=94.65 Aligned_cols=260 Identities=10% Similarity=0.019 Sum_probs=161.9 Q ss_pred Ccc-chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MAV-TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA~-~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~~~~~~ 77 (276) .+. ..++|+.|+.++++.+++...+.++++.-.. .... -++.+++... ..+..+.++......+..+...+++. T Consensus 121 ~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~--~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~ 197 (408) T protein:vir:10 121 DSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV--STSN-GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYL 197 (408) T ss_pred ccCCceeccHhHHHHHHHHHHhhchhhhhcceeec--cCCc-ceEEEeeccccccceeeecCccccccccCcceeeEEee Confidence 111 2478999999999999999999888754211 1111 2344554433 23445556655443334566777777 Q ss_pred EEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHH-HHHhh Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVK-VKLDE 156 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~l~~ 156 (276) +.+. +.-+.|+++-...+..++...+.+..+++++..+|+.++....++... ++..+ ++.+..+. ..++. T Consensus 198 ~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~----~~~~~----~~~l~~~~~~~~~~ 268 (408) T protein:vir:10 198 IKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKK----PTIAK----FDDVITMINTAVDP 268 (408) T ss_pred eeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----ccccc----HHHHHHHHHHhhhh Confidence 7655 445678887777778899999999999999999999988766543321 11122 44555443 33433 Q ss_pred cCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec--cccccccceEEEE-Ee-cceEEe Q lcl|NC_019506. 157 KNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN--NMGSLTNGTGAIA-GV-KMACTF 232 (276) Q Consensus 157 ~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~--~lp~~~~~~~~~~-~~-~~a~~~ 232 (276) .. ..+-.+++||..+..|.+...- ...+.....+.+|..++++|++|+.++ .+|..+.+...+. +. +.++.. T Consensus 269 ~~--~~~a~~v~n~~~~~~l~~lkd~--~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~ 344 (408) T protein:vir:10 269 AI--IATSSLLTNQSGLNKLALVKTA--EGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITL 344 (408) T ss_pred hh--ccCCEEEEcHHHHHHHHHhhcc--CCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEE Confidence 22 2334689999999999775322 222333334566777899999998854 4665444333322 22 333433 Q ss_pred e-eeeeeeeeccCcc----cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 233 A-EQIVQTEAYRMEK----RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 233 ~-~~~~~~e~~~~~~----~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) . +....++..+... .....+++..++|+++++|++++.++.+++ T Consensus 345 ~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~ 393 (408) T protein:vir:10 345 FDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) T ss_pred EEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeecc Confidence 3 2223343333221 224678899999999999999999996664 No 111 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.50 E-value=2.6e-14 Score=95.04 Aligned_cols=258 Identities=11% Similarity=0.038 Sum_probs=163.9 Q ss_pred Cc-c-----chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MA-V-----TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA-~-----~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~ 72 (276) +. . ..++|+.+...+.+.+++.+.+.+++..- + ..+.++++|+... ..+..+.+++.... .+++.+ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~ 186 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG----R-TDSALIEYVQETGFVNNAAIVAEGALKPE-SSLKFA 186 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhccee----e-ccCCceEEEEEecCCcceeeecCCccccc-ccceee Confidence 11 1 12556667788999999999998887542 1 2356788887643 34556667766543 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccccccCCHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK--------LKPAATLDKTNIY 144 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~--------~~~~~~~t~~~~~ 144 (276) .+++.+.+. +.-+.|+++-. +...++...+.++.+.++++++|..++..-.++.... .......+....+ T Consensus 187 ~i~~~~~k~-~~~~~is~ell-~d~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~ 264 (390) T protein:vir:81 187 KKTDTTHVI-AHTMKATRQIL-SDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRV 264 (390) T ss_pred EEEEeeeEE-EEeehhhHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhH Confidence 788888665 44567777644 4446788888889999999999998885432221110 0111122333457 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEE Q lcl|NC_019506. 145 EELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIA 224 (276) Q Consensus 145 ~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~ 224 (276) +.|..+...+...+.+.. .+++||..+..|.+...- ...+.... ...|..++++|.+|+.++.+|.. ...+. T Consensus 265 ~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~--~G~~l~~~-~~~~~~~~l~G~pv~~~~~~p~~---~~~~g 336 (390) T protein:vir:81 265 DQLRLAMLQASLAEYNPS--GIVINPIDWAAIELAKDA--NNQYLIGN-ARGTLTPTLWGLPVVATQAMAPG---EFLVG 336 (390) T ss_pred HHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC--CCceeecC-cccccCceecceeeEEcCCCCCC---cEEEE Confidence 788888888887776433 588999999988764311 11111121 22444568999999999999843 33333 Q ss_pred EecceEEeee-eeeeeeeccCcccce---eeEEeeeeeeeEEEcCCeEEEEEec Q lcl|NC_019506. 225 GVKMACTFAE-QIVQTEAYRMEKRFA---DAVKGLNVFGCKVIYPDALVCLKKT 274 (276) Q Consensus 225 ~~~~a~~~~~-~~~~~e~~~~~~~~~---~~i~~~~~yg~~v~~~~~vv~~~~~ 274 (276) ..+.++.+.. ....++..+....|. ..+++..++|.++++|+++++++.+ T Consensus 337 d~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 337 AFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ehhceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 3344444333 223444443333332 4678899999999999999999999 No 112 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.49 E-value=3.7e-14 Score=94.24 Aligned_cols=260 Identities=13% Similarity=0.058 Sum_probs=165.7 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc--cceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI--TVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~--~~~d~~~~~~~~~~~~~~~~ 72 (276) |+. ..++|+.|...+++.+++..++..++..-.. .... -++.+|+.... .+....++.........+.. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~ 185 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENV--TTLT-GSRVYEKWADITGLAKLDDEGGQIGQNDDPKLS 185 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeec--cCCc-ceEEEEeeccCCcceeeecccccccccccccee Confidence 332 2578999999999999999998888754211 1111 23556654332 34445555554332334567 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 152 (276) .+++.+.+. +.-+.|+++-...+..++...+.+..++++++.+|..++....++.. .+....++.|.++.. T Consensus 186 ~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~~--------~~~~~~~d~i~~~~~ 256 (397) T protein:vir:49 186 LIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLPN--------KPTLAKWDDIIDLQA 256 (397) T ss_pred eeEeeeeee-EeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccccccc--------cccccCHHHHHHHHH Confidence 777777654 44467888777677789999999999999999999998865443221 111123667788888 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEe--ccccccccceEEEE--Eecc Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLS--NNMGSLTNGTGAIA--GVKM 228 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s--~~lp~~~~~~~~~~--~~~~ 228 (276) .+.....+ +-.+++||..+..|.+...- ...+.....+..|.-++++|++|+.+ ..+|..+.+...+. ..+. T Consensus 257 ~l~~~~~~--~a~~v~n~~~~~~l~~lkd~--~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~ 332 (397) T protein:vir:49 257 KVDPAIKQ--TSLFLTNTSGFTALKKVKNA--MGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQ 332 (397) T ss_pred hhhhhhcC--CCEEEEcHHHHHHHHHhhcc--CCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccc Confidence 88776654 34789999999998764321 11222223345676778999999874 34565444433332 2344 Q ss_pred eEEeee-eeeeeeeccCc----ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAE-QIVQTEAYRME----KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~-~~~~~e~~~~~----~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++.+.. ....++..+.. ......+++..++|.++++|++++.++.+++ T Consensus 333 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 333 AVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAI 385 (397) T ss_pred eEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEeccc Confidence 444442 22234333221 1223578899999999999999999998887 No 113 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.49 E-value=3.7e-14 Score=94.24 Aligned_cols=263 Identities=13% Similarity=0.029 Sum_probs=160.4 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |++. .++|+.+..++++.+++..++.+++..- + ..+.+++||+... ..+..+.++..... .+++.++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~ip~~~~~~~a~~v~Eg~~~~~-~~~~f~~ 87 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV----P-MGTTGQKIPHWVGDVSAQWIGEGDMKPI-TKGNMTS 87 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEeCCcceEEecCCccccc-cccceeE Confidence 4432 3679999999999999999999987542 1 2356788887654 45677777776654 4567777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------ccccccccCCHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT--------SKLKPAATLDKTNIYE 145 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~--------~~~~~~~~~t~~~~~~ 145 (276) ++++..+. +.-+.++++-...+..++.+.+.+..++++++++|+.++..-.+... ...............+ T Consensus 88 i~~~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (318) T protein:vir:24 88 QTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVYDQ 166 (318) T ss_pred EEEeeEEE-EEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccchHHH Confidence 77777553 44567888776677789999999999999999999998854332111 0011111111122223 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeee-----eeeEEeceEEEEeccccccccce Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNG-----FVGTILGFDVYLSNNMGSLTNGT 220 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G-----~i~~~~G~~v~~s~~lp~~~~~~ 220 (276) .+.++...+.... ...-.+++||..+..|.+...-. ..+........| .-+++.|++++.++.+|... . T Consensus 167 ~~~~~~~~~~~~~--~~~~~~v~n~~~~~~L~~lkd~~--G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~--~ 240 (318) T protein:vir:24 167 VAVNGLSLLVNDG--KKWTHTLLDDITEPILNGAKDQN--GRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEGT--T 240 (318) T ss_pred HHHHHHHhhcccc--CCCCEEEEcHHHHHHHHHhhccC--CceeecCccccCccccccCceEEEEeeEEeCCCCCCc--c Confidence 4455555554443 33447899999999997642211 111111111111 12478999999999887522 2 Q ss_pred EEEEEecceEEeee-eeeeeeeccCc-------------ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 221 GAIAGVKMACTFAE-QIVQTEAYRME-------------KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 221 ~~~~~~~~a~~~~~-~~~~~e~~~~~-------------~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ..+.+.-+.+.+.. +...++..++. +.| ...+++..++|.++++|+++++|+..+. T Consensus 241 ~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a 313 (318) T protein:vir:24 241 VGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVS 313 (318) T ss_pred EEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeecc Confidence 22232222222221 11122222211 011 2578899999999999999999997666 No 114 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.49 E-value=2.2e-14 Score=95.44 Aligned_cols=266 Identities=14% Similarity=0.047 Sum_probs=161.9 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCccccccceE Q lcl|NC_019506. 1 MAVT-----SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELSTTEK 74 (276) Q Consensus 1 MA~~-----~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~~~~ 74 (276) ||.. .++|+.++.++++.+++.+++..++.+- + ..+..++||+. +...+..+.++..... .+++..++ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i----~-~~~~~~~~p~~~~~~~a~wv~Eg~~~~~-~~~~f~~v 74 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARK----P-QRFGNEDIITFNGRPKAEFVGEGQQKSS-TTGEFDFV 74 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEeCCceeEEeecCccccc-ccceeeEE Confidence 9963 4789999999999999999998887542 1 12345788887 4456777777776654 45677777 Q ss_pred EEEEEeeeecceeechHHHH---hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--c--------ccc---cccC Q lcl|NC_019506. 75 VLEINKQKYFNFQIDDVDAA---QIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS--K--------LKP---AATL 138 (276) Q Consensus 75 ~~~ld~~~~~~~~v~d~d~~---~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~--~--------~~~---~~~~ 138 (276) ++...+. +.-+.|+++-.. .+..++.+.+.++.++++++++|+.++......... . .+. .+.. T Consensus 75 ~l~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~ 153 (311) T protein:vir:99 75 TSTPKKA-QVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTAD 153 (311) T ss_pred EEeeEEE-EEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecccc Confidence 7776543 444677777553 345788999999999999999999988654322110 0 000 1111 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) +.......+..+...+.........-.+++||..+..|.+... ....+........+..++++|++++.++.+|.... T Consensus 154 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd--~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~ 231 (311) T protein:vir:99 154 TIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARY--TDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDE 231 (311) T ss_pred ccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhc--cCCCeeecCcccCCCCceecceeeEeecccccccc Confidence 2222344556666655554432221138999999999976432 11222333444456678899999999998874321 Q ss_pred c------------eEEEEEe-cceEEee-eeeeeeeecc--Ccc----cc---eeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 219 G------------TGAIAGV-KMACTFA-EQIVQTEAYR--MEK----RF---ADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 219 ~------------~~~~~~~-~~a~~~~-~~~~~~e~~~--~~~----~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) . ..++.+- ...+.+. .+...++..+ +.+ .| -..+++..++|..|++|+.+++.+++| T Consensus 232 ~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 232 ADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred cccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChhHeeeecccC Confidence 1 1111111 1112121 1111222111 111 11 135788899999999999888888888 No 115 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.47 E-value=4.8e-14 Score=93.58 Aligned_cols=260 Identities=14% Similarity=0.080 Sum_probs=164.7 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~ 72 (276) |+. -.++|+.|...+++.+++.+++.++++.-.. ....|+.. ++.... ..+....++........++.. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~~~~-~~~~~~~~~~a~~v~E~~~~~~~~~~~~~ 185 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENV--TTLTGSRV-YEKWADITGLAKLDDEAGSIGTNDDPKLY 185 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeec--cCCcceEE-EEeecCCCcceeeecccccccccccccee Confidence 322 2478999999999999999999888754211 12223332 332222 224455566555433446677 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 152 (276) .+++.+.+. +.-+.|+++-...+..++...+.++.++++++.+|..++....+... .++...++.|.++.. T Consensus 186 ~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~--------~~~~~~~d~i~~~~~ 256 (397) T protein:vir:48 186 PIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIATLPT--------KPTLTKWDDIIDLQA 256 (397) T ss_pred eEEeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------ccccccHHHHHHHHH Confidence 778877654 44578888877777889999999999999999999998865433221 122233677888888 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec--cccccccceEE-EEEe-cc Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN--NMGSLTNGTGA-IAGV-KM 228 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~--~lp~~~~~~~~-~~~~-~~ 228 (276) .|.....+ +-.+++||..+..|.+...-. ..+.....+..|.-++++|++|+.+. .+|..+.+... +.+. +. T Consensus 257 ~l~~~~~~--~a~~v~n~~~~~~L~~lkd~~--G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~ 332 (397) T protein:vir:48 257 KVDPAIKQ--TSFFLTNTSGFTALKKVKNAF--GDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQ 332 (397) T ss_pred HhhhhhcC--CCEEEECHHHHHHHHHhhcCC--CceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccc Confidence 88766543 346889999999997643211 12222334556777899999998754 34543433332 2222 33 Q ss_pred eEEeee-eeeeeeeccCc----ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAE-QIVQTEAYRME----KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~-~~~~~e~~~~~----~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++.... ....++..+.. ..-...+++..++|..+++|++++.++.++. T Consensus 333 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 333 AVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred eEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEeccc Confidence 443332 22234333322 2223578899999999999999999986655 No 116 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.47 E-value=3.8e-14 Score=94.15 Aligned_cols=254 Identities=9% Similarity=-0.003 Sum_probs=156.6 Q ss_pred Ccc-chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MAV-TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA~-~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~~~~~~ 77 (276) -++ ..++|+.|...+.+.+++..++.++++.- + ..+.+.++|.. +..++..+.++........++...+++. T Consensus 133 ~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~ 207 (394) T protein:vir:97 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVY----Q-AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWN 207 (394) T ss_pred cccccccChHHHHHHHHHHhhhhhhhhhhceee----e-ccCcceEEEEEecCCCccceecccccccccccccceeEEee Confidence 111 14789999999999999989898887531 1 12334666655 3345666666655433344666777777 Q ss_pred EEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhc Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEK 157 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 157 (276) ..+. +.-+.|+++-...+..++...+.+..++++++..|..++....+.+ +.+..+ ++.+..+....-. T Consensus 208 ~~k~-~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~-----~~~~~~----~~~~~~~~~~~~~- 276 (394) T protein:vir:97 208 IDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT-----TKTVKN----LDEIKALLNGGFD- 276 (394) T ss_pred hhhe-eeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----cccccc----HHHHHHHHHhhhh- Confidence 7554 4456788877767778899999999999999999998887554322 122222 3444443322211 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceE-Eeeeee Q lcl|NC_019506. 158 NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMAC-TFAEQI 236 (276) Q Consensus 158 ~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~-~~~~~~ 236 (276) |..+-.+++||..+..|.....- ...+.....+..|.-++++|++|+.++...... +...+.....++ .+..+. T Consensus 277 --~~~~a~~v~n~~~~~~l~~lkd~--~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~-~~~~~gd~~~~~~~~~~~~ 351 (394) T protein:vir:97 277 --PAYNVSLIVSQSFYQTLDTLKDG--NGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGA-NKAFIGDFKRGVLFADRKD 351 (394) T ss_pred --hhhCCEEEEcHHHHHHHHHhhcc--CCCeeeecCcCCCCCceeccceeEEecccccCC-ccEEEeeccccEEEEEecc Confidence 12234588999999998764211 122223334556666799999999865543222 122222122323 232333 Q ss_pred eeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEec---CC Q lcl|NC_019506. 237 VQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKT---NP 276 (276) Q Consensus 237 ~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~---~p 276 (276) ..++. .+...+...+++..++|+++.+|++++.++.+ +| T Consensus 352 ~~~~~-~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~p 393 (394) T protein:vir:97 352 LGLRW-ADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLP 393 (394) T ss_pred eEEEE-ecccccceeEEEEEEEccEEecccceEEEEecccccC Confidence 33333 33445667889999999999999999998843 34 No 117 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.46 E-value=6.1e-14 Score=93.04 Aligned_cols=262 Identities=12% Similarity=0.021 Sum_probs=154.3 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) ... ..++|+.+..++.+.+++..++..++.+- + ..+.+.++|+... ..+..+.++..... .+++..++++ T Consensus 22 ~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~----~-~~~~~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~f~~i~~ 95 (326) T protein:vir:42 22 TGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKI----P-MGTTGQKIPHWTGDVSASWIGEGDMKPI-TKGNMTSQTI 95 (326) T ss_pred ccccCCcceechhhHHHHHHHHHhcchhhhhccee----e-ccCCceEEEEEeCCcceEEecCCccccc-cccceeEEEE Confidence 111 12678889999999999999888887542 1 2356788887654 44666677766654 4677777777 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------ccccccccCCHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT-----------SKLKPAATLDKTNIYE 145 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~-----------~~~~~~~~~t~~~~~~ 145 (276) ...+. +.-+.|+++-...+..++.+.+.++..+++++++|+.++..-.+... .....++..+...... T Consensus 96 ~~~k~-~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~ 174 (326) T protein:vir:42 96 APHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVY 174 (326) T ss_pred eeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeecccccccccchhH Confidence 77554 55678888777778889999999999999999999998853221110 0111111111111122 Q ss_pred H--HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccccee-----eeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 146 E--LIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITK-----NGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 146 ~--i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~-----~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) . +..+...+... ...+-..+++|..+..|.+...-.. .+....... ....++++|++++.++.+|... T Consensus 175 ~~~~~~~~~~~~~~--~~~~a~~v~n~~~~~~L~~lkd~~G--~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~- 249 (326) T protein:vir:42 175 DAVAVNALSLLVNA--GKKWTHTLLDDITEPILNGAKDKSG--RPLFIESTYTEENSPFRLGRIVARPTILSDHVASGT- 249 (326) T ss_pred HHHHHHHHhhhhhh--ccCccEEEEeHHHHHHHHHhhccCC--ceeeccccccCccccccCceeeeeeEEEcCCCCCCc- Confidence 2 22222222222 2344468899999999976432111 111111122 2234579999999999998522 Q ss_pred ceEEEEEe-cce-EEeeeeeeeeeeccC--------c-----ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 219 GTGAIAGV-KMA-CTFAEQIVQTEAYRM--------E-----KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 219 ~~~~~~~~-~~a-~~~~~~~~~~e~~~~--------~-----~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ...+.+- ..+ ++.... ..++..++ + ..| -..+++..++|+++++|++++.|+..+= T Consensus 250 -~~~~~Gd~s~~~~~~~~~-~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 250 -VVGYQGDFRQLVWGQVGG-LSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDA 323 (326) T ss_pred -eEEEEeecceEEEEEecc-eEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccc Confidence 2222211 111 222211 12221111 1 112 2577899999999999999999874443 No 118 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.46 E-value=1e-13 Score=91.79 Aligned_cols=258 Identities=12% Similarity=0.054 Sum_probs=159.6 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT-----SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~-----~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~~ 73 (276) +... .++|..+...+++.+++.+.+.+++..- + ..+.++++|+... ..+..+.+++.... .+++... T Consensus 114 ~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~~~ 187 (390) T protein:vir:10 114 STDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSG----R-TDSALIEYVQETGFVNNAAIVAEGALKPE-SSLKFAK 187 (390) T ss_pred hcccccccccccchhHHHHHHHHHHhhchhhhhccee----e-ccCCceEEEEEecCCcceeeecCCccccc-cccceeE Confidence 1111 2445555667888888888888887531 1 2355788887643 34556666666543 4567778 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccccccCCHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK--------LKPAATLDKTNIYE 145 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~--------~~~~~~~t~~~~~~ 145 (276) +++.+.+. +.-+.|+++- ++...++.+.+.++.++++++++|..++..-.++.... .......++...++ T Consensus 188 i~~~~~k~-~~~~~is~el-l~d~~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 265 (390) T protein:vir:10 188 KTDTTHVI-AHTMKATRQI-LSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVD 265 (390) T ss_pred EEEeeEEE-EEeehhhHHH-HHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccccccccccccchHH Confidence 88888665 4456778764 44456888888899999999999999885422221100 11111223334567 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAG 225 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~ 225 (276) .+..+...+.....+.. .+++||..+..|.+...- ...+..... ..+..++++|.+|+.++.+|.. ...+.. T Consensus 266 ~~~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd~--~g~~l~~~~-~~~~~~~l~G~pv~~~~~~p~~---~~~~gd 337 (390) T protein:vir:10 266 QLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKDA--NNQYLIGNA-RGTLTPTLWGLPVVATQAMAPG---EFLVGA 337 (390) T ss_pred HHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcC--CCceeecCC-cCcCCceecceeeEEcCCCCCC---cEEEEe Confidence 78888888887776433 588999999988764321 111112222 2334467999999999999842 233332 Q ss_pred ecceEEeee-eeeeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEec Q lcl|NC_019506. 226 VKMACTFAE-QIVQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKT 274 (276) Q Consensus 226 ~~~a~~~~~-~~~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~ 274 (276) ...++.... ....++..+....| ...+++..++|+++++|++++.++.+ T Consensus 338 f~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 338 FDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 333433332 22234433322222 24677889999999999999999988 No 119 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.45 E-value=1.3e-13 Score=91.21 Aligned_cols=268 Identities=15% Similarity=0.057 Sum_probs=158.9 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCC-----cccccc Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDA-----PEELST 71 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~-----~~~~~~ 71 (276) +.+ ..++|+.+...+++.+++..++..+++.- + ..|....+|+... ..+..+.++..... ....+. T Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~ 239 (458) T protein:vir:10 165 SSVEVSSESYETIFSQRIIRDLQKELVVGALFEEL----P-MSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGAL 239 (458) T ss_pred ccCccccceehhhHhHHHHHHHHhhhhHHhhccee----e-cCCcceEEEEecCCcceeecccccccccccccccccccc Confidence 111 23789999999999999999988887542 2 2345566665433 34444444433221 112344 Q ss_pred ceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------cc---cc-ccccC Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT---------SK---LK-PAATL 138 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---------~~---~~-~~~~~ 138 (276) ..+++...+. +.-+.|+++-...+..++...+.+..++++++++|..++..-..+.. .. .. ..... T Consensus 240 ~~i~~~~~k~-~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~ 318 (458) T protein:vir:10 240 KEIHFSTYKL-AAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADG 318 (458) T ss_pred eeeEeeeeeE-EeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccccc Confidence 5555555333 33467788766667789999999999999999999988753211100 00 00 00111 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcc--cccccceeeeeeeEEeceEEEEecccccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGG--AMAESITKNGFVGTILGFDVYLSNNMGSL 216 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~--~~~~~~~~~G~i~~~~G~~v~~s~~lp~~ 216 (276) .....++.|.++...|..... .+-.+++||..+..|.....-....- .........|..++++|.+|+.++.+|.. T Consensus 319 ~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~ 396 (458) T protein:vir:10 319 SVLVTAKTISKLRRKLGRHGL--KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAK 396 (458) T ss_pred cccccHHHHHHHHHhhhhhhc--CCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccc Confidence 112236778888887776654 33468999999998865322111100 01122344566778999999999999975 Q ss_pred ccceEE-EEEecceEEeeeee-eeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 217 TNGTGA-IAGVKMACTFAEQI-VQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 217 ~~~~~~-~~~~~~a~~~~~~~-~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++.... +..-.+++.+..+. ..++..+-...-...++...++|..|.+|+++|..+.+|- T Consensus 397 ~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 397 ANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred cCCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 443332 22223333333221 2232211111112457788999999999999999987777 No 120 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=99.45 E-value=9.4e-14 Score=91.99 Aligned_cols=259 Identities=10% Similarity=0.074 Sum_probs=158.7 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCc----ccc Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAP----EEL 69 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~----~~~ 69 (276) ||.. .++|+.++..+++.+++.+++.++++.- ...+.++++|+.... .+..+.++...... .+. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~-----~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~ 75 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNV-----NMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKV 75 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhccee-----eccCCcEEEEEEeCCcceEEeeccccccccccccccc Confidence 9864 4789999999999999999999987531 123567889876543 46666665543221 234 Q ss_pred ccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------c-------cccc Q lcl|NC_019506. 70 STTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT--------S-------KLKP 134 (276) Q Consensus 70 ~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~--------~-------~~~~ 134 (276) +...+++...+. +.-+.|+++-...+..++...+.++.++++++++|+.++..-.+... . ..+. T Consensus 76 ~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~ 154 (305) T protein:vir:25 76 TWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVV 154 (305) T ss_pred ceeeEEeeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCcccccccccccccccccccc Confidence 455556665443 44467888777777889999999999999999999998853221100 0 0011 Q ss_pred cccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 135 AATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 135 ~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) .+..+..+.++.+..+...+........ -++++|..+..|.+... . .+...+.. +.++|.+++.++.+| T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd---~---~G~~i~~~---~~l~G~Pv~~~~~~~ 223 (305) T protein:vir:25 155 GGVANESDIVGATNRAAKAVASAGWAPD--TLLSSLALRYEVANIRD---A---NGNPVFRD---DSFAGFRTFFNRNGA 223 (305) T ss_pred ccchhhhHHHHHHHHHHHhhhhcccccc--eeEecHHHHHHHHHhhc---c---CCceeecC---CcccccceEEcCccC Confidence 1111223344555555555444332111 27889999999865321 1 12222322 368999999999998 Q ss_pred ccccceEEEEEecceEEeeee-eeeeeeccC--------c-ccc---eeeEEeeeeeeeEEEcCCeEEEEEec-----CC Q lcl|NC_019506. 215 SLTNGTGAIAGVKMACTFAEQ-IVQTEAYRM--------E-KRF---ADAVKGLNVFGCKVIYPDALVCLKKT-----NP 276 (276) Q Consensus 215 ~~~~~~~~~~~~~~a~~~~~~-~~~~e~~~~--------~-~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~-----~p 276 (276) ...+....+++..+.+.+..+ ...++..+. + ..| ...++...++|..|++|++++.+..+ .| T Consensus 224 ~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~p 303 (305) T protein:vir:25 224 WDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAP 303 (305) T ss_pred CCCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCC Confidence 765554555444333333322 122222211 0 112 24677889999999999999998753 44 No 121 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.44 E-value=2.1e-13 Score=90.04 Aligned_cols=260 Identities=11% Similarity=0.020 Sum_probs=159.5 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. ..++|+.+...+++.+++.+++..+++.- ......| ++.+++... ..+..+.+++.......++.. T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~--~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~ 192 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE--SVSTSNG-SRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLT 192 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhccee--eccCCcc-eEEEEeecCCccceeeecCcccccccccccee Confidence 211 24689999999999999999998887532 1111122 333444332 234556666654433456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 152 (276) .+++.+.+. +.-+.|+++-...+..++...+.++.++++++++|+.++.....+.. .+...+ ++.+.++.. T Consensus 193 ~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~----~~~~~~----~~~i~~~~~ 263 (404) T protein:vir:39 193 IIKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPK----KPTIAK----FDDVITMIN 263 (404) T ss_pred eEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc----cccccc----HHHHHHHHH Confidence 888888665 44578888877777889999999999999999999998865443221 111122 445554433 Q ss_pred -HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec--cccccccceE-EEEEe-c Q lcl|NC_019506. 153 -KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN--NMGSLTNGTG-AIAGV-K 227 (276) Q Consensus 153 -~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~--~lp~~~~~~~-~~~~~-~ 227 (276) .++.... .+-.+++||..+..|.....- ...+.....+..|..++++|++|+.+. .+|..+.+.. .+.+. + T Consensus 264 ~~~~~~~~--~~a~~v~n~~~~~~L~~lkd~--~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~ 339 (404) T protein:vir:39 264 TSVDPAII--ATSSLLTNQSGLNKLALVKTA--EGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMS 339 (404) T ss_pred Hhhhhhhc--cCCEEEEcHHHHHHHHHhhcc--CCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEecc Confidence 3333221 234689999999999864321 112222233455666799999998854 3555443332 23332 3 Q ss_pred ceEEee-eeeeeeeeccCc----ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 228 MACTFA-EQIVQTEAYRME----KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 228 ~a~~~~-~~~~~~e~~~~~----~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .++.+. .+...++..+.. ......+++..++|+.+++|++++.++.++. T Consensus 340 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (404) T protein:vir:39 340 QAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAI 393 (404) T ss_pred ccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeecc Confidence 344333 222334333322 1224578899999999999999999986655 No 122 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.43 E-value=1.8e-13 Score=90.50 Aligned_cols=259 Identities=14% Similarity=0.118 Sum_probs=161.2 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.+...+++.+++.+++..++....- . ...-++.+++... ..+..+.++........++.+. T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~--~-~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~ 167 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPV--T-TLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTL 167 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeec--c-CCceeEEEEeecCCcceeeeccccccccccccceee Confidence 332 2478999999999999999999888754211 1 1123455555443 4566677776654334567777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHH-H Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVK-V 152 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~ 152 (276) +++...+. +.-+.|+++-...+..++...+.+..++++++.+|..++....+... ....+ ++.+..+. . T Consensus 168 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~-----~~~~~----~~~i~~~~~~ 237 (371) T protein:vir:81 168 LQYQVKKY-AGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAK-----TAIAD----LDGLKQIINV 237 (371) T ss_pred EEeeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----ccccc----HHHHHHHHHh Confidence 78887665 44478888877677789999999999999999999988876543221 11112 33444332 2 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccc--------eEEEE Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNG--------TGAIA 224 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~--------~~~~~ 224 (276) .|.... ..+-..++||..+..|.+...- ...+.....+..|..++++|.+|+.++++|..... ..++. T Consensus 238 ~l~~~~--~~~a~~vmn~~~~~~L~~lkd~--~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~ 313 (371) T protein:vir:81 238 QLDPVF--RSTSSVIVNQDAFNWLDTLKDQ--NGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIV 313 (371) T ss_pred hcchhh--hcCCEEEEcHHHHHHHHHhhcc--CCCeeeecccCCCCCceecceeEEEecccccCccccccccCCcceEEE Confidence 333322 2334689999999998764321 12223333456677789999999999988743221 11222 Q ss_pred Ee-cceEEee-eeeeeeeeccCc-ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 225 GV-KMACTFA-EQIVQTEAYRME-KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 225 ~~-~~a~~~~-~~~~~~e~~~~~-~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. +.++... .....++..+.. +.| ...+++..++|.++++|+++++++.++= T Consensus 314 Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 314 GDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred EehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 21 2222222 222223322221 122 3588899999999999999999995555 No 123 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=99.43 E-value=1.7e-13 Score=90.54 Aligned_cols=268 Identities=9% Similarity=-0.004 Sum_probs=158.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccceEEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTEKVLEIN 79 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~~~~~ld 79 (276) -+-.+++|+.+++++++.+++.+++..+..+......... -.++||+.. ...+..+.++..... .+.+.+.+++... T Consensus 344 ~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~-~~~~ip~~t~~~~a~wv~Eg~~~~~-s~~~f~~v~l~~~ 421 (645) T protein:vir:93 344 WAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVP-FNIRVHAQVSGGAAGWVGEGKTKPL-TKFDFESITFSHA 421 (645) T ss_pred ccCCccCchhhHHHHHHhhhhhhhHHhhcccccccccccc-CceeeeeeecCcceEEeccCccccc-cccceeEEEEeeE Confidence 1123578999999999999999988887644322222112 246788753 345666667766544 4567777777765 Q ss_pred eeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---cc---cccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 80 KQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS---KL---KPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 80 ~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~---~~---~~~~~~t~~~~~~~i~~a~~~ 153 (276) +. +.-+.|+++-..++..++.+.+.+..+++++.++|..++..-.++... .+ ......+.......+..+... T Consensus 422 kl-a~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~ 500 (645) T protein:vir:93 422 KV-SAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEAAFGQ 500 (645) T ss_pred EE-EEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHHHHHH Confidence 43 444677777666777888888889999999999999988543222110 00 011111222334567777777 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccc---cceEEEEEecceE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLT---NGTGAIAGVKMAC 230 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~---~~~~~~~~~~~a~ 230 (276) +..+++...+-+.++||..+..|.+...-.. .....+.-..| ++++|.+|+.|+.+|... .....+.+....+ T Consensus 501 ~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G--~~~~~~~~~~~--~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~~~v 576 (645) T protein:vir:93 501 FVAANLQPTGAVWLMSSTNALALSMRKNALG--QKEYPDMTLLG--GSFQGLPVIVSQYVGDQLVLVNAPDIYLADDGGV 576 (645) T ss_pred HHhcCCCccccEEEEcHHHHHHHHhccccCC--ceeecCCCCCC--ceeeceeeEEeccCCcceeEeccccEEEEEecce Confidence 7777765555678999999999876532111 11111111112 479999999999998521 0001111222222 Q ss_pred Eeee-eeeeeeeccCc-------------ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 231 TFAE-QIVQTEAYRME-------------KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 231 ~~~~-~~~~~e~~~~~-------------~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.. +...++-...+ ..| -..|++..+++..+.+|+++++|+ .+. T Consensus 577 ~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt-~~~ 638 (645) T protein:vir:93 577 AVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVIT-GVN 638 (645) T ss_pred EEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEe-ccc Confidence 1111 11111111000 001 247888899999999999999996 444 No 124 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.42 E-value=2e-13 Score=90.24 Aligned_cols=259 Identities=14% Similarity=0.053 Sum_probs=163.4 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |+. ..++|+.|...+.+.+++.+++..+++.-.. ....| ++.+++. +...+..+.++.........+.+. T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~-~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~ 199 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPV--TTRSG-TRLLEKNADMVPFSPVEELGNLPEIDQPRFTK 199 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeec--cCCce-eEEEEEecCCcceeeeccccccccccccccee Confidence 332 1478999999999999999998888754211 11122 4555543 445567777776654323456677 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHH-H Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVK-V 152 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~ 152 (276) +++...+. +.-+.|+++-...+..++...+.+..++++++++|..++....++. +.+..+ ++.+.++. . T Consensus 200 v~~~~~k~-~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~-----~~g~~~----~~~i~~~~~~ 269 (397) T protein:vir:12 200 VSYSIIDY-GGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLK-----KVDIDG----LDGIKKALNV 269 (397) T ss_pred EEeeheee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----cccccc----HHHHHHHHhh Confidence 77777554 4446788877777778999999999999999999999886544322 111122 44555543 2 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecc-ccccccce-EEEEEe-cce Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNN-MGSLTNGT-GAIAGV-KMA 229 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~-lp~~~~~~-~~~~~~-~~a 229 (276) .+.... ..+-.+++||..+..|.+...- ...+.....+.+|..++++|.+|+.+++ .|..+.+. ..+.+. +.+ T Consensus 270 ~l~~~~--~~~a~~~~n~~~~~~L~~lkd~--~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~ 345 (397) T protein:vir:12 270 TLDPMV--APGSIVLTNQDGYDWLDTLKDG--TGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEA 345 (397) T ss_pred ccchhh--hCCCEEEEcHHHHHHHHHhhcc--CCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhce Confidence 343322 2344689999999998764211 1222333345677778999999987665 34332222 233332 444 Q ss_pred EEee-eeeeeeeeccCcc----cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 230 CTFA-EQIVQTEAYRMEK----RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 230 ~~~~-~~~~~~e~~~~~~----~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +... .+...++..+... .-...+++..+++.++++|+++++++.|+= T Consensus 346 ~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 346 IVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 4333 2333344333222 224588999999999999999999999999 No 125 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.41 E-value=2.4e-13 Score=89.75 Aligned_cols=257 Identities=16% Similarity=0.147 Sum_probs=159.0 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |+. ..++|+.|...+++.+++...+..+++.- + ..+.+.++|.. +...+..+.+++.......++.. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~ 183 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT----P-VTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFN 183 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhccee----e-ccCCeeEEEEEecCCCccccccccccccccccccce Confidence 332 14789999999999999999888887532 1 12345666654 33334555555544333456677 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 152 (276) .+++.+.+. +.-+.|+++-...+..++...+.+..+++++...|..++.....+..... +....++.+.++.. T Consensus 184 ~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~------~~~~~~d~l~~~~~ 256 (389) T protein:vir:10 184 KVDWSVATY-RGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTAKKT------TTDTLVDSLKHILN 256 (389) T ss_pred eeeeeheee-EeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc------cccccHHHHHHHHH Confidence 777777554 44567888777777789999999999999999999998877665432211 12223455555443 Q ss_pred -HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccc----ceeeeeeeEEeceEEEEecc--ccccccceEEEEE Q lcl|NC_019506. 153 -KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAES----ITKNGFVGTILGFDVYLSNN--MGSLTNGTGAIAG 225 (276) Q Consensus 153 -~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~----~~~~G~i~~~~G~~v~~s~~--lp~~~~~~~~~~~ 225 (276) .++.. .+-.+++||..+..|.+...-. ..+.... ....|..++++|.+|+.++. +|..++....+.+ T Consensus 257 ~~~~~~----~~a~~~~n~~~~~~L~~lkd~~--G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~g 330 (389) T protein:vir:10 257 VDLDPA----YSRALVVTQSLFNTLDTLKDKN--GRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVG 330 (389) T ss_pred hhhhhh----hCcEEEecHHHHHHHHHhhccC--CCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEe Confidence 23322 2346899999999998643211 1111111 12234456899999976543 3333322222222 Q ss_pred -ecceEEee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEec-CC Q lcl|NC_019506. 226 -VKMACTFA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKT-NP 276 (276) Q Consensus 226 -~~~a~~~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~-~p 276 (276) -+.++... .+...++..+ ...|...+++.+++|+.+++|++++.++.+ +| T Consensus 331 d~~~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~ 383 (389) T protein:vir:10 331 DLKRGVLFTDRQQVTLAWED-SKIYGKYLGAAFRFGVQKADSKAGYFVTNTDVP 383 (389) T ss_pred eccccEEEEeecceEEEeec-cccccceEEEEEEeccEEecccceEEEEeeccC Confidence 23444333 3333444433 455667888999999999999999998844 44 No 126 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.41 E-value=3e-13 Score=89.24 Aligned_cols=263 Identities=14% Similarity=0.039 Sum_probs=157.6 Q ss_pred Cccc-------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAVT-------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~~-------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~ 72 (276) |+.+ ++.|+ +...+++.+++.+++.+++..- + -.+.+++||+.... .+..+.++..... .+++.. T Consensus 10 ~~~~~t~~~~g~l~~~-~~~~ii~~l~~~s~i~~l~~~~----~-~~~~~~~ip~~~~~~~a~wv~Eg~~~~~-s~~~f~ 82 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPV-QAKDYFAEAEKTSIVQRVAQKI----P-MGATGIVIPHWTGDVSAQWIGEGDMKPI-TKGNMT 82 (397) T ss_pred HhhccCCCCccccchh-HHHHHHHHHHhccchhhhccee----e-ccCCceEEEEEcCCcceEEecCCccccc-ccccee Confidence 3322 34555 5667788888888888887531 1 23567889987543 4566666666543 567777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----cccccCCHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-----KPAATLDKTNIYEEL 147 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-----~~~~~~t~~~~~~~i 147 (276) ++++.+.+. +.-+.|+++-...+..++...+.++.++++++++|+.++..-.+.....+ ...........++.+ T Consensus 83 ~v~l~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~~~~~ 161 (397) T protein:vir:23 83 KRDVHPAKI-ATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAYQGLG 161 (397) T ss_pred EEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccchhHHH Confidence 778877554 44578888877778899999999999999999999998854332211100 001111222234556 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcc---cccccceeeeeeeEEeceEEEEeccccccccceEEEE Q lcl|NC_019506. 148 IKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGG---AMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIA 224 (276) Q Consensus 148 ~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~---~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~ 224 (276) .++...|.....+ .-..++||..+..|.+...-....- ....+....+..++++|++++.++++|... ...+. T Consensus 162 ~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~--~~~~~ 237 (397) T protein:vir:23 162 VSGLTKLVTDGKK--WTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGD--VVGYA 237 (397) T ss_pred HHHHHhhhhcccC--CCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCc--eEEEE Confidence 6666666666542 3468999999999986432111100 011111222344689999999999998422 22222 Q ss_pred E-ecceE-EeeeeeeeeeeccCc-------------ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 225 G-VKMAC-TFAEQIVQTEAYRME-------------KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 225 ~-~~~a~-~~~~~~~~~e~~~~~-------------~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) + ...++ +.... ..++..++. ..| -..++...++|+++++|++++.++.++- T Consensus 238 gDfs~~~i~~~~~-i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~ 306 (397) T protein:vir:23 238 GDFSQIIWGQVGG-LSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPV 306 (397) T ss_pred eecceEEEEEEec-eEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccc Confidence 2 12222 22111 122221111 112 2467888999999999999999985333 No 127 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.41 E-value=3.5e-13 Score=88.86 Aligned_cols=264 Identities=13% Similarity=0.101 Sum_probs=156.2 Q ss_pred Ccc--------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MAV--------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA~--------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~ 71 (276) ++. -.++|+.|...+++.+++.+++..+..+.. +...| .+.+|+... ..+..+.+++.... .+++. T Consensus 130 ~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~---~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~-~~~~f 204 (435) T protein:vir:14 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTL---PLSNG-NITIPRLKGGAIVGYIGADTDIPT-TQQQF 204 (435) T ss_pred hhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceee---ecCCC-ceEEEEEeCCcceeeeccCccccc-cccce Confidence 111 137899999999999998888877632221 22233 588888744 34555556655543 45666 Q ss_pred ceEEEEEEeeeecceeechHHHHhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------ccccccccC Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIRT--PLMDAAMQRAAYALADETEKILLKEMDTNAT-----------SKLKPAATL 138 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~~--d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~-----------~~~~~~~~~ 138 (276) ..+++...+. +.-+.|+++-...+.. ++.+.+.++..+++++++|+.++..-..+.. ......... T Consensus 205 ~~i~~~~~k~-~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~ 283 (435) T protein:vir:14 205 DDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDAS 283 (435) T ss_pred eEEEeeeEEE-EEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceecccccc Confidence 6777777554 4446777766555533 4667788899999999999988853222110 011112223 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) +...++..+.++...+.....-..+..+++||..+..|.+...- . +...+....-++++|++|+.++.+|.... T Consensus 284 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~--~----G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~ 357 (435) T protein:vir:14 284 TLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDG--N----GNKVYPELANGMLKGYPVGKTTQVPINLG 357 (435) T ss_pred chhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhcc--C----CceeccCCCCCeeecceeEeecccccccc Confidence 44445566777777676654433445689999999988664321 1 11112111224789999999999986421 Q ss_pred c----eEEEEEecceEEeee-eeeeeeeccCc----------ccc---eeeEEeeeeeeeEEEcCCeEEEEEe-cCC Q lcl|NC_019506. 219 G----TGAIAGVKMACTFAE-QIVQTEAYRME----------KRF---ADAVKGLNVFGCKVIYPDALVCLKK-TNP 276 (276) Q Consensus 219 ~----~~~~~~~~~a~~~~~-~~~~~e~~~~~----------~~~---~~~i~~~~~yg~~v~~~~~vv~~~~-~~p 276 (276) . ...+++.-+-+.+.. ....++..+.. ..| ...+++..+++.++.+|+++++++- +.| T Consensus 358 ~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:14 358 ETGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWG 434 (435) T ss_pred CCCccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCC Confidence 1 122333322222222 22223222211 111 2578899999999999999999963 233 No 128 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.40 E-value=2.8e-13 Score=89.40 Aligned_cols=262 Identities=13% Similarity=0.040 Sum_probs=155.6 Q ss_pred Cc--c-----chhhHHHHHHHHH-HHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MA--V-----TSFIPKLWSARLL-AHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA--~-----~~l~~e~~~~~~~-~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~~~~~~ 71 (276) ++ . ..++|+.+...++ ..+....++..+++.- ...| .+.+|+. +...+..+.++..... ..++. T Consensus 249 ~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~-----~~~g-~~~~~~~~~~~~a~~v~Eg~~~~~-~~~~~ 321 (543) T protein:vir:81 249 RAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQV-----VATG-DVWHGVSSAAVQWSWDAEFEEVSD-DSPEF 321 (543) T ss_pred hhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccc-----cCCc-ceEEEEecCCcceeecccCccccc-ccccc Confidence 11 1 1467887776655 5567767777766431 1234 3556654 3445666667766543 56777 Q ss_pred ceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----------ccccccccCCHH Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT----------SKLKPAATLDKT 141 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~----------~~~~~~~~~t~~ 141 (276) ..+++.+.+. +.-+.|+..-. ....++...+.+..+.++++++|..++..-.++.. ......+..+.. T Consensus 322 ~~i~~~~~k~-~~~~~is~ell-~d~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~ 399 (543) T protein:vir:81 322 GQPEIPVKKA-QGFVPISIEAL-QDEANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAET 399 (543) T ss_pred ceeeeeeeee-EeeehhhHHHH-hccHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccccccccccc Confidence 7788887665 44467887644 44579999999999999999999988743221100 011111222333 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc--- Q lcl|NC_019506. 142 NIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN--- 218 (276) Q Consensus 142 ~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~--- 218 (276) ..++.+.++...+..... .+-.+++||.++..|.+...-. ..+... .+..|..++++|.+|+.++.+|.... T Consensus 400 ~~~~~~~~~~~~l~~~~~--~~~~~v~n~~~~~~l~~lkd~~--G~~l~~-~~~~g~~~~l~G~pv~~~~~~~~~~~~~~ 474 (543) T protein:vir:81 400 FALADVYAVYEQLAARHR--RQGAWLANNLIYNKIRQFDTQG--GAGLWT-TIGNGEPSQLLGRPVGEAEAMDANWNTSA 474 (543) T ss_pred ccHHHHHHHHHhhhcccc--CCcEEEEcHHHHHHHHHhhcCC--Cceecc-CcCCCCCccccceeeEEeccccccccccc Confidence 457778888777765543 2336899999999997643211 111222 23456667899999999999986432 Q ss_pred --ce-EEEEEecceEEeeee-eeeeeecc------CcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 219 --GT-GAIAGVKMACTFAEQ-IVQTEAYR------MEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 219 --~~-~~~~~~~~a~~~~~~-~~~~e~~~------~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. ..+++.-+.+.+..+ ...++... +...-...+++..++|+.+++|++++.++.++= T Consensus 475 ~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~ 542 (543) T protein:vir:81 475 SADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETA 542 (543) T ss_pred cCCcceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEeccc Confidence 22 223333233333321 11222111 011113467888999999999999999885544 No 129 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.39 E-value=2.2e-13 Score=90.01 Aligned_cols=258 Identities=12% Similarity=0.051 Sum_probs=153.5 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) +.. ..++|+.+...+.. +.....+..+++.- + ....++++|.. +...+..+.+++........+.. T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~ 229 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPEKE-VHQFPRLGSLVRTE----S-VTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVIT 229 (437) T ss_pred hhhcccccccccchHHHHHHHHH-hhhhhhhhhcceeE----e-eccCceeeEEeeccccccccccccccccccccccce Confidence 111 14678888887654 44444455554321 1 11234556554 23345555555544323345556 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKV 152 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 152 (276) .+++...+. +.-+.|+.+-...+..++...+.+..+.+++...|..++....++..... +...++.+.++.. T Consensus 230 ~v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~-------~~~~~~~~~~~~~ 301 (437) T protein:vir:10 230 PILWDLKTY-TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTT-------STYLLGDLKKVLN 301 (437) T ss_pred eeeeehhhe-eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc-------cccchhhHHHHHH Confidence 667766544 44467787766677788999999999999999999999887654332211 1111233444322 Q ss_pred -HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecc--ccccccceEEEEE--ec Q lcl|NC_019506. 153 -KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNN--MGSLTNGTGAIAG--VK 227 (276) Q Consensus 153 -~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~--lp~~~~~~~~~~~--~~ 227 (276) .|..... .+-..++||..+..|.+... ....+.....+..|..++++|.+|+.+++ +|..+.+...+.+ .. T Consensus 302 ~~l~~~~~--~~~~~~~~~~~~~~l~~lkd--~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~ 377 (437) T protein:vir:10 302 VTLKPQDS--AAASIVMSQSAYNLFDMATD--AMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLK 377 (437) T ss_pred hhhhhhhh--cCCEEEEcHHHHHHHHHhhc--cCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeecc Confidence 3333322 23467999999998876422 11222333345567778999999998654 4654444433322 23 Q ss_pred ceEEee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 228 MACTFA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 228 ~a~~~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .++... .....++..++-..+.+.+.+.++||+++++|++++.|+...| T Consensus 378 ~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 427 (437) T protein:vir:10 378 KAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLK 427 (437) T ss_pred ccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeecc Confidence 344333 2233444444445566788888999999999999999986666 No 130 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.39 E-value=3.2e-13 Score=89.10 Aligned_cols=256 Identities=10% Similarity=0.020 Sum_probs=163.8 Q ss_pred Ccc----chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc---ceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV----TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT---VKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~----~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~---~~d~~~~~~~~~~~~~~~~~ 73 (276) +.. ..++|+.|...+++.+++..++.++++.- + ..+.++++|+..... +....++..... ..++... T Consensus 116 ~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~----~-~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~-s~~~f~~ 189 (421) T protein:vir:13 116 IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVI----P-VNRNAGKMPVRAGASVDKLANLAKDTELVK-AMLKTQP 189 (421) T ss_pred ccccCCcceecchhhHHHHHHHHHhhhhhhhhceee----e-ccCCceEEEEeecCCccceeeccccccccc-cccceeE Confidence 111 24789999999999999998888887531 1 234567777654332 334445554433 4567777 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVK 153 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 153 (276) +++.+.+. +.-+.|+++-...+..++...+.+..+++++..+|..+++........ .+. ..++.|.++... T Consensus 190 i~~~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~~----~~~----~~~d~i~~~~~~ 260 (421) T protein:vir:13 190 MAYDIDDY-GLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLAE----ETI----NDYAGLVKTINS 260 (421) T ss_pred EEeeeeee-EeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhcccc----ccc----cchHHHHHHHHH Confidence 77777655 444678887777777889999999999999999999888765433211 111 225677777777 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceE-EEEEe-cceEE Q lcl|NC_019506. 154 LDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTG-AIAGV-KMACT 231 (276) Q Consensus 154 l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~-~~~~~-~~a~~ 231 (276) +.....+. -.+|+||..+..|.....- ...+... ....|..++++|.+|+.++++|..+.+.. ++.+. +.++. T Consensus 261 l~~~~~~~--a~~v~n~~~~~~l~~lkd~--~G~~i~~-~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~ 335 (421) T protein:vir:13 261 LVPNARKR--AIIVTNSDGRAYLDGLMDK--QGRPLLK-ELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIK 335 (421) T ss_pred hhhhhcCC--CEEEEcHHHHHHHHHhhcC--CCceeec-CcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEE Confidence 77665432 3688999999998764221 1112221 13456667899999999998885443332 23332 33333 Q ss_pred ee-eeeeeeeeccCcccc--eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 232 FA-EQIVQTEAYRMEKRF--ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 232 ~~-~~~~~~e~~~~~~~~--~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +. .+...++..+..... -..+++..++|.++++|+++..+...-| T Consensus 336 ~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (421) T protein:vir:13 336 FMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKF 383 (421) T ss_pred EEEecceEEEeecccccccCeeEEEEEeeecceeecchhhheeeeccc Confidence 33 333345544443211 2478889999999999999877766654 No 131 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=99.39 E-value=8.4e-14 Score=92.28 Aligned_cols=265 Identities=12% Similarity=0.032 Sum_probs=169.4 Q ss_pred Ccc---c-----hhhHHHHHHHHHHHHHHhhcchh--hhccc--cccccccCCcEEEEeccCcccc--eeecCCCCC--C Q lcl|NC_019506. 1 MAV---T-----SFIPKLWSARLLAHLDKAHVVAN--LVNRD--YEGEIKAYGDTVKINQIGAITV--KEYTENSDI--D 64 (276) Q Consensus 1 MA~---~-----~l~~e~~~~~~~~~l~~~~v~~~--~~~~~--~~~~~~~~Gdtv~ip~~~~~~~--~d~~~~~~~--~ 64 (276) ||- . +|+||+|...+.+.-.+.+-|.. .+.++ +......+|++|++|.++.+.- ..+...++. . T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 992 2 48999999999988765544433 22222 1112236799999999988742 223222221 1 Q ss_pred CccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------- Q lcl|NC_019506. 65 APEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK------------- 131 (276) Q Consensus 65 ~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~------------- 131 (276) .+..++..+....+ .++.+++..+|+-...+-.|+++.+..+.+.--.+.....+++.++...... T Consensus 81 t~~kittg~~~a~v-~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~ 159 (367) T protein:vir:80 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) T ss_pred cccccccchheeee-ehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhc Confidence 23455554444444 5678888889998887778999999999888888888888887665322110 Q ss_pred -------------cc-ccccCC--HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccccee Q lcl|NC_019506. 132 -------------LK-PAATLD--KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITK 195 (276) Q Consensus 132 -------------~~-~~~~~t--~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~ 195 (276) .. .+.+.+ ..-..+.|.+|...|.+++ ..=..++|||..+..|.+.. ++..-.... . T Consensus 160 ~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~--~~l~~i~mHS~V~~~L~~~~-li~~i~~sd----~ 232 (367) T protein:vir:80 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNND-EIEFIPDSK----G 232 (367) T ss_pred cccccccccCceeeeeeccCCCccceecHHHHHHHHHHhcccc--ccccEEEEchHHHHHHHhcc-ccccccCCC----C Confidence 00 001111 1122567899999998764 33357899999999998874 433322221 1 Q ss_pred eeeeeEEeceEEEEeccccccc----cceEEEEEecceEEeeeeee--eeeeccCcccc-e---eeEEeeeeeeeEEEcC Q lcl|NC_019506. 196 NGFVGTILGFDVYLSNNMGSLT----NGTGAIAGVKMACTFAEQIV--QTEAYRMEKRF-A---DAVKGLNVFGCKVIYP 265 (276) Q Consensus 196 ~G~i~~~~G~~v~~s~~lp~~~----~~~~~~~~~~~a~~~~~~~~--~~e~~~~~~~~-~---~~i~~~~~yg~~v~~~ 265 (276) +..|+.+.|..|+++..+|+.. ..+.++.+-.+|+++..... .+|..|++... + |.+..+.+ .++.| T Consensus 233 ~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~hP 309 (367) T protein:vir:80 233 QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVHP 309 (367) T ss_pred ccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEEEEeeee---EEeec Confidence 3468999999999999999753 34567888999999886543 45778887642 2 44444444 56677 Q ss_pred CeEEEEEec------------------CC Q lcl|NC_019506. 266 DALVCLKKT------------------NP 276 (276) Q Consensus 266 ~~vv~~~~~------------------~p 276 (276) -|+--...+ .| T Consensus 310 ~G~s~~~~~v~~~~~~~~~~~~~~~~~sP 338 (367) T protein:vir:80 310 GGFNWLDADVTIPDNTGSPSGITSGPPAI 338 (367) T ss_pred ceeeecccccccccccccccccccccCCC Confidence 777665443 23 No 132 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.39 E-value=8.7e-13 Score=86.71 Aligned_cols=264 Identities=13% Similarity=0.084 Sum_probs=157.0 Q ss_pred Cc--------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MA--------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA--------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~ 71 (276) ++ .-+++|+.|...+++.+++.+++..+..+- .+...| .+.+|+... ..+..+.+++.... .+++. T Consensus 130 ~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~---v~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~-~~~~f 204 (435) T protein:vir:80 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART---LPLSNG-NITIPRLKGGAIVGYIGADTDIPT-TQQQF 204 (435) T ss_pred hhhcccCCCCCccccchhHHHHHHHHHhhhchhhhcccee---eecCCC-ceEEEEEeCCcceeeeccCccccc-cccce Confidence 21 113789999999999999888887762221 122223 588887743 34555666665543 45667 Q ss_pred ceEEEEEEeeeecceeechHHHHhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------ccccccccC Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIR--TPLMDAAMQRAAYALADETEKILLKEMDTNAT-----------SKLKPAATL 138 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~--~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~-----------~~~~~~~~~ 138 (276) ..+++.+.+. +.-+.|+++-...+. .++.+.+.++.++++++++|+.++..-..+.. ......... T Consensus 205 ~~i~~~~~k~-~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~ 283 (435) T protein:vir:80 205 DDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGS 283 (435) T ss_pred eeEEEeeEEE-EEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeeccccc Confidence 7777777554 445677776554443 46778889999999999999988854222110 011112223 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) +...++..+.++...|........+-.+++||..+..|.+...- . +...+....-++++|++|+.++.+|.... T Consensus 284 ~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~--~----G~~l~~~~~~~~l~G~pv~~~~~~p~~~~ 357 (435) T protein:vir:80 284 TLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDG--N----GNKVYPELANGMLKGYPVGKTTQVPINLG 357 (435) T ss_pred chhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhcc--C----CceeccCCCCCeEeeeeeEEecccccccc Confidence 33344556777777776665444445679999999988654321 1 11111111124789999999999986421 Q ss_pred ----ceEEEEEecceEEeee-eeeeeeeccCcc----------cc---eeeEEeeeeeeeEEEcCCeEEEEEec-CC Q lcl|NC_019506. 219 ----GTGAIAGVKMACTFAE-QIVQTEAYRMEK----------RF---ADAVKGLNVFGCKVIYPDALVCLKKT-NP 276 (276) Q Consensus 219 ----~~~~~~~~~~a~~~~~-~~~~~e~~~~~~----------~~---~~~i~~~~~yg~~v~~~~~vv~~~~~-~p 276 (276) ....+++.-+-+.+.. +...++..+... .| ...+++..++|+++.+|++++.++-. -| T Consensus 358 ~~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:80 358 EAGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWG 434 (435) T ss_pred CCCCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCC Confidence 1123333322222222 222333322211 11 35778999999999999999999733 33 No 133 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.39 E-value=8.3e-13 Score=86.81 Aligned_cols=264 Identities=14% Similarity=0.089 Sum_probs=155.3 Q ss_pred Cccc-------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAVT-------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~~-------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~ 72 (276) |+.+ .++|+.+..++.+.+++.+++..+--+. .+...| .+++|+... ..+..+.++..... .+++.+ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~---v~~~~g-~~~~p~~t~~~~a~wv~E~~~~~~-s~~~f~ 138 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARS---IPLPNG-NLSMPRLSGGATAGYVGEGKDVVA-TGATFD 138 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceee---eecCCC-ceEEEEEeCCcceeeeccCccccc-ccccee Confidence 3321 3689999999999999988887762222 122334 588888744 45666677766654 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------ccccccCC Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-------------LKPAATLD 139 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------------~~~~~~~t 139 (276) .+++...+. +.-+.|+++-..++..++.+.+.++.++++++++|+.++..-..+.... ...++..+ T Consensus 139 ~i~~~~~k~-~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~ 217 (366) T protein:vir:57 139 DVKLSAKTM-IALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAIN 217 (366) T ss_pred EEEEeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccc Confidence 777777554 4456788877777778998999999999999999998875432211100 01111222 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc- Q lcl|NC_019506. 140 KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN- 218 (276) Q Consensus 140 ~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~- 218 (276) ...+.+.+..+............+-..+++|..+..|.+... . . +...+....-++++|++|+.++.+|.... T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd---~--~-G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~ 291 (366) T protein:vir:57 218 LTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRD---G--N-GNKVYPEMSQGILKGYPIQRTSAIPANLGD 291 (366) T ss_pred hhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhc---c--C-CceeccCCCCCeecceeeEEcccccccccc Confidence 222222222222222222222233357899999998876421 1 1 11122222235789999999999996432 Q ss_pred ---ceEEEEEecceEEeee-eeeeeeeccCc-----c--------cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 219 ---GTGAIAGVKMACTFAE-QIVQTEAYRME-----K--------RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 219 ---~~~~~~~~~~a~~~~~-~~~~~e~~~~~-----~--------~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ....+++.-+-+.+.. ....++..+.. . .-...++...+++..+.||+++++++...= T Consensus 292 ~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 292 DGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred CCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 1223333333332222 11223322221 1 112578899999999999999999976555 No 134 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.38 E-value=4.9e-13 Score=88.06 Aligned_cols=268 Identities=13% Similarity=0.053 Sum_probs=160.0 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-CcccceeecCCCCCCCc-cccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKEYTENSDIDAP-EELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d~~~~~~~~~~-~~~~~~ 72 (276) |.. ..++|+.|...+++.+++.+++..++....- . ...-++.+|+. +...+..+.++...... ..++.. T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~--~-~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~ 186 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPV--F-TRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLE 186 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeec--c-CCccceEEEEecCCcceeecccccccccccccccee Confidence 321 1367999999999999999999888754211 1 11234556654 44456666666554321 234556 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccccCCHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAATLDKTNIYEE 146 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~~~t~~~~~~~ 146 (276) .++++..+. +.-+.|+++-...+..++.+.+.+..++++++.+|..++....++....+ ....+.+....++. T Consensus 187 ~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~ 265 (404) T protein:vir:10 187 RFNFKLKDL-ADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKSPALKD 265 (404) T ss_pred eeEeeheee-EeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeeccccccHHH Confidence 667776554 44467888766667789999999999999999999998865443221111 11111222233556 Q ss_pred HHHHHH-HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEe-ccccccccceE-EE Q lcl|NC_019506. 147 LIKVKV-KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLS-NNMGSLTNGTG-AI 223 (276) Q Consensus 147 i~~a~~-~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s-~~lp~~~~~~~-~~ 223 (276) +..+.. .+.... ..+-.+++||..+..|.+..... ..+.....+..|..++++|.+|+.. +.+|..+.+.. ++ T Consensus 266 ~~~~~~~~l~~~~--~~~~~~v~n~~~~~~L~~lkd~~--G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~ 341 (404) T protein:vir:10 266 FKKCKNVELLNVF--KATSSWIVNQDGFNYLDSLEDKT--GRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVL 341 (404) T ss_pred HHHHHHhhhhccc--cCCCEEEEcHHHHHHHHHhhccC--CceeeccCcCCCCCccccceeeEEecccccCCCCCccEEE Confidence 655543 233222 12336799999999987643211 1222233355677788999999853 44444333332 33 Q ss_pred EE-ecceEEee-eeeeeeeeccCcc----cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 224 AG-VKMACTFA-EQIVQTEAYRMEK----RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 224 ~~-~~~a~~~~-~~~~~~e~~~~~~----~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++ .+.++... .....++..+... .....+++..++|+.+++|+++++++.++- T Consensus 342 ~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~a 400 (404) T protein:vir:10 342 LGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVE 400 (404) T ss_pred EEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 33 24344333 2222343333221 224578999999999999999999986655 No 135 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.38 E-value=8.7e-13 Score=86.71 Aligned_cols=263 Identities=17% Similarity=0.089 Sum_probs=155.3 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-----cceeecCCCCCCCcccc Q lcl|NC_019506. 1 MA------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-----TVKEYTENSDIDAPEEL 69 (276) Q Consensus 1 MA------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-----~~~d~~~~~~~~~~~~~ 69 (276) ++ ...++|+.|...+++.+++.+.+..++..- ...|.++.+|+.... .+..+.+++.....+.. T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 192 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNL-----TMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFA 192 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhccee-----eccCCceeEEEeccccccccccceecCcccccccCcc Confidence 11 124679999999999999999888887532 123556777764322 24455565554322223 Q ss_pred ccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------ccccccCCHHH Q lcl|NC_019506. 70 STTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-------LKPAATLDKTN 142 (276) Q Consensus 70 ~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------~~~~~~~t~~~ 142 (276) ..+.+++.+.+. +.-+.|+++-.. ....+...+.+..++++++++|..++..-.++.... ....+..+... T Consensus 193 ~f~~i~~~~~k~-~~~~~iS~ell~-ds~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~ 270 (413) T protein:vir:81 193 DFDIVTESLSKI-AGLTKITDEMIE-DYDFLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDE 270 (413) T ss_pred cceeeEeeeeeE-EEeehhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccccccch Confidence 456777777655 344678876443 345566777778899999999998885422211100 00111223344 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccc-------eeeeeeeEEeceEEEEeccccc Q lcl|NC_019506. 143 IYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESI-------TKNGFVGTILGFDVYLSNNMGS 215 (276) Q Consensus 143 ~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~-------~~~G~i~~~~G~~v~~s~~lp~ 215 (276) .++.+.++...+.....-..+ .+++||..+..|.+...-.. .+..... ...+..++++|.+|+.|+.+|. T Consensus 271 ~~~~i~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G--~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~ 347 (413) T protein:vir:81 271 LADSIYKAMTNISLATPFQAD-ALVINPLDYQELRLAKDANG--QYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPV 347 (413) T ss_pred hHHHHHHHHHHhhhhccCCCc-EEEEcHHHHHHHHHhhccCC--ceeccccccccccccccccCceecceeeEEcCCCCc Confidence 567777777666544332233 37899999998865432111 1111111 1112235799999999999984 Q ss_pred cccceEEEEEecceEEeee-eeeeeeeccCc-ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 216 LTNGTGAIAGVKMACTFAE-QIVQTEAYRME-KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 216 ~~~~~~~~~~~~~a~~~~~-~~~~~e~~~~~-~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +...+...+.++-... ....++..+.. ..| ...+++..++++.+.+|+++++++.+.| T Consensus 348 ---~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 410 (413) T protein:vir:81 348 ---GKPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEV 410 (413) T ss_pred ---ccEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecCC Confidence 3334444444433332 22234433322 122 3477888999999999999999997776 No 136 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.38 E-value=1.1e-13 Score=91.58 Aligned_cols=259 Identities=12% Similarity=0.075 Sum_probs=159.3 Q ss_pred Ccc------chhhH-HHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIP-KLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~-e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. ..++| +++++.+++.+++.+++..+-.+-. +...| .++||+... ..+..+.+++.... .+++.. T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~---~~~~g-~~~ip~~~~~~~a~wv~E~~~~~~-s~~~f~ 431 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARML---PGLVG-DVDIPKKTSGANFYWIGEDEDVQD-SDFDFT 431 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceEe---ecCCc-ceEEEEEeCCceeEeecCCccccc-ccccee Confidence 111 12555 5568889999998888877622221 22223 588887643 45666677766654 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------ccc-ccCCHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPA-ATLDKTNIYE 145 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~-~~~t~~~~~~ 145 (276) .+++...+. +.-+.|+.+-..++..++...+....+.+++.++|..++..-..+....+ ... +..+....++ T Consensus 432 ~i~l~~~k~-~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~ 510 (632) T protein:vir:96 432 TLSFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWA 510 (632) T ss_pred eEEeeeeEE-EEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHH Confidence 778777554 44467777777777788888889999999999999998854332211100 000 1111222366 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAG 225 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~ 225 (276) .+.++...+...++...+-..+++|..+..|..... . +. .+...+.. +.+.|.+++.++.+|... ..+.. T Consensus 511 ~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l-~--d~-~G~~i~~~---~~l~G~pv~~s~~ip~~~---~~~gd 580 (632) T protein:vir:96 511 SVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQV-F--DN-TGERIWQN---NEVNGYRAEASNQIPADT---WIFGD 580 (632) T ss_pred HHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhc-c--CC-CCceeecC---CeecccceEeccccccCc---EEEee Confidence 788888888877765555567899998887765321 1 11 12223333 467999999999998532 22221 Q ss_pred ecce-EEeeeeee-eeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 226 VKMA-CTFAEQIV-QTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 226 ~~~a-~~~~~~~~-~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) .... ++....+. .+..+.....-...++...+++.++++|+++++++.+| T Consensus 581 ~s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 581 WSQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred cceEEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 1221 22222111 11112122223457889999999999999999999999 No 137 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=99.38 E-value=1.1e-13 Score=91.59 Aligned_cols=252 Identities=10% Similarity=0.014 Sum_probs=153.3 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MA------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |. .-+++|+-+..++++.++..+.+..+++. . ...|. ++|.. +..++....++..... .+++.+ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v--~---~~~~~--~~p~~~~~~~~a~~v~E~~~~~~-~~~~f~ 154 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL--T---NIKGL--EIPRVSYTLDDDDFITDVETAKE-LKLKGD 154 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheee--E---ecCCc--eEEEEecCCCccccccccccccc-ccccce Confidence 22 12488999999999999988888887753 1 12233 34443 2234566666665543 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cccccCCHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL----KPAATLDKTNIYEELI 148 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~----~~~~~~t~~~~~~~i~ 148 (276) .+++...+. +.-+.|+++-...+..|+.+.+.+..+++++...++.++..........+ ......++.+.++.|. T Consensus 155 ~v~~~~~k~-~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d~i~ 233 (352) T protein:vir:78 155 TVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYDAII 233 (352) T ss_pred eeeecceeE-EeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccccchHHHHH Confidence 777777655 33478888877777889999999999999988756656543222111111 1112234444578888 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecc Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM 228 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~ 228 (276) ++...|..... .+-..++++..+..|++... + + +..+..|.-.+++|.+|+.++..+.. .+..-+. T Consensus 234 ~~~~~l~~~~~--~~a~~~mn~~t~~~l~~~~~--~----~-~~~~~~~~~~~llG~PV~~~~~~~~~-----~~Gdf~~ 299 (352) T protein:vir:78 234 NALADLHEDYR--DNATIYMRYADYVKIISVLS--N----G-TTNFFDTPAEKVFGKPVVFTDAAVKP-----IVGDFNY 299 (352) T ss_pred HHHhccChhhh--cCCEEEEehHHHHHHHHHHh--c----c-CCcccccCCccccccceEEecCCCce-----eEeehhh Confidence 88777665543 34467889988877765321 1 1 12234555668999999998766431 1111111 Q ss_pred eEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.. ..+...++.+++...--..+.+..++|+++++|+++++++.++. T Consensus 300 ~~~-~~~~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 300 FGI-NYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred hhh-hhhhheeeeeccccCCeeEEEEEeeeCceeechhheEEEEeecc Confidence 111 11111233333333223567788999999999999999987776 No 138 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.38 E-value=5.4e-13 Score=87.85 Aligned_cols=260 Identities=11% Similarity=0.051 Sum_probs=159.2 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cce-eecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVK-EYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~-d~~~~~~~~~~~~~~~~ 72 (276) |.. ..++|+.|...+++.+++.+.+..+++.-.. ....-++.+++.... ... ...+++.......++.. T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~ 192 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV---STSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLT 192 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeec---cCCcceEEEEeecCCccccccccccccccccccccee Confidence 211 1468999999999999999999888754211 111234556655432 222 33344444322446777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHH- Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVK- 151 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~- 151 (276) .+++++.+. +.-+.|+++-...+..++...+.++.+++++.++|+.++........ .++..+ ++.+..+. T Consensus 193 ~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~----~~~~~~----~~~i~~~~~ 263 (408) T protein:vir:74 193 IIKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPK----KPTIAN----FDDVITMIN 263 (408) T ss_pred eEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cccccc----HHHHHHHHH Confidence 788888664 45578888877778889999999999999999999988865433221 111222 44555443 Q ss_pred HHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecc--ccccccceEEEE--Eec Q lcl|NC_019506. 152 VKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNN--MGSLTNGTGAIA--GVK 227 (276) Q Consensus 152 ~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~--lp~~~~~~~~~~--~~~ 227 (276) ..+..... .+-..++||..+..|..... ....+.....+..|.-++++|++|+.+.+ +|..+.+...+. ..+ T Consensus 264 ~~l~~~~~--~~a~~v~n~~~~~~l~~lkd--~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~ 339 (408) T protein:vir:74 264 TSVDPAII--ATSSLLTNQSGLNKLALVKT--AEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMS 339 (408) T ss_pred Hhhhhhhc--CCCEEEEcHHHHHHHHHhhc--CCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehh Confidence 34444433 23468899999999976432 11222233334556667999999987653 565444333332 224 Q ss_pred ceEEeee-eeeeeeeccCc----ccceeeEEeeeeeeeEEEcCCeEEEEEec--CC Q lcl|NC_019506. 228 MACTFAE-QIVQTEAYRME----KRFADAVKGLNVFGCKVIYPDALVCLKKT--NP 276 (276) Q Consensus 228 ~a~~~~~-~~~~~e~~~~~----~~~~~~i~~~~~yg~~v~~~~~vv~~~~~--~p 276 (276) .++.+.. +...++..+.. ......+++..++|+++++|++++.++.+ +| T Consensus 340 ~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~ 395 (408) T protein:vir:74 340 QAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIAD 395 (408) T ss_pred ccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecccC Confidence 3443332 22233332221 23346788999999999999999998842 22 No 139 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.37 E-value=8.1e-13 Score=86.88 Aligned_cols=260 Identities=12% Similarity=0.041 Sum_probs=157.6 Q ss_pred Cc--------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc--cceeecCCCCCCCccccc Q lcl|NC_019506. 1 MA--------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI--TVKEYTENSDIDAPEELS 70 (276) Q Consensus 1 MA--------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~--~~~d~~~~~~~~~~~~~~ 70 (276) |+ ...++|+.|+..+++.+++..++..+++.- ......| ++.++..... .+....++........++ T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~--~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~ 181 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVE--NVTTSHG-SRVYEKLADITPLKDLDDESALIGDNDDPE 181 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhccee--eccCCcc-eEEEEeeccCCccccccccccccccccccc Confidence 11 124789999999999999999998887531 1111222 3444444332 233344555443223456 Q ss_pred cceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHH Q lcl|NC_019506. 71 TTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKV 150 (276) Q Consensus 71 ~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a 150 (276) ...++++..+. +.-+.|+++-...+..++...+.++.++++++.+|..++....++... ++..+ ++.+.++ T Consensus 182 f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~----~~~~~----~~~i~~~ 252 (395) T protein:vir:38 182 LTVVKYLIHRY-AGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPKK----PTISQ----FDNIKDL 252 (395) T ss_pred eeeEEeeeeee-EeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----ccccc----HHHHHHH Confidence 66777777554 344677877666677889999999999999999999988754443221 11112 3444444 Q ss_pred HH-HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc--ccccceEEEEEe- Q lcl|NC_019506. 151 KV-KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG--SLTNGTGAIAGV- 226 (276) Q Consensus 151 ~~-~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp--~~~~~~~~~~~~- 226 (276) .. .+.... ..+-.+++||..+..|.+...- ...+.....+..|..++++|++|+.+.+.+ ..++....+.+. T Consensus 253 ~~~~l~~~~--~~~a~~v~n~~~~~~L~~lkd~--~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~ 328 (395) T protein:vir:38 253 ENNTLDPAI--ESTSSFITNQSGYNILSKVKDA--DGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDL 328 (395) T ss_pred HHHhhhhhh--cCCCEEEEcHHHHHHHHHhhcc--CCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEec Confidence 32 333322 2344789999999999764221 122223334566777899999999987643 323222233333 Q ss_pred cceEEee-eeeeeeeeccCcc----cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 227 KMACTFA-EQIVQTEAYRMEK----RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 227 ~~a~~~~-~~~~~~e~~~~~~----~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.++... .....++..+... .-...+++..++|+++++|++++.++.+++ T Consensus 329 ~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (395) T protein:vir:38 329 KQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTV 383 (395) T ss_pred cccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 3333333 2333344443322 223578899999999999999999997766 No 140 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.36 E-value=3.3e-13 Score=88.99 Aligned_cols=264 Identities=13% Similarity=0.106 Sum_probs=157.6 Q ss_pred Ccc-------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceee---cCCCCCCCcccc Q lcl|NC_019506. 1 MAV-------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEY---TENSDIDAPEEL 69 (276) Q Consensus 1 MA~-------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~---~~~~~~~~~~~~ 69 (276) +|. -+++|+.|+..+++.+++.+++..+++.- + ..| .+.+|.... ..+... .++..... .++ T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~----~-~~~-~~~~p~~~~~~~a~~~~~~~e~~~~~~-~~~ 213 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGV----K-TKE-NIKYPVLVKKAEAQGHKNERTNNEMPE-TDI 213 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhccee----c-cCC-ceEEEEEecCCcccceecccccccccc-ccc Confidence 221 14789999999999999999998887542 1 123 467776532 222222 22222222 345 Q ss_pred ccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----cccccccCCHHHHH Q lcl|NC_019506. 70 STTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-----KLKPAATLDKTNIY 144 (276) Q Consensus 70 ~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-----~~~~~~~~t~~~~~ 144 (276) +...+++...+. +.-+.|+++-...+..++.+.+.+..+++++.++|..++..-.++... ........+....+ T Consensus 214 ~f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~ 292 (434) T protein:vir:62 214 EFDEIELSPTEF-DALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLY 292 (434) T ss_pred ceeeEEeeheee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchh Confidence 566677776554 334677777777777899999999999999999999988533222111 11111222344457 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccccc--ceeeeeeeEEeceEEEEeccccccccceEE Q lcl|NC_019506. 145 EELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAES--ITKNGFVGTILGFDVYLSNNMGSLTNGTGA 222 (276) Q Consensus 145 ~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~--~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~ 222 (276) +.|.++...|..... .+-..++||..+..|.+...- ...+.... ....|...+++|.+|+.++.+|....+... T Consensus 293 d~l~~l~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~--~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~ 368 (434) T protein:vir:62 293 DALVKMKNTPVKEVR--KKARWVLNTAALTKIETMKTD--DGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTP 368 (434) T ss_pred hHHHHHHhhcchhhh--cCCEEEEcHHHHHHHHHhhcc--CCCEeeccCCCccCCCCceecceeeEEecCccCccCCCce Confidence 888888877766543 233578999999998664221 11222221 233466668999999999999865443322 Q ss_pred -EE-EecceEEeeee--eeeeeeccCcc--cceeeEEeeeeeeeEEEc-CCeEEEEEe--cCC Q lcl|NC_019506. 223 -IA-GVKMACTFAEQ--IVQTEAYRMEK--RFADAVKGLNVFGCKVIY-PDALVCLKK--TNP 276 (276) Q Consensus 223 -~~-~~~~a~~~~~~--~~~~e~~~~~~--~~~~~i~~~~~yg~~v~~-~~~vv~~~~--~~p 276 (276) ++ +.-+.+....+ ...++...... .-...+++..+++++++. |+.+.+++. ..| T Consensus 369 ~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~ 431 (434) T protein:vir:62 369 VFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAP 431 (434) T ss_pred EEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccC Confidence 22 22222222222 22333322221 112357888999999875 999888764 455 No 141 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.33 E-value=6e-13 Score=87.60 Aligned_cols=254 Identities=11% Similarity=0.019 Sum_probs=150.1 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEec--cCcccceeecCCCCCCCccccccceEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQ--IGAITVKEYTENSDIDAPEELSTTEKVLEI 78 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~--~~~~~~~d~~~~~~~~~~~~~~~~~~~~~l 78 (276) +....++|+.+...+.+. .....+..+++.- + ..+....+|. .+...+..+.+++......+.....+++++ T Consensus 138 ~~~~~~vp~~~~~~i~~~-~~~~~l~~~~~~~----~-~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~ 211 (397) T protein:vir:96 138 VEGGALIPQELLQPQLEP-KDIVDLSKYVRSV----P-VNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSV 211 (397) T ss_pred cccccchhHHHHHHHHHh-hhhhhHHHhhhhc----c-ccccceeEEEEeccCCccccccccccccccccccccceeecH Confidence 333457788888888774 3333445554321 1 1123344443 334445555555554333456667777777 Q ss_pred EeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHHHHHhhcC Q lcl|NC_019506. 79 NKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVKVKLDEKN 158 (276) Q Consensus 79 d~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~ 158 (276) .+. +.-+.++..-...+..++.+.+.+..+.+++...|..++....... +.+..+ ++.|.++...+... T Consensus 212 ~~~-~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~-----~~~~~~----~d~~~~~~~~~~~~- 280 (397) T protein:vir:96 212 ATR-RGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTAT-----AKSVVG----VDGLKDLINKEIKK- 280 (397) T ss_pred hHh-hcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----cccccc----hHHHHHHHHHhhhh- Confidence 554 4446777766666778888888999999999999998886544322 112222 44454444332221 Q ss_pred CCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccc-cccccce-EEEEEe-cceEEeeee Q lcl|NC_019506. 159 VPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNM-GSLTNGT-GAIAGV-KMACTFAEQ 235 (276) Q Consensus 159 vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~l-p~~~~~~-~~~~~~-~~a~~~~~~ 235 (276) ..+-..|+||..+..|.+... ....+.....+..|..++++|.+|+.++.. +..+.+. ..+++. +.++.+..+ T Consensus 281 --~~~a~~v~n~~~~~~l~~lkd--~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~ 356 (397) T protein:vir:96 281 --VYDVKLFISASMYSELDKLKD--KNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDR 356 (397) T ss_pred --hcCcEEEEcHHHHHHHHHhhc--cCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEee Confidence 123468999999999977432 112233334456677789999999876553 3222222 223322 323333322 Q ss_pred -eeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 236 -IVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 236 -~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ...+... +...+.+.+++.+++|+++++|++++.++.++= T Consensus 357 ~~~~~~~~-~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 357 KQVSVSWV-DNNIYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred cceEEEEe-cccccceeEEEEEEEccEEecccceEEEEeecC Confidence 2233332 345567788999999999999999999985544 No 142 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.33 E-value=2.7e-12 Score=83.98 Aligned_cols=259 Identities=13% Similarity=0.001 Sum_probs=154.6 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MA------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 73 (276) ++ ...++|+.+...+.+.+++.+.+..+++.- + ..+....||+... ..+....++.......+.+.+. T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~----~-~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~ 158 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFV----N-TTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDK 158 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceee----e-cCCceeEEEEEcCCcceeeeccccccCcccccccee Confidence 11 124789999999999999999898887542 2 2345677777544 3455555555443324567778 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc------------cc--cccccccCC Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNA------------TS--KLKPAATLD 139 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~------------~~--~~~~~~~~t 139 (276) +++...+. +.-+.|+++-...+..++.+.+.+..+++++.++|+.++..-..+. .. ........+ T Consensus 159 i~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t 237 (390) T protein:vir:40 159 IQTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLT 237 (390) T ss_pred eEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccccc Confidence 88888655 4447888888878888999999999999999999998885321110 00 001112233 Q ss_pred HHHHHHHHHHHHHHHhhcCCC-ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 140 KTNIYEELIKVKVKLDEKNVP-TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 140 ~~~~~~~i~~a~~~l~~~~vP-~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) ..+..+.+......+...... ..+-++++||..+..++..-..... .. +. .+.. . ..+|.+|+.++.+|.. T Consensus 238 ~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d-~~-G~-~v~~-~--~~~g~pvv~~~~~p~~-- 309 (390) T protein:vir:40 238 DLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMT-PQ-GV-WVTG-I--LPVPLEIVQSVAVPVG-- 309 (390) T ss_pred hhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccC-CC-Cc-cccc-c--CCCceeEEEcCCCCCC-- Confidence 333334444444444433221 2344678999876655543222111 11 11 1111 1 2479999999999852 Q ss_pred ceEEEEEecceEEeee-eeeeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 219 GTGAIAGVKMACTFAE-QIVQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 219 ~~~~~~~~~~a~~~~~-~~~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ...+...+. +.... +...++..+ ..+| .+.+++..++|+++++|+++++++.++. T Consensus 310 -~i~~Gd~s~-~~i~~~~~~~v~~~~-~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 310 -KAVAGRAKD-YFMGIGSEQVIRTST-EYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGL 368 (390) T ss_pred -cEEEEeece-EEEEeecceEEEecc-hhhhhcCcEEEEEEEEeCCEEecccceEEEEeecc Confidence 232322233 32222 222343332 2222 3678999999999999999999976555 No 143 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.33 E-value=7.3e-14 Score=92.59 Aligned_cols=252 Identities=11% Similarity=0.034 Sum_probs=143.8 Q ss_pred Cccchhh-------H------HHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCc Q lcl|NC_019506. 1 MAVTSFI-------P------KLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAP 66 (276) Q Consensus 1 MA~~~l~-------~------e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~ 66 (276) ||-+.++ | +.|++-+.+ |.+ ++...+ . .+...|+||++|++... .+.++.+|..++. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~-L~~---~Lgi~r--~--~p~a~G~tIt~pK~~~tgda~dVaEGe~Ipl- 71 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNIND-LLK---LLGVTR--R--ETLTNDLKIQTYKWEVTLDQTDPGEGETIPL- 71 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHH-HHH---Hhcccc--c--cccccCCeEEeeeeeeecccccccCCcccch- Confidence 8866433 1 223322222 222 222221 1 24567999999999865 5888888888765 Q ss_pred cccccc---eEEEEEEeeeecceeechHHH-HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHH Q lcl|NC_019506. 67 EELSTT---EKVLEINKQKYFNFQIDDVDA-AQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTN 142 (276) Q Consensus 67 ~~~~~~---~~~~~ld~~~~~~~~v~d~d~-~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~ 142 (276) ..++.+ ..++++++++. .++|+.. .....+.+.+.-+|+..+|++++|.+++..++.++..... . +-.. T Consensus 72 skvt~~~~~t~t~kikK~rK---~tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~tg--~--~lq~ 144 (295) T protein:vir:99 72 SKVTRTKDKDYTVKWFKKRR---ATTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVKG--V--GLQK 144 (295) T ss_pred hhheeeeeeeeEEEeeeecc---cccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeeeh--h--hHHH Confidence 455543 46777766533 3588886 6788899999999999999999999999999876654321 1 1122 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhh--hhcccccccceeeeeeeEEeceE-EEEeccccccc-- Q lcl|NC_019506. 143 IYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIV--GTGGAMAESITKNGFVGTILGFD-VYLSNNMGSLT-- 217 (276) Q Consensus 143 ~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~--~~~~~~~~~~~~~G~i~~~~G~~-v~~s~~lp~~~-- 217 (276) .++.+..+...+.+.+ ....++||||...+.|+++.... .+... +-..+. +++|++ |++|..+|... T Consensus 145 a~a~~~~al~~f~Ee~--~~~~V~FVnP~D~a~yl~~A~~~~~~a~~f-G~~~L~-----nfLG~q~II~S~kv~~G~~~ 216 (295) T protein:vir:99 145 ALSASWAKLATFNEFE--GSPLVSFVSPLDVANYLGDTKVGADASNVF-GMTLLK-----NFLGMQNVIVMPSVPEGKIY 216 (295) T ss_pred HHHHhhhhhhhccccc--CCceEEEEehHHHHHHHhccccccchhhhh-hhhhhh-----hhhccceEEEcccCCCceEE Confidence 2333333333333332 12358999999999999986543 22112 222333 499997 99999998421 Q ss_pred ---cceEEEEEecceEE-eeeeeeee-------eeccCcccceeeEEeeeeeeeE--EEcCCeEEEEEecCC Q lcl|NC_019506. 218 ---NGTGAIAGVKMACT-FAEQIVQT-------EAYRMEKRFADAVKGLNVFGCK--VIYPDALVCLKKTNP 276 (276) Q Consensus 218 ---~~~~~~~~~~~a~~-~~~~~~~~-------e~~~~~~~~~~~i~~~~~yg~~--v~~~~~vv~~~~~~p 276 (276) ..+..+++.+...+ +++.+..+ -...++....=-+.....-|.. .=++|||++.+..+| T Consensus 217 aT~~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~ 288 (295) T protein:vir:99 217 STAVENLVFASLNVKGGDLGGLFADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAA 288 (295) T ss_pred EeeccceEEEEecCCchhhhhhhhhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecC Confidence 12223333322211 11111111 0011111111111222222322 238999999998888 No 144 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.32 E-value=3.8e-12 Score=83.20 Aligned_cols=263 Identities=13% Similarity=0.107 Sum_probs=153.3 Q ss_pred Ccc-------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV-------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~-------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~ 72 (276) +++ -+++|+.|...+.+.+++.+++..+..+-. +...| .+.+|+... ..+..+.+++.... .+.+.+ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~---~~~~g-~~~~p~~~~~~~a~~v~Eg~~~~~-~~~~f~ 199 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSI---PLPNG-NMSLPRLAGGATASYTGENQDAKV-SEARFD 199 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceee---ecCCc-ceEEEEEeCCcceeeeccCccccc-ccccee Confidence 221 147899999999999999888887733221 22223 378887643 34666666766554 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----------c---ccccC Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL-----------K---PAATL 138 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~-----------~---~~~~~ 138 (276) .+++...+. +.-+.|+++-...+..++...+.+..++++++++|..++..-.++....+ . ..... T Consensus 200 ~i~~~~~k~-~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~ 278 (428) T protein:vir:10 200 DVKLTAKTM-IAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAV 278 (428) T ss_pred eEEeeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccc Confidence 788877655 44578888877777789999999999999999999988753222111000 0 01111 Q ss_pred CHHHHHHHHHHHHHHHhh-cCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDE-KNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLT 217 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~-~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~ 217 (276) +.. ..+.+.++...+.. ......+-..+++|..+..|.+... .. +...+....-++++|.+|+.++.+|... T Consensus 279 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd--~~----G~~i~~~~~~g~l~G~pv~~~~~~p~~~ 351 (428) T protein:vir:10 279 NLD-TIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRD--GN----GNKVYPEMAQGMLKGYPIQRTSAIPANL 351 (428) T ss_pred cHH-HHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhc--cC----CceeccCCCCCeeeceeeEEeccccccc Confidence 111 11222222211111 1111223357889999998866432 11 1112212222478999999999998643 Q ss_pred cc----eEEEEEecceEEeeee-eeeeeeccCc----------ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 218 NG----TGAIAGVKMACTFAEQ-IVQTEAYRME----------KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 218 ~~----~~~~~~~~~a~~~~~~-~~~~e~~~~~----------~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .. ...+++..+.+.+..+ ...++..+.. ..| -..+++..++|..+.+|+++++++..+= T Consensus 352 ~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 352 GEGGKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cCCCccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 21 1233333333333221 1122222211 111 2477899999999999999999986666 No 145 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=99.31 E-value=6.6e-13 Score=87.37 Aligned_cols=252 Identities=10% Similarity=0.036 Sum_probs=150.5 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. -+++|+.+..++++.+++...+..+++.- ...| ..+|.. +..++..+.++..... .+++.+ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~-----~~~~--~~~p~~~~~~~~a~~v~E~~~~~~-~~~~f~ 189 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----NIKG--LEIPRVSYTLDDDDFITDVETAKE-LKLKGD 189 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeee-----ecCC--ceEEEEeecCCccccccCcccccc-cccccc Confidence 221 14789999999999998888887776531 1223 334543 3334556666665443 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cccccCCHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL----KPAATLDKTNIYEELI 148 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~----~~~~~~t~~~~~~~i~ 148 (276) .+++...+. +.-+.|+++-...+..|+.+.+.+..++++++..+..++..........+ ...+..++...++.|. T Consensus 190 ~v~~~~~k~-~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~ 268 (387) T protein:vir:93 190 TVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMYDAII 268 (387) T ss_pred eeeeeheee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHH Confidence 777777554 33467887777677789999999999999998877776643332211111 1112234445578888 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecc Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM 228 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~ 228 (276) ++...|+..... +-..++++..+..++.-. .+. +..+..|.-.+++|.+|+.++..+.. .+...+. T Consensus 269 ~~~~~l~~~~~~--~a~~~mn~~t~~~~~~~~--~d~-----~~~~~~~~~~~llG~PV~~~~~~~~~-----~~GDf~~ 334 (387) T protein:vir:93 269 NALADLHEDYRD--NATIYMRYADYVKIISVL--SNG-----TTNFFDTPAEKVFGKPVVFTDAAVKP-----IVGDFNY 334 (387) T ss_pred HHHhccChhhhc--CCEEEEechHHHHHHHHH--hcC-----CCcccccCCccccccceEEecCCCce-----eeeehhh Confidence 887777665432 224678887766654421 111 11233455568999999998765431 1111121 Q ss_pred eEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.....+ ..+..++....--..+.+..++|+++++|+++++++.++| T Consensus 335 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~ 381 (387) T protein:vir:93 335 FGINYDG-TTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred hheehhh-heeeecccccCCceeEEEEeeeCceeechhheEEEEeecC Confidence 1111111 1122222222223456778899999999999999998777 No 146 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=99.30 E-value=5.3e-13 Score=87.89 Aligned_cols=252 Identities=10% Similarity=0.024 Sum_probs=152.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. -+++|+-++.++++.++....+..+++.- ...| .++|.+ +..++..+.++..... .+++.. T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~-----~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-~~~~f~ 204 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----NIKG--LEIPRVSYTLDDDDFITDVETAKE-LKAKGD 204 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceee-----ecCC--ceeeeeeccCCccccccccccccc-cccccc Confidence 221 24789999999999998888888877531 1223 334543 2334555566655443 356677 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cccccCCHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL----KPAATLDKTNIYEELI 148 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~----~~~~~~t~~~~~~~i~ 148 (276) .+++.+.+. +.-+.|+.+-...+..++.+.+.+..+++++...++.++..........+ ...+..++.+.++.|. T Consensus 205 ~i~~~~~k~-~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~ 283 (402) T protein:vir:93 205 TVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAII 283 (402) T ss_pred eeeecceee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHH Confidence 777777554 33467887767677889999999999999999877766644332211111 1122234445578888 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecc Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM 228 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~ 228 (276) ++...|+.... .+-..++++..+..|++-. .+. +..+..|.-.+++|.+|+.++..+.. .+..-+. T Consensus 284 ~~~~~l~~~y~--~na~~imn~~t~~~~~~~~--~d~-----~~~~~~~~~~~llG~PV~~t~~~~~i-----~~GDf~~ 349 (402) T protein:vir:93 284 NALADLHEDYR--DNATIYMRYADYVKIISVL--SNG-----TTNFFDTPAEKVFGKPVVFTDAAVKP-----IVGDFNY 349 (402) T ss_pred HHHhccChhhh--cCCEEEEechHHHHHHHHH--hcC-----CCcccccCCccccccceEEecCCCce-----eeechhh Confidence 88777765543 2334678877766665431 111 12334556668999999998766531 1111111 Q ss_pred eEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.....+ ..+..+++...--..+++..++|++|++|+++++++.+++ T Consensus 350 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 350 FGINYDG-TTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred hhhhhhh-hhhhhhhcccCCceEEEEEEEeCcEEechhheEEEEeecC Confidence 1111111 1223333333323567888999999999999999998776 No 147 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=99.30 E-value=4e-13 Score=88.56 Aligned_cols=252 Identities=10% Similarity=0.021 Sum_probs=152.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. -+++|+.++.++++.++....+..+++.- ...| .++|.+ +..++..+.++..... .+++.. T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~-----~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-~~~~f~ 189 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----NIKG--LEIPRVSYTLDDDDFITDVETAKE-LKAKGD 189 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee-----ecCC--ceeeeeeccCCccccccccccccc-cccccc Confidence 211 24789999999999998888888776531 1122 334543 2334555566665543 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cccccCCHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL----KPAATLDKTNIYEELI 148 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~----~~~~~~t~~~~~~~i~ 148 (276) .+++...+. +.-+.|+++-...+..++.+.+.+..+++++...+..++..........+ ...+..++...++.|. T Consensus 190 ~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~ 268 (387) T protein:vir:96 190 TVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAII 268 (387) T ss_pred eeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHH Confidence 777777655 33467888777777889999999999999998877776654332211111 1112234455678888 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecc Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM 228 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~ 228 (276) ++...|...... +-..++++..+..|++-. .+. +..+..|.-.+++|.+|+.++..+.. .+...+. T Consensus 269 ~~~~~l~~~y~~--na~~imn~~t~~~~~~~~--~~~-----~~~~~~~~~~~llG~PV~~~~~~~~~-----~~GDf~~ 334 (387) T protein:vir:96 269 NALADLHEDYRD--NATIYMRYADYVKIISVL--SNG-----TTNFFDTPAEKVFGKPVVFTDAAVKP-----IVGDFNY 334 (387) T ss_pred HHHhccChhhhc--CCEEEEechHHHHHHHHH--hcC-----CCcccccCCccccccceEEecCCCce-----eeechhh Confidence 887777655432 224678887776665421 111 12344566678999999998765431 1111111 Q ss_pred eEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.... +......+++...--..+++..++|+++++|+++++++.+++ T Consensus 335 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:96 335 FGINY-DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred hhhhh-hhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 11111 111122233322223567788899999999999999998777 No 148 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=99.30 E-value=4e-13 Score=88.56 Aligned_cols=252 Identities=10% Similarity=0.021 Sum_probs=152.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. -+++|+.++.++++.++....+..+++.- ...| .++|.+ +..++..+.++..... .+++.. T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~-----~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-~~~~f~ 189 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----NIKG--LEIPRVSYTLDDDDFITDVETAKE-LKAKGD 189 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee-----ecCC--ceeeeeeccCCccccccccccccc-cccccc Confidence 211 24789999999999998888888776531 1122 334543 2334555566665543 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cccccCCHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL----KPAATLDKTNIYEELI 148 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~----~~~~~~t~~~~~~~i~ 148 (276) .+++...+. +.-+.|+++-...+..++.+.+.+..+++++...+..++..........+ ...+..++...++.|. T Consensus 190 ~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~ 268 (387) T protein:vir:26 190 TVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAII 268 (387) T ss_pred eeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHH Confidence 777777655 33467888777777889999999999999998877776654332211111 1112234455678888 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecc Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM 228 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~ 228 (276) ++...|...... +-..++++..+..|++-. .+. +..+..|.-.+++|.+|+.++..+.. .+...+. T Consensus 269 ~~~~~l~~~y~~--na~~imn~~t~~~~~~~~--~~~-----~~~~~~~~~~~llG~PV~~~~~~~~~-----~~GDf~~ 334 (387) T protein:vir:26 269 NALADLHEDYRD--NATIYMRYADYVKIISVL--SNG-----TTNFFDTPAEKVFGKPVVFTDAAVKP-----IVGDFNY 334 (387) T ss_pred HHHhccChhhhc--CCEEEEechHHHHHHHHH--hcC-----CCcccccCCccccccceEEecCCCce-----eeechhh Confidence 887777655432 224678887776665421 111 12344566678999999998765431 1111111 Q ss_pred eEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.... +......+++...--..+++..++|+++++|+++++++.+++ T Consensus 335 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:26 335 FGINY-DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred hhhhh-hhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 11111 111122233322223567788899999999999999998777 No 149 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=99.30 E-value=4e-13 Score=88.56 Aligned_cols=252 Identities=10% Similarity=0.021 Sum_probs=152.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. -+++|+.++.++++.++....+..+++.- ...| .++|.+ +..++..+.++..... .+++.. T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~-----~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~-~~~~f~ 189 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT-----NIKG--LEIPRVSYTLDDDDFITDVETAKE-LKAKGD 189 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee-----ecCC--ceeeeeeccCCccccccccccccc-cccccc Confidence 211 24789999999999998888888776531 1122 334543 2334555566665543 456777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----cccccCCHHHHHHHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL----KPAATLDKTNIYEELI 148 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~----~~~~~~t~~~~~~~i~ 148 (276) .+++...+. +.-+.|+++-...+..++.+.+.+..+++++...+..++..........+ ...+..++...++.|. T Consensus 190 ~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~ 268 (387) T protein:vir:94 190 TVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAII 268 (387) T ss_pred eeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHH Confidence 777777655 33467888777777889999999999999998877776654332211111 1112234455678888 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecc Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKM 228 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~ 228 (276) ++...|...... +-..++++..+..|++-. .+. +..+..|.-.+++|.+|+.++..+.. .+...+. T Consensus 269 ~~~~~l~~~y~~--na~~imn~~t~~~~~~~~--~~~-----~~~~~~~~~~~llG~PV~~~~~~~~~-----~~GDf~~ 334 (387) T protein:vir:94 269 NALADLHEDYRD--NATIYMRYADYVKIISVL--SNG-----TTNFFDTPAEKVFGKPVVFTDAAVKP-----IVGDFNY 334 (387) T ss_pred HHHhccChhhhc--CCEEEEechHHHHHHHHH--hcC-----CCcccccCCccccccceEEecCCCce-----eeechhh Confidence 887777655432 224678887776665421 111 12344566678999999998765431 1111111 Q ss_pred eEEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 229 ACTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 229 a~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.... +......+++...--..+++..++|+++++|+++++++.+++ T Consensus 335 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:94 335 FGINY-DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred hhhhh-hhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 11111 111122233322223567788899999999999999998777 No 150 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=99.28 E-value=6e-12 Score=82.09 Aligned_cols=268 Identities=9% Similarity=0.027 Sum_probs=169.5 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc--ceeecCCCCC--CCccccccceEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT--VKEYTENSDI--DAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~--~~d~~~~~~~--~~~~~~~~~~~~~ 76 (276) ..--+|.|+.+. ++.+.+.+...+++++++.. ..+..+.+||+++... ......++.. ....+++.+.+++ T Consensus 19 ~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~----t~~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l 93 (314) T protein:vir:41 19 LGKGILAVQRFG-EFVREVRENSAIIKDARVLN----ALKSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNTL 93 (314) T ss_pred CCCceeChHHHH-HHHHHHHhccchhhheeeec----ccCccceeecccccCcccccccccccCCccCCcccccccceee Confidence 333358899986 57889999999999986421 1123568888876421 1111111111 1124567788888 Q ss_pred EEEeeeecceeechHHHHhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------------cccccccccC Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRT--PLMDAAMQRAAYALADETEKILLKEMDTNA----------------TSKLKPAATL 138 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~--d~~~~~~~~~~~ala~~~d~~~~~~~~~~~----------------~~~~~~~~~~ 138 (276) ...+.. ..+.|+++....+.. |+...+....++++++..+..++..=.+.+ ....+..+.. T Consensus 94 ~~~kl~-~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~~~~~ 172 (314) T protein:vir:41 94 EMKELV-TKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTDAEPE 172 (314) T ss_pred eeEEEE-EeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceeecCcc Confidence 886654 457888887777665 888888999999999988887765322110 0001111122 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCc-cCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKNVPT-IGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLT 217 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~vP~-~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~ 217 (276) +..+..+.|.++...|....--. .+-+.+++++.+..+++. +.......++..+..|...+++|++|+.++.+|... T Consensus 173 ~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~--l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~ 250 (314) T protein:vir:41 173 DENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQ--LLVRETGLGDSALIGATGLQYDGIPIQYVPALDALG 250 (314) T ss_pred ccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHH--HhccCCcccchhhhCCCCceecceeeEecccccccC Confidence 33344556677776665432111 123577899988877653 223344566777778888889999999999998765 Q ss_pred cceE-EEEEecceEEeeeee-eeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 218 NGTG-AIAGVKMACTFAEQI-VQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 218 ~~~~-~~~~~~~a~~~~~~~-~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.. .++++..-+.++... .+++.+++...-...+....+.++.+..++++|+.-.-.. T Consensus 251 ~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~ 311 (314) T protein:vir:41 251 DDKARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMS 311 (314) T ss_pred CCCceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeecc Confidence 4444 444555555555332 3566666666556778888999999988888887655444 No 151 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.28 E-value=4.9e-12 Score=82.61 Aligned_cols=259 Identities=13% Similarity=0.069 Sum_probs=155.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.+...+++.+++.+++.+++..- ......| +..+|+.. ...+..+.++........++... T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~--~~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 182 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE--PVRTRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFSN 182 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee--eccCCce-eEEEEeecCCccceeeccccccccccccccee Confidence 321 24789999999999999999888887531 1111122 34455443 34566777776654323456677 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHH-H Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVK-V 152 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~ 152 (276) +++...+. +.-+.|+++-...+..++...+.+..++++++.+|..++....+... .+..+ ++.|.++. . T Consensus 183 v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~-----~~~~~----~d~i~~~~~~ 252 (392) T protein:vir:10 183 VQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----QAIKS----LDDIKDVLNV 252 (392) T ss_pred EEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----cCccC----HHHHHHHHHH Confidence 77777554 55578888776667789999999999999999999998875544321 12222 44555543 3 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE-e-cccccc---ccceE-EEEEe Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL-S-NNMGSL---TNGTG-AIAGV 226 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~-s-~~lp~~---~~~~~-~~~~~ 226 (276) .|..... .+-..++||..+..|.+...- ...+.....+..|..++++|.+++. + +..|.. ..+.. .+.+. T Consensus 253 ~l~~~~~--~~a~~vm~~~~~~~L~~lkd~--~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gd 328 (392) T protein:vir:10 253 KLDPAIS--PNAILLTNQDGFNYLDKLKDK--DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD 328 (392) T ss_pred hhhhhhc--cCCEEEEcHHHHHHHHHhhcc--CCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEe Confidence 4444432 334689999999999764211 1122223334567777899987654 2 333321 12222 22222 Q ss_pred -cceEEee-eeeeeeeeccCc-cc---ceeeEEeeeeeeeEEEcCCeEEEEE--ecCC Q lcl|NC_019506. 227 -KMACTFA-EQIVQTEAYRME-KR---FADAVKGLNVFGCKVIYPDALVCLK--KTNP 276 (276) Q Consensus 227 -~~a~~~~-~~~~~~e~~~~~-~~---~~~~i~~~~~yg~~v~~~~~vv~~~--~~~p 276 (276) +.++... .....++..+.. .. ....+++..++|.++++|++++.++ .++| T Consensus 329 fs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~ 386 (392) T protein:vir:10 329 LKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) T ss_pred hhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccc Confidence 2333322 222233332211 11 2346889999999999999999965 5666 No 152 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.28 E-value=4.9e-12 Score=82.61 Aligned_cols=259 Identities=13% Similarity=0.069 Sum_probs=155.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.+...+++.+++.+++.+++..- ......| +..+|+.. ...+..+.++........++... T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~--~~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 182 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE--PVRTRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFSN 182 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee--eccCCce-eEEEEeecCCccceeeccccccccccccccee Confidence 321 24789999999999999999888887531 1111122 34455443 34566777776654323456677 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHH-H Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVK-V 152 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~ 152 (276) +++...+. +.-+.|+++-...+..++...+.+..++++++.+|..++....+... .+..+ ++.|.++. . T Consensus 183 v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~-----~~~~~----~d~i~~~~~~ 252 (392) T protein:vir:10 183 VQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----QAIKS----LDDIKDVLNV 252 (392) T ss_pred EEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----cCccC----HHHHHHHHHH Confidence 77777554 55578888776667789999999999999999999998875544321 12222 44555543 3 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE-e-cccccc---ccceE-EEEEe Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL-S-NNMGSL---TNGTG-AIAGV 226 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~-s-~~lp~~---~~~~~-~~~~~ 226 (276) .|..... .+-..++||..+..|.+...- ...+.....+..|..++++|.+++. + +..|.. ..+.. .+.+. T Consensus 253 ~l~~~~~--~~a~~vm~~~~~~~L~~lkd~--~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gd 328 (392) T protein:vir:10 253 KLDPAIS--PNAILLTNQDGFNYLDKLKDK--DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD 328 (392) T ss_pred hhhhhhc--cCCEEEEcHHHHHHHHHhhcc--CCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEe Confidence 4444432 334689999999999764211 1122223334567777899987654 2 333321 12222 22222 Q ss_pred -cceEEee-eeeeeeeeccCc-cc---ceeeEEeeeeeeeEEEcCCeEEEEE--ecCC Q lcl|NC_019506. 227 -KMACTFA-EQIVQTEAYRME-KR---FADAVKGLNVFGCKVIYPDALVCLK--KTNP 276 (276) Q Consensus 227 -~~a~~~~-~~~~~~e~~~~~-~~---~~~~i~~~~~yg~~v~~~~~vv~~~--~~~p 276 (276) +.++... .....++..+.. .. ....+++..++|.++++|++++.++ .++| T Consensus 329 fs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~ 386 (392) T protein:vir:10 329 LKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) T ss_pred hhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccc Confidence 2333322 222233332211 11 2346889999999999999999965 5666 No 153 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.28 E-value=4.9e-12 Score=82.61 Aligned_cols=259 Identities=13% Similarity=0.069 Sum_probs=155.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.+...+++.+++.+++.+++..- ......| +..+|+.. ...+..+.++........++... T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~--~~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 182 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE--PVRTRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFSN 182 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee--eccCCce-eEEEEeecCCccceeeccccccccccccccee Confidence 321 24789999999999999999888887531 1111122 34455443 34566777776654323456677 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHH-H Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVK-V 152 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~ 152 (276) +++...+. +.-+.|+++-...+..++...+.+..++++++.+|..++....+... .+..+ ++.|.++. . T Consensus 183 v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~-----~~~~~----~d~i~~~~~~ 252 (392) T protein:vir:10 183 VQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----QAIKS----LDDIKDVLNV 252 (392) T ss_pred EEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----cCccC----HHHHHHHHHH Confidence 77777554 55578888776667789999999999999999999998875544321 12222 44555543 3 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE-e-cccccc---ccceE-EEEEe Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL-S-NNMGSL---TNGTG-AIAGV 226 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~-s-~~lp~~---~~~~~-~~~~~ 226 (276) .|..... .+-..++||..+..|.+...- ...+.....+..|..++++|.+++. + +..|.. ..+.. .+.+. T Consensus 253 ~l~~~~~--~~a~~vm~~~~~~~L~~lkd~--~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gd 328 (392) T protein:vir:10 253 KLDPAIS--PNAILLTNQDGFNYLDKLKDK--DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD 328 (392) T ss_pred hhhhhhc--cCCEEEEcHHHHHHHHHhhcc--CCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEe Confidence 4444432 334689999999999764211 1122223334567777899987654 2 333321 12222 22222 Q ss_pred -cceEEee-eeeeeeeeccCc-cc---ceeeEEeeeeeeeEEEcCCeEEEEE--ecCC Q lcl|NC_019506. 227 -KMACTFA-EQIVQTEAYRME-KR---FADAVKGLNVFGCKVIYPDALVCLK--KTNP 276 (276) Q Consensus 227 -~~a~~~~-~~~~~~e~~~~~-~~---~~~~i~~~~~yg~~v~~~~~vv~~~--~~~p 276 (276) +.++... .....++..+.. .. ....+++..++|.++++|++++.++ .++| T Consensus 329 fs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~ 386 (392) T protein:vir:10 329 LKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) T ss_pred hhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccc Confidence 2333322 222233332211 11 2346889999999999999999965 5666 No 154 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.28 E-value=4.9e-12 Score=82.61 Aligned_cols=259 Identities=13% Similarity=0.069 Sum_probs=155.8 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. ..++|+.+...+++.+++.+++.+++..- ......| +..+|+.. ...+..+.++........++... T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~--~~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 182 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE--PVRTRSG-SRVLEKNSDMIPFAEITEMGEIPETDNPKFSN 182 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceee--eccCCce-eEEEEeecCCccceeeccccccccccccccee Confidence 321 24789999999999999999888887531 1111122 34455443 34566777776654323456677 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHHHHHHHH-H Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYEELIKVK-V 152 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~ 152 (276) +++...+. +.-+.|+++-...+..++...+.+..++++++.+|..++....+... .+..+ ++.|.++. . T Consensus 183 v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~-----~~~~~----~d~i~~~~~~ 252 (392) T protein:vir:10 183 VQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----QAIKS----LDDIKDVLNV 252 (392) T ss_pred EEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----cCccC----HHHHHHHHHH Confidence 77777554 55578888776667789999999999999999999998875544321 12222 44555543 3 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE-e-cccccc---ccceE-EEEEe Q lcl|NC_019506. 153 KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL-S-NNMGSL---TNGTG-AIAGV 226 (276) Q Consensus 153 ~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~-s-~~lp~~---~~~~~-~~~~~ 226 (276) .|..... .+-..++||..+..|.+...- ...+.....+..|..++++|.+++. + +..|.. ..+.. .+.+. T Consensus 253 ~l~~~~~--~~a~~vm~~~~~~~L~~lkd~--~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gd 328 (392) T protein:vir:10 253 KLDPAIS--PNAILLTNQDGFNYLDKLKDK--DGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD 328 (392) T ss_pred hhhhhhc--cCCEEEEcHHHHHHHHHhhcc--CCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEe Confidence 4444432 334689999999999764211 1122223334567777899987654 2 333321 12222 22222 Q ss_pred -cceEEee-eeeeeeeeccCc-cc---ceeeEEeeeeeeeEEEcCCeEEEEE--ecCC Q lcl|NC_019506. 227 -KMACTFA-EQIVQTEAYRME-KR---FADAVKGLNVFGCKVIYPDALVCLK--KTNP 276 (276) Q Consensus 227 -~~a~~~~-~~~~~~e~~~~~-~~---~~~~i~~~~~yg~~v~~~~~vv~~~--~~~p 276 (276) +.++... .....++..+.. .. ....+++..++|.++++|++++.++ .++| T Consensus 329 fs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~ 386 (392) T protein:vir:10 329 LKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAP 386 (392) T ss_pred hhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccccc Confidence 2333322 222233332211 11 2346889999999999999999965 5666 No 155 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=99.27 E-value=9.4e-12 Score=81.04 Aligned_cols=265 Identities=12% Similarity=-0.002 Sum_probs=151.2 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC--cccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG--AITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~--~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. -.++|+.|...+++.+++.+.+..+++.- + ..+.++.||+.. ...+..+.+++.... .+++.. T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~----~-~~~~~~~~~~~~~~~~~a~wv~E~~~~~~-s~~~f~ 224 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----P-VTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFA 224 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc----c-cCCCceEEEEEcCCCCcceeeccCccccc-ccccce Confidence 222 14789999999999999999999887642 1 234568898753 345677777776654 457777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc--------ccccc-------- Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNA-TS--------KLKPA-------- 135 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~-~~--------~~~~~-------- 135 (276) .+++...+... -+.|+++-.. ...++...+.+..++++++++|..++..-.... .. ....+ T Consensus 225 ~i~~~~~k~a~-~~~iS~ell~-d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) T protein:vir:78 225 RVYEQVGKVAN-ALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) T ss_pred eeEeeeeeeEe-ecHhHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhh Confidence 78888765533 3567766443 445677778889999999999998875311100 00 00000 Q ss_pred --------------c------------------------------cCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHH Q lcl|NC_019506. 136 --------------A------------------------------TLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPD 171 (276) Q Consensus 136 --------------~------------------------------~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~ 171 (276) . ..+.......+..+...+..... ...-..++||. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~vmn~~ 381 (497) T protein:vir:78 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNPR 381 (497) T ss_pred hhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-cCCCeEEEchH Confidence 0 00000111122222222222111 01115789999 Q ss_pred HHHHHhhhHHhhhhccc----ccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeee-eeeeccC-c Q lcl|NC_019506. 172 VHGLLLAADLIVGTGGA----MAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIV-QTEAYRM-E 245 (276) Q Consensus 172 ~~~~L~~~~~~~~~~~~----~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~-~~e~~~~-~ 245 (276) .+..|.+...-...... +.......+.-.+++|.+|+.++.+|... ..+..+...++....+.. .++..+. . T Consensus 382 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~--~~~Gd~~~~~~~i~~r~~~~v~~~~~~~ 459 (497) T protein:vir:78 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) T ss_pred HHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCc--eEEeecccceEEEEEecccEEEeecccc Confidence 99988654321111111 01111112223478999999999998422 222223334444443322 2222211 1 Q ss_pred ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 246 KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 246 ~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ..| -..|++..++|..|.+|+++++++.+++ T Consensus 460 ~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 460 TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred hhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 112 3568888999999999999999999888 No 156 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=99.27 E-value=9.4e-12 Score=81.04 Aligned_cols=265 Identities=12% Similarity=-0.002 Sum_probs=151.2 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC--cccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG--AITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~--~~~~~d~~~~~~~~~~~~~~~~ 72 (276) |.. -.++|+.|...+++.+++.+.+..+++.- + ..+.++.||+.. ...+..+.+++.... .+++.. T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~----~-~~~~~~~~~~~~~~~~~a~wv~E~~~~~~-s~~~f~ 224 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----P-VTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFA 224 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc----c-cCCCceEEEEEcCCCCcceeeccCccccc-ccccce Confidence 222 14789999999999999999999887642 1 234568898753 345677777776654 457777 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc--------ccccc-------- Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNA-TS--------KLKPA-------- 135 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~-~~--------~~~~~-------- 135 (276) .+++...+... -+.|+++-.. ...++...+.+..++++++++|..++..-.... .. ....+ T Consensus 225 ~i~~~~~k~a~-~~~iS~ell~-d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) T protein:vir:10 225 RVYEQVGKVAN-ALTITDEGLR-DAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATS 302 (497) T ss_pred eeEeeeeeeEe-ecHhHHHHHH-hHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhh Confidence 78888765533 3567766443 445677778889999999999998875311100 00 00000 Q ss_pred --------------c------------------------------cCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHH Q lcl|NC_019506. 136 --------------A------------------------------TLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPD 171 (276) Q Consensus 136 --------------~------------------------------~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~ 171 (276) . ..+.......+..+...+..... ...-..++||. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~vmn~~ 381 (497) T protein:vir:10 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNPR 381 (497) T ss_pred hhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-cCCCeEEEchH Confidence 0 00000111122222222222111 01115789999 Q ss_pred HHHHHhhhHHhhhhccc----ccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeee-eeeeccC-c Q lcl|NC_019506. 172 VHGLLLAADLIVGTGGA----MAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIV-QTEAYRM-E 245 (276) Q Consensus 172 ~~~~L~~~~~~~~~~~~----~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~-~~e~~~~-~ 245 (276) .+..|.+...-...... +.......+.-.+++|.+|+.++.+|... ..+..+...++....+.. .++..+. . T Consensus 382 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~--~~~Gd~~~~~~~i~~r~~~~v~~~~~~~ 459 (497) T protein:vir:10 382 DWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--ILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) T ss_pred HHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCc--eEEeecccceEEEEEecccEEEeecccc Confidence 99988654321111111 01111112223478999999999998422 222223334444443322 2222211 1 Q ss_pred ccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 246 KRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 246 ~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ..| -..|++..++|..|.+|+++++++.+++ T Consensus 460 ~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 460 TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred hhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 112 3568888999999999999999999888 No 157 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=99.27 E-value=4e-13 Score=88.57 Aligned_cols=272 Identities=15% Similarity=0.127 Sum_probs=154.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccc-e-eecCCCCCCCc------------ Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITV-K-EYTENSDIDAP------------ 66 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~-~-d~~~~~~~~~~------------ 66 (276) |..+.- .--|-+..+.-..+.+++..+.+ ..+.+.+.|+||.++..-.+.. . -.++|.++... T Consensus 19 ~~~~~~-t~y~~~k~L~~Aa~~lv~~~fA~--~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g~~y~~~r 95 (401) T protein:vir:95 19 NSDQMQ-TFFWLKKAIITARKEQYFMPLAS--VTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNGNLYGSSK 95 (401) T ss_pred ccceee-ehhhHHHHHhhhhhhhhhhhccc--ccccccccCCeEEEEecccccccccchhcCCCcccccccCcccccccc Confidence 332221 11244445555556699999875 4567889999999987654432 1 12233322210 Q ss_pred ---------------------cccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHH-----HHH Q lcl|NC_019506. 67 ---------------------EELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETE-----KIL 120 (276) Q Consensus 67 ---------------------~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d-----~~~ 120 (276) ...+-.++...|.++ ++=..++|+-+.-...+.+.+.+..-...-+.++. .++ T Consensus 96 dv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qy-G~~~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~~dl 174 (401) T protein:vir:95 96 DIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKF-GFFYEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQKDL 174 (401) T ss_pred ccceeecccccccccccccccccceeeeeeeeeeec-cCccchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHHHHH Confidence 001112244456555 33357788766555555555433222222222222 233 Q ss_pred HHHh----hccccc---cccccccCCHHHHHHHHHHHHHHHhhcCCCc-----------------cCCEEEECH------ Q lcl|NC_019506. 121 LKEM----DTNATS---KLKPAATLDKTNIYEELIKVKVKLDEKNVPT-----------------IGRFLIIPP------ 170 (276) Q Consensus 121 ~~~~----~~~~~~---~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~-----------------~~r~~vv~p------ 170 (276) ++.. .+.+.. ..+.....+..-.++.+.++...|+++..|. .-|+++|+| T Consensus 175 l~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di 254 (401) T protein:vir:95 175 LAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPEL 254 (401) T ss_pred HhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchhHH Confidence 3221 111111 1111222233335788999999999877665 125789999 Q ss_pred HHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc-------------------ccccceE-----EEEEe Q lcl|NC_019506. 171 DVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG-------------------SLTNGTG-----AIAGV 226 (276) Q Consensus 171 ~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp-------------------~~~~~~~-----~~~~~ 226 (276) ....+|..++.|.....++..+.+.+|.||.+.+|+|++++.+- .+++++. .+.+- T Consensus 255 ~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~~gg~~dVyp~lV~G 334 (401) T protein:vir:95 255 KAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVSGQEHYDVYPMLVVG 334 (401) T ss_pred HHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccccccccCCCcceeeeeeEEc Confidence 55567778889999999999899999999999999999887632 1111111 12233 Q ss_pred cceEEeee----e----eee---ee-----eccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 227 KMACTFAE----Q----IVQ---TE-----AYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 227 ~~a~~~~~----~----~~~---~e-----~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +.|++... . +.. -. .-.+|..+.-.+.-++.|++.+++|+.+++|+..+| T Consensus 335 ~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~ies~a~ 400 (401) T protein:vir:95 335 DDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALIKTVAP 400 (401) T ss_pred cccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEEEeecC Confidence 44444321 0 000 01 113344444456667899999999999999999999 No 158 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.21 E-value=7.2e-12 Score=81.67 Aligned_cols=269 Identities=12% Similarity=0.087 Sum_probs=146.7 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc--ceeecCCCCCCC----ccc Q lcl|NC_019506. 1 MA------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT--VKEYTENSDIDA----PEE 68 (276) Q Consensus 1 MA------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~--~~d~~~~~~~~~----~~~ 68 (276) +. ..++.|+.+..++++.+++.+++.+++..-. ....+.++.||+....+ .....++..... ..+ T Consensus 157 ~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~---~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~ 233 (477) T protein:vir:84 157 LDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEP---LPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVD 233 (477) T ss_pred ccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceee---ecCCcceeEEEEEecCcceeeeeccCcccccccccccc Confidence 11 1124466778889999999888888775421 11234678999764332 223344433221 123 Q ss_pred cccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----------cccccc Q lcl|NC_019506. 69 LSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-----------LKPAAT 137 (276) Q Consensus 69 ~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-----------~~~~~~ 137 (276) ++...+++...+. +.-+.|+++-...+..++.+.+.+..+++++.++|..++..-.++.... ....+. T Consensus 234 ~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~ 312 (477) T protein:vir:84 234 LTDGFVQANVKTI-AGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAG 312 (477) T ss_pred cceeeEEEeeeeE-EeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccc Confidence 4455566666554 3346777777777788999999999999999999998884322111100 111111 Q ss_pred CC---HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhc-----ccc------cccceeeeeeeEEe Q lcl|NC_019506. 138 LD---KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTG-----GAM------AESITKNGFVGTIL 203 (276) Q Consensus 138 ~t---~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~-----~~~------~~~~~~~G~i~~~~ 203 (276) .+ ....++.|.++...++.... ......+++|..+..|.+...-.... ... ....+..|..++++ T Consensus 313 ~t~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~ 391 (477) T protein:vir:84 313 SALEKHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMH 391 (477) T ss_pred cchhhHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhc Confidence 12 12234556666555544332 22346899999998886643211110 000 01224455667899 Q ss_pred ceEEEEeccccccccc----eEEEEEecceEEeeeeeeeeeeccCcccceeeEEee----eeeeeEEEc-CCeEEEEE-- Q lcl|NC_019506. 204 GFDVYLSNNMGSLTNG----TGAIAGVKMACTFAEQIVQTEAYRMEKRFADAVKGL----NVFGCKVIY-PDALVCLK-- 272 (276) Q Consensus 204 G~~v~~s~~lp~~~~~----~~~~~~~~~a~~~~~~~~~~e~~~~~~~~~~~i~~~----~~yg~~v~~-~~~vv~~~-- 272 (276) |++|+.++.+|...+. ...+++.-+.+..... .+.....+..+++..... -.+++..+| |+++++++ T Consensus 392 G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~~--~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~ 469 (477) T protein:vir:84 392 GLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFES--SVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGT 469 (477) T ss_pred ccceEecCcccccccccCCcceEEEEEeceEEEEee--ceeEEeccccccccceeeeeehhhhhhhhhccccceEEeecc Confidence 9999999999964221 1222222222222221 122223333333322222 223344556 99999988 Q ss_pred -ecCC Q lcl|NC_019506. 273 -KTNP 276 (276) Q Consensus 273 -~~~p 276 (276) .|+| T Consensus 470 ~~~~~ 474 (477) T protein:vir:84 470 ALTAP 474 (477) T ss_pred ccccc Confidence 5677 No 159 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=99.20 E-value=7.8e-12 Score=81.49 Aligned_cols=267 Identities=12% Similarity=0.083 Sum_probs=163.4 Q ss_pred Cccc----hhhHH--HHHHHHHHHHHHhhcchh--hhccc--cccccccCCcEEEEeccCccc-cee--ecCC--CCCCC Q lcl|NC_019506. 1 MAVT----SFIPK--LWSARLLAHLDKAHVVAN--LVNRD--YEGEIKAYGDTVKINQIGAIT-VKE--YTEN--SDIDA 65 (276) Q Consensus 1 MA~~----~l~~e--~~~~~~~~~l~~~~v~~~--~~~~~--~~~~~~~~Gdtv~ip~~~~~~-~~d--~~~~--~~~~~ 65 (276) ||.+ .++|| +|...+.+.-.+.+.|.. .+.++ +......+|+++++|.++.+. ..+ +... .+... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t 80 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 9965 36787 799999888755544443 22222 211223579999999998753 222 2111 11222 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc----------- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKP----------- 134 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~----------- 134 (276) +..++..+....+ .++.+++..+|+-...+-.|+++++.++.+.--.+.....+++.++........+ T Consensus 81 ~~kitt~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~t 159 (349) T protein:vir:78 81 PRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMV 159 (349) T ss_pred cccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhcccce Confidence 3455544443333 5678888888887776667999999999998888888888887766432111000 Q ss_pred -cccCCHHHHHHHHHHHHHHHhhcCCC---ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEe Q lcl|NC_019506. 135 -AATLDKTNIYEELIKVKVKLDEKNVP---TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLS 210 (276) Q Consensus 135 -~~~~t~~~~~~~i~~a~~~l~~~~vP---~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s 210 (276) ....+.....+.|..|...|.+.-.- ..-..+++||..+..|.+... +..-. ..-....|+.+.|..|+++ T Consensus 160 ~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~l-i~~i~----~s~~~~~i~ty~G~~VivD 234 (349) T protein:vir:78 160 VDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQL-IDFIR----DAENNTMFATYQGYRVIVD 234 (349) T ss_pred eeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhh-hhhcc----CcccCcccceecCeEEEEe Confidence 00001111245677777777765221 112368999999999987643 33221 1123446889999999999 Q ss_pred cccccccc----ceEEEEEecceEEeeeee--eeeeeccCcccc----eeeEEeeeeeeeEEEcCCeEEEEEec------ Q lcl|NC_019506. 211 NNMGSLTN----GTGAIAGVKMACTFAEQI--VQTEAYRMEKRF----ADAVKGLNVFGCKVIYPDALVCLKKT------ 274 (276) Q Consensus 211 ~~lp~~~~----~~~~~~~~~~a~~~~~~~--~~~e~~~~~~~~----~~~i~~~~~yg~~v~~~~~vv~~~~~------ 274 (276) ..+|+... .+.++.+..+|+++.... ..+|..|++... .|.+..+.+|. +.|.|+--.... T Consensus 235 D~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~~---~hp~G~s~~~a~v~~~~~ 311 (349) T protein:vir:78 235 DSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTWL---LHPFGYRFTSAVITGNGT 311 (349) T ss_pred CCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEEE---eeeeeeeeccccccCCcc Confidence 99997653 346778899999987533 346777777543 36666666654 345555444332 Q ss_pred -----CC Q lcl|NC_019506. 275 -----NP 276 (276) Q Consensus 275 -----~p 276 (276) .| T Consensus 312 ~~~~~sP 318 (349) T protein:vir:78 312 ETIARSA 318 (349) T ss_pred ccccCCC Confidence 34 No 160 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.18 E-value=2.7e-12 Score=83.99 Aligned_cols=266 Identities=15% Similarity=0.212 Sum_probs=172.8 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccc-cceEEE Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELS-TTEKVL 76 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~-~~~~~~ 76 (276) |+- ++++|.+++..+.+.-++-.+...++.. -..+.|.+..||.+|..-+.++.+|+.+.. ++++ .+.-.+ T Consensus 74 mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk----~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~~-~sld~~T~dsv 148 (393) T protein:vir:79 74 MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQK----IRLKSGQSMIFPSIGIMRAYDVAEGQEIPE-DSIDWQTHESP 148 (393) T ss_pred hcCCCcceechhhhhhhhhhcccchhHHHHHHHH----HhhhcCcceeccchheeeeccccccccccc-cchhhhcCCce Confidence 663 5789999999999987776666666643 234579999999999888888888888754 4555 233344 Q ss_pred EEEeee-ecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------cccc--------cCCHH Q lcl|NC_019506. 77 EINKQK-YFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------KPAA--------TLDKT 141 (276) Q Consensus 77 ~ld~~~-~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------~~~~--------~~t~~ 141 (276) ++.+.+ +..+.++++-...+-+|++.-.+++++++|+++.|+.++....+....+. +.+. ...+. T Consensus 149 ~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGT 228 (393) T protein:vir:79 149 EIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDT 228 (393) T ss_pred eEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCcccccccc Confidence 444443 45578888888889999999999999999999999999998877554221 1111 11112 Q ss_pred HHHHHHHH-HHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhh--c---cccc---c------cceeeeeeeEEeceE Q lcl|NC_019506. 142 NIYEELIK-VKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGT--G---GAMA---E------SITKNGFVGTILGFD 206 (276) Q Consensus 142 ~~~~~i~~-a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~--~---~~~~---~------~~~~~G~i~~~~G~~ 206 (276) -.++.|.+ +.+.+...-.| -+++++|-+|..+.+....-.. . +++. + ..+++|++. +.++ T Consensus 229 lSleDllDm~~av~~~hyt~---svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~--~nln 303 (393) T protein:vir:79 229 FSAEDFLDLIIAVMANEYTP---SDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLP--FNFN 303 (393) T ss_pred ccHHHHHHHHHHHhcccCCc---ceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhccccc--ccee Confidence 22444444 33444444434 4699999999998886433111 0 1110 1 112222221 4599 Q ss_pred EEEeccccccccceEE--EEEecce--EEeeeeeeeeeeccCcccceeeEEeeeeeeeEEEc-CCeEEEEE-------ec Q lcl|NC_019506. 207 VYLSNNMGSLTNGTGA--IAGVKMA--CTFAEQIVQTEAYRMEKRFADAVKGLNVFGCKVIY-PDALVCLK-------KT 274 (276) Q Consensus 207 v~~s~~lp~~~~~~~~--~~~~~~a--~~~~~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~-~~~vv~~~-------~~ 274 (276) |+.|+-+|-.+....+ ++.-++. +-++.....++.++++-+--.-|+-..|||.+|++ ..+|.+.| +. T Consensus 304 v~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~~k~y~ 383 (393) T protein:vir:79 304 VNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNISMDKSYA 383 (393) T ss_pred EEEecccccccccceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEecceeecccc Confidence 9999999977655443 2333333 33455555677777766555678888999999997 56676654 23 Q ss_pred CC Q lcl|NC_019506. 275 NP 276 (276) Q Consensus 275 ~p 276 (276) .| T Consensus 384 ~P 385 (393) T protein:vir:79 384 EP 385 (393) T ss_pred cc Confidence 44 No 161 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=99.15 E-value=2e-12 Score=84.74 Aligned_cols=251 Identities=12% Similarity=0.050 Sum_probs=140.6 Q ss_pred Ccc-------------------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEE-eccCcc-cceeecC Q lcl|NC_019506. 1 MAV-------------------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKI-NQIGAI-TVKEYTE 59 (276) Q Consensus 1 MA~-------------------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~i-p~~~~~-~~~d~~~ 59 (276) |-. ++=..+.|++-+.+.+ + .+..+. + .+...|++|++ |.|... .+.++.+ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~-~---~LGv~r--~--~pla~GstIkt~k~~~y~gda~dVaE 72 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLL-E---MLGVTR--K--ISVSEGMTLKTYAGYDVTLAEGNVPE 72 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHH-H---Hhhhcc--c--ccccCCCEEeeccceeeeeccccccC Confidence 211 1101234444433322 2 222221 2 24466999955 667765 4778888 Q ss_pred CCCCCCcccccc---ceEEEEEEeeeecceeechHHH-HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_019506. 60 NSDIDAPEELST---TEKVLEINKQKYFNFQIDDVDA-AQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPA 135 (276) Q Consensus 60 ~~~~~~~~~~~~---~~~~~~ld~~~~~~~~v~d~d~-~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~ 135 (276) |..+.. ..++. +..++++++++ +. ++|+.. .....+...+.-+|+..+|++++|++++..++.++..... T Consensus 73 Ge~Ipl-skvt~~~~~t~t~~ikK~r-K~--tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~~-- 146 (296) T protein:vir:98 73 GEVIPL-SKVERKIHSEKKIELKKYR-KA--TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDA-- 146 (296) T ss_pred Ccccch-hhheeeecceEEEEeeccc-cc--cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceeee-- Confidence 888754 45554 34778887763 33 588885 6788999999999999999999999999999877643221 Q ss_pred ccCCHHHH----HHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec Q lcl|NC_019506. 136 ATLDKTNI----YEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN 211 (276) Q Consensus 136 ~~~t~~~~----~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~ 211 (276) ++..+ ...+.++...|++.+ ....++|++|...+.++++..+. .....+-..+. +++|..|++|. T Consensus 147 ---t~~~lQ~Ala~~~~~l~~~feded--~~~~V~FVnP~D~a~ylg~a~it-~qt~fG~tyl~-----nfLG~~II~S~ 215 (296) T protein:vir:98 147 ---LGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGIT-TQTAFGLTYLV-----DFTGTVIISTN 215 (296) T ss_pred ---chhhHHHHHHHHhhhhhhhccccC--CCceEEEEehHHHHHHhcCCccc-hhheechhhhh-----hccccEEEEcC Confidence 22222 233455556666654 23568999999999999987653 22222211222 48999999999 Q ss_pred cccccc-----cceEEEEEecceEE-eeeeeeee-------eeccCcccceeeEEeeeeeeeE--EEcCCeEEEEEecCC Q lcl|NC_019506. 212 NMGSLT-----NGTGAIAGVKMACT-FAEQIVQT-------EAYRMEKRFADAVKGLNVFGCK--VIYPDALVCLKKTNP 276 (276) Q Consensus 212 ~lp~~~-----~~~~~~~~~~~a~~-~~~~~~~~-------e~~~~~~~~~~~i~~~~~yg~~--v~~~~~vv~~~~~~p 276 (276) .+|... ..+..+++.+...+ ++..++-. -...++....=-+.....-|.. .=++|||++.+.++- T Consensus 216 kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 216 DVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred cCCCceEEEeeecceEEEeecccccchhhhhccccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCC Confidence 998421 22233444432111 11111100 0111111111111222222222 227899998776544 No 162 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=99.14 E-value=2.8e-11 Score=78.41 Aligned_cols=263 Identities=13% Similarity=0.087 Sum_probs=162.5 Q ss_pred Cccc----hhhHH--HHHHHHHHHHHHhhcchh--hhccc--cccccccCCcEEEEeccCccc-cee--ecCCCC--CCC Q lcl|NC_019506. 1 MAVT----SFIPK--LWSARLLAHLDKAHVVAN--LVNRD--YEGEIKAYGDTVKINQIGAIT-VKE--YTENSD--IDA 65 (276) Q Consensus 1 MA~~----~l~~e--~~~~~~~~~l~~~~v~~~--~~~~~--~~~~~~~~Gdtv~ip~~~~~~-~~d--~~~~~~--~~~ 65 (276) ||.+ .++|| +|...+.+.-.+...|.. .+.++ +......+|+.+++|.++.+. -.+ +...+. ... T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~t 80 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 9965 36777 799999888755444444 23222 211223579999999997753 222 222221 111 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKL------------- 132 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~------------- 132 (276) +..++..+... +..++.+++..+|+-...+-.|+++.+.++.+.--.+.....+++.++....... T Consensus 81 ~~kit~~~~~a-~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~~ 159 (349) T protein:vir:94 81 PRAIQTGEMMA-RVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMV 159 (349) T ss_pred cccccccceee-eeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCcee Confidence 24444433333 3356778888888877766679999999999988888888888887764322110 Q ss_pred ---cccccCCHHHHHHHHHHHHHHHhhcCCC--ccC-CEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceE Q lcl|NC_019506. 133 ---KPAATLDKTNIYEELIKVKVKLDEKNVP--TIG-RFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFD 206 (276) Q Consensus 133 ---~~~~~~t~~~~~~~i~~a~~~l~~~~vP--~~~-r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~ 206 (276) ...+..+ .+.|..|...|.+...- .+. ..++||+..+..|.+...+ ..-+ ..-....|+.+.|.. T Consensus 160 ~d~~~~a~~~----~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li-~~i~----~s~~~~~i~ty~G~~ 230 (349) T protein:vir:94 160 VDVSATSGFD----AGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI-DFIR----DAENNTMFATYQGYR 230 (349) T ss_pred EEecccCCCC----hhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchh-hhcc----CcccCcccceecCcE Confidence 0111122 34566677777665321 111 3689999999999886543 2221 111344578999999 Q ss_pred EEEecccccccc----ceEEEEEecceEEeeeeee--eeeeccCcccc----eeeEEeeeeeeeEEEcCCeEEEEEec-- Q lcl|NC_019506. 207 VYLSNNMGSLTN----GTGAIAGVKMACTFAEQIV--QTEAYRMEKRF----ADAVKGLNVFGCKVIYPDALVCLKKT-- 274 (276) Q Consensus 207 v~~s~~lp~~~~----~~~~~~~~~~a~~~~~~~~--~~e~~~~~~~~----~~~i~~~~~yg~~v~~~~~vv~~~~~-- 274 (276) |+++..+|+... ...++.+.++|+++..... .+|..|++... .|.+..+.+| ++.|-|+--.... T Consensus 231 VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s~~~a~v~ 307 (349) T protein:vir:94 231 VIVDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYSFTSAVIT 307 (349) T ss_pred EEEeCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---EeeeeeeeecccccC Confidence 999999997543 4456788899999986543 46777777543 3666666655 3455555554432 Q ss_pred ---------CC Q lcl|NC_019506. 275 ---------NP 276 (276) Q Consensus 275 ---------~p 276 (276) .| T Consensus 308 ~~~~~~~~~sP 318 (349) T protein:vir:94 308 GNGTETIARSA 318 (349) T ss_pred CCccccccCCC Confidence 34 No 163 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=99.07 E-value=1.2e-11 Score=80.42 Aligned_cols=258 Identities=11% Similarity=0.056 Sum_probs=139.2 Q ss_pred Cccc--hhh------------HHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC---c-ccceeecCCCC Q lcl|NC_019506. 1 MAVT--SFI------------PKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG---A-ITVKEYTENSD 62 (276) Q Consensus 1 MA~~--~l~------------~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~---~-~~~~d~~~~~~ 62 (276) |+.. +.. .+.|+.-+.+ |.+ .+..+. +. +...|.+|+++++. . ..+.++.+|.. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~-L~~---~LGv~r--~~--pla~Gt~iktyK~~~~~y~gda~dVaEGe~ 72 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNK-LFE---ALAIQN--KI--PMNVGSALKQYRFKVEDSEKPNGDVAEGDV 72 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHH-HHH---Hhhhhc--cc--cccCCceeeeeeeeceeeccccccccCCcc Confidence 6642 221 2334443332 222 222221 22 33468888766653 3 23667888877 Q ss_pred CCCcccccc---ceEEEEEEeeeecceeechHHH-HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-cccccc Q lcl|NC_019506. 63 IDAPEELST---TEKVLEINKQKYFNFQIDDVDA-AQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-LKPAAT 137 (276) Q Consensus 63 ~~~~~~~~~---~~~~~~ld~~~~~~~~v~d~d~-~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-~~~~~~ 137 (276) +.. ..+.. ...++++++++. .++++.. .....+...+.-+++.++|++++|.+++..++.++... .+..+. T Consensus 73 Ipl-skvt~~~~~t~~~~~kK~rK---~tTdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~ 148 (303) T protein:vir:10 73 IPL-TKVTREQVDITELQFAKYRK---STSAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKTK 148 (303) T ss_pred cch-hhheeeecceEEEEeecccc---cccHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhccccccccccee Confidence 754 45553 356788877633 3488886 67888999999999999999999999999999876432 222233 Q ss_pred CCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccc Q lcl|NC_019506. 138 LDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLT 217 (276) Q Consensus 138 ~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~ 217 (276) .+.+.+-+++.....+|+...=-...-++|+||...+.++++..........+-..+. +++|+.|++|..+|... T Consensus 149 ~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~-----nfLG~~II~S~kv~~G~ 223 (303) T protein:vir:10 149 LSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLT-----PYVGVKIVEFADVPQGE 223 (303) T ss_pred ecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhh-----hhhcceEEEeccCCCce Confidence 4444444444444334332221112248999999999999876554221122333343 48999999999998422 Q ss_pred -----cceEEEEEecceEE-eeeeeee-------eeeccCcccceeeEEeeeeeeeEE--EcCCeEEEEEecC------C Q lcl|NC_019506. 218 -----NGTGAIAGVKMACT-FAEQIVQ-------TEAYRMEKRFADAVKGLNVFGCKV--IYPDALVCLKKTN------P 276 (276) Q Consensus 218 -----~~~~~~~~~~~a~~-~~~~~~~-------~e~~~~~~~~~~~i~~~~~yg~~v--~~~~~vv~~~~~~------p 276 (276) ..+..+++.+.. | +++.+.- +-...++....=-+.....-|... =++|||++.+-++ | T Consensus 224 ~~~T~~~Ni~~ay~~~~-g~l~~~f~~t~D~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e~~~~~ 302 (303) T protein:vir:10 224 VWMTVAENLNVAYANPR-GELSRAFAFATDATGFVGVLHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDEAGELP 302 (303) T ss_pred EEEeeccceEEEEecCc-hhhhhhhhhccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEEeccccCCCC Confidence 122233333221 1 1111110 000111111111112222222222 2789999987632 3 No 164 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=99.05 E-value=1.7e-10 Score=74.13 Aligned_cols=266 Identities=8% Similarity=-0.041 Sum_probs=158.9 Q ss_pred Ccc-----chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc--cc--ee-ecCCCCCCCccccc Q lcl|NC_019506. 1 MAV-----TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI--TV--KE-YTENSDIDAPEELS 70 (276) Q Consensus 1 MA~-----~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~--~~--~d-~~~~~~~~~~~~~~ 70 (276) |-. -++.|++++. +++.+.+.+.++++++.- ....+.+..|+.++.. .. .+ ..+..... ...++ T Consensus 19 ~t~~d~~Gg~l~P~~~~~-~i~~~~e~s~~l~~~~vi----~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~-~~~~~ 92 (315) T protein:vir:41 19 IDVPDLGRGVLSVDRFGE-FVKAVRDSAVIIPEARID----NALKSYEKDISRLSLVLDVGPGRDETGQKLAPP-ESTAE 92 (315) T ss_pred cCCcCCCCceechHHHHH-HHHHHHhhhhhhhhceee----eccccccccccccccCcccccccccccCcCCCC-CCccc Confidence 222 2478999865 667888889999987641 1112345555554322 11 11 11111211 13456 Q ss_pred cceEEEEEEeeeecceeechHHHHhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHhhcc--c-------------ccccc Q lcl|NC_019506. 71 TTEKVLEINKQKYFNFQIDDVDAAQIRT--PLMDAAMQRAAYALADETEKILLKEMDTN--A-------------TSKLK 133 (276) Q Consensus 71 ~~~~~~~ld~~~~~~~~v~d~d~~~~~~--d~~~~~~~~~~~ala~~~d~~~~~~~~~~--~-------------~~~~~ 133 (276) .+.+++.+.+. +..+.|+++-...+.. |+.+.+....++++++..+..++..=.++ + ....+ T Consensus 93 f~~~~l~~~~l-~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~ 171 (315) T protein:vir:41 93 VKTNTLYMREM-VTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTE 171 (315) T ss_pred cceeeeceeee-eeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccccc Confidence 67777777554 3446788877766654 88999999999999999888777542211 0 00000 Q ss_pred c-cccCCHHHHHHHHHHHHHHHhhcCCC-ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec Q lcl|NC_019506. 134 P-AATLDKTNIYEELIKVKVKLDEKNVP-TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN 211 (276) Q Consensus 134 ~-~~~~t~~~~~~~i~~a~~~l~~~~vP-~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~ 211 (276) . .+........+.|.++...|....-- ..+-+.++++..+..+++.. .....+.++..+..|....++|.+|+..+ T Consensus 172 ~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk--~~~g~~lw~~~~~~g~~~tl~G~PV~~~~ 249 (315) T protein:vir:41 172 SDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDAL--KGRETGLGDQALTGANSILYDGRPVQYVP 249 (315) T ss_pred cccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHh--ccCCCccccchhhcCCCceecccceEecc Confidence 0 11111112345566665555432210 12346899999998886642 23455678888888998999999999999 Q ss_pred cccccccceEE-EEEecceEEeeee-eeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 212 NMGSLTNGTGA-IAGVKMACTFAEQ-IVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 212 ~lp~~~~~~~~-~~~~~~a~~~~~~-~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) .+|....+... +++...-+.++.+ ...++.+++.......+....+.|+....++++++.-.++ T Consensus 250 ~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 250 ALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred cccccCCCCccEEEecccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 99876544333 3334433444432 2356666766555567788889999888777766554555 No 165 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=99.04 E-value=1.8e-10 Score=74.04 Aligned_cols=254 Identities=11% Similarity=-0.030 Sum_probs=147.3 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MA------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~ 73 (276) |. .-.++|+.+..++.+.+++.+++.++++.- + ..| +..||+....+ +.............+.+... T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~----~-~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~ 159 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQ----N-AGI-KTRVIKADPAGQAVWGKVFGEIKGQLDAAFRE 159 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeE----e-cCC-ceEEEEecCCcceEEeecccccCcccccccee Confidence 11 114789999999999999999999988641 1 234 45787765543 44433333332223456667 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---cccc----------ccc---ccc Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN---ATSK----------LKP---AAT 137 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~---~~~~----------~~~---~~~ 137 (276) +++...+. +.-+.|+.+-+..+..|+.+.+.+..+.+++.++|+.++..-..+ +... .+. ... T Consensus 160 i~l~~~kl-~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~ 238 (395) T protein:vir:95 160 ENFTQYKL-TCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGT 238 (395) T ss_pred eeeceeeE-EEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccccccch Confidence 77776543 445678887777788899999999999999999999887432221 1100 000 001 Q ss_pred CCHHH---HHHHHHHHHHHHhhc----C-CCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEe--ceEE Q lcl|NC_019506. 138 LDKTN---IYEELIKVKVKLDEK----N-VPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTIL--GFDV 207 (276) Q Consensus 138 ~t~~~---~~~~i~~a~~~l~~~----~-vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~--G~~v 207 (276) .+..+ .+..+..+...+... . ....+..++++|..+..+.....+.. ..|...+++ |.+| T Consensus 239 ~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~----------~~G~~~~~lg~g~~v 308 (395) T protein:vir:95 239 LTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLT----------ANGGFVTVLPYNVTI 308 (395) T ss_pred hhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceecc----------CCCcceeccCCcceE Confidence 11111 122333333322111 1 11223457899988876654433321 134444554 6678 Q ss_pred EEeccccccccceEEEEEecceEEeeee-eeeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 208 YLSNNMGSLTNGTGAIAGVKMACTFAEQ-IVQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 208 ~~s~~lp~~~~~~~~~~~~~~a~~~~~~-~~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++++.+|.. ...+..-+. +....+ ...++..+. .+| ...+++..|+|+++++++++++++.+.. T Consensus 309 ~~~~~~p~~---~i~fgdfs~-y~i~~r~~~~i~~~~~-~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~ 376 (395) T protein:vir:95 309 ITSEFVPEG---KLVAFVTDR-YNAVRGGGLTVKKFDQ-TLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVA 376 (395) T ss_pred EEcCCCCCC---cEEEEeccc-EEEEEecceEEEeccc-hhhhCCcEEEEEEEEECCEEeccccEEEEEeecc Confidence 999999852 222222222 323222 223333322 122 2578999999999999999999987744 No 166 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=99.03 E-value=2.3e-10 Score=73.41 Aligned_cols=255 Identities=15% Similarity=0.005 Sum_probs=152.4 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAV---TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~---~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) -.. -.++|+.+..++.+.+.+...+.++++.- + ..| ...||.... .++.+..++.......+.+...+++ T Consensus 82 ~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~-~~~-~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l 155 (377) T protein:vir:96 82 VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK----N-TSL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDF 155 (377) T ss_pred CCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeE----e-cCC-ceEEEEecCCcceeEeecccccccccCccceeEee Confidence 111 25889999999999999999999988642 1 123 456776544 3466666555443323556666677 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------cccccc----------- Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN------------ATSKLK----------- 133 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~------------~~~~~~----------- 133 (276) ...+. +.-+.|+..-+..+..|+...+.+..+.+++..+|+.++..=... ...... T Consensus 156 ~~~kl-~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~ 234 (377) T protein:vir:96 156 SQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDK 234 (377) T ss_pred eeeeE-EeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeecc Confidence 66444 444677877777788899999999999999999999877421110 000000 Q ss_pred ----ccccCCHHHHHHHHHHHHHHHhhcCC--C---ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEe- Q lcl|NC_019506. 134 ----PAATLDKTNIYEELIKVKVKLDEKNV--P---TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTIL- 203 (276) Q Consensus 134 ----~~~~~t~~~~~~~i~~a~~~l~~~~v--P---~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~- 203 (276) ..+..+...+.+.+..+...+..++- | ..+-++++||..+..+.....+.. .+|...+++ T Consensus 235 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~----------~~G~~~~~l~ 304 (377) T protein:vir:96 235 EAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRN----------QFGEYVTVLP 304 (377) T ss_pred ccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccccccC----------CCCCceeccC Confidence 00112333344444445444443321 1 123468899998887754332221 123444554 Q ss_pred -ceEEEEeccccccccceEEEEEecceEEeeeee-eeeeeccCcc--cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 204 -GFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQI-VQTEAYRMEK--RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 204 -G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~-~~~e~~~~~~--~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) |+.+++|+.+|. +...+...+. +.+..+. ..++..++.. .-.+.+++..++++++++|++++++..+.- T Consensus 305 ~p~~v~~s~~~p~---~~i~fgdf~~-Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 305 HGITILESLAVET---GKAIAFVANR-YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred CCceEEecCCCCc---ccEEEEEcCc-EEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 456888888884 2223332333 4444332 2343333211 113578999999999999999999998888 No 167 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.97 E-value=2.7e-10 Score=73.06 Aligned_cols=254 Identities=13% Similarity=-0.006 Sum_probs=149.3 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MA------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |. .-+++|+.+..++.+.+.+.+.+.++++.- . ..|+ ..||+... ..+.+...+.......+.+... T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~-~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~ 149 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----N-AGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSE 149 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE----e-cCcc-eEEEEecCCcceeeeccccccccccccccee Confidence 11 125789999999999999999999988542 1 2243 46776544 3455555554433223455666 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-cc----------ccc-------ccc Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN-AT----------SKL-------KPA 135 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~-~~----------~~~-------~~~ 135 (276) +++...+. +.-+.|+..-+..+..|+...+.+..+.+++..+|+.++..-... +. ... ... T Consensus 150 i~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~ 228 (381) T protein:vir:95 150 ETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) T ss_pred eeecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccc Confidence 66666544 444677877777778899999999999999999999876432111 00 000 001 Q ss_pred ccC---CHHHHHHHHHHHHHHHhhcC-----CCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEE--ece Q lcl|NC_019506. 136 ATL---DKTNIYEELIKVKVKLDEKN-----VPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTI--LGF 205 (276) Q Consensus 136 ~~~---t~~~~~~~i~~a~~~l~~~~-----vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~--~G~ 205 (276) ... +....++.+......+.... .+..+.+++++|..+..|+....... .+|..... +|. T Consensus 229 ~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~----------~~G~~v~~l~~g~ 298 (381) T protein:vir:95 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN----------ANGVYVTALPFNL 298 (381) T ss_pred cccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCC----------CCCceeecCCCCc Confidence 111 12223344544444443221 22345568999999888865432221 12322222 467 Q ss_pred EEEEeccccccccceEEEEEecceEEeeeee-eeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEecC---C Q lcl|NC_019506. 206 DVYLSNNMGSLTNGTGAIAGVKMACTFAEQI-VQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKTN---P 276 (276) Q Consensus 206 ~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~-~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~---p 276 (276) .|+.|+.+|. +. .+++.-+-+....+. ..++..+. ..| .+.+++..|+++++++|+++++++.+. | T Consensus 299 ~vv~s~~~p~---~~-iifgDfs~Y~i~~r~~~~i~~~~~-~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~ 371 (381) T protein:vir:95 299 NVIESTVQEA---GK-VLTYVKGLYDGYLAGGINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) T ss_pred eEEecCCCCc---Cc-EEEEecccEEEEEecccEEEeech-hHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCC Confidence 7999998884 22 233332324333332 23333322 222 257899999999999999999977554 2 No 168 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.97 E-value=2.7e-10 Score=73.06 Aligned_cols=254 Identities=13% Similarity=-0.006 Sum_probs=149.3 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MA------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |. .-+++|+.+..++.+.+.+.+.+.++++.- . ..|+ ..||+... ..+.+...+.......+.+... T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~-~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~ 149 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----N-AGLR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSE 149 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE----e-cCcc-eEEEEecCCcceeeeccccccccccccccee Confidence 11 125789999999999999999999988542 1 2243 46776544 3455555554433223455666 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-cc----------ccc-------ccc Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN-AT----------SKL-------KPA 135 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~-~~----------~~~-------~~~ 135 (276) +++...+. +.-+.|+..-+..+..|+...+.+..+.+++..+|+.++..-... +. ... ... T Consensus 150 i~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~ 228 (381) T protein:vir:10 150 ETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQ 228 (381) T ss_pred eeecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccc Confidence 66666544 444677877777778899999999999999999999876432111 00 000 001 Q ss_pred ccC---CHHHHHHHHHHHHHHHhhcC-----CCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEE--ece Q lcl|NC_019506. 136 ATL---DKTNIYEELIKVKVKLDEKN-----VPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTI--LGF 205 (276) Q Consensus 136 ~~~---t~~~~~~~i~~a~~~l~~~~-----vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~--~G~ 205 (276) ... +....++.+......+.... .+..+.+++++|..+..|+....... .+|..... +|. T Consensus 229 ~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~~----------~~G~~v~~l~~g~ 298 (381) T protein:vir:10 229 GTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLN----------ANGVYVTALPFNL 298 (381) T ss_pred cccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccCC----------CCCceeecCCCCc Confidence 111 12223344544444443221 22345568999999888865432221 12322222 467 Q ss_pred EEEEeccccccccceEEEEEecceEEeeeee-eeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEecC---C Q lcl|NC_019506. 206 DVYLSNNMGSLTNGTGAIAGVKMACTFAEQI-VQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKTN---P 276 (276) Q Consensus 206 ~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~-~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~---p 276 (276) .|+.|+.+|. +. .+++.-+-+....+. ..++..+. ..| .+.+++..|+++++++|+++++++.+. | T Consensus 299 ~vv~s~~~p~---~~-iifgDfs~Y~i~~r~~~~i~~~~~-~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~ 371 (381) T protein:vir:10 299 NVIESTVQEA---GK-VLTYVKGLYDGYLAGGINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHK 371 (381) T ss_pred eEEecCCCCc---Cc-EEEEecccEEEEEecccEEEeech-hHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCC Confidence 7999998884 22 233332324333332 23333322 222 257899999999999999999977554 2 No 169 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.96 E-value=6.5e-10 Score=70.95 Aligned_cols=264 Identities=10% Similarity=0.043 Sum_probs=148.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccccee--ecCCCCCCCccccccceEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKE--YTENSDIDAPEELSTTEKVLEI 78 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d--~~~~~~~~~~~~~~~~~~~~~l 78 (276) ...-..+|.-++.++.+.+.+...+.++++.-. ....+..||.++...... ..+++......+++.+.+++.+ T Consensus 24 ~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~-----v~~~~~~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~ 98 (321) T protein:vir:31 24 LDAGGTLPDPLWDEFWTDMIEETPLLDAIRTET-----VGAKKTRIPTLNIGERHRRPQDEGEWNENESDVSTGTIDIST 98 (321) T ss_pred cCCcceeCHHHHHHHHHHHHHhhhhhhhceeee-----ccCcceeeeeeccCCcccccccccccccccccceeeeeeeee Confidence 111234555666678888888888888876421 123345566654322111 1123332223445666777877 Q ss_pred EeeeecceeechHHHHhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----cc--------c-ccccCCHHH Q lcl|NC_019506. 79 NKQKYFNFQIDDVDAAQIR--TPLMDAAMQRAAYALADETEKILLKEMDTNATS-----KL--------K-PAATLDKTN 142 (276) Q Consensus 79 d~~~~~~~~v~d~d~~~~~--~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-----~~--------~-~~~~~t~~~ 142 (276) .+. ...+.|+.+-...+. .|+.+.+....+++++..++..++..-..+... .+ . .....+... T Consensus 99 ~k~-~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~~~~~ 177 (321) T protein:vir:31 99 EKA-TVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAADDIL 177 (321) T ss_pred EEE-EeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhcccccccccccccc Confidence 554 455678876555443 488888999999999999888766432211110 00 0 001112222 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEE Q lcl|NC_019506. 143 IYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGA 222 (276) Q Consensus 143 ~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~ 222 (276) .++.|.++...|+...--..+-+.+|+++.+..++.- +.......++..+..|...+++|++++.++.+|.. .. T Consensus 178 ~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~--l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~----~i 251 (321) T protein:vir:31 178 DNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYT--LTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDD----KA 251 (321) T ss_pred CHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHH--HhcCCCccccchhhccccccccceeEEEcCCCCCC----cE Confidence 3566777777776554212234678999987665432 12223334555677777778999999999999852 24 Q ss_pred EEEecceEEeee-eeeeeeeccCccc---ceeeEE--eeeeeeeEEEcCCeEEEEE-ecCC Q lcl|NC_019506. 223 IAGVKMACTFAE-QIVQTEAYRMEKR---FADAVK--GLNVFGCKVIYPDALVCLK-KTNP 276 (276) Q Consensus 223 ~~~~~~a~~~~~-~~~~~e~~~~~~~---~~~~i~--~~~~yg~~v~~~~~vv~~~-~~~p 276 (276) +++...-+.+.. +...++..++.+. ..+.++ ....+|+.|-++++++.++ ...| T Consensus 252 l~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 252 MFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDP 312 (321) T ss_pred EEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCcc Confidence 444444444332 2223333333221 122333 2345778788899999987 4666 No 170 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.93 E-value=4.9e-10 Score=71.64 Aligned_cols=259 Identities=18% Similarity=0.210 Sum_probs=146.0 Q ss_pred Cc-cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccceEEEEE Q lcl|NC_019506. 1 MA-VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTEKVLEI 78 (276) Q Consensus 1 MA-~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~~l 78 (276) -+ ...++|+.+...+.+.+.....+.++++.. . ..| ++++|+.+.. .+....++..... .+++...+++.+ T Consensus 154 ~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~--~---~~g-~~~~~~~~~~~~a~wv~E~~~~~~-~~~~f~~i~~~~ 226 (466) T protein:vir:80 154 VSGAELTIPDVMLELLRDNMHRYSKLISKVRLR--P---LKG-TARQNIAGAIPEGVWTEAVANLNE-LSLSFSQIEVDG 226 (466) T ss_pred hccccccccHHHHHHHHHhhhhhhhhhhheeee--e---cCc-eeEeeeecCCcceeeccccccccc-ccccccceeecc Confidence 11 125789999999999998888887776431 1 122 4566655443 3444555555433 356677777777 Q ss_pred EeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cc-----------ccccc-----ccCCHH Q lcl|NC_019506. 79 NKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNA-TS-----------KLKPA-----ATLDKT 141 (276) Q Consensus 79 d~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~-~~-----------~~~~~-----~~~t~~ 141 (276) .+. +.-+.|+++-...+..++...+.+..+++++..+|..++..-..+. .+ ..... ...+.. T Consensus 227 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (466) T protein:vir:80 227 YKV-GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTT 305 (466) T ss_pred eee-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchh Confidence 554 4446788887777878999999999999999999998875311110 00 00000 000110 Q ss_pred HH----------HHHHHHHHHHHhhcCCC-ccC-CEEEECHHHHHHHhhhHHhhhh-cccccccceeeeeeeEEeceEEE Q lcl|NC_019506. 142 NI----------YEELIKVKVKLDEKNVP-TIG-RFLIIPPDVHGLLLAADLIVGT-GGAMAESITKNGFVGTILGFDVY 208 (276) Q Consensus 142 ~~----------~~~i~~a~~~l~~~~vP-~~~-r~~vv~p~~~~~L~~~~~~~~~-~~~~~~~~~~~G~i~~~~G~~v~ 208 (276) .. +..+.++...+.....+ ..+ .+.++++..+..|++....... ...... ..++ ..++|.+|+ T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~--~~~~--~~i~G~pvv 381 (466) T protein:vir:80 306 NLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVAS--LNNT--MPIVGGDIV 381 (466) T ss_pred hhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCcccccc--CCCc--cccccccee Confidence 00 01111211111111111 123 3467899988888665322111 111111 1111 258999999 Q ss_pred EeccccccccceEEEEEecceEEeeee-eeeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 209 LSNNMGSLTNGTGAIAGVKMACTFAEQ-IVQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 209 ~s~~lp~~~~~~~~~~~~~~a~~~~~~-~~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .++.+|... .+++....+.+..+ ...++..+. .+| -+.+++..++|+++++|+++++++.+.. T Consensus 382 ~s~~~~~~~----~~~g~~~~y~i~~r~~~~i~~~~~-~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~ 448 (466) T protein:vir:80 382 ILDFIPDND----IIGGYGSLYLLAERADIKLAQSEH-VRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANA 448 (466) T ss_pred ecCccCccc----eeeeccccEEEEeecceEEEechh-hhhhcCcEEEEEEEEEccEEeccCceEEEEecCC Confidence 999998532 33444444433322 223333322 222 2478999999999999999999976655 No 171 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.90 E-value=9e-10 Score=70.17 Aligned_cols=256 Identities=11% Similarity=-0.039 Sum_probs=145.4 Q ss_pred Ccc--chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MAV--TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA~--~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~~ 77 (276) ..- -+++|+.+..++.+.+.+.+.+.++++.- . ..| ...+|+.... .+.+...........+.+.+.+++. T Consensus 80 t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~----~-~~~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~ 153 (381) T protein:vir:10 80 VGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----N-AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAI 153 (381) T ss_pred CCCCCceecCHHHHHHHHHHHHhhcceeeeeeeE----e-cCc-ceEEEeecCCcceEEeecccccccccCccceeEeec Confidence 221 25899999999999999999999988542 1 223 3466765543 3444444333322234566666666 Q ss_pred EEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccc---------cccc--------cccCC Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN-ATS---------KLKP--------AATLD 139 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~-~~~---------~~~~--------~~~~t 139 (276) ..+. +.-+.|+..-+..+..|+.+.+.+..++++++.+|+.++..=... +.+ .... ....+ T Consensus 154 ~~kl-~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t 232 (381) T protein:vir:10 154 QNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLT 232 (381) T ss_pred ceeE-EeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCcccccccccccccccccccc Confidence 6444 455677877777778899999999999999999998776331111 000 0000 00111 Q ss_pred HH---HHHHHHHHHHHHHhhc----C-CCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec Q lcl|NC_019506. 140 KT---NIYEELIKVKVKLDEK----N-VPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN 211 (276) Q Consensus 140 ~~---~~~~~i~~a~~~l~~~----~-vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~ 211 (276) .. ..++.+......+... . ....+.+++++|..+..|.....+... ++.+.. . --+|.+|++++ T Consensus 233 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~-----~G~~v~-~--lp~g~~vv~~~ 304 (381) T protein:vir:10 233 FANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNA-----NGVYVT-A--LPFNLNVIEST 304 (381) T ss_pred ccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccCCC-----CCceee-c--CCCCceeEEcC Confidence 11 1122222222222111 1 123456789999999988765433221 111111 0 11588899999 Q ss_pred cccccccceEEEEEecceEEeeeee-eeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEecC----C Q lcl|NC_019506. 212 NMGSLTNGTGAIAGVKMACTFAEQI-VQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKTN----P 276 (276) Q Consensus 212 ~lp~~~~~~~~~~~~~~a~~~~~~~-~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~~----p 276 (276) .+|.. ...+...+. +....+. ..++..+. .+| .+.+++..++++++++|++++++..+. | T Consensus 305 ~~p~~---~i~fGDfs~-Y~i~~r~~~~i~~~~~-~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~ 372 (381) T protein:vir:10 305 VQEAG---KVLTYVKGL-YDGYLAGGINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) T ss_pred CCCcC---cEEEEEccc-EEEEEecccEEEeech-hhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCcc Confidence 99842 222222232 3333332 23333322 222 257899999999999999999977663 3 No 172 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.82 E-value=2.7e-09 Score=67.60 Aligned_cols=254 Identities=10% Similarity=-0.030 Sum_probs=140.2 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. -+++|+.|...+.+.+.+...+.++++.- + ..|+ ..||+.... .+.+...+.......+.+... T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~----~-~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~ 156 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMR----T-TGLR-TKFLKSETSGVAVWGKIFGEIKGQLDATFSD 156 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeE----e-cCCc-eEEEEEcCCcceEEeecccccccccCcceee Confidence 221 15889999999999999999999987531 1 2354 578876554 355555544433223456667 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccc---------cccccc----cCC Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN-ATS---------KLKPAA----TLD 139 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~-~~~---------~~~~~~----~~t 139 (276) +++...+. +.-+.|+.+-+..+..++.+.+.+..+++++..+|+.++..-... +.. ....+. ..+ T Consensus 157 i~l~~~kl-~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~ 235 (383) T protein:vir:78 157 EESIQNKL-TAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAAT 235 (383) T ss_pred Eeecceee-EeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCccccccccccccccc Confidence 77777554 445788887777788899999999999999999999887432110 000 000000 000 Q ss_pred HHHHHHHHHH---HHHHHhhcCC--------C-ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEe--ce Q lcl|NC_019506. 140 KTNIYEELIK---VKVKLDEKNV--------P-TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTIL--GF 205 (276) Q Consensus 140 ~~~~~~~i~~---a~~~l~~~~v--------P-~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~--G~ 205 (276) +......+.. ....+.+... . ......+++|..+..+....... + .+|....++ |. T Consensus 236 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~--~--------~~G~~~t~l~~~~ 305 (383) T protein:vir:78 236 GTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSL--N--------ANGVYVTALPFNL 305 (383) T ss_pred chhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhcc--C--------CCCceeeecCCCc Confidence 0001111111 1111111110 0 11224678887665553321111 0 123333444 55 Q ss_pred EEEEeccccccccceEEEEEecceEEeeee-eeeeeeccCcccc---eeeEEeeeeeeeEEEcCCeEEEEEec-CC Q lcl|NC_019506. 206 DVYLSNNMGSLTNGTGAIAGVKMACTFAEQ-IVQTEAYRMEKRF---ADAVKGLNVFGCKVIYPDALVCLKKT-NP 276 (276) Q Consensus 206 ~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~-~~~~e~~~~~~~~---~~~i~~~~~yg~~v~~~~~vv~~~~~-~p 276 (276) .|++|+.+|.. . .+++..+.+....+ ...++..+ ..+| .+.+++..|+|+++++|++++++..+ ++ T Consensus 306 ~iv~s~~~p~~---~-iifgdfs~Y~i~~r~~~~i~~~~-~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~ 376 (383) T protein:vir:78 306 NIIESLFVPEK---K-AISYVAERYDALIGGPLDIGTYD-QTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINP 376 (383) T ss_pred eEEecCCCCcc---c-EEEeeccceEEEecccceEEecc-hhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEecC Confidence 68889888842 2 22333232444332 22343332 2222 35789999999999999999996632 22 No 173 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=98.82 E-value=3.4e-09 Score=67.01 Aligned_cols=254 Identities=14% Similarity=-0.042 Sum_probs=142.0 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC-cccceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAV------TSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG-AITVKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~------~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~-~~~~~d~~~~~~~~~~~~~~~~~ 73 (276) |.. -.++|+.+..++.+.+.+...+.++++.- . ..|+ .++|... ..++.+..++.......+.+... T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~----~-~~~~-~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~ 152 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK----N-TSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKE 152 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeE----e-cCcc-eEEEEecCCcceeEeecccccCcccCcccee Confidence 221 24789999999999999999899987531 1 2344 4677543 34566666554433223445566 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------cccc---ccccccCC-- Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTN---------ATSK---LKPAATLD-- 139 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~---------~~~~---~~~~~~~t-- 139 (276) +++...+. +.-+.|+.+-+..+..|+.+.+.+..++++++.+|+.++..=... +... .+...+.+ T Consensus 153 i~l~~~kl-~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~ 231 (377) T protein:vir:98 153 QDFSQFKL-TAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYK 231 (377) T ss_pred EeecceeE-EeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccccc Confidence 66666443 444677777777788899999999999999999999876422111 0000 00000011 Q ss_pred --HHHHHH--------------HHHHH--HHHHhhcCCCccCC-EEEECHHHHHHHhhhHHhhhhcccccccceeeeeee Q lcl|NC_019506. 140 --KTNIYE--------------ELIKV--KVKLDEKNVPTIGR-FLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVG 200 (276) Q Consensus 140 --~~~~~~--------------~i~~a--~~~l~~~~vP~~~r-~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~ 200 (276) ...+.+ ++... ...+++.+- ..|+ +++++|..+..+...... ....|... T Consensus 232 ~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd-~~G~~i~~~n~~~~~~~~p~~~~----------~~~~G~~~ 300 (377) T protein:vir:98 232 TDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLK-IAGQVKLILNPEDRWALEAQFTS----------RNQFGEYV 300 (377) T ss_pred chhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhc-cCCceEEEecccchhhccccccc----------cCCCCccc Confidence 011100 11000 001111111 2445 456788776655322110 11234444 Q ss_pred EEece--EEEEeccccccccceEEEEEecceEEeeee-eeeeeeccCcc--cceeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 201 TILGF--DVYLSNNMGSLTNGTGAIAGVKMACTFAEQ-IVQTEAYRMEK--RFADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 201 ~~~G~--~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~-~~~~e~~~~~~--~~~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) +++|+ .|+.|+.+|.. ... ++..+.+....+ ...++..++.. .-.+.+++..++|+++++|+++++++.+. T Consensus 301 t~lg~p~~vv~s~~~p~~---~i~-fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~ 376 (377) T protein:vir:98 301 TVLPHGITILESLAVETG---KAI-AFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAG 376 (377) T ss_pred cccCCCceEEecCCCCcc---cEE-EEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEec Confidence 55654 57888888852 223 333232433322 22343332211 11367899999999999999999999888 Q ss_pred C Q lcl|NC_019506. 276 P 276 (276) Q Consensus 276 p 276 (276) = T Consensus 377 ~ 377 (377) T protein:vir:98 377 G 377 (377) T ss_pred C Confidence 8 No 174 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.72 E-value=1.2e-08 Score=63.93 Aligned_cols=273 Identities=12% Similarity=0.059 Sum_probs=160.4 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhcchh-hhcccccc-------ccccCCcEEEEeccCcccceeecCCCCCC-C Q lcl|NC_019506. 1 MAVTS------FIPKLWSARLLAHLDKAHVVAN-LVNRDYEG-------EIKAYGDTVKINQIGAITVKEYTENSDID-A 65 (276) Q Consensus 1 MA~~~------l~~e~~~~~~~~~l~~~~v~~~-~~~~~~~~-------~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-~ 65 (276) ||.+. ....+|++.+...-.+.+.|.+ ++-++-.. .-...||+|+|+-...++-..+..+.... . T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv~Gd~~leGn 80 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKPTYGDARVEGK 80 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCCcccCceeecc Confidence 98764 3357899888888777776766 54332111 11246999999877665544444333332 2 Q ss_pred ccccccceEEEEEEeeeecceee-chHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------------- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQI-DDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-------------- 130 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v-~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-------------- 130 (276) .+.++..+-.++||+.++ ++.. ..+..-.+..|++++..+.+..-+++..|+.++-.+..+... T Consensus 81 ee~L~~~~~~i~idq~r~-~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~~ 159 (364) T protein:vir:93 81 EESLRFYQDEVRIDQVRH-SVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGYA 159 (364) T ss_pred ccceeEEeeEEEEeeccc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCccccc Confidence 356888889999998765 4544 345666788999999999999999999999877665432100 Q ss_pred --ccc--------------ccccCCHH--HHHHHHHHHHHHHhhcCCCc--------------cCCEEEECHHHHHHHhh Q lcl|NC_019506. 131 --KLK--------------PAATLDKT--NIYEELIKVKVKLDEKNVPT--------------IGRFLIIPPDVHGLLLA 178 (276) Q Consensus 131 --~~~--------------~~~~~t~~--~~~~~i~~a~~~l~~~~vP~--------------~~r~~vv~p~~~~~L~~ 178 (276) ... ....++.. ..++.|.+|+..++..+.+. +--+++++|.++..|+. T Consensus 160 ~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr~ 239 (364) T protein:vir:93 160 GNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMRT 239 (364) T ss_pred ccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhhh Confidence 000 00111111 23678888888887664320 11268999999999985 Q ss_pred --hHHhhhhcc-----cccccceeeeeeeEEeceEEEEeccccccccceE--------EEEEecce--EEeee----eee Q lcl|NC_019506. 179 --ADLIVGTGG-----AMAESITKNGFVGTILGFDVYLSNNMGSLTNGTG--------AIAGVKMA--CTFAE----QIV 237 (276) Q Consensus 179 --~~~~~~~~~-----~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~--------~~~~~~~a--~~~~~----~~~ 237 (276) +++|..... .+.+..+.+|.+|+|.|+-+++..+++....... ++..-..| +++++ +.. T Consensus 240 ~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllGaQA~~~a~g~~~g~~~~ 319 (364) T protein:vir:93 240 AAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMGRQAGVIAYGTANGLRFD 319 (364) T ss_pred cCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheecceeeEEEeecCCCCCce Confidence 445443322 2334567889999999999999888763321111 11111222 23322 111 Q ss_pred eeee-ccCcccceeeEEeeeeeeeEEE----cCCeEEEEEecCC Q lcl|NC_019506. 238 QTEA-YRMEKRFADAVKGLNVFGCKVI----YPDALVCLKKTNP 276 (276) Q Consensus 238 ~~e~-~~~~~~~~~~i~~~~~yg~~v~----~~~~vv~~~~~~p 276 (276) -.|. .+..+.. .|......|.+=. ..=|+.+|..+|| T Consensus 320 w~Ee~~D~gn~~--~i~~~~i~G~kK~rF~~~DfGvi~idtaa~ 361 (364) T protein:vir:93 320 WEETVKDYGNEP--AIAAGFIAGMKKARFNNKDFGVISIDTAAK 361 (364) T ss_pred eeecccCCCCch--hhhhhhHhhhhhcccCCccceEEEeccccc Confidence 1111 1111211 2223233332222 3668899999999 No 175 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.55 E-value=6.9e-08 Score=59.84 Aligned_cols=272 Identities=13% Similarity=0.045 Sum_probs=148.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhcccc-------ccccccCCcEEEEeccCcccceeecCCCCCC-Cccccccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDY-------EGEIKAYGDTVKINQIGAITVKEYTENSDID-APEELSTT 72 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~-------~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-~~~~~~~~ 72 (276) +.|+..+ .+|+..+...-....-+..+..++. ...-...||+|+|.-...++-..+..+.... ..+.++.. T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~ 100 (404) T protein:vir:10 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecccCCcccCceeeccccceeEE Confidence 5555432 3444432222221211221111110 1111256999999877665544444333332 23568888 Q ss_pred eEEEEEEeeeecceeec-hHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------------------- Q lcl|NC_019506. 73 EKVLEINKQKYFNFQID-DVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT---------------------- 129 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~-d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---------------------- 129 (276) +-.++||+.++ ++... .+..-.+..|++++....+...+++..|+.++-.+..... T Consensus 101 s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~ 179 (404) T protein:vir:10 101 DFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) T ss_pred eeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceee Confidence 89999998765 34333 4556668899999999999999999999988755543221 Q ss_pred ccccc---------cccCCHHH-------HHHHHHHHHHHHhhcCCCcc--------------CCEEEECHHHHHHHhhh Q lcl|NC_019506. 130 SKLKP---------AATLDKTN-------IYEELIKVKVKLDEKNVPTI--------------GRFLIIPPDVHGLLLAA 179 (276) Q Consensus 130 ~~~~~---------~~~~t~~~-------~~~~i~~a~~~l~~~~vP~~--------------~r~~vv~p~~~~~L~~~ 179 (276) +...+ +.+++..+ .++.|..+++.+++..-|.. -++++++|.++.+|+.+ T Consensus 180 N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~d 259 (404) T protein:vir:10 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) T ss_pred cccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhhC Confidence 00001 11111111 25567788888877554421 16789999999999998 Q ss_pred H---Hhhhhcc------cccccceeeeeeeEEeceEEEEecccccc--------------ccceE----------EEEEe Q lcl|NC_019506. 180 D---LIVGTGG------AMAESITKNGFVGTILGFDVYLSNNMGSL--------------TNGTG----------AIAGV 226 (276) Q Consensus 180 ~---~~~~~~~------~~~~~~~~~G~i~~~~G~~v~~s~~lp~~--------------~~~~~----------~~~~~ 226 (276) + +|..... -+.+..+.+|.+|+|.|+-+++..++|.- +.+.. ++..- T Consensus 260 t~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLG 339 (404) T protein:vir:10 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLG 339 (404) T ss_pred CCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeec Confidence 5 3433211 13456788999999999999987766520 10000 11111 Q ss_pred cceEEee--ee----eeee-eeccCcccceeeEEeeeeeeeEEEc---------CCeEEEEEecCC Q lcl|NC_019506. 227 KMACTFA--EQ----IVQT-EAYRMEKRFADAVKGLNVFGCKVIY---------PDALVCLKKTNP 276 (276) Q Consensus 227 ~~a~~~~--~~----~~~~-e~~~~~~~~~~~i~~~~~yg~~v~~---------~~~vv~~~~~~p 276 (276) ..|++.+ +. ..-. |..+..+. -.|......|.+=+| .=|+.+|..+|| T Consensus 340 aQAl~~A~g~~~g~~~~w~Ee~~D~g~~--~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~ 403 (404) T protein:vir:10 340 AQALANAYGQKAGGHFNMVEKKTDMDNR--TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) T ss_pred ceeEEEEeeccCCCCceeEeeccccCch--hhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc Confidence 2222222 21 1101 11111111 133333444443333 447888999999 No 176 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.55 E-value=6.9e-08 Score=59.84 Aligned_cols=272 Identities=13% Similarity=0.045 Sum_probs=148.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhcccc-------ccccccCCcEEEEeccCcccceeecCCCCCC-Cccccccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDY-------EGEIKAYGDTVKINQIGAITVKEYTENSDID-APEELSTT 72 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~-------~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-~~~~~~~~ 72 (276) +.|+..+ .+|+..+...-....-+..+..++. ...-...||+|+|.-...++-..+..+.... ..+.++.. T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~ 100 (404) T protein:vir:81 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecccCCcccCceeeccccceeEE Confidence 5555432 3444432222221211221111110 1111256999999877665544444333332 23568888 Q ss_pred eEEEEEEeeeecceeec-hHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------------------- Q lcl|NC_019506. 73 EKVLEINKQKYFNFQID-DVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT---------------------- 129 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~-d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---------------------- 129 (276) +-.++||+.++ ++... .+..-.+..|++++....+...+++..|+.++-.+..... T Consensus 101 s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~ 179 (404) T protein:vir:81 101 DFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) T ss_pred eeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceee Confidence 89999998765 34333 4556668899999999999999999999988755543221 Q ss_pred ccccc---------cccCCHHH-------HHHHHHHHHHHHhhcCCCcc--------------CCEEEECHHHHHHHhhh Q lcl|NC_019506. 130 SKLKP---------AATLDKTN-------IYEELIKVKVKLDEKNVPTI--------------GRFLIIPPDVHGLLLAA 179 (276) Q Consensus 130 ~~~~~---------~~~~t~~~-------~~~~i~~a~~~l~~~~vP~~--------------~r~~vv~p~~~~~L~~~ 179 (276) +...+ +.+++..+ .++.|..+++.+++..-|.. -++++++|.++.+|+.+ T Consensus 180 N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~d 259 (404) T protein:vir:81 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) T ss_pred cccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhhC Confidence 00001 11111111 25567788888877554421 16789999999999998 Q ss_pred H---Hhhhhcc------cccccceeeeeeeEEeceEEEEecccccc--------------ccceE----------EEEEe Q lcl|NC_019506. 180 D---LIVGTGG------AMAESITKNGFVGTILGFDVYLSNNMGSL--------------TNGTG----------AIAGV 226 (276) Q Consensus 180 ~---~~~~~~~------~~~~~~~~~G~i~~~~G~~v~~s~~lp~~--------------~~~~~----------~~~~~ 226 (276) + +|..... -+.+..+.+|.+|+|.|+-+++..++|.- +.+.. ++..- T Consensus 260 t~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLG 339 (404) T protein:vir:81 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLG 339 (404) T ss_pred CCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeec Confidence 5 3433211 13456788999999999999987766520 10000 11111 Q ss_pred cceEEee--ee----eeee-eeccCcccceeeEEeeeeeeeEEEc---------CCeEEEEEecCC Q lcl|NC_019506. 227 KMACTFA--EQ----IVQT-EAYRMEKRFADAVKGLNVFGCKVIY---------PDALVCLKKTNP 276 (276) Q Consensus 227 ~~a~~~~--~~----~~~~-e~~~~~~~~~~~i~~~~~yg~~v~~---------~~~vv~~~~~~p 276 (276) ..|++.+ +. ..-. |..+..+. -.|......|.+=+| .=|+.+|..+|| T Consensus 340 aQAl~~A~g~~~g~~~~w~Ee~~D~g~~--~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~ 403 (404) T protein:vir:81 340 AQALANAYGQKAGGHFNMVEKKTDMDNR--TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) T ss_pred ceeEEEEeeccCCCCceeEeeccccCch--hhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc Confidence 2222222 21 1101 11111111 133333444443333 447888999999 No 177 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.55 E-value=6.9e-08 Score=59.84 Aligned_cols=272 Identities=13% Similarity=0.045 Sum_probs=148.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhcccc-------ccccccCCcEEEEeccCcccceeecCCCCCC-Cccccccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDY-------EGEIKAYGDTVKINQIGAITVKEYTENSDID-APEELSTT 72 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~-------~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-~~~~~~~~ 72 (276) +.|+..+ .+|+..+...-....-+..+..++. ...-...||+|+|.-...++-..+..+.... ..+.++.. T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~ 100 (404) T protein:vir:10 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecccCCcccCceeeccccceeEE Confidence 5555432 3444432222221211221111110 1111256999999877665544444333332 23568888 Q ss_pred eEEEEEEeeeecceeec-hHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------------------- Q lcl|NC_019506. 73 EKVLEINKQKYFNFQID-DVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT---------------------- 129 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~-d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---------------------- 129 (276) +-.++||+.++ ++... .+..-.+..|++++....+...+++..|+.++-.+..... T Consensus 101 s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~ 179 (404) T protein:vir:10 101 DFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) T ss_pred eeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceee Confidence 89999998765 34333 4556668899999999999999999999988755543221 Q ss_pred ccccc---------cccCCHHH-------HHHHHHHHHHHHhhcCCCcc--------------CCEEEECHHHHHHHhhh Q lcl|NC_019506. 130 SKLKP---------AATLDKTN-------IYEELIKVKVKLDEKNVPTI--------------GRFLIIPPDVHGLLLAA 179 (276) Q Consensus 130 ~~~~~---------~~~~t~~~-------~~~~i~~a~~~l~~~~vP~~--------------~r~~vv~p~~~~~L~~~ 179 (276) +...+ +.+++..+ .++.|..+++.+++..-|.. -++++++|.++.+|+.+ T Consensus 180 N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~d 259 (404) T protein:vir:10 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) T ss_pred cccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhhC Confidence 00001 11111111 25567788888877554421 16789999999999998 Q ss_pred H---Hhhhhcc------cccccceeeeeeeEEeceEEEEecccccc--------------ccceE----------EEEEe Q lcl|NC_019506. 180 D---LIVGTGG------AMAESITKNGFVGTILGFDVYLSNNMGSL--------------TNGTG----------AIAGV 226 (276) Q Consensus 180 ~---~~~~~~~------~~~~~~~~~G~i~~~~G~~v~~s~~lp~~--------------~~~~~----------~~~~~ 226 (276) + +|..... -+.+..+.+|.+|+|.|+-+++..++|.- +.+.. ++..- T Consensus 260 t~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLG 339 (404) T protein:vir:10 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLG 339 (404) T ss_pred CCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeec Confidence 5 3433211 13456788999999999999987766520 10000 11111 Q ss_pred cceEEee--ee----eeee-eeccCcccceeeEEeeeeeeeEEEc---------CCeEEEEEecCC Q lcl|NC_019506. 227 KMACTFA--EQ----IVQT-EAYRMEKRFADAVKGLNVFGCKVIY---------PDALVCLKKTNP 276 (276) Q Consensus 227 ~~a~~~~--~~----~~~~-e~~~~~~~~~~~i~~~~~yg~~v~~---------~~~vv~~~~~~p 276 (276) ..|++.+ +. ..-. |..+..+. -.|......|.+=+| .=|+.+|..+|| T Consensus 340 aQAl~~A~g~~~g~~~~w~Ee~~D~g~~--~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~ 403 (404) T protein:vir:10 340 AQALANAYGQKAGGHFNMVEKKTDMDNR--TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) T ss_pred ceeEEEEeeccCCCCceeEeeccccCch--hhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc Confidence 2222222 21 1101 11111111 133333444443333 447888999999 No 178 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.55 E-value=6.9e-08 Score=59.84 Aligned_cols=272 Identities=13% Similarity=0.045 Sum_probs=148.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhcccc-------ccccccCCcEEEEeccCcccceeecCCCCCC-Cccccccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDY-------EGEIKAYGDTVKINQIGAITVKEYTENSDID-APEELSTT 72 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~-------~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-~~~~~~~~ 72 (276) +.|+..+ .+|+..+...-....-+..+..++. ...-...||+|+|.-...++-..+..+.... ..+.++.. T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~ 100 (404) T protein:vir:32 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecccCCcccCceeeccccceeEE Confidence 5555432 3444432222221211221111110 1111256999999877665544444333332 23568888 Q ss_pred eEEEEEEeeeecceeec-hHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------------------- Q lcl|NC_019506. 73 EKVLEINKQKYFNFQID-DVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT---------------------- 129 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~-d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---------------------- 129 (276) +-.++||+.++ ++... .+..-.+..|++++....+...+++..|+.++-.+..... T Consensus 101 s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~~ 179 (404) T protein:vir:32 101 DFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (404) T ss_pred eeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeeccccccccceee Confidence 89999998765 34333 4556668899999999999999999999988755543221 Q ss_pred ccccc---------cccCCHHH-------HHHHHHHHHHHHhhcCCCcc--------------CCEEEECHHHHHHHhhh Q lcl|NC_019506. 130 SKLKP---------AATLDKTN-------IYEELIKVKVKLDEKNVPTI--------------GRFLIIPPDVHGLLLAA 179 (276) Q Consensus 130 ~~~~~---------~~~~t~~~-------~~~~i~~a~~~l~~~~vP~~--------------~r~~vv~p~~~~~L~~~ 179 (276) +...+ +.+++..+ .++.|..+++.+++..-|.. -++++++|.++.+|+.+ T Consensus 180 N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~d 259 (404) T protein:vir:32 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (404) T ss_pred cccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhhC Confidence 00001 11111111 25567788888877554421 16789999999999998 Q ss_pred H---Hhhhhcc------cccccceeeeeeeEEeceEEEEecccccc--------------ccceE----------EEEEe Q lcl|NC_019506. 180 D---LIVGTGG------AMAESITKNGFVGTILGFDVYLSNNMGSL--------------TNGTG----------AIAGV 226 (276) Q Consensus 180 ~---~~~~~~~------~~~~~~~~~G~i~~~~G~~v~~s~~lp~~--------------~~~~~----------~~~~~ 226 (276) + +|..... -+.+..+.+|.+|+|.|+-+++..++|.- +.+.. ++..- T Consensus 260 t~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallLG 339 (404) T protein:vir:32 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLLG 339 (404) T ss_pred CCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheeec Confidence 5 3433211 13456788999999999999987766520 10000 11111 Q ss_pred cceEEee--ee----eeee-eeccCcccceeeEEeeeeeeeEEEc---------CCeEEEEEecCC Q lcl|NC_019506. 227 KMACTFA--EQ----IVQT-EAYRMEKRFADAVKGLNVFGCKVIY---------PDALVCLKKTNP 276 (276) Q Consensus 227 ~~a~~~~--~~----~~~~-e~~~~~~~~~~~i~~~~~yg~~v~~---------~~~vv~~~~~~p 276 (276) ..|++.+ +. ..-. |..+..+. -.|......|.+=+| .=|+.+|..+|| T Consensus 340 aQAl~~A~g~~~g~~~~w~Ee~~D~g~~--~~i~~~~i~G~kK~rF~~~~g~~~DfGvi~idta~~ 403 (404) T protein:vir:32 340 AQALANAYGQKAGGHFNMVEKKTDMDNR--TEIAISWINGLKKIRFPEKSGKMQDHGVIAVDTAVK 403 (404) T ss_pred ceeEEEEeeccCCCCceeEeeccccCch--hhhhhHHHhhhhhccccCCCCceeeEEEEEeccccc Confidence 2222222 21 1101 11111111 133333444443333 447888999999 No 179 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.49 E-value=1.4e-07 Score=58.14 Aligned_cols=227 Identities=11% Similarity=0.042 Sum_probs=133.6 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccc-------cccccCCcEEEEeccCcccceeecCCCCCC-Cccccccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYE-------GEIKAYGDTVKINQIGAITVKEYTENSDID-APEELSTT 72 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~-------~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~-~~~~~~~~ 72 (276) ++|+.. -.+|++.+...-.+...+..+..++.. +.-+..||+|+|.-...++-..+..+.... ..+.++.. T Consensus 22 ~~~~~~-vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~ 100 (318) T protein:vir:27 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHA 100 (318) T ss_pred hcCChH-HHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccccCccccCceeeccccceEEE Confidence 444442 357877665544444444443322111 111357999999877555443333333332 23567888 Q ss_pred eEEEEEEeeeecceee-chHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------------------- Q lcl|NC_019506. 73 EKVLEINKQKYFNFQI-DDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT---------------------- 129 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v-~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---------------------- 129 (276) +-.++||+.++ ++.. ..++.-.+..|++++....+...+++..|+-++-.+..+.. T Consensus 101 ~d~l~IDq~r~-~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~~n~~~~~p~~~~~~~~~~~~ 179 (318) T protein:vir:27 101 DFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIMI 179 (318) T ss_pred eeEEEEeeecc-ccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceEecccCccchhhhh Confidence 88999988765 4433 34566678899999999999999999999987655532211 Q ss_pred ccccc---------ccc-----CCHHH--HHHHHHHHHHHHhhcCCCc-------cC-------CEEEECHHHHHHHhhh Q lcl|NC_019506. 130 SKLKP---------AAT-----LDKTN--IYEELIKVKVKLDEKNVPT-------IG-------RFLIIPPDVHGLLLAA 179 (276) Q Consensus 130 ~~~~~---------~~~-----~t~~~--~~~~i~~a~~~l~~~~vP~-------~~-------r~~vv~p~~~~~L~~~ 179 (276) +...+ +.+ ++..+ .++.|..++..+++..-|. +. ++|+++|.++..|+.+ T Consensus 180 N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~~~~yV~~~~p~q~~~Lrtd 259 (318) T protein:vir:27 180 NDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTS 259 (318) T ss_pred cccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccCCcceEEEEechHHHHHHhhc Confidence 00011 111 11111 2456777888887744331 11 5789999999999987 Q ss_pred H---Hhhhh----ccc--ccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeeeeee Q lcl|NC_019506. 180 D---LIVGT----GGA--MAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIV 237 (276) Q Consensus 180 ~---~~~~~----~~~--~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~ 237 (276) . +|... ... +.+..+..|.+|+|.|+=+++..++|.-= ..+...-+.+++ T Consensus 260 t~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIrf--------~~G~~v~~~~~~ 318 (318) T protein:vir:27 260 TSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRF--------YQGQRFWYQRIT 318 (318) T ss_pred CCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEEE--------cCCCeeeeeecC Confidence 5 34432 111 33456889999999999999999886310 011111122222 No 180 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.37 E-value=3.5e-07 Score=55.99 Aligned_cols=263 Identities=10% Similarity=-0.002 Sum_probs=136.9 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhcchhhhccc---cccccccCCcEEEEeccCcccc-----eeecCCCCCCCcccc Q lcl|NC_019506. 1 MAVT---SFIPKLWSARLLAHLDKAHVVANLVNRD---YEGEIKAYGDTVKINQIGAITV-----KEYTENSDIDAPEEL 69 (276) Q Consensus 1 MA~~---~l~~e~~~~~~~~~l~~~~v~~~~~~~~---~~~~~~~~Gdtv~ip~~~~~~~-----~d~~~~~~~~~~~~~ 69 (276) ||.. .|.|+++...+...-+...+|-.. ... +..+ ...||.++.|-+..+.- .++...+.+. +..+ T Consensus 1 m~lsD~~vfN~~~~~a~~e~~~q~~~~fn~a-s~gai~l~~~-~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt-~~ki 77 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAFSETLRQQVDLFNTA-TGGAIMLQSA-AHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVA-EKVL 77 (325) T ss_pred CchhhhhhhhhhhhhhhhhhhhhhHhhhhhc-ccceeEeccc-cccCceeeccccccccccccccccCCCCceec-ccee Confidence 8864 366777766544422222223221 110 0111 12589999998865422 2333333332 2344 Q ss_pred ccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh----hccccc----cccccccCCHH Q lcl|NC_019506. 70 STTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEM----DTNATS----KLKPAATLDKT 141 (276) Q Consensus 70 ~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~----~~~~~~----~~~~~~~~t~~ 141 (276) +. ...+.+.-.+..++...|+.....-.+.++++....+..+++...++++..+ .++-.. +....+..+.. T Consensus 78 tt-~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~~~~ 156 (325) T protein:vir:95 78 KH-LVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATANTDAA 156 (325) T ss_pred cc-ccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeecccCcc Confidence 42 3333333455666666777766666677777777777777776666544433 221111 11111111111 Q ss_pred ---HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccccc Q lcl|NC_019506. 142 ---NIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTN 218 (276) Q Consensus 142 ---~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~ 218 (276) ...+.|.+|..+|.++.- .=..++||+..|..|.++ .+.+..+....+... .|..++|-.|+++..+|+... T Consensus 157 ~~~~s~~~l~~A~~klGD~~~--~l~~~~MHS~v~~~L~~~-~L~~~~~~~~~~g~~--~i~t~~G~~VIVdD~~p~~~~ 231 (325) T protein:vir:95 157 DKLPTWNNLNNGQAKFGDQSS--QIAAWIMHSTPMHKLYGS-NLTNGERLFTYGTVN--VVRDPFGKLLVMTDSPNLFAA 231 (325) T ss_pred cccccHHHHHHHHHHhccccc--ceeEEEEchHHHHHHHHh-hccccccccccCCcc--cccccCCcEEEEeCCCCCCCc Confidence 135789999999977641 113589999999999886 344332211111111 356789999999999997543 Q ss_pred ----ceEEEEEecceEEeeeeee----eeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEE---ecCC Q lcl|NC_019506. 219 ----GTGAIAGVKMACTFAEQIV----QTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLK---KTNP 276 (276) Q Consensus 219 ----~~~~~~~~~~a~~~~~~~~----~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~---~~~p 276 (276) ....+.+.++|+++...-. ..+..+. ...+..++.++ ..++.|-|+---+ -..| T Consensus 232 g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~---tf~lhp~G~sw~~s~~g~sP 296 (325) T protein:vir:95 232 GTPNVYHILGLVPGGVLIGQNNDFDANEETKNGD-ENIIRTYQAEW---SYNIGVKGFAWDKANGGKSP 296 (325) T ss_pred cCceeEEEEEEecCeEEecCCCCccccccccCcc-cceeeeeeeee---eEEeecceeeeecccccCCc Confidence 3446778889988764322 1122222 22223333222 2344454444421 1356 No 181 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=98.35 E-value=2.3e-07 Score=56.98 Aligned_cols=272 Identities=11% Similarity=0.071 Sum_probs=149.2 Q ss_pred Cc--------cchhhHHHHHHHHHHHHHHhhcc-hhhhc------------------------c--ccccccccCCcEEE Q lcl|NC_019506. 1 MA--------VTSFIPKLWSARLLAHLDKAHVV-ANLVN------------------------R--DYEGEIKAYGDTVK 45 (276) Q Consensus 1 MA--------~~~l~~e~~~~~~~~~l~~~~v~-~~~~~------------------------~--~~~~~~~~~Gdtv~ 45 (276) |. ++.....+|++.+...-.+...| ..++. + |++ ...||+|+ T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~---K~~GD~Vt 77 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLG---RNKGDEVR 77 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCC---CCCccEEE Confidence 43 22344678988876655443223 22222 1 222 34799999 Q ss_pred EeccCcccceeecCCCCCC-CccccccceEEEEEEeeeecceeec-hHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019506. 46 INQIGAITVKEYTENSDID-APEELSTTEKVLEINKQKYFNFQID-DVDAAQIRTPLMDAAMQRAAYALADETEKILLKE 123 (276) Q Consensus 46 ip~~~~~~~~d~~~~~~~~-~~~~~~~~~~~~~ld~~~~~~~~v~-d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~ 123 (276) |.-...++-..+..+.... ..+.++..+-.++||+.++ ++... .+..-.+..|++++....+..-+++..|+-++-. T Consensus 78 f~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R~-~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~ 156 (430) T protein:vir:10 78 FHFVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQARF-PVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVH 156 (430) T ss_pred EeEeeccccCceecCceeeccccceEEEeeEEEEeeecc-ccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9877655443333333332 2356888889999998765 55555 3455567899999999999999999999877654 Q ss_pred hhccc-----------------------ccccc---------cccc------------CCHHH--HHHHHHHHHHHHhhc Q lcl|NC_019506. 124 MDTNA-----------------------TSKLK---------PAAT------------LDKTN--IYEELIKVKVKLDEK 157 (276) Q Consensus 124 ~~~~~-----------------------~~~~~---------~~~~------------~t~~~--~~~~i~~a~~~l~~~ 157 (276) +..+. +.... .+.+ ++..+ .++.|.+|+..++.+ T Consensus 157 laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~ 236 (430) T protein:vir:10 157 LAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQI 236 (430) T ss_pred HhhhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhh Confidence 43210 00000 0111 11122 256788888888886 Q ss_pred CCCc-------cC-------CEEEECHHHHHHHhhhHHhhh-------hcccccccceeeeeeeEEeceEEEEecccc-- Q lcl|NC_019506. 158 NVPT-------IG-------RFLIIPPDVHGLLLAADLIVG-------TGGAMAESITKNGFVGTILGFDVYLSNNMG-- 214 (276) Q Consensus 158 ~vP~-------~~-------r~~vv~p~~~~~L~~~~~~~~-------~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp-- 214 (276) ..|. +. ++|+++|.++..|+.++.+.. +...+.+..+.+|.+|+|.|+-+++...+- T Consensus 237 ~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~virf 316 (430) T protein:vir:10 237 ELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPIRF 316 (430) T ss_pred CCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCceeee Confidence 5331 12 578999999999999987632 112233456789999999999999865331 Q ss_pred ------cccc----ce-----------------EEEEEecce--EEeeeee------eeee-eccCcccc---eeeEEee Q lcl|NC_019506. 215 ------SLTN----GT-----------------GAIAGVKMA--CTFAEQI------VQTE-AYRMEKRF---ADAVKGL 255 (276) Q Consensus 215 ------~~~~----~~-----------------~~~~~~~~a--~~~~~~~------~~~e-~~~~~~~~---~~~i~~~ 255 (276) ..++ .. -++..-..| .++++.. .=.| ..+..+.. ...|.|. T Consensus 317 ~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~~i~~~~i~G~ 396 (430) T protein:vir:10 317 YAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKLELLIGAILGC 396 (430) T ss_pred cCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchhhhhhhHHhcc Confidence 0000 00 000011111 2222210 0011 11111111 1111111 Q ss_pred --eeee-----eEEEcCCeEEEEEecCC Q lcl|NC_019506. 256 --NVFG-----CKVIYPDALVCLKKTNP 276 (276) Q Consensus 256 --~~yg-----~~v~~~~~vv~~~~~~p 276 (276) .+|- .+..+.=|+.+|..+|| T Consensus 397 kK~rF~~~~~~~~~~~DfGvi~idtaa~ 424 (430) T protein:vir:10 397 SKIRFAVEATNGLEYTDHGVMAIDTAVK 424 (430) T ss_pred ceeeecCCCCCCceeeeeEEEEhhhhhh Confidence 2222 12235668888999998 No 182 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=98.10 E-value=1.2e-06 Score=53.13 Aligned_cols=268 Identities=17% Similarity=0.161 Sum_probs=152.5 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccc-cCCcEEEEeccCcc--cceeec--CCCCCCCc---cc-ccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIK-AYGDTVKINQIGAI--TVKEYT--ENSDIDAP---EE-LST 71 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~-~~Gdtv~ip~~~~~--~~~d~~--~~~~~~~~---~~-~~~ 71 (276) ||...+ .++|+..+.+.|.+...|.+.+--..+..-+ ...+|.---+.... .+..|. ++.++... +. --. T Consensus 1 ~avr~y-~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ssRFG~ 79 (287) T protein:vir:39 1 MAIKYF-TKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNTSRFGQ 79 (287) T ss_pred CCcccc-cHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCccccccc Confidence 998754 6789999999999999998876321221111 12233211111111 122332 22222110 00 000 Q ss_pred ceEEEEEEee-e-ecceeec-hHHHHhhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHHHHHH Q lcl|NC_019506. 72 TEKVLEINKQ-K-YFNFQID-DVDAAQIR---TPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKTNIYE 145 (276) Q Consensus 72 ~~~~~~ld~~-~-~~~~~v~-d~d~~~~~---~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~~~~~ 145 (276) -+.-+-.|+. . .+++.|. -.|...-+ ...+++.++.++.+-++.+|..+-..+...+....+. ..+.+.+.. T Consensus 80 rkEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~~--~~t~d~V~~ 157 (287) T protein:vir:39 80 RKEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETLTV--KLDEDSVTK 157 (287) T ss_pred eeEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchheee--eecccchHH Confidence 0001111111 0 1222221 23433333 3446667788888888999887766666554432222 367777778 Q ss_pred HHHHHHHHHhhcCCCcc-CCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccc-cccceEEE Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTI-GRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGS-LTNGTGAI 223 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~-~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~-~~~~~~~~ 223 (276) .|.++.+.+-.+++... .-...|+|+.|..|...+..+.+..... . +-+--|.++-||.+.+ +|. .-..+... T Consensus 158 LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~TsaK~Ssa-N-iDen~i~kFkGf~l~e---~P~~~~q~g~~a 232 (287) T protein:vir:39 158 LFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTAKNSSA-N-VDEQTLYKFKGFILSE---LPDEKFQLNEGA 232 (287) T ss_pred HHHHHHHHhhccceeeEEEEEEEEChhHHhHHhcccccccccccee-e-eccCCcceecceEEEe---cchHhhccCcEE Confidence 88899999988777433 3468999999999998875554433221 1 3333456899999988 552 11122223 Q ss_pred EEecceEEee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 224 AGVKMACTFA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 224 ~~~~~a~~~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .+.+..++.+ -.+........++.-|..+.|--.||-.+++....++++.+.| T Consensus 233 ~fs~dnig~af~GI~vaR~i~sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~ 286 (287) T protein:vir:39 233 YFAADNVGVAGVGIQVTRAMDSEDFAGTALQAAAKYGKYLPEKNKKAILKATVT 286 (287) T ss_pred EEccccceeecccceeEEeeecccccceeeecccccccccccccceEEEEEecC Confidence 3333334333 1223333455567778889999999999999999999999999 No 183 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=98.04 E-value=6.1e-06 Score=49.16 Aligned_cols=259 Identities=12% Similarity=0.057 Sum_probs=127.2 Q ss_pred Cc-cch-----hhHHHHHHHHHHHHHHhhcchhh-hccccccccccCCcEEEEecc---CcccceeecCC-C-CCCCccc Q lcl|NC_019506. 1 MA-VTS-----FIPKLWSARLLAHLDKAHVVANL-VNRDYEGEIKAYGDTVKINQI---GAITVKEYTEN-S-DIDAPEE 68 (276) Q Consensus 1 MA-~~~-----l~~e~~~~~~~~~l~~~~v~~~~-~~~~~~~~~~~~Gdtv~ip~~---~~~~~~d~~~~-~-~~~~~~~ 68 (276) |+ .++ +.+......+++.|.+.+-+... -..+. .|.+.+..+. +..+..++... + ....... T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~v------eg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~ 74 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSI------EGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAA 74 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccc------cCCcceeeEeeccCCcccccccccccCCCccccc Confidence 88 432 44566777788888654444332 22221 2445554433 22233322211 0 0001122 Q ss_pred cccceEEEEEEeeeecceeechH--HHH-hhhhhHHHHHHHHHHHHHHHHHHHHHHHH---------hhcc-cccc---- Q lcl|NC_019506. 69 LSTTEKVLEINKQKYFNFQIDDV--DAA-QIRTPLMDAAMQRAAYALADETEKILLKE---------MDTN-ATSK---- 131 (276) Q Consensus 69 ~~~~~~~~~ld~~~~~~~~v~d~--d~~-~~~~d~~~~~~~~~~~ala~~~d~~~~~~---------~~~~-~~~~---- 131 (276) .+...++..|.-. ...+.|+.. |.. ....|.+.+.+++..++|.++.+..++.. +... .... T Consensus 75 ~t~~~~~~~L~i~-~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~ 153 (310) T protein:vir:97 75 ATFTKVNSNLTTI-MGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATT 153 (310) T ss_pred cccceeeeeeeee-eehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeec Confidence 3334444555322 333455542 222 22567888889999999999999887651 1111 1111 Q ss_pred ccccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhh-hhcccccccceeeee-eeEEeceEEEE Q lcl|NC_019506. 132 LKPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIV-GTGGAMAESITKNGF-VGTILGFDVYL 209 (276) Q Consensus 132 ~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~-~~~~~~~~~~~~~G~-i~~~~G~~v~~ 209 (276) ++.+..+|. +.+.++....-+.+ -+..++++||.++..+..--+-. ....+-.. ...-|+ +-.|.|++|+. T Consensus 154 ~~~gg~~t~----d~LDeLl~~v~~~~--g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~-~~~~G~~v~~~~GiPi~~ 226 (310) T protein:vir:97 154 GATGSAISF----AILDELMDLVVDKD--GQVDYLTMHARTLRSYKALLRALGGASINEVV-ELPSGAEVPAYSGTPIFR 226 (310) T ss_pred CCCCCCCCH----HHHHHHHHHHhcCC--CCCCEEEecHHHHHHHHHHHHHhcCCCCCCcc-ccCCCCEEeeeCCeEEEE Confidence 112233343 33333322221111 23458999998766654432211 11221111 223444 56899999999 Q ss_pred eccccccc------cceEEEEEecc-------eEEeee---eeeeeeec---cCcccceeeEEeeeeeeeEEEcCCeEEE Q lcl|NC_019506. 210 SNNMGSLT------NGTGAIAGVKM-------ACTFAE---QIVQTEAY---RMEKRFADAVKGLNVFGCKVIYPDALVC 270 (276) Q Consensus 210 s~~lp~~~------~~~~~~~~~~~-------a~~~~~---~~~~~e~~---~~~~~~~~~i~~~~~yg~~v~~~~~vv~ 270 (276) ++.+|... +.+..++..-+ -.|+-. ....++.. .+...+.+ ...+++|+.++.|+++++ T Consensus 227 ~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~--~V~~Y~~~av~~~~A~a~ 304 (310) T protein:vir:97 227 NDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIW--RVKWYCGLALFSEKGLAC 304 (310) T ss_pred eCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeE--EEEEeeeEEEecccceee Confidence 99999742 22223332211 122210 11122222 22233333 346789999999999999 Q ss_pred EE-ecC Q lcl|NC_019506. 271 LK-KTN 275 (276) Q Consensus 271 ~~-~~~ 275 (276) |+ +++ T Consensus 305 L~~V~~ 310 (310) T protein:vir:97 305 ADGITN 310 (310) T ss_pred eccccC Confidence 98 666 No 184 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=97.93 E-value=5.9e-06 Score=49.24 Aligned_cols=255 Identities=13% Similarity=0.040 Sum_probs=124.8 Q ss_pred Cc--------cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MA--------VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA--------~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~~~~~~~~~~~~~~~ 71 (276) +. -.++.|..+...+...+.....+...+... ......+|.... ..+..+.++..... .+++. T Consensus 237 ~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~-------~i~~~~~~~~~~~~~a~~~~eG~~kp~-s~~tf 308 (517) T protein:vir:97 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHE-------NLPTLVVGGDNALTQGTGHTTGTDKTE-SNITL 308 (517) T ss_pred eeeecccccccccccchHHHHHHHHhhhhhccceeeeeec-------cccceeeecccccceeeeeecCCcccc-cccce Confidence 11 012345556666665555544444433211 112333432222 22333444443322 45666 Q ss_pred ceEEEEEEeeeecceeechHHHHhhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------cccc-ccCC Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIRTP----LMDAAMQRAAYALADETEKILLKEMDTNATSK-------LKPA-ATLD 139 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~~d----~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------~~~~-~~~t 139 (276) ..+++.+.+. +.-+.++......+..| +.+-+..+..+.|+++.++.++..=..+.... .... +... T Consensus 309 ~~~~~~~~~i-a~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~ 387 (517) T protein:vir:97 309 QTRVLTPQYV-YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTG 387 (517) T ss_pred eeEEeeHhhh-hhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccc Confidence 6677766443 44456776655555444 55566778899999999988875322221110 0000 1111 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccc Q lcl|NC_019506. 140 KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNG 219 (276) Q Consensus 140 ~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~ 219 (276) .....+.+......+.+ ..+-.+||||..+..|.+... ....+.++.....+....++|+.-.+.. ++ .+ T Consensus 388 ~~~~~d~i~~l~~a~~~----a~~a~~vmn~~t~~~I~klKD--~~G~Yl~~~~~~~~~~~~l~G~~~~~~~-~~---~~ 457 (517) T protein:vir:97 388 TTNIQELLEKLSVATPK----AADSTLVIHRNDLAAIRFLKD--KNGNYVFPVGVSNQTIATHFGFNRLVQS-VA---VD 457 (517) T ss_pred cchHHHHHHHHHHHhhh----ccCCEEEECHHHHHHHHHhhc--CCCCeeccCcCCcccccccCCccccccc-cc---cC Confidence 11222222222222222 223468899999999977543 2334555666666777778885433221 22 12 Q ss_pred eEEEEEecceEEeeeeeeeeeeccCc--ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 220 TGAIAGVKMACTFAEQIVQTEAYRME--KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 220 ~~~~~~~~~a~~~~~~~~~~e~~~~~--~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .....+..+ +..+.+.. ++..++- ..-.+.+....+.|..|+.|+..+....+-| T Consensus 458 ~~~~~~~~~-y~i~~~~g-~~~~~~fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~ 514 (517) T protein:vir:97 458 EKTAVSLSG-YVTNGSRG-MEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPP 514 (517) T ss_pred ceeEeeccc-cEEEeecc-eeeeeeeecccCceeEeeeeeeccccccccceEEEEEcCC Confidence 222222221 22221111 1111111 1123445666778888999999999888888 No 185 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=97.83 E-value=1.6e-05 Score=46.87 Aligned_cols=260 Identities=14% Similarity=0.066 Sum_probs=116.4 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhcchhhhccc--cccccccCCcEEEEeccC---cccceeecCCCCCCCcccc Q lcl|NC_019506. 1 MAVT------SFIPKLWSARLLAHLDKAHVVANLVNRD--YEGEIKAYGDTVKINQIG---AITVKEYTENSDIDAPEEL 69 (276) Q Consensus 1 MA~~------~l~~e~~~~~~~~~l~~~~v~~~~~~~~--~~~~~~~~Gdtv~ip~~~---~~~~~d~~~~~~~~~~~~~ 69 (276) ||.+ .|.+.+....+...-+.-.+|-....-- +..+ ...||=...+.+. ...-+++...+.... ..+ T Consensus 1 ~~~t~~sdl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~-~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~-~ki 78 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSE-LIEGDLKLRSFYKVGGAIADRDVNSTATVAG-TKI 78 (315) T ss_pred CceeeecceeeehhhhhhhHHhhhHHHHHHhhhhcCCccccccc-ccccccccccccccccchhhcccCCCccccc-eec Confidence 8864 2455555554433223333333322100 0000 1135544444332 222334433333322 333 Q ss_pred cc-ceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHH----HHHhhccccccccc-cccCCHHHH Q lcl|NC_019506. 70 ST-TEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKIL----LKEMDTNATSKLKP-AATLDKTNI 143 (276) Q Consensus 70 ~~-~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~----~~~~~~~~~~~~~~-~~~~t~~~~ 143 (276) +. ..+.+.+ -+.+.++..+.......-.| ..++....+..++...-+.+ ++.+.+.-...... .+..+.... T Consensus 79 t~~~dvaVk~-~~~~~~~~~~~~~~a~~g~d-p~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~ 156 (315) T protein:vir:96 79 AADEMVSVKV-PWKYGPYETTEEAFKRRARS-PEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNVSGELATEG 156 (315) T ss_pred ccccceeEEE-eecCCchhccHHHHHHhhcC-HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccccccccccC Confidence 32 3344444 33444455555544433223 23333333333333333333 33332221111100 001111122 Q ss_pred HHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEE Q lcl|NC_019506. 144 YEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAI 223 (276) Q Consensus 144 ~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~ 223 (276) .+.|.+|..+|.++.- .=-.++||+..|..|.+ ..+.+.-+..++...+.+..+.+ |-.|+++..+|+. ..+ T Consensus 157 ~~~l~dA~~klGD~~~--~l~~~vMHS~v~~~L~~-q~L~~~~~~~~~~~~~~~~~~~l-GkrViVdD~~P~~----~~~ 228 (315) T protein:vir:96 157 KKVLTKGLRTMGDKAS--SIAIWVMDSTSYFDIVD-EAIDNKLYEEAGVVVYGGTPGTL-GKPVLVTDQCPAT----KIF 228 (315) T ss_pred HHHHHHHHHHhccccc--CeeEEEEchHHHHHHHH-hhhhhhcccccceeEecCcCccc-ccEEEEECCCCcc----eee Confidence 4678889999976641 11247899999999998 45554433333334444445544 9999999999963 456 Q ss_pred EEecceEEeeeeee----eeeeccCcccceeeEEeeeeeeeEEEcCCeEEEE--EecCC Q lcl|NC_019506. 224 AGVKMACTFAEQIV----QTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCL--KKTNP 276 (276) Q Consensus 224 ~~~~~a~~~~~~~~----~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~--~~~~p 276 (276) .+..+|+++...-. ..+..+. -+++.....-+..++.|.|+-=- ...+| T Consensus 229 gl~~GAi~~~~~~~~~~~~~~~~g~----e~l~~~~r~e~tf~l~p~G~sw~~~~~~sP 283 (315) T protein:vir:96 229 GLVAGAVMITESQAPGMRSYQIDDQ----ENLAIGFRAEGTANVEVLGYKWKTKTNVNP 283 (315) T ss_pred eeecceeeecCCCccccccccCCCc----ceeEEEEeeeeEeeeeeeeEEeecCCCcCC Confidence 67788887764221 1122211 12222222222233444443331 22467 No 186 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=97.72 E-value=5.5e-06 Score=49.41 Aligned_cols=255 Identities=15% Similarity=0.131 Sum_probs=147.5 Q ss_pred Cc---------cc--hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc-ccceee-------cCCC Q lcl|NC_019506. 1 MA---------VT--SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA-ITVKEY-------TENS 61 (276) Q Consensus 1 MA---------~~--~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~-~~~~d~-------~~~~ 61 (276) |+ +. .+.++ |-+-+++.+++.-...+++.+ .| ..|.|+.-|.... .++..+ ++|. T Consensus 127 ~r~a~~~~~Tgd~~~~i~~~-~v~d~i~li~q~r~i~slf~t----LP-~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd 200 (410) T protein:vir:83 127 YARAADHQKTGDLQGVIPDP-IVGPVIDFIDSARPLVSTLGT----LP-LNNATFYRPIVSQRPAVGLQGVAGGASDEKT 200 (410) T ss_pred HHHhhccCcccccccccchh-HhhhHHHHHhhccchhhhhhh----CC-CCCCeeEEeeecccccccccccccccccccc Confidence 11 11 24444 877788888888878887754 23 3488998876533 233222 2555 Q ss_pred CCCCccccccceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCHH Q lcl|NC_019506. 62 DIDAPEELSTTEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDKT 141 (276) Q Consensus 62 ~~~~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~~ 141 (276) .+.. ..+...+.+..|+.+-+ ...++......+.....+-.++-+..+-|+..++..-+.+...... ..+...+|+. T Consensus 201 ~L~~-gKl~~~t~tA~ikTyGG-yt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~-~~a~~~~Tad 277 (410) T protein:vir:83 201 ELDS-QKMVIDRLTVNAKTLGG-YVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG-AVGYGNATAD 277 (410) T ss_pred cccc-cceeeeeccceeehhcC-cccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhhhccHH Confidence 5544 67888888999988744 4556766666666666677777666666766666544444332221 1234455888 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhh---cccc-cccceeeeeeeEEeceEEEEeccccccc Q lcl|NC_019506. 142 NIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGT---GGAM-AESITKNGFVGTILGFDVYLSNNMGSLT 217 (276) Q Consensus 142 ~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~---~~~~-~~~~~~~G~i~~~~G~~v~~s~~lp~~~ 217 (276) ++...+.++....+.+.--..-+++.|+|+.+..+.....-.+. +..| +-.-+-.|.-|+++|++|++...++ T Consensus 278 ~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~--- 354 (410) T protein:vir:83 278 NVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALG--- 354 (410) T ss_pred HHHHHHHHHHHHHhhhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCC--- Confidence 88888888988888862112335899999997666543211111 1111 1112336767899999999966664 Q ss_pred cceEEEEEecceEEee-eee--eeeee---ccCcccceeeEEeeeeeeeEEEcCCeEEEEEec Q lcl|NC_019506. 218 NGTGAIAGVKMACTFA-EQI--VQTEA---YRMEKRFADAVKGLNVFGCKVIYPDALVCLKKT 274 (276) Q Consensus 218 ~~~~~~~~~~~a~~~~-~~~--~~~e~---~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~ 274 (276) +++.+| +.+.|+.+- ... .++.. ......|+ -+|+..+..|+|++=+.-. T Consensus 355 AgTA~f-~~~~Ai~~~eS~~gp~qL~d~~i~nLt~~yS------gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 355 SGDAYL-FSTAAIECFEQRVGTLQVVEPSVFGLQVAYA------GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred cCeeeE-eccceeeeeecCCceeEeeCCchhhhhhhhe------eeeeeccccccceeeeccC Confidence 444444 355555432 221 11111 12223333 3445556678888766444 No 187 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=97.70 E-value=2.7e-05 Score=45.66 Aligned_cols=260 Identities=12% Similarity=0.095 Sum_probs=126.6 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccC---cccceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MAVTS------FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIG---AITVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA~~~------l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~---~~~~~d~~~~~~~~~~~~~~~ 71 (276) |+.-+ +.|......+++.|.+.+-+.... .|.. ..|.+.+.++.. ..+..+. +.++......+. T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~l--pf~~---ve~~~~~~~r~~~lp~a~~r~~--n~~~~~~~~~Tf 97 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMM--PFTE---IEGNALAYNRENVLGDVQFLAV--GGTITAKNPATF 97 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhc--cccc---ccCCcceeeeeecCCcceeeec--cccccccCccee Confidence 55211 345666777888886655444432 1111 123444444433 3333443 333221111122 Q ss_pred ceEEEEEEeeeecceeechHH--HHhhhhhHHHHHHHHHHHHHHHHHHHHHHHH---------hhcc-cccc----cccc Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVD--AAQIRTPLMDAAMQRAAYALADETEKILLKE---------MDTN-ATSK----LKPA 135 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d--~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~---------~~~~-~~~~----~~~~ 135 (276) ..++..+.-. .-.+.|+..- ......|.+.+..+...++|+++....++.. +... ...+ +..+ T Consensus 98 ~q~t~~l~~l-~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~g 176 (330) T protein:vir:94 98 TKVTSELTTL-IGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGANG 176 (330) T ss_pred eeeeechhhh-hhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCCC Confidence 2333333221 1223444332 2234568889999999999999999887752 1110 1011 1122 Q ss_pred ccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccccc--ccceeeee-eeEEeceEEEEecc Q lcl|NC_019506. 136 ATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMA--ESITKNGF-VGTILGFDVYLSNN 212 (276) Q Consensus 136 ~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~--~~~~~~G~-i~~~~G~~v~~s~~ 212 (276) .+.|.++ ++.+..+.. +. +-+.-+++++..+...+..-.+- ...++- .....-|+ |-.|.|++|+.++. T Consensus 177 g~~T~d~-LDeLl~~v~---~~--~g~~~~~l~n~a~~r~I~a~~R~--~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ 248 (330) T protein:vir:94 177 GTLTFEL-LDQLLDLVK---DK--DGQVDYLMSSFAMRRKYFSLLRA--LGGAAIGEVMTLPSGRQIPTYRGVPWFVNDF 248 (330) T ss_pred CCCCHHH-HHHHHHHhc---CC--CCCCcEEEechhHHHHHHHHHHh--ccCCCCCCcccccCCCEEeeeCCeEEEeccc Confidence 3344322 333333321 11 11234788888877777554321 112221 11223465 46799999999999 Q ss_pred cccccc-----ceE-EEEEec-------ceEEeeee---eeeeeecc-CcccceeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 213 MGSLTN-----GTG-AIAGVK-------MACTFAEQ---IVQTEAYR-MEKRFADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 213 lp~~~~-----~~~-~~~~~~-------~a~~~~~~---~~~~e~~~-~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) +|.... ++. .++..- +-.|+-.. ...++..- ....-..-....+++|+.++.|+++++|+.-. T Consensus 249 ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~ 328 (330) T protein:vir:94 249 IPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVKMYCGFANFSQLGLAAIKGLI 328 (330) T ss_pred ccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEEEeeeeEEechhheeeecccc Confidence 987421 222 222220 11222111 11222211 01111122345689999999999999999999 Q ss_pred C Q lcl|NC_019506. 276 P 276 (276) Q Consensus 276 p 276 (276) | T Consensus 329 ~ 329 (330) T protein:vir:94 329 P 329 (330) T ss_pred C Confidence 9 No 188 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=97.60 E-value=4e-05 Score=44.71 Aligned_cols=264 Identities=9% Similarity=0.038 Sum_probs=144.9 Q ss_pred Cc--cch-hhHHHHHHHHHHHHHHhh-----cchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccc Q lcl|NC_019506. 1 MA--VTS-FIPKLWSARLLAHLDKAH-----VVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTT 72 (276) Q Consensus 1 MA--~~~-l~~e~~~~~~~~~l~~~~-----v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~ 72 (276) +| ++. =.|-++...+.+.|.+.- .|..++.+.--.+| +..+.+.+- +.+....+.+++.+.. ..+.+. T Consensus 359 ~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DF-k~~~~~~lg--~~~~L~~V~E~gEyk~-~t~~e~ 434 (652) T protein:vir:79 359 AAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDF-KIAHRVGMG--GFSALRQVREGAEYKY-VTTGDK 434 (652) T ss_pred HHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccc-cccceeecC--CCCCccccCCCCccce-eeecCc Confidence 22 111 124444444444433221 23333332111122 223444443 2334666678888865 567788 Q ss_pred eEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-c-------cc-cc-cCCHHH Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-L-------KP-AA-TLDKTN 142 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-~-------~~-~~-~~t~~~ 142 (276) ..++.|.++ +.-|.||.....-.--+....+....+++-++.++..+++.+..++.-. . .. +. ..++.- T Consensus 435 ~e~~~l~ty-G~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa~ 513 (652) T protein:vir:79 435 QATIALATY-GELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAAM 513 (652) T ss_pred cceeeeecc-cCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccccccccC Confidence 889999886 6678888877665555667777888888888899988888876654211 0 00 00 001111 Q ss_pred HHHHHHHHHHHHhhcCCC-----ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEece-EEEEecccccc Q lcl|NC_019506. 143 IYEELIKVKVKLDEKNVP-----TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGF-DVYLSNNMGSL 216 (276) Q Consensus 143 ~~~~i~~a~~~l~~~~vP-----~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~-~v~~s~~lp~~ 216 (276) ..+.|..++..|.+++=. -..++++++|+......+ ++.+....+ ...-.|.+--+.|+ +++.+.+|... T Consensus 514 ~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~---ll~s~~v~~-a~~~~~~~Np~~~~~~~i~eprL~~~ 589 (652) T protein:vir:79 514 DVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQ---VIRSSSVKG-ADINAGIINPVKDFATVIAEPRLDDN 589 (652) T ss_pred CHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHH---HhccCCCcc-cccccccccccccccccccccccCCC Confidence 245667777666555421 134689999987765533 332222111 11234555556675 77888888654 Q ss_pred ccceEEEEEecce--E--Eee--eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEe Q lcl|NC_019506. 217 TNGTGAIAGVKMA--C--TFA--EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKK 273 (276) Q Consensus 217 ~~~~~~~~~~~~a--~--~~~--~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~ 273 (276) ......++-.+.. + +|- .+...+|...+-+.-|-.++.++-||+++++.-|++..++ T Consensus 590 s~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 590 SQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred CcccEEEecCCCCCeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 4333333333321 2 222 1222333333333335677888889999999999887766 No 189 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=97.53 E-value=2.3e-05 Score=46.03 Aligned_cols=268 Identities=13% Similarity=0.076 Sum_probs=142.7 Q ss_pred Cccch----hhHHHHHHHHHHHHHHhhcchhhhccccccccc-cCCcEE-EEeccCccc-c-eeec--CCCCCCCc-cc- Q lcl|NC_019506. 1 MAVTS----FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIK-AYGDTV-KINQIGAIT-V-KEYT--ENSDIDAP-EE- 68 (276) Q Consensus 1 MA~~~----l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~-~~Gdtv-~ip~~~~~~-~-~d~~--~~~~~~~~-~~- 68 (276) -+|+. .+.++|...+.+.|....+|.+.+--..+..-+ ...+|. .+..-..+- + +.|. +++++... .+ T Consensus 21 t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~Y~TdeNvaFGtGTg~S 100 (314) T protein:vir:98 21 TANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNEYNKDENVGFGEGTSRS 100 (314) T ss_pred cccCccceeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCcccCCCCcccccCCccc Confidence 22322 357889999999999999998876332221111 112221 111111110 1 1222 22222110 00 Q ss_pred cccceE-EE-EEEee-e-ecceeec-hHHHHhhhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCH Q lcl|NC_019506. 69 LSTTEK-VL-EINKQ-K-YFNFQID-DVDAAQIRT---PLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPAATLDK 140 (276) Q Consensus 69 ~~~~~~-~~-~ld~~-~-~~~~~v~-d~d~~~~~~---d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~~~~t~ 140 (276) .-.+.. ++ -.|.. . .+++.|. -.|...-+. ..+++.++.++.+-++.+|..+-..+...+ .....-+..+. T Consensus 101 sRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~lS~~A-s~te~ltd~~~ 179 (314) T protein:vir:98 101 TRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFISSIA-EKTETLTDYSA 179 (314) T ss_pred cccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhhhhhhcch Confidence 000111 11 11110 0 1122221 234443333 445666777888888888876655543332 22233344566 Q ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccc--c Q lcl|NC_019506. 141 TNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLT--N 218 (276) Q Consensus 141 ~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~--~ 218 (276) +.+...|..+.+.+-...| .......|+|+.|..|...+..+.+..... . +-+--|.++-||.+.+ +|..- . T Consensus 180 d~V~~LF~~as~~yvn~ev-~~~~~AyV~~evYnaiiD~~l~TsaK~Ssa-N-IDengi~~FkGf~i~e---~P~~~~q~ 253 (314) T protein:vir:98 180 DNVLRLFNELSKYYVNIEA-IGTKAAKVSPELYNAIVDHPLTTSAKSSSA-N-IDQNGIVNFKGFAIQE---IPESMLQS 253 (314) T ss_pred hhHHHHHHHHHhhhhccee-eEEEEEEEchhHHhHhhcccccccccccee-e-eccCCcceecceEEEe---cchhhcCC Confidence 6677778888888877776 334578999999999998875554433221 1 3333456899999988 55322 2 Q ss_pred ceEEEEEecceEEee-eeeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 219 GTGAIAGVKMACTFA-EQIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 219 ~~~~~~~~~~a~~~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +..+++. ...++-+ -.+........++.-|-.+.|--.||-.+++....++++.|+- T Consensus 254 g~ia~~s-~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~t 311 (314) T protein:vir:98 254 GDVAYTY-ITNIGKAFTGINTSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTST 311 (314) T ss_pred CcEEEEc-cccceeecccceeeeeeecccccceeeecccccccccccccceeeEEEecC Confidence 2222222 1333332 1222233445566667888999999999999999999997765 No 190 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=97.52 E-value=3.2e-05 Score=45.25 Aligned_cols=252 Identities=12% Similarity=0.074 Sum_probs=138.2 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhcchhhhccccccccc-cCCcEE------EEeccCcccceeec--CCCCCCC Q lcl|NC_019506. 1 MAVTS------FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIK-AYGDTV------KINQIGAITVKEYT--ENSDIDA 65 (276) Q Consensus 1 MA~~~------l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~-~~Gdtv------~ip~~~~~~~~d~~--~~~~~~~ 65 (276) |+.+. .+.++|...+.+.|+...+|.+.+-- .+..-+ ...+|. .+|+. +..|. ++.++.. T Consensus 1 m~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~fgg-lQalDGV~~N~tafsvKt~D~pVV----ig~Y~TdeNv~FGt 75 (286) T protein:vir:94 1 MATTNNDLPVRVYSKEFLQLLSTVYQAQSVFTPTFGA-LQALDGVPNNATAFSVKTNDMAVV----VGEYSTDANTAFGT 75 (286) T ss_pred CCCCccccceeehhHHHHHHHHHHHhhHHHhhhhhcc-hhhhhCCCccceEEEEeecCcceE----EecccCCCcccccc Confidence 66321 45788999999999999998886532 221111 112221 12221 22232 1222211 Q ss_pred c---c-------c--cccceEEEEEEeeeecceeec-hHHHHhhhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_019506. 66 P---E-------E--LSTTEKVLEINKQKYFNFQID-DVDAAQIRT---PLMDAAMQRAAYALADETEKILLKEMDTNAT 129 (276) Q Consensus 66 ~---~-------~--~~~~~~~~~ld~~~~~~~~v~-d~d~~~~~~---d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~ 129 (276) . + + ..++.++++ +++.|. -.|...-+. ..+++.++.++.+-++.+|..+-..+...+. T Consensus 76 gTg~SsRFG~rkEi~y~dtdV~Y~------~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~ 149 (286) T protein:vir:94 76 GTSNSSRFGEMKEVIYADTDVPYT------AGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALATAGT 149 (286) T ss_pred CCccccccCceeeEEeeccccccc------ccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 0 0 0 011112211 122221 234443333 4456667778888888888766555543332 Q ss_pred ccccccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE Q lcl|NC_019506. 130 SKLKPAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL 209 (276) Q Consensus 130 ~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~ 209 (276) . +.+-+.+...|..+.+.+-...|- ...-..|+|+.|..|...+..+.+..... . +-+--|.++-||.+.+ T Consensus 150 ~------t~~~D~V~~LF~~as~~yvn~ev~-~~~~ayV~~evYnaiiD~~l~TsaK~Ssa-N-iDengi~~FkGf~i~e 220 (286) T protein:vir:94 150 D------LGAVDDVNALFESAVEKYTDLEVI-APVRAYVTASVYNAIIDLANVTTAKNSAV-N-IDTNGMLSFRGIAITK 220 (286) T ss_pred h------hhhhhhHHHHHHHHHHHhhhhhee-eeeEEEEchhHHHHHhcccccccccccee-e-eccCCcceecceEEee Confidence 2 111145556677777777777773 33348999999999998875554433221 1 3333456899999998 Q ss_pred eccccccccceEEEEEecceEEeee-eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 210 SNNMGSLTNGTGAIAGVKMACTFAE-QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 210 s~~lp~~~~~~~~~~~~~~a~~~~~-~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) . |..--.+....+.+..++-+- .+........++.-|-.+.|--.||-.+++....++++. .| T Consensus 221 ~---P~~~~~g~~aifs~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~-~~ 284 (286) T protein:vir:94 221 V---PTQYMGGKAVIFAPDNVARVFTGINIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTA-TP 284 (286) T ss_pred c---chhhccCceEEEccccceeeeccceeeeeeeccccCceeeeccccccccccccCceeEEEe-ec Confidence 4 422222333444444444432 222233444556667788888999999999999888754 45 No 191 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.34 E-value=8.9e-05 Score=42.78 Aligned_cols=260 Identities=11% Similarity=0.090 Sum_probs=130.5 Q ss_pred Cccc-----hhhHHHHH---HHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCC-CCCccccc Q lcl|NC_019506. 1 MAVT-----SFIPKLWS---ARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSD-IDAPEELS 70 (276) Q Consensus 1 MA~~-----~l~~e~~~---~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~-~~~~~~~~ 70 (276) |..+ .|+.+.|. .++.+...+.++...++.- ..+....-.++.++.....+ +..+..+.. +.. .+.. T Consensus 24 ~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v--~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~-v~~~ 100 (319) T protein:vir:10 24 KQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPV--TTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPL-VDAL 100 (319) T ss_pred hhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhccc--ccCCCCceEEEEeeeeccccceeeecCccccccc-eecc Confidence 1111 35554444 3444445555556665532 11111223567777664433 444443222 211 2444 Q ss_pred cceEEEEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhh---------cccccc-----cc Q lcl|NC_019506. 71 TTEKVLEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILLKEMD---------TNATSK-----LK 133 (276) Q Consensus 71 ~~~~~~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~~~---------~~~~~~-----~~ 133 (276) .+.....+..+ +.++.++..|...+ -.++-.+....+++++++..|+.++-.-. ...... .. T Consensus 101 ~~~~~~~i~~~-~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~ 179 (319) T protein:vir:10 101 GTSEFGKVFRL-GNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWI 179 (319) T ss_pred ceeeEEEEEEE-EeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecCCCC Confidence 55666666443 66677776665433 45777777888888999999986552211 100001 11 Q ss_pred ccccCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec Q lcl|NC_019506. 134 PAATLDKTNIYEELIKVKVKLDEK--NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN 211 (276) Q Consensus 134 ~~~~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~ 211 (276) ...+.++++++++|..+...|... ++- ..-.++++|+.+..|..- . .+.+..-. +-+.+ +..+.+|.... T Consensus 180 ~~~t~t~~~i~~di~~~~~~l~~~s~g~~-~p~~L~L~p~~~~~L~~~-~-~~~~~t~l-~~lk~----~~~~l~I~~~p 251 (319) T protein:vir:10 180 DVSTMKPETAEAELTQAIETIETITRGQH-RATNILIPPSMRKVLAIR-M-PETTMSYL-DYFKS----QNSGIEIDSIA 251 (319) T ss_pred CccccCHHHHHHHHHHHHHHHHHhcCcee-eceEEEecHHHHHhhhcc-c-CCCCeeHH-HHHHH----hcCCceEEEee Confidence 122346778899999998888644 431 223699999999988431 1 11110000 11111 12355566655 Q ss_pred cccccccce--EEEEEe--cceEEe--eeeeeeeeeccCcccceeeEEeeeee-eeEEEcCCeEEEEEec Q lcl|NC_019506. 212 NMGSLTNGT--GAIAGV--KMACTF--AEQIVQTEAYRMEKRFADAVKGLNVF-GCKVIYPDALVCLKKT 274 (276) Q Consensus 212 ~lp~~~~~~--~~~~~~--~~a~~~--~~~~~~~e~~~~~~~~~~~i~~~~~y-g~~v~~~~~vv~~~~~ 274 (276) .+.....++ ..+.+. +.-+.+ +..+..... .++.....+.+..++ |+.+.+|.+++.++== T Consensus 252 el~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~--e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 252 ELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA--QPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eecccCCCcceEEEEEecCCceEEEecCcceeeeee--eecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 554433221 122222 222222 222222222 233445566666655 4777789999998744 No 192 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.34 E-value=9.1e-05 Score=42.74 Aligned_cols=258 Identities=13% Similarity=0.081 Sum_probs=141.2 Q ss_pred Ccc--ch-hhHHHHHHHH----HHHHHH-hhcchhhhcc-ccccccccCCcEEEEeccCcccceeecCCCCCCCcccccc Q lcl|NC_019506. 1 MAV--TS-FIPKLWSARL----LAHLDK-AHVVANLVNR-DYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELST 71 (276) Q Consensus 1 MA~--~~-l~~e~~~~~~----~~~l~~-~~v~~~~~~~-~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~ 71 (276) ||- +. =.|-++...+ ++.|+. ...|..++.+ .++ +| +..+.+.+-.+ +....+.+++.+.. ..+.+ T Consensus 394 ~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~-DF-k~~~~~~lg~~--~~L~~V~E~gEyk~-~t~~e 468 (693) T protein:vir:95 394 LAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILT-DF-KPARRVGLGEF--SSLRQVREGAEYKY-VTLGE 468 (693) T ss_pred HHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCC-cc-cccceeecCCC--CChhhcCCCCceee-eecCC Confidence 222 11 1244444433 333321 1223333322 222 22 12344444222 24556677877754 56777 Q ss_pred ceEEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------c-c-ccccc Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS------------K-L-KPAAT 137 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~------------~-~-~~~~~ 137 (276) ...++.|.++ +.-|.||.....-.--+....+....+++-++.++..+++.+..++.- + . ++++. T Consensus 469 ~~e~~~l~ty-G~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sa 547 (693) T protein:vir:95 469 RGEQIILATY-GELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASA 547 (693) T ss_pred ccceeehhhc-CCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeeccccccccccccc Confidence 8889999876 777899988877766677778888899999999999999888765321 1 1 11222 Q ss_pred CCHHHHHHHHHHHHHHHhhcCCC--------c--cCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEece-E Q lcl|NC_019506. 138 LDKTNIYEELIKVKVKLDEKNVP--------T--IGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGF-D 206 (276) Q Consensus 138 ~t~~~~~~~i~~a~~~l~~~~vP--------~--~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~-~ 206 (276) ++ .+.+.+++..|..++.. - ..+++++.|+......+ ++++...-+ .....|.+--+.|+ + T Consensus 548 ls----~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~---l~~s~~~~~-a~~~~~~~NP~~~~~~ 619 (693) T protein:vir:95 548 LS----IDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQ---IINSESVPG-ADVNSGIVNPIRAFAQ 619 (693) T ss_pred cC----hHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHH---Hhccccccc-cccccccccchhcccc Confidence 33 45566666666554321 1 34688888887775543 333322111 11334555556675 6 Q ss_pred EEEeccccccccceEEEEEecc--eE--Eeee--eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 207 VYLSNNMGSLTNGTGAIAGVKM--AC--TFAE--QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 207 v~~s~~lp~~~~~~~~~~~~~~--a~--~~~~--~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++.+.+|...+.....++..+. .+ +|-. +...+|....-+.-|-.++.++-||+++++.-|++. +| T Consensus 620 vi~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~k----n~ 691 (693) T protein:vir:95 620 VIGEPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQK----SN 691 (693) T ss_pred ccccceecCCCCCceEEecCCCCCeEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCceeecccccc----CC Confidence 7777888644433333332222 12 2221 222333333333335577788889999999888854 44 No 193 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=96.93 E-value=0.00025 Score=40.31 Aligned_cols=260 Identities=14% Similarity=0.092 Sum_probs=132.3 Q ss_pred Cccc------hhhHHHHH---HHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCC-CCCcccc Q lcl|NC_019506. 1 MAVT------SFIPKLWS---ARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSD-IDAPEEL 69 (276) Q Consensus 1 MA~~------~l~~e~~~---~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~-~~~~~~~ 69 (276) |-.. .|..+.|. .++.+.....++...++.- ..+....-.++..++.... .+..+...+. ... .+. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v--~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~-v~~ 77 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPV--SNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPL-VDA 77 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceeccc--ccCCCCceeEEEeeeeeccCceeEeCCCccccce-eec Confidence 5432 34454444 4444444455556655532 1111222357777766443 2444443322 211 244 Q ss_pred ccceEEEEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------c-cccccccc Q lcl|NC_019506. 70 STTEKVLEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILLKEMDTN---------A-TSKLKPAA 136 (276) Q Consensus 70 ~~~~~~~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~---------~-~~~~~~~~ 136 (276) ..++....+..+ +.++.++..|...+ -.++-.+....++.++++..|+.++-.-..- . ......++ T Consensus 78 ~~~~~~~~i~~~-~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~ 156 (296) T protein:vir:10 78 LATERQGKVFRF-GNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGS 156 (296) T ss_pred cceeEEEEEEEE-EeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCC Confidence 555666666443 66677776655432 3567777788888889999988665321110 0 01111222 Q ss_pred cCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 137 TLDKTNIYEELIKVKVKLDEK--NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 137 ~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) ..++.+++++|.++...+... ++- ..-.++++|..+..|..- .. ..+ .. +.+-.--+..+.+|.....+. T Consensus 157 W~~~t~i~~Di~~~~~~l~~~s~g~~-~p~~l~L~p~~~~~L~~~---~~--~~~-~t-~l~~ik~~~~~l~i~~~~~l~ 228 (296) T protein:vir:10 157 WSQPTTAVSDITSLLDIIETSTNGQH-RATHLLLPTTARRIMQNL---VP--GTS-VS-YGEFFRQNNSGVTVEFVQYLN 228 (296) T ss_pred ccCHHHHHHHHHHHHHHHHHhhCcee-cceeEEeCHHHHHHHhhc---cC--CCC-cc-HHHHHHHhcCCceEEEeeeec Confidence 345667889999888766543 431 223688999999888532 11 111 01 111000123455666655554 Q ss_pred ccccc--eEEEEEe--cceEEee--eeeeeeeeccCcccceeeEEeeeee-eeEEEcCCeEEEEE---ec Q lcl|NC_019506. 215 SLTNG--TGAIAGV--KMACTFA--EQIVQTEAYRMEKRFADAVKGLNVF-GCKVIYPDALVCLK---KT 274 (276) Q Consensus 215 ~~~~~--~~~~~~~--~~a~~~~--~~~~~~e~~~~~~~~~~~i~~~~~y-g~~v~~~~~vv~~~---~~ 274 (276) ....+ ...+.+. +.-+.++ ..+.... ..++.....+....+. |+.+.+|++++.++ -+ T Consensus 229 ~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~--~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 229 DYNGTGTSAAIAYEKDPNNMAIEIPEATNALP--AQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred cCCCCcceEEEEEEcCCceEEEEcCcceeeec--ccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 33222 1223332 2222222 2222211 1234455677777776 58888999999974 34 No 194 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=96.87 E-value=0.00029 Score=40.02 Aligned_cols=263 Identities=10% Similarity=0.039 Sum_probs=132.1 Q ss_pred Cccc---hhhHHH---HHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccce Q lcl|NC_019506. 1 MAVT---SFIPKL---WSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~---~l~~e~---~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~ 73 (276) |=+. .|..+. |..++.+.+.+.+..+.++.- .........++.+++....+ +..+..+..-....+..-+. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v--~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~ 78 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQ--KFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVR 78 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhccc--ccCCCCceEEEEEeeeccceeEEEecCccccccccccccee Confidence 6554 244444 455566666666666665422 11122233667777765544 33343322211113444456 Q ss_pred EEEEEEeeeecceeechHHHHh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc---------ccccc---------- Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQ---IRTPLMDAAMQRAAYALADETEKILLKEMDT---------NATSK---------- 131 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~---~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~---------~~~~~---------- 131 (276) ....+... +.++.++..|... ...++-.+....+++++++..|+.++-.-.. ..... T Consensus 79 ~~~~i~~~-~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~ 157 (301) T protein:vir:80 79 KSVPIYSI-GIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGN 157 (301) T ss_pred EEEEEEEE-EeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCccccc Confidence 66666543 6667777665543 3457778888888899999999876532111 00000 Q ss_pred ccccccCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE Q lcl|NC_019506. 132 LKPAATLDKTNIYEELIKVKVKLDEK--NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL 209 (276) Q Consensus 132 ~~~~~~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~ 209 (276) .......|++.++++|.++...+... ++ ...-.++++|+.|..|..- ..... . +... .+-.--+..+.+|+. T Consensus 158 ~~~w~~~t~~ei~~di~~~~~~l~~~s~g~-~~p~~L~L~p~~~~~L~~~-~~~~~--~-~~tv-l~~l~~~~~~~~I~~ 231 (301) T protein:vir:80 158 VSKWEKKTAEQIIDEIGEAHTKITVLPGYG-TASLKLCLPPKQFELINKK-RYSNE--D-SRSV-LKVLQDNAWFSAIVR 231 (301) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHhcCce-ecccEEEecHHHHHhhhhc-cccCC--C-CeeH-HHHHHHHcCcceEEE Confidence 01112346778899999999888654 32 1123699999999988531 11111 1 1111 010001122345555 Q ss_pred ecccccccc--ceEEEEEecc----eEEeeeeeeeeeeccCcccceeeEEeeeee-eeEEEcCCeEEEEEec Q lcl|NC_019506. 210 SNNMGSLTN--GTGAIAGVKM----ACTFAEQIVQTEAYRMEKRFADAVKGLNVF-GCKVIYPDALVCLKKT 274 (276) Q Consensus 210 s~~lp~~~~--~~~~~~~~~~----a~~~~~~~~~~e~~~~~~~~~~~i~~~~~y-g~~v~~~~~vv~~~~~ 274 (276) .+.+..... ...++.+.++ .+.++..+.....+ ++.....+....++ |+.+.||++++.++== T Consensus 232 ~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e--~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 232 VPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPEE--YSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred cceeccCCCCcccEEEEEecCCcEEEEEecCceeeecce--ecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 444443221 1122233221 11222222211111 12223445555555 5677789999998744 No 195 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=96.65 E-value=9.6e-05 Score=42.62 Aligned_cols=260 Identities=11% Similarity=0.023 Sum_probs=105.4 Q ss_pred Cccchhh-HHHHHHH----HHHHHHHhhcchhhhccccccccccCCcEEEEec--------------cCcccceeecCCC Q lcl|NC_019506. 1 MAVTSFI-PKLWSAR----LLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQ--------------IGAITVKEYTENS 61 (276) Q Consensus 1 MA~~~l~-~e~~~~~----~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~--------------~~~~~~~d~~~~~ 61 (276) +....-. ...+... ..+..+....... .+......+ .....+.++. .+........... T Consensus 184 ~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~ 261 (480) T protein:vir:40 184 ERKFMRELGSKMAEMPEQGFLREFANGADLNV-VNSLGSITS-KYARKSGIYDGAMKARFQGLTLAEDGVDDTFISGTFK 261 (480) T ss_pred hhHHHHHHHHHhccchhhhhhhhhhhhccccc-ccccccccc-chhhheeechhhhhhhhhcceeeeccccceeeeeeee Confidence 1111000 0000000 0000111000000 000000000 0000000100 0100000000000 Q ss_pred CCCCccccccceE-EEEEEee-eecceeech--HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------- Q lcl|NC_019506. 62 DIDAPEELSTTEK-VLEINKQ-KYFNFQIDD--VDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS------- 130 (276) Q Consensus 62 ~~~~~~~~~~~~~-~~~ld~~-~~~~~~v~d--~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~------- 130 (276) . .....+.... ...+..+ .+.-..+.. .+.+....++.+-+..+.++.++++.++.++..-..+... T Consensus 262 ~--~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~ 339 (480) T protein:vir:40 262 A--GTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTA 339 (480) T ss_pred c--ccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceee Confidence 0 0000000000 1111110 011111111 1111223356666788888899999998887652222110 Q ss_pred cccccccCCHHHHHHHHHHHHHHHhhcCCCccCC-EEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEE Q lcl|NC_019506. 131 KLKPAATLDKTNIYEELIKVKVKLDEKNVPTIGR-FLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYL 209 (276) Q Consensus 131 ~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r-~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~ 209 (276) ....+...++.+.++.|..+...-... +. .+|+||..+..|.+... ....+.++..+..|...+++|+++++ T Consensus 340 ~~~~~~~~~~~d~id~L~~al~~~y~~-----~a~~~vmn~~t~~~I~klKD--~~G~Yi~q~~~~~~~~~~llG~pvv~ 412 (480) T protein:vir:40 340 TDGWTKQIEYTDLFEGITDAVAECSIS-----DAITIVMSPQTFAELRKAKG--TDGHSRFNELATKEQIAQSFGAVNLE 412 (480) T ss_pred cccccccchhHHHHHHHHHhhhHHhhC-----CCCEEEECHHHHHHHHHhhc--CCCCeeccCcccccCcceecccceee Confidence 011122234444444444333222222 33 47899999999877543 23456777888899999999999887 Q ss_pred ec-cccccccceEEEEEecceEEeeeeeeeeeeccCc--ccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 210 SN-NMGSLTNGTGAIAGVKMACTFAEQIVQTEAYRME--KRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 210 s~-~lp~~~~~~~~~~~~~~a~~~~~~~~~~e~~~~~--~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ++ .+|... ...+....++.+..+- ++..++. +.-...|....+-|..+.+|+++..++...- T Consensus 413 ~~~~~~~~~---~~~~~~~~~~~~~d~~--~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 413 TRVWMPKDE---VAVYNHDEYVLIGDLN--VENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS 477 (480) T ss_pred eeccccCCc---ceeeeCCccEEEEecc--cceecccccccchhhhhhhhhhceeeEccccEEEEEeccC Confidence 53 333211 1111122233333322 2222221 1223455666778888999999999887777 No 196 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=96.04 E-value=0.0011 Score=36.84 Aligned_cols=271 Identities=13% Similarity=0.070 Sum_probs=122.6 Q ss_pred Cccc-hhhHH-HHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeec--CCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAVT-SFIPK-LWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYT--ENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~-~l~~e-~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~--~~~~~~~~~~~~~~~~~~ 76 (276) |..+ .++.. +....++.-..+..+...++-+ -+.....|+-.+|++ ..+...+-. +++.+.-.+--..+..+. T Consensus 2 ~~~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P~--vpv~~~~~k~~~f~~-eaF~~~~t~r~~~~~~~~v~~~~~~~~~~ 78 (307) T protein:vir:10 2 GRLSKLRIVDPVLTNLAIGYTNAEFIGQSLMPV--VEVEKEGGKIPKFGK-ESFRLYKTERALRARSNRMNPEDLGSIDI 78 (307) T ss_pred CCCCCCcccChhHHHHHHhhcchhhhhhhcCCc--ccccccccceeeECc-ccccchhhhcccCCCcceeeccccccccc Confidence 4332 33333 4555555444444444444311 111122344444442 233222211 222221111111233455 Q ss_pred EEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----ccc-cccc---cCCHHHHHHHHH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNAT----SKL-KPAA---TLDKTNIYEELI 148 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~----~~~-~~~~---~~t~~~~~~~i~ 148 (276) .+.++ .....++..+...+.+++.+...+.....|....+..+...+..... ... ..++ .-...+.+..|. T Consensus 79 ~~~~~-~L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsGt~~Wsd~~sDPi~di~ 157 (307) T protein:vir:10 79 VLDEH-DLEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLSATEKFTAAGSDPVGVIE 157 (307) T ss_pred ccccc-cccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEeccccccCCCCCCcHHHHH Confidence 55443 44566777777778888888888877777766666655554433221 111 1111 012445677888 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEe-ccccccccce------- Q lcl|NC_019506. 149 KVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLS-NNMGSLTNGT------- 220 (276) Q Consensus 149 ~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s-~~lp~~~~~~------- 220 (276) +++..+.+..- ...-.|++++..+..|+.++.+.+.-.+...+.+..-.+..++|++.+.. ...-....+. T Consensus 158 ~~~~ai~~~~g-~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~~~la~ll~v~~i~vg~a~~~~~~~~~~~iw~~ 236 (307) T protein:vir:10 158 DGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWGA 236 (307) T ss_pred HHHHHHHhhhC-CccceEEeCHHHHHHHhcCHHHHHHhCCccccccCHHHHHHHhCceeEEEeeeeeeccCCccceeCCC Confidence 88887776533 22337999999999999999998876544333332223456777664432 1111111111 Q ss_pred -EEEEEecc------------eEEeeeee--eeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 221 -GAIAGVKM------------ACTFAEQI--VQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 221 -~~~~~~~~------------a~~~~~~~--~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ..+++++. ++|+--+. ..+...+.....+..|+....+--.++=|++=..|.-++= T Consensus 237 ~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~~~~d~~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 237 NIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred ceEEEecccccCCCCCcccccccceeEEEcCCeEeeceecCCceeEEeccccccceeecccccceeccCCC Confidence 11222111 12322221 1111222233334444443333333333332222322222 No 197 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=95.93 E-value=0.0012 Score=36.53 Aligned_cols=259 Identities=12% Similarity=0.042 Sum_probs=132.4 Q ss_pred Cccchh---hHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAVTSF---IPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~~l---~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) +++..+ ..+.+..++.+..........++-..-.++ ..-.++.+++....+ ++.|..++.... .+..-..... T Consensus 49 ~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~--w~~~t~~y~~~e~~G~a~~ygd~ad~Pl-~~~~v~~~~~ 125 (339) T protein:vir:94 49 TANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGD--WTTTYGVFIIAEPVGQVATYSDWSANGM-SKANVNFESR 125 (339) T ss_pred ccccchhhhhhhhhchhheeecccccchhhhcccccCCC--CcccEEEEeeeecccceEEcccccCCCc-ccccceeeEE Confidence 333322 233344555555666666666654321111 112588898875543 555544333211 2223333344 Q ss_pred EEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHH---------hhccc----cccccccccCCH Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILLKE---------MDTNA----TSKLKPAATLDK 140 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~---------~~~~~----~~~~~~~~~~t~ 140 (276) ++-. ...++.++..|+..+ -.++..+....+.+++.+..|+..+-. +..-. .+..+..+..|. T Consensus 126 ~v~~-~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~ 204 (339) T protein:vir:94 126 QNYR-YQTWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATAAP 204 (339) T ss_pred eEEE-EEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccCCH Confidence 4422 245667787776543 356677777777788888887743311 11000 001112234578 Q ss_pred HHHHHHHHHHHHHHhhcC----CCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccccc Q lcl|NC_019506. 141 TNIYEELIKVKVKLDEKN----VPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSL 216 (276) Q Consensus 141 ~~~~~~i~~a~~~l~~~~----vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~ 216 (276) +.++++|..+...+.... -|...-.|+++|..+..|-.- +.....--+-+.+ ++-+++|.....+... T Consensus 205 ~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~----n~~~~Tvl~~lk~----n~pnl~i~~~~el~~a 276 (339) T protein:vir:94 205 EDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT----NNFGLSAGAKIAQ----TYPNIQFVAVPEFDTA 276 (339) T ss_pred HHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC----CcCCccHHHHHHH----hcCCcEEEEccccccC Confidence 888999988888875552 122334699999999988532 1111000011211 2345677776666543 Q ss_pred ccceEEEEEecc-------eEEeeeeeeeeeeccCcccceeeEEeeee-eeeEEEcCCeEEEEEec Q lcl|NC_019506. 217 TNGTGAIAGVKM-------ACTFAEQIVQTEAYRMEKRFADAVKGLNV-FGCKVIYPDALVCLKKT 274 (276) Q Consensus 217 ~~~~~~~~~~~~-------a~~~~~~~~~~e~~~~~~~~~~~i~~~~~-yg~~v~~~~~vv~~~~~ 274 (276) +++ ....+... .+.++..+.....+ ++..+..+.+..+ .|+.+.+|.+++.++== T Consensus 277 ~g~-~~~~~~~~~~~~~~~~~~~p~~~~~lpvq--~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 277 SGR-LVQLWVPEVNGQPTGEVAFAEKLRSHSIE--RYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred CCc-eEEEEEEeccCCcceEEEcchhhhccccE--EcCceEEecceeeeeeEEEEccceeeeeecC Confidence 322 22222211 23334433333222 2334556677766 55667789999887633 No 198 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=95.70 E-value=0.0016 Score=35.91 Aligned_cols=261 Identities=13% Similarity=0.074 Sum_probs=125.8 Q ss_pred Cc---------------------cc---hhhHHHHH---HHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc Q lcl|NC_019506. 1 MA---------------------VT---SFIPKLWS---ARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT 53 (276) Q Consensus 1 MA---------------------~~---~l~~e~~~---~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~ 53 (276) || +. .|..+.|. .++.+.....+....++.-. .+....-.++.++.....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~--~~~~~~~et~~~~~~e~~G 78 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVT--NEIPGHAKYFEYPEFDGVG 78 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccc--cCCCCceeEEEeeeecccc Confidence 11 11 23333333 33333333444444444211 1111112477777765443 Q ss_pred -ceeecCCCC-CCCccccccceEEEEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019506. 54 -VKEYTENSD-IDAPEELSTTEKVLEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILLKE----- 123 (276) Q Consensus 54 -~~d~~~~~~-~~~~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~----- 123 (276) +..+...+. +.. .+....+....+..+ +.++.++..|...+ ..++-.+....+..++++..|+.++-. T Consensus 79 ~a~~~~d~~~dip~-vd~~~~~~~~~i~~~-~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g 156 (314) T protein:vir:10 79 IAQIIADYSDDLPL-VDAFMTEKQGKVFRF-GNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHG 156 (314) T ss_pred ceeeeCCcccccce-eecccceeEEEEEEE-EeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccc Confidence 444443322 221 345556667777544 66788877666543 346667777777788888888755422 Q ss_pred ----hhccc-cccccccccCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceee Q lcl|NC_019506. 124 ----MDTNA-TSKLKPAATLDKTNIYEELIKVKVKLDEK--NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKN 196 (276) Q Consensus 124 ----~~~~~-~~~~~~~~~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~ 196 (276) +..-. ......+...|.+.++++|.++...+.+. ++- ..-.++++|..+..|.. . ..+....--+.+.+ T Consensus 157 ~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~-~p~~l~Lpp~~~~~L~~-~--~~~~~~tvl~~l~~ 232 (314) T protein:vir:10 157 IVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLH-HVTDILLPASARRVMQG-L--VPQTNLSYGELFTR 232 (314) T ss_pred ceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccc-cceeEEecHHHHHhhcc-c--ccCCCccHHHHHHH Confidence 11000 00111123357788899999999998865 321 11258999999876632 1 11100000011111 Q ss_pred eeeeEEeceEEEEeccccccccceE--EEEEecc--eEE--eeeeeeeeeeccCcccceeeEEeeeee-eeEEEcCCeEE Q lcl|NC_019506. 197 GFVGTILGFDVYLSNNMGSLTNGTG--AIAGVKM--ACT--FAEQIVQTEAYRMEKRFADAVKGLNVF-GCKVIYPDALV 269 (276) Q Consensus 197 G~i~~~~G~~v~~s~~lp~~~~~~~--~~~~~~~--a~~--~~~~~~~~e~~~~~~~~~~~i~~~~~y-g~~v~~~~~vv 269 (276) +.-+++|.....+.....+.. .+.+.++ -+. ++..+..... .++..+..+.+..++ |+.+.+|.+++ T Consensus 233 ----n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~--e~~~~~~~~~~~~r~~Gv~i~~P~ai~ 306 (314) T protein:vir:10 233 ----NNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLPA--QPKDLHFRYPVTSKATGLIVYRPLTMA 306 (314) T ss_pred ----hCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeecc--eecCceEEEcceeeeEEEEEECcceeE Confidence 123555665555543332211 2222222 111 2222222211 223345566666666 57777899999 Q ss_pred EEE--ecC Q lcl|NC_019506. 270 CLK--KTN 275 (276) Q Consensus 270 ~~~--~~~ 275 (276) .++ .=| T Consensus 307 ~~dGI~~~ 314 (314) T protein:vir:10 307 VIKGITFA 314 (314) T ss_pred eeeeeecC Confidence 855 222 No 199 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=95.41 E-value=0.0021 Score=35.24 Aligned_cols=269 Identities=13% Similarity=0.099 Sum_probs=119.6 Q ss_pred Ccc-c-hhhHH-HHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCc--ccceee--cCCCCCCCccccccce Q lcl|NC_019506. 1 MAV-T-SFIPK-LWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGA--ITVKEY--TENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~-~-~l~~e-~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~--~~~~d~--~~~~~~~~~~~~~~~~ 73 (276) |.. + .++.. +++..++.-.+...+-..++=+ -+. ...+.++++++. +..-+- .+++..........+. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~Iad~lfP~--vpV---~~~~~k~~~f~~e~f~~~~t~ra~~~~~~~v~~~~~~~ 75 (307) T protein:vir:79 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQTLMPV--VEV---EKEGGKIPKFGKESFRLYQTERALRAKSNRMNPEDIDS 75 (307) T ss_pred CCCCCCCcccCHHHHHHHhhccchhhhhhhcCCc--ccc---cccccceeeeccccccccccccccCCCcceeeeecccc Confidence 442 2 33333 4444444323333333333211 111 122333444332 222111 1222221111111234 Q ss_pred EEEEEEeeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cc-cccc---cCCHHHHHH Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS----KL-KPAA---TLDKTNIYE 145 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~----~~-~~~~---~~t~~~~~~ 145 (276) .++.++++ .....|+..+...+.+++.+...+.....|....+.-+...+...... .. ..++ .-...+.+. T Consensus 76 ~~~~~~~~-~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd~~sDPi~ 154 (307) T protein:vir:79 76 VDVNLDEH-DLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTAANSDPVG 154 (307) T ss_pred cccccccc-chhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCCCCCCcHH Confidence 56666554 444567777766777888777777766666666666555555433221 11 1111 112345677 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceE-EEEeccccccccc----- Q lcl|NC_019506. 146 ELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFD-VYLSNNMGSLTNG----- 219 (276) Q Consensus 146 ~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~-v~~s~~lp~~~~~----- 219 (276) .|.+++..+.+..- .-.-.|++++..+..|+.++.+.+.-.....+.+..-.+..++|++ |+.-...-...++ T Consensus 155 di~~~~~ai~~~~g-~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it~~~la~l~~v~~V~vg~a~y~~~~~~~~~i 233 (307) T protein:vir:79 155 VIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDI 233 (307) T ss_pred HHHHHHHHHHHhhC-CccceEEeCHHHHHHHhcCHHHHHHhcCccccccCHHHHHHHhCceeEEEeeeeeecccccchhc Confidence 88888887776533 2223799999999999999999887655443333322345577776 4432222111111 Q ss_pred ---eEEEEEecc------------eEEeeeee--eeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 220 ---TGAIAGVKM------------ACTFAEQI--VQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 220 ---~~~~~~~~~------------a~~~~~~~--~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) ...+++++. ++|+-.+. ......+.....+..|++....--.++=|++=..|+.++= T Consensus 234 w~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 234 WGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred CCCceEEEecccccCCCCCcccccccceeEEecCceEEecccCCCceeEEeecccccceeeccccchhhccCCC Confidence 112222111 12222211 1111122223334444444433333333332222222222 No 200 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=95.27 E-value=0.0024 Score=34.95 Aligned_cols=262 Identities=13% Similarity=0.098 Sum_probs=126.1 Q ss_pred Cccc---hhhHHH---HHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCC-CCCCccccccc Q lcl|NC_019506. 1 MAVT---SFIPKL---WSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENS-DIDAPEELSTT 72 (276) Q Consensus 1 MA~~---~l~~e~---~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~-~~~~~~~~~~~ 72 (276) |.++ .|..+. +...+.+.....+....++.-. .+....-.+++++.....+ +..+.... .... .+.... T Consensus 31 ~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~--~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~-vd~~~~ 107 (329) T protein:vir:79 31 NDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVT--SELSDTDKTFEYQTFDKVGHAKIIADYTDDLST-VDALMT 107 (329) T ss_pred eccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccc--cCCCCceeEEEeeeeecceeeeeecCcccccce-eecccc Confidence 3232 233422 4455555555556666655321 1111222577777765443 44444322 2211 244445 Q ss_pred eEEEEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHH---------hhcccc-------cccc Q lcl|NC_019506. 73 EKVLEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILLKE---------MDTNAT-------SKLK 133 (276) Q Consensus 73 ~~~~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~---------~~~~~~-------~~~~ 133 (276) +....+..+ +.++.++..|...+ -.++-.+....+.+++++..|+.++-. +..-.. ...+ T Consensus 108 ~~~~~i~~~-~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~~~~ 186 (329) T protein:vir:79 108 SEFGKVFRL-GNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGWNNA 186 (329) T ss_pred eeEEEEEEE-EEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccccCCCCCc Confidence 555666443 56677776665443 356677777888888888888865421 110000 0011 Q ss_pred ccccCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec Q lcl|NC_019506. 134 PAATLDKTNIYEELIKVKVKLDEK--NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN 211 (276) Q Consensus 134 ~~~~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~ 211 (276) ..+..|.+.++++|.++...+... ++ ...-.++++|..+..|..- . .+.+....+-..+++ .+++|...+ T Consensus 187 ~w~~kt~~ei~~di~~~~~~l~~~s~g~-~~p~~L~Lpp~~~~~L~~~-~-~~~~~tvl~~lk~~~-----~~l~I~~~~ 258 (329) T protein:vir:79 187 AGTGKKPETAQDELEQAIEKIETLTNGQ-HRANMILIPPSMRKVLMVR-M-PETTMSYLDYFKQQN-----GGITIESIS 258 (329) T ss_pred cccccCHHHHHHHHHHHHHHHHHhcCce-ecccEEEecHHHHHHhhcc-c-CCCCccHHHHHHHhC-----CCcEEEEcc Confidence 123346778899999988888764 32 1123699999999877431 0 111111111111111 234455444 Q ss_pred ccccccc--ceEEEEEecc--e--EEeeeeeeeeeeccCcccceeeEEeeeee-eeEEEcCCeEEEEEecCC Q lcl|NC_019506. 212 NMGSLTN--GTGAIAGVKM--A--CTFAEQIVQTEAYRMEKRFADAVKGLNVF-GCKVIYPDALVCLKKTNP 276 (276) Q Consensus 212 ~lp~~~~--~~~~~~~~~~--a--~~~~~~~~~~e~~~~~~~~~~~i~~~~~y-g~~v~~~~~vv~~~~~~p 276 (276) .+-.... ...++.+..+ - +.++..+.....+ ++.....+....++ |+.+.+|.+++.+.==+- T Consensus 259 el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~q--~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~ 328 (329) T protein:vir:79 259 ELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTAQ--PKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVV 328 (329) T ss_pred cccccCCCCceEEEEEecCCceEEEecCcceeeeece--ecCceEEEceeeeEEEEEEECcceeeeeeeeee Confidence 4432221 1222333222 1 1122222222222 23334555556565 477778999988762222 No 201 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=92.58 E-value=0.011 Score=31.36 Aligned_cols=257 Identities=9% Similarity=0.048 Sum_probs=122.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccceEEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTEKVLEIN 79 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~~~~~ld 79 (276) |..+--.+..+...+...|.+..-..+-.++.+-.....-..+-+-...|.++ ..... ++.. ...+.+...++++. T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~p~l~e~~--Ge~~-~~~l~~~~~~i~~~ 77 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTFPKMRRWI--GAKV-VKNLKAYKYVVENE 77 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCCCCccccc--ccee-eccccccceeEEee Confidence 88764333444433444444333222211111111100111222223333332 33322 3332 25677788889987 Q ss_pred eeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc------------------------ Q lcl|NC_019506. 80 KQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPA------------------------ 135 (276) Q Consensus 80 ~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~------------------------ 135 (276) ++ +..+.|+..+.............+..+++.++..|+.++.++.++....-..+ T Consensus 78 ~~-g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~~~~~ 156 (302) T protein:vir:10 78 DF-EATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGTAPLS 156 (302) T ss_pred cc-cceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccchhhh Confidence 76 66789999888877778888889999999999999999999887533211000 Q ss_pred ---ccCCHHHHHHHHHHHHHHH----hhcCCCc--cCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEec-e Q lcl|NC_019506. 136 ---ATLDKTNIYEELIKVKVKL----DEKNVPT--IGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILG-F 205 (276) Q Consensus 136 ---~~~t~~~~~~~i~~a~~~l----~~~~vP~--~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G-~ 205 (276) ..++. +.+.+++..| +..+-|. ..++||+.|......... ..+.+.. .|..--+.| + T Consensus 157 ~~~~~l~~----~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~l---l~~~~~~------~g~~Np~~g~~ 223 (302) T protein:vir:10 157 NASQAAAK----AGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKML---LTNPKLA------DNTPNPYVGTA 223 (302) T ss_pred hcccccch----HHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHH---hhccccC------CCCcceeccce Confidence 01112 2233344433 3333332 246899998877655432 2211111 111111223 5 Q ss_pred EEEEeccccccccceEEEEEecceE---Eee-eeeeeeeeccCcccceeeEEeeeeeeeEE------EcCCeEEEEEecC Q lcl|NC_019506. 206 DVYLSNNMGSLTNGTGAIAGVKMAC---TFA-EQIVQTEAYRMEKRFADAVKGLNVFGCKV------IYPDALVCLKKTN 275 (276) Q Consensus 206 ~v~~s~~lp~~~~~~~~~~~~~~a~---~~~-~~~~~~e~~~~~~~~~~~i~~~~~yg~~v------~~~~~vv~~~~~~ 275 (276) +++.+..+... ....++..+..+ -+- .+...++...+++.-+-.++..+.||+.- ..+....-=+.++ T Consensus 224 ~~vv~p~L~s~--~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~ 301 (302) T protein:vir:10 224 ELVVDGRIESD--TAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTG 301 (302) T ss_pred EEEEeeccCCC--CceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccC Confidence 77777777532 222333333322 111 12234444444444445566666677522 1121111111111 Q ss_pred C Q lcl|NC_019506. 276 P 276 (276) Q Consensus 276 p 276 (276) - T Consensus 302 ~ 302 (302) T protein:vir:10 302 A 302 (302) T ss_pred C Confidence 1 No 202 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=92.23 E-value=0.012 Score=31.06 Aligned_cols=263 Identities=10% Similarity=0.060 Sum_probs=113.5 Q ss_pred Cccc---------------------------hhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc Q lcl|NC_019506. 1 MAVT---------------------------SFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT 53 (276) Q Consensus 1 MA~~---------------------------~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~ 53 (276) |.++ .|.|+++.. +.+...+...|.+.+..- ...-++.+|++++... T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l~~g~L~p~~a~~-Fl~~v~~~t~iL~~~r~~-----~~~s~~~ei~kig~G~ 74 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAELDGFQLPVDVTEE-FLERMQKGVQILGMADTM-----TLARLEMEVPQFGVPR 74 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccccCceeecHHHHHH-HHHHHhhccchhhhccee-----ecccccccccccccce Confidence 2221 355777776 444556666677776432 1223556666665432 Q ss_pred --ceeecCCCCCCCccccccceEEE-EEEeeeecceeechHHHHhhhh----hHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|NC_019506. 54 --VKEYTENSDIDAPEELSTTEKVL-EINKQKYFNFQIDDVDAAQIRT----PLMDAAMQRAAYALADETEKILL----- 121 (276) Q Consensus 54 --~~d~~~~~~~~~~~~~~~~~~~~-~ld~~~~~~~~v~d~d~~~~~~----d~~~~~~~~~~~ala~~~d~~~~----- 121 (276) .+...+++.......++..++++ ..+.... .+.+...+..+... ++...+....+..+++.+....+ T Consensus 75 r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~-~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~d 153 (360) T protein:vir:99 75 LSGHTRDEEGSRTENSEAESGSVKFNATDKSYY-ILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGAS 153 (360) T ss_pred eeccccccCCCCCcCCcCccccCccccccceee-EeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccch Confidence 22222222211112233334443 2333323 23454444433322 11222223333333332221111 Q ss_pred ---------------------HHhhcccccccccc-----------ccC-----------C----HHHHHHHHHHHHHHH Q lcl|NC_019506. 122 ---------------------KEMDTNATSKLKPA-----------ATL-----------D----KTNIYEELIKVKVKL 154 (276) Q Consensus 122 ---------------------~~~~~~~~~~~~~~-----------~~~-----------t----~~~~~~~i~~a~~~l 154 (276) ..+.+.......++ ... + .....+.|.++.+.| T Consensus 154 s~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~L 233 (360) T protein:vir:99 154 SGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTL 233 (360) T ss_pred hcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhc Confidence 11110000000000 000 0 000123355666666 Q ss_pred hhcCC--CccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEe Q lcl|NC_019506. 155 DEKNV--PTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTF 232 (276) Q Consensus 155 ~~~~v--P~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~ 232 (276) ...-- |...-+.+++|..+...... +.......+..++..+..-...|++++..+.+|. ...++.++.-+.+ T Consensus 234 p~kyr~~~~~~~~~~~s~~~~~~yr~~--L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd----~~~mlT~p~NLi~ 307 (360) T protein:vir:99 234 DSRYRESDAYSPVLMTSPNQVQSYTMS--LTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPD----EYMMFTDPNNLAF 307 (360) T ss_pred chhhhcCcccceEEEccCchHHHHHHH--HhccCcccchhheecccccccceeeeEEcCCCCC----CceEEeccCceeE Confidence 54421 11122678888876666442 3444444555566655444688999999888874 2366667766666 Q ss_pred ee-eeeeeeeccCcccce----eeEEee-eeeeeEEEcCCeEEEEE-ecCC Q lcl|NC_019506. 233 AE-QIVQTEAYRMEKRFA----DAVKGL-NVFGCKVIYPDALVCLK-KTNP 276 (276) Q Consensus 233 ~~-~~~~~e~~~~~~~~~----~~i~~~-~~yg~~v~~~~~vv~~~-~~~p 276 (276) +. +..+++....+.+.+ ..++.+ ..+.+.+-+++++|++. .-.| T Consensus 308 g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~ 358 (360) T protein:vir:99 308 GLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETP 358 (360) T ss_pred EeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCC Confidence 52 223332222222211 122332 33445445567777765 4445 No 203 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=87.58 E-value=0.038 Score=28.38 Aligned_cols=265 Identities=13% Similarity=0.097 Sum_probs=136.6 Q ss_pred CccchhhHHHHHHHHHHH-------HHHhhcchh-hhccccccccccCCcEEEEecc--CcccceeecCCCCCCCccccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAH-------LDKAHVVAN-LVNRDYEGEIKAYGDTVKINQI--GAITVKEYTENSDIDAPEELS 70 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~-------l~~~~v~~~-~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~~~~~~~~~~~~~~ 70 (276) |+-..| .|+....+.+. ..++..+.. |..+. ..++..+|.+|..|.- ...+...|..-..+.....-. T Consensus 1 mp~~~l-sel~t~tl~~rs~~~~D~v~~~n~LL~~L~~kG-~~~~~~gg~~I~~~l~y~~~s~~~wy~Gyd~l~~~p~d~ 78 (321) T protein:vir:34 1 MPFPNI-SDIITTTIESRSGVIADNVTKNNAILARLAKRG-KPRLVSGGYTILEELSFSGNSNGGWYSGYDVLPTAPQDV 78 (321) T ss_pred CCCchH-HHHHHHHHHhhcchhhhhhhcccHHHHHHHhcC-cccccCCCeeEEEEEeeccCcceeEEEeeeeeccchhhh Confidence 887543 45555443332 222332332 33331 2223356788888743 345566666554443322223 Q ss_pred cceEEEEEEeeeecceeechHHHHhh-----hhhHHHHHHHHHHHHHHHHHHHHHHHHhhc--------------ccccc Q lcl|NC_019506. 71 TTEKVLEINKQKYFNFQIDDVDAAQI-----RTPLMDAAMQRAAYALADETEKILLKEMDT--------------NATSK 131 (276) Q Consensus 71 ~~~~~~~ld~~~~~~~~v~d~d~~~~-----~~d~~~~~~~~~~~ala~~~d~~~~~~~~~--------------~~~~~ 131 (276) .+..++...+ -..++.|+-.|..+. ..|++++.++.+-+.+++.+|.+|.+-..+ ...+. T Consensus 79 ~~~Aef~wk~-aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~sdGTa~g~~~i~GL~~lv~~~p~t 157 (321) T protein:vir:34 79 ISSAEYALKQ-YAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYGDGTAFGGRAINGLDGAVPVDPTV 157 (321) T ss_pred ccccccchhh-eeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhccccccccchhhhhhhhcccCCCC Confidence 3456677644 367788998887754 469999999999999999999988753222 00000 Q ss_pred c------------------cccccCCHHHHHHHHHHHHHHHhhcC-CCccCCEEEECHHHHHHHhhhHHhhhhccccccc Q lcl|NC_019506. 132 L------------------KPAATLDKTNIYEELIKVKVKLDEKN-VPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAES 192 (276) Q Consensus 132 ~------------------~~~~~~t~~~~~~~i~~a~~~l~~~~-vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~ 192 (276) + ..++..|..++......+..++-+.+ -| -.++.+.++|..-.+.-+-.. -+...+ T Consensus 158 GtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~P---Dlii~~~~~y~~y~~s~q~~q--R~~~~~ 232 (321) T protein:vir:34 158 GTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMP---DLIMSGNDAWTTYSNSLQVLQ--RFTSAE 232 (321) T ss_pred ceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCc---cEEEechHHHHHHHHhhheee--eecccc Confidence 1 11122234443344444444333322 22 267777777776655433222 222222 Q ss_pred ceeeeeee-EEeceEEEEeccccccccceEEEEEecceEEeee-eeeeeeeccCcccce----eeEEee-eeeeeEEE-c Q lcl|NC_019506. 193 ITKNGFVG-TILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAE-QIVQTEAYRMEKRFA----DAVKGL-NVFGCKVI-Y 264 (276) Q Consensus 193 ~~~~G~i~-~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~-~~~~~e~~~~~~~~~----~~i~~~-~~yg~~v~-~ 264 (276) .-.-|..+ .+.|.+|+-..++.......++++...+.+.+.. ....+....+.. ++ |.+... .-.|.-++ + T Consensus 233 ~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r-~~~~NqdA~~q~I~~~GnL~~sn 311 (321) T protein:vir:34 233 EANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSR-RAAFNQDAEAQILAWAGNLTCSG 311 (321) T ss_pred cccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCccc-ccccchhHHhhhhhhhheeeeec Confidence 23334443 5789999998876655555667777777666652 111222222211 11 222111 22333333 4 Q ss_pred CCeEEEEEec Q lcl|NC_019506. 265 PDALVCLKKT 274 (276) Q Consensus 265 ~~~vv~~~~~ 274 (276) +..=++++.. T Consensus 312 ~~~~~vL~~~ 321 (321) T protein:vir:34 312 AQFQGRLIAE 321 (321) T ss_pred ccceeEEeeC Confidence 5555555555 No 204 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=84.86 E-value=0.057 Score=27.40 Aligned_cols=265 Identities=12% Similarity=0.040 Sum_probs=116.5 Q ss_pred CccchhhHHHHHHHHHHHHH-HhhcchhhhccccccccccCCcEEEEeccCccccee--ecCCCCCCCccccccceEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLD-KAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKE--YTENSDIDAPEELSTTEKVLE 77 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~-~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d--~~~~~~~~~~~~~~~~~~~~~ 77 (276) |+|..+.+.-.-..+...++ ..++-..++-+ -+.....|+-..+++-..+...+ ..+++.+.. -+...+..++. T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~--vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~-v~~~~~~~~~~ 77 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPR--VPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE-VEFSATDETGS 77 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCc--cccCccccceeeechhhcccccchhhccCCCcce-EeecccCceee Confidence 99988776643333444443 44433333311 11111223333444333333322 223433322 13344556677 Q ss_pred EEeeeecceeechHHHH--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-----ccccccc---CCHHHHHHHH Q lcl|NC_019506. 78 INKQKYFNFQIDDVDAA--QIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS-----KLKPAAT---LDKTNIYEEL 147 (276) Q Consensus 78 ld~~~~~~~~v~d~d~~--~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~-----~~~~~~~---~t~~~~~~~i 147 (276) +..+ .....|+..+.. ...+|+.+...+.....|....+..+...+...++. ....++. ....+++..| T Consensus 78 ~~~~-~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt~~wsd~~SDPi~~i 156 (309) T protein:vir:99 78 TEDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVI 156 (309) T ss_pred eccc-ceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCccccCCCCCCcHHHH Confidence 6544 555566666543 446788888888777777666665555544333221 1111211 1234556777 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccc--cceeeeeeeEEeceE-EEEeccccccc----cce Q lcl|NC_019506. 148 IKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAE--SITKNGFVGTILGFD-VYLSNNMGSLT----NGT 220 (276) Q Consensus 148 ~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~--~~~~~G~i~~~~G~~-v~~s~~lp~~~----~~~ 220 (276) .+++..+ +. ..-.|+++...+..|+.++.+....++... +.+..-.+..++|++ |+.....-..+ ++. T Consensus 157 ~~~~~~~---g~--~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~~ 231 (309) T protein:vir:99 157 TDALDSV---IL--RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPN 231 (309) T ss_pred HHHHHhh---CC--CcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceeeccccccccc Confidence 7776654 21 223799999999999999999887654432 123233455678884 55433221111 011 Q ss_pred EEEEE-ecceEEeeeeee-eeee-------ccCcccceeeEEeeeeee---eEEEcCCeEEEEEecCC Q lcl|NC_019506. 221 GAIAG-VKMACTFAEQIV-QTEA-------YRMEKRFADAVKGLNVFG---CKVIYPDALVCLKKTNP 276 (276) Q Consensus 221 ~~~~~-~~~a~~~~~~~~-~~e~-------~~~~~~~~~~i~~~~~yg---~~v~~~~~vv~~~~~~p 276 (276) ..-.| ...++.+..... .++. .......+.++ +-.|| ..++|-.-.+.-..++| T Consensus 232 ~~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~--d~~~~~~g~~~vr~~~~~k~~i~~~ 297 (309) T protein:vir:99 232 LIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIA--DPNIGLRGGQRVRVGESVKELVTAP 297 (309) T ss_pred cccccCCcEEEEEcCCCCCCcccccccceeecccccCCcee--eeeeccCCceEEEEeccccchhcch Confidence 11111 111222221111 0100 00001111111 11122 22222111111112444 No 205 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=82.22 E-value=0.079 Score=26.63 Aligned_cols=259 Identities=12% Similarity=0.010 Sum_probs=123.9 Q ss_pred Ccc----------chhhHHHHHHHHH----HHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCC Q lcl|NC_019506. 1 MAV----------TSFIPKLWSARLL----AHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDA 65 (276) Q Consensus 1 MA~----------~~l~~e~~~~~~~----~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~ 65 (276) .|+ +.-+|..++.-+. +.+........++--. ......-+++.++.....+ +.-|...++... T Consensus 34 da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~--t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~ 111 (336) T protein:vir:36 34 DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGES--KKGDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) T ss_pred hhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhcccc--ccCCccceeEEEeeeeceeeEEEeeccCCCce Confidence 011 1113455444332 2222222233332110 0111112567777765432 444433333211 Q ss_pred ccccccceEEEEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHH---------HHhhccccc--- Q lcl|NC_019506. 66 PEELSTTEKVLEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILL---------KEMDTNATS--- 130 (276) Q Consensus 66 ~~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~---------~~~~~~~~~--- 130 (276) .+..-...+.++... ..++.++..|...+ -.++..+....+.+++.++.++..+ +.+..-... T Consensus 112 -~d~~~~~~~~~v~~~-~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~ 189 (336) T protein:vir:36 112 -SGANINYPQRQSYFF-QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPI 189 (336) T ss_pred -eecccceeeeeEEEE-EeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCcccc Confidence 234445566666443 55678887776543 3567777777777888888776432 111110000 Q ss_pred --cccccccCCHHHHHHHHHHHHHHHhhcC--C-C-ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEec Q lcl|NC_019506. 131 --KLKPAATLDKTNIYEELIKVKVKLDEKN--V-P-TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILG 204 (276) Q Consensus 131 --~~~~~~~~t~~~~~~~i~~a~~~l~~~~--v-P-~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G 204 (276) .+......+.+.++++|.++...+.... . . ...-.|+++|..+..|-.- + ..+ . .+.+-.--++-+ T Consensus 190 t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~----n--~~g-~-Tvl~~lk~n~Pn 261 (336) T protein:vir:36 190 TATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT----N--QYG-L-AAAAKLKDIFPK 261 (336) T ss_pred ccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCC----C--ccC-c-cHHHHHHHhcCc Confidence 0111123466788999998888887642 1 1 1223689999988877431 1 111 1 111111112445 Q ss_pred eEEEEeccccccccceEEEEEecc-------eEEeeeeeeeeeeccCcccceeeEEeeee-eeeEEEcCCeEEEEEec Q lcl|NC_019506. 205 FDVYLSNNMGSLTNGTGAIAGVKM-------ACTFAEQIVQTEAYRMEKRFADAVKGLNV-FGCKVIYPDALVCLKKT 274 (276) Q Consensus 205 ~~v~~s~~lp~~~~~~~~~~~~~~-------a~~~~~~~~~~e~~~~~~~~~~~i~~~~~-yg~~v~~~~~vv~~~~~ 274 (276) ++|+..+.+.... +..+..+.+. -+.++.++.....+ ++.....+.+..+ .|+.+.||-+++.+.== T Consensus 262 l~i~t~pEl~~a~-g~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq--~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 262 LEFVTIPEYDTAS-GRLVQLWAPRVEGKDTATCGFTEKMRAHSIE--RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cEEEEccccccCC-CceEEEEEEecCCCcceeeecchhhhcccee--ecCceeEeccccceeeeeeeccchheeeecC Confidence 6777766664333 3333333221 12333333332232 2333445566655 45666679888886533 No 206 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=80.57 E-value=0.094 Score=26.22 Aligned_cols=258 Identities=14% Similarity=0.057 Sum_probs=123.3 Q ss_pred CccchhhHHHHHHHHH----HHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLL----AHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l~~e~~~~~~~----~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~~~ 75 (276) +++. -+|..++.-+. +.+........++-.+-.+ ...-+++.++.....+ +.-|...++.. -.+..-.... T Consensus 45 ~~~~-g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g--~W~~~~~~~~~~e~~G~a~~ygd~~D~P-~vd~~~~~~~ 120 (336) T protein:vir:78 45 TGSS-GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKG--DWTTLVAAFITAEPTTTVATYGDYSSDG-DSGTNINYPQ 120 (336) T ss_pred CCCc-chHHHHHHhcccceeeehhhhhhhhhhcccccCC--CccccEEEEeeeecceeeEEeecccCCC-eeecceeeEE Confidence 2221 12444443332 2222222233333211011 1112578887765443 44444333321 1344455666 Q ss_pred EEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHH-H--------Hhhccccc-cccc----cccC Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILL-K--------EMDTNATS-KLKP----AATL 138 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~-~--------~~~~~~~~-~~~~----~~~~ 138 (276) .++... ..++.++..|...+ -.++..+....+.+++.+.+++..+ + .+..-... ..++ .... T Consensus 121 ~~v~~~-~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~ 199 (336) T protein:vir:78 121 RQSYFF-QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSP 199 (336) T ss_pred EEEEEE-EeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCccccc Confidence 666443 56678887776543 3566777777777777777775322 1 11100000 0111 1235 Q ss_pred CHHHHHHHHHHHHHHHhhcC---C-CccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKN---V-PTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~---v-P~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) |.+.++++|..+...+...- + +...-.++++|..+..|..- +.....--+-+.+ ++-+++|+..+.+. T Consensus 200 T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~----n~~g~tv~~~lk~----n~Pnl~i~t~pel~ 271 (336) T protein:vir:78 200 AVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT----NQYGLSAAAKLKE----IFPKLEFVTIPEYD 271 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC----CccCccHHHHHHH----hcCccEEEEccccc Confidence 67888999998888875543 1 12223689999999988432 1110000011111 23456777766664 Q ss_pred ccccceEEEEEecc-------eEEeeeeeeeeeeccCcccceeeEEeeeee-eeEEEcCCeEEEEEec Q lcl|NC_019506. 215 SLTNGTGAIAGVKM-------ACTFAEQIVQTEAYRMEKRFADAVKGLNVF-GCKVIYPDALVCLKKT 274 (276) Q Consensus 215 ~~~~~~~~~~~~~~-------a~~~~~~~~~~e~~~~~~~~~~~i~~~~~y-g~~v~~~~~vv~~~~~ 274 (276) ..+ +.....+.+. -+.++.++.....+. +.....+.+..+. |+.+.||-+++.+.== T Consensus 272 ~Ag-g~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~--~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 272 TAS-GRLVQLWAPRVEGKDTATCGFTEKMRAHSIER--YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccC-cceEEEEEeeccCCcceeeecchhhhccceee--cCceeEeccccceeeeeeeccchheeeccC Confidence 433 3333333222 123333443333332 2334455666554 5556678888876533 No 207 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=80.00 E-value=0.099 Score=26.09 Aligned_cols=268 Identities=10% Similarity=0.003 Sum_probs=100.1 Q ss_pred Ccc--chhhHHHHHHHHHHHHHHh--hcchhhhccccccccccCCcEEEEecc--Ccccceee-cCCCCCCCccccccce Q lcl|NC_019506. 1 MAV--TSFIPKLWSARLLAHLDKA--HVVANLVNRDYEGEIKAYGDTVKINQI--GAITVKEY-TENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~--~~l~~e~~~~~~~~~l~~~--~v~~~~~~~~~~~~~~~~Gdtv~ip~~--~~~~~~d~-~~~~~~~~~~~~~~~~ 73 (276) ||+ .+|.+..++..+.+.-.+. .+...++- .....+-++.+-+. +..-+..+ .++.+......-.... T Consensus 1 M~~i~d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp-----~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~ 75 (348) T protein:vir:96 1 MGLIYDKVTASNIAGYFNTLQENVDSTLGESIFP-----ARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEI 75 (348) T ss_pred CcchhhccCHHHHHHHHHhcccchhhhhhhhcCC-----CccccceeEEEEeecCCceeEeeeecCCCCcceecccceee Confidence 995 4677777777554432221 11122221 11111222221111 11111222 2222211111111223 Q ss_pred EEEEEEeeeecceeechHHHHh--hh------------hhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------- Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQ--IR------------TPLMDAAMQRAAYALADETEKILLKEMDTNATS--------- 130 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~--~~------------~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~--------- 130 (276) ..+.+-.. .-...++..|... .. .+.+.+-...+...+.+.++--++..+..+... T Consensus 76 ~~~~~p~i-~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~ 154 (348) T protein:vir:96 76 HDEQMPFF-KEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeecCcc-ccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEE Confidence 33333222 1123344433211 11 111111122333444444444444444322110 Q ss_pred ---------cccc-cc-cCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccc--cccc---- Q lcl|NC_019506. 131 ---------KLKP-AA-TLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAM--AESI---- 193 (276) Q Consensus 131 ---------~~~~-~~-~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~--~~~~---- 193 (276) ..+. +. ..+..+.++.|.++...+.+.+.. ..+++++++++..|++++.+.+.-... .... T Consensus 155 vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~--~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~ 232 (348) T protein:vir:96 155 IDYGVKADHKKQVSKSWAEPGATPLADLEDAIETARELGLN--PERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKA 232 (348) T ss_pred EeccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCc--ccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHH Confidence 0010 11 112345678888888888776652 347999999999999998887643211 1111 Q ss_pred eeeeeeeEEeceEEEEeccccccccceE--------EEEEecceEE---eeee---eeee------------------ee Q lcl|NC_019506. 194 TKNGFVGTILGFDVYLSNNMGSLTNGTG--------AIAGVKMACT---FAEQ---IVQT------------------EA 241 (276) Q Consensus 194 ~~~G~i~~~~G~~v~~s~~lp~~~~~~~--------~~~~~~~a~~---~~~~---~~~~------------------e~ 241 (276) .....++...|++|+..+.--...+|.. .++...+..| ++.- ...+ .. T Consensus 233 ~~~~~~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (348) T protein:vir:96 233 ELQNYVADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTT 312 (348) T ss_pred HHHHHHhhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEEEe Confidence 1122344567888776433211111211 1222222111 1100 0000 00 Q ss_pred ccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 242 YRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 242 ~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +...+-.+..+.+-.+-=-.+.+|+++.++++-+= T Consensus 313 ~~~~dP~~~~~~~~s~plPv~~~~~~~~~a~Vl~~ 347 (348) T protein:vir:96 313 TKTTDPVNVQTKVSMVALPSFERLGDVYMLTVIPG 347 (348) T ss_pred eecCCCceEEEEEeeeeeccccCCCcEEEEEEecC Confidence 00000001111111110011125666666654333 No 208 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=78.70 E-value=0.11 Score=25.80 Aligned_cols=258 Identities=12% Similarity=0.007 Sum_probs=124.3 Q ss_pred CccchhhHHHHHHHHHHHH----HHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHL----DKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l----~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~~~ 75 (276) +++ .-+|..+..-+...+ ........++--. ......-+++.++.....+ +.-|...++... .+..-...+ T Consensus 45 ~~~-~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~--t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~-~d~~~~~~~ 120 (336) T protein:vir:10 45 TGS-SGIPNYLTTYVDPAVIDILVAPMKAAELVGES--KKGDWTTLVAAFITAEPTTKVATYGDYSSDGD-SGANINYPQ 120 (336) T ss_pred CCC-chhHHHHHhhcccceeeehhhhhhhhhhcccc--ccCCccceeEEEeeeeceeeEEEeeccCCCce-eecccceee Confidence 112 223444443332222 2222222232110 0111112567777765432 444443333211 234445566 Q ss_pred EEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHH---------HHhhccccc-----cccccccC Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILL---------KEMDTNATS-----KLKPAATL 138 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~---------~~~~~~~~~-----~~~~~~~~ 138 (276) .++... ..++.++..|+..+ -.++..+....+.+++.++.++..+ +.+..-... .+...... T Consensus 121 ~~v~~~-~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~ 199 (336) T protein:vir:10 121 RQSYFF-QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSP 199 (336) T ss_pred eeEEEE-EeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCccccc Confidence 666443 55678887776543 3577777777788888888876433 111110000 01111234 Q ss_pred CHHHHHHHHHHHHHHHhhcC--C-C-ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKN--V-P-TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~--v-P-~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) +.+.++++|.++...|.... + . ...-.++++|..+..|-.- + ..+ . .+.+-.--++-+++|+..+.+. T Consensus 200 t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~----n--~~g-~-Tvl~~lk~n~Pnl~i~t~pEl~ 271 (336) T protein:vir:10 200 AVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT----N--QYG-L-AAAAKLKDIFPKLEFVTIPEYD 271 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCC----C--ccC-c-cHHHHHHHhcCccEEEEccccc Confidence 66788999998888887643 2 2 1234689999988877431 1 111 1 1111111124466777766664 Q ss_pred ccccceEEEEEecc-------eEEeeeeeeeeeeccCcccceeeEEeeee-eeeEEEcCCeEEEEEec Q lcl|NC_019506. 215 SLTNGTGAIAGVKM-------ACTFAEQIVQTEAYRMEKRFADAVKGLNV-FGCKVIYPDALVCLKKT 274 (276) Q Consensus 215 ~~~~~~~~~~~~~~-------a~~~~~~~~~~e~~~~~~~~~~~i~~~~~-yg~~v~~~~~vv~~~~~ 274 (276) .. ++..+..+.+. -+.++.++.....+ ++.....+.+..+ .|+.+.||-+++.+.== T Consensus 272 ~a-~G~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq--~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 272 TA-SGRLVQLWAPRVEGKDTATCGFTEKMRAHSIE--RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cC-CCceEEEEEEecCCCcceeeecchhhhcccee--ecCceeEeccccceeeeeeeccchheeeecC Confidence 33 23333333221 12333333332232 2333445566655 45666679888886533 No 209 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=78.38 E-value=0.078 Score=26.65 Aligned_cols=107 Identities=17% Similarity=0.071 Sum_probs=61.4 Q ss_pred EECHHHHHHHhhhHHhhhhcccccccceeeeee-eEEeceEEEEeccccccccceEEEEEec----------ceEEeee- Q lcl|NC_019506. 167 IIPPDVHGLLLAADLIVGTGGAMAESITKNGFV-GTILGFDVYLSNNMGSLTNGTGAIAGVK----------MACTFAE- 234 (276) Q Consensus 167 vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i-~~~~G~~v~~s~~lp~~~~~~~~~~~~~----------~a~~~~~- 234 (276) +++-.+++.++++.....+----......+|.+ -+.+|..|+.+.++|-.+ ..+.... .+=+|+. T Consensus 1 vvsdlqfA~~~g~~v~~~aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~---a~vlDst~lGgmaDE~l~~Pgya~~ 77 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALPREQANIVLTGSLPVSAYGLTWVTSRHITGTD---PWLFDVEQLGGMADEKLLSPEFAPA 77 (123) T ss_pred CcchhhHHHHhcchhcccccccccCCceEecCcceeeeceeeeecCCCCCCc---cceeehhhhccccccccCCCcccCC Confidence 555566777777654332211111233445655 579999999999998322 1111111 1112221 Q ss_pred --eeeeeeeccCcc--cceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 235 --QIVQTEAYRMEK--RFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 235 --~~~~~e~~~~~~--~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) .-.++..+|..+ .-+..++++-..=.-|++|.+.+.|+-+-- T Consensus 78 ~~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 78 GNTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred CCcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 112444555544 335678888888888899999888875544 No 210 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=71.90 E-value=0.19 Score=24.54 Aligned_cols=259 Identities=9% Similarity=0.055 Sum_probs=123.4 Q ss_pred CccchhhHHH---HHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccceEEE Q lcl|NC_019506. 1 MAVTSFIPKL---WSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTEKVL 76 (276) Q Consensus 1 MA~~~l~~e~---~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~~~~ 76 (276) ++|..+ |.. |-..+.+.+-.-.....++--+-.+ ...-.++.++.....+ +..|...+.... .+..-..... T Consensus 74 ~~~~g~-~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g--~W~~~~~~~~v~e~~G~A~~ygd~~d~pl-~d~~~~~~~r 149 (379) T protein:vir:10 74 VSIPGL-IQFLQNWLPGHVRILTAVREADEFLGLSTVG--QWDDEQIVQRVLEGLGTAQPYTDGGNMAL-MSWTPTFETR 149 (379) T ss_pred ccccch-HHHHHhhcchHHHHHhhhhhhhhhcccccCC--CceeeeEEEeeeeeeeeeEEeccccCCCe-eeeeeeeeee Confidence 222211 222 2233444443333333333211011 1112577777765443 444543332211 2233334444 Q ss_pred EEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------c-------cc----- Q lcl|NC_019506. 77 EINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILLKEMDTNAT---------S-------KL----- 132 (276) Q Consensus 77 ~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~---------~-------~~----- 132 (276) ++-.. ..++.+.+.|+..+ -.++..+....+.+++.+.+++..+-....... + .+ T Consensus 150 ~v~~~-~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~ 228 (379) T protein:vir:10 150 TVVRF-EAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGS 228 (379) T ss_pred eeEEE-EEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCcccc Confidence 44332 45677777776543 357777777777777777777754322110000 0 00 Q ss_pred cccccCCHHHHHHHHHHHHHHHhhc--CC--CccCC-EEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEE Q lcl|NC_019506. 133 KPAATLDKTNIYEELIKVKVKLDEK--NV--PTIGR-FLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDV 207 (276) Q Consensus 133 ~~~~~~t~~~~~~~i~~a~~~l~~~--~v--P~~~r-~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v 207 (276) +.....|.+.++++|..+...+... ++ |.+-+ .+++.|..+..|..-. .++ ..+.+-.--++-+++| T Consensus 229 t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n------~~g--~Tvl~~lk~n~Pnl~i 300 (379) T protein:vir:10 229 PLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPT------ELG--YSVAQYMRESYPNVTF 300 (379) T ss_pred cccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhcccc------ccC--ccHHHHHHHhcCCcEE Confidence 1122346777888888877766533 22 43323 6899999999885421 111 1111111112456777 Q ss_pred EEeccccccccc-eEEEEEecceE------------EeeeeeeeeeeccCcccceeeEEeeee-eeeEEEcCCeEEEEEe Q lcl|NC_019506. 208 YLSNNMGSLTNG-TGAIAGVKMAC------------TFAEQIVQTEAYRMEKRFADAVKGLNV-FGCKVIYPDALVCLKK 273 (276) Q Consensus 208 ~~s~~lp~~~~~-~~~~~~~~~a~------------~~~~~~~~~e~~~~~~~~~~~i~~~~~-yg~~v~~~~~vv~~~~ 273 (276) +..+.+-..+++ ..++.+.++.. .++.++.....+ ++..+..+.+..+ .|+.+.+|-+++.+.= T Consensus 301 ~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~ve--~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G 378 (379) T protein:vir:10 301 VSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGVE--KKIKGYAEGYTNATAGAMLKRPFATYRQTG 378 (379) T ss_pred EEcccccccCCCccEEEEEeeccCCCccCCcceEEEecchhhhhccce--ecCceeEeccccceeeeeeecchhhheecC Confidence 777767543332 23444444322 222222222222 2333445555555 4566667999988876 Q ss_pred c Q lcl|NC_019506. 274 T 274 (276) Q Consensus 274 ~ 274 (276) + T Consensus 379 ~ 379 (379) T protein:vir:10 379 A 379 (379) T ss_pred C Confidence 6 No 211 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=70.14 E-value=0.21 Score=24.26 Aligned_cols=245 Identities=12% Similarity=0.082 Sum_probs=116.5 Q ss_pred Cc-cch----hhHHHHHHHHHHHHHHhhcchhhhccccccccc-cCCcEE------EEeccCcccceeecC--CC-CCCC Q lcl|NC_019506. 1 MA-VTS----FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIK-AYGDTV------KINQIGAITVKEYTE--NS-DIDA 65 (276) Q Consensus 1 MA-~~~----l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~-~~Gdtv------~ip~~~~~~~~d~~~--~~-~~~~ 65 (276) |+ |+. .+.++|...+.+.|.+..+|.+.+-- .+..-+ ...+|. .+|+. +..|.. +. ++.. T Consensus 1 mp~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg-lQalDGV~~N~tafsvKt~D~pVV----ig~Y~TdeNvagFGt 75 (295) T protein:vir:47 1 MPSNQNNAVRRYEKQYAGILETVFGVRAAFSNALAP-IQILDGVQENSKAFSVKTNNTPVV----IGEYKTGENDGGFGD 75 (295) T ss_pred CCCCCCccchhhhHHHHHHHHHHHhHHHHHhhhhcc-hhhhhCCCccceEEEEeecCcceE----eecccCCCccccccc Confidence 87 332 46788999999999999988886532 221111 112221 12221 222321 11 1111 Q ss_pred c-cc-cccceE-EE-EEEee-e-ecceeec-hHHHHhhhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|NC_019506. 66 P-EE-LSTTEK-VL-EINKQ-K-YFNFQID-DVDAAQIRT---PLMDAAMQRAAYALADETEKILLKEMDTNATSKLKPA 135 (276) Q Consensus 66 ~-~~-~~~~~~-~~-~ld~~-~-~~~~~v~-d~d~~~~~~---d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~~~~~ 135 (276) . .+ .-.+.. ++ -.|.. . .+++.|. -.|...-+. ..+++.++.++.+-++.+|..+-..+...+. ....- T Consensus 76 GTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~-~te~~ 154 (295) T protein:vir:47 76 NSGAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTAT-KTEAL 154 (295) T ss_pred CCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhh Confidence 0 00 000111 11 11110 0 1122221 234443333 3455667778888888888766555544332 33444 Q ss_pred ccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEeccccc Q lcl|NC_019506. 136 ATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMGS 215 (276) Q Consensus 136 ~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp~ 215 (276) +..+.+++.+.|..+.+.+-...|- ...-..++|+.|..|...+..+.+..... . +-+--|.++-||.+.+ +|. T Consensus 155 td~t~d~V~~LF~~as~~yvn~ev~-~~~~AyV~~evYnaiiD~~l~TsaK~Ssa-N-iDengi~~FkGf~i~e---~P~ 228 (295) T protein:vir:47 155 ADFTDDKVKALFNKLSAFYTNNEVT-APITVYLRSEFYNAIVDMASVTSAKGATI-S-LDENGLPKYKGFTLEE---TPA 228 (295) T ss_pred hcccchhHHHHHHHHHHHhhhhhee-eeeEEEEchhHHHHHhcccccccccccee-e-eccCCcceecceEEEe---ccH Confidence 5566777778888899999888883 33348999999999998875554433221 2 3333456899999988 443 Q ss_pred cc--cceEEEEEecceEEee---e---eeeeeeeccCc-----------------ccceeeEEeeeee Q lcl|NC_019506. 216 LT--NGTGAIAGVKMACTFA---E---QIVQTEAYRME-----------------KRFADAVKGLNVF 258 (276) Q Consensus 216 ~~--~~~~~~~~~~~a~~~~---~---~~~~~e~~~~~-----------------~~~~~~i~~~~~y 258 (276) .- .+.. ..+.+..++-+ . |.-+-|..... +.|..+-.-+++- T Consensus 229 ~~~q~G~~-aifs~dnig~aftGIn~aR~IesEdF~GValQ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (295) T protein:vir:47 229 QYFETGVI-AIFSPNGIIIPFVGISTARVIEAENFDGVNCKLLLRVVLTLLMTIRKQFTKLQELLYRR 295 (295) T ss_pred hhccCCcE-EEEccccceeecccceeeeeeecccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 21 1222 22222222221 1 11111111110 0000000000000 No 212 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=69.71 E-value=0.22 Score=24.20 Aligned_cols=260 Identities=12% Similarity=0.045 Sum_probs=119.7 Q ss_pred CccchhhHHHHHH----HHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVTSFIPKLWSA----RLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l~~e~~~~----~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~~~ 75 (276) |..++=+|-.+.. .+.+.+..-.....++-.+-.++ ..-+++.++.....+ +..|...++... .+..-+... T Consensus 76 t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~--W~~~~~~f~v~e~~G~A~~ygd~~D~Pl-~d~~~~~~~ 152 (388) T protein:vir:99 76 TQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGS--WEDQEIVQGIVEPAGTAMEYGDLTNIPL-SSWNVNFER 152 (388) T ss_pred ccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCC--ccceeEEEeeeecceeEEEeecccCCCc-eeccceeee Confidence 2222223433333 33333333333333332111111 112578888764432 444443333211 223334444 Q ss_pred EEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHH------------hhc-ccc--------cc Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILLKE------------MDT-NAT--------SK 131 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~------------~~~-~~~--------~~ 131 (276) .++-.. ..++.+.+.|+..+ ..++..+-...+.++|.+..++..+-. +.. +.. +. T Consensus 153 r~v~~~-~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~ 231 (388) T protein:vir:99 153 RTIVRG-EMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGG 231 (388) T ss_pred eeEEEE-EeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCc Confidence 554332 45577787776543 457777777888888888877754411 110 000 00 Q ss_pred ccccccCCHHHHHHHHHHHHHHHhhcC--C--Ccc-CCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceE Q lcl|NC_019506. 132 LKPAATLDKTNIYEELIKVKVKLDEKN--V--PTI-GRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFD 206 (276) Q Consensus 132 ~~~~~~~t~~~~~~~i~~a~~~l~~~~--v--P~~-~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~ 206 (276) .......|.+.++++|..+...+...- + |.. ...|++.|..+..|-.- + .++ . .+.+-.--++-+++ T Consensus 232 ~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~----n--~~g-~-Tvl~~lk~n~Pnl~ 303 (388) T protein:vir:99 232 WVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV----T--DLG-I-SVRDWLKQTYPRVR 303 (388) T ss_pred CcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc----C--cCC-c-cHHHHHHHhcCCcE Confidence 111223477888999988888875442 2 221 22688999999888432 1 111 1 11111111244666 Q ss_pred EEEecccccc---ccceEEEEEecceE---------------EeeeeeeeeeeccCcccceeeEEeeee-eeeEEEcCCe Q lcl|NC_019506. 207 VYLSNNMGSL---TNGTGAIAGVKMAC---------------TFAEQIVQTEAYRMEKRFADAVKGLNV-FGCKVIYPDA 267 (276) Q Consensus 207 v~~s~~lp~~---~~~~~~~~~~~~a~---------------~~~~~~~~~e~~~~~~~~~~~i~~~~~-yg~~v~~~~~ 267 (276) |+....+-.. .++.....+++.-- .++.++.....++ +.....+.+..+ .|+.+.+|.+ T Consensus 304 i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~--~~~~~~~~~~~rt~Gv~ir~P~A 381 (388) T protein:vir:99 304 VMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEK--RVKNYVEAYSNATAGVMLKRPWA 381 (388) T ss_pred EEEecccccccccCCceeEEEEecccccccccCccCcceeEEeccccccccccee--cCceeEeccccceeeeEEeccch Confidence 6665444321 12222333333210 1122222222222 222344455544 4566667988 Q ss_pred EEEEEec Q lcl|NC_019506. 268 LVCLKKT 274 (276) Q Consensus 268 vv~~~~~ 274 (276) ++.+.== T Consensus 382 i~~~~GI 388 (388) T protein:vir:99 382 VVRLIGL 388 (388) T ss_pred hheeccC Confidence 8876533 No 213 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=67.75 E-value=0.25 Score=23.91 Aligned_cols=257 Identities=7% Similarity=-0.037 Sum_probs=124.8 Q ss_pred Cccch--h-------hHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccc-----eeecCCCCCCCc Q lcl|NC_019506. 1 MAVTS--F-------IPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITV-----KEYTENSDIDAP 66 (276) Q Consensus 1 MA~~~--l-------~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~-----~d~~~~~~~~~~ 66 (276) ||+.. + .-|=++.+|-..=.....|.+++... ......+ .|..-.. ....+|.+... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~-----~a~~~~~---~W~~d~l~~~~~~~~~EG~da~~- 71 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKG-----VATAITH---EWQTDELRQPGKNTRVEGEDATI- 71 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCc-----eecccEE---EEEeeecCCccccccccCccccc- Confidence 88632 1 12334554444445666666665431 0111122 2221111 11223443322 Q ss_pred cccccceEEEEEEeeeecceeechHHHHhhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------- Q lcl|NC_019506. 67 EELSTTEKVLEINKQKYFNFQIDDVDAAQIR---TPLMDAAMQRAAYALADETEKILLKEMDTNATS------------- 130 (276) Q Consensus 67 ~~~~~~~~~~~ld~~~~~~~~v~d~d~~~~~---~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~------------- 130 (276) .........-...|--...+.|+.-...... .|..+......+..|++.++..++..-++.... T Consensus 72 ~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~ 151 (317) T protein:vir:88 72 KAGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFA 151 (317) T ss_pred ccccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHH Confidence 2223344444455556677888877665433 355666677778888999999887654321000 Q ss_pred -------cccc-----------cccCCHH-HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhccc--- Q lcl|NC_019506. 131 -------KLKP-----------AATLDKT-NIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGA--- 188 (276) Q Consensus 131 -------~~~~-----------~~~~t~~-~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~--- 188 (276) .... .+..+.. -..+.|.++.+++=+++.. ...++++|.....|-+--. .+.... T Consensus 152 ~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~--~~~i~v~a~~k~~i~~~~~-~~~~~i~~~ 228 (317) T protein:vir:88 152 YYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQ--ANSIQTSSSIKKAISKNMK-GRATEITLD 228 (317) T ss_pred HhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCC--CCEEEeChHHHHHHHHHhc-CCceeEEEc Confidence 0000 0011111 1245678888888888763 3357899988887733211 011110 Q ss_pred cccc---ceeeeeeeEEeceEEEEeccccccccceEEEEEecceEEeee-eeeeeeeccCcccceeeEEee--eeeeeEE Q lcl|NC_019506. 189 MAES---ITKNGFVGTILGFDVYLSNNMGSLTNGTGAIAGVKMACTFAE-QIVQTEAYRMEKRFADAVKGL--NVFGCKV 262 (276) Q Consensus 189 ~~~~---~~~~G~i~~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~-~~~~~e~~~~~~~~~~~i~~~--~~yg~~v 262 (276) .... ...+-.+..+-=++++.+..+|. ...+.+-++.+.++. +-...|.. -.-||..++. .-||..+ T Consensus 229 ~~~~~~g~~v~~~~tdfG~v~ii~~r~lp~----~~~~~~D~~~~~l~~Lr~~~~e~l---aKtGd~~k~~i~~E~tLe~ 301 (317) T protein:vir:88 229 ASDNRIAQTVDVYESDFGKYTIRANRWFHE----NTLFVFDPKMHSLCYLRPFFQHEL---AKTGDSEKRQLLVEYTFRV 301 (317) T ss_pred ccCeEEEEEEEEEEeCCeEEEEEeCCCCCC----CeEEEEcccccceeecccceeecc---CCCcccceeEEEEEEEEEE Confidence 1100 01111122222256777777763 334444444443331 11111211 1123444444 3489999 Q ss_pred EcCCeEEEEE-ecCC Q lcl|NC_019506. 263 IYPDALVCLK-KTNP 276 (276) Q Consensus 263 ~~~~~vv~~~-~~~p 276 (276) ..|.+.+++. .++| T Consensus 302 ~N~~a~a~i~~l~~~ 316 (317) T protein:vir:88 302 NNEKSGALIRDVVAQ 316 (317) T ss_pred cCccceeEEEEeccc Confidence 9999999988 6677 No 214 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=67.59 E-value=0.25 Score=23.89 Aligned_cols=258 Identities=13% Similarity=0.057 Sum_probs=119.4 Q ss_pred CccchhhHHHHHHHHHHHHH----HhhcchhhhccccccccccCCcEEEEeccCccc-ceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLD----KAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~----~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~~~~~~~~~~~~~~~~~~~ 75 (276) +++. -+|..++.-+.-.+. .......++--+-.+ ...-+++.++.....+ +..|...+...- .+..-.... T Consensus 45 ~~~~-g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g--~w~~~~~~~~~~e~~G~a~~ygd~~d~P~-~d~~~~~~~ 120 (336) T protein:vir:10 45 TGSS-GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKG--DWTTLVAAFITAEPTTKVATYGDYSSDGD-SGTNINYPQ 120 (336) T ss_pred CCCc-chHHHHHhhcCcceeeeeechhchhhhcccccCC--CcceeeEEEEeeeeeeeEEEccccCCCcc-eeeeeeeee Confidence 1221 124444433322221 111222222110000 0112567777654432 334433222221 233444555 Q ss_pred EEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHH-H--------Hhhccccc-cccc----cccC Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILL-K--------EMDTNATS-KLKP----AATL 138 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~-~--------~~~~~~~~-~~~~----~~~~ 138 (276) .++-.. ..++.++..|...+ -.++..+....+.+++.+++++..+ + .+..-... ..++ .... T Consensus 121 ~~v~~~-~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~ 199 (336) T protein:vir:10 121 RQSYFF-QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSP 199 (336) T ss_pred eeEEEE-EEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCccccc Confidence 565443 55678887776543 3466667777777777777775322 1 11100000 0111 1235 Q ss_pred CHHHHHHHHHHHHHHHhhcC---C-CccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEecccc Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKN---V-PTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSNNMG 214 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~---v-P~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~~lp 214 (276) |.+.++++|..+...+...- + +...-.++++|..+..|..- +.....--+-+.+ ++-+++|+..+.+. T Consensus 200 T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~~----n~~g~tv~~~lk~----n~Pnl~i~t~pel~ 271 (336) T protein:vir:10 200 AVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT----NQYGLSAAAKLKE----IFPKLEFVTIPEYD 271 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC----CccCccHHHHHHH----hCCccEEEEccccc Confidence 67888999998888885543 1 12223689999999988432 1110000011211 23456777766664 Q ss_pred ccccceEEEEEecc-------eEEeeeeeeeeeeccCcccceeeEEeeeee-eeEEEcCCeEEEEEec Q lcl|NC_019506. 215 SLTNGTGAIAGVKM-------ACTFAEQIVQTEAYRMEKRFADAVKGLNVF-GCKVIYPDALVCLKKT 274 (276) Q Consensus 215 ~~~~~~~~~~~~~~-------a~~~~~~~~~~e~~~~~~~~~~~i~~~~~y-g~~v~~~~~vv~~~~~ 274 (276) ... +.....+.+. .+.++.++.....+. +..+..+.+..+. |+.+.||-+++.+.== T Consensus 272 ~Ag-g~~~~~~~~~~~~~~t~~~~~P~~f~~lpvq~--~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 272 TAS-GRLVQLWAPRVEGKDTATCGFTEKMRAHSIER--YSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred ccC-CceEEEEEecccCCcceeeecChhhhccceee--cCceeEeccccceeeeeeeccchheeeccC Confidence 433 3333333322 123333443333332 2334455666554 5556678888876533 No 215 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=58.33 E-value=0.42 Score=22.68 Aligned_cols=268 Identities=13% Similarity=0.091 Sum_probs=110.5 Q ss_pred CccchhhHHH-----------------------------HHHHHHHHHHHhhcc-hhhhccccccccccCCcEE------ Q lcl|NC_019506. 1 MAVTSFIPKL-----------------------------WSARLLAHLDKAHVV-ANLVNRDYEGEIKAYGDTV------ 44 (276) Q Consensus 1 MA~~~l~~e~-----------------------------~~~~~~~~l~~~~v~-~~~~~~~~~~~~~~~Gdtv------ 44 (276) |+....-.++ +.-. ...+...... ...+..++.........++ T Consensus 162 ~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s-~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~ 240 (523) T protein:vir:59 162 SSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYP-LPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPS 240 (523) T ss_pred cccceeeeeccccccccccccccccccccccccccccccccch-hhccccccccccccccccccccccccccccCCCccc Confidence 2222111100 0000 0000000000 0000000000000000000 Q ss_pred --------EEeccCcccceeecCCCCCCCccccccceEEEEEEeeeecc--eee---chHHHH---hhh---hhHHHHHH Q lcl|NC_019506. 45 --------KINQIGAITVKEYTENSDIDAPEELSTTEKVLEINKQKYFN--FQI---DDVDAA---QIR---TPLMDAAM 105 (276) Q Consensus 45 --------~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld~~~~~~--~~v---~d~d~~---~~~---~d~~~~~~ 105 (276) .+..-.......-..........+....+..++|+++.--+ -.+ ...|+. .++ -|.-.++. T Consensus 241 t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELa 320 (523) T protein:vir:59 241 TQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIV 320 (523) T ss_pred ccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHH Confidence 00000000000000000000112233456677776652211 111 122332 231 24556666 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccc-----cccc-----ccCCHH--------HHHHHHHHHHHHHhhcC--C----Cc Q lcl|NC_019506. 106 QRAAYALADETEKILLKEMDTNATSK-----LKPA-----ATLDKT--------NIYEELIKVKVKLDEKN--V----PT 161 (276) Q Consensus 106 ~~~~~ala~~~d~~~~~~~~~~~~~~-----~~~~-----~~~t~~--------~~~~~i~~a~~~l~~~~--v----P~ 161 (276) .=+...|..+|+++++..+...+..- ...+ ...++. ...+++.....++++.. + -. T Consensus 321 nILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~ 400 (523) T protein:vir:59 321 TLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAV 400 (523) T ss_pred HHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhccc Confidence 66677788899998888776543210 0011 011110 11234443333332211 1 11 Q ss_pred -cCCEEEECHHHHHHHhhhHHhhhhcc--cccccceeeeeeeEE-eceEEEEeccccccccceEEEEEecce-------E Q lcl|NC_019506. 162 -IGRFLIIPPDVHGLLLAADLIVGTGG--AMAESITKNGFVGTI-LGFDVYLSNNMGSLTNGTGAIAGVKMA-------C 230 (276) Q Consensus 162 -~~r~~vv~p~~~~~L~~~~~~~~~~~--~~~~~~~~~G~i~~~-~G~~v~~s~~lp~~~~~~~~~~~~~~a-------~ 230 (276) .+-++|++|++.+.|-.++.+..... ....+.+. .|.+ .|+.||..++-|. .....+.++. + T Consensus 401 ~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~---~g~l~~~~~vy~d~~~~~----dy~~~g~k~~~~~~~~~~ 473 (523) T protein:vir:59 401 AGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFY---VGMVQGRYRLYKNIYQNQ----PVIIMGNQDLNTPWQTGA 473 (523) T ss_pred ccccEEEEchhHHHHHHhccccccCCcccccccccee---EEEecCceEEEecCCCCc----ceEEEEecccCCcccccc Confidence 24589999999999877765543221 11112222 3443 4578998877553 2222333322 1 Q ss_pred Eeee--eeeeeeeccCcccceeeEEeeeeeeeEEEcCCeEEEEEec--CC Q lcl|NC_019506. 231 TFAE--QIVQTEAYRMEKRFADAVKGLNVFGCKVIYPDALVCLKKT--NP 276 (276) Q Consensus 231 ~~~~--~~~~~e~~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~--~p 276 (276) -|+- .........+|+.|.-.+-.+.|||..|.+|-.++.+.+. -| T Consensus 474 ~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 474 VYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred eecccchhhcccccccCCcccceeeeeeehhheecchhHhhhhhhhhcCC Confidence 1221 1111233458899999999999999999999988886543 45 No 216 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=57.40 E-value=0.44 Score=22.56 Aligned_cols=267 Identities=14% Similarity=0.014 Sum_probs=116.5 Q ss_pred CccchhhHHH---HHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCccc-ceee--c-CCCCCCCccccccce Q lcl|NC_019506. 1 MAVTSFIPKL---WSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAIT-VKEY--T-ENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~~l~~e~---~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~-~~d~--~-~~~~~~~~~~~~~~~ 73 (276) |+-=.|.... +..++.+.-.+.+.+..++..+.+.. ..-.++..+.....+ ++++ . ..+++.. -+...++ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~--~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~-vd~~~~~ 77 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTA--VGITEKLHYGADEHGSLDDGLITVGTSTLDQ-VEVGFTP 77 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCC--cccceEEEeeeeccCcccccccCCcCCccce-eecccce Confidence 7753333333 33333333335555666664332221 122477777775544 4433 2 2233222 2455556 Q ss_pred EEEEEEeeeecceeechHHHHhhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhc--cc------ccc----------c Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQIR---TPLMDAAMQRAAYALADETEKILLKEMDT--NA------TSK----------L 132 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~~~---~d~~~~~~~~~~~ala~~~d~~~~~~~~~--~~------~~~----------~ 132 (276) ....|..+ +.++.++-.|+..+. .++-.+..+.+.+++.+.+|+..+-.-.. +. .+. + T Consensus 78 ~~~~i~~~-~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~ 156 (304) T protein:vir:52 78 TRSYIVPW-AKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQN 156 (304) T ss_pred eEEEEEEE-eeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccC Confidence 66666443 566777655544322 24444455555556666666543311110 00 000 1 Q ss_pred cccccCCHHHHHHHHHHHHHHHhhcCCC-ccCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEec Q lcl|NC_019506. 133 KPAATLDKTNIYEELIKVKVKLDEKNVP-TIGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLSN 211 (276) Q Consensus 133 ~~~~~~t~~~~~~~i~~a~~~l~~~~vP-~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s~ 211 (276) ....+.|++.+++.|.++...+....-- ...-.++++|..+..|..- ...+.+....+=..++..-.+-.+++|..-. T Consensus 157 ~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~-~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~v~ 235 (304) T protein:vir:52 157 TKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALV-QRANTDTTALEFLTKHLSAAAGRQVAIKALP 235 (304) T ss_pred CccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhc-cCCCCCchHHHHHHHhcccccCCcceEEEec Confidence 1122347778888888887777544210 1112599999999988531 1111110000000111111111233443321 Q ss_pred -cccccc--cceEEEEEecce--EEe--eeeeeeeeeccCcccceeeEEeeeeee-eEEEcCCeEEEEEe Q lcl|NC_019506. 212 -NMGSLT--NGTGAIAGVKMA--CTF--AEQIVQTEAYRMEKRFADAVKGLNVFG-CKVIYPDALVCLKK 273 (276) Q Consensus 212 -~lp~~~--~~~~~~~~~~~a--~~~--~~~~~~~e~~~~~~~~~~~i~~~~~yg-~~v~~~~~vv~~~~ 273 (276) .+-..+ +....+++.++- +.+ +...+....+. .+.....+-+..++| +.+.+|++++-+.+ T Consensus 236 ~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~q~-~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 236 SNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDAQP-KGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred ccccccCCCCceEEEEEecChhheEEecCccccccchhh-cCCceEEecceeeeeeEEEEccceeeeecC Confidence 121111 111223333221 111 11111111111 111223344555555 66668999999999 No 217 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=54.87 E-value=0.49 Score=22.27 Aligned_cols=267 Identities=9% Similarity=-0.003 Sum_probs=92.0 Q ss_pred CccchhhHH--HHHHHHHHHH--------HHhhcchhhhccccccccccCCcEEEEeccC--cccceeec-CCCCCCCcc Q lcl|NC_019506. 1 MAVTSFIPK--LWSARLLAHL--------DKAHVVANLVNRDYEGEIKAYGDTVKINQIG--AITVKEYT-ENSDIDAPE 67 (276) Q Consensus 1 MA~~~l~~e--~~~~~~~~~l--------~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~--~~~~~d~~-~~~~~~~~~ 67 (276) |.|+.+... .+...+++.| .+.....+.+...|-+.....+-++.+.+.. ...+..+. ++.+.. .. T Consensus 1 ~~~~~~~~~~~~~~~~~~d~~~~~~l~~~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~-~~ 79 (349) T protein:vir:10 1 MKNQKLQLDLQRFATPILDMFSQNTVLDYTRNRQYPEMLGDTLFPAVKVPTLEVDILKAGSRVPTIASVSAFDAEAE-IG 79 (349) T ss_pred CCcchhhHHHHHHHHHhhcccCHHHHHHHHHhcCcchhhHhhcCCccccccceeEEEeeccCcceeeeeecCCCCcc-ee Confidence 888754322 2222222222 1111111111111111111222223222221 11122222 222211 12 Q ss_pred ccccceEEEEEEeeeecceeechHHHHh--h---------hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------- Q lcl|NC_019506. 68 ELSTTEKVLEINKQKYFNFQIDDVDAAQ--I---------RTPLMDAAMQRAAYALADETEKILLKEMDTNAT------- 129 (276) Q Consensus 68 ~~~~~~~~~~ld~~~~~~~~v~d~d~~~--~---------~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~------- 129 (276) ..........+-.. .-...++..|... . ..+.+.+-...+...+.+.++--++.++..+.. T Consensus 80 ~r~~~~~~~~~p~i-k~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~~~~g~ 158 (349) T protein:vir:10 80 TREASKMTAELAYV-KRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITDKKNGI 158 (349) T ss_pred cccceeEEeecccc-ccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEEcCCcE Confidence 22222223332211 1224455544331 1 112222223344444555555445555543210 Q ss_pred ----------ccc-cccc--cCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccccccce-- Q lcl|NC_019506. 130 ----------SKL-KPAA--TLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAMAESIT-- 194 (276) Q Consensus 130 ----------~~~-~~~~--~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~-- 194 (276) ... ++++ .....+.++.|.+....+ +. ...+++++++++..|++++.+...-.....+.. T Consensus 159 ~vD~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~~~~---g~--~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~ 233 (349) T protein:vir:10 159 AIDYGVPKKHQETLSGTKTWDKSDASIIDNLQDWSDSL---DV--TPTRALTSKKVLRILMRSTEIKEAIFGKDTGRVVG 233 (349) T ss_pred EEecccCccceeEecCcccCCCCCCCHHHHHHHHHHHh---CC--CccEEEeCHHHHHHHhcCHHHHHHhcccccccccC Confidence 000 1110 012344566666655443 33 224799999999999999887665332222111 Q ss_pred ---eeeeeeEEeceEEEEecccccc--cc----------ceEEEEEecceEE---eeee--eeeee-----ec------- Q lcl|NC_019506. 195 ---KNGFVGTILGFDVYLSNNMGSL--TN----------GTGAIAGVKMACT---FAEQ--IVQTE-----AY------- 242 (276) Q Consensus 195 ---~~G~i~~~~G~~v~~s~~lp~~--~~----------~~~~~~~~~~a~~---~~~~--~~~~e-----~~------- 242 (276) ..+.++.+.|++|+..+.--.. .. ...++.......| ++.- ..... .. T Consensus 234 ~~~~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~~~~G~~~yG~~~e~~~~~~g~~~~~~~~~~~~ 313 (349) T protein:vir:10 234 QADLDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFNDEVPGQKIYGPTPEENRLISSNAQVSNVGNIMA 313 (349) T ss_pred HHHHHHHHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecCCCceeEEeeccchhhhhcccccceeeccceEE Confidence 1234556677777654321100 00 0111111111111 1110 00000 00 Q ss_pred ----cCcccceeeEEeeeeeeeEEEcCCeEEEEEec Q lcl|NC_019506. 243 ----RMEKRFADAVKGLNVFGCKVIYPDALVCLKKT 274 (276) Q Consensus 243 ----~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~ 274 (276) ...+-....+.+..+-=-.+.+|++++++++= T Consensus 314 ~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 314 KIYETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred EeeeecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 00000011111111111112355555555544 No 218 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=52.28 E-value=0.56 Score=21.97 Aligned_cols=267 Identities=11% Similarity=0.026 Sum_probs=97.9 Q ss_pred Ccc--chhhHHHHHHHHHHHHHHh--hcchhhhccccccccccCC-cEEEEec-cCcccceee-cCCCCCCCccccccce Q lcl|NC_019506. 1 MAV--TSFIPKLWSARLLAHLDKA--HVVANLVNRDYEGEIKAYG-DTVKINQ-IGAITVKEY-TENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~--~~l~~e~~~~~~~~~l~~~--~v~~~~~~~~~~~~~~~~G-dtv~ip~-~~~~~~~d~-~~~~~~~~~~~~~~~~ 73 (276) ||+ .+|.+..+...+.+.-.+. .....++-. ....+ +.+.+.. -+...+..+ .++.+......-.-.. T Consensus 1 M~~l~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~-----~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~ 75 (348) T protein:vir:49 1 MGLIYDKVTASNIAGYFNALQENVDSTLGESIFPA-----RKQLGTKLSYITGASGQSVALKAAAFDTNVTVRDRVSAEM 75 (348) T ss_pred CcchhhhcCHHHHHHHHHhccccchhhhHhhcCCC-----ccccCceeEEEEeecCceeeeeeecCCCCcceecccceee Confidence 995 4677777777665432221 111112210 00111 1111111 011111111 1222211111111122 Q ss_pred EEEEEEeeeecceeechHHH--Hhhhh------------hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------- Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDA--AQIRT------------PLMDAAMQRAAYALADETEKILLKEMDTNATSK-------- 131 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~--~~~~~------------d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------- 131 (276) ..+.+-..+ -...++..|. .+... +.+.+-.+.+...+.+.++--+...+..+.... T Consensus 76 ~~~~~p~i~-~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~~ 154 (348) T protein:vir:49 76 HDEQMPFFK-EAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeecCccc-cccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceEE Confidence 333332221 1234444442 11111 111111233334444455544455554321100 Q ss_pred ----------cc---ccccCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccc--cccceee Q lcl|NC_019506. 132 ----------LK---PAATLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAM--AESITKN 196 (276) Q Consensus 132 ----------~~---~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~ 196 (276) .+ ..+ .++.+++.+|.++...+.+.+.. ..+++++++.+..|++++.+.+.-... ....+.. T Consensus 155 vdyg~~~~~~~t~~~~W~-~~~adp~~di~~~~~~~~~~G~~--~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~ 231 (348) T protein:vir:49 155 IDYGVKPDHKKQVSKSWA-EPGATPLADLEDAIETARELGLN--PERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTK 231 (348) T ss_pred EeecCCcccceeeeeccC-CCCCCHHHHHHHHHHHHHhcCCc--ccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccH Confidence 00 011 12345678888888888776652 347899999999999998877643211 1111111 Q ss_pred e----eeeEEeceEEEEeccccccccce--------EEEEEecceEE---eeeeeeeeee-------------------- Q lcl|NC_019506. 197 G----FVGTILGFDVYLSNNMGSLTNGT--------GAIAGVKMACT---FAEQIVQTEA-------------------- 241 (276) Q Consensus 197 G----~i~~~~G~~v~~s~~lp~~~~~~--------~~~~~~~~a~~---~~~~~~~~e~-------------------- 241 (276) . ..+.+.|++|+..+.--....|. ..++...+.+| ++.-....+. T Consensus 232 ~~~~~~~~~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 311 (348) T protein:vir:49 232 AELDNYIADNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVT 311 (348) T ss_pred HHHHHHHHhhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCeEEEe Confidence 1 23356677776533221111111 11222221111 1100000000 Q ss_pred -ccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 242 -YRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 242 -~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +...+-.+..+.+-..-=-.+.+|+++.++++.+= T Consensus 312 ~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~ 347 (348) T protein:vir:49 312 TTKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPA 347 (348) T ss_pred eeecCCCceEEEEEeeeccccccCCCcEEEEEEecC Confidence 00000000111111110011125666666554433 No 219 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=50.93 E-value=0.6 Score=21.82 Aligned_cols=261 Identities=9% Similarity=-0.010 Sum_probs=101.5 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEEe Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEINK 80 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld~ 80 (276) ++..+-+..-.+..+..++.+..-|++.+|--.-+ -..|..|.+-..|..+-+.-+ +.. +.++......+.+.+ T Consensus 29 ~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~--e~~Ge~v~lg~~g~iagrtdt--~R~--~r~~~l~~~~Y~c~q 102 (341) T protein:vir:27 29 VAELFNVSPQLETKLRAAITESAEFLKMITVTTVD--QIEGQVVDVGVSGLYTGRKAG--GRF--TKQVGVGGHKYKLAE 102 (341) T ss_pred ccceEeecHHHHHHHHHHHHhhHHhhhcCcccccc--ceeeeEeecccccceeeccCC--Cce--ecccccCCcceEEEE Confidence 22222222234566777777777788887653222 246888887665554443322 111 122233445566643 Q ss_pred eeecceeec--hHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------------------------cc- Q lcl|NC_019506. 81 QKYFNFQID--DVDAAQI---RTPLMDAAMQRAAYALADETEKILLKEMDTN-------------------------AT- 129 (276) Q Consensus 81 ~~~~~~~v~--d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~-------------------------~~- 129 (276) ..+...|+ .+|.... ..||...+.+...+.+|...-.--+.....+ +. T Consensus 103 -tn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~ 181 (341) T protein:vir:27 103 -TDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDPSANPLGQDVNEGWIAFVKNRKAS 181 (341) T ss_pred -eeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCChhhcccccccchhHHHHHHhhccc Confidence 24444454 4444332 3566666666555544432221111111100 00 Q ss_pred cc-----ccccccCCHHHHHHH-HHHHHH-HHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccc-cccceeeee-ee Q lcl|NC_019506. 130 SK-----LKPAATLDKTNIYEE-LIKVKV-KLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAM-AESITKNGF-VG 200 (276) Q Consensus 130 ~~-----~~~~~~~t~~~~~~~-i~~a~~-~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~-~~~~~~~G~-i~ 200 (276) .+ ...++..+=.+ +++ +.++.. .++..---..+.+++|+ .+|+.+..|.-..... ..+.+.--. .. T Consensus 182 rVl~~~~~~~g~~gdy~n-LDAlV~D~~~~lI~~~~~~d~dLVvivG----~dLla~k~~~l~n~~~~ptE~~Aa~~i~k 256 (341) T protein:vir:27 182 QVVDVDVYFDETNGDYRT-LDAMASDIINNQIHPMFRNDPRLTVFVG----SGLIGAAQAKLYDKADKPSEQIAAQKLDK 256 (341) T ss_pred ceeccceeeccCCCcccc-HHHHHHHHHhcccChHHhcCCCEEEEEc----hhhhhhhhhhhhccCCCCHHHHHHHHHHH Confidence 00 01111111112 222 333332 22333221224678888 4555555442211111 111111001 24 Q ss_pred EEeceEEEEeccccccccceEEEEEecceEEee---eeeeeeeeccCcccceeeE--EeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 201 TILGFDVYLSNNMGSLTNGTGAIAGVKMACTFA---EQIVQTEAYRMEKRFADAV--KGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 201 ~~~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~---~~~~~~e~~~~~~~~~~~i--~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) ++.|.+.+.-+.+|. +...+..-++--.|- .+...++.....+++.+.- +.-.-||+.---+-.-|.+...+ T Consensus 257 ~iGGlpa~~~PffP~---~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~yes~YvVEdyg~~~~~~~~~vkl~~~~ 333 (341) T protein:vir:27 257 TIAGRPAYVPPFLPD---NAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSKTHTGAWKVTQWVCWKRSPLTTQKKSTSA 333 (341) T ss_pred hhCCCeEEEccccCC---CceEEeeccceEEEEecCcEEEEEEeccccccccchhhhheeehhhhhhhccccccccCccc Confidence 789999998777764 222222112211121 1111222221112222210 11122444322222222222222 Q ss_pred C Q lcl|NC_019506. 276 P 276 (276) Q Consensus 276 p 276 (276) = T Consensus 334 ~ 334 (341) T protein:vir:27 334 L 334 (341) T ss_pred c Confidence 2 No 220 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=47.33 E-value=0.71 Score=21.42 Aligned_cols=199 Identities=10% Similarity=0.057 Sum_probs=97.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccceEEEEEE Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTEKVLEIN 79 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~~~ld 79 (276) |..+--.=+-+...+...|++.+-..+-.++..--+.-..+.+=+-.-.|.+ .++... |.. ..++++...-+|+-. T Consensus 1 M~i~~~~l~~l~~~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~fP~lrewi-Ger--~i~~l~~~~y~i~Nk 77 (305) T protein:vir:19 1 MIVTPASIKALMTSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKFPTLKEWV-GKR--TIQQMEAHGYSIANK 77 (305) T ss_pred CccCHHHHHHHHHHHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccCCccchhh-cce--eeeeccccceeEeec Confidence 8765211011222234444444322221111000000001111111112222 122222 211 235677666777765 Q ss_pred eeeecceeechHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------------ccc------- Q lcl|NC_019506. 80 KQKYFNFQIDDVDAAQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATSK------------------LKP------- 134 (276) Q Consensus 80 ~~~~~~~~v~d~d~~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~------------------~~~------- 134 (276) .+ ...+.|+..|........-....++.+++.+..-|..++.++.++.... ... T Consensus 78 ~f-e~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~cyDGq~FFdtDHpv~~~~~~tg~~~~vs 156 (305) T protein:vir:19 78 TF-EGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPCYDGQNFFDKEHPVYPNVDGTGSAVNTS 156 (305) T ss_pred cc-cceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccCCCCCcccCCCCCcccCCcccccccchh Confidence 54 4567888888777777888888999999999999998888876532110 000 Q ss_pred -------------------------------------------------------------------------cccCCHH Q lcl|NC_019506. 135 -------------------------------------------------------------------------AATLDKT 141 (276) Q Consensus 135 -------------------------------------------------------------------------~~~~t~~ 141 (276) ..+++ T Consensus 157 n~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ygvd~R~n~Gygfwq~a~gS~~~Ls-- 234 (305) T protein:vir:19 157 NIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFLFGASTRRAAGYGFWQMAVAVKGDLT-- 234 (305) T ss_pred hhhcCCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceeeeeeeeeeeccccchhheecCCCCCC-- Confidence 01111 Q ss_pred HHHHHHHHHHHHHhhcCC----Cc--cCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEec-eEEEEeccc Q lcl|NC_019506. 142 NIYEELIKVKVKLDEKNV----PT--IGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILG-FDVYLSNNM 213 (276) Q Consensus 142 ~~~~~i~~a~~~l~~~~v----P~--~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G-~~v~~s~~l 213 (276) .+.|.+|+..|...+- |- ..++|||.|..+..-.+ +..+....+.. .|.+--+.| ++++.++.| T Consensus 235 --~~nl~aar~aM~~qk~d~G~pL~I~P~~LvVPp~LE~~A~q---ll~s~~i~~g~---~~~~Np~~g~~eliV~P~L 305 (305) T protein:vir:19 235 --LDNLWKGWQLMRSFEGDGGKKLGLKPTHIVVPVGLEKAAEQ---LLNRELFADGN---TTVSNEMKGKLQLVVADYL 305 (305) T ss_pred --HHHHHHHHHHHHhhcCCCCceeeeecCeEEeCchhHHHHHH---HHhhcccCCcc---ccccceecceEEEEecccC Confidence 2345666666655543 22 13588999987776543 33332211110 112223566 688888888 No 221 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=45.86 E-value=0.75 Score=21.25 Aligned_cols=268 Identities=9% Similarity=0.006 Sum_probs=100.2 Q ss_pred Ccc--chhhHHHHHHHHHHHHHHhh-cc-hhhhccccccccccCCcEEEEec-c-Ccccceee-cCCCCCCCccccccce Q lcl|NC_019506. 1 MAV--TSFIPKLWSARLLAHLDKAH-VV-ANLVNRDYEGEIKAYGDTVKINQ-I-GAITVKEY-TENSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~--~~l~~e~~~~~~~~~l~~~~-v~-~~~~~~~~~~~~~~~Gdtv~ip~-~-~~~~~~d~-~~~~~~~~~~~~~~~~ 73 (276) ||+ .+|.+..+...+.+.-.+.. .+ ..++- .....+-++..-+ . +...+.++ .++.+......-.-.. T Consensus 1 M~~i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp-----~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~ 75 (348) T protein:vir:27 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFP-----ARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEM 75 (348) T ss_pred CcchhhhcCHHHHHHHHHhccchhhhhhHhhcCC-----CccccceeEEEEeeccCceeEeeeecCCCCcceecccceee Confidence 995 46777777776554322221 11 12221 1111121221111 0 11111111 1222211111111122 Q ss_pred EEEEEEeeeecceeechHHHH--hhhh------------hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------- Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAA--QIRT------------PLMDAAMQRAAYALADETEKILLKEMDTNATSK-------- 131 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~--~~~~------------d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------- 131 (276) ....+-.. .-...++..|.. +... +.+..-.+.+...+.+.++--+...+..+.... T Consensus 76 ~~~~~p~i-~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~ 154 (348) T protein:vir:27 76 HDEQMPFF-KEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKD 154 (348) T ss_pred eeeecCcc-ccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEE Confidence 23333211 112344443321 1111 111222333444444454544445443321100 Q ss_pred ----------ccc-cc-cCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcccc--ccccee-- Q lcl|NC_019506. 132 ----------LKP-AA-TLDKTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAM--AESITK-- 195 (276) Q Consensus 132 ----------~~~-~~-~~t~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~--~~~~~~-- 195 (276) .+. +. ..+..+++++|.++.+.+++.+. ...+++++++.+..|++++.+.+.-... ....+. T Consensus 155 vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~--~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~ 232 (348) T protein:vir:27 155 IDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGL--NPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKA 232 (348) T ss_pred EeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCC--cccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHH Confidence 000 00 01234567888888888877665 2347899999999999998877653211 111111 Q ss_pred --eeeeeEEeceEEEEeccccccccc--------eEEEEEecceEEeeeeeeeeee------------------------ Q lcl|NC_019506. 196 --NGFVGTILGFDVYLSNNMGSLTNG--------TGAIAGVKMACTFAEQIVQTEA------------------------ 241 (276) Q Consensus 196 --~G~i~~~~G~~v~~s~~lp~~~~~--------~~~~~~~~~a~~~~~~~~~~e~------------------------ 241 (276) .-.++.+.|++|+..+.--...++ ..+++...+..|.-.-....|. T Consensus 233 ~~~~~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (348) T protein:vir:27 233 ELENYIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTT 312 (348) T ss_pred HHHHHHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEe Confidence 112345678877664322111111 1122222222221100000000 Q ss_pred ccCcccceeeEEeeeeeeeEEEcCCeEEEEEecCC Q lcl|NC_019506. 242 YRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTNP 276 (276) Q Consensus 242 ~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~p 276 (276) +...+-.+..+.+-.+-=-.+.+|+++.++++.+= T Consensus 313 ~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~~ 347 (348) T protein:vir:27 313 TKTTDPVNVQTKVSMVALPSFERLDDVYMLTVIPA 347 (348) T ss_pred eecCCCceEEEEEeeeeeccccCCCcEEEEEEecC Confidence 00000000111111100011125666666654333 No 222 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=35.99 E-value=1.2 Score=20.15 Aligned_cols=257 Identities=10% Similarity=-0.040 Sum_probs=100.5 Q ss_pred chhhHHHHHHHHHHHH-HHhhcchhhhccccccccccCCcEEEEeccCcccceeecCCCCCCCccccccceEEEEEEeee Q lcl|NC_019506. 4 TSFIPKLWSARLLAHL-DKAHVVANLVNRDYEGEIKAYGDTVKINQIGAITVKEYTENSDIDAPEELSTTEKVLEINKQK 82 (276) Q Consensus 4 ~~l~~e~~~~~~~~~l-~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~ld~~~ 82 (276) -.++|..|.. ++..| ++.-+....+ .-+. ..|.---+|... |+........-.-....+.+-..+ T Consensus 1 i~~~P~~~g~-~~glff~~~~v~T~~V----~ie~-~~~~l~lip~v~--------rg~~g~~~~~~~~~~~~f~~p~~~ 66 (320) T protein:vir:10 1 MNLLPVNYGD-SRALFAREKKVRTRTI----LVEE-KNGVLTLIQSRE--------PGSTENVAKRGKRKVRSFVIPHLP 66 (320) T ss_pred CCcCCchhhh-hhhhccCCCCcccceE----EEEE-ecCceeeeeccC--------CCCCceeecCCcceEEEEecceec Confidence 2245777765 23333 2211111111 1111 122222233221 222211111111123333332222 Q ss_pred ecceeechHHH----------HhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----c--------c-ccc---- Q lcl|NC_019506. 83 YFNFQIDDVDA----------AQIRTPLMDAAMQRAAYALADETEKILLKEMDTNATS----K--------L-KPA---- 135 (276) Q Consensus 83 ~~~~~v~d~d~----------~~~~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~----~--------~-~~~---- 135 (276) .. ..|+-.|. .++..+.+.+.+..+.+.+....+..++.++...-.. . + +.. T Consensus 67 ~~-d~i~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~ildadGtv~~d~y~~fGi~~~~i~~ 145 (320) T protein:vir:10 67 LE-DVILPDEYEGLRGFGTTALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQILDADGTVLYDLYAEFGITKKTIYF 145 (320) T ss_pred cC-CccCHHHHcCcccCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEEcCCCcEEEechhhhCCccceeEE Confidence 21 22332221 2233344555555556666666655666665421000 0 0 000 Q ss_pred -ccCCHHHHHHHHHHHHHHHhh--cCCCccCCEEEECHHHHHHHhhhHHhhhhcccc--cccceeeeee--eEEeceEEE Q lcl|NC_019506. 136 -ATLDKTNIYEELIKVKVKLDE--KNVPTIGRFLIIPPDVHGLLLAADLIVGTGGAM--AESITKNGFV--GTILGFDVY 208 (276) Q Consensus 136 -~~~t~~~~~~~i~~a~~~l~~--~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~--~~~~~~~G~i--~~~~G~~v~ 208 (276) -.....++.+.+.+..+.+.+ .+.+..+-+++++|+++..|..++.+..+.... +...++...- -.+.|+.|. T Consensus 146 ~l~~a~~dv~~~~~~~~~~i~~~l~g~~~t~v~al~g~~f~~al~~h~~Vke~y~~~~~~~~~l~~~~~~~f~~gGi~~~ 225 (320) T protein:vir:10 146 GLDNKDANVAESCRQVLRHVEDNLRGDVMKDVSVDVSEEFFDKFIKHASVKEVFLNHEAAVNRLGGDTRKGFKFGGLIFN 225 (320) T ss_pred ecCCCCccHHHHHHHHHHHHHHHhccCCCCceEEEEChHHHHHHhcCHHHHHHHHhhhhhhhhccccccceEEecCEEEE Confidence 000111233333444444422 244556667899999999999988765542211 1112222111 257888887 Q ss_pred Eeccccccc----------cceEEEE-EecceEE--eeeeeeeee------------eccCcccceeeEEeeeeeeeEEE Q lcl|NC_019506. 209 LSNNMGSLT----------NGTGAIA-GVKMACT--FAEQIVQTE------------AYRMEKRFADAVKGLNVFGCKVI 263 (276) Q Consensus 209 ~s~~lp~~~----------~~~~~~~-~~~~a~~--~~~~~~~~e------------~~~~~~~~~~~i~~~~~yg~~v~ 263 (276) +...--... ....++. +.++.+. ++.. +..| .+..++..+-.+..-..-=.... T Consensus 226 ~Y~g~~~d~~g~~~~~I~~~~~~~~p~g~~~~f~~~~apa-d~~e~vnt~g~p~y~k~~~~~~~~g~~l~~qS~PLpi~~ 304 (320) T protein:vir:10 226 ENRARHVDEEGKETRFIKAGKGHAFPTGTTNTFFTALAPA-DFNETAGTLGKRYYAKMEPRRMGRGFDLHSQSNVLPMCC 304 (320) T ss_pred EcccEEEcCCCCeeEeecCCeeEEEEecCchhheeeeccc-CcHhhcCCcccccccccccccCCCeEEEEeeeccccccc Confidence 753210001 1111111 1222211 1111 1011 11222323333333333333446 Q ss_pred cCCeEEEEE-ecCC Q lcl|NC_019506. 264 YPDALVCLK-KTNP 276 (276) Q Consensus 264 ~~~~vv~~~-~~~p 276 (276) ||+.++.++ .++| T Consensus 305 rP~~lv~~~~~a~~ 318 (320) T protein:vir:10 305 RPGVLVELDAAAQP 318 (320) T ss_pred CcceEEEEEecCCC Confidence 899999988 4556 No 223 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=31.75 E-value=1.5 Score=19.66 Aligned_cols=267 Identities=8% Similarity=0.039 Sum_probs=100.6 Q ss_pred Cccch----hhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEec--c--Ccccceee-cCCCCCCCcccccc Q lcl|NC_019506. 1 MAVTS----FIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQ--I--GAITVKEY-TENSDIDAPEELST 71 (276) Q Consensus 1 MA~~~----l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~--~--~~~~~~d~-~~~~~~~~~~~~~~ 71 (276) |++++ +.|..++..+.+...+ .....+....|-+.... +.+.|-. . +...+..+ .++.+......-.. T Consensus 1 M~~~~~~d~~~~~~l~~~i~~~~~~-~~~~~~l~~~~fp~~~~--~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~ 77 (348) T protein:vir:98 1 MSWTLDTEFIEPTQLTGLIREALRD-LQVNRFRLARWLPNVDV--DDITFEFLRGGGGLAETASYRSWDTESKIGRREGL 77 (348) T ss_pred CcchhhhhccCHHHHHHHHHHHhhc-cCcchhhHHhcCCCccc--cceEEEEEeccCCceeeeeeecCCCccceeecccc Confidence 99864 5666666655543321 11111111111111111 1222211 1 11111222 22222111111112 Q ss_pred ceEEEEEEeeeecceeechHHHHhh-------hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------- Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQI-------RTPLMDAAMQRAAYALADETEKILLKEMDTNAT--------------- 129 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~~-------~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~--------------- 129 (276) ......+-.. .-...++..|.... ..+.+.+-.+.+...+.+.++--+..++..+.. T Consensus 78 ~~~~~~~~~i-~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~ 156 (348) T protein:vir:98 78 AKVMGELPPI-SEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIG 156 (348) T ss_pred eeeeeecccc-ccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCc Confidence 3333443222 22345555554421 112222233445555555555545555443211 Q ss_pred -cccccc---ccCCHHHHHHHHHHHHHHHhhc-CCCccCCEEEECHHHHHHHhhhHHhhhhcccc----cccceeeee-- Q lcl|NC_019506. 130 -SKLKPA---ATLDKTNIYEELIKVKVKLDEK-NVPTIGRFLIIPPDVHGLLLAADLIVGTGGAM----AESITKNGF-- 198 (276) Q Consensus 130 -~~~~~~---~~~t~~~~~~~i~~a~~~l~~~-~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~~~----~~~~~~~G~-- 198 (276) ...+.. +.....+.++.|.++...+.+. +.. ..+++++++.+..|++++.+.+..... ....+..+. T Consensus 157 ~~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~--p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 234 (348) T protein:vir:98 157 SHSVVAAVLWSVHATATPISDLESWVATYEDTNGQS--PGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLN 234 (348) T ss_pred ccccccccccCCCCCCCHHHHHHHHHHHHHHccCCc--ceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHH Confidence 001111 1112345678888887777654 432 247999999999999998877643211 111111122 Q ss_pred --eeEEeceEEEEecc-ccccccceEEEEEecceEEeee----------------------eee----eeee-------- Q lcl|NC_019506. 199 --VGTILGFDVYLSNN-MGSLTNGTGAIAGVKMACTFAE----------------------QIV----QTEA-------- 241 (276) Q Consensus 199 --i~~~~G~~v~~s~~-lp~~~~~~~~~~~~~~a~~~~~----------------------~~~----~~e~-------- 241 (276) .+.+.+..|+.... ... .++.. -.+-.+.+.+.. ... ..+. T Consensus 235 ~~~~~~g~~~i~~~d~~~~~-~g~~~-~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~ 312 (348) T protein:vir:98 235 TVLSSMGLPPIEVYDAKVAV-DGVST-RITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGIVA 312 (348) T ss_pred HHHHhhCCeEEEEeeeEEEc-CCcee-ceecCCeEEEEecCCcccccccccccceecccchhhhccccccceeccCceee Confidence 12233444444222 111 11111 011111111100 000 0000 Q ss_pred --ccCcccceeeEEeeeeeeeEEEcCCeEEEEEecC Q lcl|NC_019506. 242 --YRMEKRFADAVKGLNVFGCKVIYPDALVCLKKTN 275 (276) Q Consensus 242 --~~~~~~~~~~i~~~~~yg~~v~~~~~vv~~~~~~ 275 (276) +...+-.+..+.+..+-=-.+.+|++++++++-+ T Consensus 313 ~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 313 ATWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred eeeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 0001111111111111112223677777777766 No 224 >protein:vir:79399 Length: 455 # NCBI annotation: head protein # Family: family:all:4054 # MgeID: mge:1869 # MgeName: Av-1 # Cross-refs: genbank:acc:YP_001333662;genbank:gi:151266299;genbank:GeneID:5329881 Probab=27.35 E-value=1.9 Score=19.12 Aligned_cols=263 Identities=14% Similarity=0.133 Sum_probs=110.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccc----cccccccCCcEEEEeccCcccceeecC-----CCCCCCcccccc Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRD----YEGEIKAYGDTVKINQIGAITVKEYTE-----NSDIDAPEELST 71 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~----~~~~~~~~Gdtv~ip~~~~~~~~d~~~-----~~~~~~~~~~~~ 71 (276) |+++.+.-|.+.+-+.+. ...+ ...+.+++ |..+....|+||+=--..-.....|.+ ...+-....++- T Consensus 45 i~d~~~qnEf~~sLI~RI-gs~L-~~d~S~~NPLa~FK~g~~~fGdtIeei~~d~ak~~~yd~~~~~aev~pFk~e~P~I 122 (455) T protein:vir:79 45 MSDNITRNEFMSALINRI-GSTL-IRDLSWKNPLAVFKQGMMNFGDTIEEVHMDYIKPTIYEEQRDYLERDVFGQAPPPV 122 (455) T ss_pred hhhhhHHHHHHHHHHhcc-ccEE-EecccccCchHHhccccchhhhhhhhhhhccccccccCcchhhhhccccccCCCce Confidence 667666555555533322 1111 11111122 222222346665421111111222222 122212233333 Q ss_pred ceEEEEEEeeeecceeechHHHHh--hhhhHHHHHHHHHHHHHH--HHHHHHH-----HHHhh-ccccc---cccccccC Q lcl|NC_019506. 72 TEKVLEINKQKYFNFQIDDVDAAQ--IRTPLMDAAMQRAAYALA--DETEKIL-----LKEMD-TNATS---KLKPAATL 138 (276) Q Consensus 72 ~~~~~~ld~~~~~~~~v~d~d~~~--~~~d~~~~~~~~~~~ala--~~~d~~~-----~~~~~-~~~~~---~~~~~~~~ 138 (276) ...-.+.+++-+.-..|++..... .+..-.++++.+...+|. .++|++. +..+. +.-.+ ...+++.- T Consensus 123 kA~~H~~nR~~~y~~TI~dd~i~~AF~S~~gldefi~~i~~si~sSde~dEY~ylk~Li~~~~~~~~f~~~~I~D~~t~~ 202 (455) T protein:vir:79 123 KSAFHTINRKEKFKITVNRDVLRRAFLSDNGLSEMLSQTMAVAASSDQWSEFLYMTRLFKTYEDSFGFYRMQISDMNTFE 202 (455) T ss_pred eEEEeeccccceeeeeeeHHHHHHhhcChhhHHHHHHHHHHHHhcccchHHHHHHHHHHHHhhhhccceEEEeccccccc Confidence 444555555555556666655433 344556777777777775 3555543 22221 12212 12222211 Q ss_pred CHHHHH----HHHHHHHHHH-------hhcCCCc----cCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEe Q lcl|NC_019506. 139 DKTNIY----EELIKVKVKL-------DEKNVPT----IGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTIL 203 (276) Q Consensus 139 t~~~~~----~~i~~a~~~l-------~~~~vP~----~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~ 203 (276) ..+..+ +.++.+..+| +..++|. ++.+++++|++-..|-- ..+.++.+-. .+-.-+.+-.+. T Consensus 203 ~d~~~~~~~iK~lr~aA~kM~lPTR~yN~~gv~~~tdi~DL~lI~~~dtq~evdv-~~LA~AFN~d--~vd~~~~~i~Vd 279 (455) T protein:vir:79 203 PDKNKVDAALKALRVAANKMQYPTPAFNSAGVHSFARPEDLVLITTPEFKANVDV-TSLSAAFNRS--DAEAPSHIITVP 279 (455) T ss_pred cchhHHHHHHHHHHHHHHHhcCCCcccccccCcccccceeeEEEeCCCceeeecH-HHHHHHhCcc--chhcCceeEEec Confidence 222333 3444443333 2223332 45689999998777632 2233332211 111223344455 Q ss_pred ceEEEEeccccccccceEEEEEecceEEeeeeeeeeeeccCcccceeeEEe-------eeee---eeEEEcCCeEEEE-- Q lcl|NC_019506. 204 GFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIVQTEAYRMEKRFADAVKG-------LNVF---GCKVIYPDALVCL-- 271 (276) Q Consensus 204 G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~~~e~~~~~~~~~~~i~~-------~~~y---g~~v~~~~~vv~~-- 271 (276) ||-. ..++..+++.-+.++..-....++|..|+|...-+=|.. ..+| -+++-.|+.|+|- T Consensus 280 ~f~f--------a~~~~~a~~~sk~~~~i~D~l~~~~si~np~~l~~Ny~~H~w~ils~S~F~~a~af~~~~~~~~vtp~ 351 (455) T protein:vir:79 280 GETL--------GMDDTSAILTSKQFFVIKDILLENRTISNPEGLYDNYWLHHWSILSASPFTPAIAFGTKPNTIVVTPK 351 (455) T ss_pred cccc--------ccCCceEEEeehhhhhhhhhhhhcccccCcccceeehhhhhhhhhhhccccceeeeecCCceEEEccc Confidence 5521 123444666666666655556677787777543221110 1111 1333344443331 Q ss_pred ----------EecCC Q lcl|NC_019506. 272 ----------KKTNP 276 (276) Q Consensus 272 ----------~~~~p 276 (276) +.+-| T Consensus 352 ~~~~~~~~~l~~~~~ 366 (455) T protein:vir:79 352 AETNAEITNLTVTRP 366 (455) T ss_pred ccccccccccccccc Confidence 11112 No 225 >protein:vir:15 Length: 472 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073691;swissprot:sw:q9fzw7;genbank:gi:12248115;uniprot:Q9FZW7;genbank:GeneID:919909 Probab=26.65 E-value=1.9 Score=19.03 Aligned_cols=257 Identities=12% Similarity=0.119 Sum_probs=113.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcchhhhccccccccc----cCCcEEEEeccCcccceeecC---CCCCCCccccccce Q lcl|NC_019506. 1 MAVTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIK----AYGDTVKINQIGAITVKEYTE---NSDIDAPEELSTTE 73 (276) Q Consensus 1 MA~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~----~~Gdtv~ip~~~~~~~~d~~~---~~~~~~~~~~~~~~ 73 (276) |+++.+.-|.+.+-+. ...-.. ...+.+++.-.+|. ..|++|.=.........-|.+ ...+-....++-.. T Consensus 52 ~~d~~~QnEf~~sLv~-RIgst~-V~~~s~~NPLa~Fk~~~~~fG~~Ieei~~D~a~~~~yd~~k~Ev~pFk~~~P~IkA 129 (472) T protein:vir:15 52 LADKTLQNDFIHTLVD-RIGLVV-VHHKLMQNPLKIFKKGTLEYGRKIEEIFTDLTREHVYDPEKAETEVFKREIPNVKT 129 (472) T ss_pred hhhhhhHHHHHHHHHh-hhcchh-hhhhhccChHHHHhhcCccchhhhhhhhcccccccccchhhhhccccccCCCccee Confidence 7777666555555332 222111 22333333333333 235554422221111112222 11111112333334 Q ss_pred EEEEEEeeeecceeechHHHHh--hhhhHHHHHHHHHHHHHH--HHHHHHHH-----HHh-hcccccc---c-ccc-ccC Q lcl|NC_019506. 74 KVLEINKQKYFNFQIDDVDAAQ--IRTPLMDAAMQRAAYALA--DETEKILL-----KEM-DTNATSK---L-KPA-ATL 138 (276) Q Consensus 74 ~~~~ld~~~~~~~~v~d~d~~~--~~~d~~~~~~~~~~~ala--~~~d~~~~-----~~~-~~~~~~~---~-~~~-~~~ 138 (276) .-.+.+++-+.-..|++..... .+..-.++++.+...+|. .++|++.. ..+ ...-..+ . .+. ... T Consensus 130 ~~H~~nR~~~y~~Ti~~d~i~~AF~S~~gld~fi~~i~~si~sSde~dEY~~~k~li~~~~~k~lf~v~~i~~d~~~~~v 209 (472) T protein:vir:15 130 LFHERDRQVFYKQTISDQQLKTAFTNAQKFDEFLSTIVTSIYNSAEVDEFRYTKLLIDNYFSKNLFKIVPVSVDPATGIV 209 (472) T ss_pred EEeeccccceeeeeeeHHHHHHhhcChhhHHHHHHHHHHHHhccccHHHHHHHHHHHHHhhhccceEEEecCCCcccccc Confidence 4555555555556666654433 344556677777777775 35555432 111 1111111 1 111 112 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCc----------------cCCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEE Q lcl|NC_019506. 139 DKTNIYEELIKVKVKLDEKNVPT----------------IGRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTI 202 (276) Q Consensus 139 t~~~~~~~i~~a~~~l~~~~vP~----------------~~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~ 202 (276) +..+..+.++.+. .+..+|+ ++.+++++|++-..|-- ..+.++.+-. .+-.-+.+-.+ T Consensus 210 ~~kd~~K~lr~aa---~kM~lP~gT~~yN~~gv~~~td~~DL~lI~~~dtq~evdv-~~LA~AFN~d--~vd~~~~~i~V 283 (472) T protein:vir:15 210 NTKEFLAKTRATA---TKMTLPMGTRDFNSMAVHTRTDMDDLYIIMDADTQAEVDV-NELASAFNLN--KADFIGRRILI 283 (472) T ss_pred cHHHHHHHHHHHH---HHhcCCCCCCCCCccccceeccceeeeEEeCCCceEeecH-HHHHHHhCcc--hhhcCceeEEe Confidence 2223334444433 4445552 45588999988776622 2333332211 11122334445 Q ss_pred eceEEEEeccccccccceEEEEEecceEEeeeeeeeeeeccCcccc-------------eeeEEeeeeeeeEEEcC---C Q lcl|NC_019506. 203 LGFDVYLSNNMGSLTNGTGAIAGVKMACTFAEQIVQTEAYRMEKRF-------------ADAVKGLNVFGCKVIYP---D 266 (276) Q Consensus 203 ~G~~v~~s~~lp~~~~~~~~~~~~~~a~~~~~~~~~~e~~~~~~~~-------------~~~i~~~~~yg~~v~~~---~ 266 (276) .|| + .++..+++.-+.+++.-.+..+++..|+|... .........||++-.=| . T Consensus 284 d~F--------a--~~d~~a~l~sk~~f~i~D~l~~m~s~rnprgL~~Ny~lHv~q~~s~s~F~naiaF~~g~~v~~~~~ 353 (472) T protein:vir:15 284 DGF--------A--STGLKAVMVDKDFFMLYDQVFRMESQRNAQGMYWNYYLHVWQVLSTSRFANAVAFVDSALIDGDVS 353 (472) T ss_pred ccc--------C--CCCceeeeehhhHHHHHHHHHhcccccCcccchhHHHHHHHHHHHhccccceEEEeccccCCCccc Confidence 554 2 24455667777777766666778888877532 22334445566655322 2 Q ss_pred eEEEEEecCC Q lcl|NC_019506. 267 ALVCLKKTNP 276 (276) Q Consensus 267 ~vv~~~~~~p 276 (276) .+++ ..++- T Consensus 354 ~~iv-~p~~~ 362 (472) T protein:vir:15 354 QVIV-TPTVG 362 (472) T ss_pred eEEE-eeccc Confidence 2222 11111 No 226 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=25.98 E-value=2 Score=18.94 Aligned_cols=266 Identities=9% Similarity=0.043 Sum_probs=100.4 Q ss_pred cchhhHHHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEecc-Cccccee-ecCCCCCCCccccccceEEEEEEe Q lcl|NC_019506. 3 VTSFIPKLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQI-GAITVKE-YTENSDIDAPEELSTTEKVLEINK 80 (276) Q Consensus 3 ~~~l~~e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~-~~~~~~d-~~~~~~~~~~~~~~~~~~~~~ld~ 80 (276) ..+|.+..+.+.+.+.-.....+..++ |..+....-++|.+-.. +...... ..++...............+++-. T Consensus 1 ~d~f~~~~L~~~i~~~~~~~~~l~~~~---Fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~ 77 (341) T protein:vir:39 1 MSVYTTAQLLAVNEKKFKFDPLFLRIF---FRETYPFSTEKVYLSQIPGLVNMALYVSPIVSGKVIRSRGGSTSEFTPGY 77 (341) T ss_pred CCccCHHHHHHHHHhhcCccchhHhhc---CCcccccCcceEEEEEecCCceeeEEecCCCCcceecccceeeeeEeccc Confidence 666777666665555443332233322 11111112245555322 2222222 223322221122222334444432 Q ss_pred eeecceeechHHHHh-----h----------hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------- Q lcl|NC_019506. 81 QKYFNFQIDDVDAAQ-----I----------RTPLMDAAMQRAAYALADETEKILLKEMDTNATSK-------------- 131 (276) Q Consensus 81 ~~~~~~~v~d~d~~~-----~----------~~d~~~~~~~~~~~ala~~~d~~~~~~~~~~~~~~-------------- 131 (276) .+ -...++-.|... . ..+.+.+-...+...+...++--++..+..+.... T Consensus 78 i~-~~~~i~~~d~~~r~~g~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g~~~~~vDfg~ 156 (341) T protein:vir:39 78 VK-PKHEVNPLMTLRRLPDEDPQNLADPVYRRRRIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEAFEPVEVDMGR 156 (341) T ss_pred cC-cccccCHHHHHHHhhcccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCCCcEEEEeccC Confidence 22 223444444321 0 01112222233444444455544555553321100 Q ss_pred ------cccccc--CC-HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHhhhhcc--cccccce------ Q lcl|NC_019506. 132 ------LKPAAT--LD-KTNIYEELIKVKVKLDEKNVPTIGRFLIIPPDVHGLLLAADLIVGTGG--AMAESIT------ 194 (276) Q Consensus 132 ------~~~~~~--~t-~~~~~~~i~~a~~~l~~~~vP~~~r~~vv~p~~~~~L~~~~~~~~~~~--~~~~~~~------ 194 (276) ...++. .+ +..+.+-+.+....+++.+.. ..+++++++++..|+.++.+..... .+..+.+ T Consensus 157 ~~~~~~~lt~~~~W~~~~~~~~d~l~di~~~~~~~g~~--~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~ 234 (341) T protein:vir:39 157 SAGNNIVQAGAAAWSSRDKETYDPTDDIEAYALNASGV--VNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETALKD 234 (341) T ss_pred CccceeEecCCccCCCCCCchHHHHHHHHHHHHhcCCc--eEEEEeChHHHHHHhcCHHHHHHHhhcccccccccchhhh Confidence 000110 11 112234444445555555542 3478999999999999887765432 1111111 Q ss_pred -eeee--eeEEeceEEEEeccccccccce-------EEEEEecceE---Eeee--eeee-----e--ee----c-cCccc Q lcl|NC_019506. 195 -KNGF--VGTILGFDVYLSNNMGSLTNGT-------GAIAGVKMAC---TFAE--QIVQ-----T--EA----Y-RMEKR 247 (276) Q Consensus 195 -~~G~--i~~~~G~~v~~s~~lp~~~~~~-------~~~~~~~~a~---~~~~--~~~~-----~--e~----~-~~~~~ 247 (276) .+|. ++++.|++|++.+.--...++. .+.++..+.. .++. .... . +. + ...+- T Consensus 235 ~~~~~~~~~~~~g~~i~~y~~~y~d~g~~~~~ip~~~~~l~p~~~~g~~~yg~~~d~~~~~~~~~~~~~~~~~~~~~~dp 314 (341) T protein:vir:39 235 LGKAVSYKGMYGDVAIVVYSGQYIENDVKKNYLPDLTMVLGNTQARGLRTYGCILDADAQREGINASTRYPKNWVQTGDP 314 (341) T ss_pred hhhHhhhhhhhcCceEEEEccEEEecCcEEeeecCCeEEEeeCCCcceEEEecccchhhcccceeeeeeeeeeeeecCCC Confidence 1232 3467788887754322111111 1111111111 1110 0000 0 00 0 00011 Q ss_pred ceeeEEeeeeeeeEEEcCCeEEEEEec Q lcl|NC_019506. 248 FADAVKGLNVFGCKVIYPDALVCLKKT 274 (276) Q Consensus 248 ~~~~i~~~~~yg~~v~~~~~vv~~~~~ 274 (276) .+..+.+-..-=-...+|+++++++++ T Consensus 315 ~~~~~~~~s~plPv~~~p~~~~~a~V~ 341 (341) T protein:vir:39 315 AREFTMIQSAPLMLLADPDEFVSVKLA 341 (341) T ss_pred cEEEEEEeccccceeeCCCcEEEEEeC Confidence 111111111111222466666666666 No 227 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=23.80 E-value=2.3 Score=18.64 Aligned_cols=261 Identities=13% Similarity=0.063 Sum_probs=118.6 Q ss_pred CccchhhH----HHHHHHHHHHHHHhhcchhhhccccccccccCCcEEEEeccCcc-cceeecCCCCCCCccccccceEE Q lcl|NC_019506. 1 MAVTSFIP----KLWSARLLAHLDKAHVVANLVNRDYEGEIKAYGDTVKINQIGAI-TVKEYTENSDIDAPEELSTTEKV 75 (276) Q Consensus 1 MA~~~l~~----e~~~~~~~~~l~~~~v~~~~~~~~~~~~~~~~Gdtv~ip~~~~~-~~~d~~~~~~~~~~~~~~~~~~~ 75 (276) ++|.- +| +-|...+.+.+........++-.+-.++ ..-+++.++..... .+..|...++... -+..-+... T Consensus 73 ~~~~g-~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~--W~~~t~ty~~~e~~G~A~~ygd~~D~Pl-~d~~~~~~~ 148 (382) T protein:vir:96 73 TPSIP-TPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGS--WEDQEIVQGIVEPAGTAVEYGDHTNIPL-TSWNANFER 148 (382) T ss_pred cCCcc-HHHHHHhhhhhhhhhhhhhhhhhhhhccccccCC--ccceEEEEeeeecccceEEeecccCCCc-cccccceeE Confidence 44432 23 3444555555554444555543211111 11257888876443 3445543333211 223333444 Q ss_pred EEEEeeeecceeechHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHHHHHhhc-------c----c------ccccccc Q lcl|NC_019506. 76 LEINKQKYFNFQIDDVDAAQI---RTPLMDAAMQRAAYALADETEKILLKEMDT-------N----A------TSKLKPA 135 (276) Q Consensus 76 ~~ld~~~~~~~~v~d~d~~~~---~~d~~~~~~~~~~~ala~~~d~~~~~~~~~-------~----~------~~~~~~~ 135 (276) .++-. ...++.+.+.|+..+ ..++..+-...+.+++.+..++..+-...+ + + ....+.. T Consensus 149 r~v~~-~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~W 227 (382) T protein:vir:96 149 RTIVR-GELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGW 227 (382) T ss_pred EEEEE-EEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCc Confidence 44422 245577887777654 457777777777778888877654411101 0 0 0011122 Q ss_pred ccCCHHHHHHHHHHHHHHHhhcCC----Ccc-CCEEEECHHHHHHHhhhHHhhhhcccccccceeeeeeeEEeceEEEEe Q lcl|NC_019506. 136 ATLDKTNIYEELIKVKVKLDEKNV----PTI-GRFLIIPPDVHGLLLAADLIVGTGGAMAESITKNGFVGTILGFDVYLS 210 (276) Q Consensus 136 ~~~t~~~~~~~i~~a~~~l~~~~v----P~~-~r~~vv~p~~~~~L~~~~~~~~~~~~~~~~~~~~G~i~~~~G~~v~~s 210 (276) ...|.+.++++|..+...+...-- |.. ...|++.|..+..|-.- + .++ .. +.+-.--++-+++|+.. T Consensus 228 a~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~----n--~~g-~T-vl~~lk~n~Pnl~i~t~ 299 (382) T protein:vir:96 228 ATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT----T--PYG-IS-VSDWIEQTYPKMRIVSA 299 (382) T ss_pred ccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc----C--ccC-cc-HHHHHHHhcCCcEEEEc Confidence 345778889999988888865531 222 23588999988877431 1 111 00 11100012345566654 Q ss_pred ccccccc---cceE--EEEEecce-------EEeeeeeeeeee----ccCc--ccceeeEEeeee-eeeEEEcCCeEEEE Q lcl|NC_019506. 211 NNMGSLT---NGTG--AIAGVKMA-------CTFAEQIVQTEA----YRME--KRFADAVKGLNV-FGCKVIYPDALVCL 271 (276) Q Consensus 211 ~~lp~~~---~~~~--~~~~~~~a-------~~~~~~~~~~e~----~~~~--~~~~~~i~~~~~-yg~~v~~~~~vv~~ 271 (276) ..+-... ++.. ...+.+.. ......+++... ..+. ...+..+....+ .|+.+.+|.+++.+ T Consensus 300 peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~ 379 (382) T protein:vir:96 300 PELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRY 379 (382) T ss_pred cccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhc Confidence 4442211 1111 11111110 000111111000 0011 112223333333 56767788888876 Q ss_pred Eec Q lcl|NC_019506. 272 KKT 274 (276) Q Consensus 272 ~~~ 274 (276) .== T Consensus 380 ~GI 382 (382) T protein:vir:96 380 LGI 382 (382) T ss_pred cCC Confidence 533 Done!