Query lcl|NC_011288.1_cdsid_YP_002241691.1 [gene=6] [protein=gp6] [protein_id=YP_002241691.1] [location=5063..5884] Match_columns 273 No_of_seqs 126 out of 305 Neff 9.1 Searched_HMMs 1612 Date Thu Nov 7 13:56:07 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_6 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_6_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:105822 Length: 273 100.0 4.6E-74 2.9E-77 422.6 29.4 273 1-273 1-273 (273) 2 protein:vir:102605 Length: 273 100.0 4.6E-74 2.9E-77 422.6 29.4 273 1-273 1-273 (273) 3 protein:vir:7990 Length: 273 # 100.0 3.6E-73 2.3E-76 417.7 28.9 273 1-273 1-273 (273) 4 protein:vir:94622 Length: 341 100.0 3.3E-58 2E-61 335.7 22.0 269 1-273 3-339 (341) 5 protein:vir:78739 Length: 332 100.0 4.4E-54 2.7E-57 313.1 19.2 266 1-271 7-332 (332) 6 protein:vir:3364 Length: 347 # 100.0 1.7E-53 1E-56 309.9 19.9 267 1-273 1-345 (347) 7 protein:vir:1541 Length: 347 # 100.0 9.2E-52 5.7E-55 300.4 21.7 268 1-273 1-345 (347) 8 protein:vir:94711 Length: 347 100.0 1.6E-52 1E-55 304.5 16.9 267 1-273 1-346 (347) 9 protein:vir:10450 Length: 344 100.0 3.3E-52 2E-55 302.8 18.4 265 1-271 1-344 (344) 10 protein:vir:8885 Length: 347 # 100.0 7.2E-51 4.4E-54 295.5 19.2 267 1-273 1-346 (347) 11 protein:vir:2201 Length: 345 # 100.0 2.8E-50 1.7E-53 292.2 19.4 268 1-273 1-345 (345) 12 protein:vir:3136 Length: 322 # 100.0 7.2E-51 4.5E-54 295.5 15.2 264 1-273 1-318 (322) 13 protein:vir:99075 Length: 392 100.0 4E-49 2.5E-52 285.9 23.3 268 1-273 1-307 (392) 14 protein:vir:80180 Length: 381 100.0 1.5E-49 9.4E-53 288.2 19.6 269 1-273 15-381 (381) 15 protein:vir:100057 Length: 375 100.0 1.1E-48 6.6E-52 283.6 22.8 269 1-273 1-370 (375) 16 protein:vir:94576 Length: 347 100.0 2.2E-49 1.4E-52 287.3 18.7 268 1-273 1-347 (347) 17 protein:vir:80930 Length: 278 100.0 5.5E-48 3.4E-51 279.7 24.9 266 1-273 1-277 (278) 18 protein:vir:108303 Length: 418 100.0 8.4E-48 5.2E-51 278.7 25.3 265 1-273 1-417 (418) 19 protein:vir:3525 Length: 423 # 100.0 3.5E-48 2.1E-51 280.8 21.2 268 1-273 1-303 (423) 20 protein:vir:96123 Length: 274 100.0 5.1E-47 3.2E-50 274.4 25.5 259 1-273 1-270 (274) 21 protein:vir:96262 Length: 274 100.0 1.1E-46 6.7E-50 272.6 24.9 258 1-273 1-269 (274) 22 protein:vir:95898 Length: 274 100.0 1.1E-46 6.7E-50 272.6 24.9 258 1-273 1-269 (274) 23 protein:vir:174 Length: 423 # 100.0 3.2E-47 2E-50 275.5 21.9 267 1-273 1-302 (423) 24 protein:vir:80213 Length: 334 100.0 1.3E-47 8.2E-51 277.6 19.6 267 1-273 1-332 (334) 25 protein:vir:1239 Length: 274 # 100.0 2.3E-46 1.4E-49 270.8 25.3 259 1-273 1-271 (274) 26 protein:vir:93742 Length: 274 100.0 3.2E-46 2E-49 270.0 25.5 259 1-273 1-271 (274) 27 protein:vir:94494 Length: 274 100.0 3.3E-46 2E-49 269.9 25.6 259 1-273 1-270 (274) 28 protein:vir:97433 Length: 274 100.0 3.3E-46 2E-49 269.9 25.6 259 1-273 1-270 (274) 29 protein:vir:105374 Length: 423 100.0 8.6E-47 5.4E-50 273.1 22.3 267 1-273 1-302 (423) 30 protein:vir:103323 Length: 364 100.0 1.3E-46 7.9E-50 272.2 22.8 269 1-273 1-339 (364) 31 protein:vir:3613 Length: 272 # 100.0 1.8E-46 1.1E-49 271.4 23.5 263 1-273 1-272 (272) 32 protein:vir:96833 Length: 275 100.0 5.2E-46 3.2E-49 268.9 23.9 260 1-273 3-271 (275) 33 protein:vir:105522 Length: 423 100.0 5.3E-46 3.3E-49 268.8 23.0 271 1-273 1-308 (423) 34 protein:vir:97331 Length: 319 100.0 7.6E-45 4.7E-48 262.5 25.8 265 1-273 25-294 (319) 35 protein:vir:94800 Length: 319 100.0 7.6E-45 4.7E-48 262.5 25.8 265 1-273 25-294 (319) 36 protein:vir:78935 Length: 335 100.0 9.4E-46 5.8E-49 267.4 20.3 267 1-273 1-328 (335) 37 protein:vir:99675 Length: 324 100.0 7.1E-46 4.4E-49 268.1 18.1 239 28-273 1-296 (324) 38 protein:vir:107120 Length: 329 100.0 2.5E-44 1.5E-47 259.6 25.9 265 1-273 36-305 (329) 39 protein:vir:6324 Length: 335 # 100.0 7.3E-45 4.6E-48 262.5 20.4 267 1-273 1-328 (335) 40 protein:vir:105334 Length: 276 100.0 9.1E-43 5.6E-46 251.1 23.9 260 1-273 1-271 (276) 41 protein:vir:97031 Length: 402 100.0 1.8E-43 1.1E-46 254.9 18.4 269 1-273 1-335 (402) 42 protein:vir:3033 Length: 272 # 100.0 2.9E-40 1.8E-43 237.4 24.9 260 1-273 1-269 (272) 43 protein:vir:9820 Length: 272 # 100.0 2.9E-40 1.8E-43 237.4 24.9 260 1-273 1-269 (272) 44 protein:vir:79008 Length: 299 100.0 8.8E-40 5.5E-43 234.7 26.9 271 1-273 1-299 (299) 45 protein:vir:7019 Length: 401 # 100.0 1.1E-39 6.8E-43 234.2 17.9 267 1-273 1-333 (401) 46 protein:vir:102655 Length: 322 100.0 6.5E-39 4E-42 229.9 21.0 268 1-273 13-321 (322) 47 protein:vir:95107 Length: 270 100.0 9E-39 5.6E-42 229.2 21.4 259 1-273 1-267 (270) 48 protein:vir:105645 Length: 400 100.0 8.1E-39 5E-42 229.4 18.3 268 1-273 1-333 (400) 49 protein:vir:78920 Length: 290 100.0 2.7E-37 1.7E-40 221.0 24.8 266 1-273 1-290 (290) 50 protein:vir:739 Length: 231 # 100.0 9.6E-36 6E-39 212.6 19.9 228 34-273 1-231 (231) 51 protein:vir:102335 Length: 312 100.0 3E-34 1.9E-37 204.4 25.1 270 1-273 1-310 (312) 52 protein:vir:105464 Length: 346 100.0 4.2E-34 2.6E-37 203.6 23.4 269 1-273 1-299 (346) 53 protein:vir:79712 Length: 285 100.0 2.1E-31 1.3E-34 188.8 22.9 267 1-273 1-283 (285) 54 protein:vir:99523 Length: 311 100.0 1.7E-30 1.1E-33 183.8 22.7 266 1-272 8-311 (311) 55 protein:vir:78090 Length: 302 100.0 4.6E-30 2.8E-33 181.4 22.4 268 1-273 1-300 (302) 56 protein:vir:9265 Length: 430 # 99.9 3.1E-29 1.9E-32 176.9 15.7 269 1-273 1-304 (430) 57 protein:vir:100939 Length: 430 99.9 3.1E-29 1.9E-32 176.9 15.7 269 1-273 1-304 (430) 58 protein:vir:2106 Length: 430 # 99.9 2.5E-28 1.5E-31 171.9 15.8 268 1-273 1-304 (430) 59 protein:vir:1781 Length: 221 # 99.9 3.5E-28 2.2E-31 171.1 13.8 185 77-273 1-202 (221) 60 protein:vir:5974 Length: 324 # 99.8 6.1E-22 3.8E-25 136.9 20.5 263 1-273 1-296 (324) 61 protein:vir:95451 Length: 313 99.8 2.4E-23 1.5E-26 144.6 12.8 269 1-273 1-311 (313) 62 protein:vir:102944 Length: 330 99.8 1.4E-20 8.7E-24 129.4 20.8 260 1-273 1-302 (330) 63 protein:vir:1583 Length: 351 # 99.8 2.8E-20 1.7E-23 127.8 20.1 262 1-273 1-300 (351) 64 protein:vir:94142 Length: 304 99.5 4.7E-15 2.9E-18 99.1 21.9 257 1-272 1-304 (304) 65 protein:vir:105905 Length: 304 99.5 4.7E-15 2.9E-18 99.1 21.9 257 1-272 1-304 (304) 66 protein:vir:41 Length: 299 # N 99.5 3.6E-15 2.2E-18 99.8 21.0 259 1-273 6-298 (299) 67 protein:vir:9309 Length: 324 # 99.5 7.8E-15 4.9E-18 97.9 20.9 258 1-273 30-315 (324) 68 protein:vir:96223 Length: 324 99.5 7.8E-15 4.8E-18 97.9 20.9 258 1-273 30-315 (324) 69 protein:vir:97148 Length: 324 99.5 1.1E-14 7.1E-18 97.0 21.1 258 1-273 31-315 (324) 70 protein:vir:99749 Length: 324 99.5 1.6E-14 9.8E-18 96.2 21.0 258 1-273 30-315 (324) 71 protein:vir:78830 Length: 324 99.5 1.9E-14 1.2E-17 95.8 21.3 258 1-273 30-315 (324) 72 protein:vir:96392 Length: 324 99.5 1.9E-14 1.2E-17 95.8 21.3 258 1-273 30-315 (324) 73 protein:vir:94771 Length: 298 99.5 3.2E-14 2E-17 94.5 22.0 262 1-272 1-298 (298) 74 protein:vir:1638 Length: 298 # 99.5 3.1E-14 1.9E-17 94.6 21.8 262 1-272 1-298 (298) 75 protein:vir:104085 Length: 320 99.5 3.6E-14 2.2E-17 94.3 21.7 262 1-273 14-317 (320) 76 protein:vir:9759 Length: 303 # 99.5 2.3E-14 1.4E-17 95.3 20.4 264 1-273 1-303 (303) 77 protein:vir:4339 Length: 395 # 99.5 6.1E-14 3.8E-17 93.0 22.4 258 1-273 117-395 (395) 78 protein:vir:2430 Length: 318 # 99.5 3.5E-14 2.2E-17 94.4 21.0 263 1-273 14-313 (318) 79 protein:vir:103955 Length: 324 99.5 3.7E-14 2.3E-17 94.2 21.0 258 1-273 30-315 (324) 80 protein:vir:7771 Length: 330 # 99.4 7.7E-14 4.8E-17 92.5 22.0 265 1-273 1-323 (330) 81 protein:vir:98339 Length: 415 99.4 7E-14 4.4E-17 92.7 21.7 264 1-273 120-404 (415) 82 protein:vir:79987 Length: 415 99.4 7E-14 4.4E-17 92.7 21.7 264 1-273 120-404 (415) 83 protein:vir:81100 Length: 415 99.4 7E-14 4.4E-17 92.7 21.7 264 1-273 120-404 (415) 84 protein:vir:78523 Length: 338 99.4 8.5E-14 5.3E-17 92.2 22.1 266 1-273 10-335 (338) 85 protein:vir:9410 Length: 415 # 99.4 5E-14 3.1E-17 93.5 20.8 264 1-273 127-404 (415) 86 protein:vir:9574 Length: 300 # 99.4 8.3E-14 5.2E-17 92.3 22.0 263 1-273 1-300 (300) 87 protein:vir:80684 Length: 315 99.4 4.6E-14 2.8E-17 93.7 20.5 265 1-273 1-306 (315) 88 protein:vir:1886 Length: 385 # 99.4 1.1E-13 6.9E-17 91.6 22.5 258 1-273 105-384 (385) 89 protein:vir:191 Length: 385 # 99.4 1.1E-13 6.9E-17 91.6 22.5 258 1-273 105-384 (385) 90 protein:vir:78223 Length: 333 99.4 1E-13 6.5E-17 91.8 21.9 266 1-273 20-332 (333) 91 protein:vir:95763 Length: 297 99.4 9.3E-14 5.8E-17 92.0 21.6 258 1-273 9-296 (297) 92 protein:vir:4226 Length: 326 # 99.4 9.5E-14 5.9E-17 92.0 21.3 265 1-273 22-323 (326) 93 protein:vir:4700 Length: 415 # 99.4 1.4E-13 8.6E-17 91.1 21.7 261 1-273 120-404 (415) 94 protein:vir:4600 Length: 415 # 99.4 1.4E-13 8.6E-17 91.1 21.7 261 1-273 120-404 (415) 95 protein:vir:8187 Length: 311 # 99.4 2.4E-13 1.5E-16 89.8 22.2 263 1-273 1-310 (311) 96 protein:vir:100135 Length: 418 99.4 2.1E-13 1.3E-16 90.1 21.8 258 1-273 136-415 (418) 97 protein:vir:97053 Length: 390 99.4 2.9E-13 1.8E-16 89.3 21.5 254 1-271 113-390 (390) 98 protein:vir:94673 Length: 419 99.4 3.4E-13 2.1E-16 88.9 21.8 260 1-273 130-417 (419) 99 protein:vir:2344 Length: 397 # 99.4 2E-13 1.3E-16 90.2 20.4 264 1-273 10-306 (397) 100 protein:vir:6242 Length: 390 # 99.4 1.5E-13 9.4E-17 90.9 19.5 260 1-273 116-389 (390) 101 protein:vir:485 Length: 407 # 99.4 2.7E-13 1.7E-16 89.5 19.9 262 1-273 106-400 (407) 102 protein:vir:1328 Length: 392 # 99.4 4E-13 2.5E-16 88.5 20.4 260 1-273 114-391 (392) 103 protein:vir:81070 Length: 390 99.4 6.9E-13 4.3E-16 87.3 21.7 256 1-271 113-390 (390) 104 protein:vir:100247 Length: 425 99.3 4E-13 2.5E-16 88.5 19.9 262 1-273 130-424 (425) 105 protein:vir:4511 Length: 409 # 99.3 2.6E-13 1.6E-16 89.6 18.9 266 1-273 117-406 (409) 106 protein:vir:4456 Length: 401 # 99.3 3E-13 1.9E-16 89.3 19.1 262 1-273 107-401 (401) 107 protein:vir:104256 Length: 458 99.3 9.2E-13 5.7E-16 86.6 21.6 266 1-273 165-458 (458) 108 protein:vir:8102 Length: 543 # 99.3 8E-13 5E-16 86.9 20.8 260 1-273 251-542 (543) 109 protein:vir:80446 Length: 367 99.3 3.2E-13 2E-16 89.1 17.9 261 1-273 1-339 (367) 110 protein:vir:99920 Length: 311 99.3 1.1E-12 6.9E-16 86.1 20.4 264 1-272 1-311 (311) 111 protein:vir:93616 Length: 645 99.3 1.1E-12 6.5E-16 86.3 20.1 267 1-273 344-639 (645) 112 protein:vir:80376 Length: 435 99.3 3.4E-12 2.1E-15 83.5 22.2 262 1-273 130-433 (435) 113 protein:vir:10364 Length: 390 99.3 3.5E-12 2.1E-15 83.4 22.0 256 1-271 114-390 (390) 114 protein:vir:101607 Length: 379 99.3 2.7E-12 1.7E-15 84.0 21.3 259 1-273 109-379 (379) 115 protein:vir:3870 Length: 400 # 99.3 1.3E-12 7.8E-16 85.8 19.0 253 1-273 140-399 (400) 116 protein:vir:4856 Length: 293 # 99.3 3.5E-12 2.2E-15 83.4 21.2 258 1-273 5-281 (293) 117 protein:vir:100172 Length: 394 99.3 3.4E-12 2.1E-15 83.4 21.1 259 1-273 111-384 (394) 118 protein:vir:5739 Length: 366 # 99.3 8.1E-12 5E-15 81.4 22.5 262 1-273 64-366 (366) 119 protein:vir:1433 Length: 435 # 99.3 5.4E-12 3.4E-15 82.3 21.5 262 1-273 130-433 (435) 120 protein:vir:95376 Length: 425 99.3 3.3E-12 2E-15 83.5 19.9 259 1-273 141-421 (425) 121 protein:vir:4830 Length: 397 # 99.2 4.8E-12 3E-15 82.6 20.7 259 1-273 109-385 (397) 122 protein:vir:9704 Length: 394 # 99.2 7.5E-12 4.7E-15 81.6 20.9 251 1-273 133-390 (394) 123 protein:vir:9927 Length: 295 # 99.2 4E-13 2.5E-16 88.6 13.8 251 1-273 1-288 (295) 124 protein:vir:6212 Length: 434 # 99.2 2.7E-12 1.7E-15 84.0 17.8 264 1-273 141-429 (434) 125 protein:vir:2504 Length: 305 # 99.2 9.3E-12 5.8E-15 81.1 20.7 256 1-273 1-298 (305) 126 protein:vir:4997 Length: 397 # 99.2 1.4E-11 8.5E-15 80.1 21.1 258 1-273 109-385 (397) 127 protein:vir:4953 Length: 397 # 99.2 2.1E-11 1.3E-14 79.2 21.7 258 1-273 109-385 (397) 128 protein:vir:108211 Length: 318 99.2 2.8E-12 1.7E-15 83.9 16.9 260 1-273 22-317 (318) 129 protein:vir:96762 Length: 632 99.2 6.3E-12 3.9E-15 82.0 18.7 257 1-272 357-632 (632) 130 protein:vir:81160 Length: 371 99.2 1.9E-11 1.2E-14 79.4 21.3 257 1-273 91-371 (371) 131 protein:vir:3991 Length: 404 # 99.2 2.5E-11 1.5E-14 78.7 21.2 258 1-273 116-393 (404) 132 protein:vir:1383 Length: 421 # 99.2 2E-11 1.2E-14 79.2 20.1 255 1-273 116-385 (421) 133 protein:vir:105038 Length: 428 99.2 4.8E-11 3E-14 77.1 22.0 262 1-273 125-428 (428) 134 protein:vir:1025 Length: 408 # 99.2 2.7E-11 1.7E-14 78.5 20.4 258 1-273 116-393 (408) 135 protein:vir:100884 Length: 389 99.1 3.1E-11 1.9E-14 78.2 20.6 257 1-273 109-382 (389) 136 protein:vir:81227 Length: 413 99.1 8.7E-11 5.4E-14 75.7 22.2 262 1-273 118-410 (413) 137 protein:vir:7409 Length: 408 # 99.1 5.2E-11 3.2E-14 77.0 20.5 258 1-273 116-393 (408) 138 protein:vir:4197 Length: 314 # 99.1 1.1E-10 7.1E-14 75.1 22.3 265 1-273 14-313 (314) 139 protein:vir:1268 Length: 397 # 99.1 6.2E-11 3.8E-14 76.6 20.8 257 1-273 123-397 (397) 140 protein:vir:4092 Length: 390 # 99.1 1.3E-10 8.2E-14 74.7 21.8 258 1-273 84-368 (390) 141 protein:vir:3845 Length: 395 # 99.1 1.6E-10 9.8E-14 74.3 21.6 258 1-273 105-383 (395) 142 protein:vir:78640 Length: 352 99.1 2.6E-11 1.6E-14 78.6 17.2 251 1-273 83-346 (352) 143 protein:vir:78387 Length: 349 99.0 9.5E-11 5.9E-14 75.5 19.2 263 1-273 1-319 (349) 144 protein:vir:102119 Length: 404 99.0 9.8E-11 6.1E-14 75.5 19.0 265 1-273 110-400 (404) 145 protein:vir:9875 Length: 296 # 99.0 1.2E-11 7.5E-15 80.4 13.6 248 1-273 1-295 (296) 146 protein:vir:962 Length: 397 # 99.0 1.4E-10 8.6E-14 74.6 18.6 252 1-273 138-397 (397) 147 protein:vir:94989 Length: 349 99.0 3E-10 1.9E-13 72.8 19.5 263 1-273 1-319 (349) 148 protein:vir:93881 Length: 387 99.0 1.4E-10 8.8E-14 74.6 17.4 251 1-273 118-381 (387) 149 protein:vir:9361 Length: 402 # 99.0 8.7E-11 5.4E-14 75.7 16.2 250 1-273 133-396 (402) 150 protein:vir:2685 Length: 387 # 99.0 9.7E-11 6E-14 75.5 16.1 250 1-273 118-381 (387) 151 protein:vir:96978 Length: 387 99.0 9.7E-11 6E-14 75.5 16.1 250 1-273 118-381 (387) 152 protein:vir:94424 Length: 387 99.0 9.7E-11 6E-14 75.5 16.1 250 1-273 118-381 (387) 153 protein:vir:1084 Length: 437 # 98.9 4.5E-10 2.8E-13 71.8 19.2 256 1-273 156-427 (437) 154 protein:vir:8420 Length: 477 # 98.9 2.7E-10 1.7E-13 73.1 17.9 267 1-273 157-471 (477) 155 protein:vir:106647 Length: 303 98.9 2.5E-10 1.6E-13 73.2 15.5 252 1-273 1-296 (303) 156 protein:vir:102873 Length: 392 98.9 1.6E-09 1E-12 68.8 19.7 257 1-273 106-384 (392) 157 protein:vir:102082 Length: 392 98.9 1.6E-09 1E-12 68.8 19.7 257 1-273 106-384 (392) 158 protein:vir:105004 Length: 392 98.9 1.6E-09 1E-12 68.8 19.7 257 1-273 106-384 (392) 159 protein:vir:107593 Length: 392 98.9 1.6E-09 1E-12 68.8 19.7 257 1-273 106-384 (392) 160 protein:vir:101650 Length: 497 98.8 6.8E-09 4.2E-12 65.4 21.4 262 1-273 151-493 (497) 161 protein:vir:7855 Length: 497 # 98.8 6.8E-09 4.2E-12 65.4 21.4 262 1-273 151-493 (497) 162 protein:vir:80128 Length: 466 98.7 2.7E-09 1.7E-12 67.6 16.1 259 1-273 154-448 (466) 163 protein:vir:95875 Length: 401 98.7 3.9E-09 2.4E-12 66.7 16.1 266 1-273 19-361 (401) 164 protein:vir:3158 Length: 321 # 98.6 3.6E-08 2.2E-11 61.4 19.9 262 1-273 19-311 (321) 165 protein:vir:4159 Length: 315 # 98.6 3.3E-08 2.1E-11 61.6 19.3 264 1-272 19-315 (315) 166 protein:vir:9509 Length: 381 # 98.6 2.6E-08 1.6E-11 62.2 17.2 252 1-273 76-368 (381) 167 protein:vir:101291 Length: 381 98.6 2.6E-08 1.6E-11 62.2 17.2 252 1-273 76-368 (381) 168 protein:vir:95963 Length: 395 98.5 9.3E-08 5.8E-11 59.1 19.0 252 1-273 86-376 (395) 169 protein:vir:100632 Length: 381 98.5 5.4E-08 3.3E-11 60.4 17.5 253 1-273 80-370 (381) 170 protein:vir:9643 Length: 377 # 98.5 6.9E-08 4.3E-11 59.8 18.0 253 1-273 82-377 (377) 171 protein:vir:93696 Length: 364 98.4 1.5E-07 9.5E-11 58.0 18.8 268 1-273 1-361 (364) 172 protein:vir:2770 Length: 318 # 98.4 5.4E-07 3.3E-10 55.0 19.7 224 1-234 22-318 (318) 173 protein:vir:78350 Length: 383 98.3 3.2E-07 2E-10 56.2 17.3 252 1-273 83-375 (383) 174 protein:vir:95131 Length: 325 98.3 7.6E-07 4.7E-10 54.1 19.0 259 1-273 1-299 (325) 175 protein:vir:3969 Length: 287 # 98.3 2.3E-07 1.4E-10 57.0 15.9 267 1-273 1-286 (287) 176 protein:vir:98635 Length: 377 98.2 9.2E-07 5.7E-10 53.7 17.2 252 1-273 79-377 (377) 177 protein:vir:79928 Length: 393 98.2 4.7E-07 2.9E-10 55.3 15.5 263 1-273 74-377 (393) 178 protein:vir:8324 Length: 410 # 98.1 5.7E-07 3.5E-10 54.8 14.3 251 1-271 136-410 (410) 179 protein:vir:96792 Length: 315 98.1 5.8E-06 3.6E-09 49.3 20.0 258 1-273 1-281 (315) 180 protein:vir:98871 Length: 314 97.9 3.4E-06 2.1E-09 50.6 15.4 268 1-273 21-311 (314) 181 protein:vir:79548 Length: 652 97.7 2.3E-05 1.4E-08 46.0 19.1 259 1-270 359-652 (652) 182 protein:vir:819 Length: 404 # 97.7 2.6E-05 1.6E-08 45.7 19.1 262 1-273 22-389 (404) 183 protein:vir:104439 Length: 404 97.7 2.6E-05 1.6E-08 45.7 19.1 262 1-273 22-389 (404) 184 protein:vir:10123 Length: 404 97.7 2.6E-05 1.6E-08 45.7 19.1 262 1-273 22-389 (404) 185 protein:vir:3298 Length: 404 # 97.7 2.6E-05 1.6E-08 45.7 19.1 262 1-273 22-389 (404) 186 protein:vir:105610 Length: 430 97.5 5.3E-05 3.3E-08 44.0 17.7 264 1-273 1-406 (430) 187 protein:vir:94528 Length: 286 97.5 2.3E-05 1.4E-08 46.0 14.7 262 1-273 1-285 (286) 188 protein:vir:95512 Length: 693 97.4 7.5E-05 4.7E-08 43.2 18.0 260 1-273 394-693 (693) 189 protein:vir:97397 Length: 517 97.2 0.00011 6.7E-08 42.3 15.8 254 1-273 237-516 (517) 190 protein:vir:97255 Length: 310 97.0 0.00024 1.5E-07 40.4 21.2 263 1-273 1-310 (310) 191 protein:vir:94933 Length: 330 96.5 0.00059 3.6E-07 38.3 17.1 261 1-273 25-329 (330) 192 protein:vir:4074 Length: 480 # 95.4 0.00064 4E-07 38.1 10.0 257 1-273 184-477 (480) 193 protein:vir:80068 Length: 301 94.8 0.0034 2.1E-06 34.1 18.5 258 1-271 1-301 (301) 194 protein:vir:107687 Length: 319 94.4 0.0046 2.8E-06 33.4 17.9 258 1-271 24-319 (319) 195 protein:vir:99424 Length: 360 92.2 0.012 7.7E-06 31.0 18.5 261 1-273 1-357 (360) 196 protein:vir:4786 Length: 295 # 91.1 0.018 1.1E-05 30.2 14.9 247 1-252 1-295 (295) 197 protein:vir:94070 Length: 339 88.9 0.029 1.8E-05 29.0 16.3 256 1-271 49-339 (339) 198 protein:vir:103285 Length: 296 87.3 0.04 2.5E-05 28.3 19.3 259 1-273 1-294 (296) 199 protein:vir:103886 Length: 302 87.2 0.04 2.5E-05 28.2 20.2 254 1-273 1-302 (302) 200 protein:vir:79642 Length: 329 84.2 0.063 3.9E-05 27.2 18.1 259 1-273 31-328 (329) 201 protein:vir:78148 Length: 123 82.2 0.027 1.7E-05 29.2 6.4 108 164-273 1-123 (123) 202 protein:vir:104342 Length: 314 75.8 0.14 8.8E-05 25.2 17.3 259 1-273 1-312 (314) 203 protein:vir:10324 Length: 320 72.4 0.18 0.00011 24.6 14.2 253 4-273 1-317 (320) 204 protein:vir:79399 Length: 455 72.2 0.19 0.00012 24.6 10.5 261 1-273 45-354 (455) 205 protein:vir:79078 Length: 307 72.1 0.19 0.00012 24.6 13.7 268 1-273 1-307 (307) 206 protein:vir:15 Length: 472 # N 62.9 0.33 0.0002 23.2 11.7 257 1-273 52-362 (472) 207 protein:vir:107882 Length: 307 50.5 0.61 0.00038 21.8 15.0 268 1-273 1-307 (307) 208 protein:vir:101557 Length: 336 50.4 0.61 0.00038 21.8 13.7 254 1-271 45-336 (336) 209 protein:vir:3643 Length: 336 # 50.2 0.62 0.00038 21.7 13.2 254 1-271 45-336 (336) 210 protein:vir:2736 Length: 348 # 32.0 1.5 0.0009 19.7 14.5 260 1-273 1-329 (348) 211 protein:vir:78558 Length: 336 31.2 1.5 0.00094 19.6 13.1 255 1-271 45-336 (336) 212 protein:vir:5942 Length: 523 # 29.4 1.7 0.001 19.4 14.1 260 1-273 193-521 (523) 213 protein:vir:96490 Length: 348 23.9 2.2 0.0014 18.7 16.2 266 1-273 1-329 (348) 214 protein:vir:99888 Length: 309 23.5 2.3 0.0014 18.6 12.7 265 1-273 1-308 (309) No 1 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=4.6e-74 Score=422.63 Aligned_cols=273 Identities=99% Similarity=1.356 Sum_probs=263.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEeee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~ 80 (273) ||+++|+||+|++++++.|++.+++.++++++|++++++||||+||+++.+++++|++.++++.++++++++++++||++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEeee Confidence 99999999999999999999999999999999999999999999999999999999988888889999999999999999 Q ss_pred eecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCCCccC Q lcl|NC_011288. 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANVPNVG 160 (273) Q Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~ 160 (273) +++++.|+|+|+.+..++++++++|++++||+++|+++++++..++....+++++++.++++.|.+|+++|++++||.++ T Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~ 160 (273) T protein:vir:10 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKANVPNVG 160 (273) T ss_pred eecceEeecHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcCCCcCC Confidence 99999999999999999988899999999999999999999998887777777888899999999999999999999999 Q ss_pred CEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeeeeehhhc Q lcl|NC_011288. 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) Q Consensus 161 r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~e~~~ 240 (273) |++||+|+++..|+++++++.+.+..++...+++|.||+++||+|++|+++|.+++..++++|++|+++++|++++|..| T Consensus 161 R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~e~~r 240 (273) T protein:vir:10 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) T ss_pred CEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeeehhhccc Confidence 99999999999999998889888988888899999999999999999999999888899999999999999999999999 Q ss_pred CCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++++|+|.|+|+++||++++|||++++|+++|| T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 999999999999999999999999999999999 No 2 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=4.6e-74 Score=422.63 Aligned_cols=273 Identities=99% Similarity=1.356 Sum_probs=263.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEeee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~ 80 (273) ||+++|+||+|++++++.|++.+++.++++++|++++++||||+||+++.+++++|++.++++.++++++++++++||++ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEeee Confidence 99999999999999999999999999999999999999999999999999999999988888889999999999999999 Q ss_pred eecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCCCccC Q lcl|NC_011288. 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANVPNVG 160 (273) Q Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~ 160 (273) +++++.|+|+|+.+..++++++++|++++||+++|+++++++..++....+++++++.++++.|.+|+++|++++||.++ T Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~ 160 (273) T protein:vir:10 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKANVPNVG 160 (273) T ss_pred eecceEeecHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcCCCcCC Confidence 99999999999999999988899999999999999999999998887777777888899999999999999999999999 Q ss_pred CEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeeeeehhhc Q lcl|NC_011288. 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) Q Consensus 161 r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~e~~~ 240 (273) |++||+|+++..|+++++++.+.+..++...+++|.||+++||+|++|+++|.+++..++++|++|+++++|++++|..| T Consensus 161 R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~e~~r 240 (273) T protein:vir:10 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) T ss_pred CEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeeehhhccc Confidence 99999999999999998889888988888899999999999999999999999888899999999999999999999999 Q ss_pred CCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++++|+|.|+|+++||++++|||++++|+++|| T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 999999999999999999999999999999999 No 3 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=3.6e-73 Score=417.72 Aligned_cols=273 Identities=98% Similarity=1.353 Sum_probs=262.1 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEeee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~ 80 (273) ||+++|+||+|++++++.|++.+++.++++++|++++.+||||+||+++.+++.+|++.++++.++++++++++++||++ T Consensus 1 MA~~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~~ 80 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEEEEeee Confidence 99999999999999999999999999999999999889999999999999999999988888889999999999999999 Q ss_pred eecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCCCccC Q lcl|NC_011288. 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANVPNVG 160 (273) Q Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~ 160 (273) +++++.|+|+|+.+.+++++++++|++++||+++|+++++++..++.....++..++.++++.|.+++++|++++||.++ T Consensus 81 ~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~ 160 (273) T protein:vir:79 81 KSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANVPNVG 160 (273) T ss_pred cccceeeccHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhhccCCccC Confidence 99999999999999999998899999999999999999999988877777777888889999999999999999999999 Q ss_pred CEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeeeeehhhc Q lcl|NC_011288. 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) Q Consensus 161 r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~e~~~ 240 (273) |++||+|+++..|+++++++.+.+..++...+++|.||+|+||+|++|+++|.+++..++++|++|+++++|++++|..| T Consensus 161 R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~~~~~e~~r 240 (273) T protein:vir:79 161 RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALR 240 (273) T ss_pred cEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeeehhhhhccc Confidence 99999999999999998888888888888899999999999999999999999888889999999999999999999999 Q ss_pred CCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 241 ~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++++|+|+|+|+++||++++|||++++|+++|| T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 999999999999999999999999999999999 No 4 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=3.3e-58 Score=335.74 Aligned_cols=269 Identities=19% Similarity=0.266 Sum_probs=228.1 Q ss_pred Cccc------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCC Q lcl|NC_011288. 1 MAFN------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAI 68 (273) Q Consensus 1 MA~~------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~ 68 (273) |+|+ +|+||+|++++++.|++.++|.+++ ++|+.++.+||||+||+++.+++.+|+++ .+++++++ T Consensus 3 ~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~-~d~~~~~~~Gdtv~ip~~g~~~~~d~~~~-~~i~~~~~ 80 (341) T protein:vir:94 3 LGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVV-KTWGAQVKKGDTFHVPRISELGVEDKATD-VPVGVQPV 80 (341) T ss_pred chhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhcc-ccccccccCCceEEEeccCcceeeeecCC-Cccccccc Confidence 3333 3899999999999999999999987 68988889999999999999999999864 46788999 Q ss_pred ccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------ccCCCHH Q lcl|NC_011288. 69 SDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTALSG---------SAPTDAD 138 (273) Q Consensus 69 ~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~---------~~~~t~~ 138 (273) ++++++++||+++++++.|+|+|+.+.++++ .+++++++++||+++|+++++++...+..... .++.... T Consensus 81 ~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~~~ 160 (341) T protein:vir:94 81 NDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNGQA 160 (341) T ss_pred cCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCchhh Confidence 9999999999999999999999999999985 67999999999999999999988654322111 1111234 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCc- Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE- 217 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~- 217 (273) ..++.|.++++.|++++||.++|++||+|+++..|++++++.. .+..+ +..+++|.||+++||+|++|+++|.+++. T Consensus 161 ~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~-~~~~g-~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~ 238 (341) T protein:vir:94 161 FSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFIS-KDFIN-NAPIAQGQIGSLMGVRVIRTSLIGNNSATG 238 (341) T ss_pred hhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhh-hhccc-cchhheeeeeeEeceEEEEecccccccccc Confidence 5789999999999999999999999999999999999987654 45554 45789999999999999999999864422 Q ss_pred ---------------------------------EEEEEcCceeEEeee------------eeeehhhcCCCceeeeEEee Q lcl|NC_011288. 218 ---------------------------------QFVAFHPSAAAYVSQ------------IDTVEALRDQDSFSDRIRAL 252 (273) Q Consensus 218 ---------------------------------~~~~~~~~a~~~a~~------------~~~~e~~~~~~~~~~~v~~~ 252 (273) ..+++|++|++.++- ...+|..|++.+++|+|.|+ T Consensus 239 ~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 318 (341) T protein:vir:94 239 WRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGR 318 (341) T ss_pred ccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhhh Confidence 237789999887762 23567778899999999999 Q ss_pred eeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 253 HVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 253 ~~~g~~v~~~~~~v~~~~~~s 273 (273) ++|||+++||||+|.|+.++- T Consensus 319 ~~~G~~~lrp~~~v~~~~~~~ 339 (341) T protein:vir:94 319 QAYGARLYRPLHAVNIHTTGD 339 (341) T ss_pred hhhcccccCcceeEEEecCcC Confidence 999999999999999999888 No 5 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=4.4e-54 Score=313.13 Aligned_cols=266 Identities=20% Similarity=0.231 Sum_probs=225.2 Q ss_pred Ccc-----------------chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCccc Q lcl|NC_011288. 1 MAF-----------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT 63 (273) Q Consensus 1 MA~-----------------~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~ 63 (273) |++ ..|+ |+|++++++.|++.++|.++++.. +++.|+|++||+++..++.+|+++. .+ T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r---~i~~G~tv~i~~ig~~~~~~~~~g~-~l 81 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSY---DLRGGKSKQFMFTGKLSAGYHTPGT-PI 81 (332) T ss_pred ccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhccccc---cccccceEEEEeccceeEeeecCCC-CC Confidence 221 2455 999999999999999999998753 4567999999999999999998754 45 Q ss_pred CCC-CCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc------------- Q lcl|NC_011288. 64 SAD-AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTA------------- 128 (273) Q Consensus 64 ~~~-~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~------------- 128 (273) .++ ++++++++++||+.+++.+.|+|+|+.+.++++ .++.++++++||+++|+.++..+..++.. T Consensus 82 ~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~~~ 161 (332) T protein:vir:78 82 VGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGFHV 161 (332) T ss_pred CCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccccc Confidence 554 589999999999999999999999999999986 56999999999999999999988654321 Q ss_pred -cccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhh-HHHHhhhhcccccceeeeee-eeeEeceEE Q lcl|NC_011288. 129 -LSGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSS-GSKLTSADTSGDAAGLRAGT-IGNLLGARI 205 (273) Q Consensus 129 -~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~-~~~~~~~~~~~~~~~l~~G~-ig~~~G~~v 205 (273) ...+..+++.++++.|.++++.|++++||.+|||+||+|++|..|+++ +..+.+.+..+....+++|. |++++||+| T Consensus 162 ~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V 241 (332) T protein:vir:78 162 NIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRI 241 (332) T ss_pred ccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeeeEE Confidence 122335677889999999999999999999999999999999999983 22355666666667888886 999999999 Q ss_pred EeeCccccCC---------------------CcEEEEEcCceeEEeeeee----eehhhcCCCceeeeEEeeeeeeeEEe Q lcl|NC_011288. 206 VESNNLRDTD---------------------DEQFVAFHPSAAAYVSQID----TVEALRDQDSFSDRIRALHVYGGKVV 260 (273) Q Consensus 206 ~~s~~l~~~~---------------------~~~~~~~~~~a~~~a~~~~----~~e~~~~~~~~~~~v~~~~~~g~~v~ 260 (273) |+||++|..+ ...++++|++|++++++.+ .+|.+|++++|+|.|.|+++||++++ T Consensus 242 ~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G~~v~ 321 (332) T protein:vir:78 242 LKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMGCGSL 321 (332) T ss_pred EecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhcCcee Confidence 9999999543 2246889999999998654 56789999999999999999999999 Q ss_pred cCceEEEEecC Q lcl|NC_011288. 261 RPTGVVVFNKT 271 (273) Q Consensus 261 ~~~~~v~~~~~ 271 (273) |||++++|+++ T Consensus 322 rPe~~v~l~~a 332 (332) T protein:vir:78 322 RTSVAGSFQAA 332 (332) T ss_pred cccceEEEeeC Confidence 99999999999 No 6 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=1.7e-53 Score=309.93 Aligned_cols=267 Identities=19% Similarity=0.155 Sum_probs=223.3 Q ss_pred Cccc---------------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCC Q lcl|NC_011288. 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAA 59 (273) Q Consensus 1 MA~~---------------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~ 59 (273) |||+ +|+ |+|++++++.|++.++|.++++.. +++.|++++||+++..++.+|+++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~~~v~~r---~~~~G~sv~i~~iG~~t~~~~~~g 76 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTMPRHMLR---SIASGKSAQFPVIGRTKAAYLKPG 76 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhhhhhccc---cccccceeEeeeccceeeeeecCC Confidence 6644 255 999999999999999999999853 456799999999999999999875 Q ss_pred Cc-ccCCCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccc---------- Q lcl|NC_011288. 60 GR-QTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGT---------- 127 (273) Q Consensus 60 ~~-~~~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~---------- 127 (273) .. +.+++++.+++.+++||+.+++.+.|+|+|+.++++++ .++.++++++||+++|+.++..+..... T Consensus 77 ~~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~ 156 (347) T protein:vir:33 77 ENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIE 156 (347) T ss_pred CCCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 43 23456688999999999999999999999999999986 5699999999999999999866532100 Q ss_pred --------c-ccccc------CCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccccee Q lcl|NC_011288. 128 --------A-LSGSA------PTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) Q Consensus 128 --------~-~~~~~------~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l 192 (273) . ...++ ..++.++++.|.++++.|++++||.++||+||+|++|..|++++++ .+.++. +...+ T Consensus 157 ~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~-~~~d~~-~~~~~ 234 (347) T protein:vir:33 157 GLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMP-NAANYQ-ALLDP 234 (347) T ss_pred cccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccc-cccccc-ccccc Confidence 0 00011 1124567999999999999999999999999999999999999864 455654 45578 Q ss_pred eeeeeeeEeceEEEeeCccccCCC-----------------------------cEEEEEcCceeEEeeeee-eehhhcCC Q lcl|NC_011288. 193 RAGTIGNLLGARIVESNNLRDTDD-----------------------------EQFVAFHPSAAAYVSQID-TVEALRDQ 242 (273) Q Consensus 193 ~~G~ig~~~G~~v~~s~~l~~~~~-----------------------------~~~~~~~~~a~~~a~~~~-~~e~~~~~ 242 (273) ++|.|++++||+||+||++|.... ..++++|++|++.+++++ ++|..|++ T Consensus 235 ~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~ 314 (347) T protein:vir:33 235 ERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA 314 (347) T ss_pred ccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccch Confidence 999999999999999999986422 124688999999999776 89999999 Q ss_pred CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 243 DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 243 ~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+|+|+|+|+++||++++|||++|+|+..+- T Consensus 315 ~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 315 NYQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhHhhhhhhhcCCceecccceEEEecCCC Confidence 9999999999999999999999999998887 No 7 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=9.2e-52 Score=300.38 Aligned_cols=268 Identities=19% Similarity=0.139 Sum_probs=222.1 Q ss_pred Cccch--------------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCC Q lcl|NC_011288. 1 MAFNN--------------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAG 60 (273) Q Consensus 1 MA~~~--------------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~ 60 (273) ||++. +.=|+|++++++.|++.++|.++++.. +++.|++++||+++..++.+|+++. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~---~~~~G~sv~i~~ig~~t~~~~~~g~ 77 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLR---SIASGKSAQFPVIGRTKAAYLKPGE 77 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccc---cccccceeEeeeccceeeeeeccCC Confidence 66551 223889999999999999999999764 5667999999999999999998754 Q ss_pred c-ccCCCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc---------c Q lcl|NC_011288. 61 R-QTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTA---------L 129 (273) Q Consensus 61 ~-~~~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~---------~ 129 (273) . +.+++++.+++++++||+.+++.+.|+|+|+.+.++++ .++.++++++||+++|+.++..+...... . T Consensus 78 ~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~ 157 (347) T protein:vir:15 78 NLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEG 157 (347) T ss_pred CCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 3 33456688999999999999999999999999999986 56999999999999999999876532100 0 Q ss_pred ----------cc--ccCC----CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceee Q lcl|NC_011288. 130 ----------SG--SAPT----DADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) Q Consensus 130 ----------~~--~~~~----t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~ 193 (273) .. +... ....+++.|.+|++.|++++||.++||+||+|++|..|++++.+ .+.+.. +...++ T Consensus 158 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~-~~~d~~-~~~~~~ 235 (347) T protein:vir:15 158 LGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMP-NAANYQ-ALIDHE 235 (347) T ss_pred cCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccc-cccccc-cccccc Confidence 00 0011 12456889999999999999999999999999999999999865 455554 445689 Q ss_pred eeeeeeEeceEEEeeCccccCCC-----------------------------cEEEEEcCceeEEeeeee-eehhhcCCC Q lcl|NC_011288. 194 AGTIGNLLGARIVESNNLRDTDD-----------------------------EQFVAFHPSAAAYVSQID-TVEALRDQD 243 (273) Q Consensus 194 ~G~ig~~~G~~v~~s~~l~~~~~-----------------------------~~~~~~~~~a~~~a~~~~-~~e~~~~~~ 243 (273) +|.|++++||+||+||++|.... ...+++|++|++.+++.+ ++|..|++. T Consensus 236 ~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~ 315 (347) T protein:vir:15 236 RGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN 315 (347) T ss_pred ceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccch Confidence 99999999999999999985322 124678999999999765 899999999 Q ss_pred ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 244 SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 244 ~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +|+|+|.++++||++++||||+|+|+..+- T Consensus 316 ~~~d~i~~~~~~G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 316 YQADQIIAKYAMGHGGLRPEAAGAIVLPKV 345 (347) T ss_pred hhhhhhehhhhcCCceeccccEEEEecCCC Confidence 999999999999999999999999998887 No 8 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=1.6e-52 Score=304.53 Aligned_cols=267 Identities=19% Similarity=0.149 Sum_probs=218.8 Q ss_pred Cccch--------------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCC Q lcl|NC_011288. 1 MAFNN--------------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAG 60 (273) Q Consensus 1 MA~~~--------------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~ 60 (273) ||+.+ |+ |.|..+++..|++.++|.+++... +++.|++++||++|..++.+|+++. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~~~~r---~i~~G~sv~i~~iG~~tv~~~t~G~ 76 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADKHIVR---TIQNGKSAQFPVMGRTSGVYLAPGE 76 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cccccceEEEecccceeeeeecCCC Confidence 44431 22 567777777899999999988664 5678999999999999999999854 Q ss_pred cc-cCCCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc---c------ Q lcl|NC_011288. 61 RQ-TSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTA---L------ 129 (273) Q Consensus 61 ~~-~~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~---~------ 129 (273) .. .+++++.+++++++||+.+++.+.|+|+|+.+.++++ .++.++++++|++.+|+.++..+...+.. . T Consensus 77 ~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g 156 (347) T protein:vir:94 77 RLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAG 156 (347) T ss_pred CcCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCC Confidence 32 2456789999999999999999999999999999986 56999999999999999998766421100 0 Q ss_pred --------------ccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeee Q lcl|NC_011288. 130 --------------SGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAG 195 (273) Q Consensus 130 --------------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G 195 (273) ......++..+++.|.+|++.|++++||.++||+||+|++|..|+.+.. +.+.+.. +...+++| T Consensus 157 ~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~-~~~~~~~-~~~~~~~G 234 (347) T protein:vir:94 157 LGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALM-PNAANYA-ALIDPETG 234 (347) T ss_pred CcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccch-hhhhhcc-cccccccc Confidence 0011123456789999999999999999999999999999999998754 4444444 44568999 Q ss_pred eeeeEeceEEEeeCccccCC---------------------------------CcEEEEEcCceeEEeeeee-eehhhcC Q lcl|NC_011288. 196 TIGNLLGARIVESNNLRDTD---------------------------------DEQFVAFHPSAAAYVSQID-TVEALRD 241 (273) Q Consensus 196 ~ig~~~G~~v~~s~~l~~~~---------------------------------~~~~~~~~~~a~~~a~~~~-~~e~~~~ 241 (273) .|++++||+||+||++|... ....+++|+.|++.+++++ ++|.+|+ T Consensus 235 ~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 314 (347) T protein:vir:94 235 NIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRD 314 (347) T ss_pred ceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhc Confidence 99999999999999998421 1134678999999999887 8999999 Q ss_pred CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 242 QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 242 ~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +++|+|+|.|+++||++++|||++++|+++.+ T Consensus 315 ~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A 346 (347) T protein:vir:94 315 VDAQGDLIVGKYAMGHGGLRPEAAGALVFSPA 346 (347) T ss_pred hhhHHHHhhhhhhhcCcccccceeEEEEecCC Confidence 99999999999999999999999999999977 No 9 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=3.3e-52 Score=302.84 Aligned_cols=265 Identities=20% Similarity=0.195 Sum_probs=219.4 Q ss_pred Cccc----------------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecC Q lcl|NC_011288. 1 MAFN----------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKA 58 (273) Q Consensus 1 MA~~----------------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~ 58 (273) |||. .|+ |+|++++++.|++.++|.+++++. +++.|++++||++|..++..+++ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~~~~r---~i~~g~s~~~~~iG~~~~~~~~~ 76 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSRHMVR---SISSGKSAQFPVLGRTQAAYLAP 76 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhcccceee---eecccceEEEEeeceeEEEeeec Confidence 8765 144 999999999999999999999864 56779999999999999999887 Q ss_pred CCccc-CCCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc-------- Q lcl|NC_011288. 59 AGRQT-SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTA-------- 128 (273) Q Consensus 59 ~~~~~-~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~-------- 128 (273) +.... +.+++.+++++++||+.+++.+.|+|+|+.+.++++ .++.++++++||+.+|+.++..+...... T Consensus 77 G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~ 156 (344) T protein:vir:10 77 GENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENI 156 (344) T ss_pred CCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccc Confidence 65433 236789999999999999999999999999999986 56999999999999999998776431110 Q ss_pred ------------ccc----ccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccccee Q lcl|NC_011288. 129 ------------LSG----SAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) Q Consensus 129 ------------~~~----~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l 192 (273) ... ....++..+++.|.++++.|++++||.++||+||+|++|..|++++.+ .+.+ .++...+ T Consensus 157 ~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~-~~~~-~~~~~~~ 234 (344) T protein:vir:10 157 TGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMP-NAAN-YAALIDP 234 (344) T ss_pred ccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccc-cccc-cccccce Confidence 000 011233567899999999999999999999999999999999988754 3434 4456678 Q ss_pred eeeeeeeEeceEEEeeCccccCC----------------------------CcEEEEEcCceeEEeeeee-eehhhcCCC Q lcl|NC_011288. 193 RAGTIGNLLGARIVESNNLRDTD----------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQD 243 (273) Q Consensus 193 ~~G~ig~~~G~~v~~s~~l~~~~----------------------------~~~~~~~~~~a~~~a~~~~-~~e~~~~~~ 243 (273) ++|.|++++||+||+||++|.+. ...++++||.|++.+++++ ++|..|+++ T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 314 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 314 (344) T ss_pred eeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchh Confidence 99999999999999999998421 1124678999999999887 899999999 Q ss_pred ceeeeEEeeeeeeeEEecCceE--EEEecC Q lcl|NC_011288. 244 SFSDRIRALHVYGGKVVRPTGV--VVFNKT 271 (273) Q Consensus 244 ~~~~~v~~~~~~g~~v~~~~~~--v~~~~~ 271 (273) +|+|+|.|+++||++++||||+ |+|+.. T Consensus 315 ~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 315 FQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHHHHHHhhcccceecccceEEEEeecC Confidence 9999999999999999999988 566555 No 10 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=7.2e-51 Score=295.49 Aligned_cols=267 Identities=18% Similarity=0.139 Sum_probs=221.0 Q ss_pred Cccc---------------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCC Q lcl|NC_011288. 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAA 59 (273) Q Consensus 1 MA~~---------------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~ 59 (273) |||. .|+ |+|++++++.|++.++|.++++.. +++.|++++||++|..++.+++++ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~~~~r---~i~~G~sv~~~~iG~~~~~~~~~g 76 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR---TIQNGKSASFPVMGRTKGYYLAPG 76 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhhhhccccc---cccCcceEEEeeecceeeeeeccc Confidence 7754 244 999999999999999999999764 567899999999999999888765 Q ss_pred Cccc-CCCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc--------- Q lcl|NC_011288. 60 GRQT-SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTA--------- 128 (273) Q Consensus 60 ~~~~-~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~--------- 128 (273) .... +.+++.+++++++||+.+++.+.|+|+|+.+.++|+ .++.++++++||+++|+.++..+...+.. T Consensus 77 ~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~ 156 (347) T protein:vir:88 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) T ss_pred cCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccC Confidence 4433 236789999999999999999999999999999985 67999999999999999998776432210 Q ss_pred -------cccc-------cCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeee Q lcl|NC_011288. 129 -------LSGS-------APTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA 194 (273) Q Consensus 129 -------~~~~-------~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~ 194 (273) ...+ ....+..+++.|.+++++|++++||.++||+||+|++|..|++++.+ .+.+.. +...+++ T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~-~~~~~~-~~~~~~~ 234 (347) T protein:vir:88 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMP-NAANYA-ALIDPET 234 (347) T ss_pred CccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhh-hhhhhc-cccchhc Confidence 0000 01123456899999999999999999999999999999999987753 344443 4556899 Q ss_pred eeeeeEeceEEEeeCccccCCC--------------------------------cEEEEEcCceeEEeeeee-eehhhcC Q lcl|NC_011288. 195 GTIGNLLGARIVESNNLRDTDD--------------------------------EQFVAFHPSAAAYVSQID-TVEALRD 241 (273) Q Consensus 195 G~ig~~~G~~v~~s~~l~~~~~--------------------------------~~~~~~~~~a~~~a~~~~-~~e~~~~ 241 (273) |.|++++||+|++|+++|.+.. ...+.+|++|++.+++++ ++|..|+ T Consensus 235 G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~ 314 (347) T protein:vir:88 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) T ss_pred ceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeec Confidence 9999999999999999985211 123678999999999887 7999999 Q ss_pred CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 242 QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 242 ~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +++|+|+|.|+++||++++|||++++|+.+.+ T Consensus 315 ~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred hhhHHHHhhhhhhhcCceeccceEEEEEeCCC Confidence 99999999999999999999999999999888 No 11 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=2.8e-50 Score=292.23 Aligned_cols=268 Identities=18% Similarity=0.154 Sum_probs=220.8 Q ss_pred Cccc---------------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCC Q lcl|NC_011288. 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAA 59 (273) Q Consensus 1 MA~~---------------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~ 59 (273) ||+. .+.=|+|+.++++.|++.++|.++++.. +++.|++++||+.|..++..++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r---~i~~gks~~~~~iG~~~~~~~~~G 77 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVR---SISSGKSAQFPVLGRTQAAYLAPG 77 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceee---eccccceEEEeeecceEEEeeecC Confidence 3332 1233999999999999999999999753 567799999999999999999876 Q ss_pred Cccc-CCCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc-------- Q lcl|NC_011288. 60 GRQT-SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTAL-------- 129 (273) Q Consensus 60 ~~~~-~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~-------- 129 (273) .... +..++..++.+++||+.+++.+.|+|+|+.+.++++ .++.++++++||+.+|+.++..+...+... T Consensus 78 ~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~ 157 (345) T protein:vir:22 78 ENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIE 157 (345) T ss_pred CCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 4422 234577889999999999999999999999999996 569999999999999999987664321100 Q ss_pred ------------cc----ccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceee Q lcl|NC_011288. 130 ------------SG----SAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) Q Consensus 130 ------------~~----~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~ 193 (273) .+ ...+++..+++.|.+|++.|++++||.++||+||+|++|..|++++.+ .+.++ ++...++ T Consensus 158 ~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~-~~~~~-~~~~~~~ 235 (345) T protein:vir:22 158 GLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMP-NAANY-AALIDPE 235 (345) T ss_pred ccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccc-ccccc-ccccccc Confidence 00 112345678999999999999999999999999999999999988764 44444 4556688 Q ss_pred eeeeeeEeceEEEeeCccccCC-----------------------------CcEEEEEcCceeEEeeeee-eehhhcCCC Q lcl|NC_011288. 194 AGTIGNLLGARIVESNNLRDTD-----------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQD 243 (273) Q Consensus 194 ~G~ig~~~G~~v~~s~~l~~~~-----------------------------~~~~~~~~~~a~~~a~~~~-~~e~~~~~~ 243 (273) +|.|++++||+|++||++|.+. +..++.+|++|++++++++ ++|..|+++ T Consensus 236 ~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 315 (345) T protein:vir:22 236 KGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 (345) T ss_pred cceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechh Confidence 9999999999999999987421 1234688999999999887 899999999 Q ss_pred ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 244 SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 244 ~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +|+|+|.|+++||++++||||+++|+...- T Consensus 316 ~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 316 FQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 999999999999999999999998887777 No 12 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=7.2e-51 Score=295.48 Aligned_cols=264 Identities=16% Similarity=0.177 Sum_probs=214.0 Q ss_pred Cccc--------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccce Q lcl|NC_011288. 1 MAFN--------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~~--------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (273) ||.- +++||+|+++++..|++.+++..++++.. ...|||||||.++++++.||...+ +++++++++++ T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d---~g~GDtV~InsIg~~tV~dY~~~~-~i~~d~ltt~~ 76 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVD---FPDGDKLTIPSVGTPVVRSRPEQG-DFTFDNLDTGE 76 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccc---cCCCCeEEeccccccccccccCCC-CcccccCCCce Confidence 7743 35699999999999999999988877543 346999999999999999998754 57899999999 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccc---cc------c------cccCCC Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGT---AL------S------GSAPTD 136 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~---~~------~------~~~~~t 136 (273) ++++|||.||++|.|+| |+.+...++ ....++++++|+..+|+++.++++..+. +. . ..++++ T Consensus 77 ~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~ 155 (322) T protein:vir:31 77 ISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTD 155 (322) T ss_pred EEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCC Confidence 99999999999999999 999999997 5588999999999999999987764331 11 0 124556 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHH---------hhhHHHHhhhhcccccceeeeeeeeeEeceEEEe Q lcl|NC_011288. 137 ADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWL---------RSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE 207 (273) Q Consensus 137 ~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L---------~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~ 207 (273) +.+.|+.|++++.+|++++||.+|||+||+|.++..| +++++|. ....+|..+.++ .||+++||+|++ T Consensus 156 ~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~-~i~~sG~a~g~~--~Vg~~~GF~V~~ 232 (322) T protein:vir:31 156 QTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWE-GIVESGIAPDMQ--FVRSVYGIDLFV 232 (322) T ss_pred chhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhcccccc-ccccccchhhHH--HHHHHhceeeee Confidence 6789999999999999999999999999999997755 5555433 234444333322 389999999999 Q ss_pred eCccccCCCcEEEEEcC---------------------ceeEEeeeeeeehhhcCCCceeeeEEeeeeeeeEEecCceEE Q lcl|NC_011288. 208 SNNLRDTDDEQFVAFHP---------------------SAAAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVV 266 (273) Q Consensus 208 s~~l~~~~~~~~~~~~~---------------------~a~~~a~~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v 266 (273) ||+++. +++.+++|+. ..++..+|+.+.|.+|++++|+|.+.++++||++++|||+++ T Consensus 233 SN~l~~-~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~ 311 (322) T protein:vir:31 233 SNLLAD-ANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLV 311 (322) T ss_pred eccccc-cccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccceE Confidence 999974 3444444444 333444567778999999999999999999999999999999 Q ss_pred EEecCCC Q lcl|NC_011288. 267 VFNKTGS 273 (273) Q Consensus 267 ~~~~~~s 273 (273) ++.+++. T Consensus 312 ~~~a~~~ 318 (322) T protein:vir:31 312 CVLANAD 318 (322) T ss_pred EEEeccc Confidence 9999988 No 13 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=4e-49 Score=285.90 Aligned_cols=268 Identities=19% Similarity=0.215 Sum_probs=204.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccc--cCCceEEEeecCcccceeecC----CCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA--SKGNVVHIAGVVAPTVKDYKA----AGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~--~~Gdtv~ip~~~~~~~~~~~~----~~~~~~~~~~~~~~~~ 74 (273) |||++|+||+|++++++.|++.|+|++++||+|+.++ ++||||+||+++.+++.+|+. .++++.++++.+++++ T Consensus 1 Ma~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSFP 80 (392) T ss_pred CccccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccccceEE Confidence 9999999999999999999999999999999998776 579999999999999988863 3556788899999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccc-ccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTALSG-SAPTDADDAFDLIATALKELT 152 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~-~~~~t~~~~~~~i~~a~~~l~ 152 (273) ++||++++++|.|+|+|+.+...++ ++++++++++|++++|.++++++.++...... ....++.+.|+.|.+++++|+ T Consensus 81 ~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~~a~~~L~ 160 (392) T protein:vir:99 81 VTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARRALN 160 (392) T ss_pred EEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHHHHHHHHh Confidence 9999999999999999999988885 67999999999999999999998876554432 345677889999999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccc--ceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +++||. ||+++++|+++..|+++++|.. .+..+.. ..+++|+||+++||+||+|+++|.++. +++|++++.++ T Consensus 161 ~~~vP~-~R~~vv~p~~~~~l~~~~~~~~-~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~t~---~a~~~~a~~~a 235 (392) T protein:vir:99 161 ELYIPQ-GRVLVVGTAVTEQILNDDRFIK-YESQGQSAVSALQEARLGRIYGYEIVESTLIPHGDA---YLYHPTAFIMA 235 (392) T ss_pred hcCCCC-CCEEEEcHHHHHHHhcccceee-cccccchhhhhhhcceeeeeeeeEEEeecccccccc---eeeeccccccc Confidence 999996 8999999999999999987543 3444433 469999999999999999999987643 56788777665 Q ss_pred eeeeeeh--h----------------h--cCCCceeeeEEeeeeeeeEEecCc---eEE------EEecCCC Q lcl|NC_011288. 231 SQIDTVE--A----------------L--RDQDSFSDRIRALHVYGGKVVRPT---GVV------VFNKTGS 273 (273) Q Consensus 231 ~~~~~~e--~----------------~--~~~~~~~~~v~~~~~~g~~v~~~~---~~v------~~~~~~s 273 (273) ....... . . .+....++.......+|.+.+... +.. ....+.+ T Consensus 236 t~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~ 307 (392) T protein:vir:99 236 TRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) T ss_pred cccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceee Confidence 5321110 0 0 011111122222333444433211 110 0000000 No 14 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=1.5e-49 Score=288.23 Aligned_cols=269 Identities=23% Similarity=0.320 Sum_probs=213.6 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) |+.+ +|+||+|++++++.|++.++|.+++++ .+.++..||||+||+++.+++.+++++ .+++++++.+++++++| T Consensus 15 ~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~-~~~~~~~GdTV~ip~~g~~~a~d~~~g-~~i~~~~~~~~~~~itI 92 (381) T protein:vir:80 15 VDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKK-IPFEGKKGDLIHIPNISRAAVYDKQPQ-TPVNLQARTDSEFTFTV 92 (381) T ss_pred cchhhHHhhhhHHHHHHHHHHHHHhhhhhhcccc-ccceeecCceEEeeccCcceeeeecCC-CcccccccCCceEEEEE Confidence 4433 689999999999999999999998764 344667899999999999999998874 57788999999999999 Q ss_pred eeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc-----------------ccccCCCHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTAL-----------------SGSAPTDADD 139 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~-----------------~~~~~~t~~~ 139 (273) |+++++++.|+|+|+.+.++++ +++.++++.+||+++|+++++.+....... ...+..+... T Consensus 93 D~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~~~~~~~~~t~~~~~~ 172 (381) T protein:vir:80 93 TKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLGDGTVNAHLTGTPAPL 172 (381) T ss_pred eeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccccccccccccccccchhhH Confidence 9999999999999999999985 669999999999999999998875432211 0112334567 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcE- Q lcl|NC_011288. 140 AFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ- 218 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~- 218 (273) +++.|.+|+++|++++||.++|++||+|+++..|+++++++ +.+. ++...+++|.||+++||+|++|+++|...... T Consensus 173 t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~-~ad~-~~~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~ 250 (381) T protein:vir:80 173 TYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFI-SVDF-SQVKPVTSGVVGTILGMEVIVTTQIGINSLTGY 250 (381) T ss_pred HHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhh-hhhh-ccchhhhceeeeEEcceEEEeecccccccccce Confidence 89999999999999999999999999999999999998754 4454 45568999999999999999999998643211 Q ss_pred -EEEE--------------------------------------------------------------------------c Q lcl|NC_011288. 219 -FVAF--------------------------------------------------------------------------H 223 (273) Q Consensus 219 -~~~~--------------------------------------------------------------------------~ 223 (273) ..++ | T Consensus 251 ~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (381) T protein:vir:80 251 VNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLGSFGGANRWATAVVCH 330 (381) T ss_pred eeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCceeeeehhhhhhhhhcccc Confidence 0011 1 Q ss_pred CceeEEeeeee-eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 224 PSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 224 ~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +++.+.+.|+. ..+.-+..-+++|.+.|++.||++++||.++|.|..+|- T Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 331 PDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred cccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 11111111111 112334555789999999999999999999999999988 No 15 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=1.1e-48 Score=283.58 Aligned_cols=269 Identities=18% Similarity=0.162 Sum_probs=220.4 Q ss_pred Cccc-----------------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeec Q lcl|NC_011288. 1 MAFN-----------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYK 57 (273) Q Consensus 1 MA~~-----------------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~ 57 (273) ||+. .+.=|+|++++++.|++.+++.++++.. +++.|++++||++|..++.+|+ T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~r---ti~~Gksv~f~~iG~~t~~~~t 77 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKR---TLKNGKSLQFIYTGRMTSSFHT 77 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhcccccc---ccccCceEEEEeeeeeEEeeec Confidence 2221 1233999999999999999999998753 6677999999999999999998 Q ss_pred CCCcccCC--CCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc------ Q lcl|NC_011288. 58 AAGRQTSA--DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTA------ 128 (273) Q Consensus 58 ~~~~~~~~--~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~------ 128 (273) ++...... .+...++++++||+.+++.+.|+|+|+.+.++++ .++.++++++||+++|+.++..+..++.. T Consensus 78 ~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~ 157 (375) T protein:vir:10 78 PGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSA 157 (375) T ss_pred CCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 76543322 2456788899999999999999999999999986 56999999999999999999887643211 Q ss_pred ----------------cccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhh--HHHHhhhhcccccc Q lcl|NC_011288. 129 ----------------LSGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSS--GSKLTSADTSGDAA 190 (273) Q Consensus 129 ----------------~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~--~~~~~~~~~~~~~~ 190 (273) ......+++.++++.|.++++.|++++||.++||+||+|++|..|+++ ..++.+.+.. +.. T Consensus 158 ~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~-~~~ 236 (375) T protein:vir:10 158 TNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQ-GSA 236 (375) T ss_pred ccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccc-ccc Confidence 111223568889999999999999999999999999999999999875 3456676664 445 Q ss_pred eeeeeeeeeEeceEEEeeCccccCCC-----------------------------------------------cEEEEEc Q lcl|NC_011288. 191 GLRAGTIGNLLGARIVESNNLRDTDD-----------------------------------------------EQFVAFH 223 (273) Q Consensus 191 ~l~~G~ig~~~G~~v~~s~~l~~~~~-----------------------------------------------~~~~~~~ 223 (273) ...+|.+++++||+|++||++|..++ ..++.+| T Consensus 237 ~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~ 316 (375) T protein:vir:10 237 LQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQ 316 (375) T ss_pred eeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEc Confidence 67789999999999999999995321 2357889 Q ss_pred CceeEEeeeee-eeh---hhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 224 PSAAAYVSQID-TVE---ALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 224 ~~a~~~a~~~~-~~e---~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++|.+.++.++ .+| ..++..+++|+|.+++.|||+++|||++|+|+.+++ T Consensus 317 ~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~ 370 (375) T protein:vir:10 317 KEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGAT 370 (375) T ss_pred hhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcC Confidence 99999887665 444 447999999999999999999999999999999999 No 16 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=2.2e-49 Score=287.32 Aligned_cols=268 Identities=18% Similarity=0.162 Sum_probs=220.4 Q ss_pred Cccch--------------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCC Q lcl|NC_011288. 1 MAFNN--------------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAG 60 (273) Q Consensus 1 MA~~~--------------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~ 60 (273) |||.. +.=|+|++++++.|.+.++|.+++++. +++.|++++||++|..++.+++++. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r---ti~~G~sv~~~~iG~~~~~~~~~G~ 77 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVR---SIQSGKSAQFPVLGRTKAAYLQPGE 77 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhhe---eccccceEEeeeccceeEeeeecCc Confidence 66441 233999999999999999999999764 5677999999999999999888765 Q ss_pred cccC-CCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc---------- Q lcl|NC_011288. 61 RQTS-ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTA---------- 128 (273) Q Consensus 61 ~~~~-~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~---------- 128 (273) .... .+++.+++++++||+.+++.+.|+|+|+.++++++ .++.++++++||+++|+.++..+...+.. T Consensus 78 ~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g 157 (347) T protein:vir:94 78 NLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAG 157 (347) T ss_pred CCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 4433 36789999999999999999999999999999986 56999999999999999998765432110 Q ss_pred --------------cccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeee Q lcl|NC_011288. 129 --------------LSGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA 194 (273) Q Consensus 129 --------------~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~ 194 (273) ...+...++.++++.|.++.+.|++++||+++||+|++|++|..|++...+ ... .......+++ T Consensus 158 ~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~-~~~-~~~~~~~~~~ 235 (347) T protein:vir:94 158 LGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMP-NAA-NYQALIDPST 235 (347) T ss_pred CCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcc-ccc-cccccccccc Confidence 000112345678999999999999999999999999999999999986432 222 2334457889 Q ss_pred eeeeeEeceEEEeeCccccCC--------------------------------CcEEEEEcCceeEEeeeee-eehhhcC Q lcl|NC_011288. 195 GTIGNLLGARIVESNNLRDTD--------------------------------DEQFVAFHPSAAAYVSQID-TVEALRD 241 (273) Q Consensus 195 G~ig~~~G~~v~~s~~l~~~~--------------------------------~~~~~~~~~~a~~~a~~~~-~~e~~~~ 241 (273) |.|++++||+||+||++|... +...+++|++|++.++.++ .+|..|+ T Consensus 236 G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~ 315 (347) T protein:vir:94 236 GSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARR 315 (347) T ss_pred ceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeec Confidence 999999999999999998532 1134788999999998776 6899999 Q ss_pred CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 242 QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 242 ~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.+|+|+|.+++.||++++||||++++..+.+ T Consensus 316 ~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 316 ANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 99999999999999999999999998877777 No 17 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=5.5e-48 Score=279.66 Aligned_cols=266 Identities=19% Similarity=0.168 Sum_probs=225.1 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc-ceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (273) ||+. .|+||+|+++++++|.+.+++.+++..+++.++.+|++|+||++...+ +.++.. +..+++++++.++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~-g~~i~~~~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAE-GAAIDYSALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecC-CCcCccccccccee Confidence 9983 589999999999999999999999988888888899999999998776 455554 55788999999999 Q ss_pred EEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-CCCHHHHHHHHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSA-PTDADDAFDLIATALKEL 151 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~-~~t~~~~~~~i~~a~~~l 151 (273) +++|++ ++..|.++|++..+...+ ++++.+++++++++++|+++++.+..+........ ..+....++.|.++..+| T Consensus 80 ~~~i~~-~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~~l 158 (278) T protein:vir:80 80 KHGIKK-AGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPDAI 158 (278) T ss_pred eEeeeh-hhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHhh Confidence 999977 567899999998888766 68899999999999999999999987765554332 234456789999999999 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHH-HhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~-~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +++++|. .++++++|++++.|+++... +......+ .+.+++|.||++.||+|++|+++|.. .+++.|++|+++. T Consensus 159 ~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~l~~~gAi~~~ 233 (278) T protein:vir:80 159 EDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLG-DDLLVKGAFGELLGWEIVRTKKLADG---NALAVKAGALKTF 233 (278) T ss_pred cccCCCc-ccEEEECHHHHHHHHhhhhhhcccccccc-ccceeeccceeecceeEEEcCCCCcc---eEEEEeccceeee Confidence 9999995 67899999999999887522 22233333 45789999999999999999999853 5678899999876 Q ss_pred e-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 231 S-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) . +...+|..|++++++|.|+++++||+++++|+++|++++.+- T Consensus 234 ~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~ 277 (278) T protein:vir:80 234 LKRNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAG 277 (278) T ss_pred ecCCcccccccchhhccceeeeeeEEEEEEEcCcceEEEeeccC Confidence 4 556899999999999999999999999999999999988877 No 18 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=8.4e-48 Score=278.68 Aligned_cols=265 Identities=20% Similarity=0.184 Sum_probs=212.3 Q ss_pred Ccc---chhhHHHHHHHHHHHHHHhhccchhhcccccccc-cCCceEEEeecCcccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAF---NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~---~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~-~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) ||. ++++||+|++++++.|++++||.++|||+|+.++ +.||||+||+++.+.+.++. .++++++.++.++++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg~----~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASGR----TLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecccC----CccccccccceEEEE Confidence 986 4667999999999999999999999999999885 56999999999999887643 467889999999999 Q ss_pred EeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcC Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKAN 155 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~ 155 (273) ||++++++|.|+|+|+++...++ ++++++++++||+++|+++++++...+.... +..+..+.|++|.+++++|++++ T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~g--t~gt~~~~~~~i~~a~~~Ld~~~ 154 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSG--TPGVRPGAFIDFANAGAKQTTYA 154 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cCCcCcchHHHHHHHHHHHHhcC Confidence 99999999999999999988886 6799999999999999999999887765442 23344467999999999999999 Q ss_pred CCccC-CEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCc----------------- Q lcl|NC_011288. 156 VPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE----------------- 217 (273) Q Consensus 156 vP~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~----------------- 217 (273) ||.+| |++|++|++|..|+++..++. +..+....+|+|+||+++||+||+|+++|..+.. T Consensus 155 VP~~G~R~lVv~P~~~~~L~~~~~~~~--~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~ 232 (418) T protein:vir:10 155 VPQDGMRHAVLDPFTCASLSDEVTKLF--KESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDT 232 (418) T ss_pred CCCCCceEEEeCHHHHHHHhhhccccc--cccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeeccccccee Confidence 99875 999999999999998876554 4556667899999999999999999999832100 Q ss_pred ------------------E------------------------------------------------------------- Q lcl|NC_011288. 218 ------------------Q------------------------------------------------------------- 218 (273) Q Consensus 218 ------------------~------------------------------------------------------------- 218 (273) . T Consensus 233 ~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~ 312 (418) T protein:vir:10 233 VGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPVS 312 (418) T ss_pred EEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccccccCcceeEecccccccccccccccccccc Confidence 0 Q ss_pred --------------------------E---EEEcCceeEEeeeeeee-----------h----------hhcCCCceeee Q lcl|NC_011288. 219 --------------------------F---VAFHPSAAAYVSQIDTV-----------E----------ALRDQDSFSDR 248 (273) Q Consensus 219 --------------------------~---~~~~~~a~~~a~~~~~~-----------e----------~~~~~~~~~~~ 248 (273) + +++|++|++++..-..+ + ..++++..-+. T Consensus 313 ~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~~ 392 (418) T protein:vir:10 313 LTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDLELPQSAVIKSRAADPETGLSLTLTGAYDINEQSEI 392 (418) T ss_pred ccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeeccCCCCCCcceEEEeccCCeEEEEEEcccccccceE Confidence 0 23455555554432111 0 11233334456 Q ss_pred EEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 249 IRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 249 v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++-+..||++.++||-.+.+--.++ T Consensus 393 ~r~d~l~g~~~~~p~~~~~~~g~~~ 417 (418) T protein:vir:10 393 HRIDAVWGADMIYGELALRLWGAAS 417 (418) T ss_pred EEEEeecCceeecccceEEEEeecC Confidence 6778899999999998777766666 No 19 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=3.5e-48 Score=280.78 Aligned_cols=268 Identities=15% Similarity=0.144 Sum_probs=198.1 Q ss_pred Cccchhh--HHHHHHHHHHHHHHhhccchhhcccccccc---cCCceEEEeecCcccceeecCC-CcccCCCCCccceEE Q lcl|NC_011288. 1 MAFNNFI--PELWSDMLLEEWTAQTVFANLVNREYEGTA---SKGNVVHIAGVVAPTVKDYKAA-GRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~~--pev~~~~~~~~~~~~lv~~~~v~~~~~~~~---~~Gdtv~ip~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 74 (273) |||+++. |++|++++++.|+++|||+++|||+|+.++ +.||||+||+++.+++.+|... +..+.++++.+.+++ T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~~v~ 80 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSAKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccceee Confidence 9999754 999999999999999999999999999886 4699999999999999999654 456788999999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhc Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKA 154 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 154 (273) ++||++++++|.++|+|+++...+++.++++++++|++++|.+++..+........ ++..++...|++|.+++++|+++ T Consensus 81 l~id~~k~~a~~v~d~e~~l~i~~~~~~l~~a~~ala~~vd~~l~~~l~~~a~~~v-gt~~t~~~~~~~i~~a~~~Ld~~ 159 (423) T protein:vir:35 81 GKVGKYITVAVEWTQIEEALKLNQLDQILSPIHERMVTDLETELAHFMMNNGALSL-GSPNTAIKKWADVAQTASFIKDI 159 (423) T ss_pred EEeccceeccceeCHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-ccccCCcchHHHHHHHHHHHHHh Confidence 99999999999999999998888888899999999999999999987765443332 33445556799999999999999 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeee-eeEeceEEEeeCccccCCCcEE--EEEcCceeE--- Q lcl|NC_011288. 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNNLRDTDDEQF--VAFHPSAAA--- 228 (273) Q Consensus 155 ~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~i-g~~~G~~v~~s~~l~~~~~~~~--~~~~~~a~~--- 228 (273) +||.++||+|++|+++..|++++.++.+.+ ...+..+++|+| |+++||+||+||++|..+...+ ...+..+.. T Consensus 160 ~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~-~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~ 238 (423) T protein:vir:35 160 GIKTGENYAIMDPWSAQRLADAQSGLHAAD-QLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDY 238 (423) T ss_pred cCCcCCCEEEeCHHHHHHHhccccceeccc-cchhHHHhhccceeeecceEEEEcCCCccccccccccceeecccccccc Confidence 999999999999999999998887776544 345578999987 9999999999999996544321 111111110 Q ss_pred Eee----ee-eeeh----hhcCCCceeeeEEeeeeeeeEEecCc--------------eEEEEecCCC Q lcl|NC_011288. 229 YVS----QI-DTVE----ALRDQDSFSDRIRALHVYGGKVVRPT--------------GVVVFNKTGS 273 (273) Q Consensus 229 ~a~----~~-~~~e----~~~~~~~~~~~v~~~~~~g~~v~~~~--------------~~v~~~~~~s 273 (273) .+. +. ..+. ...+....||.+ ..-|.+.+.|- ..+++.++.+ T Consensus 239 ~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~---t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~ 303 (423) T protein:vir:35 239 LSVKDSYQFTVALTGATPSKTGFLKAGDQL---KFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNS 303 (423) T ss_pred ccccccccceeeeeeeeeccCCcEEecceE---EeeeeeeccccccceeecccCCceeEEEEeccccc Confidence 000 00 0000 011222334433 22243333221 2222222211 No 20 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=5.1e-47 Score=274.38 Aligned_cols=259 Identities=19% Similarity=0.198 Sum_probs=221.9 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) ||+. .++||+|++.+++++.+.+++.+++.+++++++.+|++|+||.|... .+.+|.. +..+++++++.++. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~-g~~i~~~~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAE-GEKIPVDQIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCC-CCcCchhhccccee Confidence 9974 57899999999999999999999999998888889999999999864 5666654 55788999999999 Q ss_pred EEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) +++|++ +++.+.++|++..+...+ +.++.+++++++++++|.++++.+..++.... +...+++.|.+|...|+ T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~-----~~~~~~d~i~dA~~~l~ 153 (274) T protein:vir:96 80 EAKVRK-IGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVE-----ADITKLDGLQTAIDKFN 153 (274) T ss_pred EEEEEe-eeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcC-----cccccHHHHHHHHHHhc Confidence 999976 689999999998887776 68899999999999999999999876554332 22345889999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhH--HHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSG--SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~--~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) ++++ ++|+++|+|+++..|+++. .|+. .... +.+.+++|.||++.||+|++|+++|.. .+++++++|+++. T Consensus 154 d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~-~~~~-g~~~~~~g~ig~~~G~~Vi~s~~~p~~---t~~l~~~gA~~~~ 226 (274) T protein:vir:96 154 DEDL--EPMVLFVNPLDAGGLRTSASDNFTR-PTQL-GDNIIVKGAFGEALGAVIVRSNKLNKG---EALLAKKGAVKLI 226 (274) T ss_pred ccCC--CceEEEeCHHHHHHHHhcccccccc-cccc-cccceeecccceecCeeEEEcCCCCcc---eEEEEeCcceeee Confidence 9875 6899999999999998875 3333 2233 346799999999999999999999864 4678899999998 Q ss_pred eeee-eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 231 SQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~~~-~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+.. .+|..|++++++|.++++++||+++++|+++|+++.... T Consensus 227 ~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:96 227 TKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAG 270 (274) T ss_pred ecCCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcc Confidence 7655 899999999999999999999999999999999988777 No 21 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=1.1e-46 Score=272.60 Aligned_cols=258 Identities=17% Similarity=0.186 Sum_probs=217.0 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc-ceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (273) ||+. .++||+|++++++++.+.++|.+++..+.+.++.+|+||+||.|...+ +.++.. +..+++++++.++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE-GEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccC-CCccchhhccccee Confidence 9995 578999999999999999999999877766677889999999998764 555654 56788999999999 Q ss_pred EEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) +++|++ ++++|.++|++..+...+ ++++.++++.++++++|+++++.+..+...... ....++.|.+|..+|+ T Consensus 80 ~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~-----~~~~~d~i~~A~~~lg 153 (274) T protein:vir:96 80 EAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA-----DITKLTGLQTAIDKFN 153 (274) T ss_pred EEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----cccCHHHHHHHHHHhc Confidence 999976 689999999999888766 688999999999999999999998776544322 2234789999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhH--HHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSG--SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~--~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +++. .+|+++|+|++++.|+++. +|+. .... +.+.+++|.||++.||+|++|+++|. ..+++++++|+++. T Consensus 154 d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~-~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~~~---~t~~l~~~gA~~~~ 226 (274) T protein:vir:96 154 DEDL--EPMVLFISPLDAGKLRGDATTNFTR-ATEL-GDDVIVKGAFGEALGAVIVRSNKLEA---GTAILAKKGAVKLI 226 (274) T ss_pred cccc--cccEEEeCHHHHHHHHhhccccccc-cccc-cccceeccccceecCeEEEEeCCCCC---ceEEEEeccceeee Confidence 8874 7899999999999999985 3333 2233 34689999999999999999999985 35577788999875 Q ss_pred e-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 231 S-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) . +...+|..|++++++|.++++++||+++++|+++|+++ ++| T Consensus 227 ~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t-k~~ 269 (274) T protein:vir:96 227 TKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT-KGS 269 (274) T ss_pred ecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEE-cCC Confidence 4 55689999999999999999999999999999999886 445 No 22 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=1.1e-46 Score=272.60 Aligned_cols=258 Identities=17% Similarity=0.186 Sum_probs=217.0 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc-ceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (273) ||+. .++||+|++++++++.+.++|.+++..+.+.++.+|+||+||.|...+ +.++.. +..+++++++.++. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE-GEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccC-CCccchhhccccee Confidence 9995 578999999999999999999999877766677889999999998764 555654 56788999999999 Q ss_pred EEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) +++|++ ++++|.++|++..+...+ ++++.++++.++++++|+++++.+..+...... ....++.|.+|..+|+ T Consensus 80 ~~~i~~-~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~-----~~~~~d~i~~A~~~lg 153 (274) T protein:vir:95 80 EAKIRK-IAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEA-----DITKLTGLQTAIDKFN 153 (274) T ss_pred EEEeee-eecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----cccCHHHHHHHHHHhc Confidence 999976 689999999999888766 688999999999999999999998776544322 2234789999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhH--HHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSG--SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~--~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +++. .+|+++|+|++++.|+++. +|+. .... +.+.+++|.||++.||+|++|+++|. ..+++++++|+++. T Consensus 154 d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~-~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~~~---~t~~l~~~gA~~~~ 226 (274) T protein:vir:95 154 DEDL--EPMVLFISPLDAGKLRGDATTNFTR-ATEL-GDDVIVKGAFGEALGAVIVRSNKLEA---GTAILAKKGAVKLI 226 (274) T ss_pred cccc--cccEEEeCHHHHHHHHhhccccccc-cccc-cccceeccccceecCeEEEEeCCCCC---ceEEEEeccceeee Confidence 8874 7899999999999999985 3333 2233 34689999999999999999999985 35577788999875 Q ss_pred e-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 231 S-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) . +...+|..|++++++|.++++++||+++++|+++|+++ ++| T Consensus 227 ~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t-k~~ 269 (274) T protein:vir:95 227 TKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT-KGS 269 (274) T ss_pred ecCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEE-cCC Confidence 4 55689999999999999999999999999999999886 445 No 23 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=3.2e-47 Score=275.47 Aligned_cols=267 Identities=13% Similarity=0.115 Sum_probs=198.0 Q ss_pred Cccchh--hHHHHHHHHHHHHHHhhccchhhcccccccc---cCCceEEEeecCcccceeecCCC-cccCCCCCccceEE Q lcl|NC_011288. 1 MAFNNF--IPELWSDMLLEEWTAQTVFANLVNREYEGTA---SKGNVVHIAGVVAPTVKDYKAAG-RQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~--~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~---~~Gdtv~ip~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 74 (273) |||+++ +|++|++++++.|+++|||+++|||+|+.++ +.||||+||+|+.+.+.+|.... ...+++++.+.+++ T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~v~ 80 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccceeE Confidence 999975 5999999999999999999999999998886 47999999999999999987533 33567899999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhc Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKA 154 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 154 (273) ++||++++++|.++|+|+++...++++++++++++||+++|.++++++...+....+ +..+..+.|++|.+++++|+++ T Consensus 81 l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~g-t~~t~~~a~~~i~~a~~~Ld~~ 159 (423) T protein:vir:17 81 GRVGNYITVAVEYQQLEEAIKLNQLEEILAPVRQRIVTDLETELAHFMMNNGALSLG-SPNTPITKWSDVAQTASFLKDL 159 (423) T ss_pred EEeeceeeeeeeecHHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-cCCcccccHHHHHHHHHHHHhc Confidence 999999999999999999987788888999999999999999999998765544333 3334445689999999999999 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeee-eeEeceEEEeeCccccCCCcEE--EEEcCce--e-E Q lcl|NC_011288. 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNNLRDTDDEQF--VAFHPSA--A-A 228 (273) Q Consensus 155 ~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~i-g~~~G~~v~~s~~l~~~~~~~~--~~~~~~a--~-~ 228 (273) +||.++|++|++|++++.|++++.++... ..+.+..+|+|+| |+++||+||+||++|..++..+ -+.+..+ + + T Consensus 160 ~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~-~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~ 238 (423) T protein:vir:17 160 GVNEGENYAVMDPWSAQRLADAQTGLHAS-DQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTY 238 (423) T ss_pred cCCcCCCEEEeChHHHHHHhccccceecc-cccchHHHhhccceeeecceEEEEeCCCccccccceeceeeecccccccc Confidence 99999999999999999999988766543 4556678999988 9999999999999996544332 0111000 0 0 Q ss_pred Ee-----eeeeeeh----hhcCCCceeeeEEeeeeeeeEEecC--------------ceEEEEecCCC Q lcl|NC_011288. 229 YV-----SQIDTVE----ALRDQDSFSDRIRALHVYGGKVVRP--------------TGVVVFNKTGS 273 (273) Q Consensus 229 ~a-----~~~~~~e----~~~~~~~~~~~v~~~~~~g~~v~~~--------------~~~v~~~~~~s 273 (273) .+ .+...+. ...+....||.+.- -|.+.+.| ...+ +++.++ T Consensus 239 ~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~---aGv~~v~~~tk~v~~~~~t~~~~~~~-v~~~~~ 302 (423) T protein:vir:17 239 NAVKDSYQFTVTLTGATTSVTGFLKAGDQVKF---TNTYWLQQQTKQALYNGATPISFTAT-VTADAN 302 (423) T ss_pred cccccccceeeeeeeeeeeccCceeecceEEe---cceeeecccccccccccccccceEEE-EEeccc Confidence 00 0011110 11122334554322 23333322 1222 222222 No 24 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=1.3e-47 Score=277.60 Aligned_cols=267 Identities=17% Similarity=0.205 Sum_probs=224.0 Q ss_pred Cccc------------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcc Q lcl|NC_011288. 1 MAFN------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQ 62 (273) Q Consensus 1 MA~~------------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~ 62 (273) |++- .|+ |+|+.++++.|++.++|.+++.+. +++.|+|++||++|..++.++++ +.+ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~l-e~~~geV~~af~~~s~~~~~~~~r---~i~~G~s~~~~~iG~~~~~~~~~-g~~ 75 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHI-EEHLGLVDASFMYSSKFASWMNVR---SLRGTNQLRVDRVGASTIAGRKA-GEE 75 (334) T ss_pred CCCCcCCCccccccccccchheehh-hhhhhHHHHHHHHhhhhhccceee---eccccceEEEeeecceeeeeecC-CCC Confidence 6654 122 999999999999999999998764 67889999999999999998887 556 Q ss_pred cCCCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----------c- Q lcl|NC_011288. 63 TSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTAL----------S- 130 (273) Q Consensus 63 ~~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~----------~- 130 (273) +..+.+.+++++++||+.+++.+.|+|+|+.+.++|+ .++.++++++||+++|+.++..+..++... . T Consensus 76 l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 76 LVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 7788899999999999999999999999999999996 569999999999999999887665322110 0 Q ss_pred -----------cccCCCHHHHHHHHHHHHHHHhhcCCCc---cCCEEEECHHHHHHHhhhHHHHhhhhccc--ccceeee Q lcl|NC_011288. 131 -----------GSAPTDADDAFDLIATALKELTKANVPN---VGRVVVVNAEMAFWLRSSGSKLTSADTSG--DAAGLRA 194 (273) Q Consensus 131 -----------~~~~~t~~~~~~~i~~a~~~l~~~~vP~---~~r~lvv~p~~~~~L~~~~~~~~~~~~~~--~~~~l~~ 194 (273) ....+++...+.++..|++.|++++||+ .+||+||+|++|..|+.+++++ +.++.+ +...+.+ T Consensus 156 ~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~-n~d~~~s~~~~~~~~ 234 (334) T protein:vir:80 156 ILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLM-NVEFGAKEGGNSFVG 234 (334) T ss_pred cceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccc-cceeccccccccccc Confidence 0112344556788999999999999995 5799999999999999998754 555433 3456889 Q ss_pred eeeeeEeceEEEeeCccccCCC------------------cEEEEEcCceeEEeeeee-eehhhcCCCceeeeEEeeeee Q lcl|NC_011288. 195 GTIGNLLGARIVESNNLRDTDD------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVY 255 (273) Q Consensus 195 G~ig~~~G~~v~~s~~l~~~~~------------------~~~~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~~ 255 (273) |.|++++||+|++||++|.... ..++.+|++|+++++.++ .+|..|++++|+|+|.+++.| T Consensus 235 g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~ 314 (334) T protein:vir:80 235 GRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSY 314 (334) T ss_pred eeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHc Confidence 9999999999999999996521 123677999999999875 789999999999999999999 Q ss_pred eeEEecCceEEEEecCCC Q lcl|NC_011288. 256 GGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 256 g~~v~~~~~~v~~~~~~s 273 (273) |++++||||+++++-+.+ T Consensus 315 G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 315 NIGQRRPDAVAVHDITVT 332 (334) T ss_pred CCceeccceEEEEEEeee Confidence 999999999999999999 No 25 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=2.3e-46 Score=270.75 Aligned_cols=259 Identities=18% Similarity=0.187 Sum_probs=220.2 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFNN------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~~------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) ||+.. ++||+|++++++++.+.++|.+++.+++++++.+|+||+||.|... .+.++.. +..+++++++.++. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~-g~~i~~~~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE-GEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccC-CCccchhhccccee Confidence 99974 8999999999999999999999999999988889999999999876 4556654 55788999999999 Q ss_pred EEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) +++|++ +++.|.++|++..+...+ +.++.++++.++++++|+++++.+..+...... ....++.|.+|..+|+ T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~-----~a~~~d~i~dA~~~lg 153 (274) T protein:vir:12 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA-----DITKLNGLQSAIDKFN 153 (274) T ss_pred eEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----cccCHHHHHHHHHHhc Confidence 999976 789999999999888776 688999999999999999999998876554322 2345889999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhH--HHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSG--SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~--~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +++. .+|+++|+|+++..|+++. +|+ +.... +.+.+++|.||++.|++|++|+.+|.. ++++++++|+++. T Consensus 154 d~~~--~~~~ivv~p~~~~~L~k~~~~~fv-~~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~l~~~gA~~~~ 226 (274) T protein:vir:12 154 DEDL--EPMVLFINPLDAGKLRGDASTNFT-RATEL-GDDIIVKGAFGEALGAIIVRSNKLEAG---TAILAKKGAVKLI 226 (274) T ss_pred cccc--cccEEEeCHHHHHHHHhhhhhhcc-ccccc-cccceecccceeecCeeEEEeCCCCcc---eEEEEeccceeee Confidence 8874 7899999999999999975 333 33333 346799999999999999999999854 4578889999886 Q ss_pred e-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecC-CC Q lcl|NC_011288. 231 S-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT-GS 273 (273) Q Consensus 231 ~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~-~s 273 (273) . +...+|..|+++++.|.++++++||+++++|+++|+++.. +| T Consensus 227 ~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:12 227 LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ecCCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCcc Confidence 5 5568999999999999999999999999999999987644 44 No 26 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=3.2e-46 Score=270.04 Aligned_cols=259 Identities=17% Similarity=0.183 Sum_probs=219.5 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFNN------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~~------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) ||++. ++||+|++++++++++.+++.+++.++++.++.+|+||+||+|... .+.+|.. +..+++++++.++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~e-g~~i~~~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE-GEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccC-CCccccccccccee Confidence 99994 8899999999999999999999999998888889999999999865 4666654 55788999999999 Q ss_pred EEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) +++|++ +++.+.++|++..+...+ +..+.+++++++++++|+++++.+..+..... +....++.|.+|..+|+ T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~-----~~~~~~d~i~dA~~~l~ 153 (274) T protein:vir:93 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVN-----ADITKLNGLQSAIDKFN 153 (274) T ss_pred EEEeee-ecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccCHHHHHHHHHHhh Confidence 999966 678999999999888776 67899999999999999999999876654332 22335788999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhH--HHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSG--SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~--~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +++. ++|+++|+|++++.|+++. .|+.. ... +...+++|.||++.||+|++|+++|. +++++++++|+++. T Consensus 154 d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~-s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~---~t~~l~~~gai~~~ 226 (274) T protein:vir:93 154 DEDL--EPMVLFINPLDAGKLRGDASTNFTRA-TEL-GDDIIVKGAFGEALGAIIVRTNKLEA---GTAILAKKGAVKLI 226 (274) T ss_pred hccC--CccEEEeCHHHHHHHHhhhhhccccc-ccc-cccceeecccceecCeeEEEcCCCCc---ceEEEEeCCeEEEE Confidence 9875 6899999999999999875 33332 222 34578999999999999999999985 45788899999988 Q ss_pred eee-eeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecC-CC Q lcl|NC_011288. 231 SQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT-GS 273 (273) Q Consensus 231 ~~~-~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~-~s 273 (273) .+. ..+|..|++++++|.++++++||+++++|+++++++.. +| T Consensus 227 ~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s 271 (274) T protein:vir:93 227 LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ecCCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCccc Confidence 654 48999999999999999999999999999999987654 44 No 27 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=3.3e-46 Score=269.94 Aligned_cols=259 Identities=17% Similarity=0.179 Sum_probs=220.7 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc-ceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFNN------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~~------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (273) ||+.. ++||+|++++++++++.++|.+++.++++.++.+|+||+||+|+..+ +.++.. +..+++++++.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~-g~~i~~~~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE-GEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccC-CCccccccccccee Confidence 99974 89999999999999999999999999998888899999999998754 556654 55788999999999 Q ss_pred EEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) +++|++ +++.+.++|++..+...+ +.++.+++++++++++|+++++.+..++..... ....++.|.+|...|+ T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~-----~~~~~d~i~dA~~~l~ 153 (274) T protein:vir:94 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA-----DITKLNGLQSAIDKFN 153 (274) T ss_pred EEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc-----cccCHHHHHHHHHHhh Confidence 999976 678999999999888776 688999999999999999999998776654432 2234789999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhH--HHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSG--SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~--~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +++. .+|+++|+|+++..|+++. .|+.. ...+ ...+++|.||++.||+|++|+++|. +.+++++++|+++. T Consensus 154 d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~-s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~---~t~~l~~~gA~~~~ 226 (274) T protein:vir:94 154 DEDL--EPMVLFVNPLDAGKLRGDASTNFTRA-TELG-DDIIVKGAFGEALGAIIVRTNKLEA---GTAILAKKGAVKLI 226 (274) T ss_pred ccCC--CceEEEeCHHHHHHHHhhhhhhcccc-Cccc-ccceeccccceecCeeEEEcCCCCc---ceEEEEeCcceEee Confidence 8875 6799999999999999875 44433 3333 4578999999999999999999985 45678889999986 Q ss_pred ee-eeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 231 SQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~-~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+ ...+|..|++++++|.++++++||+++++|+++++++.++. T Consensus 227 ~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:94 227 LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred ecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 54 45899999999999999999999999999999998776655 No 28 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=3.3e-46 Score=269.94 Aligned_cols=259 Identities=17% Similarity=0.179 Sum_probs=220.7 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc-ceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFNN------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~~------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 73 (273) ||+.. ++||+|++++++++++.++|.+++.++++.++.+|+||+||+|+..+ +.++.. +..+++++++.++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~-g~~i~~~~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAE-GEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccC-CCccccccccccee Confidence 99974 89999999999999999999999999998888899999999998754 556654 55788999999999 Q ss_pred EEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) +++|++ +++.+.++|++..+...+ +.++.+++++++++++|+++++.+..++..... ....++.|.+|...|+ T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~-----~~~~~d~i~dA~~~l~ 153 (274) T protein:vir:97 80 EAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA-----DITKLNGLQSAIDKFN 153 (274) T ss_pred EEEeee-ecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc-----cccCHHHHHHHHHHhh Confidence 999976 678999999999888776 688999999999999999999998776654432 2234789999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhH--HHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSG--SKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~--~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +++. .+|+++|+|+++..|+++. .|+.. ...+ ...+++|.||++.||+|++|+++|. +.+++++++|+++. T Consensus 154 d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~-s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~---~t~~l~~~gA~~~~ 226 (274) T protein:vir:97 154 DEDL--EPMVLFVNPLDAGKLRGDASTNFTRA-TELG-DDIIVKGAFGEALGAIIVRTNKLEA---GTAILAKKGAVKLI 226 (274) T ss_pred ccCC--CceEEEeCHHHHHHHHhhhhhhcccc-Cccc-ccceeccccceecCeeEEEcCCCCc---ceEEEEeCcceEee Confidence 8875 6799999999999999875 44433 3333 4578999999999999999999985 45678889999986 Q ss_pred ee-eeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 231 SQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~-~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+ ...+|..|++++++|.++++++||+++++|+++++++.++. T Consensus 227 ~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:97 227 LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred ecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 54 45899999999999999999999999999999998776655 No 29 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=8.6e-47 Score=273.12 Aligned_cols=267 Identities=15% Similarity=0.129 Sum_probs=197.3 Q ss_pred Cccchh--hHHHHHHHHHHHHHHhhccchhhcccccccc---cCCceEEEeecCcccceeecCC-CcccCCCCCccceEE Q lcl|NC_011288. 1 MAFNNF--IPELWSDMLLEEWTAQTVFANLVNREYEGTA---SKGNVVHIAGVVAPTVKDYKAA-GRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~--~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~---~~Gdtv~ip~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 74 (273) |||+++ +|++|++++++.|+++||++++|||+|+.++ +.||||+||+++.+++.+|... +..++++++.+++++ T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v~ 80 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccceeE Confidence 999975 5999999999999999999999999998886 4799999999999999999853 334678999999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhc Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKA 154 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 154 (273) ++||++++++|.++|+|+++...++++++++++++||+++|.++++++...+....++ ..+..+.|++|.+++++|+++ T Consensus 81 l~id~~k~va~~v~d~E~~~~i~~~~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt-~~t~~~a~~~i~~a~~~Ld~~ 159 (423) T protein:vir:10 81 GRVGNYITVAVEYQQLEEAIKLNQLEEILAPVRQRIVTDLETELAHFMMNNGALSLGS-PNTPITKWSDVAQTASFLKDL 159 (423) T ss_pred EEeeceeeeeeeechHHHhcChhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-CCcccchHHHHHHHHHHHHhc Confidence 9999999999999999998777788889999999999999999999987765544433 333445689999999999999 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeee-eeEeceEEEeeCccccCCCcEEEEEcCceeE----E Q lcl|NC_011288. 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNNLRDTDDEQFVAFHPSAAA----Y 229 (273) Q Consensus 155 ~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~i-g~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~----~ 229 (273) +||.++|++||+|+++..|++++.++...+ .+.++.+|+|+| |+++||+||+||++|..+...+ |.++.. . T Consensus 160 ~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~-~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~---~~t~~~~~~~~ 235 (423) T protein:vir:10 160 GVNEGENYAVMDPWSAQRLADAQTGLHASD-QLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAF---GGTLTVKTQPT 235 (423) T ss_pred cCCcCCCEEEeChHHHHHHhccccceeccc-ccchhhhhhccceeeecceEEEEeCCCcccccccc---ccceeeeecce Confidence 999999999999999999999887666543 455678999987 9999999999999997544321 111111 0 Q ss_pred e--------eeeee-e--hhh--cCCCceeeeEEeeeeeeeEEe-----------cCceEEEEecCCC Q lcl|NC_011288. 230 V--------SQIDT-V--EAL--RDQDSFSDRIRALHVYGGKVV-----------RPTGVVVFNKTGS 273 (273) Q Consensus 230 a--------~~~~~-~--e~~--~~~~~~~~~v~~~~~~g~~v~-----------~~~~~v~~~~~~s 273 (273) + .+... + ... ......||.+.--=++..-.+ ++...++. +.++ T Consensus 236 v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~-a~~~ 302 (423) T protein:vir:10 236 VTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVT-ADAN 302 (423) T ss_pred eccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEE-eeee Confidence 0 01000 0 001 112223443322111111111 11222222 2221 No 30 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=1.3e-46 Score=272.19 Aligned_cols=269 Identities=16% Similarity=0.112 Sum_probs=223.4 Q ss_pred Cccch---------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCC Q lcl|NC_011288. 1 MAFNN---------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~~~---------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~ 65 (273) |+.-| +.=|+|..++++.|.+.++|.++++.. +++.|+|++||.+|..++.+++++ ..+.+ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~r---ti~~gkS~q~~~iG~~~~~~~~~G-~~ld~ 76 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQ---EVVGTNSVSNKYIGETELQVLSPG-KSPDA 76 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceee---eecccceEEeeeeeeeEEeeeccC-cccCC Confidence 66542 223899999999999999999888653 678899999999999999888764 45678 Q ss_pred CCCccceEEEEEeeeeecceEEchHHHHhhhHH-HH-HHHHHHHHHHHHHHHHHHHHHHhhccccc-------------- Q lcl|NC_011288. 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LE-AYTRAGATALATDTDKFIADLLVDNGTAL-------------- 129 (273) Q Consensus 66 ~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~-~~~~~~~~ala~~~D~~i~~~~~~~~~~~-------------- 129 (273) +.+..++.+++||+.+++.+.|+|+|+.+.+++ ++ ++.++++++||+.+|+.++..+..++... T Consensus 77 ~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g 156 (364) T protein:vir:10 77 SPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHG 156 (364) T ss_pred CCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCc Confidence 889999999999999999999999999999998 65 58899999999999999987664332000 Q ss_pred --------ccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEe Q lcl|NC_011288. 130 --------SGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLL 201 (273) Q Consensus 130 --------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~ 201 (273) .....+++...+++|.++.+.|+|++||.++|+++|+|++|..|+++++++.+.....+...+++|+|++++ T Consensus 157 ~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~ 236 (364) T protein:vir:10 157 FSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSW 236 (364) T ss_pred ceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEe Confidence 011123445678889999999999999999999999999999999988765432222234568899999999 Q ss_pred ceEEEeeCccccCC------------------------------CcEEEEEcCceeEEeeeee-eehhhcCCCceeeeEE Q lcl|NC_011288. 202 GARIVESNNLRDTD------------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIR 250 (273) Q Consensus 202 G~~v~~s~~l~~~~------------------------------~~~~~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~ 250 (273) ||+|++||++|... ...++++||.|++.+++++ .+|..|++.+|++++. T Consensus 237 Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~id 316 (364) T protein:vir:10 237 NTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYID 316 (364) T ss_pred ceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeeee Confidence 99999999998421 1235789999999999774 7899999999999999 Q ss_pred eeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 251 ALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 251 ~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.+.||++++||||++++++..+ T Consensus 317 a~~a~G~g~lRPeaa~~i~~~~~ 339 (364) T protein:vir:10 317 TFLAEGAIPDRWEAVAVVTAADT 339 (364) T ss_pred eehcccCcccCccceEEEEecCC Confidence 99999999999999999999888 No 31 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=1.8e-46 Score=271.36 Aligned_cols=263 Identities=21% Similarity=0.183 Sum_probs=225.3 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) ||++ .++||+|++.+++++.+.+++.+++..+.+.++.+|+||+||+|+.++..+....+..+++++++.++.+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~~ 80 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTKS 80 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCccee Confidence 9974 5679999999999999999999999888888888999999999998765554455677889999999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) ++|++ +++.+.++|++..+...+ +.++.++++.++++++|+++++.+....... +....++.|.+|+..|++ T Consensus 81 ~~i~~-~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~------~~~~~~d~i~~A~~~lgd 153 (272) T protein:vir:36 81 VTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTV------STKANVDGVQAALDIFND 153 (272) T ss_pred Eeeeh-hhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------cccccHHHHHHHHHHhhh Confidence 99965 578999999998888776 6889999999999999999998886544332 334567899999999999 Q ss_pred cCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCc-EEEEEcCceeEEee- Q lcl|NC_011288. 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE-QFVAFHPSAAAYVS- 231 (273) Q Consensus 154 ~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~-~~~~~~~~a~~~a~- 231 (273) ++.+ .|+++|+|+.+..|+++..+.. .....+.+.+++|.||+++|++|++|+++|.+++. .+++++++|+++.. T Consensus 154 ~~~~--~~~ivv~p~~~~~L~k~~~~~~-~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~ 230 (272) T protein:vir:36 154 EDAQ--AYVLIVNPKDAAKIRKDANAKN-IGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLK 230 (272) T ss_pred cCCC--ceEEEEcHHHHHHHhccccccc-ccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecccceeeeec Confidence 9875 6899999999999999876443 33344566899999999999999999999987663 45788899998665 Q ss_pred eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 232 QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 232 ~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +..++|..|++++++|.++++++||+++++|+++|+++-+|- T Consensus 231 ~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 231 RGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred CCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 555899999999999999999999999999999999988888 No 32 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=5.2e-46 Score=268.86 Aligned_cols=260 Identities=20% Similarity=0.202 Sum_probs=219.2 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc-ceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 74 (273) ||++ .++||+|++.+++.+.+.++|.+++..+.+.++.+|+||+||.|...+ +.++. .+..+++++++.++.+ T Consensus 3 ~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~-~g~~i~~~~lt~~~~~ 81 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVP-EGEEIPIDLIETKKRQ 81 (275) T ss_pred CcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCcccccc-CCCCcchhhcccceee Confidence 6664 467999999999999999999999988877788889999999998764 44454 4567889999999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) ++|.+ ++++|.++|++..+...+ +.+++++++.++++++|+++++.+..+..... +....++.|.+|..+|++ T Consensus 82 ~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~-----~~~~~~d~i~dA~~~lgd 155 (275) T protein:vir:96 82 ATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVE-----ADITKLAGLQTAIDKFND 155 (275) T ss_pred EEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccc-----ccccCHHHHHHHHHHhcc Confidence 99954 799999999998888766 78899999999999999999998877654432 223458899999999988 Q ss_pred cCCCccCCEEEECHHHHHHHhhhHH-HHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeee Q lcl|NC_011288. 154 ANVPNVGRVVVVNAEMAFWLRSSGS-KLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 154 ~~vP~~~r~lvv~p~~~~~L~~~~~-~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~ 232 (273) ++. ++|+++|+|+++..|+++.. .|......+ .+.+++|.||++.|++|++|+++|.+ .+++.+++|+++..+ T Consensus 156 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~---t~~i~~~gA~~~~~~ 229 (275) T protein:vir:96 156 EDL--EPMVLFVNPLDAGKLRASATDNFTRATLLG-DNVIVKGAFGEALGAIIVRSNKIKEG---EAILAKRGAVKLITK 229 (275) T ss_pred ccC--CccEEEeCHHHHHHHHhccccccccccccc-ccceeccccceecCeeEEEeCCCCcc---eEEEEeccceeeeec Confidence 764 68999999999999988741 233333333 45789999999999999999999854 457778999998775 Q ss_pred e-eeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 233 I-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~-~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) . ..+|..|++++++|.++++++||+++++|+++|+++.+.| T Consensus 230 ~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 230 RDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred CCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 5 4899999999999999999999999999999999999999 No 33 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=5.3e-46 Score=268.79 Aligned_cols=271 Identities=14% Similarity=0.096 Sum_probs=194.0 Q ss_pred Cccch--hhHHHHHHHHHHHHHHhhccchhhcccccccc---cCCceEEEeecCcccceeecCCC-cccCCCCCccceEE Q lcl|NC_011288. 1 MAFNN--FIPELWSDMLLEEWTAQTVFANLVNREYEGTA---SKGNVVHIAGVVAPTVKDYKAAG-RQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~--~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~---~~Gdtv~ip~~~~~~~~~~~~~~-~~~~~~~~~~~~~~ 74 (273) |||++ |+|++|++++++.|++++||+++|||+|+.++ +.||||+||+|+...+.+..... .+..++++.+.+++ T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e~~v~ 80 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLISAKAT 80 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccccccceEE Confidence 99997 99999999999999999999999999998885 36999999999998887643322 23345788899999 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhc Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKA 154 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 154 (273) ++||++++++|.++|+|+++...++++++++++++||+++|.+|+..+........++ +.+....++++.+++++|+++ T Consensus 81 l~id~~k~~a~~v~d~E~~l~i~~~~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt-~~t~~~a~~~~a~a~~~L~~~ 159 (423) T protein:vir:10 81 GEVGNYITVAVEYRQIEEALKLNQLDQILVPINERMVTDLETELALFMMKHGALSLGS-PNTPIKKWSDVAQTASFLKDL 159 (423) T ss_pred EEecceeeeeeeeChHHHhcChhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc-cccccccHHHHHHHHHHHhhc Confidence 9999999999999999999777888889999999999999999987665544433333 333345689999999999999 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeee-eeEeceEEEeeCccccC-CCcEEEEEcCceeEEeee Q lcl|NC_011288. 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNNLRDT-DDEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 155 ~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~i-g~~~G~~v~~s~~l~~~-~~~~~~~~~~~a~~~a~~ 232 (273) ++|.++|++||+|++++.|++++.++...+. +.+..+|+|.| |+++||+||+||++|.. .+....++|.++...+.+ T Consensus 160 ~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~-~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~ 238 (423) T protein:vir:10 160 GINSGENYAVMDPWAAQRLADAQSGLHVSEQ-LVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNY 238 (423) T ss_pred cCCcCCCEEEeCHHHHHHHhhhhhhhccccc-cchHHHHhcccceeecceEEEEecCCcccccccccceeeeeeeeEEEe Confidence 9999999999999999999988877766544 45567999987 99999999999999953 444445555554443322 Q ss_pred eee--ehhhcC------CCceeeeEEeeeeeeeE--Ee---cC-----------ceEEEEecC-----CC Q lcl|NC_011288. 233 IDT--VEALRD------QDSFSDRIRALHVYGGK--VV---RP-----------TGVVVFNKT-----GS 273 (273) Q Consensus 233 ~~~--~e~~~~------~~~~~~~v~~~~~~g~~--v~---~~-----------~~~v~~~~~-----~s 273 (273) -.- .+..+. ...-+.+..|+..-=++ .+ .- -..++...+ +. T Consensus 239 a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~ 308 (423) T protein:vir:10 239 DSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGD 308 (423) T ss_pred cccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccccCc Confidence 110 000000 00111222222111111 11 11 112221110 00 No 34 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=7.6e-45 Score=262.45 Aligned_cols=265 Identities=20% Similarity=0.183 Sum_probs=221.6 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccch-hhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFAN-LVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~-~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~ 79 (273) =+|+....|.|++.|.+.+...++... ++|+++++ ..|++|+||+++..++.||+|.++ .++++++.++.+++|+| T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~--~gg~tVkIp~i~~~gl~DY~R~~g-~~~g~vt~~~~t~tidq 101 (319) T protein:vir:97 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF--MEGRSFTVMKGDTTELKDYKRNAT-NEFDHPKIEETTYFLDQ 101 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEe--ccCcEEEEeeecccccccccCCCC-cccCCcccceeEEEeec Confidence 445566788899998887777776554 57877765 469999999999999999998764 67789999999999999 Q ss_pred eeecceEEchHHHHhhhHHH--HHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCC Q lcl|NC_011288. 80 EKSIDFLVDDIDRVQVAGSL--EAY-TRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANV 156 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~--~~~-~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~v 156 (273) ++++.|.||+.|..++...+ ..+ .+++...+++++|.+.++.+.+.+... .+.+.|+.++|+.|.++++.|++++| T Consensus 102 dR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~-~~~~~t~~n~y~~i~~a~~~Lde~~V 180 (319) T protein:vir:97 102 EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-LTVGTGSDAQYDAVLDVSVELDEIKA 180 (319) T ss_pred ccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc-cccccCHHHHHHHHHHHHHHHHhcCC Confidence 99999999999998887765 344 456777899999999999998766543 34457889999999999999999999 Q ss_pred CccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeeeee Q lcl|NC_011288. 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) Q Consensus 157 P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~ 236 (273) | ++|||+|+|+++..|++++.|..+.+. ....+++|.||+++||+|+++++... .+..++++|++|+.++.|.+++ T Consensus 181 P-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~--~~~~~~~g~Vg~idG~~Vi~vps~~~-k~in~i~~h~~A~~~~~k~~~~ 256 (319) T protein:vir:97 181 P-ENRVLFVSPTFYKGIKKFVIALPQGDT--RQQVLGKGVQGELDGFVIVKVPTKLL-QGLQAIAVVGEVLASPIQADLA 256 (319) T ss_pred C-CCcEEEeCHHHHHHHHhhhhhhccccc--cccceeeeeceeecCeEEEEeccccc-ccceEEEEcCCeeeeeeeeeee Confidence 9 699999999999999999987765544 34578899999999999999865543 3566899999999999999999 Q ss_pred hhhcC-CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 237 EALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 237 e~~~~-~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.+++ ++++|+.|++|++||++|++|+...++....+ T Consensus 257 ~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:97 257 KTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred eccCCCccccceeeeeeeeeeeEEeccccceEEEeecC Confidence 98874 88999999999999999999995555543333 No 35 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=7.6e-45 Score=262.45 Aligned_cols=265 Identities=20% Similarity=0.183 Sum_probs=221.6 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccch-hhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFAN-LVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~-~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~ 79 (273) =+|+....|.|++.|.+.+...++... ++|+++++ ..|++|+||+++..++.||+|.++ .++++++.++.+++|+| T Consensus 25 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~--~gg~tVkIp~i~~~gl~DY~R~~g-~~~g~vt~~~~t~tidq 101 (319) T protein:vir:94 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF--MEGRSFTVMKGDTTELKDYKRNAT-NEFDHPKIEETTYFLDQ 101 (319) T ss_pred CcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEe--ccCcEEEEeeecccccccccCCCC-cccCCcccceeEEEeec Confidence 445566788899998887777776554 57877765 469999999999999999998764 67789999999999999 Q ss_pred eeecceEEchHHHHhhhHHH--HHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCC Q lcl|NC_011288. 80 EKSIDFLVDDIDRVQVAGSL--EAY-TRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANV 156 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~--~~~-~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~v 156 (273) ++++.|.||+.|..++...+ ..+ .+++...+++++|.+.++.+.+.+... .+.+.|+.++|+.|.++++.|++++| T Consensus 102 dR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~-~~~~~t~~n~y~~i~~a~~~Lde~~V 180 (319) T protein:vir:94 102 EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-LTVGTGSDAQYDAVLDVSVELDEIKA 180 (319) T ss_pred ccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc-cccccCHHHHHHHHHHHHHHHHhcCC Confidence 99999999999998887765 344 456777899999999999998766543 34457889999999999999999999 Q ss_pred CccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeeeee Q lcl|NC_011288. 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) Q Consensus 157 P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~ 236 (273) | ++|||+|+|+++..|++++.|..+.+. ....+++|.||+++||+|+++++... .+..++++|++|+.++.|.+++ T Consensus 181 P-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~--~~~~~~~g~Vg~idG~~Vi~vps~~~-k~in~i~~h~~A~~~~~k~~~~ 256 (319) T protein:vir:94 181 P-ENRVLFVSPTFYKGIKKFVIALPQGDT--RQQVLGKGVQGELDGFVIVKVPTKLL-QGLQAIAVVGEVLASPIQADLA 256 (319) T ss_pred C-CCcEEEeCHHHHHHHHhhhhhhccccc--cccceeeeeceeecCeEEEEeccccc-ccceEEEEcCCeeeeeeeeeee Confidence 9 699999999999999999987765544 34578899999999999999865543 3566899999999999999999 Q ss_pred hhhcC-CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 237 EALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 237 e~~~~-~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.+++ ++++|+.|++|++||++|++|+...++....+ T Consensus 257 ~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:94 257 KTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred eccCCCccccceeeeeeeeeeeEEeccccceEEEeecC Confidence 98874 88999999999999999999995555543333 No 36 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=9.4e-46 Score=267.44 Aligned_cols=267 Identities=13% Similarity=0.167 Sum_probs=224.7 Q ss_pred Cccch----------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC Q lcl|NC_011288. 1 MAFNN----------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS 64 (273) Q Consensus 1 MA~~~----------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~ 64 (273) |++-+ |+ |+|+.++++.|.+.++|.+++++. +++.|++++||.+|..++..+++ |.++. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r---ti~~g~s~~~~~iG~~~~~~~~p-G~~l~ 75 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR---DLRGSNVVRLDRLGNVEAKGRRA-GEELE 75 (335) T ss_pred CCccccccccccccccchhhhhh-hhhhhHHHHHHHHhhhhcccccee---eeccceeEEEeeeeeeeeccccc-CcccC Confidence 66543 44 999999999999999999998765 67889999999999999877766 55677 Q ss_pred CCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc-------------- Q lcl|NC_011288. 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTAL-------------- 129 (273) Q Consensus 65 ~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~-------------- 129 (273) .+.+..++..++||+.++..+.|+|+|+.+.++|+ .++.++++++||+.+|+.++..+..++... T Consensus 76 ~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:78 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcc Confidence 88899999999999999999999999999999996 569999999999999999887664433110 Q ss_pred ------ccccCCCHHHHHHHHHHHHHHHhhcCCCcc---CCEEEECHHHHHHHhhhHHHHhhhhc--ccccceeeeeeee Q lcl|NC_011288. 130 ------SGSAPTDADDAFDLIATALKELTKANVPNV---GRVVVVNAEMAFWLRSSGSKLTSADT--SGDAAGLRAGTIG 198 (273) Q Consensus 130 ------~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~---~r~lvv~p~~~~~L~~~~~~~~~~~~--~~~~~~l~~G~ig 198 (273) ..+...++..+.+++.++.+.|++++||+. +|+++|+|++|..|+.+++++.+ ++ +++...+.+|.|+ T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~-~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:78 156 EKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSV-EYQATGATNDYVKSRVA 234 (335) T ss_pred eeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccc-cccccccccccccceeE Confidence 011223455678889999999999999975 69999999999999999876544 43 2334568899999 Q ss_pred eEeceEEEeeCccccCCC------------------cEEEEEcCceeEEeeeee-eehhhcCCCceeeeEEeeeeeeeEE Q lcl|NC_011288. 199 NLLGARIVESNNLRDTDD------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKV 259 (273) Q Consensus 199 ~~~G~~v~~s~~l~~~~~------------------~~~~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~~g~~v 259 (273) +++||+|++||++|.... ..++.+|+.|++.+++++ ..|..|++++|+|+|.+.+.||+++ T Consensus 235 ~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~ 314 (335) T protein:vir:78 235 ILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYNIGA 314 (335) T ss_pred EeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcCCcc Confidence 999999999999995431 246789999999999887 5689999999999999999999999 Q ss_pred ecCceEEEEecCCC Q lcl|NC_011288. 260 VRPTGVVVFNKTGS 273 (273) Q Consensus 260 ~~~~~~v~~~~~~s 273 (273) +||||+++++.+|. T Consensus 315 lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 315 RRPDTAGAIELKGI 328 (335) T ss_pred cCcceEEEEEecCC Confidence 99999999999998 No 37 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=7.1e-46 Score=268.10 Aligned_cols=239 Identities=17% Similarity=0.155 Sum_probs=194.8 Q ss_pred hhcccccccccCCceEEEeecCcccceeecCCCcc-cCCCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHH Q lcl|NC_011288. 28 LVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQ-TSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRA 105 (273) Q Consensus 28 ~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~ 105 (273) ++ .+++.|++++||++|..++..|+++... .+++++..++.+++||+.+++.+.|+|+|+.+.++++ .++.++ T Consensus 1 ~v-----r~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~ 75 (324) T protein:vir:99 1 MT-----RTITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQ 75 (324) T ss_pred Ce-----eeeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHH Confidence 22 3577899999999999999999875543 2457799999999999999999999999999999996 569999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccc----------------------cccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEE Q lcl|NC_011288. 106 GATALATDTDKFIADLLVDNGTA----------------------LSGSAPTDADDAFDLIATALKELTKANVPNVGRVV 163 (273) Q Consensus 106 ~~~ala~~~D~~i~~~~~~~~~~----------------------~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~l 163 (273) ++++||+.+|+.++..+...... .......++..+++.|.++++.|++++||.++||+ T Consensus 76 ~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~ 155 (324) T protein:vir:99 76 MGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTF 155 (324) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEE Confidence 99999999999998776421100 00011234567899999999999999999999999 Q ss_pred EECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCC---------------------------- Q lcl|NC_011288. 164 VVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD---------------------------- 215 (273) Q Consensus 164 vv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~---------------------------- 215 (273) ||+|++|..|+.+. ++.+.+. ++.+.+++|.|++++||+||+||++|... T Consensus 156 vv~P~~y~~Ll~~~-~~~~~~~-~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~ 233 (324) T protein:vir:99 156 YTDPDTYSAILAAL-MPNAANY-AALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMT 233 (324) T ss_pred EeChHHHHHHhhcc-ccccccc-ccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccc Confidence 99999999887654 4444444 45567999999999999999999998531 Q ss_pred ----CcEEEEEcCceeEEeeeee-eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 216 ----DEQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ----~~~~~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ...++.+|+++.+..++++ ++|..|++++|+|+|.|+++||++++||||+++++..++ T Consensus 234 ~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~ 296 (324) T protein:vir:99 234 VGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDG 296 (324) T ss_pred cccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccC Confidence 1224788999999888776 799999999999999999999999999999975554444 No 38 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=2.5e-44 Score=259.63 Aligned_cols=265 Identities=17% Similarity=0.166 Sum_probs=221.5 Q ss_pred CccchhhHHHHHHHHHHHHHHhhc-cchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTV-FANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv-~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~ 79 (273) =.|++...|.|++.|++.|...+. ...++|++++ +..|++|+||+++..++.||+|.++ .++++++.++.+++|+| T Consensus 36 ~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e--~~~g~tVkIp~i~~~gl~DY~R~~g-~~~g~vt~~~~t~tidq 112 (329) T protein:vir:10 36 EPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAI--FMQGRSFTVIKGDVTELKDYKRNAT-NEFDHPQIQETTYFLDQ 112 (329) T ss_pred CCchhHHHHHHHHHHHHHHHhhceeeeeeccccee--eccCcEEEEeeecccccccccCCCC-ccccccccceeEEEeec Confidence 455667789999999999987654 4457888876 4569999999999999999998765 56788999999999999 Q ss_pred eeecceEEchHHHHhhhHHH--HHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCC Q lcl|NC_011288. 80 EKSIDFLVDDIDRVQVAGSL--EAY-TRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANV 156 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~--~~~-~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~v 156 (273) ++++.|.||+.|..++...+ ..+ .+++...+++++|.+.++.+.+.+... ...+.++.++|+.|.++++.|+++++ T Consensus 113 dR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~-~~~~~t~~nay~~i~~a~~~Lde~~v 191 (329) T protein:vir:10 113 EKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKH-LTVGSGADAQYDAVLDVSVELDEIGA 191 (329) T ss_pred ccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccc-cccccCHHHHHHHHHHHHHHHHhcCC Confidence 99999999999988887654 344 456778899999999999998766543 34567889999999999999999999 Q ss_pred CccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeeeee Q lcl|NC_011288. 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) Q Consensus 157 P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~ 236 (273) | ++||++|+|+++..|++++.|...... ..+.+++|.||+++||+|+++++.+. .+..++++|++|+.++.|.+++ T Consensus 192 p-~~Rvl~VtP~~~~~Lk~~~~f~~~~~~--~~~~~~~g~Vg~idG~~Ii~vps~~~-k~in~ii~~~~A~~~~~K~~~~ 267 (329) T protein:vir:10 192 G-ASRILFVTPKFYKGIKKFVIELPQGDN--RQQVLGKGVQGELDGFTIVKVPSKML-QGVEAMAVIGEVMASPIQANEA 267 (329) T ss_pred C-CCcEEEeCHHHHHHHHhhhhhhccccc--cccceeeeeeeeecCeEEEEecCCcc-cceeEEEEcCCceeeeeeeeee Confidence 9 599999999999999998877654333 34578899999999999999876554 3556899999999999999999 Q ss_pred hhhcC-CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 237 EALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 237 e~~~~-~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) |.+++ ++++|+.|++|++||++|++|++..++....+ T Consensus 268 ~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~ 305 (329) T protein:vir:10 268 KLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGK 305 (329) T ss_pred eeeCCCCccchheeeeeeeeeeEEEccccCEEEEeccc Confidence 98875 88999999999999999999995554443322 No 39 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=7.3e-45 Score=262.54 Aligned_cols=267 Identities=13% Similarity=0.154 Sum_probs=223.4 Q ss_pred Cccch----------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC Q lcl|NC_011288. 1 MAFNN----------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS 64 (273) Q Consensus 1 MA~~~----------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~ 64 (273) |++-+ |+ |+|+.++++.|.+.++|.++++.. +++.|++++||.+|..++..++++ .++. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r---ti~~g~s~~~~~iG~~~~~~~~pG-~~l~ 75 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR---DLRGSNVVRLDRLGNVEAKGRRAG-EELE 75 (335) T ss_pred CCCcccchhhhcccccchhheeh-hhhhhhHHHHHHhhhhhcccccee---eeccceeEEEeeeeeeeeecccCC-cCcC Confidence 66542 33 999999999999999999998765 678899999999999999988874 5567 Q ss_pred CCCCccceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------- Q lcl|NC_011288. 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTALS------------- 130 (273) Q Consensus 65 ~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~------------- 130 (273) .+.+..++.+++||+..+..+.|+|+|+.+.++|+ .++.++++++||+..|+.++..+..++.... T Consensus 76 ~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~ 155 (335) T protein:vir:63 76 RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVL 155 (335) T ss_pred CCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcc Confidence 77788899999999999999999999999999996 5699999999999999999876654332110 Q ss_pred -------cccCCCHHHHHHHHHHHHHHHhhcCCCccC---CEEEECHHHHHHHhhhHHHHhhhhcc--cccceeeeeeee Q lcl|NC_011288. 131 -------GSAPTDADDAFDLIATALKELTKANVPNVG---RVVVVNAEMAFWLRSSGSKLTSADTS--GDAAGLRAGTIG 198 (273) Q Consensus 131 -------~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~---r~lvv~p~~~~~L~~~~~~~~~~~~~--~~~~~l~~G~ig 198 (273) .+...++...+.++..|.++|++++||+++ |+++|+|++|..|+.+++++. .++. ++...+.+|.|+ T Consensus 156 ~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n-~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:63 156 EKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMN-VEYQATGATNDYVKSRVA 234 (335) T ss_pred eeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccc-cccccccccccccCceeE Confidence 011123455667888999999999999754 999999999999999987654 3432 333568899999 Q ss_pred eEeceEEEeeCccccCCC------------------cEEEEEcCceeEEeeeee-eehhhcCCCceeeeEEeeeeeeeEE Q lcl|NC_011288. 199 NLLGARIVESNNLRDTDD------------------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKV 259 (273) Q Consensus 199 ~~~G~~v~~s~~l~~~~~------------------~~~~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~~g~~v 259 (273) +++||+|++||++|.... ..++++|++|++.+++.+ ..|..++..+|+|+|.+.+.||+++ T Consensus 235 ~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~ 314 (335) T protein:vir:63 235 ILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGA 314 (335) T ss_pred EeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCcc Confidence 999999999999995432 146789999999999876 6789999999999999999999999 Q ss_pred ecCceEEEEecCCC Q lcl|NC_011288. 260 VRPTGVVVFNKTGS 273 (273) Q Consensus 260 ~~~~~~v~~~~~~s 273 (273) +||||+++++.+|. T Consensus 315 lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 315 RRPDTAGAIELKGI 328 (335) T ss_pred cccceEEEEEEcCC Confidence 99999999999888 No 40 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=9.1e-43 Score=251.07 Aligned_cols=260 Identities=18% Similarity=0.194 Sum_probs=218.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) ||+. .++||+|++.+.+.+.+.++|.+++..+.+.++.+|++|+||.|...+..+....+..+++++++.++.+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 9975 4789999999999999999999999998888888999999999988765544455677889999999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) ++|. ++++.+.++|++......+ +.+++++++.++++++|+++++.+......... ....++.|.+|...|++ T Consensus 81 a~i~-~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~-----~~~t~d~i~~A~~~lgd 154 (276) T protein:vir:10 81 AKIH-KIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSA-----DIGTLAGLEAAIDTFDD 154 (276) T ss_pred EEee-hccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----cccCHHHHHHHHHHhcc Confidence 9995 5799999999998888777 688999999999999999999998776544322 12347889999999998 Q ss_pred cCCCccCCEEEECHHHHHHHhhh--HHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEee Q lcl|NC_011288. 154 ANVPNVGRVVVVNAEMAFWLRSS--GSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVS 231 (273) Q Consensus 154 ~~vP~~~r~lvv~p~~~~~L~~~--~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~ 231 (273) ++. +.++++|+|+.++.|+++ ..|+. .... +.+.+++|+||++.|++|++|+++|. ..+++++++|+++.. T Consensus 155 ~~~--~~~~ivv~p~~~~~L~k~~~~~f~~-~s~~-g~~~~~~G~ig~~~G~~Vi~s~~~p~---~t~~l~~~gAi~~~~ 227 (276) T protein:vir:10 155 EDL--EPMVLFINPKDAGKLRSSASDNFTR-ATEL-GDNIIVKGAFGEALGAVIVRSKKLDE---GEAILAKRGAVKLIT 227 (276) T ss_pred ccC--cccEEEEcHHHHHHHHHhccccccc-cccc-cccceeccccceecceeEEEcCCCCc---ceEEEEeccceeeee Confidence 875 679999999999999775 34433 2333 34578999999999999999999985 356788899999766 Q ss_pred e-eeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEe-cCCC Q lcl|NC_011288. 232 Q-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN-KTGS 273 (273) Q Consensus 232 ~-~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~-~~~s 273 (273) + ...+|..|++++++|.|+++++||+++++|+++++++ +++| T Consensus 228 ~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 228 KRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred cCCceeecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 4 4589999999999999999999999999999999776 3444 No 41 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=1.8e-43 Score=254.92 Aligned_cols=269 Identities=15% Similarity=0.113 Sum_probs=221.6 Q ss_pred Cccch---------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCC Q lcl|NC_011288. 1 MAFNN---------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~~~---------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~ 65 (273) |++-| +.=|+|+.++++.|.+.++|.++++.. +++.|++++||..|..++..++++ ..+.. T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vr---ti~~GkS~qf~~iG~~~a~y~~~G-~~ldg 76 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQ---TVTGTNTVSNKYLGETELQVLAPG-QSPNA 76 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceee---eecccceEEEEEEeeeEEeeeccc-cccCC Confidence 66543 234899999999999999999888653 678899999999999999887764 45677 Q ss_pred CCCccceEEEEEeeeeecceEEchHHHHhhhHH-HH-HHHHHHHHHHHHHHHHHHHHHHhhccccc-------------- Q lcl|NC_011288. 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LE-AYTRAGATALATDTDKFIADLLVDNGTAL-------------- 129 (273) Q Consensus 66 ~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~-~~~~~~~~ala~~~D~~i~~~~~~~~~~~-------------- 129 (273) +.+..++..++||+..+..+.|+|+|+.+.+++ ++ ++.++++++||+.+|+.++..+...+... T Consensus 77 ~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g 156 (402) T protein:vir:97 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) T ss_pred CCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccc Confidence 888999999999999999999999999999998 65 58899999999999999988775422100 Q ss_pred -----cc---ccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEe Q lcl|NC_011288. 130 -----SG---SAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLL 201 (273) Q Consensus 130 -----~~---~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~ 201 (273) .+ .+.+++..+++.|.++.+.|++++||.++|+++++|++|..|+++++++.+.....+...+.+|.|++++ T Consensus 157 ~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~ 236 (402) T protein:vir:97 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) T ss_pred cccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEe Confidence 00 1124667788999999999999999999999999999999999988765443223444568899999999 Q ss_pred ceEEEeeCccccCC------------------------CcEEEEEcCceeEEeeeee-eehhhcCCCceeeeEEeeeeee Q lcl|NC_011288. 202 GARIVESNNLRDTD------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYG 256 (273) Q Consensus 202 G~~v~~s~~l~~~~------------------------~~~~~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~~g 256 (273) ||+||+||++|... ...++.+|+.|++.++.++ ..|..|++++|+++|.+.+.|| T Consensus 237 Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G 316 (402) T protein:vir:97 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEG 316 (402) T ss_pred ceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHhC Confidence 99999999998532 0135788999999988665 6688999999999999999999 Q ss_pred eEEecCceEEEEecCC--C Q lcl|NC_011288. 257 GKVVRPTGVVVFNKTG--S 273 (273) Q Consensus 257 ~~v~~~~~~v~~~~~~--s 273 (273) ++++|||++.++.... | T Consensus 317 ~g~~RPeaa~vv~~~~~~t 335 (402) T protein:vir:97 317 AIPDRWEAVSVVTTKRDAT 335 (402) T ss_pred CcccCccceEEEEEecccc Confidence 9999999988885444 2 No 42 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=2.9e-40 Score=237.36 Aligned_cols=260 Identities=19% Similarity=0.189 Sum_probs=215.1 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) ||++ .++||+|++.+.+.+.+.+++.+++.++++.++.+|++|+||++...+......++..++.++++.++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 9976 4899999999999999999999999988888888999999999976543433445667888999999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) +++++ .+..+.++|++..+...+ +..+.+++++++++++|.++++.+..+.... +....++.|.+|...|++ T Consensus 81 ~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~------~~~~t~d~i~da~~~l~~ 153 (272) T protein:vir:30 81 MTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTV------EATATVDGVSKALDIFND 153 (272) T ss_pred EEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ccccCHHHHHHHHHHHhc Confidence 99976 567899999998887777 4678999999999999999998876554332 223457889999999988 Q ss_pred cCCCccCCEEEECHHHHHHHhhhHHH-HhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeee Q lcl|NC_011288. 154 ANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 154 ~~vP~~~r~lvv~p~~~~~L~~~~~~-~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~ 232 (273) .+. +.++++++|+++..|+++... +...... ..+.+++|.+|++.|++|++|+++|.+ .++++++++++++.+ T Consensus 154 ~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~~~~-~~~~~~~g~ig~i~G~~Vi~s~~~p~~---t~~~~~~~a~~~~~~ 227 (272) T protein:vir:30 154 EDD--AETVIVMNPADASTLRLDAAKEWLGATEV-GANRVVSGVYGEVLGVQIVRSRKCPKG---TAYMVRKGALRIMLK 227 (272) T ss_pred cCC--CccEEEEcHHHHHHHHHhccccccccccc-cccccccccchhhcCeeEEEcCCCCcc---eEEEEcCCeEEEEec Confidence 864 578999999999999876421 1111222 234688999999999999999999853 467889999998875 Q ss_pred ee-eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 233 ID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~~-~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+ .+|..|++.++.+.++++++||+++++|+++|.++-..+ T Consensus 228 ~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:30 228 RNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred CCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 54 789999999999999999999999999999999988888 No 43 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=2.9e-40 Score=237.36 Aligned_cols=260 Identities=19% Similarity=0.189 Sum_probs=215.1 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) ||++ .++||+|++.+.+.+.+.+++.+++.++++.++.+|++|+||++...+......++..++.++++.++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 9976 4899999999999999999999999988888888999999999976543433445667888999999999 Q ss_pred EEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) +++++ .+..+.++|++..+...+ +..+.+++++++++++|.++++.+..+.... +....++.|.+|...|++ T Consensus 81 ~~~~~-~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~------~~~~t~d~i~da~~~l~~ 153 (272) T protein:vir:98 81 MTIKK-AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTV------EATATVDGVSKALDIFND 153 (272) T ss_pred EEeee-eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccccc------ccccCHHHHHHHHHHHhc Confidence 99976 567899999998887777 4678999999999999999998876554332 223457889999999988 Q ss_pred cCCCccCCEEEECHHHHHHHhhhHHH-HhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeee Q lcl|NC_011288. 154 ANVPNVGRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQ 232 (273) Q Consensus 154 ~~vP~~~r~lvv~p~~~~~L~~~~~~-~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~ 232 (273) .+. +.++++++|+++..|+++... +...... ..+.+++|.+|++.|++|++|+++|.+ .++++++++++++.+ T Consensus 154 ~~~--~~~~~vv~p~~~~~L~k~~~~~~~~~~~~-~~~~~~~g~ig~i~G~~Vi~s~~~p~~---t~~~~~~~a~~~~~~ 227 (272) T protein:vir:98 154 EDD--AETVIVMNPADASTLRLDAAKEWLGATEV-GANRVVSGVYGEVLGVQIVRSRKCPKG---TAYMVRKGALRIMLK 227 (272) T ss_pred cCC--CccEEEEcHHHHHHHHHhccccccccccc-cccccccccchhhcCeeEEEcCCCCcc---eEEEEcCCeEEEEec Confidence 864 578999999999999876421 1111222 234688999999999999999999853 467889999998875 Q ss_pred ee-eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 233 ID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~~-~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+ .+|..|++.++.+.++++++||+++++|+++|.++-..+ T Consensus 228 ~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a 269 (272) T protein:vir:98 228 RNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLKDA 269 (272) T ss_pred CCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEeccc Confidence 54 789999999999999999999999999999999988888 No 44 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=8.8e-40 Score=234.69 Aligned_cols=271 Identities=14% Similarity=0.169 Sum_probs=211.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccc--cCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEe Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTA--SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~--~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id 78 (273) ||+.++ +|+|+++|++.|.+.+++..|.++.+..+. ..|++|+||+++..+..||+|.+......+++.++.+++|+ T Consensus 1 MA~~n~-a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~~ld 79 (299) T protein:vir:79 1 MAALNY-AKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPKVLT 79 (299) T ss_pred Cccchh-HHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEEEee Confidence 997665 699999999999999999998887766554 45899999999999999999865434455789999999999 Q ss_pred eeeecceEEchHHHHhh--hHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccccc---ccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 79 QEKSIDFLVDDIDRVQV--AGSLEAYT-RAGATALATDTDKFIADLLVDNGTAL---SGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~--~~~~~~~~-~~~~~ala~~~D~~i~~~~~~~~~~~---~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) |++++.|.||+.|..++ ......++ +.+...+++++|++.++.+.+.+... ...+..|+.++|+.|.++.+.|+ T Consensus 80 qdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~~~~~~ld 159 (299) T protein:vir:79 80 NQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFDKLMEKMT 159 (299) T ss_pred ccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHHHHHHHHH Confidence 99999999995554443 33345544 44556789999999999887655432 23345688999999999999999 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEe--eCccccC----CC---------c Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE--SNNLRDT----DD---------E 217 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~--s~~l~~~----~~---------~ 217 (273) +++||.++|+++|+|+++..|++++.|.+..+. .+....++|.||+++||+|++ |+.++.. .+ - T Consensus 160 e~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~-~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~~ak~i 238 (299) T protein:vir:79 160 EARVPENGRILYVTPVVNTLIKNAKEIQRTVNI-KDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGAGAKQI 238 (299) T ss_pred hcCCCCCCeEEEeCHHHHHHHhhchhhhccccc-ccccceeeeeeeeecceEEEEechhhcCccceeccCccccCccccc Confidence 999999999999999999999999876655443 344567899999999999997 4444421 11 2 Q ss_pred EEEEEcCceeEEeeeeeeehhhcCCCc-eee-eEEeeeeeeeEEecCc--eE-EEEecCCC Q lcl|NC_011288. 218 QFVAFHPSAAAYVSQIDTVEALRDQDS-FSD-RIRALHVYGGKVVRPT--GV-VVFNKTGS 273 (273) Q Consensus 218 ~~~~~~~~a~~~a~~~~~~e~~~~~~~-~~~-~v~~~~~~g~~v~~~~--~~-v~~~~~~s 273 (273) .+++.|++|+....+.+.++...|.-. .++ ++.++.++++.|++.. +| +..++.++ T Consensus 239 n~ii~~~~a~~~~~K~~~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 239 FMSLVHPSAIITPVSYQFSKLDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred ceEEEcCCeeeeeEeeeeEEeecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 368889999998888888877665333 233 6778999999999866 44 45666666 No 45 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=1.1e-39 Score=234.18 Aligned_cols=267 Identities=16% Similarity=0.141 Sum_probs=216.2 Q ss_pred Cccch---------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCC Q lcl|NC_011288. 1 MAFNN---------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~~~---------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~ 65 (273) |++-+ +.=|+|..++++.|.+..+|.+++... +++.|+|++||+.|..++..++++. .+.. T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vR---ti~~gkS~qf~~~G~s~~~~~~pG~-~ld~ 76 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQ---TVTGTNTVSNKYLGETELQVLAPGQ-SPAA 76 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceee---eecccceEEEEEeeeeEeeeecCCC-CcCC Confidence 65542 334889999999999999999888754 6788999999999999999988755 4667 Q ss_pred CCCccceEEEEEeeeeecceEEchHHHHhhhHH-HH-HHHHHHHHHHHHHHHHHHHHHHhhcccc--------------- Q lcl|NC_011288. 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LE-AYTRAGATALATDTDKFIADLLVDNGTA--------------- 128 (273) Q Consensus 66 ~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~-~~~~~~~~ala~~~D~~i~~~~~~~~~~--------------- 128 (273) +.+..+++.++||...+..+.|+|+|+.+.+++ ++ ++.++++++||+.+|+.++.++..++.. T Consensus 77 ~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G 156 (401) T protein:vir:70 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHG 156 (401) T ss_pred CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCc Confidence 888999999999999999999999999999998 75 6899999999999999998887543210 Q ss_pred ----cc---cccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEE-CHHHHHHHhhhHHHHhhhhcc-cccceeeeeeeee Q lcl|NC_011288. 129 ----LS---GSAPTDADDAFDLIATALKELTKANVPNVGRVVVV-NAEMAFWLRSSGSKLTSADTS-GDAAGLRAGTIGN 199 (273) Q Consensus 129 ----~~---~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv-~p~~~~~L~~~~~~~~~~~~~-~~~~~l~~G~ig~ 199 (273) .. ..+.+++..+.++|.+|+..|++++||.+ |++++ +|.+|..|+..++ +.+.++. .+...+.+|.|.+ T Consensus 157 ~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~-r~vvl~pp~~Ys~Ll~~d~-L~nrd~~~s~~g~~~~G~v~~ 234 (401) T protein:vir:70 157 FSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRDADR-IVDKTYTISQSGATIQGFTLS 234 (401) T ss_pred eEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHhcCc-ccchhhccccCCccccceEEE Confidence 00 01234556788999999999999999965 66555 6667777777665 4455544 3345688999999 Q ss_pred EeceEEEeeCccccCC-------------C-----------cEEEEEcCceeEEeeeee-eehhhcCCCceeeeEEeeee Q lcl|NC_011288. 200 LLGARIVESNNLRDTD-------------D-----------EQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHV 254 (273) Q Consensus 200 ~~G~~v~~s~~l~~~~-------------~-----------~~~~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~ 254 (273) ++||+|++||++|..+ + ..++++|++|++.++.++ ..|..++.++|++++.+.+. T Consensus 235 vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a 314 (401) T protein:vir:70 235 SYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMA 314 (401) T ss_pred EeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHH Confidence 9999999999998632 0 124788999999987665 56889999999999999999 Q ss_pred eeeEEecCceEEEEecCCC Q lcl|NC_011288. 255 YGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 255 ~g~~v~~~~~~v~~~~~~s 273 (273) ||++++|||++++++...+ T Consensus 315 ~g~g~~RPeaa~vv~~k~~ 333 (401) T protein:vir:70 315 EGAIPDRWEAVSVVTTKRN 333 (401) T ss_pred hCCcccchhheEEEeecCc Confidence 9999999999999866665 No 46 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=6.5e-39 Score=229.94 Aligned_cols=268 Identities=17% Similarity=0.133 Sum_probs=200.4 Q ss_pred Cccc---hhhHHHHHHHHHHHHH-HhhccchhhcccccccccCCceEEEeecCcccc------eeecCCCcccCC-CCCc Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWT-AQTVFANLVNREYEGTASKGNVVHIAGVVAPTV------KDYKAAGRQTSA-DAIS 69 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~-~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~------~~~~~~~~~~~~-~~~~ 69 (273) ||.+ .|+ +.|++++...++ +...|.+.|.. ......+++++.+....... ....+.+.+.++ .+.+ T Consensus 13 Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~~~~ 89 (322) T protein:vir:10 13 IAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQH--KNESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVNNKP 89 (322) T ss_pred eechhhhHHH-HHHHHHHHHHHHHhhhhhhccccc--ccccccccceeecccccccccccccccccccCcccCCCccccc Confidence 7776 244 668888888775 44566666542 22455677777777543322 222222222222 3445 Q ss_pred cceEEEEEeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----------ccCCCH Q lcl|NC_011288. 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTALSG-----------SAPTDA 137 (273) Q Consensus 70 ~~~~~~~id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~-----------~~~~t~ 137 (273) .+...+.+ +++++++.|+|.|+.++.+|. ..++++++.+|+++.|+.|++.+...+..... ....+. T Consensus 90 ~~~r~~~~-~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~~g~~ 168 (322) T protein:vir:10 90 FAKRRTNV-DTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIGDGTK 168 (322) T ss_pred cceEEEee-cccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccCCCcccccCcc Confidence 56666666 556889999999999999985 66999999999999999998766543321111 011233 Q ss_pred HHHHHHHHHHHHHHhhcCCCcc-CCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC Q lcl|NC_011288. 138 DDAFDLIATALKELTKANVPNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vP~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~ 216 (273) ..+++.|.+|++.|++++||++ +||+|++|++|..||+++++ .+.++.+.....++|.||+|+||+|++|++||..+. T Consensus 169 g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~-ts~D~~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~ 247 (322) T protein:vir:10 169 PISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEA-TSADYTSAMDLQSKGIITNWMGYTWIVSTRLDKFDP 247 (322) T ss_pred chhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhh-hhhhcccchhhhhcCeeeeeeeEEEEEeccCCcccc Confidence 5678899999999999999976 49999999999999998865 578888766666789999999999999999985322 Q ss_pred ---------------cEEEEEcCceeEEeeeee-eeh-hhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 ---------------EQFVAFHPSAAAYVSQID-TVE-ALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ---------------~~~~~~~~~a~~~a~~~~-~~e-~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..|+++|++|++++++.+ +++ .+++...+++.|++.+.||+++++|++||+|+..-| T Consensus 248 t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~ 321 (322) T protein:vir:10 248 TQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRVEDEHIFKLRLKNS 321 (322) T ss_pred ccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEeccCcEEEEEEecc Confidence 358999999999998754 455 666777889999999999999999999999999999 No 47 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=100.00 E-value=9e-39 Score=229.16 Aligned_cols=259 Identities=14% Similarity=0.081 Sum_probs=214.3 Q ss_pred Cccchh----hHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAFNNF----IPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~----~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) ||.+.+ +||+|++.+.+++.+.++|.+++..+.++.+.+|++|+||.|...+..+....+..+++++++.++...+ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 999854 8999999999999999999999999998888999999999998776555455677888999999999999 Q ss_pred EeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcC Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKAN 155 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~ 155 (273) |. +++.++.++|++.....++ +.++.+|++..+++++|+++++.+..+..... ....++.|.+|...|++.. T Consensus 81 i~-~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~------~~~t~~~~~dA~~~lgd~~ 153 (270) T protein:vir:95 81 VK-ETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTAT------VSADATGILDAIEVFNSEN 153 (270) T ss_pred ee-hhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccccc------cccCHHHHHHHHHHhcccc Confidence 94 5689999999988777655 78999999999999999999998876544332 2234578899999997764 Q ss_pred CCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeee- Q lcl|NC_011288. 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQID- 234 (273) Q Consensus 156 vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~- 234 (273) ....+++|+|..++.|+++. ++. . .....+.+++|.||.+.|++|+.+++.+. ...++.++++|+++..+.+ T Consensus 154 --~~~~~i~vhs~~~~~Lrk~~-~~~-~-~~~~~~~~~~G~ig~~~G~~Viv~s~~~~--~~~~~l~~~gAi~~~~~~~~ 226 (270) T protein:vir:95 154 --DEDYVLYVNPKDYNKLVKSL-FKV-G-GNVQDRAISKGDLVEIVGVSDIVKSKRVS--ENTAFLQRYGAMEIVNKKKP 226 (270) T ss_pred --CCCcEEEEcHHHHHHHHhhh-ccc-c-cccccchhcccccceecceeEEEeCCCCC--ceeEEEEeccceeeeecCCc Confidence 34578999999999998764 332 2 22344578999999999999887666543 3467788999999888665 Q ss_pred eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEe--cCCC Q lcl|NC_011288. 235 TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN--KTGS 273 (273) Q Consensus 235 ~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~--~~~s 273 (273) .+|..|+++++.|.+++++|||+++++|+++|+++ +++| T Consensus 227 ~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~ 267 (270) T protein:vir:95 227 EAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTFKPSGS 267 (270) T ss_pred eeeeccchhhcccEEEeeeEEEEEEEccceEEEEEecCCCC Confidence 89999999999999999999999999999999855 5555 No 48 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=8.1e-39 Score=229.42 Aligned_cols=268 Identities=15% Similarity=0.126 Sum_probs=216.6 Q ss_pred Cccch---------------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCC Q lcl|NC_011288. 1 MAFNN---------------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA 65 (273) Q Consensus 1 MA~~~---------------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~ 65 (273) |++-+ +.=|+|..++++.|.+..+|.+++... +++.|+|++||+.|..++..++++. .+.. T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vR---tI~~gkS~qf~~lG~s~a~y~~pG~-~ldg 76 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQ---TVTGTNTVSNKYLGETELQVLAPGQ-SPAA 76 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceee---eecccceEEEEEeeeeEEeeecCCC-CcCC Confidence 65542 334899999999999999999888754 6788999999999999999888754 5677 Q ss_pred CCCccceEEEEEeeeeecceEEchHHHHhhhHH-HH-HHHHHHHHHHHHHHHHHHHHHHhhcccc--------------- Q lcl|NC_011288. 66 DAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LE-AYTRAGATALATDTDKFIADLLVDNGTA--------------- 128 (273) Q Consensus 66 ~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~-~~~~~~~~ala~~~D~~i~~~~~~~~~~--------------- 128 (273) +++..++..++||...+....|+|+|+.+.++| ++ ++.++++++||+.+|+.++..+..++.. T Consensus 77 ~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 888999999999999999999999999999998 65 5899999999999999998766443210 Q ss_pred -------cccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcc-cccceeeeeeeeeE Q lcl|NC_011288. 129 -------LSGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTS-GDAAGLRAGTIGNL 200 (273) Q Consensus 129 -------~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~-~~~~~l~~G~ig~~ 200 (273) ....+.+++..+..+|.+|...|++++||.++++++++|.+|..|+..++.+ +.++. .++..+..|.|.++ T Consensus 157 ~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLv-nrdf~~s~~g~~~~g~v~~v 235 (400) T protein:vir:10 157 FSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIV-DKSYTISQSGATIQGFVLSS 235 (400) T ss_pred cceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCccc-chhccccCCCccccceEEEE Confidence 0011123555677889999999999999976555667777777887766544 55543 22356788999999 Q ss_pred eceEEEeeCccccCC------------------------CcEEEEEcCceeEEeeeee-eehhhcCCCceeeeEEeeeee Q lcl|NC_011288. 201 LGARIVESNNLRDTD------------------------DEQFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVY 255 (273) Q Consensus 201 ~G~~v~~s~~l~~~~------------------------~~~~~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~~ 255 (273) +||.|++||++|... ...++++|++|++.++.++ ..|..|++++|++++.+.+.| T Consensus 236 ~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~ 315 (400) T protein:vir:10 236 YNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSE 315 (400) T ss_pred eceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHh Confidence 999999999998532 0124788999999987665 568899999999999999999 Q ss_pred eeEEecCceEEEEecCCC Q lcl|NC_011288. 256 GGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 256 g~~v~~~~~~v~~~~~~s 273 (273) |++++|||++++++...+ T Consensus 316 G~g~~RPeaa~vv~~~~~ 333 (400) T protein:vir:10 316 GAIPDRWEAVSVVTTKRQ 333 (400) T ss_pred CCcccchhheEEEEecCC Confidence 999999999999988776 No 49 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=100.00 E-value=2.7e-37 Score=221.03 Aligned_cols=266 Identities=15% Similarity=0.113 Sum_probs=211.7 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEeee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~ 80 (273) ||++. .++|++.|++.|...+++..+.+.++++ ..|++|+||+++..+..||+|.++. ...+++.++.+++|+|+ T Consensus 1 Main~--a~~~~~~Ld~~~~~~~~t~~l~~~~~~~--~ggktVkI~~i~~~gl~DY~R~~g~-~~g~v~~~~et~tl~qd 75 (290) T protein:vir:78 1 MAINY--VDKYGKELDQKLVFGTYTNELETPNLLW--LDAKTFKIQTITTTGLKAHTRNKGY-NEGSASNTNKSYTIDFD 75 (290) T ss_pred CchhH--HHHHHHHHHHHHHhhheeeeccccceee--ccCCEEEEeeeccCcccccccCCCc-ccCccccceeeEEeecc Confidence 99984 4899999999999999999998888765 4599999999999999999997754 45678999999999999 Q ss_pred eecceEEc--hHHHHhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHhhccccc--ccccCCCHHHHHHHHHHHHHHHhhcC Q lcl|NC_011288. 81 KSIDFLVD--DIDRVQVAGSLEAY-TRAGATALATDTDKFIADLLVDNGTAL--SGSAPTDADDAFDLIATALKELTKAN 155 (273) Q Consensus 81 ~~~~~~i~--d~d~~~~~~~~~~~-~~~~~~ala~~~D~~i~~~~~~~~~~~--~~~~~~t~~~~~~~i~~a~~~l~~~~ 155 (273) +++.|.|| |+|+++....+.++ .+.+.+.+++++|.+.++.+...+... ......|++++|+.|.++...|++ T Consensus 76 R~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~~lde-- 153 (290) T protein:vir:78 76 RDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIRKVKK-- 153 (290) T ss_pred ccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHHHHHh-- Confidence 99999999 88888777776664 456667899999999999887655332 223456889999999999999986 Q ss_pred CCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc---c-----------ccCCC--cEE Q lcl|NC_011288. 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN---L-----------RDTDD--EQF 219 (273) Q Consensus 156 vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~---l-----------~~~~~--~~~ 219 (273) ||.++|+|+++|+++..|++++.|.+..+....+....+|.|++++||+|++.+. + +.+.+ -.+ T Consensus 154 vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ak~in~ 233 (290) T protein:vir:78 154 YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAGAKKLNF 233 (290) T ss_pred cCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcccccccCCccceeE Confidence 8999999999999999999998877665554444455699999999999998552 1 11111 246 Q ss_pred EEEcCceeEEeeeeeeehhhcCCC---ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 220 VAFHPSAAAYVSQIDTVEALRDQD---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 220 ~~~~~~a~~~a~~~~~~e~~~~~~---~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.|++|.....+.+.+....|.- ..+..+.+|.++++.|++...-.++....- T Consensus 234 ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 234 LLVNKGSVVGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred EEEcCCceeeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 788999998888888776665533 346799999999999998774443322222 No 50 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=100.00 E-value=9.6e-36 Score=212.56 Aligned_cols=228 Identities=21% Similarity=0.199 Sum_probs=189.9 Q ss_pred cccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHH Q lcl|NC_011288. 34 EGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALAT 112 (273) Q Consensus 34 ~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~ 112 (273) +--...||||+||++ ++..+....+..++++.++.++.+++|.+ .+.+|.|+|++.....++ +.+..+|++.+||+ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~-~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKK-AAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred CccccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEee-eccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 323467999999987 44444445677888999999999999955 699999999999888777 68899999999999 Q ss_pred HHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccccee Q lcl|NC_011288. 113 DTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) Q Consensus 113 ~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l 192 (273) ++|.++++.+..+..... ...+++.|.+|...|++.+ ...++++|+|..+..|+++..++... ...+.+.+ T Consensus 78 kvD~di~~~~~~a~l~~~------~~~t~d~i~~A~~~fgde~--~~~~vivv~p~~~~~Lrk~~~~~~~~-~~~g~~i~ 148 (231) T protein:vir:73 78 KVDDDLLKAAKTTSQTVS------TKANVDGVQAALDIFNDED--AQAYVLIVNPKDAAKIRKDANAKNIG-SEVGANAL 148 (231) T ss_pred hhhHHHHHhhcccccccc------ccccHHHHHHHHHHhcccc--ccceEEEEcchHHHhhhhccchhhhh-hhhcccee Confidence 999999988876554332 2345889999999999887 35789999999999999987654433 33456789 Q ss_pred eeeeeeeEeceEEEeeCccccCCCcEE-EEEcCceeEEeeeee-eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEec Q lcl|NC_011288. 193 RAGTIGNLLGARIVESNNLRDTDDEQF-VAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK 270 (273) Q Consensus 193 ~~G~ig~~~G~~v~~s~~l~~~~~~~~-~~~~~~a~~~a~~~~-~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~ 270 (273) ++|.||+++|++|+.|+++|.+++... ++..++|+++..|.+ .+|..|++++++|.+++++||++++++|+++|+++- T Consensus 149 ~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~ 228 (231) T protein:vir:73 149 INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITF 228 (231) T ss_pred eecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEe Confidence 999999999999999999997665432 566799999888765 899999999999999999999999999999999988 Q ss_pred CCC Q lcl|NC_011288. 271 TGS 273 (273) Q Consensus 271 ~~s 273 (273) +|- T Consensus 229 ~g~ 231 (231) T protein:vir:73 229 TGV 231 (231) T ss_pred ecC Confidence 888 No 51 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=100.00 E-value=3e-34 Score=204.35 Aligned_cols=270 Identities=12% Similarity=0.093 Sum_probs=204.9 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCccc-CCCCCccceEEEEEee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT-SADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~id~ 79 (273) |||++-+.+.|++.|++.+...+++..+...+...+...|++|+||++...+..||+|.++.. +..+++.++.+++|++ T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~et~tl~q 80 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYETKTMTQ 80 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccceeEEeee Confidence 999888899999999999999998887743322223466999999999999999999976522 3456999999999999 Q ss_pred eeecceEEc--hHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcccccc------cccCCCHHHHHHHHHHHHHH Q lcl|NC_011288. 80 EKSIDFLVD--DIDRVQVAGSLEAYTR-AGATALATDTDKFIADLLVDNGTALS------GSAPTDADDAFDLIATALKE 150 (273) Q Consensus 80 ~~~~~~~i~--d~d~~~~~~~~~~~~~-~~~~ala~~~D~~i~~~~~~~~~~~~------~~~~~t~~~~~~~i~~a~~~ 150 (273) ++++.|.|| |+|+++....+.+++. .+...+++++|++.++.+...+.... .....|++++|+.|..+.+. T Consensus 81 DR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~i~~~~~~ 160 (312) T protein:vir:10 81 DRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINKIKTGIKI 160 (312) T ss_pred cccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHHHHHHHHH Confidence 999999999 8888877767777654 46677899999999999886543322 12346889999999999999 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc--ccc------C-------- Q lcl|NC_011288. 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRD------T-------- 214 (273) Q Consensus 151 l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~--l~~------~-------- 214 (273) |++++|| ++|+|+|+|+++..|.++. ..............+|+|++++|++|++.+. +.. + T Consensus 161 lde~~vp-~~rvl~vTp~~~~lLk~~~--~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~~~~~g 237 (312) T protein:vir:10 161 IRENGYN-GPLVCHLTYDSMFAIEEKV--LEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTTSNQTAG 237 (312) T ss_pred HHHccCC-CceEEEeChHHHHHHhhhh--hceecccccccceeeeeeeeecccEEEEchhhhccceeeeccCcccccccC Confidence 9999999 6999999999997777653 2222222333445689999999999998543 310 0 Q ss_pred --------CCcEEEEEcCceeEEeeeeeeehhhcC---CCceeeeEEeeeeeeeEEecCc--eE-EEEecCCC Q lcl|NC_011288. 215 --------DDEQFVAFHPSAAAYVSQIDTVEALRD---QDSFSDRIRALHVYGGKVVRPT--GV-VVFNKTGS 273 (273) Q Consensus 215 --------~~~~~~~~~~~a~~~a~~~~~~e~~~~---~~~~~~~v~~~~~~g~~v~~~~--~~-v~~~~~~s 273 (273) ..-.+++.|++|.....+.+.+....| +...+.++..|.++++.|++.. +| +-++.+.. T Consensus 238 g~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a~~ 310 (312) T protein:vir:10 238 GYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDAKP 310 (312) T ss_pred ceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecccC Confidence 011257889999888887777765544 4445679999999999999876 44 23333222 No 52 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=100.00 E-value=4.2e-34 Score=203.57 Aligned_cols=269 Identities=13% Similarity=0.116 Sum_probs=203.0 Q ss_pred CccchhhHHHHHHHHHHHHHHhhcc-chhhccccc--ccccCCceEEEeecC-cccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVF-ANLVNREYE--GTASKGNVVHIAGVV-APTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~-~~~v~~~~~--~~~~~Gdtv~ip~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) ||.+. .+.|++.|+++|...++. ..+.+.... .++..|++|+||++. ..+..||+|.++.....+++.++.+++ T Consensus 1 Mainy--a~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et~t 78 (346) T protein:vir:10 1 MTINY--AEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDSYE 78 (346) T ss_pred Ccchh--HHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeEEE Confidence 99974 578999999999888654 344333222 223569999999997 568999999887765678999999999 Q ss_pred EeeeeecceEEc--hHHHHhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHhhccccc----ccccCCCHHHHHHHHHHHHH Q lcl|NC_011288. 77 IDQEKSIDFLVD--DIDRVQVAGSLEAYT-RAGATALATDTDKFIADLLVDNGTAL----SGSAPTDADDAFDLIATALK 149 (273) Q Consensus 77 id~~~~~~~~i~--d~d~~~~~~~~~~~~-~~~~~ala~~~D~~i~~~~~~~~~~~----~~~~~~t~~~~~~~i~~a~~ 149 (273) |+|++++.|.|| |+|++.....+..++ +.+....++++|.+.++.+.+.+... ..+...|++++|+.|..+.+ T Consensus 79 l~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~i~~~~~ 158 (346) T protein:vir:10 79 LKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPAFDNMML 158 (346) T ss_pred eeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHHHHHHHH Confidence 999999999999 777776555566655 34556788999999999887544322 22345689999999999999 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEe--eCcccc-----------CC- Q lcl|NC_011288. 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE--SNNLRD-----------TD- 215 (273) Q Consensus 150 ~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~--s~~l~~-----------~~- 215 (273) .|++++||.++|+|+++|+++..|++++.|.++.+.. +... .+|.|++++||+|++ |+.++. ++ T Consensus 159 ~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~-~~~~-i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~~~t~a 236 (346) T protein:vir:10 159 DFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLK-DPNN-IQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSKIIDTA 236 (346) T ss_pred HHHHccCCCCCeEEEECHHHHHHHhhchhheeccccc-cccc-cceeeeeecCeEEEEcchhhcccchhhccCccccCCc Confidence 9999999999999999999999999988776655543 3333 489999999999987 444431 11 Q ss_pred -CcEEEEEcCceeEEeeeeeeehhhcCCCc-ee-eeEEeeeeeeeEEecCceEEE--EecCCC Q lcl|NC_011288. 216 -DEQFVAFHPSAAAYVSQIDTVEALRDQDS-FS-DRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) Q Consensus 216 -~~~~~~~~~~a~~~a~~~~~~e~~~~~~~-~~-~~v~~~~~~g~~v~~~~~~v~--~~~~~s 273 (273) .-.+++.|++|.....+.+.+....|... .+ ..+.+|.++++.|++...-.+ --.++- T Consensus 237 k~INfiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~ 299 (346) T protein:vir:10 237 KQIEMFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKP 299 (346) T ss_pred cceeEEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeeccc Confidence 12357889999998888888877766443 34 389999999999998763332 211111 No 53 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.96 E-value=2.1e-31 Score=188.80 Aligned_cols=267 Identities=19% Similarity=0.185 Sum_probs=201.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhccccc--ccccCCceEEEeecC-cccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYE--GTASKGNVVHIAGVV-APTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~--~~~~~Gdtv~ip~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) ||++ +.+.|.+.|+++|...+++..+.+.... ..+..|++|+||++. ..+..+|+|+.+ ....+++.++.+++| T Consensus 1 Main--~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g-~~~g~v~~~~et~tl 77 (285) T protein:vir:79 1 MTVV--LDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQD-NARKTISVGKETVKL 77 (285) T ss_pred Ccch--hhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccC-ccccccceeeeEEEe Confidence 9998 4688999999999999888777654332 334568999999996 468999999765 456789999999999 Q ss_pred eeeeecceEEc--hHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhc Q lcl|NC_011288. 78 DQEKSIDFLVD--DIDRVQVAGSLEAYTRA-GATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKA 154 (273) Q Consensus 78 d~~~~~~~~i~--d~d~~~~~~~~~~~~~~-~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~ 154 (273) ++++++.|.|| |+|+. ....+..++++ +...+++++|++.++.+.+.+.... +.+.|++++|+.|..+.+.|+++ T Consensus 78 ~~DR~~~f~iD~mDvdEn-~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~-~~~~T~~nv~~~i~~~~~~lde~ 155 (285) T protein:vir:79 78 THEDWFGYDLDQFDMDEN-GAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAKKA-TDSITKDNALDAYDTAEAYMFDN 155 (285) T ss_pred eccccceecccccchhhh-hhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhccccc-ccccCHHHHHHHHHHHHHHHHHc Confidence 99999999999 55552 23335666655 4456789999999999987765543 45678999999999999999999 Q ss_pred CCCccCCEEEECHHHHHHHhhhHHHHhhhhccccc-ceeeeeeeeeEec-eEEEeeC--ccccCC---CcEEEEEcCcee Q lcl|NC_011288. 155 NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA-AGLRAGTIGNLLG-ARIVESN--NLRDTD---DEQFVAFHPSAA 227 (273) Q Consensus 155 ~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~-~~l~~G~ig~~~G-~~v~~s~--~l~~~~---~~~~~~~~~~a~ 227 (273) +|| ++|+|+++|+++..|++++.+.+..+..+.. ..=.++.|++++| ++|++.+ .+...+ .-.+++.|++|. T Consensus 156 ~vp-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv~~~a~ 234 (285) T protein:vir:79 156 EVP-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILTPLSAI 234 (285) T ss_pred CCC-CceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEEecCcee Confidence 999 6999999999999999998876655443221 1123578999999 8998753 443221 224688999998 Q ss_pred EEeeeeeeehhhcCC---CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVSQIDTVEALRDQ---DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~~~~~~e~~~~~---~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ....+-+.+....|+ +..+.++.+|.++++.|++...-.+.-...+ T Consensus 235 i~~~K~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a 283 (285) T protein:vir:79 235 APIVKYDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATA 283 (285) T ss_pred ccceeeeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeecc Confidence 777777766555443 5557899999999999998764333322222 No 54 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.96 E-value=1.7e-30 Score=183.77 Aligned_cols=266 Identities=16% Similarity=0.156 Sum_probs=201.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEeee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~ 80 (273) ||.+ +.+.|++.|+++|...++...+.+.++.. ...|++|+||++...+..||+|.++. ...+++.++.+++|+++ T Consensus 8 mAln--ya~~~~~~Ld~~~~~~~~t~~l~~~~~~~-~~Gak~VkIp~i~~~gl~dY~R~~g~-~~g~v~~~~et~tl~~D 83 (311) T protein:vir:99 8 RGFN--YVTKDGNLLDQKITAGLFTAALGTPEVDL-VNGGRSFTLKTISTSGLKDHTRGKGF-NSGTISDEKTIYTMGQD 83 (311) T ss_pred hHHH--HHHHHHHHHHHHHHhhhcccceecCchhe-eecCCEEEEEeeeeccccccccccCc-cccceeeeeeEEEeeec Confidence 5533 57999999999999999988888877653 34699999999999999999998764 56789999999999999 Q ss_pred eecceEEc--hHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcccccc--------------cccCCCHHHHHHH Q lcl|NC_011288. 81 KSIDFLVD--DIDRVQVAGSLEAYTR-AGATALATDTDKFIADLLVDNGTALS--------------GSAPTDADDAFDL 143 (273) Q Consensus 81 ~~~~~~i~--d~d~~~~~~~~~~~~~-~~~~ala~~~D~~i~~~~~~~~~~~~--------------~~~~~t~~~~~~~ 143 (273) +++.|.|| |+|++.....+.+++. .+....++++|.+.++.+.+.+.... .....+.++.++. T Consensus 84 R~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~~nvl~~ 163 (311) T protein:vir:99 84 RDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDETNAYSQ 163 (311) T ss_pred cceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccCHHHHHHH Confidence 99999999 7777765555666654 44456889999999998875443211 2234688899999 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEee-C--cccc----CC- Q lcl|NC_011288. 144 IATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES-N--NLRD----TD- 215 (273) Q Consensus 144 i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s-~--~l~~----~~- 215 (273) |..+...|++ +|.++|+|+++|+++..|.+++.+.+.......+..-.++.|++++|++|++. + .++. +. T Consensus 164 l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t~~~ft~G 241 (311) T protein:vir:99 164 LKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVYESNRFMTKYDFTDG 241 (311) T ss_pred HHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEecCchhhcchhhhcCC Confidence 9999999987 68899999999999998888776654444332222334788999999999876 3 2321 01 Q ss_pred --------CcEEEEEcCceeEEeeeeeeehhhc---CCCceeeeEEeeeeeeeEEecCc--eEEEEecCC Q lcl|NC_011288. 216 --------DEQFVAFHPSAAAYVSQIDTVEALR---DQDSFSDRIRALHVYGGKVVRPT--GVVVFNKTG 272 (273) Q Consensus 216 --------~~~~~~~~~~a~~~a~~~~~~e~~~---~~~~~~~~v~~~~~~g~~v~~~~--~~v~~~~~~ 272 (273) .-.+++.|++|.....+.+.+.... +++..+.++.+|.++++.|++.. +|.+=..++ T Consensus 242 ~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 242 AKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred ccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEeeecC Confidence 1236788999988777777665443 34456889999999999999876 443333333 No 55 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.95 E-value=4.6e-30 Score=181.44 Aligned_cols=268 Identities=16% Similarity=0.182 Sum_probs=201.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecC-----cccceeecCCCcccCCCCCccceEEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVV-----APTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (273) |||+.-+.+.|++.|++.|...+++..|...+....+..|++|+||++. +.+..||+|.++.. ..+++.++.++ T Consensus 1 Mantl~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~-~g~v~~~~et~ 79 (302) T protein:vir:78 1 MANSLALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFT-QGSVTLAWSDY 79 (302) T ss_pred CCchhHHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCcc-ccceeeeeeeE Confidence 9998778899999999999999988887544333345679999999996 56899999987654 56799999999 Q ss_pred EEeeeeecceEEc--hHHHHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHhhcccccc-----cccCCCHHHHHHHHHHH Q lcl|NC_011288. 76 LIDQEKSIDFLVD--DIDRVQVAGSLEAYTRA-GATALATDTDKFIADLLVDNGTALS-----GSAPTDADDAFDLIATA 147 (273) Q Consensus 76 ~id~~~~~~~~i~--d~d~~~~~~~~~~~~~~-~~~ala~~~D~~i~~~~~~~~~~~~-----~~~~~t~~~~~~~i~~a 147 (273) +|++++++.|.|| |+|++.....+.+++.+ +....++++|++.++.+...+.... .++..+..+++++|..+ T Consensus 80 tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl~~i~~~ 159 (302) T protein:vir:78 80 TLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALMGDIATA 159 (302) T ss_pred EeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccCccccccccchhHHHHHHHHHHH Confidence 9999999999999 67776655556676544 5567889999999998875443222 22346889999999999 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc--ccc----C------- Q lcl|NC_011288. 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRD----T------- 214 (273) Q Consensus 148 ~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~--l~~----~------- 214 (273) ...|+++ ++|+|+++|.++..|.+++.+.+..+.......-.++.|++++|++|++.+. ++. . T Consensus 160 ~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~G~~~~~ 235 (302) T protein:vir:78 160 MELVDDS----NQLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTKITFIQDVEVLQVPSEYLYDKVAPKVGVPDYT 235 (302) T ss_pred HHHhhcc----CCeEEEEChHHHHHHhcchhhccceeccccccccccceeeeecccEEEEchhhhcccceeccCCccccC Confidence 9999996 5899999999999998876544333332222223478899999999997553 321 0 Q ss_pred --CCcEEEEEcCceeEEeeeeeeehhhcC-CCceee--eEEeeeeeeeEEecCceEE-EEecCCC Q lcl|NC_011288. 215 --DDEQFVAFHPSAAAYVSQIDTVEALRD-QDSFSD--RIRALHVYGGKVVRPTGVV-VFNKTGS 273 (273) Q Consensus 215 --~~~~~~~~~~~a~~~a~~~~~~e~~~~-~~~~~~--~v~~~~~~g~~v~~~~~~v-~~~~~~s 273 (273) ..-.+++.|++|.....+.+.+....| ....+| ++.+|.++++.|++...-. ....+++ T Consensus 236 ~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~ 300 (302) T protein:vir:78 236 GAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGT 300 (302) T ss_pred CccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeeccc Confidence 112368889999988888888876655 444554 9999999999999977322 2333333 No 56 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.93 E-value=3.1e-29 Score=176.91 Aligned_cols=269 Identities=16% Similarity=0.092 Sum_probs=180.0 Q ss_pred Cccchhh-HHHHHHHHHHHHHHhhccchh--hcccccccc-cCCceEEEeecCcccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAFNNFI-PELWSDMLLEEWTAQTVFANL--VNREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~-pev~~~~~~~~~~~~lv~~~~--v~~~~~~~~-~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) |||+... -+++.+++++.|++.++++++ ++|+|+.++ +.||||.+|.+......+ +......++++.+.+++++ T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~--G~~~t~~~~~i~e~~v~~~ 78 (430) T protein:vir:92 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAVN 78 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc--CcccCCCCCccccceEEEE Confidence 9999665 489999999999999999996 557776553 689999999998876544 1111223456889999999 Q ss_pred EeeeeecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---cccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALS---GSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~---~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) +++++.+.|.+++.|+ ....+.++++++++++||.+||.++++++...+..+. .+++....+.+.++..+++.|++ T Consensus 79 v~~~k~V~~~~~~kel-~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~ 157 (430) T protein:vir:92 79 MGEPDNDFFQLRADDL-RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFS 157 (430) T ss_pred EeeeccceEEechhHh-cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHH Confidence 9999999999999884 3334457788999999999999999999876554332 23344445668999999999999 Q ss_pred cCCCcc-CCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeee-EeceEE-EeeCccccCCCcEE----EEEcCce Q lcl|NC_011288. 154 ANVPNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTDDEQF----VAFHPSA 226 (273) Q Consensus 154 ~~vP~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~-~~G~~v-~~s~~l~~~~~~~~----~~~~~~a 226 (273) +++|.+ +|.+|++|+.+..|...-..+...+.. .++++|+|+|++ +.||++ |+++.+|.+++..+ +.+..-- T Consensus 158 ~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~-~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~ 236 (430) T protein:vir:92 158 RELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSF 236 (430) T ss_pred hcCCCCCCcEEEeChHHHHHHHhhhccccccccc-hhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccccc Confidence 999995 799999999999987653333332222 456899999997 999975 67888886544322 1111100 Q ss_pred eEEeeeee--ee------------hhhcCCCceeeeEEeeeeeeeEEe------cCceEEEEe-cCCC Q lcl|NC_011288. 227 AAYVSQID--TV------------EALRDQDSFSDRIRALHVYGGKVV------RPTGVVVFN-KTGS 273 (273) Q Consensus 227 ~~~a~~~~--~~------------e~~~~~~~~~~~v~~~~~~g~~v~------~~~~~v~~~-~~~s 273 (273) -..+.+++ -. -..-.-...||.+.-.=+|+.-.+ ++..-++.. ..++ T Consensus 237 ~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~at 304 (430) T protein:vir:92 237 KPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) T ss_pred ccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCc Confidence 00111111 00 000111233444444333333333 334455544 3333 No 57 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.93 E-value=3.1e-29 Score=176.91 Aligned_cols=269 Identities=16% Similarity=0.092 Sum_probs=180.0 Q ss_pred Cccchhh-HHHHHHHHHHHHHHhhccchh--hcccccccc-cCCceEEEeecCcccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAFNNFI-PELWSDMLLEEWTAQTVFANL--VNREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~-pev~~~~~~~~~~~~lv~~~~--v~~~~~~~~-~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) |||+... -+++.+++++.|++.++++++ ++|+|+.++ +.||||.+|.+......+ +......++++.+.+++++ T Consensus 1 MAn~l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~~--G~~~t~~~~~i~e~~v~~~ 78 (430) T protein:vir:10 1 MALNEGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAVN 78 (430) T ss_pred CccchhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEecccccccccc--CcccCCCCCccccceEEEE Confidence 9999665 489999999999999999996 557776553 689999999998876544 1111223456889999999 Q ss_pred EeeeeecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---cccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALS---GSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~---~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) +++++.+.|.+++.|+ ....+.++++++++++||.+||.++++++...+..+. .+++....+.+.++..+++.|++ T Consensus 79 v~~~k~V~~~~~~kel-~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~~ 157 (430) T protein:vir:10 79 MGEPDNDFFQLRADDL-RDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEELMFS 157 (430) T ss_pred EeeeccceEEechhHh-cChhHHHHHhHHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHHHHHHHH Confidence 9999999999999884 3334457788999999999999999999876554332 23344445668999999999999 Q ss_pred cCCCcc-CCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeee-EeceEE-EeeCccccCCCcEE----EEEcCce Q lcl|NC_011288. 154 ANVPNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTDDEQF----VAFHPSA 226 (273) Q Consensus 154 ~~vP~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~-~~G~~v-~~s~~l~~~~~~~~----~~~~~~a 226 (273) +++|.+ +|.+|++|+.+..|...-..+...+.. .++++|+|+|++ +.||++ |+++.+|.+++..+ +.+..-- T Consensus 158 ~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~-~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~ 236 (430) T protein:vir:10 158 RELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSF 236 (430) T ss_pred hcCCCCCCcEEEeChHHHHHHHhhhccccccccc-hhHHHhhccccccchhhhhhhhcCCcccccCccCcCceecccccc Confidence 999995 799999999999987653333332222 456899999997 999975 67888886544322 1111100 Q ss_pred eEEeeeee--ee------------hhhcCCCceeeeEEeeeeeeeEEe------cCceEEEEe-cCCC Q lcl|NC_011288. 227 AAYVSQID--TV------------EALRDQDSFSDRIRALHVYGGKVV------RPTGVVVFN-KTGS 273 (273) Q Consensus 227 ~~~a~~~~--~~------------e~~~~~~~~~~~v~~~~~~g~~v~------~~~~~v~~~-~~~s 273 (273) -..+.+++ -. -..-.-...||.+.-.=+|+.-.+ ++..-++.. ..++ T Consensus 237 ~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~at 304 (430) T protein:vir:10 237 KPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) T ss_pred ccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCc Confidence 00111111 00 000111233444444333333333 334455544 3333 No 58 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.93 E-value=2.5e-28 Score=171.91 Aligned_cols=268 Identities=16% Similarity=0.113 Sum_probs=176.6 Q ss_pred Cccch--hhHHHHHHHHHHHHHHhhccchh--hcccccccc-cCCceEEEeecCcccceeecCCCcccCCCCCccceEEE Q lcl|NC_011288. 1 MAFNN--FIPELWSDMLLEEWTAQTVFANL--VNREYEGTA-SKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~--~~pev~~~~~~~~~~~~lv~~~~--v~~~~~~~~-~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (273) ||++. +. ++--+++++.|+..+||+++ ++|+|+.++ +.||||.+|.+......+ .......++++.+.++++ T Consensus 1 Ma~~~~~~l-ti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~~~--G~~~t~~~~~~~e~~v~~ 77 (430) T protein:vir:21 1 MALNEGQIV-TLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQE--GWDLTDKATGLLELNVAV 77 (430) T ss_pred Cccccchhh-HHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccccc--cccccCCCccceeeeEeE Confidence 99993 43 33339999999999999997 567776554 689999999987766543 211123456789999999 Q ss_pred EEeeeeecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---cccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 76 LIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALS---GSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 76 ~id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~---~~~~~t~~~~~~~i~~a~~~l~ 152 (273) ++++++.+.|.+++.|. ......++++++++++||.+||.++++++...+..+. .+++.++.+.+.++..+++.|+ T Consensus 78 ~~~~~~~V~~~~~~kEl-~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a~~~L~ 156 (430) T protein:vir:21 78 NMGEPDNDFFQLRADDL-RDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADAEEIMF 156 (430) T ss_pred EEeeeccceEEeehhHh-cChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHHHHHHHH Confidence 99999999999998774 3333458899999999999999999999876553332 2334444556899999999999 Q ss_pred hcCCCcc-CCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeee-EeceEE-EeeCccccCCCcEE----EEEcCc Q lcl|NC_011288. 153 KANVPNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN-LLGARI-VESNNLRDTDDEQF----VAFHPS 225 (273) Q Consensus 153 ~~~vP~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~-~~G~~v-~~s~~l~~~~~~~~----~~~~~~ 225 (273) ++++|.+ +|.++++|+.+..|......+...+.. .++++|+|+|++ +.||++ |.|+++|.+++..+ +.+..- T Consensus 157 ~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~-~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~ 235 (430) T protein:vir:21 157 SRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRI-PEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQS 235 (430) T ss_pred HhcCCCCCCcEEEeChHHHHHHhhhhccccccccc-hhHHHhhcccccccchhhhhhhcCCcccccCccCcCceeccccc Confidence 9999995 799999999999887654333333222 456899999997 999985 67888886544322 111110 Q ss_pred eeEEeeeee---------e-----ehhhcCCCceeeeEEeeeeeeeEEec------CceEEEEe-cCCC Q lcl|NC_011288. 226 AAAYVSQID---------T-----VEALRDQDSFSDRIRALHVYGGKVVR------PTGVVVFN-KTGS 273 (273) Q Consensus 226 a~~~a~~~~---------~-----~e~~~~~~~~~~~v~~~~~~g~~v~~------~~~~v~~~-~~~s 273 (273) --..+.+++ . .-..-.....||.+.-.=++..-.+. +.--+|+. .+++ T Consensus 236 ~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~t 304 (430) T protein:vir:21 236 FKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGT 304 (430) T ss_pred cccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCc Confidence 000111111 0 00011122344544443333333333 22334444 3333 No 59 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.92 E-value=3.5e-28 Score=171.08 Aligned_cols=185 Identities=19% Similarity=0.204 Sum_probs=131.9 Q ss_pred EeeeeecceEEchHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc--------------cccccCCCHHHHH Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADLLVDNGTA--------------LSGSAPTDADDAF 141 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~~~--------------~~~~~~~t~~~~~ 141 (273) ||......+.|+|+|++++++++ .++.++++++||+++|+.++..+..++.. ...+..+++..++ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 99999999999999999999996 56999999999999999999887644321 1122345677889 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhh-hHHHHhhhhcccccceeeee-eeeeEeceEEEeeCccccCCCcEE Q lcl|NC_011288. 142 DLIATALKELTKANVPNVGRVVVVNAEMAFWLRS-SGSKLTSADTSGDAAGLRAG-TIGNLLGARIVESNNLRDTDDEQF 219 (273) Q Consensus 142 ~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~-~~~~~~~~~~~~~~~~l~~G-~ig~~~G~~v~~s~~l~~~~~~~~ 219 (273) +.|.+++++|++++||.++||+||+|++|..|++ ++.++.+.+..++...+++| .|++++||+||+||++|..++... T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~ 160 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNL 160 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccccc Confidence 9999999999999999999999999987777775 34567777777777778999 499999999999999997544322 Q ss_pred EEEcCceeEEeeeeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 220 VAFHPSAAAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 220 ~~~~~~a~~~a~~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.++-.++.+..+.+.+|.. |+..+ +.+..|+++.++|.=+- T Consensus 161 ---~~~ag~~~~~~~~~~~yr~~--fs~~~-------glv~~~~Avgtvkl~~~ 202 (221) T protein:vir:17 161 ---VTDPGDATTSGENNGSYRPA--ITDRA-------GLVFHKEAADTVEVLLP 202 (221) T ss_pred ---ccCCcccccccccccccccc--ccceE-------EEEEcchheeeeeeecC Confidence 12222222222222222222 11111 23445554444443333 No 60 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.83 E-value=6.1e-22 Score=136.89 Aligned_cols=263 Identities=11% Similarity=0.070 Sum_probs=184.0 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhccch--hhcccc----cc-cccCCceEEEeecCcc-c-ceeecCCCcccCCCC Q lcl|NC_011288. 1 MAFN----NFIPELWSDMLLEEWTAQTVFAN--LVNREY----EG-TASKGNVVHIAGVVAP-T-VKDYKAAGRQTSADA 67 (273) Q Consensus 1 MA~~----~~~pev~~~~~~~~~~~~lv~~~--~v~~~~----~~-~~~~Gdtv~ip~~~~~-~-~~~~~~~~~~~~~~~ 67 (273) ||.+ .++||+|++.+.+.+.+.+.|.. .+.++- .+ ...+|++|++|.|+.+ + .+++. ++..++++. T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~-~~~~i~~~~ 79 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLN-DTDDLVPQK 79 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccC-CCcccchhh Confidence 9966 47899999999999988876622 222211 11 2347999999999987 3 44444 466788889 Q ss_pred CccceEEEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccc-------ccccccCCCHHH Q lcl|NC_011288. 68 ISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGT-------ALSGSAPTDADD 139 (273) Q Consensus 68 ~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~-------~~~~~~~~t~~~ 139 (273) ++.++...++ +.++.++.++|+.......+ ++++.+|.+..++++.+.++++.+.+.-. ....++..+... T Consensus 80 l~t~~~~a~i-~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~ 158 (324) T protein:vir:59 80 INAGQDKAVL-ILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIY 158 (324) T ss_pred cccceeeEEE-EeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeecccccee Confidence 9999988888 46899999999877665555 78899999999999999999988764211 111122222223 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccC----- Q lcl|NC_011288. 140 AFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT----- 214 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~----- 214 (273) ..+.|.+|..+|.++. ..-..++++|..+..|.++. .+ +.-.... .++.|+.+.|..|+++..+|.. T Consensus 159 s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li-~~~~~s~----~~~~i~~~~G~~VivdD~~p~~~~~~~ 230 (324) T protein:vir:59 159 SAETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQD-LI-EFVKDSQ----SGIRFPTYMNKRVIVDDSMPVETLEDG 230 (324) T ss_pred cHHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHhh-hh-hhccccc----cCceeeeecccEEEEeCCCCccccCCC Confidence 4678999999998875 34467899999999998763 22 2111111 1367899999999999998842 Q ss_pred -CCcEEEEEcCceeEEeeee--eeehhhcCCCceeeeEEeeeeeeeEEe----cCceEEEEecCCC Q lcl|NC_011288. 215 -DDEQFVAFHPSAAAYVSQI--DTVEALRDQDSFSDRIRALHVYGGKVV----RPTGVVVFNKTGS 273 (273) Q Consensus 215 -~~~~~~~~~~~a~~~a~~~--~~~e~~~~~~~~~~~v~~~~~~g~~v~----~~~~~v~~~~~~s 273 (273) ..+.++...++|+++..+. ..+|..|++.+..|.++.+.+|...+. ....+.-..++.+ T Consensus 231 ~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~~p~G~s~~~~~~~~~sPt~~ 296 (324) T protein:vir:59 231 TKVFTSYLFGAGALGYAEGQPEVPTETARNALGSQDILINRKHFVLHPRGVKFTENAMAGTTPTDE 296 (324) T ss_pred CceEEEEEEecCeEEEeecCCCcceecccCccccceEEEEeeEEEeEeeeEEecccccCCCCCChh Confidence 2345778889999987643 467999999888888888888775553 2222222223222 No 61 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.83 E-value=2.4e-23 Score=144.59 Aligned_cols=269 Identities=18% Similarity=0.170 Sum_probs=192.7 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEE Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (273) |-.+ ++..|+|+++++..|.+.|. +.-+.|+.. ++..|++.|||.+|.+++++. ++.++..++++.++++++ T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL-~~~~~R~V~-DF~~G~~L~I~tiGs~~~~~~-~E~~~~~~~~i~TGEIt~ 77 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLL-PETFYRNVS-DFGSGETLHIKTIGSVTLQEA-EEDTPLIYNPIETGEITF 77 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhcccc-chhhhhhhc-cCCCCCEEEecccCceeeecc-ccCCCeeecccccceEEE Confidence 5433 46799999999999999874 444445433 577899999999999998874 567889999999999999 Q ss_pred EEeeeeecceEEchHH---HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh----cc--ccc------ccccCCCHHHH Q lcl|NC_011288. 76 LIDQEKSIDFLVDDID---RVQVAGSLEAYTRAGATALATDTDKFIADLLVD----NG--TAL------SGSAPTDADDA 140 (273) Q Consensus 76 ~id~~~~~~~~i~d~d---~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~----~~--~~~------~~~~~~t~~~~ 140 (273) .|..+++-+..|++.- -..++..+.+...++.+++.+.+..+++++-.+ .+ ..+ -.++++++... T Consensus 78 ~i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~ 157 (313) T protein:vir:95 78 QITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFA 157 (313) T ss_pred EEEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceeh Confidence 9999888888888642 233344456667788899999999998875432 11 111 12466777788 Q ss_pred HHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeee------eeeeEeceEEEeeCccccC Q lcl|NC_011288. 141 FDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAG------TIGNLLGARIVESNNLRDT 214 (273) Q Consensus 141 ~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G------~ig~~~G~~v~~s~~l~~~ 214 (273) +..|.+++-.|++.++|.+||+.+++|.....|.............+. -.+.+| .|.++||++++.||.+.+. T Consensus 158 ~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k-~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~A 236 (313) T protein:vir:95 158 LKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGK-MILESGMARGQRFIMNLYGWDILTSNRLHVA 236 (313) T ss_pred hhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccc-eeeeccCCchhHHHHHHhhhhhhhhhhhhhc Confidence 999999999999999999999999999999888665432221111111 133344 2678999999999988643 Q ss_pred CCc--------EE------EE--EcCceeEEeeeeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 215 DDE--------QF------VA--FHPSAAAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 215 ~~~--------~~------~~--~~~~a~~~a~~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..+ .| +. +.+.-++.=+.+.+.|.+++..+-.+.-..+++||.++.|.|.++++-++++ T Consensus 237 N~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L~~~~~~A~ 311 (313) T protein:vir:95 237 NYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTLGLLATSAT 311 (313) T ss_pred cccccccccCceeeeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceeecceeEEEeccc Confidence 211 11 00 0111112223345667888877777777778999999999999999989888 No 62 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.79 E-value=1.4e-20 Score=129.44 Aligned_cols=260 Identities=12% Similarity=0.092 Sum_probs=180.9 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccch--hhcccc----cccccCCceEEEeecCcc-cceeecCCC-cccCCC Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFAN--LVNREY----EGTASKGNVVHIAGVVAP-TVKDYKAAG-RQTSAD 66 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~--~v~~~~----~~~~~~Gdtv~ip~~~~~-~~~~~~~~~-~~~~~~ 66 (273) ||++ .++||+|+..+.+.+.+.+.|.. .+..+- ..++ +|+++++|.|+.+ +..+....+ +.++++ T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~-~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~ 79 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITS-GGLLVNMPFWNDLTGDSEVLGNGDKALETG 79 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhc-CCCEEEecccccCCCcccccCCCccccchh Confidence 9974 57899999999999987765522 222221 1222 7999999999977 333333333 468888 Q ss_pred CCccceEEEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------cccc Q lcl|NC_011288. 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTA-------------LSGS 132 (273) Q Consensus 67 ~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~-------------~~~~ 132 (273) .++.++...++ +.+..++.++|+.......+ +.++.+|.+...+++.+..+++.+...-.. .... T Consensus 80 ki~t~~~~a~i-~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:10 80 KITAGADIACV-LYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQ 158 (330) T ss_pred hcccceeEEEE-EeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecc Confidence 89999888888 55788999999886655555 788999999999999999988877532110 0001 Q ss_pred cCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccc Q lcl|NC_011288. 133 APTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~ 212 (273) +.......++.|.+|..+|.++. ..-..++++|..+..|.+.. .+ +.-.... .++.|+.+.|..|+++..+| T Consensus 159 ~~~~a~~s~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li-~~~~~s~----~~~~i~~~~G~~VivdD~~p 230 (330) T protein:vir:10 159 SKASTGIDAGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKDN-LI-QYIQPTT----ATINIPTYLGYRVIIDDGIA 230 (330) T ss_pred cccccccCHHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHhh-hh-hhhcccc----cCcccccccceEEEEeCCCC Confidence 11122234578999999998876 34468899999999998742 22 2222111 14679999999999999998 Q ss_pred cCC-CcEEEEEcCceeEEeee----eeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEe---------cCCC Q lcl|NC_011288. 213 DTD-DEQFVAFHPSAAAYVSQ----IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFN---------KTGS 273 (273) Q Consensus 213 ~~~-~~~~~~~~~~a~~~a~~----~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~---------~~~s 273 (273) ... .+.++...++|+++..+ ...+|..|++++-.|.+..+.+|...+ -|.---. ++.+ T Consensus 231 ~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~~~~hp---~G~s~~~~~~~~~~~sPt~~ 302 (330) T protein:vir:10 231 PTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRALVMHP---YGVKWTGAEVDAGNITPSNA 302 (330) T ss_pred CCCCceeEEEEecCceeeecccCCccccccccCCccccceEEEEeeEEEeee---eeeeecccccccCcCCcChH Confidence 644 35567788999988653 357899999998888999999877654 3333222 1111 No 63 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.78 E-value=2.8e-20 Score=127.79 Aligned_cols=262 Identities=13% Similarity=0.103 Sum_probs=176.8 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhccch--hhccccccc---ccCCceEEEeecCcc-cceeecCCCcccCCCCCcc Q lcl|NC_011288. 1 MAFN----NFIPELWSDMLLEEWTAQTVFAN--LVNREYEGT---ASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISD 70 (273) Q Consensus 1 MA~~----~~~pev~~~~~~~~~~~~lv~~~--~v~~~~~~~---~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~ 70 (273) ||.+ .++||+|++.+.+.+.+.+.|.. .+..+-+.. -..|++|+||.|+.+ +..+...++..+++++++. T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~kitt 80 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNLTS 80 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchheecc Confidence 9988 47899999999999887775522 222221111 136999999999986 3333334566788899999 Q ss_pred ceEEEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc----------cccccCCCHHH Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTA----------LSGSAPTDADD 139 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~----------~~~~~~~t~~~ 139 (273) ++...++ +.+..++.++|+.......+ ++++.+|.+...+++.+..+++.+.+.-.. ....++.+... T Consensus 81 ~~~~a~i-~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~~~i 159 (351) T protein:vir:15 81 GKQQGIK-FYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSEPMF 159 (351) T ss_pred cceeEEE-EeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceecccccccccccc Confidence 9988888 66788899999877666556 788999999999999999999887642110 11111222334 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccC----- Q lcl|NC_011288. 140 AFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT----- 214 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~----- 214 (273) .++.|.+|..+|.+..- ..-..++++|..+..|.+.. .+ +.-.... .++.|+.+.|..|+++..+|.. T Consensus 160 s~~~l~~A~~~~GD~~~-~~~~~ivmhS~v~~~L~~~~-li-~~~~~s~----~~~~i~t~~G~~VivdD~~p~~~~~~~ 232 (351) T protein:vir:15 160 GAKGFTGAIGLMGDLQD-TAFGAIAVNSATYSLMKVQG-LI-ETIQPQN----GATPFEAYNGLRIVLDDDIEIDLTDKT 232 (351) T ss_pred CHHHHHHHHHHhccccc-cceEEEEEChHHHHHHHhhh-hh-hhccccc----cCcccceecceEEEEcCCCccccCCCC Confidence 56889999999977531 12356889999999998753 22 2111111 1356999999999999999852 Q ss_pred -CCcEEEEEcCceeEEeeeeeeehhhcCCCce--eeeEEeeeeeeeEEecCceEEEEe---------cCCC Q lcl|NC_011288. 215 -DDEQFVAFHPSAAAYVSQIDTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFN---------KTGS 273 (273) Q Consensus 215 -~~~~~~~~~~~a~~~a~~~~~~e~~~~~~~~--~~~v~~~~~~g~~v~~~~~~v~~~---------~~~s 273 (273) ..+.++...++|+++..+...+|..|++... .|.++.+.+|. +-|-|+---+ ++.+ T Consensus 233 ~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~~~~g~d~l~~r~~~~---~hp~G~s~~~~~~~~~~~sPt~~ 300 (351) T protein:vir:15 233 KPVSTSYIFAPGAVRYSTNMRSTETKYDPLINGGQDVIVQKRVGT---IHVAGTSIKASFSPSKASFPTID 300 (351) T ss_pred CceeEEEEEecceeeeecCCcCcceeecccCCCCceEEEEeeeee---eeeeeeeecccccccCcCCcChH Confidence 1245778889999998887778888876643 35555555544 3333333221 1111 No 64 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.54 E-value=4.7e-15 Score=99.12 Aligned_cols=257 Identities=11% Similarity=0.036 Sum_probs=163.7 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCC Q lcl|NC_011288. 1 MAFN--------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSAD 66 (273) Q Consensus 1 MA~~--------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~ 66 (273) ||.. .++|+.+...+.+.+++.+++.+++..- .-.+.+++||+...........++...... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE----PMTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee----eccCCceEEEEEeCCcceEEeecCcccccc Confidence 6554 3679999999999999999888877542 223567889988654444444555555556 Q ss_pred CCccceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc-c--------c---cccccc Q lcl|NC_011288. 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN-G--------T---ALSGSA 133 (273) Q Consensus 67 ~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~-~--------~---~~~~~~ 133 (273) +++.+.+++++.+. +.-+.|++.-..++..+++. +.++..+++++++|..++.--... + . ...... T Consensus 77 ~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 155 (304) T protein:vir:94 77 KPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNV 155 (304) T ss_pred cceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccc Confidence 67777777777553 33456666434445556766 668888999999998886311100 0 0 001111 Q ss_pred CCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCcccc Q lcl|NC_011288. 134 PTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 134 ~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~ 213 (273) ..+....+++|.++...+...+.. ...++++|..+..|.+- ++ . .+..+..+..++++|.+|+.++++|. T Consensus 156 ~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~L~~l----kd--~--~G~~l~~~~~~~l~G~PV~~~~~~~~ 225 (304) T protein:vir:94 156 VTDTNNLYVDLSALMATIEDEELD--PNGVLTTRSFRSKMRNA----LD--A--NDRPLFDANGNEIMGLPLSYTGADVY 225 (304) T ss_pred cccccchHHHHHHHHHHhhhccCC--cCEEEEcHHHHHHHHHh----hc--c--CCcEeecCCCccccceeeEEeccccc Confidence 223345689999999888877654 33688999999998652 21 1 22345555668999999999999986 Q ss_pred CC-CcEEEEEcCceeEEeee-eeeehhhc----------CCC-----cee---eeEEeeeeeeeEEecCceEEEEecCC Q lcl|NC_011288. 214 TD-DEQFVAFHPSAAAYVSQ-IDTVEALR----------DQD-----SFS---DRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 214 ~~-~~~~~~~~~~a~~~a~~-~~~~e~~~----------~~~-----~~~---~~v~~~~~~g~~v~~~~~~v~~~~~~ 272 (273) .. ...++.+..+-+.+... -..++..+ +.+ .|. ..+++.+++|..+++|+++++|+.+- T Consensus 226 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 226 DKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 43 23344554333323221 11111111 111 122 56788999999999999999999999 No 65 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.54 E-value=4.7e-15 Score=99.12 Aligned_cols=257 Identities=11% Similarity=0.036 Sum_probs=163.7 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCC Q lcl|NC_011288. 1 MAFN--------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSAD 66 (273) Q Consensus 1 MA~~--------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~ 66 (273) ||.. .++|+.+...+.+.+++.+++.+++..- .-.+.+++||+...........++...... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNE----PMTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhccee----eccCCceEEEEEeCCcceEEeecCcccccc Confidence 6554 3679999999999999999888877542 223567889988654444444555555556 Q ss_pred CCccceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc-c--------c---cccccc Q lcl|NC_011288. 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN-G--------T---ALSGSA 133 (273) Q Consensus 67 ~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~-~--------~---~~~~~~ 133 (273) +++.+.+++++.+. +.-+.|++.-..++..+++. +.++..+++++++|..++.--... + . ...... T Consensus 77 ~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 155 (304) T protein:vir:10 77 KPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNV 155 (304) T ss_pred cceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccccc Confidence 67777777777553 33456666434445556766 668888999999998886311100 0 0 001111 Q ss_pred CCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCcccc Q lcl|NC_011288. 134 PTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 134 ~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~ 213 (273) ..+....+++|.++...+...+.. ...++++|..+..|.+- ++ . .+..+..+..++++|.+|+.++++|. T Consensus 156 ~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~L~~l----kd--~--~G~~l~~~~~~~l~G~PV~~~~~~~~ 225 (304) T protein:vir:10 156 VTDTNNLYVDLSALMATIEDEELD--PNGVLTTRSFRSKMRNA----LD--A--NDRPLFDANGNEIMGLPLSYTGADVY 225 (304) T ss_pred cccccchHHHHHHHHHHhhhccCC--cCEEEEcHHHHHHHHHh----hc--c--CCcEeecCCCccccceeeEEeccccc Confidence 223345689999999888877654 33688999999998652 21 1 22345555668999999999999986 Q ss_pred CC-CcEEEEEcCceeEEeee-eeeehhhc----------CCC-----cee---eeEEeeeeeeeEEecCceEEEEecCC Q lcl|NC_011288. 214 TD-DEQFVAFHPSAAAYVSQ-IDTVEALR----------DQD-----SFS---DRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 214 ~~-~~~~~~~~~~a~~~a~~-~~~~e~~~----------~~~-----~~~---~~v~~~~~~g~~v~~~~~~v~~~~~~ 272 (273) .. ...++.+..+-+.+... -..++..+ +.+ .|. ..+++.+++|..+++|+++++|+.+- T Consensus 226 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 226 DKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 43 23344554333323221 11111111 111 122 56788999999999999999999999 No 66 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.53 E-value=3.6e-15 Score=99.79 Aligned_cols=259 Identities=10% Similarity=0.004 Sum_probs=163.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) |+.. .++|+.++..+++.+++.+++.++++. ..-.|.+.++|......+.- ..++......+++.+.++ T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~~~~~~~~~a~~-v~E~~~~~~~~~~f~~v~ 80 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKA----VPMTKPEEEFTFMSGVGAFW-VDEAERIQTSKPTFTKAK 80 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhcchhhhhcee----eecCCCcEEEEEEcCCceee-eecCccccccccceeEEE Confidence 4433 368999999999999999998888743 22346778888887655443 444555555567777777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccccccccCCCHHHHHHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTALSGSAPTDADDAFDLI 144 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~~~~~~~~t~~~~~~~i 144 (273) +...+ .+.-+.|++.-..++..++++ +.+..++++++++|..++. .+....... .........+++| T Consensus 81 l~~~k-~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~--~~~~~~~~~~~~l 157 (299) T protein:vir:41 81 MRSKK-MGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDAS--NLVEETANKYDDL 157 (299) T ss_pred EeeEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccc--eeeccccccHHHH Confidence 77754 344566776444444566765 6788899999999998773 111111111 1111223467889 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcE-EEEEc Q lcl|NC_011288. 145 ATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ-FVAFH 223 (273) Q Consensus 145 ~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~-~~~~~ 223 (273) .++...|..++.+ +..++++|..+..|.+-... +.... ....+. +..+++.|++|+.++.+|..++.. ++.++ T Consensus 158 ~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lkd~--~G~~l-~~~~~~-~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd 231 (299) T protein:vir:41 158 NEAIGLIEAEDLE--PNGIATIRKQRVKYRSTKDG--NGMPI-FNTATS-NGVDDVLGLPIAYTPKYTFGDKDISELVGD 231 (299) T ss_pred HHHHHhhhcccCC--cCEEEEcHHHHHHHHHhhcc--CCcee-ecCCcC-CCCceecceeeEEecccCCCCCceEEEEEe Confidence 9999888887764 34689999999998753211 00000 011122 234689999999999999655432 34444 Q ss_pred CceeEEeee-eeeehhhcCC--------C-----ce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 224 PSAAAYVSQ-IDTVEALRDQ--------D-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 224 ~~a~~~a~~-~~~~e~~~~~--------~-----~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) -+-+.+... -..++..++. . .| ...++..+++|.++.+|+++++++..++ T Consensus 232 fs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa 298 (299) T protein:vir:41 232 WNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAG 298 (299) T ss_pred cccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 333323222 1223322221 1 11 2467888999999999999999999999 No 67 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.51 E-value=7.8e-15 Score=97.92 Aligned_cols=258 Identities=10% Similarity=-0.005 Sum_probs=162.6 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) |+.+ .++|+.|...+.+.+++.+++.+++..- ...|.+++||+...........++......+++.+.++++. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~ 105 (324) T protein:vir:93 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeeecCCccccccccceeEEEEEe Confidence 3222 3789999999999999999988877431 23366788998754444444555666666667777777777 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--c-----ccccCCCHHHHHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA--L-----SGSAPTDADDAFDLIATALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~--~-----~~~~~~t~~~~~~~i~~a~~ 149 (273) .+ .+.-+.|++.-..++..++.. +.++.++++++++|..++.--.+.... . ..........++++|.++.. T Consensus 106 ~k-~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 184 (324) T protein:vir:93 106 FK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccceeccccccHHHHHHHHH Confidence 54 344567776444445556655 667888999999999876321111000 0 00111122345888999998 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEE Q lcl|NC_011288. 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .+..++.. ...++++|..+..|.+. .+ ..| ...+..+.-+++.|.+|+.++..+.+.+ .++.+..+.+.+ T Consensus 185 ~l~~~~~~--~~~~v~n~~~~~~L~~l----~d--~~G-~~~~~~~~~~~l~G~PVv~~~~~~~~~~-~i~~gdfs~~~~ 254 (324) T protein:vir:93 185 LLEDDELE--ANAFISKTQNRSLLRKI----VD--PET-KERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCC--CCEEEEcHHHHHHHHHh----hC--CCC-CeeecCCCCCcccceeeEeecCCCCCcc-eEEEEecceEEE Confidence 88887653 34689999999988653 21 122 2334456667899999999887665444 455555444433 Q ss_pred eee-eeeehhhcCC-------------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 230 VSQ-IDTVEALRDQ-------------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 a~~-~~~~e~~~~~-------------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ... ...++..++. +.| ...+++.+++|..+++|+++++|+...- T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:93 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 322 1223222221 112 3688999999999999999999974333 No 68 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.51 E-value=7.8e-15 Score=97.94 Aligned_cols=258 Identities=10% Similarity=-0.008 Sum_probs=162.7 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) |+.+ .++|+.|+.++++.+++.+++.+++.+- ...|.+++||+....+......++......+++.+.+++.. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~ 105 (324) T protein:vir:96 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeeecCCccccccccceeEEEEEe Confidence 4322 3789999999999999999888877542 23366789998754444455556666666667777777777 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-------cccccCCCHHHHHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-------LSGSAPTDADDAFDLIATALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-------~~~~~~~t~~~~~~~i~~a~~ 149 (273) .+. +.-+.|++.-..++..++.. +.+++++++++++|..++.--.+.... ...........++++|.++.. T Consensus 106 ~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 184 (324) T protein:vir:96 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHHHHHHHH Confidence 553 34466776444444556655 678889999999999877321111000 000111122345788999988 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEE Q lcl|NC_011288. 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .+..+... ...++++|..+..|.+. ++ .. +...+..|.-+++.|++|+.++..+.+.+ .++.++.+.+.+ T Consensus 185 ~i~~~~~~--~~~~i~n~~~~~~L~~l----kd--~~-G~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~gd~s~~~~ 254 (324) T protein:vir:96 185 LLEDDELE--ANAFISKTQNRSLLRKI----VD--PE-TKERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCC--CCEEEEcHHHHHHHHHh----hC--CC-CCeeecCCCCCcccceeeEeecCCCCCcc-eEEEEecceEEE Confidence 88777653 33689999999988653 21 11 12234456667899999999887765544 455555444433 Q ss_pred eee-eeeehhhcC--------C-----Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 230 VSQ-IDTVEALRD--------Q-----DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 a~~-~~~~e~~~~--------~-----~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ... ...++..+. . +.| ...+++.+++|.++.+|++++.|+...- T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 321 112222221 1 112 2578899999999999999999985433 No 69 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.50 E-value=1.1e-14 Score=97.02 Aligned_cols=258 Identities=10% Similarity=-0.024 Sum_probs=162.7 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEe Q lcl|NC_011288. 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~--~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id 78 (273) ++.+ .++|+.|...+++.+++.+++.+++.+ ....|.++++|+....+......++......+...+.++++.. T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~ 106 (324) T protein:vir:97 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY----EPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhcchhhhcce----eeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeE Confidence 2222 478999999999999999998888743 1234678999998655444555566666666677777777775 Q ss_pred eeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-------cccccCCCHHHHHHHHHHHHHH Q lcl|NC_011288. 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-------LSGSAPTDADDAFDLIATALKE 150 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-------~~~~~~~t~~~~~~~i~~a~~~ 150 (273) +. +.-+.|++.-..+...++.. +.++.++++++++|+.++.--...... ............+++|.++... T Consensus 107 k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) T protein:vir:97 107 KL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) T ss_pred EE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCccccccccccceeccccCCHHHHHHHHHh Confidence 43 44456666433444556655 668888999999999887422111100 0001111223458889999988 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 151 l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) +...+.. ...++++|..+..|.+. .+ ..+ ...+..+.-+.+.|++|+.++..+.+.+ .++.+..+.+... T Consensus 186 l~~~~~~--~~~~v~n~~~~~~L~~l----kd--~~g-~~~~~~~~~~tl~G~PV~~~~~~~~~~~-~~~~gd~~~~~i~ 255 (324) T protein:vir:97 186 LEDDELE--ANAFISKTQNRSLLRKI----VD--PET-KERIYDRNSDTLDGLPVVNLKSSNLKRG-ELITGDFDKLIYG 255 (324) T ss_pred hhhccCC--CCEEEEcHHHHHHHHHh----hc--CCC-ceeecCCCCccccceeeEeecCCCCCcc-eEEEEecccEEEE Confidence 8877653 33679999999988653 21 111 2233345557899999999887766544 3445554433333 Q ss_pred eee-eeehhhcCC--------C-----ce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 231 SQI-DTVEALRDQ--------D-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~~-~~~e~~~~~--------~-----~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ... ..++..++. + .| ...+++.+++|.++.+|+++++|+...- T Consensus 256 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 315 (324) T protein:vir:97 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC Confidence 322 223322221 1 12 2578888999999999999998876444 No 70 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.49 E-value=1.6e-14 Score=96.24 Aligned_cols=258 Identities=10% Similarity=-0.011 Sum_probs=163.0 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) |+.+ .++|+.|...+++.+++.+++.+++..- ...+.+++||+....+......++......++..+.++++. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:99 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceeEeccCccccccccceeEEEEee Confidence 3333 3789999999999999999888877431 22366789999765444455556666666667777777777 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-------cccccCCCHHHHHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-------LSGSAPTDADDAFDLIATALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-------~~~~~~~t~~~~~~~i~~a~~ 149 (273) .+. +.-+.|++.-..++..++.. +.+++++++++++|..++.--...... ............+++|.++.. T Consensus 106 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 184 (324) T protein:vir:99 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHH Confidence 543 44456666444444556654 678899999999999887321111100 001111223445888999999 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEE Q lcl|NC_011288. 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .|...+... -.++++|..+..|.+. .+ ..+ ...+..+.-+++.|.+|+.++..+.+.+ .++.++.+.+.+ T Consensus 185 ~l~~~~~~~--~~~v~n~~~~~~L~~l----~d--~~g-~~~~~~~~~~~l~G~PVv~~~~~~~~~~-~~i~gd~~~~~~ 254 (324) T protein:vir:99 185 LLEDDELEA--NAFISKTQNRSLLRKI----VD--PET-KERIYDRNSDTLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCCC--CEEEEcHHHHHHHHHh----hc--CCC-ceeecCCCCccccceeEEeecCCCCCcc-eEEEEecccEEE Confidence 998776532 3578999999988653 21 111 2233344457899999999988776544 455555444433 Q ss_pred eee-eeeehhhcCC--------C-----ce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 230 VSQ-IDTVEALRDQ--------D-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 a~~-~~~~e~~~~~--------~-----~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ... -..++..++. + .| ...+++.+++|..+.+|+++++|+...- T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~ 315 (324) T protein:vir:99 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 321 1223222221 1 12 3578888999999999999998876544 No 71 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.48 E-value=1.9e-14 Score=95.82 Aligned_cols=258 Identities=10% Similarity=-0.004 Sum_probs=164.1 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) |... .++|+-|...+++.+++.+++.+++.+ ....|.+++||+....+......++...+..+++.+.++++. T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~----~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:78 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY----EPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEecCcceeEecCCccccccccceeEEEEee Confidence 3222 478999999999999999998888754 223467788999765544555566666666677777777777 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--c-----ccccCCCHHHHHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA--L-----SGSAPTDADDAFDLIATALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~--~-----~~~~~~t~~~~~~~i~~a~~ 149 (273) .+. +.-+.|++.-..++..++.. +.++.++++++++|..++.--...... . ...........+++|.++.. T Consensus 106 ~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~ 184 (324) T protein:vir:78 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHHHH Confidence 542 44456666434445556755 668888999999999877321111100 0 00111223346889999998 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEE Q lcl|NC_011288. 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .|...... ...++++|..+..|.+. .+ .. +...+..|.-++++|++|+.++..+.+.+ .++.++.+.+.+ T Consensus 185 ~l~~~~~~--~~~~vmn~~~~~~L~~l----~d--~~-G~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~gd~~~~~~ 254 (324) T protein:vir:78 185 LLEDDELE--ANAFISKTQNRSLLRKI----VD--PE-TKERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCC--CCEEEEcHHHHHHHHHh----hc--cC-CCeeecCCCCCcccceeeEeeCCCCCCcc-eEEEEecceEEE Confidence 88877653 34689999999988653 21 11 22334456677899999999887765444 355555443333 Q ss_pred ee-eeeeehhhcCC-------------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 230 VS-QIDTVEALRDQ-------------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 a~-~~~~~e~~~~~-------------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .. +...++..++. ..| ...+++.+++|..+.+|+++++|+...- T Consensus 255 g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:78 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 32 11222222211 112 3578889999999999999999985333 No 72 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.48 E-value=1.9e-14 Score=95.82 Aligned_cols=258 Identities=10% Similarity=-0.004 Sum_probs=164.1 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) |... .++|+-|...+++.+++.+++.+++.+ ....|.+++||+....+......++...+..+++.+.++++. T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~----~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:96 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY----EPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEecCcceeEecCCccccccccceeEEEEee Confidence 3222 478999999999999999998888754 223467788999765544555566666666677777777777 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--c-----ccccCCCHHHHHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA--L-----SGSAPTDADDAFDLIATALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~--~-----~~~~~~t~~~~~~~i~~a~~ 149 (273) .+. +.-+.|++.-..++..++.. +.++.++++++++|..++.--...... . ...........+++|.++.. T Consensus 106 ~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~ 184 (324) T protein:vir:96 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHHHH Confidence 542 44456666434445556755 668888999999999877321111100 0 00111223346889999998 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEE Q lcl|NC_011288. 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .|...... ...++++|..+..|.+. .+ .. +...+..|.-++++|++|+.++..+.+.+ .++.++.+.+.+ T Consensus 185 ~l~~~~~~--~~~~vmn~~~~~~L~~l----~d--~~-G~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~gd~~~~~~ 254 (324) T protein:vir:96 185 LLEDDELE--ANAFISKTQNRSLLRKI----VD--PE-TKERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCC--CCEEEEcHHHHHHHHHh----hc--cC-CCeeecCCCCCcccceeeEeeCCCCCCcc-eEEEEecceEEE Confidence 88877653 34689999999988653 21 11 22334456677899999999887765444 355555443333 Q ss_pred ee-eeeeehhhcCC-------------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 230 VS-QIDTVEALRDQ-------------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 a~-~~~~~e~~~~~-------------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .. +...++..++. ..| ...+++.+++|..+.+|+++++|+...- T Consensus 255 g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:96 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccc Confidence 32 11222222211 112 3578889999999999999999985333 No 73 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.48 E-value=3.2e-14 Score=94.54 Aligned_cols=262 Identities=13% Similarity=0.048 Sum_probs=161.5 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEe Q lcl|NC_011288. 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~--~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id 78 (273) ||.+ .++|+.+..++++.+++.+++..++..- ...+.+++||+....+......++......++..+.+++... T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeee Confidence 9999 5789999999999999998887776431 123456789987544444455566656556677777777764 Q ss_pred eeeecceEEchHHHHhh---hHHHH-HHHHHHHHHHHHHHHHHHHHHHhh-ccc--------------ccccccCCCHHH Q lcl|NC_011288. 79 QEKSIDFLVDDIDRVQV---AGSLE-AYTRAGATALATDTDKFIADLLVD-NGT--------------ALSGSAPTDADD 139 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~---~~~~~-~~~~~~~~ala~~~D~~i~~~~~~-~~~--------------~~~~~~~~t~~~ 139 (273) +. +.-+.|++.-..+. ..++. .+.++.++++++++|..++.-... .+. ............ T Consensus 77 k~-~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) T protein:vir:94 77 KV-EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) T ss_pred EE-EEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccccccccccc Confidence 43 34456665432222 23444 467888999999999888743110 000 000011222344 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC--- Q lcl|NC_011288. 140 AFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--- 216 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~--- 216 (273) .+++|.++...+..++.. ....+++|..+..|.+-..... .....+....|.-+++.|++|+.++.+|...+ T Consensus 156 ~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:94 156 PNGAIENAVELLTGVDAD--VTGIAINPSFRSALAKQKDLQG---NALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) T ss_pred HHHHHHHHHHhhhhcCCC--ccEEEEcHHHHHHHHHhhccCC---CeeecCcccCCCCceecceeeEEecccccccCCCc Confidence 678899999999887764 3469999999999865321110 11111223346667899999999999985432 Q ss_pred cEEEEEcC-ceeEEee-eeeeehhhc--CCC-----cee---eeEEeeeeeeeEEecCceEEEEecCC Q lcl|NC_011288. 217 EQFVAFHP-SAAAYVS-QIDTVEALR--DQD-----SFS---DRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 217 ~~~~~~~~-~a~~~a~-~~~~~e~~~--~~~-----~~~---~~v~~~~~~g~~v~~~~~~v~~~~~~ 272 (273) ..++.|.- .++.+.. +...++..+ +++ .|. ..+++.+++|..+.+|++++.|+..- T Consensus 231 ~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 34555543 2332322 122222211 111 122 36888999999999999999996555 No 74 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.47 E-value=3.1e-14 Score=94.62 Aligned_cols=262 Identities=12% Similarity=0.049 Sum_probs=159.9 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEe Q lcl|NC_011288. 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~--~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id 78 (273) ||.+ .++|+-+..++++.+++.+++.+++.+- .-.+..++||+....+......++......++.-+.+++... T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQK----PIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhccee----eccCCceEEEEEecCcceEEecCCccccccccceeEEEEeee Confidence 9988 5777778889999999988888877432 123455788887655445555666666556666666666664 Q ss_pred eeeecceEEchHHHHhh---hHHHH-HHHHHHHHHHHHHHHHHHHHHHh---hcccc------------cccccCCCHHH Q lcl|NC_011288. 79 QEKSIDFLVDDIDRVQV---AGSLE-AYTRAGATALATDTDKFIADLLV---DNGTA------------LSGSAPTDADD 139 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~---~~~~~-~~~~~~~~ala~~~D~~i~~~~~---~~~~~------------~~~~~~~t~~~ 139 (273) +. +.-+.|+++-..++ ..++. .+.+++++++++++|..++.-.. ..+.. ........... T Consensus 77 k~-a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) T protein:vir:16 77 KV-EYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) T ss_pred eE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccccccccc Confidence 43 33355655433222 23454 46788999999999999874311 00000 00011112234 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCC---C Q lcl|NC_011288. 140 AFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD---D 216 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~---~ 216 (273) .+++|.++...+..++.+. ...+++|..+..|.+...... .....+....|.-+++.|.+|+.++.+|... . T Consensus 156 ~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:16 156 PNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKDLQD---NALFPELKWGATPDTINGLPVDVNKTVSDMSLTQR 230 (298) T ss_pred HHHHHHHHHHHhhhcCCCc--cEEEEcHHHHHHHHHhhccCC---CeeecCcccCCCCceecceeeEEecccccccCCCc Confidence 5778889988888877643 358899999998866321111 1111123345666899999999999998532 2 Q ss_pred cEEEEEc-CceeEEee-eeeeehhhcC--CC-----cee---eeEEeeeeeeeEEecCceEEEEecCC Q lcl|NC_011288. 217 EQFVAFH-PSAAAYVS-QIDTVEALRD--QD-----SFS---DRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 217 ~~~~~~~-~~a~~~a~-~~~~~e~~~~--~~-----~~~---~~v~~~~~~g~~v~~~~~~v~~~~~~ 272 (273) ..++.|. +.++.+.. +..+++..+. ++ .|. ..+++.+++|.++++|++++.|+..- T Consensus 231 ~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 231 DRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 3355553 33333322 2222222221 11 122 46888999999999999999986555 No 75 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.47 E-value=3.6e-14 Score=94.32 Aligned_cols=262 Identities=9% Similarity=0.016 Sum_probs=153.8 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) |+.+ -++|+.|..++++.+++.+++.+++.+- ...+.+++||+...........++......+++.+.++ T Consensus 14 ~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~ 89 (320) T protein:vir:10 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV----PMGTTGQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQN 89 (320) T ss_pred hhccccccccccccHHHHHHHHHHHHhccchhhhccee----eccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 4443 2678889999999999999888876532 22366789998765544555566666666667777777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh--------ccccc--ccccCCCHHH---H Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD--------NGTAL--SGSAPTDADD---A 140 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~--------~~~~~--~~~~~~t~~~---~ 140 (273) ++..+ .+.-+.|++.-..++..++++ +.++..+++++++|+.++.--.. ..... ......+.+. . T Consensus 90 ~~~~k-~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (320) T protein:vir:10 90 IAPHK-IATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGGATASDLTAY 168 (320) T ss_pred EeeEE-EEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecccccccccccH Confidence 77644 344466666544445566765 66788899999999998631100 00000 0011111111 1 Q ss_pred HHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhh-----hhcccccceeeeeeeeeEeceEEEeeCccccCC Q lcl|NC_011288. 141 FDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTS-----ADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 141 ~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~-----~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~ 215 (273) .+.+.++...+..... +...++++|..+..|.+-.....+ ....+.... ..-+++.|++|+.++++|... T Consensus 169 ~~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~---~~~~~i~g~pv~~~~~~~~~~ 243 (320) T protein:vir:10 169 DAVAVNGLSLLVNAKK--KWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSP---FRAGRIVSRPTILSDHVADGT 243 (320) T ss_pred HHHHHHHHhhhhcccC--CCcEEEEcHHHHHHHHHhhccCCceeeccccccCcccc---ccCceeeeeeeEecCCCCCCc Confidence 2245556666665543 355889999999999653211100 000111111 112578999999999997643 Q ss_pred CcEEEEEcCceeEEeee-eeeehhhcC--------C-----Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 216 DEQFVAFHPSAAAYVSQ-IDTVEALRD--------Q-----DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ~~~~~~~~~~a~~~a~~-~~~~e~~~~--------~-----~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) . .++.++.+-+.++.. -..++..++ . ..| ...+++.+++|.++++|+++++|+..++ T Consensus 244 ~-~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~a 317 (320) T protein:vir:10 244 T-VGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVT 317 (320) T ss_pred e-EEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 2 234444333323221 112222211 1 112 2468888999999999999999986666 No 76 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.46 E-value=2.3e-14 Score=95.34 Aligned_cols=264 Identities=13% Similarity=0.018 Sum_probs=159.6 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAFN----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) ||.. .++|+.++..+++.+++.+++.+++.+- .-.+.++++|+...........++......++.-+.+++. T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~ 76 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQK----PIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIV 76 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhccee----ecCCCceEEEEEecCcceEEeecCccccccccceeeEEee Confidence 9986 5789999999999999999988887542 2235678899975554555666666666666666677766 Q ss_pred EeeeeecceEEchHHHHh---hhHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcc------cc--------cccccCCCHH Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQ---VAGSL-EAYTRAGATALATDTDKFIADLLVDNG------TA--------LSGSAPTDAD 138 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~---~~~~~-~~~~~~~~~ala~~~D~~i~~~~~~~~------~~--------~~~~~~~t~~ 138 (273) ..+ .+.-+.++++=..+ ...++ +.+.+++++++++++|..++.-..... .. ......++.. T Consensus 77 ~~k-l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (303) T protein:vir:97 77 PIK-VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESE 155 (303) T ss_pred eEE-EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccccccccccccc Confidence 633 23344555532211 12344 446788899999999998874321000 00 0001122334 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC-- Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~-- 216 (273) ..+++|.++...+...+.. ....+++|..+..|++......+ ... ....-..+..+++.|++++.|+++|.... T Consensus 156 ~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~L~~lkd~~g~-~~~-~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 231 (303) T protein:vir:97 156 DADANIEAAVNLIQGAEGV--VTGLAMDTEFSTALAKVTNGEMG-PKM-YPELAWGANPDSINGLKSSVNTTVGAGADEA 231 (303) T ss_pred chHHHHHHHHHHHhhcCCC--ccEEEEcHHHHHHHHHhhccCCC-eEE-ecCccCCCCCceecceeeEEecccCCccccC Confidence 5688999998888776643 23589999999988653211000 000 00011123456899999999999985321 Q ss_pred ---cEEEEEc-CceeEEeeee-eeehh--hcCCC-----cee---eeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 ---EQFVAFH-PSAAAYVSQI-DTVEA--LRDQD-----SFS---DRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ---~~~~~~~-~~a~~~a~~~-~~~e~--~~~~~-----~~~---~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.|. ..++.+..+. .++|. +.+++ .|. ..+++.+++|.++++|++++.|+..-= T Consensus 232 ~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 232 ESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred CCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 2234432 3333333221 12221 11111 122 478889999999999999998876666 No 77 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.46 E-value=6.1e-14 Score=93.03 Aligned_cols=258 Identities=14% Similarity=0.056 Sum_probs=159.1 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~--~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) ...+ .++|+.|...+++.+++...+.++++.. ...|.++++|+.... .......++...+..+++.+.+++++ T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~ 192 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPG----TTESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPV 192 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHHhhccce----ecCCCceEEEEEecCCCceeeecCCccccccccceeEEEEee Confidence 1211 3567778999999999999988887653 123567888886432 23334445555555567777777777 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHH----------hhccc-ccccccCCCHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLL----------VDNGT-ALSGSAPTDADDAFDLIA 145 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~----------~~~~~-~~~~~~~~t~~~~~~~i~ 145 (273) .+. +.-+.|++. ..+...++.. +.+...++++..+|..++.-- ..... ....+...+....+++|. T Consensus 193 ~k~-~~~~~is~e-ll~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~ 270 (395) T protein:vir:43 193 RTI-AHLFKASRQ-ILDDASALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIR 270 (395) T ss_pred eeE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHHHH Confidence 554 334566654 3333445666 456788899999999887421 10000 011112233445688899 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEc-C Q lcl|NC_011288. 146 TALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH-P 224 (273) Q Consensus 146 ~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~-~ 224 (273) ++...+...+.+ .-.++++|..+..|.+...... ... .. ...+|.-+.++|++|+.++.+|.+. ++.+. + T Consensus 271 ~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~~G-~~i--~~-~~~~~~~~~l~G~pVv~~~~~~~~~---~~~gd~~ 341 (395) T protein:vir:43 271 LAILQAQLAEFP--ASGIVLNPIDWALIELNKDAEN-RYI--IG-SPQNGTTPTLWRLPVVETQAITQDE---FLTGAFS 341 (395) T ss_pred HHHHhhccccCC--CcEEEEcHHHHHHHHHhhccCC-cee--cc-ccccCCCceecceeeEEcCCCCCCc---EEEEecc Confidence 998888777653 3468999999988865321100 001 11 1335666789999999999998542 34443 3 Q ss_pred ceeEEee-eeeeehhhcCC-Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 225 SAAAYVS-QIDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~a~~~a~-~~~~~e~~~~~-~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+.-... +...++..+.. ..| ...+++.+++|.++.+|++++.++-++| T Consensus 342 ~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 342 LGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred ceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 3222222 12223333322 122 3478889999999999999999999999 No 78 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.46 E-value=3.5e-14 Score=94.36 Aligned_cols=263 Identities=10% Similarity=0.004 Sum_probs=153.4 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) |++. .++|+.+..++++.+++..++.+++.+- ...+.+++||+....+......++......++..+.++ T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~ 89 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKV----PMGTTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQT 89 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCcceEEecCCccccccccceeEEE Confidence 4433 3578889999999999999888887542 12366788998765544455556666666667777777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh--------cccccccc-cCCCHHHHHHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD--------NGTALSGS-APTDADDAFDLI 144 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~--------~~~~~~~~-~~~t~~~~~~~i 144 (273) ++..+. +.-+.+++.-..++..++.. +.+..++++++++|..++.--.. .......+ .........+.+ T Consensus 90 ~~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 168 (318) T protein:vir:24 90 IAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTTGATTVYDQVA 168 (318) T ss_pred EeeEEE-EEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccccccchHHHHH Confidence 776442 34456666433445556655 66888899999999988732110 00000000 111112223445 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhh----hhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEE Q lcl|NC_011288. 145 ATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTS----ADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFV 220 (273) Q Consensus 145 ~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~----~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~ 220 (273) .++...+..... ..-.++++|..+..|.+....-.+ ....+.. .....-+++.|++++.++++|.+.. .++ T Consensus 169 ~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~--~~~~~~~~i~g~pv~~~~~~~~~~~-~~~ 243 (318) T protein:vir:24 169 VNGLSLLVNDGK--KWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEA--ASPFRSGRIVARPTILSDHVVEGTT-VGF 243 (318) T ss_pred HHHHHhhccccC--CCCEEEEcHHHHHHHHHhhccCCceeecCccccCc--cccccCceEEEEeeEEeCCCCCCcc-EEE Confidence 555555554443 345789999999988653211000 0000010 1111225789999999999976433 334 Q ss_pred EEcCceeEEeee-eeeehhhcCC-------------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 221 AFHPSAAAYVSQ-IDTVEALRDQ-------------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~~~a~~~a~~-~~~~e~~~~~-------------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+..+.+.+... ...++..++. +.| ...+++.+++|.++++|++++.|+...+ T Consensus 244 ~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a 313 (318) T protein:vir:24 244 MGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVS 313 (318) T ss_pred EeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeecc Confidence 444333333321 1122221111 112 2578999999999999999999988666 No 79 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.46 E-value=3.7e-14 Score=94.23 Aligned_cols=258 Identities=10% Similarity=-0.018 Sum_probs=161.5 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) |+.+ .++|+.|...+++.+++.+++.+++.. ....+.++++|+....+......++......++..+.+++.. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~ 105 (324) T protein:vir:10 30 MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKY----EPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEeCCcceeEeccCccccccccceeEEEEee Confidence 3333 378999999999999999988887743 122356789999765444555556666655667777777776 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-------cccccCCCHHHHHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-------LSGSAPTDADDAFDLIATALK 149 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-------~~~~~~~t~~~~~~~i~~a~~ 149 (273) .+. +.-+.|+..-..++..++.. +.+++.+++++++|..++.--...... ...........++++|.++.. T Consensus 106 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~~ 184 (324) T protein:vir:10 106 FKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) T ss_pred EEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHH Confidence 443 34456666434444556655 678888999999999877422111100 001111223345888999998 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEE Q lcl|NC_011288. 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 150 ~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) .+..++.. .-.++++|..+..|.+. .+ ..+ ...+..|.-++++|.+|+.++..+.+.+ .++.++.+.+.+ T Consensus 185 ~l~~~~~~--~~~~v~n~~~~~~L~~l----~d--~~g-~~~~~~~~~~~l~G~PV~~~~~~~~~~~-~~~~gd~~~~~~ 254 (324) T protein:vir:10 185 LLEDDELE--ANAFISKTQNRSLLRKI----VD--PET-KERIYDRNSDTLDGLPVVNLKSSNLKRG-ELITGDFDKLIY 254 (324) T ss_pred hhhhccCC--CCEEEEcHHHHHHHHHh----hc--cCC-ceeecCCCCccccceeEEeecCCCCCcc-eEEEEecccEEE Confidence 88877653 23578999999988653 21 111 2233345557899999999887765444 355555444433 Q ss_pred eeee-eeehhhcC--------CC-----ce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 230 VSQI-DTVEALRD--------QD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 a~~~-~~~e~~~~--------~~-----~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .... ..++..++ ++ .| ...+++.+++|..+++|+++++|+...- T Consensus 255 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~ 315 (324) T protein:vir:10 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred EEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccC Confidence 3221 12222221 11 12 2578889999999999999998876444 No 80 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.45 E-value=7.7e-14 Score=92.49 Aligned_cols=265 Identities=11% Similarity=-0.019 Sum_probs=156.5 Q ss_pred Cccc--------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCC Q lcl|NC_011288. 1 MAFN--------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSAD 66 (273) Q Consensus 1 MA~~--------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~ 66 (273) ||-. .++|+.+..++++.+++.+++.+++.. ....+..+++|+....+......++...+.. T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~----~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~ 76 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARK----VPMGPTGISIPHWTGAVSASWTGEAERKPIT 76 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcce----eeccCCceEEEEEcCCcceeEecCCCccccc Confidence 4433 234455667889999999988887754 2233566889987655445555666666666 Q ss_pred CCccceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH----------Hhhccc------cc Q lcl|NC_011288. 67 AISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL----------LVDNGT------AL 129 (273) Q Consensus 67 ~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~----------~~~~~~------~~ 129 (273) +++.+.+++++.+ .+.-+.|++.-..++..++++ +.++.++++++++|+.++.= +..... .. T Consensus 77 ~~~f~~i~~~~~k-~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~ 155 (330) T protein:vir:77 77 KGSFGKQELEPVK-ITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTN 155 (330) T ss_pred cceeeEEEEeEEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeeccc Confidence 6777777777744 344456666433444566765 67888899999999988721 111000 00 Q ss_pred ccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccc---ccceeeeeeeeeEeceEEE Q lcl|NC_011288. 130 SGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG---DAAGLRAGTIGNLLGARIV 206 (273) Q Consensus 130 ~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~---~~~~l~~G~ig~~~G~~v~ 206 (273) ...........+++|.++...+..++.+ ....+++|..+..|.+-.....+ .... .......+.-++++|++|+ T Consensus 156 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~~G~-~l~~~~~~~~~~~~~~~~~l~G~PV~ 232 (330) T protein:vir:77 156 LTTASGPQGNAYLAVNNALSLLVNSGKK--WTGTLLDNVTEPILNTAVDGNGR-PLFVESTYTEQVGAIREGRILGRPTY 232 (330) T ss_pred ccccccccchhHHHHHHHHHhhhhcCCC--ccEEEEcHHHHHHHHHHhccCCc-eeecCccccccccccCCceecceeeE Confidence 0111223345678899888888877753 34689999999988653211000 0000 0001111223579999999 Q ss_pred eeCccccCCC---cEEEEEcCceeEEeeee-eeehhhcC-----------------CCce---eeeEEeeeeeeeEEecC Q lcl|NC_011288. 207 ESNNLRDTDD---EQFVAFHPSAAAYVSQI-DTVEALRD-----------------QDSF---SDRIRALHVYGGKVVRP 262 (273) Q Consensus 207 ~s~~l~~~~~---~~~~~~~~~a~~~a~~~-~~~e~~~~-----------------~~~~---~~~v~~~~~~g~~v~~~ 262 (273) .++.+|..+. ..++.++.+...+..+. ..++..++ .+.| ...+++.+++|+.+.+| T Consensus 233 ~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 312 (330) T protein:vir:77 233 VADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDK 312 (330) T ss_pred EeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecc Confidence 9999986443 22445554443333221 11211111 1111 35788999999999999 Q ss_pred ceEEEEecCCC Q lcl|NC_011288. 263 TGVVVFNKTGS 273 (273) Q Consensus 263 ~~~v~~~~~~s 273 (273) +++++|+.... T Consensus 313 ~a~~~i~~~~~ 323 (330) T protein:vir:77 313 DAFVKLTDQVA 323 (330) T ss_pred cceEEEEeccC Confidence 99887754444 No 81 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.44 E-value=7e-14 Score=92.70 Aligned_cols=264 Identities=13% Similarity=0.023 Sum_probs=158.3 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA-------~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~ 72 (273) ++ ...++|+.|...+++.++...++..+++.-. .....-++.+|+...........++.... .+....+. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:98 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee--ccCCceeEEEEeecCCccceeeccccccCcccccceee Confidence 11 1247899999999999999888877765321 11112244455543333333333333332 23345667 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCCHHHHHHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGT--------ALSGSAPTDADDAFDL 143 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~--------~~~~~~~~t~~~~~~~ 143 (273) +++++.+. +.-+.|++.=..++..++.. +.+..+++++..+|..++.-...... ........+....+++ T Consensus 198 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (415) T protein:vir:98 198 LAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDD 276 (415) T ss_pred EEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhH Confidence 77776543 33356665433444556766 56778889999999988764432111 0111222334456888 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC--cEEEE Q lcl|NC_011288. 144 IATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVA 221 (273) Q Consensus 144 i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~--~~~~~ 221 (273) |.++...+...... +-.+|++|..+..|.+-... +... -....+.+|.-++|.|++|+.++.+|.++. ..++. T Consensus 277 i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~--~G~~-l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:98 277 IKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDK--LGNY-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhcc--CCce-eeccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 99988888776653 33578999999988652211 1011 111224456677999999999988875432 33455 Q ss_pred Ec-CceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 222 FH-PSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~-~~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) |+ +.++.... +...++..+ ...+.+.+++-+++|..+.+|++++.++.+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:98 352 GNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEec-cccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 54 33333332 223333332 34556778899999999999999998877777 No 82 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.44 E-value=7e-14 Score=92.70 Aligned_cols=264 Identities=13% Similarity=0.023 Sum_probs=158.3 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA-------~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~ 72 (273) ++ ...++|+.|...+++.++...++..+++.-. .....-++.+|+...........++.... .+....+. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:79 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee--ccCCceeEEEEeecCCccceeeccccccCcccccceee Confidence 11 1247899999999999999888877765321 11112244455543333333333333332 23345667 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCCHHHHHHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGT--------ALSGSAPTDADDAFDL 143 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~--------~~~~~~~~t~~~~~~~ 143 (273) +++++.+. +.-+.|++.=..++..++.. +.+..+++++..+|..++.-...... ........+....+++ T Consensus 198 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (415) T protein:vir:79 198 LAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDD 276 (415) T ss_pred EEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhH Confidence 77776543 33356665433444556766 56778889999999988764432111 0111222334456888 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC--cEEEE Q lcl|NC_011288. 144 IATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVA 221 (273) Q Consensus 144 i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~--~~~~~ 221 (273) |.++...+...... +-.+|++|..+..|.+-... +... -....+.+|.-++|.|++|+.++.+|.++. ..++. T Consensus 277 i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~--~G~~-l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:79 277 IKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDK--LGNY-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhcc--CCce-eeccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 99988888776653 33578999999988652211 1011 111224456677999999999988875432 33455 Q ss_pred Ec-CceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 222 FH-PSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~-~~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) |+ +.++.... +...++..+ ...+.+.+++-+++|..+.+|++++.++.+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:79 352 GNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEec-cccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 54 33333332 223333332 34556778899999999999999998877777 No 83 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.44 E-value=7e-14 Score=92.70 Aligned_cols=264 Identities=13% Similarity=0.023 Sum_probs=158.3 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA-------~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~ 72 (273) ++ ...++|+.|...+++.++...++..+++.-. .....-++.+|+...........++.... .+....+. T Consensus 120 ~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 197 (415) T protein:vir:81 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQ 197 (415) T ss_pred hhccccccccccccchHHHHHHHHHHHhhhhhhhheeeee--ccCCceeEEEEeecCCccceeeccccccCcccccceee Confidence 11 1247899999999999999888877765321 11112244455543333333333333332 23345667 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------ccccccCCCHHHHHHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGT--------ALSGSAPTDADDAFDL 143 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~--------~~~~~~~~t~~~~~~~ 143 (273) +++++.+. +.-+.|++.=..++..++.. +.+..+++++..+|..++.-...... ........+....+++ T Consensus 198 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 276 (415) T protein:vir:81 198 LAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDD 276 (415) T ss_pred EEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchhH Confidence 77776543 33356665433444556766 56778889999999988764432111 0111222334456888 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC--cEEEE Q lcl|NC_011288. 144 IATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVA 221 (273) Q Consensus 144 i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~--~~~~~ 221 (273) |.++...+...... +-.+|++|..+..|.+-... +... -....+.+|.-++|.|++|+.++.+|.++. ..++. T Consensus 277 i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~lkd~--~G~~-l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 351 (415) T protein:vir:81 277 IKDAINLNVKPNYE--HNVAIVSQTMFAKLDKMKDK--LGNY-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLII 351 (415) T ss_pred HHHHHHhhhhhccC--CCEEEEcHHHHHHHHHhhcc--CCce-eeccCcCCCCCceecceeeEEecccccCCCCccEEEE Confidence 99988888776653 33578999999988652211 1011 111224456677999999999988875432 33455 Q ss_pred Ec-CceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 222 FH-PSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~-~~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) |+ +.++.... +...++..+ ...+.+.+++-+++|..+.+|++++.++.+.+ T Consensus 352 Gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:81 352 GNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EehhccEEEEeecceEEEEec-cccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 54 33333332 223333332 34556778899999999999999998877777 No 84 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.44 E-value=8.5e-14 Score=92.24 Aligned_cols=266 Identities=11% Similarity=-0.021 Sum_probs=155.1 Q ss_pred Cccc------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccc--------eeecCCC Q lcl|NC_011288. 1 MAFN------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTV--------KDYKAAG 60 (273) Q Consensus 1 MA~~------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~--------~~~~~~~ 60 (273) |+.. .++|+-|+.++++.+++.+++.+++.. ..-.+..+++|+...... .....++ T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~----~~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg 85 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGEN----IPISYGETIIPTTVKRPEVGQVGVGTSNEQREG 85 (338) T ss_pred hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcce----eeccCCceEEEEEecCccceeeccccccccccc Confidence 1111 268999999999999999998888753 223467888888643211 1122333 Q ss_pred cccCCCCCccceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh-----------c-cc Q lcl|NC_011288. 61 RQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD-----------N-GT 127 (273) Q Consensus 61 ~~~~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~-----------~-~~ 127 (273) ...+..++..+.++++..+ .+.-+.|++.=..++..++++ +.+++++++++++|..++.--.. . .. T Consensus 86 ~~~~~~~~~f~~v~l~~~k-~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~ 164 (338) T protein:vir:78 86 GTKPLSGTAWDTRSVAPIK-LATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVI 164 (338) T ss_pred ccccccccceeEEEEEEEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccccc Confidence 4444455666666666644 234456666433445566765 66888899999999988741110 0 00 Q ss_pred ---ccccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhh-cccccceeeeeeeeeEece Q lcl|NC_011288. 128 ---ALSGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGA 203 (273) Q Consensus 128 ---~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~~~G~ 203 (273) .............++.|.++...+..+. .......+++|..+..|.+... +++.+ ..........|.-++++|+ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~m~~~~~~~L~~~~~-l~d~~g~~l~~~~~~~~~~~~l~G~ 242 (338) T protein:vir:78 165 VNTTNVDYLQTGTTPLLDRFLDGYDLVSANT-DVDFNGWAADPRYRARLLRSQA-YRDANGNVDPTRINLAASAGDLLGL 242 (338) T ss_pred ccccccccccccchhhHHHHHHHHHHhhhhc-cccceEEEEchHHHHHHHHHhh-hccCCCceeecccccCCCCceeeee Confidence 0001112233456788888877775433 2234468999999998865432 22211 1111122344566789999 Q ss_pred EEEeeCccccCC------CcEEEEEcCceeEEeeee-eeehhhcC-------------CCcee---eeEEeeeeeeeEEe Q lcl|NC_011288. 204 RIVESNNLRDTD------DEQFVAFHPSAAAYVSQI-DTVEALRD-------------QDSFS---DRIRALHVYGGKVV 260 (273) Q Consensus 204 ~v~~s~~l~~~~------~~~~~~~~~~a~~~a~~~-~~~e~~~~-------------~~~~~---~~v~~~~~~g~~v~ 260 (273) +|+.++++|... ...++.+.-+.+.+.... ..++..+. .+.|. ..++..+++|..++ T Consensus 243 PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~ 322 (338) T protein:vir:78 243 PVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLG 322 (338) T ss_pred eEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEee Confidence 999999988421 123444544333222211 11221111 11121 46788999999999 Q ss_pred cCceEEEEecCCC Q lcl|NC_011288. 261 RPTGVVVFNKTGS 273 (273) Q Consensus 261 ~~~~~v~~~~~~s 273 (273) +|+++++|+...- T Consensus 323 ~~~a~~~l~~~~~ 335 (338) T protein:vir:78 323 DKQAFVKFVDDED 335 (338) T ss_pred cccceEEEecccC Confidence 9999998876444 No 85 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.44 E-value=5e-14 Score=93.52 Aligned_cols=264 Identities=13% Similarity=0.028 Sum_probs=159.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceEEEEEee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~id~ 79 (273) -....++|+.|...+++.+++..++..+++.-. ......++.+|+...........++.... .+....+.+++.+.+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k 204 (415) T protein:vir:94 127 DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR--VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccccCcHHHHHHHHHHHHhhhhhhhhcceee--ccCCceeEEEEeecCCccceeccccccccccccccceeeEeehee Confidence 112247899999999999999998888775421 11122345555544433333334443333 233456677777644 Q ss_pred eeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccCCCHHHHHHHHHHHHHH Q lcl|NC_011288. 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA--------LSGSAPTDADDAFDLIATALKE 150 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~--------~~~~~~~t~~~~~~~i~~a~~~ 150 (273) . +.-+.|++.-..++..++.+ +.++.+++++..+|..++.-....... .......+....+++|.++... T Consensus 205 ~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 283 (415) T protein:vir:94 205 H-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINL 283 (415) T ss_pred e-eeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHh Confidence 3 33356665433344556766 567788899999999887644322110 1111223334568888888888 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC--cEEEEEc-Ccee Q lcl|NC_011288. 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVAFH-PSAA 227 (273) Q Consensus 151 l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~--~~~~~~~-~~a~ 227 (273) +...... +-.+|++|..+..|.+-.....+ . -....+.+|..++|.|++|+.++.+|.++. ..++.+. +.++ T Consensus 284 ~~~~~~~--~~~~vmn~~~~~~l~~lkd~~G~--~-l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~ 358 (415) T protein:vir:94 284 NVKPNYE--HNVAIVSQTMFAKLDKMKDKLGN--Y-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAI 358 (415) T ss_pred hhhhccC--CCEEEEcHHHHHHHHHhhccCCC--e-eeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccE Confidence 8776653 34688999999988653211111 1 111234456678999999999998885443 2345554 3334 Q ss_pred EEeeee-eeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~~~-~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ....+. ..++..+ ...+.+.+++-+++|..+.+|++++.++.+.+ T Consensus 359 ~~~~~~~~~v~~~~-~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 404 (415) T protein:vir:94 359 VLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEeecceEEEEec-cccCceEEEEEEEeccEEeccccEEEEEEecc Confidence 333322 2333222 34456778899999999999999998877666 No 86 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.44 E-value=8.3e-14 Score=92.29 Aligned_cols=263 Identities=11% Similarity=0.009 Sum_probs=162.0 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEE Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (273) ||.+ .++|+-+...+++.++..+++..++.. ...++..+++|+...........++......++..+.+++ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~----~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l 76 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQ----KPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTI 76 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcce----eeccCCceEEEEEecCcceEEeeCCcccccccccceeeEe Confidence 9988 467888999999999999888776543 2233456788886544445566666666666677777777 Q ss_pred EEeeeeecceEEchHHHHh---hhHHHH-HHHHHHHHHHHHHHHHHHHHHHh---hccc----------ccccccCCCHH Q lcl|NC_011288. 76 LIDQEKSIDFLVDDIDRVQ---VAGSLE-AYTRAGATALATDTDKFIADLLV---DNGT----------ALSGSAPTDAD 138 (273) Q Consensus 76 ~id~~~~~~~~i~d~d~~~---~~~~~~-~~~~~~~~ala~~~D~~i~~~~~---~~~~----------~~~~~~~~t~~ 138 (273) +..+ .+.-+.|+++=..+ ...++. .+.++.++++++++|..++.-.. ..+. ........+.. T Consensus 77 ~~~k-~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 155 (300) T protein:vir:95 77 VPLK-VEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDT 155 (300) T ss_pred eeEE-EEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeeccccc Confidence 7644 23445566542221 223454 46788899999999999884311 0000 00011123345 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC-- Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~-- 216 (273) ..++.|.++...+...+.. ....+++|..+..|.+....-. ..-.......|.-++++|++|+.|+.+|.... T Consensus 156 ~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~L~~lkd~~G---~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~ 230 (300) T protein:vir:95 156 NPDESMEDAVGMIDGSERD--ITGAILDPIFTTALSKMKNAEG---GKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDP 230 (300) T ss_pred chHHHHHHHHHHhhhcCCC--ccEEEECHHHHHHHHHhhccCC---CeeccCccccCCCceecceeeEEecCCCCCCCCC Confidence 5678899999888876643 2358999999998865321110 11111223345678999999999999986432 Q ss_pred -cEEEEEcC-ceeEEee-eeeeeh--hhcCCC-----cee---eeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 -EQFVAFHP-SAAAYVS-QIDTVE--ALRDQD-----SFS---DRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 -~~~~~~~~-~a~~~a~-~~~~~e--~~~~~~-----~~~---~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++++.- .++-+.. +..+++ .+.+.+ .|. ..+++.+++|..+.+|++++.|+..+= T Consensus 231 ~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 231 KNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 23444432 2232221 111221 111111 232 578899999999999999999987777 No 87 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.44 E-value=4.6e-14 Score=93.72 Aligned_cols=265 Identities=14% Similarity=0.052 Sum_probs=157.7 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) ||.. .++|+.++.++++.+++.+++.+++.+ ....+..++||+....+......++......+..-+.++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~----i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~ 76 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPE----QPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFT 76 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcce----eecCCCceEEEEEeCCcceEEeeCCccccccccceeeeE Confidence 9977 468999999999999999998887643 122355788999765544555666666666667777777 Q ss_pred EEEeeeeecceEEchHHHHhhhH----HHHH-HHHHHHHHHHHHHHHHHHHHHhh-cccccc-------c-c-cCCCHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAG----SLEA-YTRAGATALATDTDKFIADLLVD-NGTALS-------G-S-APTDADD 139 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~----~~~~-~~~~~~~ala~~~D~~i~~~~~~-~~~~~~-------~-~-~~~t~~~ 139 (273) +...+. +.-+.|+++=..+... .++. +.+..++++++++|..++.--.. .+.... . + ....... T Consensus 77 l~~~kl-~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (315) T protein:vir:80 77 AQPIKV-VTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDS 155 (315) T ss_pred eeeeeE-EeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeecccc Confidence 776443 3334566542222222 2544 56788889999999887732100 000000 0 0 0011223 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccc--ceeeeeeeeeEeceEEEeeCccccCCC- Q lcl|NC_011288. 140 AFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~~~G~~v~~s~~l~~~~~- 216 (273) .+++|.++...+..++.-. ....+++|..+..|.+....- ..+..+.. ..+..|.-++++|.+|+.++.+|.... T Consensus 156 ~~~d~~~~~~~~~~~~~~~-~~~~imn~~~~~~L~~l~~~~-g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~ 233 (315) T protein:vir:80 156 ATADLVKAVGLIAGAGLQV-PNGVALDPAFSFALSTEVYPK-GSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEM 233 (315) T ss_pred chHHHHHHHHHHhhccCcc-ceEEEEcHHHHHHHHHHhhcc-CCcccccccccccccCCCceecceeeEecCcCCccccc Confidence 4677888887776554422 235789999999997643211 01111110 123345567899999999999975321 Q ss_pred -----cEEEEEcCc--eeEEeeeeeeehhhcCC-------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 -----EQFVAFHPS--AAAYVSQIDTVEALRDQ-------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 -----~~~~~~~~~--a~~~a~~~~~~e~~~~~-------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.|.-+ .++....+ .++..+.. +.| ...+++.+++|.++.+|+++++|+..++ T Consensus 234 ~~~~~~~~~~GDfs~~~~g~~~~~-~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a 306 (315) T protein:vir:80 234 SPASGVKAIVGDFSRVHWGFQRNF-PIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAA 306 (315) T ss_pred ccccccEEEEeecccEEEEEecCe-eEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccC Confidence 223333221 22332221 22222211 112 2478888999999999999999997776 No 88 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.44 E-value=1.1e-13 Score=91.60 Aligned_cols=258 Identities=15% Similarity=0.069 Sum_probs=158.0 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCc-ccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) |... .++|+.+...+++.+...+.+..++..- ...|.++++|+... .........++..+..+++.+.++ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 180 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG----RTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQT 180 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee----cccCcceEEEEEecCCcceeeeccCccccccccceeEEE Confidence 2222 2567778888999999988887776541 12356788888643 233444455555555667777777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh---------cccccccccCCCHHHHHHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD---------NGTALSGSAPTDADDAFDLI 144 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~---------~~~~~~~~~~~t~~~~~~~i 144 (273) +++.+. +.-+.|++. ..+....+.+ +.++.+++++.++|..++.--.. .......+...+....+++| T Consensus 181 ~~~~k~-~~~~~is~e-ll~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i 258 (385) T protein:vir:18 181 ANVKTI-AHWVQASRQ-VMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADII 258 (385) T ss_pred EeeeeE-EEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHH Confidence 777554 344566653 3333345655 56778889999999887732111 01111111222344568889 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEc- Q lcl|NC_011288. 145 ATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH- 223 (273) Q Consensus 145 ~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~- 223 (273) .++...|...... .-.++++|..+..|.+...... ... . .....|.-++++|.+|+.++.+|.+ .++.+. T Consensus 259 ~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd~~G--~~l-~-~~~~~~~~~~l~G~pV~~~~~~p~~---~~~~gd~ 329 (385) T protein:vir:18 259 AHAIYQVTESEFS--ASGIVLNPRDWHNIALLKDNEG--RYI-F-GGPQAFTSNIMWGLPVVPTKAQAAG---TFTVGGF 329 (385) T ss_pred HHHHHhhccccCC--CCEEEEcHHHHHHHHHhhcCCC--cee-c-cCcccCCCceecceeeEEcCcCCCC---cEEEeec Confidence 9998888776643 3468999999998865321100 000 0 1123566788999999999999854 244443 Q ss_pred CceeEEeeee-eeehhhcCC-Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 224 PSAAAYVSQI-DTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 224 ~~a~~~a~~~-~~~e~~~~~-~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.+.....+. ..++..+.. ..| ...+++.+++|..+.+|+++++++-+.+ T Consensus 330 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 330 DMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred ccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 3344433322 223322221 122 2468899999999999999999887777 No 89 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.44 E-value=1.1e-13 Score=91.60 Aligned_cols=258 Identities=15% Similarity=0.069 Sum_probs=158.0 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCc-ccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) |... .++|+.+...+++.+...+.+..++..- ...|.++++|+... .........++..+..+++.+.++ T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~ 180 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQG----RTSSNALEYVREEVFTNNADVVAEKALKPESDITFSKQT 180 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchhhhccee----cccCcceEEEEEecCCcceeeeccCccccccccceeEEE Confidence 2222 2567778888999999988887776541 12356788888643 233444455555555667777777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh---------cccccccccCCCHHHHHHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD---------NGTALSGSAPTDADDAFDLI 144 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~---------~~~~~~~~~~~t~~~~~~~i 144 (273) +++.+. +.-+.|++. ..+....+.+ +.++.+++++.++|..++.--.. .......+...+....+++| T Consensus 181 ~~~~k~-~~~~~is~e-ll~d~~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i 258 (385) T protein:vir:19 181 ANVKTI-AHWVQASRQ-VMDDAPMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADII 258 (385) T ss_pred EeeeeE-EEeehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccccchHHHH Confidence 777554 344566653 3333345655 56778889999999887732111 01111111222344568889 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEc- Q lcl|NC_011288. 145 ATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH- 223 (273) Q Consensus 145 ~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~- 223 (273) .++...|...... .-.++++|..+..|.+...... ... . .....|.-++++|.+|+.++.+|.+ .++.+. T Consensus 259 ~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd~~G--~~l-~-~~~~~~~~~~l~G~pV~~~~~~p~~---~~~~gd~ 329 (385) T protein:vir:19 259 AHAIYQVTESEFS--ASGIVLNPRDWHNIALLKDNEG--RYI-F-GGPQAFTSNIMWGLPVVPTKAQAAG---TFTVGGF 329 (385) T ss_pred HHHHHhhccccCC--CCEEEEcHHHHHHHHHhhcCCC--cee-c-cCcccCCCceecceeeEEcCcCCCC---cEEEeec Confidence 9998888776643 3468999999998865321100 000 0 1123566788999999999999854 244443 Q ss_pred CceeEEeeee-eeehhhcCC-Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 224 PSAAAYVSQI-DTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 224 ~~a~~~a~~~-~~~e~~~~~-~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.+.....+. ..++..+.. ..| ...+++.+++|..+.+|+++++++-+.+ T Consensus 330 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 330 DMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred ccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 3344433322 223322221 122 2468899999999999999999887777 No 90 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.43 E-value=1e-13 Score=91.75 Aligned_cols=266 Identities=9% Similarity=-0.019 Sum_probs=154.8 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcc--------cCCCCCcc Q lcl|NC_011288. 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQ--------TSADAISD 70 (273) Q Consensus 1 MA~~--~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~--------~~~~~~~~ 70 (273) |... .++|+.+..++++.+++.+++..++..- ...+.++++|+........+..++.. .+...+.. T Consensus 20 ~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f 95 (333) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQI----PISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAW 95 (333) T ss_pred eecCCccccchhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCceeEeecCcccccccccccccccccce Confidence 2111 2689999999999999999888877541 12356778888765544444333221 22233444 Q ss_pred ceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc-cc--------------ccccccC Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN-GT--------------ALSGSAP 134 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~-~~--------------~~~~~~~ 134 (273) +.+++...+. +.-+.|++.-..+...++.. +.+++++++++++|..++.--... +. ....... T Consensus 96 ~~i~l~~~kl-~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~~~~~ 174 (333) T protein:vir:78 96 DTRSVSPIKL-ATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNVDYLQ 174 (333) T ss_pred eEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccccccccccccccccccc Confidence 4455544322 33345555333345556755 667888999999999987311100 00 0001112 Q ss_pred CCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhh-cccccceeeeeeeeeEeceEEEeeCcccc Q lcl|NC_011288. 135 TDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 135 ~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~~~G~~v~~s~~l~~ 213 (273) ......+++|.++...+..+.- ......+++|..+..|++.. .+.+.+ .......+..|.-++++|++|+.++++|. T Consensus 175 ~~~~~~~~~i~~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~~~-~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~ 252 (333) T protein:vir:78 175 ETGDPLLDRLLDGYDLVSANTD-VEFNGWAVDPRFRAHLLRAQ-AYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGG 252 (333) T ss_pred cccchhHHHHHHHHHhhccccc-cCceEEEEcchHHHHHHHHh-hhcCCCCceeecCccccCCCceeeceeeEEccccCC Confidence 2334567888888877765432 23346888999999887643 222211 11112234456678999999999999985 Q ss_pred CC------CcEEEEEcCceeEEeeee-eeehhhcC----------CCcee---eeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 214 TD------DEQFVAFHPSAAAYVSQI-DTVEALRD----------QDSFS---DRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 214 ~~------~~~~~~~~~~a~~~a~~~-~~~e~~~~----------~~~~~---~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .. ...++.+..+-+.+.... ..++..+. .+.|. ..+++.+++|.++++|+++++|+.... T Consensus 253 ~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a 332 (333) T protein:vir:78 253 DLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQ 332 (333) T ss_pred CccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCC Confidence 42 123555554433333221 12221111 11222 357889999999999999999986666 No 91 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.43 E-value=9.3e-14 Score=92.02 Aligned_cols=258 Identities=10% Similarity=0.016 Sum_probs=157.7 Q ss_pred Ccc---c---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAF---N---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~---~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) |.+ + .++|+.|+.++.+.+.+.+++.+++.+-. . ..+..+.+|+...........++......+.+.+.++ T Consensus 9 ~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~--~-~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~ 85 (297) T protein:vir:95 9 ENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQE--M-EGEQEKTVYVQTDGISAYWVNETEKIKTDKPEVVPVT 85 (297) T ss_pred ccccccCCCcceechhHHHHHHHHHHhhchhhhhcceee--c-CCCccEEEEEEcCCceeEEeecCccccccccceeEEE Confidence 211 1 36899999999999999998888875421 1 1122456676554433444455555555566777777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh-ccccc-----ccccCCCHHHHHHHHHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD-NGTAL-----SGSAPTDADDAFDLIATA 147 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~-~~~~~-----~~~~~~t~~~~~~~i~~a 147 (273) +...+ .+.-+.|++.-..++..++.+ +.+++++++++++|..++.--.+ .+... ...........+++|.++ T Consensus 86 l~~~k-~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~ 164 (297) T protein:vir:95 86 LKAHK-LGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPINYDNILKL 164 (297) T ss_pred EeeEE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccCHHHHHHH Confidence 77644 344456666434445556655 67888999999999998731100 00000 000111122357889999 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCcee Q lcl|NC_011288. 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...|..++.+. -.++++|..+..|.+ +++ . .+..+.++..+.++|++++.++..+...+ .++++..+.+ T Consensus 165 ~~~l~~~~~~~--~~~v~~~~~~~~L~~----l~d--~--~G~~i~~~~~~~l~G~Pv~~~~~~~~~~~-~~~~gd~s~~ 233 (297) T protein:vir:95 165 QDALYDADVEP--NAFVSKIQNRSALRE----ARD--G--NKVSIYDKAANTIDGITTVDLKSARFEKG-DLLAGDFDNL 233 (297) T ss_pred HHHhhhccCCc--CEEEEcHHHHHHHHH----hhc--c--CCceeecCCCCcccceeeEeecCCCCCCc-eEEEEecccE Confidence 99998877543 357899999998865 222 1 22345667778899999998876654444 3555544433 Q ss_pred EEeeee-eeehhhcCC-------------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVSQI-DTVEALRDQ-------------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~~~-~~~e~~~~~-------------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+.... ..++..++. +.| ...+++.+++|.++++|+++++|+.+-. T Consensus 234 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~ 296 (297) T protein:vir:95 234 IYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAER 296 (297) T ss_pred EEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCC Confidence 332211 122222111 112 3578888999999999999999976655 No 92 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.43 E-value=9.5e-14 Score=91.98 Aligned_cols=265 Identities=9% Similarity=-0.032 Sum_probs=147.2 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) .... -++|+-+...+++.+++..++.+++.+- ...+.+.++|+...........++...+..++..+.+++.. T Consensus 22 ~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~----~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~ 97 (326) T protein:vir:42 22 TGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKI----PMGTTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAP 97 (326) T ss_pred ccccCCcceechhhHHHHHHHHHhcchhhhhccee----eccCCceEEEEEeCCcceEEecCCccccccccceeEEEEee Confidence 1111 2578888999999999998887776541 12356788888765444445556666666677777777777 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHh---------hc---ccccccccCCCHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLV---------DN---GTALSGSAPTDADDAFDLI 144 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~---------~~---~~~~~~~~~~t~~~~~~~i 144 (273) .+ .+.-+.|++.-..++..++.+ +.++..+++++++|+.++.=-. .. ..........+......++ T Consensus 98 ~k-~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (326) T protein:vir:42 98 HK-IATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTGSNADLTVYDA 176 (326) T ss_pred EE-EEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeecccccccccchhHHH Confidence 44 345567776444445566766 6678889999999998873100 00 0001111111222222222 Q ss_pred --HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcc--cccceeeeeeeeeEeceEEEeeCccccCCCcEEE Q lcl|NC_011288. 145 --ATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTS--GDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFV 220 (273) Q Consensus 145 --~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~--~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~ 220 (273) ..+...+... ...+-..+++|..+..|.+-.....+.-.. ...........+++.|++++.++.+|.+... .+ T Consensus 177 ~~~~~~~~~~~~--~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-~~ 253 (326) T protein:vir:42 177 VAVNALSLLVNA--GKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVV-GY 253 (326) T ss_pred HHHHHHhhhhhh--ccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceE-EE Confidence 2222233222 234456789999999986532111000000 0000111123467999999999999864332 22 Q ss_pred EEcCceeEEeee-eeeehhhc--------CC-----Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 221 AFHPSAAAYVSQ-IDTVEALR--------DQ-----DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~~~a~~~a~~-~~~~e~~~--------~~-----~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .|.-+.+.+... ...++..+ ++ ..| ...+++.+++|+++.+|++++.|+...+ T Consensus 254 ~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~ 323 (326) T protein:vir:42 254 QGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDA 323 (326) T ss_pred EeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccc Confidence 222111111111 11111111 11 112 2578999999999999999998887777 No 93 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.42 E-value=1.4e-13 Score=91.07 Aligned_cols=261 Identities=12% Similarity=0.008 Sum_probs=155.1 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEee--cCcccceeecCCCcccC-CCCCcc Q lcl|NC_011288. 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAG--VVAPTVKDYKAAGRQTS-ADAISD 70 (273) Q Consensus 1 MA-------~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~--~~~~~~~~~~~~~~~~~-~~~~~~ 70 (273) ++ ...++|+.|...+++.+++.+++.++++.-. ..+.+.++|. ...........++.... .+.... T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~ 195 (415) T protein:vir:47 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR----VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceee----ccCCceeEEEEEecCCcceeecccccccccccccce Confidence 11 1247899999999999999998888775321 1122333343 33322222333333332 233455 Q ss_pred ceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccCCCHHHHH Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA--------LSGSAPTDADDAF 141 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~--------~~~~~~~t~~~~~ 141 (273) +.++++..+. +.-+.|++.-..++..++.. +.+.+++++++++|..++.-..+.... .......+....+ T Consensus 196 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:47 196 FQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccch Confidence 6666666443 34456666444444556766 667888999999999988644321110 1111222334567 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhh-cccccceeeeeeeeeEeceEEEeeCccccCCC--cE Q lcl|NC_011288. 142 DLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQ 218 (273) Q Consensus 142 ~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~--~~ 218 (273) ++|.++...+...... +-.+|++|..+..|.+- ++.+ .......+.+|.-++|.|++|+.++.+|.++. .. T Consensus 275 ~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~l----kd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:47 275 DDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM----KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred HHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHh----hccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccE Confidence 8888888888766643 34688999999988542 2111 11111234466678999999999988885443 33 Q ss_pred EEEEc-CceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 219 FVAFH-PSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 219 ~~~~~-~~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.+. +.++.... +...++..+ ...+.+.+++-+++|+++++|++++.++.+.+ T Consensus 349 ~~~gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:47 349 LIIGNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEehhccEEEEeecceEEEeec-cccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 45554 33333333 222333222 33445678899999999999999998876655 No 94 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.42 E-value=1.4e-13 Score=91.07 Aligned_cols=261 Identities=12% Similarity=0.008 Sum_probs=155.1 Q ss_pred Cc-------cchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEee--cCcccceeecCCCcccC-CCCCcc Q lcl|NC_011288. 1 MA-------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAG--VVAPTVKDYKAAGRQTS-ADAISD 70 (273) Q Consensus 1 MA-------~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~--~~~~~~~~~~~~~~~~~-~~~~~~ 70 (273) ++ ...++|+.|...+++.+++.+++.++++.-. ..+.+.++|. ...........++.... .+.... T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~ 195 (415) T protein:vir:46 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKR----VTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPF 195 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceee----ccCCceeEEEEEecCCcceeecccccccccccccce Confidence 11 1247899999999999999998888775321 1122333343 33322222333333332 233455 Q ss_pred ceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccCCCHHHHH Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA--------LSGSAPTDADDAF 141 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~--------~~~~~~~t~~~~~ 141 (273) +.++++..+. +.-+.|++.-..++..++.. +.+.+++++++++|..++.-..+.... .......+....+ T Consensus 196 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~ 274 (415) T protein:vir:46 196 FQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSL 274 (415) T ss_pred eeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccch Confidence 6666666443 34456666444444556766 667888999999999988644321110 1111222334567 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhh-cccccceeeeeeeeeEeceEEEeeCccccCCC--cE Q lcl|NC_011288. 142 DLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQ 218 (273) Q Consensus 142 ~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~--~~ 218 (273) ++|.++...+...... +-.+|++|..+..|.+- ++.+ .......+.+|.-++|.|++|+.++.+|.++. .. T Consensus 275 ~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~l----kd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 348 (415) T protein:vir:46 275 DDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM----KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNT 348 (415) T ss_pred HHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHh----hccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccE Confidence 8888888888766643 34688999999988542 2111 11111234466678999999999988885443 33 Q ss_pred EEEEc-CceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 219 FVAFH-PSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 219 ~~~~~-~~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.+. +.++.... +...++..+ ...+.+.+++-+++|+++++|++++.++.+.+ T Consensus 349 ~~~gd~~~~~~~~~~~~~~v~~~~-~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 404 (415) T protein:vir:46 349 LIIGNLKDAIVLFDRSQYQASWTD-YMHFGECLMIAVRQDCRILDYKSAIVIEYDDS 404 (415) T ss_pred EEEEehhccEEEEeecceEEEeec-cccCceEEEEEEEeccEEeccccEEEEEeecc Confidence 45554 33333333 222333222 33445678899999999999999998876655 No 95 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.41 E-value=2.4e-13 Score=89.77 Aligned_cols=263 Identities=14% Similarity=0.050 Sum_probs=157.0 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAFN----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) ||.. .++|+-+...+++.+++.+++..++..- .-.+..+++|+....+......++...+..+++.+.+++. T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i----~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~ 76 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAE----PQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAI 76 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhccee----ecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEe Confidence 8776 5889999999999999999888877542 1234568899975544455556666666666777777777 Q ss_pred EeeeeecceEEchHHHHhh---hHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcc-c-------------ccccccCCCHH Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIADLLVDNG-T-------------ALSGSAPTDAD 138 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~-~-------------~~~~~~~~t~~ 138 (273) ..+. +.-+.|+++=..+. ..++.+ +.+++++++++++|..++.--.+.. . .....+..+.. T Consensus 77 ~~kl-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~ 155 (311) T protein:vir:81 77 PRKV-QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSA 155 (311) T ss_pred eEEE-EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccc Confidence 6443 33345555422212 233544 6788999999999998874321000 0 00011122223 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC-- Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~-- 216 (273) ..+..|.++...+...+.. ....+++|..+..|.+...... ..........|.-++++|++|+.++.+|.... T Consensus 156 ~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~ 230 (311) T protein:vir:81 156 TPDLAVEAAVGLVLGDNLS--PDGVALDNTFSFMLATQRDSQG---RKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAV 230 (311) T ss_pred hHHHHHHHHHHHhhhcCCC--ceEEEEcHHHHHHHHhhhccCC---CeeecCccccCCCceecceeEEeccccccccccc Confidence 4456677777777666542 2358999999998865321100 00111122345568899999999998874321 Q ss_pred -------------cEEEEEcCceeEEeee-eeeehhhcCC------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 -------------EQFVAFHPSAAAYVSQ-IDTVEALRDQ------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 -------------~~~~~~~~~a~~~a~~-~~~~e~~~~~------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.+.-+-+.+..+ -..++..+.. +.| ...+++.+++|.++++|++++.|+...+ T Consensus 231 ~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 310 (311) T protein:vir:81 231 TASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADE 310 (311) T ss_pred ccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeecc Confidence 1223333222222211 1122222221 112 2478888999999999999999987777 No 96 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.41 E-value=2.1e-13 Score=90.11 Aligned_cols=258 Identities=14% Similarity=0.022 Sum_probs=154.9 Q ss_pred Cc-----cchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MA-----FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA-----~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) +. ...++|+.|+..+++.++....+.++++.- ...|.++++|+.... .......++......++..+.++ T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~ 211 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPG----QTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKN 211 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHhhccee----eccCCceeEEEEecCCCceeeeccCccccccccceeeEE Confidence 11 113789999999999999998888877532 223567888886442 33334455555555566777777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc--c-------cccccccCCCHHHHHHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN--G-------TALSGSAPTDADDAFDLI 144 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~--~-------~~~~~~~~~t~~~~~~~i 144 (273) +...+. +.-+.|++. ..+...++.. +.+...++++.++|..++.--... + .........+....+++| T Consensus 212 ~~~~k~-~~~~~is~e-ll~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i 289 (418) T protein:vir:10 212 QPVRTI-AHLFKASRQ-ILDDAPALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKI 289 (418) T ss_pred EeeeeE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccHHHH Confidence 777553 333556654 3333446766 556788899999999887311100 0 011111122333457788 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcC Q lcl|NC_011288. 145 ATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 145 ~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~ 224 (273) ..+...+...+.+. -.++++|..+..|.+-..... ..... ...+|.-++++|++|+.++.+|.+. ++.+.. T Consensus 290 ~~~~~~~~~~~~~~--~~~v~n~~~~~~L~~lkd~~G---~~i~~-~~~~~~~~~l~G~pV~~~~~~p~~~---~~~gd~ 360 (418) T protein:vir:10 290 RLALLQAVLAEFPA--TGIVLNPIDWASIELTKDSQG---RYIVG-NPVNGTTPRLWNLPVVETQAMTANE---FLVGAF 360 (418) T ss_pred HHHHHhhccccCCC--CEEEEcHHHHHHHHHhhcCCC---ceecc-ccccCCCceecceeeEEcCCCCCCc---EEEeec Confidence 88887777665432 258899999988865321100 01111 1234666889999999999998542 444443 Q ss_pred c-eeEEee-eeeeehhhcCCC----ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 225 S-AAAYVS-QIDTVEALRDQD----SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~-a~~~a~-~~~~~e~~~~~~----~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) + ++-... +...++..+... +-...+++.+++|..+.+|++++.++.+.+ T Consensus 361 s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~ 415 (418) T protein:vir:10 361 SMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQ 415 (418) T ss_pred cceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEeccC Confidence 3 332222 222233222221 223478888999999999999987776666 No 97 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.39 E-value=2.9e-13 Score=89.31 Aligned_cols=254 Identities=13% Similarity=0.049 Sum_probs=157.9 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+-+...+++.+.....+.+++..- ...+.++++|..... .......++......+++.+.+ T Consensus 113 ~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:97 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG----RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hhcccccccccccchhhhHHHHHHHhhhhhhHhhccee----eccCCceEEEEEecCCcceeeecCCccccccccceeEE Confidence 2111 3567778888999999888887776431 123567888886542 2344455565555566777788 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc---------ccccccccCCCHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN---------GTALSGSAPTDADDAFDL 143 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~---------~~~~~~~~~~t~~~~~~~ 143 (273) ++++.+. +.-+.|++. ......++.+ +.++.++++++++|..++.--... +.........+....+++ T Consensus 189 ~~~~~k~-~~~~~is~e-ll~ds~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~ 266 (390) T protein:vir:97 189 TDTTHVI-AHTMKATRQ-ILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQ 266 (390) T ss_pred EEeeeeE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHHH Confidence 8887653 344566663 3333345666 567788999999999877421000 001111122234456788 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccc--ceeeeeeeeeEeceEEEeeCccccCCCcEEEE Q lcl|NC_011288. 144 IATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA--AGLRAGTIGNLLGARIVESNNLRDTDDEQFVA 221 (273) Q Consensus 144 i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~--~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~ 221 (273) +.++...+.....+.. .++++|..+..|.+- ++ ..|.. .....|.-++++|.+|+.++.+|.+ .++. T Consensus 267 ~~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~l----kd--~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~ 335 (390) T protein:vir:97 267 LRLAMLQASLAEYPAS--GIVINPIDWAAIELA----KD--ANNQYLIGNARGTLTPTLWGLPVVATQAMAPG---EFLV 335 (390) T ss_pred HHHHHHhhccccCCCC--EEEEcHHHHHHHHHh----hc--CCCceeecCccCCCCceecceeeEEcCCCCCC---cEEE Confidence 8889888888877533 578999999988653 21 11110 0112344578999999999999854 2444 Q ss_pred Ec-CceeEEee-eeeeehhhcCCCce-e--eeEEeeeeeeeEEecCceEEEEecC Q lcl|NC_011288. 222 FH-PSAAAYVS-QIDTVEALRDQDSF-S--DRIRALHVYGGKVVRPTGVVVFNKT 271 (273) Q Consensus 222 ~~-~~a~~~a~-~~~~~e~~~~~~~~-~--~~v~~~~~~g~~v~~~~~~v~~~~~ 271 (273) +. +.+.-.+. +...++..+....| . ..+++.++||..+.+|++++++.-+ T Consensus 336 gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 336 GAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 43 33443333 23345544443333 2 3588889999999999999999888 No 98 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.39 E-value=3.4e-13 Score=88.91 Aligned_cols=260 Identities=15% Similarity=0.060 Sum_probs=153.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc--------ceeecCCCcccCCCCCccce Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT--------VKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~--------~~~~~~~~~~~~~~~~~~~~ 72 (273) -....+.|+.+...+....+..+++..+++. ....+.++++|+....+ ......++...+..++..+. T Consensus 130 ~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~ 205 (419) T protein:vir:94 130 NPNVPHLPQLVPGIVPTTPDLPLLVADLLDQ----QNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDT 205 (419) T ss_pred CCcccccchhhhHHHHHHHhhhhhhhhccee----eeccCCceeeeeeccccccccccCcccceecCCccccccccceee Confidence 1122356888888888777777666666643 12235567777643221 12223344444445566777 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH---------Hhhccc----ccccccCCCHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL---------LVDNGT----ALSGSAPTDAD 138 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~---------~~~~~~----~~~~~~~~t~~ 138 (273) +++++.+. +.-+.|+.. ..+...++++ +.++++++++.++|..++.= +...+. ........+.. T Consensus 206 i~~~~~k~-~~~~~is~e-ll~d~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~ 283 (419) T protein:vir:94 206 ITTTLKTV-AHWLPITRQ-AADDNSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDE 283 (419) T ss_pred EEeeeeeE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccccccc Confidence 77777554 334566653 3333345766 45678899999999998731 000000 00112233445 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcE Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ 218 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~ 218 (273) ..+++|.++...+.....+ .-.++++|..+..|......-... .. ....+..|..++++|++|+.++.+|.+. T Consensus 284 ~~~~~l~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~~k~~~~~~-~~-~~~~~~~~~~~~l~G~pV~~~~~~~~~~--- 356 (419) T protein:vir:94 284 PPLVDIRRAKTVAEIAGFP--PDGVVVHPQDWESIELDQAPGSGV-FR-VIANVQGEATPRIWGLNVVSTVAIAQGT--- 356 (419) T ss_pred hhHHHHHHHHHhhhhccCC--CCEEEEcHHHHHHHHHHhhcCCCc-ee-ecCCcccCCCccccceeeEEcCCCCCcc--- Confidence 6788999999888877654 336899999999886542211110 00 1112345667799999999999998542 Q ss_pred EEEE-cCceeEEee-eeeeehhhcCC-Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 219 FVAF-HPSAAAYVS-QIDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 219 ~~~~-~~~a~~~a~-~~~~~e~~~~~-~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.| .+.+..... +...++..+.. +.| ...+++.+++|.++++|++++.++-+++ T Consensus 357 ~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa 417 (419) T protein:vir:94 357 ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAA 417 (419) T ss_pred EEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccC Confidence 3333 333332222 22233333222 122 3578999999999999999998877777 No 99 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.38 E-value=2e-13 Score=90.19 Aligned_cols=264 Identities=11% Similarity=-0.005 Sum_probs=153.4 Q ss_pred Cccc-------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN-------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~-------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |+.+ .+.|+ +...+++.+++.+++.+++.+- .-.+.+++||+....+......++......++..+.+ T Consensus 10 ~~~~~t~~~~g~l~~~-~~~~ii~~l~~~s~i~~l~~~~----~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v 84 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPV-QAKDYFAEAEKTSIVQRVAQKI----PMGATGIVIPHWTGDVSAQWIGEGDMKPITKGNMTKR 84 (397) T ss_pred HhhccCCCCccccchh-HHHHHHHHHHhccchhhhccee----eccCCceEEEEEcCCcceEEecCCccccccccceeEE Confidence 4433 34455 5567788888888887776431 1235678999876655555566666666667777778 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcc------cccccccCCCHHHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNG------TALSGSAPTDADDAFDLIAT 146 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~------~~~~~~~~~t~~~~~~~i~~ 146 (273) ++++.+ .+.-+.|++.-..+...+++. +.++..+++++++|+.++.--.... ..............++.+.+ T Consensus 85 ~l~~~k-~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (397) T protein:vir:23 85 DVHPAK-IATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAYQGLGVS 163 (397) T ss_pred EEeeEE-EEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccchhHHHHH Confidence 877744 344466666544455566765 5678889999999998873111000 00001111222334566777 Q ss_pred HHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhc--ccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcC Q lcl|NC_011288. 147 ALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADT--SGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 147 a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~--~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~ 224 (273) +...|.....+ .-..+++|..+..|.+......+.-+ .........+..+++.|++++.++++|.+.. .++.+.. T Consensus 164 ~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~-~~~~gDf 240 (397) T protein:vir:23 164 GLTKLVTDGKK--WTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDV-VGYAGDF 240 (397) T ss_pred HHHhhhhcccC--CCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCce-EEEEeec Confidence 77777766643 34689999999988763211100000 0001111122346899999999999986432 2233322 Q ss_pred ceeEEeee-eeeehhhcC-------------CCce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 225 SAAAYVSQ-IDTVEALRD-------------QDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~a~~~a~~-~~~~e~~~~-------------~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +-+-+... -..++..++ .+.| ...+++.+++|.++++|++++.++.+.. T Consensus 241 s~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~ 306 (397) T protein:vir:23 241 SQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPV 306 (397) T ss_pred ceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccc Confidence 22212211 111221111 1122 2467888999999999999998887555 No 100 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.38 E-value=1.5e-13 Score=90.85 Aligned_cols=260 Identities=13% Similarity=0.050 Sum_probs=154.2 Q ss_pred Cccc-hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEee Q lcl|NC_011288. 1 MAFN-NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~~-~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~ 79 (273) -++- .+.|+++...+.+.++...++..++++- ....|..+.||+....+......++...+..++..+.++++..+ T Consensus 116 ~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~---~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k 192 (390) T protein:vir:62 116 AGNPNVLSRTLYGQLIAQAVERSAIMRGGATTF---TTSDANPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFK 192 (390) T ss_pred cCCCccccccchHHHHHHHHhhhhhhhhcceee---ecCCCceeEEEEEcCCcceeeecccccccccccceeeeEeeeee Confidence 1111 3567778777777777777776665431 12346678899876554455555666565566777778777754 Q ss_pred eeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH-------HHhhcccc-cccccCCCHHHHHHHHHHHHHH Q lcl|NC_011288. 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD-------LLVDNGTA-LSGSAPTDADDAFDLIATALKE 150 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~-------~~~~~~~~-~~~~~~~t~~~~~~~i~~a~~~ 150 (273) . +.-+.|++.-..++..++.+ +.+..+++++.++|..++. .+...... ............+++|.++... T Consensus 193 ~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 271 (390) T protein:vir:62 193 Y-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALIDLFHE 271 (390) T ss_pred E-EeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCccccccccccccccceecccccccchHHHHHHHHh Confidence 3 34455665444445556766 5567888999999998774 11111110 0011111223457788888777 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhh-cccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEE Q lcl|NC_011288. 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAY 229 (273) Q Consensus 151 l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~ 229 (273) |+.... .+-..|++|..+..|.+- ++.+ ..-....+..|.-+.+.|++|+.++.+|.. .++.|.-+.... T Consensus 272 l~~~~~--~~a~~vmn~~~~~~L~~l----kd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~---~i~~gd~s~~~i 342 (390) T protein:vir:62 272 VPSAYR--ANAKYVVNDLRAAQMRKL----KDANGQYLWQSGLTVGAPSLFNGKVVETDDGMPAD---KILFADLSKYRV 342 (390) T ss_pred hhhhhh--cCCEEEEchHHHHHHHHh----hccCCCeeecCCcCCCccceecccceEEecCCCCc---cEEEeeccceeE Confidence 765543 344678999999888542 2111 011112244566678999999999999853 233343322222 Q ss_pred eee-eeeehhhcCCCc--eeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 230 VSQ-IDTVEALRDQDS--FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 230 a~~-~~~~e~~~~~~~--~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ... -..++...+... -...+++.+++|+++++|+++++|+-+++ T Consensus 343 ~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~ 389 (390) T protein:vir:62 343 RFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPG 389 (390) T ss_pred EeecceEEEeeccccccCCcEEEEEEEEeCcEeechhheEEEEeecC Confidence 111 112222222211 13467899999999999999998888877 No 101 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.36 E-value=2.7e-13 Score=89.51 Aligned_cols=262 Identities=15% Similarity=0.067 Sum_probs=152.5 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCC-CCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSA-DAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~-~~~~~~~~ 73 (273) |... .++|+-|..++++.+++.+++.++++.- ...+.++.+|.....+......++..... .......+ T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i 181 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVI----TLGGSDYKKLVNLGGTTSGWVGETDARPETATSKLGLI 181 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceee----ecCCCceEEEEecCCcceeeecccccccccccccceeE Confidence 3222 3789999999999999998887776531 12244677776544333344444443322 23455666 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH---------Hhhccccc-----------ccc Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL---------LVDNGTAL-----------SGS 132 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~---------~~~~~~~~-----------~~~ 132 (273) ++.+.+. ..-+.|++.-..++..++.. +.++.+++++.++|..++.= +....... ... T Consensus 182 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 260 (407) T protein:vir:48 182 EPFMGEI-YGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIA 260 (407) T ss_pred Eeeeeee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeecccccccccccccccccccc Confidence 6666443 22345555444445556766 66788899999999886631 10000000 001 Q ss_pred cCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccc Q lcl|NC_011288. 133 APTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~ 212 (273) +.......+++|.++...|.....+ +-..+++|..+..|.+....-. ...-...+..|..++|+|.+|+.++.+| T Consensus 261 ~~~~~~~~~d~i~~l~~~l~~~~~~--~a~~v~n~~~~~~L~~lkD~~G---r~l~~~~~~~g~~~~l~G~PV~~~~~~p 335 (407) T protein:vir:48 261 SGAASGVTADAIIKLIYTLRKAHRS--GAKFMMNNSSLFAIRLLKDNDG---NYLWRPGIELGQPSSLAGYGIVENEQMP 335 (407) T ss_pred cccccccChHHHHHHHHhhchhhhc--CCEEEEcHHHHHHHHHhhccCC---ceeeccCcCCCCCceecceeeEEecCcC Confidence 1122233578888888888766543 3357899999988855321110 1111122445777899999999999998 Q ss_pred cCC-CcEE-EEEcC-ceeEEeeeeeeehhhcCCC--ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 213 DTD-DEQF-VAFHP-SAAAYVSQIDTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~-~~~~-~~~~~-~a~~~a~~~~~~e~~~~~~--~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..+ +..+ +.|.- .++...... .++..++.- .--..+++.+++|+++++|++++.++-+++ T Consensus 336 ~~~~~~~~i~~Gd~~~~~~i~~~~-~~~i~~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa 400 (407) T protein:vir:48 336 DIAADAKAIAFGNFKRGYTIVDRI-GTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAA 400 (407) T ss_pred CccCCccEEEEEeccccEEEEEee-ceEEEeeccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 633 2333 33432 233222211 122223211 223468899999999999999999887777 No 102 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.36 E-value=4e-13 Score=88.54 Aligned_cols=260 Identities=13% Similarity=0.045 Sum_probs=152.2 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 77 (273) ++.. .+.|++|...+.+.+....++..+++.- ....|..+.+|............++......++..+.+++.. T Consensus 114 t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~ 190 (392) T protein:vir:13 114 TKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTF---TTSDANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGG 190 (392) T ss_pred cccCCCccccccchHHHHHHHHhhhhhhhhcceee---ecCCCceeEEEEEcCCcceeeecccccccccccceeeEEeee Confidence 2221 3457777777777667666665555321 123466788888765444444555555555567777777777 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhccccc-ccccCCCHHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTAL-SGSAPTDADDAFDLIAT 146 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~~-~~~~~~t~~~~~~~i~~ 146 (273) .+. +.-+.|++.-..+...++.. +.+..+++++.++|..++. .+....... ..+........+++|.+ T Consensus 191 ~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~ 269 (392) T protein:vir:13 191 FKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALID 269 (392) T ss_pred eeE-EeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccccccccccHHHHHH Confidence 543 33445665444445556766 5577889999999998873 111111000 01111222345778888 Q ss_pred HHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhh-cccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCc Q lcl|NC_011288. 147 ALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSAD-TSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS 225 (273) Q Consensus 147 a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~-~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~ 225 (273) +...|..... .+-..|++|..+..|..- ++.+ ..-....+..|.-++|.|++|+.++.+|.. .++.|+-+ T Consensus 270 ~~~~l~~~~~--~~a~~v~n~~~~~~l~~l----kd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~---~i~~Gdf~ 340 (392) T protein:vir:13 270 LFHEVPSAYR--KNAKFVVNDLRAAQMRKL----KDANGQYLWQSALTVGAPDTFNGKVVETDDGMPAD---KVLFADLS 340 (392) T ss_pred HHHhhhhhhh--cCCEEEEcHHHHHHHHHh----hccCCceeecCCcCCCCCceecceeeEEcCCCCCC---cEEEeecc Confidence 7777765432 233568899999888642 2110 010112234455568999999999999853 34455544 Q ss_pred eeEEeee-eeeehhhcCCCce--eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 226 AAAYVSQ-IDTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~a~~-~~~~e~~~~~~~~--~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+..... -..++..++.... .+.+++..++|+++.+|+++++++-+.+ T Consensus 341 ~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 341 KYRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPA 391 (392) T ss_pred ceeEEeecceEEEeeccccccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 3332221 1123222222221 2578999999999999999998887777 No 103 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.35 E-value=6.9e-13 Score=87.26 Aligned_cols=256 Identities=12% Similarity=0.016 Sum_probs=155.3 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) +... .++|+-+...+++.+.+.+++.+++..- ...+.++++|+.... .......++......++..+.+ T Consensus 113 ~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i 188 (390) T protein:vir:81 113 ASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSG----RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKK 188 (390) T ss_pred hccccccCCcceechhhhHHHHHHHhhhhhhhhhccee----eccCCceEEEEEecCCcceeeecCCcccccccceeeEE Confidence 1111 2455557788899999988888877542 223567888886542 2333444555555556777777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc---------ccccccccCCCHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN---------GTALSGSAPTDADDAFDL 143 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~---------~~~~~~~~~~t~~~~~~~ 143 (273) ++++.+. +.-+.|++. ......++.+ +.++.++++++++|..++.--... +.........+....+++ T Consensus 189 ~~~~~k~-~~~~~is~e-ll~d~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~ 266 (390) T protein:vir:81 189 TDTTHVI-AHTMKATRQ-ILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQ 266 (390) T ss_pred EEeeeEE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhHHH Confidence 7777554 344566663 3333345666 556788999999999877321100 001111122333456788 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEc Q lcl|NC_011288. 144 IATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH 223 (273) Q Consensus 144 i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~ 223 (273) |..+...+...+.+.. .+|++|..+..|.+-...-. .... .. ...|.-++++|.+|+.++.+|.+. ++.|. T Consensus 267 ~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd~~G-~~l~--~~-~~~~~~~~l~G~pv~~~~~~p~~~---~~~gd 337 (390) T protein:vir:81 267 LRLAMLQASLAEYNPS--GIVINPIDWAAIELAKDANN-QYLI--GN-ARGTLTPTLWGLPVVATQAMAPGE---FLVGA 337 (390) T ss_pred HHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCCC-ceee--cC-cccccCceecceeeEEcCCCCCCc---EEEEe Confidence 9888888887776433 57899999998865321100 0011 11 123445689999999999998542 34443 Q ss_pred -CceeEEee-eeeeehhhcCCCcee---eeEEeeeeeeeEEecCceEEEEecC Q lcl|NC_011288. 224 -PSAAAYVS-QIDTVEALRDQDSFS---DRIRALHVYGGKVVRPTGVVVFNKT 271 (273) Q Consensus 224 -~~a~~~a~-~~~~~e~~~~~~~~~---~~v~~~~~~g~~v~~~~~~v~~~~~ 271 (273) +.++.... +...++..+....|. ..+++.+++|.++.+|+++|+++-+ T Consensus 338 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 338 FDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred hhceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 33443333 222344444333332 4688999999999999999999988 No 104 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.35 E-value=4e-13 Score=88.54 Aligned_cols=262 Identities=13% Similarity=0.051 Sum_probs=153.3 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCC-CccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADA-ISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 73 (273) |... .++|+-|...+++.++..+++.++++.- ...+...++|.....+......++......+ ...+.+ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~----~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v 205 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQ----PVSKAGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPL 205 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceee----eccCCceEEEEEcCCcceeeecccccccccccccccee Confidence 3222 3789999999999999999888887532 1224456777654443344444444332222 345566 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH---------Hhhccccc-----------ccc Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL---------LVDNGTAL-----------SGS 132 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~---------~~~~~~~~-----------~~~ 132 (273) ++...+. +.-+.|+..-..++..++.+ +.++++++++.++|..++.= +....... ... T Consensus 206 ~~~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 284 (425) T protein:vir:10 206 SFASGEI-YANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVN 284 (425) T ss_pred eeeheee-EeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeecccccccccccccccccccc Confidence 6665332 33345555433444566766 56888999999999987731 11100000 001 Q ss_pred cCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccc Q lcl|NC_011288. 133 APTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~ 212 (273) +..+....+++|.++...|..... .+-..+++|..+..|.+....-. .......+..|.-++++|.+|+.++++| T Consensus 285 ~~~~~~~~~d~l~~l~~~l~~~~~--~~a~~vmn~~~~~~L~~lkD~~G---~~l~~~~~~~g~~~~l~G~PV~~~~~~p 359 (425) T protein:vir:10 285 SGAAADITSDGIIDLVYDLPSAFT--GNARFAMNRNTQRQVRKLKDGQG---NYLWQPSYVAGQPATLAGYPVTEVPDMP 359 (425) T ss_pred ccccccccHHHHHHHHhhhhhhhc--cCCEEEEchHHHHHHHHhhcCCC---ceeeccCccCCCCceecceeeEEecCcC Confidence 112333467778887777765443 34467999999988865321110 1111123445666789999999999998 Q ss_pred cCCC-cE-EEEEc-CceeEEeeeeeeehhhcCC--CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 213 DTDD-EQ-FVAFH-PSAAAYVSQIDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~~-~~-~~~~~-~~a~~~a~~~~~~e~~~~~--~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .... .. ++.|. +.++...... .++..++. .+--..+++.+++|.++++|+++++++-.+| T Consensus 360 ~~~~~~~~i~~Gd~~~~~~i~~~~-~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as 424 (425) T protein:vir:10 360 DVAANSTPILFGDFQQTYLIIDRI-GVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAAS 424 (425) T ss_pred CccCCccEEEEEehhccEEEEEec-ceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeecc Confidence 5332 22 33343 3333222211 12222221 1223578899999999999999999999999 No 105 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.35 E-value=2.6e-13 Score=89.56 Aligned_cols=266 Identities=11% Similarity=0.024 Sum_probs=150.7 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+.|...+++.+++.+++.++++.- ....+..+.+|..... .......++......++....+ T Consensus 117 ~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~ 193 (409) T protein:vir:45 117 QGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQIL---TTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMG 193 (409) T ss_pred ccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceee---ecCCCceEEEEeeccCcccccccccccccccccccccee Confidence 3322 3789999999999999888887776542 1223556666665432 2333445555444445555555 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh----cccc------cccccCCCHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD----NGTA------LSGSAPTDADDAFD 142 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~----~~~~------~~~~~~~t~~~~~~ 142 (273) ++...+....-+.|++.-..++..++++ +.++.+++++.++|..++.--.. .+.. .......+....++ T Consensus 194 ~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d 273 (409) T protein:vir:45 194 SLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQ 273 (409) T ss_pred eeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccchH Confidence 5443222122245666444444556766 56778899999999887731100 0000 00111222334577 Q ss_pred HHHHHHHHHhhcCCCccCCE-EEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCC-CcE-E Q lcl|NC_011288. 143 LIATALKELTKANVPNVGRV-VVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD-DEQ-F 219 (273) Q Consensus 143 ~i~~a~~~l~~~~vP~~~r~-lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~-~~~-~ 219 (273) +|.++...|..... ....| +++++..+..|.+-...-. .......+.+|.-+++.|.+|+.++.+|..+ +.. + T Consensus 274 ~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i 349 (409) T protein:vir:45 274 EILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEMEDGQG---RPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFM 349 (409) T ss_pred HHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhhcCCC---ceeeccCcCCCCCceecceeeEEecCcCCccCCccEE Confidence 88888888866542 23355 5679999888754211100 1111122344556789999999999998632 222 3 Q ss_pred EEEcCceeEEee-eeeeehhhcCCCc--eeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 220 VAFHPSAAAYVS-QIDTVEALRDQDS--FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 220 ~~~~~~a~~~a~-~~~~~e~~~~~~~--~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.|.-+...... +...++..++... -...+++..++|.++.+|+++++++..+| T Consensus 350 ~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s 406 (409) T protein:vir:45 350 FCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGS 406 (409) T ss_pred EEeehhhhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccC Confidence 334322221111 1112222232221 12468999999999999999999888777 No 106 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.34 E-value=3e-13 Score=89.25 Aligned_cols=262 Identities=15% Similarity=0.072 Sum_probs=151.9 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |+.. .++|+-|..++++.++..+++..+++.- ...|.+..+|.....+......++.... .+....+.+ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v 182 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVI----TVGGSDYKKLVNLGGTASGWVGETDTRSQTATSRLGLI 182 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceee----ecCCCceEEEEecCCccceeeccccccCccccccceee Confidence 4432 4789999999999999988887776531 1235566677654433333334443322 223455666 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH---------Hhhccc-c----------cccc Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL---------LVDNGT-A----------LSGS 132 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~---------~~~~~~-~----------~~~~ 132 (273) ++...+. +.-+.|+..-..++..++.. +.+..+++++.++|..++.= +..... . .... T Consensus 183 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 261 (401) T protein:vir:44 183 EPFMGEI-YGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIV 261 (401) T ss_pred eeehhhe-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeccccccccccccccccccccc Confidence 6666432 23345555433444556766 56778889999999887731 000000 0 0001 Q ss_pred cCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccc Q lcl|NC_011288. 133 APTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~ 212 (273) +.......+++|.++...|..... .+-..+++|..+..|.+....-. ...-...+.+|.-++++|.+|+.++++| T Consensus 262 t~~~~~~~~d~i~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd~~G---~~l~~~~~~~g~~~~l~G~PVv~~~~~p 336 (401) T protein:vir:44 262 SGEATAVTADAIIKLIYTLRKAHR--TGAKFMMNNNSLFAIRLLKDTEG---NYLWRPGLELGQPSSLAGYGIAENEQMP 336 (401) T ss_pred cccccccCHHHHHHHHHhcchhhh--cCCEEEEcHHHHHHHHHhhccCC---ceeecCCcCCCCCceecceeeEEecCcC Confidence 112223347888888887765442 34467899999998865321100 0111122345777789999999999998 Q ss_pred cCCC-cEE-EEEcC-ceeEEeeeeeeehhhcCCCc--eeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 213 DTDD-EQF-VAFHP-SAAAYVSQIDTVEALRDQDS--FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~~-~~~-~~~~~-~a~~~a~~~~~~e~~~~~~~--~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..+. ..+ +.|.- .++....+.. ++..++... --..+++.+++|+.+++|++++.|+-++| T Consensus 337 ~~~~~~~~i~~Gd~~~~~~i~~~~~-~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 337 DIAADAKAIAFGNFKRGYTIVDRIG-TRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred CccCCccEEEEeehhccEEEEEecc-eEEeeeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 6332 233 33442 3333332221 222222211 12467888999999999999999999888 No 107 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.34 E-value=9.2e-13 Score=86.56 Aligned_cols=266 Identities=12% Similarity=0.050 Sum_probs=150.0 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC------CCCCccc Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS------ADAISDT 71 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~------~~~~~~~ 71 (273) +... .++|+.|...+++.++...++..+++. ....|....+|+....+.......+.... ..+...+ T Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~ 240 (458) T protein:vir:10 165 SSVEVSSESYETIFSQRIIRDLQKELVVGALFEE----LPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALK 240 (458) T ss_pred ccCccccceehhhHhHHHHHHHHhhhhHHhhcce----eecCCcceEEEEecCCcceeecccccccccccccccccccce Confidence 1111 378999999999999999888777654 12235566777654443333333322221 1223344 Q ss_pred eEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH---------Hhhcccc-----cccccCCC Q lcl|NC_011288. 72 GVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL---------LVDNGTA-----LSGSAPTD 136 (273) Q Consensus 72 ~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~---------~~~~~~~-----~~~~~~~t 136 (273) .++++..+. +.-+.|++.-..++..++.+ +.+.+.++++.++|..++.= +...... ...+.... T Consensus 241 ~i~~~~~k~-~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 319 (458) T protein:vir:10 241 EIHFSTYKL-AAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGS 319 (458) T ss_pred eeEeeeeeE-EeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccccccc Confidence 455444332 23346665433344556766 56788899999999988731 0000000 00011111 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhh-hcccccceeeeeeeeeEeceEEEeeCccccCC Q lcl|NC_011288. 137 ADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSA-DTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 137 ~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~-~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~ 215 (273) ....+++|.++...|..... .+-..+++|..+..|..-.....+. ...........|..+++.|++|+.++.+|..+ T Consensus 320 ~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~ 397 (458) T protein:vir:10 320 VLVTAKTISKLRRKLGRHGL--KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKA 397 (458) T ss_pred ccccHHHHHHHHHhhhhhhc--CCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEcccccccc Confidence 22357888988888877664 3446799999998885422110000 00001122334566789999999999999754 Q ss_pred CcE--EEEEcCceeEEeeee-eeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 216 DEQ--FVAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ~~~--~~~~~~~a~~~a~~~-~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.. ++..-.+++...... ..++..+-...-...++...++|..+.+|+++|..+.++| T Consensus 398 ~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 398 NSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred CCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 322 222222333233221 2222111111223468888999999999999999999999 No 108 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.33 E-value=8e-13 Score=86.89 Aligned_cols=260 Identities=13% Similarity=0.015 Sum_probs=151.4 Q ss_pred Cccc-----hhhHHHHHHHHH-HHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLL-EEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~-~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) +..+ .++|+-|...++ ..+....++..+++.- ...| .+.+|+............+...+...++.+.++ T Consensus 251 ~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~----~~~g-~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 325 (543) T protein:vir:81 251 MGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQV----VATG-DVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPE 325 (543) T ss_pred cccccccCcccCchhhhhHHHHHHHhhhchhhhhcccc----cCCc-ceEEEEecCCcceeecccCccccccccccceee Confidence 1111 467877776655 4456666666665431 1234 456676554444555566666666677777777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH----------Hhhccc-ccccccCCCHHHHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL----------LVDNGT-ALSGSAPTDADDAFD 142 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~----------~~~~~~-~~~~~~~~t~~~~~~ 142 (273) ++..+. +.-+.|+.. ......++.+ +.+.+..+++.++|..++.= +..... ....++..+...+++ T Consensus 326 ~~~~k~-~~~~~is~e-ll~d~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 403 (543) T protein:vir:81 326 IPVKKA-QGFVPISIE-ALQDEANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALA 403 (543) T ss_pred eeeeee-EeeehhhHH-HHhccHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccccccccccccccccHH Confidence 777553 334566653 3333346655 66788899999999987631 111000 111122233345678 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC------ Q lcl|NC_011288. 143 LIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD------ 216 (273) Q Consensus 143 ~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~------ 216 (273) ++.++...+..... .+-.++++|..+..|.+....-. .... ..+..|.-++++|.+|+.++.+|.... T Consensus 404 ~~~~~~~~l~~~~~--~~~~~v~n~~~~~~l~~lkd~~G-~~l~---~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~ 477 (543) T protein:vir:81 404 DVYAVYEQLAARHR--RQGAWLANNLIYNKIRQFDTQGG-AGLW---TTIGNGEPSQLLGRPVGEAEAMDANWNTSASAD 477 (543) T ss_pred HHHHHHHhhhcccc--CCcEEEEcHHHHHHHHHhhcCCC-ceec---cCcCCCCCccccceeeEEeccccccccccccCC Confidence 88888888866553 23468999999999865321100 0111 123345567899999999999886431 Q ss_pred -cEEEEEcCceeEEeeee-eeehh--hcC----CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 -EQFVAFHPSAAAYVSQI-DTVEA--LRD----QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 -~~~~~~~~~a~~~a~~~-~~~e~--~~~----~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.|+.+.+.+.... ..++. +.. ...-...+++.+++|..+++|++++.++-+.+ T Consensus 478 ~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~ 542 (543) T protein:vir:81 478 NFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETA 542 (543) T ss_pred cceEEEeeccceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEeccc Confidence 22445554443333221 12211 111 11123468889999999999999998888877 No 109 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=99.32 E-value=3.2e-13 Score=89.07 Aligned_cols=261 Identities=12% Similarity=0.061 Sum_probs=163.2 Q ss_pred Cccch--------hhHHHHHHHHHHHHHHhhccc--hhhcccccccc---cCCceEEEeecCcccceeec-CCC---ccc Q lcl|NC_011288. 1 MAFNN--------FIPELWSDMLLEEWTAQTVFA--NLVNREYEGTA---SKGNVVHIAGVVAPTVKDYK-AAG---RQT 63 (273) Q Consensus 1 MA~~~--------~~pev~~~~~~~~~~~~lv~~--~~v~~~~~~~~---~~Gdtv~ip~~~~~~~~~~~-~~~---~~~ 63 (273) |+--+ ++||+|.+.+.+.-.+.+-|. +.+.++-+... ..|++|++|.++.+...+.+ .++ ... T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 99443 889999999998876554432 23434433221 46999999999887432221 111 134 Q ss_pred CCCCCccceEEEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------- Q lcl|NC_011288. 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGT--------------- 127 (273) Q Consensus 64 ~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~--------------- 127 (273) +++.++.++....+ .++..++..+|+....+-.+ |+.+..|.+.--.+...+.+++.+...-. T Consensus 81 t~~kittg~~~a~v-~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~~~ 159 (367) T protein:vir:80 81 PIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTRGR 159 (367) T ss_pred cccccccchheeee-ehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhhhc Confidence 55666666655554 56778888888876655445 67777777766666666666766543210 Q ss_pred ------------ccccccC---CCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccccee Q lcl|NC_011288. 128 ------------ALSGSAP---TDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGL 192 (273) Q Consensus 128 ------------~~~~~~~---~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l 192 (273) ....++. .+.....+.|.+|+..|.++. ..-..++++|..+..|.+.. .+.-. .. ++. T Consensus 160 ~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~--~~l~~i~mHS~V~~~L~~~~-li~~i-~~--sd~- 232 (367) T protein:vir:80 160 VPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTNND-EIEFI-PD--SKG- 232 (367) T ss_pred cccccccccCceeeeeeccCCCccceecHHHHHHHHHHhcccc--ccccEEEEchHHHHHHHhcc-ccccc-cC--CCC- Confidence 0000111 112234677999999998875 34467899999999998753 22211 11 111 Q ss_pred eeeeeeeEeceEEEeeCccccC-----CCcEEEEEcCceeEEeeee--eeehhhcCCCce----eeeEEeeeeeeeEEec Q lcl|NC_011288. 193 RAGTIGNLLGARIVESNNLRDT-----DDEQFVAFHPSAAAYVSQI--DTVEALRDQDSF----SDRIRALHVYGGKVVR 261 (273) Q Consensus 193 ~~G~ig~~~G~~v~~s~~l~~~-----~~~~~~~~~~~a~~~a~~~--~~~e~~~~~~~~----~~~v~~~~~~g~~v~~ 261 (273) +..|+.+.|..|++++.+|.. ..++++++-++|+++...- ..+|..|++... -|.+..|.+ +++- T Consensus 233 -~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~h 308 (367) T protein:vir:80 233 -QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVH 308 (367) T ss_pred -ccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEEEEeeee---EEee Confidence 346999999999999999962 2356778889999988754 345888888753 144444433 4566 Q ss_pred CceEEEEecCC-------------------C Q lcl|NC_011288. 262 PTGVVVFNKTG-------------------S 273 (273) Q Consensus 262 ~~~~v~~~~~~-------------------s 273 (273) |-|+--.+++. | T Consensus 309 P~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 309 PGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred cceeeecccccccccccccccccccccCCCC Confidence 66665544321 1 No 110 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.31 E-value=1.1e-12 Score=86.13 Aligned_cols=264 Identities=16% Similarity=0.064 Sum_probs=151.0 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEE Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (273) ||.. .++|+.++.++++.+++.+++..++.+- ...+..++||+....+......++...+..+++.+.+++ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i----~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l 76 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARK----PQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTS 76 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEeCCceeEEeecCcccccccceeeEEEE Confidence 9975 4679999999999999999888877541 122346789987554444555566666556677777777 Q ss_pred EEeeeeecceEEchHHHHh---hhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc-cccc------------c-cccCCCH Q lcl|NC_011288. 76 LIDQEKSIDFLVDDIDRVQ---VAGSLEA-YTRAGATALATDTDKFIADLLVDN-GTAL------------S-GSAPTDA 137 (273) Q Consensus 76 ~id~~~~~~~~i~d~d~~~---~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~-~~~~------------~-~~~~~t~ 137 (273) ...+. +.-+.|+++=..+ ...++.. +.+++++++++++|+.++.-.... +... . .....+. T Consensus 77 ~~~k~-~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~ 155 (311) T protein:vir:99 77 TPKKA-QVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTI 155 (311) T ss_pred eeEEE-EEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecccccc Confidence 76433 3345666542222 2344554 667889999999999887432110 0000 0 0111122 Q ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC- Q lcl|NC_011288. 138 DDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~- 216 (273) ...+.++..+...+..++.....-.++++|..+..|.+...... ..........+..+++.|++++.++.+|.... T Consensus 156 ~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~ 232 (311) T protein:vir:99 156 ANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDG---RKKFPELGLGIGVSSFEGIDASVSDTVNGGDEA 232 (311) T ss_pred chhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCC---CeeecCcccCCCCceecceeeEeeccccccccc Confidence 33445666676666555432222237999999998865321100 01111122234567899999999998873221 Q ss_pred ------------cEEEEEcC-ceeEEee-eeeeehhhcC--CC----ce---eeeEEeeeeeeeEEecCceEEEEecCC Q lcl|NC_011288. 217 ------------EQFVAFHP-SAAAYVS-QIDTVEALRD--QD----SF---SDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 217 ------------~~~~~~~~-~a~~~a~-~~~~~e~~~~--~~----~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~ 272 (273) ..++.|.- ..+.+.. +...++..+. .+ .| ...+++.+++|..+++|+.+++.++++ T Consensus 233 ~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 233 DPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred ccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChhHeeeecccC Confidence 11222221 1222211 1112221111 11 12 236788999999999998888888888 No 111 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=99.31 E-value=1.1e-12 Score=86.26 Aligned_cols=267 Identities=13% Similarity=0.088 Sum_probs=151.7 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEeee Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQE 80 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~ 80 (273) -+-..++|+.+.+.+++.+++.+++..+..+-....-..--.++||+....+...+..++......+.+.+.++++..+. T Consensus 344 ~~Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kl 423 (645) T protein:vir:93 344 WAGSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKV 423 (645) T ss_pred ccCCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEE Confidence 01224689999999999999998887765432221111112567888654445556666666666667777777776442 Q ss_pred eecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc-----ccccc--cccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 81 KSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN-----GTALS--GSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 81 ~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~-----~~~~~--~~~~~t~~~~~~~i~~a~~~l~ 152 (273) +.-+.|++.=..++..++++ +.+.+.++++.++|..++.--... +.... .....+......++..+...+. T Consensus 424 -a~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~~~ 502 (645) T protein:vir:93 424 -SAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEAAFGQFV 502 (645) T ss_pred -EEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHHHHHHHH Confidence 33345554323344556777 457788999999999887421111 11111 0111122234567778888887 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccC----CCcEEEEEcCceeE Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT----DDEQFVAFHPSAAA 228 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~----~~~~~~~~~~~a~~ 228 (273) .+++...+-..+++|..+..|.+......+..+ ..... .| +++.|++|+.|+.+|.. ....+++++...+. T Consensus 503 ~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~--~~~~~-~~--~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~~~v~ 577 (645) T protein:vir:93 503 AANLQPTGAVWLMSSTNALALSMRKNALGQKEY--PDMTL-LG--GSFQGLPVIVSQYVGDQLVLVNAPDIYLADDGGVA 577 (645) T ss_pred hcCCCccccEEEEcHHHHHHHHhccccCCceee--cCCCC-CC--ceeeceeeEEeccCCcceeEeccccEEEEEecceE Confidence 777755566788999999988654221111000 00011 12 58999999999999842 11112233332222 Q ss_pred Eeeee-eeeh--hhcCC-----------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 229 YVSQI-DTVE--ALRDQ-----------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 229 ~a~~~-~~~e--~~~~~-----------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.... ..++ ..... +.| -..|++-++++..+.+|+++++|+...= T Consensus 578 i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~ 639 (645) T protein:vir:93 578 VDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNY 639 (645) T ss_pred EEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccC Confidence 21110 0111 00000 012 2468899999999999999999974321 No 112 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.29 E-value=3.4e-12 Score=83.47 Aligned_cols=262 Identities=12% Similarity=0.071 Sum_probs=153.0 Q ss_pred Cccc--------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccce Q lcl|NC_011288. 1 MAFN--------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~~--------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (273) ++.+ .++|+.|...+++.+++.+++..+..+-. ....| .+.+|+....+......++...+..++..+. T Consensus 130 ~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v--~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~ 206 (435) T protein:vir:80 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTL--PLSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFDD 206 (435) T ss_pred hhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceee--ecCCC-ceEEEEEeCCcceeeeccCccccccccceee Confidence 2111 37899999999999988887766522211 11223 5788887554444455555555555666677 Q ss_pred EEEEEeeeeecceEEchHHHHhh--hHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc--ccc----------cccccCCCH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQV--AGSLEA-YTRAGATALATDTDKFIADLLVDN--GTA----------LSGSAPTDA 137 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~--~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~--~~~----------~~~~~~~t~ 137 (273) +++...+. +.-+.|++.-..++ ..++++ +.++.+++++.++|..++.--... +.. .......+. T Consensus 207 i~~~~~k~-~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~ 285 (435) T protein:vir:80 207 LKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTL 285 (435) T ss_pred EEEeeEEE-EEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccch Confidence 77776443 33456665433333 235666 567888999999999877421100 000 011122233 Q ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC- Q lcl|NC_011288. 138 DDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~- 216 (273) ...+.++.++...|........+-..+++|..+..|.+-. + ..|. ..+....=++++|++|+.++.+|...+ T Consensus 286 ~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk----d--~~G~-~l~~~~~~~~l~G~pv~~~~~~p~~~~~ 358 (435) T protein:vir:80 286 QKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLR----D--GNGN-KVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred hhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhh----c--cCCc-eeccCCCCCeEeeeeeEEeccccccccC Confidence 4455677777777776665444556789999998885432 1 1111 111111124799999999999985322 Q ss_pred ----cEEEEEcCceeEEeee-eeeehhhcCCC----------ce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 ----EQFVAFHPSAAAYVSQ-IDTVEALRDQD----------SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ----~~~~~~~~~a~~~a~~-~~~~e~~~~~~----------~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.++-+-+.+... ...++..+... .| ...+++..++|.++.+|++++.|+..+= T Consensus 359 ~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:80 359 AGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAW 433 (435) T ss_pred CCCcceEEEEEcccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCC Confidence 2244554433333221 12232222221 11 3578899999999999999999987765 No 113 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.29 E-value=3.5e-12 Score=83.42 Aligned_cols=256 Identities=13% Similarity=0.027 Sum_probs=152.5 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 74 (273) +... .++|+-+...+++.+...+.+.+++..- ...+.++++|+.... .......++...+..+++.+.++ T Consensus 114 ~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 189 (390) T protein:vir:10 114 STDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSG----RTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKT 189 (390) T ss_pred hcccccccccccchhHHHHHHHHHHhhchhhhhccee----eccCCceEEEEEecCCcceeeecCCccccccccceeEEE Confidence 1111 2344445677888888888777776431 123557888886543 23344455555555667777888 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc---------ccccccccCCCHHHHHHHH Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN---------GTALSGSAPTDADDAFDLI 144 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~---------~~~~~~~~~~t~~~~~~~i 144 (273) +++.+. +.-+.|++. ......++.+ +.++.+++++.++|..++.--... ..........+....++.+ T Consensus 190 ~~~~k~-~~~~~is~e-ll~d~~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 267 (390) T protein:vir:10 190 DTTHVI-AHTMKATRQ-ILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQL 267 (390) T ss_pred EeeEEE-EEeehhhHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccccccccccccccccccccchHHHH Confidence 887654 344566663 3333346666 557788899999999877321000 0011111222334567888 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcC Q lcl|NC_011288. 145 ATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 145 ~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~ 224 (273) ..+...|.....+. -.++++|..+..|.+-.....+ ... ... ..+.-+++.|.+|+.++.+|.+. ++.|.- T Consensus 268 ~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~lkd~~g~-~l~--~~~-~~~~~~~l~G~pv~~~~~~p~~~---~~~gdf 338 (390) T protein:vir:10 268 RLAMLQASLAEYPA--SGIVINPIDWAAIELAKDANNQ-YLI--GNA-RGTLTPTLWGLPVVATQAMAPGE---FLVGAF 338 (390) T ss_pred HHHHHhhccccCCC--CEEEEcHHHHHHHHHhhcCCCc-eee--cCC-cCcCCceecceeeEEcCCCCCCc---EEEEec Confidence 88888888877653 3578999999988653211000 010 111 12334679999999999998542 344432 Q ss_pred -ceeEEee-eeeeehhhcCCCce---eeeEEeeeeeeeEEecCceEEEEecC Q lcl|NC_011288. 225 -SAAAYVS-QIDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) Q Consensus 225 -~a~~~a~-~~~~~e~~~~~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~ 271 (273) .++.... +...++..+....| ...+++.+++|+++.+|++++.++-+ T Consensus 339 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 339 DLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred cceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 3333322 22234433333332 34778889999999999999999888 No 114 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.29 E-value=2.7e-12 Score=83.99 Aligned_cols=259 Identities=10% Similarity=0.058 Sum_probs=154.5 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc--ceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT--VKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 74 (273) |... .++|+-|...+++.+...+.+.++++. ....+.++.+|+....+ ......++...+..++..+.++ T Consensus 109 ~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~ 184 (379) T protein:vir:10 109 MTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGA----VSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMID 184 (379) T ss_pred cccCCCCccccchhhhhHHHHhHHhhhhHHhhcee----eeccCCceEEEEeecCCCcccccccCCccccccccceeeeE Confidence 2221 357899999999998888877777643 12246678888864321 1122344444444567778888 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) +.+.+... -+.|++. ..+....+.++ .+.++++++.+.|..+++-+...+. ......+....++.|.++...+.. T Consensus 185 ~~~~k~~~-~~~iS~e-ll~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~--~~~~~~~~~~~~d~i~~~~~~~~~ 260 (379) T protein:vir:10 185 VNTDFIAG-FTRYSKK-MANNLPFLTSFIPNALRRDYAKAENAAFNAVLAANAT--ASTEIITNKNKVEMLINEIAKQEN 260 (379) T ss_pred eeeeeEEe-eehhhHH-HHhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--cccccccCcccHHHHHHHHHhhhh Confidence 88866533 3456653 33333346664 4677888999999888765443321 112223334456788888877877 Q ss_pred cCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe-ee Q lcl|NC_011288. 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV-SQ 232 (273) Q Consensus 154 ~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a-~~ 232 (273) .+.+. ..+|++|..+..|.+...... ............|.-.+++|++|+.|+.+|.+ .++.+.-+..... .+ T Consensus 261 ~~~~~--~~~vmn~~~~~~l~~lkd~~G-~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag---~~~~gdf~~~~~~~~~ 334 (379) T protein:vir:10 261 LDFPV--TAIVLRPTDYYDILVTQKSVG-AGYGLPGVVTQDNGVLRINGIPLFRATWLAAN---KYYVGDWTRVTKVTTE 334 (379) T ss_pred ccCCC--CEEEEcHHHHHHHHHhhccCC-ceeccCCccCCCCCcceecceeeEecCCCCCC---ceEEeecccEEEEEEe Confidence 76543 357889999988865321111 01111111112344458999999999998753 2344433222221 22 Q ss_pred eeeehhhcCC-Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 233 IDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~~~~e~~~~~-~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ...++.-+.. ..| -..+++.+++|+.+.+|+++|.++-++= T Consensus 335 ~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 335 GLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred ceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 2233333332 223 3478888999999999999999877777 No 115 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.28 E-value=1.3e-12 Score=85.81 Aligned_cols=253 Identities=13% Similarity=0.025 Sum_probs=146.0 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCccc-CCCCCccceEEEEEe Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQT-SADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~~id 78 (273) -....++|+-|...+++.++....+.++++.- ...+.++++|.+... +......+++.. ..+.+..+.+++++. T Consensus 140 ~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~ 215 (400) T protein:vir:38 140 ADAASTIPETISNTPQRELQTVVDLKPFTNVF----QASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVE 215 (400) T ss_pred cCCcccccHHHHHHHHHHHHhhhhhhhcceeE----eccCcceEEEEEecCCCccccccccccccccccccceeeEeehh Confidence 11125789999999999999888877776431 123446677765422 222233333333 233456666776664 Q ss_pred eeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_011288. 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANVP 157 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP 157 (273) +. +.-+.|++.=..++..++.. +.+..+++++.+.|..++....... .. +...++++.++....-+ | T Consensus 216 k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~----~~----~~~~~~~~~~~~~~~~~---~ 283 (400) T protein:vir:38 216 TY-RQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFT----AK----TISSVDDLKHINNVDLD---P 283 (400) T ss_pred he-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc----cc----ccccHHHHHHHHHhhhh---h Confidence 43 34445555433344556766 5567778888888887764332211 11 11224455544332211 2 Q ss_pred ccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCC--CcEEEEEc-CceeEEe-eee Q lcl|NC_011288. 158 NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD--DEQFVAFH-PSAAAYV-SQI 233 (273) Q Consensus 158 ~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~--~~~~~~~~-~~a~~~a-~~~ 233 (273) ..+...+++|..+..|.+...... .......+..|.-++|.|++|+.+++.|..+ ...++.+. +.++... .+. T Consensus 284 ~~~a~~v~~~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~ 360 (400) T protein:vir:38 284 AYSRVIIASQSFYNFLDTVKDGNG---RYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRAD 360 (400) T ss_pred hhCcEEEEcHHHHHHHHHhhccCC---CeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecc Confidence 234578899999998865321100 1111122445666789999999999887543 22344544 3333333 222 Q ss_pred eeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 234 DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 234 ~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.. +...+.+.+++.+++|+++.+|++++.|+-+.. T Consensus 361 ~~~~~~-~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 399 (400) T protein:vir:38 361 FMVRWV-DDQIYGQFLQAGMRFGVSVADEKAGYFLTYTPK 399 (400) T ss_pred eEEEEe-cccccceeEEEEEEeccEEecccceEEEEeecC Confidence 333332 235567789999999999999999888777666 No 116 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.27 E-value=3.5e-12 Score=83.38 Aligned_cols=258 Identities=10% Similarity=-0.018 Sum_probs=156.5 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~-~~~~~~~~ 72 (273) |+.. .++|+.|...+++.+++..++.++++.- .......+..||+.... .......++.... .+.+..+. T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~--~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~ 82 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVE--NVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKLSL 82 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhhhhhceee--eccCCcceEEEEeecCCCcceeeecCCcccccccccceeE Confidence 5544 4789999999999999999887776431 11122335666665432 3344455554443 23466677 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKEL 151 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l 151 (273) ++++..+. +.-+.|+++-..++..++++ +.++.+++++.+.|..++.-...... ......+++|.++...+ T Consensus 83 i~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~-------~~~~~~~d~i~~~~~~l 154 (293) T protein:vir:48 83 IKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT-------KPTLTKWDDIIDLEAKV 154 (293) T ss_pred EEEeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc-------cccccCHHHHHHHHHhh Confidence 77777553 34456666544455566766 66788899999999888864432221 12233477888888888 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc--cccCCC-c-EEEEEc-Cce Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDD-E-QFVAFH-PSA 226 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~--l~~~~~-~-~~~~~~-~~a 226 (273) ..... .+-..+++|..+..|.+...... ..-....+.+|.-++++|.+|+.+.. +|..+. . .++.+. +.+ T Consensus 155 ~~~~~--~~a~~vmn~~~~~~L~~lkd~~g---~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~ 229 (293) T protein:vir:48 155 DPAIK--QTSFFLTNTSGFTALKKVKNALG---DYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQA 229 (293) T ss_pred hhhhc--CCCEEEEcHHHHHHHHHhhccCC---ceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccce Confidence 76653 34467899999998865321111 11111224456667999999987544 443222 2 234443 344 Q ss_pred eEEeee-eeeehhhcCC-C---ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 227 AAYVSQ-IDTVEALRDQ-D---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~a~~-~~~~e~~~~~-~---~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +....+ ...++..+.. + +-...+++.+++|.++.+|++++.++-+.+ T Consensus 230 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 281 (293) T protein:vir:48 230 VTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 281 (293) T ss_pred EEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeecc Confidence 433332 2233333221 2 234679999999999999999998875554 No 117 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.27 E-value=3.4e-12 Score=83.45 Aligned_cols=259 Identities=14% Similarity=0.040 Sum_probs=148.7 Q ss_pred Cc------cchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MA------FNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA------~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~-~~~~~~~~ 72 (273) |. -..++|+-|...+++.++...++.++++.- ...+.+.++|..... +......+++... .+.+..+. T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~ 186 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT----PVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQ 186 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhhhhhceee----eccCCceEEEEEecCCCcccccccccccccccccccee Confidence 11 114789999999999999999888887542 223556777765432 2223333333332 34567777 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKEL 151 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l 151 (273) +++.+.+. +.-+.|++.-..++..++.+ +.+.++++++.+.|..++....... ....+....+++|.++...+ T Consensus 187 v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~-----~~~~~~~~~~d~l~~~~~~~ 260 (394) T protein:vir:10 187 VDWSVSTY-RGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFT-----AKATTTDTLVDSLKHILNVD 260 (394) T ss_pred EEeeeeee-EeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-----cccccccccHHHHHHHHHhh Confidence 88877554 33356666544455566766 5677888999999998875543221 11122233455666554322 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccc-cceeeeeeeeeEeceEEEeeCc--cccCCCcEE-EEEc-Cce Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD-AAGLRAGTIGNLLGARIVESNN--LRDTDDEQF-VAFH-PSA 226 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~-~~~l~~G~ig~~~G~~v~~s~~--l~~~~~~~~-~~~~-~~a 226 (273) -... .+-.+|++|..+..|.+......+--...+ ......|.-++++|++|+.++. ++...+..+ +.+. +.+ T Consensus 261 ~~~~---~~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~ 337 (394) T protein:vir:10 261 LDPA---YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRG 337 (394) T ss_pred hhhh---ccCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeecccc Confidence 2211 234689999999998763211110000000 0111123336899999987654 343334333 3333 333 Q ss_pred eEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 227 AAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.... +...++.. +...|.+.+++-+++|+++.+|++++.++-+.+ T Consensus 338 ~~~~~~~~~~v~~~-~~~~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 338 VLFADRQQVTLAWE-DSKIYGRYLGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred EEEEeecceEEEEe-cccccceeEEEEEEeccEEeccccEEEEEeecc Confidence 33332 22233332 344567788999999999999999988776666 No 118 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=99.26 E-value=8.1e-12 Score=81.38 Aligned_cols=262 Identities=14% Similarity=0.073 Sum_probs=146.1 Q ss_pred Cccc-------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN-------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~-------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |+.+ .++|+.+..++++.+++.+++..+-.+.. ....| .+++|+...........++...+..+++.+.+ T Consensus 64 ~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v--~~~~g-~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i 140 (366) T protein:vir:57 64 MAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSI--PLPNG-NLSMPRLSGGATAGYVGEGKDVVATGATFDDV 140 (366) T ss_pred hhccccccCCccccchhHHHHHHHHHhhhcchhhhceeee--ecCCC-ceEEEEEeCCcceeeeccCccccccccceeEE Confidence 3322 46899999999999998888766522211 12234 58888876554445555666565566777777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh--ccccc-------cc-----ccCCCHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD--NGTAL-------SG-----SAPTDAD 138 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~--~~~~~-------~~-----~~~~t~~ 138 (273) ++...+. +.-+.|++.=..++..++++ +.++..+++++++|..++.=-.. .+... .. .+..+.. T Consensus 141 ~~~~~k~-~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~ 219 (366) T protein:vir:57 141 KLSAKTM-IALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLT 219 (366) T ss_pred EEeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchh Confidence 7776443 33456665433445556766 56788899999999887631100 00000 00 0111111 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC-- Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD-- 216 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~-- 216 (273) .....+..+.......+....+-..+++|..+..|.+- ++ ..| ...+....-|.+.|++|+.|+.+|...+ T Consensus 220 ~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~l----kd--~~G-~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~ 292 (366) T protein:vir:57 220 TIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGL----RD--GNG-NKVYPEMSQGILKGYPIQRTSAIPANLGDD 292 (366) T ss_pred hHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhh----hc--cCC-ceeccCCCCCeecceeeEEccccccccccC Confidence 11122222222222222212334568999999888653 21 111 1112222236799999999999986321 Q ss_pred ---cEEEEEcCceeEEeeee-eeehhhcC-----CC-----ce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 ---EQFVAFHPSAAAYVSQI-DTVEALRD-----QD-----SF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ---~~~~~~~~~a~~~a~~~-~~~e~~~~-----~~-----~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.+.-+-+-..... ..++..+. .. .| ...++..++++..+.+|+++++++..+= T Consensus 293 ~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 293 GNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred CCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 22444554433332211 12222221 11 11 2578999999999999999999987777 No 119 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.26 E-value=5.4e-12 Score=82.34 Aligned_cols=262 Identities=11% Similarity=0.071 Sum_probs=151.3 Q ss_pred Cccc--------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccce Q lcl|NC_011288. 1 MAFN--------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~~--------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (273) ++.+ .++|+.|...+++.++..+++..+..+.. ....| .+++|+....+......++...+..++..+. T Consensus 130 ~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~--~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~ 206 (435) T protein:vir:14 130 MSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTL--PLSNG-NITIPRLKGGAIVGYIGADTDIPTTQQQFDD 206 (435) T ss_pred hhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceee--ecCCC-ceEEEEEeCCcceeeeccCccccccccceeE Confidence 2111 37899999999999988887766532221 12233 5788887554444455555555555666666 Q ss_pred EEEEEeeeeecceEEchHHHHhhh--HHHHH-HHHHHHHHHHHHHHHHHHHHHhhc--ccc----------cccccCCCH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVA--GSLEA-YTRAGATALATDTDKFIADLLVDN--GTA----------LSGSAPTDA 137 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~--~~~~~-~~~~~~~ala~~~D~~i~~~~~~~--~~~----------~~~~~~~t~ 137 (273) +++...+. +.-+.|++.=..++. ..++. +.+.+.+++++++|..++.--... +.. .......+. T Consensus 207 i~~~~~k~-~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~ 285 (435) T protein:vir:14 207 LKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTL 285 (435) T ss_pred EEeeeEEE-EEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccch Confidence 76666443 334556654333332 24666 457788999999999877311000 000 011112334 Q ss_pred HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC- Q lcl|NC_011288. 138 DDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~- 216 (273) ...+.++.++...+.....-..+...+++|..+..|.+-.. ..|. ..+....=|.++|++|+.++.+|...+ T Consensus 286 ~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd------~~G~-~l~~~~~~g~l~G~Pv~~~~~~p~~~~~ 358 (435) T protein:vir:14 286 QKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRD------GNGN-KVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred hhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhc------cCCc-eeccCCCCCeeecceeEeeccccccccC Confidence 44566777777777665443345568999999988855321 1111 111111125789999999999886321 Q ss_pred ----cEEEEEcCceeEEeee-eeeehhhcCC----------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 ----EQFVAFHPSAAAYVSQ-IDTVEALRDQ----------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ----~~~~~~~~~a~~~a~~-~~~~e~~~~~----------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.+.-+.+.+... ...++..+.. ..| ...+++.+++|.++.+|++++.|+..+- T Consensus 359 ~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:14 359 TGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAW 433 (435) T ss_pred CCccceEEEeecccEEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCC Confidence 2344554433333221 1122222211 112 2578999999999999999999987665 No 120 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.25 E-value=3.3e-12 Score=83.54 Aligned_cols=259 Identities=17% Similarity=0.114 Sum_probs=151.0 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCC-CccceEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADA-ISDTGVDLL 76 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 76 (273) .+.+ .++|+.+...+++.++..+.+.+++..- ...|+ ..+|+....+......++......+ ...+.++++ T Consensus 141 ~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~----~~~g~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~ 215 (425) T protein:vir:95 141 RAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKI----RVKGT-TRILVDTDTSPATWIEQSGALPTGDVGTIASIDFD 215 (425) T ss_pred cccccCceeccHHHHHHHHHHHHhhhhHHHhhcee----ecCce-eEEEEecCCccccccccccccccccccccceeeee Confidence 1111 3789999999999999998888877431 12354 5788877666666566555543333 345666666 Q ss_pred EeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHH-----------hhcccccccccCCCHHHHHHHH Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLL-----------VDNGTALSGSAPTDADDAFDLI 144 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~-----------~~~~~~~~~~~~~t~~~~~~~i 144 (273) ..+. +.-+.|++.-..++..++.+ +.++.+++++.++|..++.-- ...+.. ..........+++.+ T Consensus 216 ~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~-~~~~~~~~~~~~~~~ 293 (425) T protein:vir:95 216 GFKV-GKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPE-NQVTVEADNNLLKNL 293 (425) T ss_pred heee-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccc-cccccccccchHHHH Confidence 6432 34456666444445556776 557788899999999887421 110000 011112234467888 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHH-HHHhhhHHHHhhhhccccc-ceeeeeeeeeEeceEEEeeCccccCCCcEEEEE Q lcl|NC_011288. 145 ATALKELTKANVPNVGRVVVVNAEMA-FWLRSSGSKLTSADTSGDA-AGLRAGTIGNLLGARIVESNNLRDTDDEQFVAF 222 (273) Q Consensus 145 ~~a~~~l~~~~vP~~~r~lvv~p~~~-~~L~~~~~~~~~~~~~~~~-~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~ 222 (273) .++...+.....+..+-..++++..+ ..|..- ..+.+ ..|.- ...-.+..+++.|.+|+.++++|.. .++.| T Consensus 294 ~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l-~~~kd--~~g~~i~~~~~~~~~~l~G~pvv~~~~~~~~---~i~~G 367 (425) T protein:vir:95 294 VKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEF-SIQVD--SNGNVVGKLPNLRTPDLLGLRVVFNNFLDDD---TVLFG 367 (425) T ss_pred HHHHHhhhhhccccCceEEEEeChHHHHHHHHH-HhhcC--CCCceeeccCCCCCccccceeeEEcCcCCCc---cEEEE Confidence 88877776655444444556776654 434321 12221 11110 0012455678999999999999854 23334 Q ss_pred cCceeEEee-eeeeehhhcCCCce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 223 HPSAAAYVS-QIDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 223 ~~~a~~~a~-~~~~~e~~~~~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .-+-..... +...++..++ .+| .+.+++.+++++++.+|+++++++-+-+ T Consensus 368 d~~~~~~~~~~~~~i~~~~~-~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~ 421 (425) T protein:vir:95 368 EFEQYTLVERENITIDSSTH-VKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDP 421 (425) T ss_pred ecccEEEEeecceEEEeecc-cccccCceEEEEEEeeCcEeecccceEEEEecCc Confidence 322222222 2122322222 233 3578999999999999999999887776 No 121 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.25 E-value=4.8e-12 Score=82.64 Aligned_cols=259 Identities=11% Similarity=-0.008 Sum_probs=152.4 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |+.. .++|+-|...+++.+++.+++.++++.-.. ....|+....+.....+......++.... .+.+..+.+ T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 187 (397) T protein:vir:48 109 KTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENV-TTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPI 187 (397) T ss_pred hhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeec-cCCcceEEEEeecCCCcceeeeccccccccccccceeeE Confidence 3222 478999999999999999988888754211 12223332222222222333344444332 334566777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHh Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELT 152 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~ 152 (273) ++++.+. +.-+.|++.-..++..++.+ +.++..++++.++|..++.-... . ...+....+++|.++...|. T Consensus 188 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~---~----~~~~~~~~~d~i~~~~~~l~ 259 (397) T protein:vir:48 188 RYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAT---L----PTKPTLTKWDDIIDLQAKVD 259 (397) T ss_pred Eeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc---c----ccccccccHHHHHHHHHHhh Confidence 7777543 34456666544455566766 66788899999999988743211 1 12223345778888888888 Q ss_pred hcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc--cccCC--CcEEEEEc-Ccee Q lcl|NC_011288. 153 KANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTD--DEQFVAFH-PSAA 227 (273) Q Consensus 153 ~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~--l~~~~--~~~~~~~~-~~a~ 227 (273) ....+ +-.++++|..+..|.+-..... .......+..|.-+.|.|++|+.+.+ ++..+ ...++.|. +.++ T Consensus 260 ~~~~~--~a~~v~n~~~~~~L~~lkd~~G---~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~ 334 (397) T protein:vir:48 260 PAIKQ--TSFFLTNTSGFTALKKVKNAFG---DYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAV 334 (397) T ss_pred hhhcC--CCEEEECHHHHHHHHHhhcCCC---ceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceE Confidence 77653 4578899999998865321111 11111224456678999999987543 33322 22344454 3344 Q ss_pred EEeee-eeeehhhcCC----CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVSQ-IDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~~-~~~~e~~~~~----~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ....+ ...++..+.. ..-...+++.+++|..+++|++++.++-+++ T Consensus 335 ~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 335 TLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAI 385 (397) T ss_pred EEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEeccc Confidence 33322 2233333322 2223588899999999999999988775555 No 122 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.23 E-value=7.5e-12 Score=81.57 Aligned_cols=251 Identities=11% Similarity=-0.037 Sum_probs=143.9 Q ss_pred Ccc-chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCc--ccceeecCCCcccC-CCCCccceEEEE Q lcl|NC_011288. 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVA--PTVKDYKAAGRQTS-ADAISDTGVDLL 76 (273) Q Consensus 1 MA~-~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~--~~~~~~~~~~~~~~-~~~~~~~~~~~~ 76 (273) -++ ..++|+-|...+++.++...++.++++.- ...+.+.++|.+.. .++. ...++.... .+.+..+.+++. T Consensus 133 ~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~-~v~E~~~~~~~~~~~~~~v~l~ 207 (394) T protein:vir:97 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVY----QAKKASGKYPVLQRATTKMV-TVAELEKNPALAKPDFKDVAWN 207 (394) T ss_pred cccccccChHHHHHHHHHHhhhhhhhhhhceee----eccCcceEEEEEecCCCccc-eecccccccccccccceeEEee Confidence 111 14689999999999998888887776531 11233566776532 2222 233333332 234566677777 Q ss_pred EeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcC Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKAN 155 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~ 155 (273) ..+. +.-+.|+..=..++..++.. +.+..+++++...|..++.-..+.. ..+ ...++++..+...+-. T Consensus 208 ~~k~-~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~----~~~----~~~~~~~~~~~~~~~~-- 276 (394) T protein:vir:97 208 IDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT----TKT----VKNLDEIKALLNGGFD-- 276 (394) T ss_pred hhhe-eeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----ccc----cccHHHHHHHHHhhhh-- Confidence 6443 34455665433344556766 5577888899998888775432211 111 1224555554433211 Q ss_pred CCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEc-CceeEEe-eee Q lcl|NC_011288. 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFH-PSAAAYV-SQI 233 (273) Q Consensus 156 vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~-~~a~~~a-~~~ 233 (273) |.-+-.+|++|..+..|..-.... +.......+.+|.-++|.|++|+.+++...+.+ .++.|. ..+.... .+. T Consensus 277 -~~~~a~~v~n~~~~~~l~~lkd~~---G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~-~~~~gd~~~~~~~~~~~~ 351 (394) T protein:vir:97 277 -PAYNVSLIVSQSFYQTLDTLKDGN---GRYLLQDDITAVSGKVLLGKPVFVLSDEVLGAN-KAFIGDFKRGVLFADRKD 351 (394) T ss_pred -hhhCCEEEEcHHHHHHHHHhhccC---CCeeeecCcCCCCCceeccceeEEecccccCCc-cEEEeeccccEEEEEecc Confidence 222345889999999886532110 011111223455567899999998766544433 344454 2223222 222 Q ss_pred eeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 234 DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 234 ~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.. +..++...+++-+++|+++.+|++++.++-+.+ T Consensus 352 ~~~~~~-~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~ 390 (394) T protein:vir:97 352 LGLRWA-DNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) T ss_pred eEEEEe-cccccceeEEEEEEEccEEecccceEEEEeccc Confidence 233322 234566788999999999999999998888777 No 123 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.23 E-value=4e-13 Score=88.57 Aligned_cols=251 Identities=13% Similarity=0.074 Sum_probs=136.1 Q ss_pred Cccchhh-------HHH------HHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCC Q lcl|NC_011288. 1 MAFNNFI-------PEL------WSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADA 67 (273) Q Consensus 1 MA~~~~~-------pev------~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~ 67 (273) ||-++++ |+. |++-+.+ |.+.| +.. | .. ....|+||++|+|...+......+|..+..+. T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~-L~~~L---gi~-r-~~-p~a~G~tIt~pK~~~tgda~dVaEGe~Iplsk 73 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNIND-LLKLL---GVT-R-RE-TLTNDLKIQTYKWEVTLDQTDPGEGETIPLSK 73 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHH-HHHHh---ccc-c-cc-ccccCCeEEeeeeeeecccccccCCcccchhh Confidence 9998764 221 2222111 22221 111 1 11 34569999999999877666677788888888 Q ss_pred Cccc---eEEEEEeeeeecceEEchHHHHhh-hHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHH Q lcl|NC_011288. 68 ISDT---GVDLLIDQEKSIDFLVDDIDRVQV-AGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFD 142 (273) Q Consensus 68 ~~~~---~~~~~id~~~~~~~~i~d~d~~~~-~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~ 142 (273) ++.+ ..++++++++. .++|+..... .++ +.+..+|+.++|++++|++++..+..+.....+ .+-...++ T Consensus 74 vt~~~~~t~t~kikK~rK---~tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~tg---~~lq~a~a 147 (295) T protein:vir:99 74 VTRTKDKDYTVKWFKKRR---ATTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVKG---VGLQKALS 147 (295) T ss_pred heeeeeeeeEEEeeeecc---cccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeeeh---hhHHHHHH Confidence 8865 57777876533 3466553333 334 678889999999999999999999765544321 11122344 Q ss_pred HHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhh-hhcccccceeeeeeeeeEeceE-EEeeCccccCCCc--- Q lcl|NC_011288. 143 LIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTS-ADTSGDAAGLRAGTIGNLLGAR-IVESNNLRDTDDE--- 217 (273) Q Consensus 143 ~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~-~~~~~~~~~l~~G~ig~~~G~~-v~~s~~l~~~~~~--- 217 (273) .+..+...+.+.+- ...+++|+|...+.|+++...... +... +.+.| -++.|++ |++|+.+|.+... T Consensus 148 ~~~~al~~f~Ee~~--~~~V~FVnP~D~a~yl~~A~~~~~~a~~f-G~~~L-----~nfLG~q~II~S~kv~~G~~~aT~ 219 (295) T protein:vir:99 148 ASWAKLATFNEFEG--SPLVSFVSPLDVANYLGDTKVGADASNVF-GMTLL-----KNFLGMQNVIVMPSVPEGKIYSTA 219 (295) T ss_pred HhhhhhhhcccccC--CceEEEEehHHHHHHHhccccccchhhhh-hhhhh-----hhhhccceEEEcccCCCceEEEee Confidence 44444444444321 236899999999999987643222 1112 22233 3599997 9999999865431 Q ss_pred ---EEEEEcCceeE-Eeeee----ee---ehhhcCCCceeeeEEeeeeeeeEEe---cCceEEEEecCCC Q lcl|NC_011288. 218 ---QFVAFHPSAAA-YVSQI----DT---VEALRDQDSFSDRIRALHVYGGKVV---RPTGVVVFNKTGS 273 (273) Q Consensus 218 ---~~~~~~~~a~~-~a~~~----~~---~e~~~~~~~~~~~v~~~~~~g~~v~---~~~~~v~~~~~~s 273 (273) -.+++-+...+ ++... |+ +-...+.....--+.- ..+.+-++ ++++||+.+-++. T Consensus 220 ~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et-~~~~~~~lfpE~~dgiv~~tI~~~ 288 (295) T protein:vir:99 220 VENLVFASLNVKGGDLGGLFADFTDETGLIAAARNRQLSNLTYES-VFFGANVLFAEIPEGVVEATIEAA 288 (295) T ss_pred ccceEEEEecCCchhhhhhhhhccCcccceEEEeccccceeeehh-hhHhHHHhcccccceEEEEEEecC Confidence 12222111101 11111 10 0011111111111111 22222223 5678886555443 No 124 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.22 E-value=2.7e-12 Score=83.97 Aligned_cols=264 Identities=12% Similarity=0.044 Sum_probs=145.2 Q ss_pred Ccc--c-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccce---eecCCCcccCCCCCcc Q lcl|NC_011288. 1 MAF--N-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVK---DYKAAGRQTSADAISD 70 (273) Q Consensus 1 MA~--~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~---~~~~~~~~~~~~~~~~ 70 (273) +|. . .++|+.|...+++.++..+++..+++.- ..+..+.+|.....+.. ....++...+..++.. T Consensus 141 ~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~-----~~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f 215 (434) T protein:vir:62 141 RALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGV-----KTKENIKYPVLVKKAEAQGHKNERTNNEMPETDIEF 215 (434) T ss_pred hhhcccccccceecchhhHHHHHHhhhhhhhhhhhccee-----ccCCceEEEEEecCCcccceecccccccccccccce Confidence 221 1 4689999999999999998887777541 11234777775322211 1112233333344555 Q ss_pred ceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc-c-----cccccccCCCHHHHHHH Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN-G-----TALSGSAPTDADDAFDL 143 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~-~-----~~~~~~~~~t~~~~~~~ 143 (273) +.+++...+. +.-+.|++.=..++..++.+ +.+..+++++.++|..++.=-... + .....+...+....+++ T Consensus 216 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~ 294 (434) T protein:vir:62 216 DEIELSPTEF-DALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDA 294 (434) T ss_pred eeEEeeheee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchhhH Confidence 6666666443 23345555433445556766 567888999999999887311000 0 00111222344457889 Q ss_pred HHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCc--EEE- Q lcl|NC_011288. 144 IATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDE--QFV- 220 (273) Q Consensus 144 i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~--~~~- 220 (273) |.++...|.....+ +-..|++|..+..|.+....-.+ ...........|.-..+.|.+|+.++.+|...+. ..+ T Consensus 295 l~~l~~~l~~~~~~--~a~~v~n~~~~~~L~~lkd~~G~-~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~ 371 (434) T protein:vir:62 295 LVKMKNTPVKEVRK--KARWVLNTAALTKIETMKTDDGF-PLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFY 371 (434) T ss_pred HHHHHhhcchhhhc--CCEEEEcHHHHHHHHHhhccCCC-EeeccCCCccCCCCceecceeeEEecCccCccCCCceEEE Confidence 99888888765533 33568899999988553211000 0000011122344457999999999988754432 222 Q ss_pred EEcCceeEEeeeeeeehhhcCCCce----eeeEEeeeeeeeEEec-CceEEEEecCCC Q lcl|NC_011288. 221 AFHPSAAAYVSQIDTVEALRDQDSF----SDRIRALHVYGGKVVR-PTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~~~a~~~a~~~~~~e~~~~~~~~----~~~v~~~~~~g~~v~~-~~~~v~~~~~~s 273 (273) .|.-+......+...++..+....| ...+.+..+++++++. |+...+++-.+. T Consensus 372 ~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~ 429 (434) T protein:vir:62 372 FGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLK 429 (434) T ss_pred EeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEec Confidence 3333332222221112222222222 2357888999999775 887777754433 No 125 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=99.21 E-value=9.3e-12 Score=81.06 Aligned_cols=256 Identities=14% Similarity=0.058 Sum_probs=146.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-----CCCCc Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-----ADAIS 69 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-----~~~~~ 69 (273) ||.. .++|+.++..+++.+++.+++.++++.- ...+.++++|+....+......++...+ ..++. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~----~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~ 76 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNV----NMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhccee----eccCCcEEEEEEeCCcceEEeecccccccccccccccc Confidence 9887 4689999999999999999888887531 2235678899876544444443433222 22344 Q ss_pred cceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHh------------hccccccc-c--- Q lcl|NC_011288. 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLV------------DNGTALSG-S--- 132 (273) Q Consensus 70 ~~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~------------~~~~~~~~-~--- 132 (273) .+.+++...+. +.-+.|++.-..++..+++. +.+..++++++++|..++.=-. ........ . T Consensus 77 f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (305) T protein:vir:25 77 WANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVG 155 (305) T ss_pred eeeEEeeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccc Confidence 55555555332 34456666433445566766 5677889999999998873100 00000000 0 Q ss_pred cCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccc Q lcl|NC_011288. 133 APTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 133 ~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~ 212 (273) ...+....++.+.++...+....... .-++++|..+..|.+- ++ .. +...+.. +.+.|++++.++.+| T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~l----kd--~~-G~~i~~~---~~l~G~Pv~~~~~~~ 223 (305) T protein:vir:25 156 GVANESDIVGATNRAAKAVASAGWAP--DTLLSSLALRYEVANI----RD--AN-GNPVFRD---DSFAGFRTFFNRNGA 223 (305) T ss_pred cchhhhHHHHHHHHHHHhhhhccccc--ceeEecHHHHHHHHHh----hc--cC-CceeecC---CcccccceEEcCccC Confidence 11111223444555555444433211 1278899999988642 21 11 1122222 478999999999887 Q ss_pred cCCC-cEEEEEcCceeEEeeee-eeehhhcC---------CCce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 213 DTDD-EQFVAFHPSAAAYVSQI-DTVEALRD---------QDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~~-~~~~~~~~~a~~~a~~~-~~~e~~~~---------~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ...+ ..++.+..+.+.+..+. ..++..+. .+.| ...+++..++|..+++|++++.+..+.. T Consensus 224 ~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~ 298 (305) T protein:vir:25 224 WDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV 298 (305) T ss_pred CCCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc Confidence 5433 23445544433333221 12221111 1112 2467888999999999999998877644 No 126 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.20 E-value=1.4e-11 Score=80.13 Aligned_cols=258 Identities=11% Similarity=-0.014 Sum_probs=150.7 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCC-CCCccce Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSA-DAISDTG 72 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~-~~~~~~~ 72 (273) |+.. .++|+.|...+++.+++..++.+++..-.. ....-++.+|+.... +......++..... +....+. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENV--TTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSL 186 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeec--cCCcceEEEEeeccCCcceeeeccccccccccccceee Confidence 3322 568999999999999999888777654211 111123455554332 33444555444332 2345567 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKEL 151 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l 151 (273) +++++.+. +.-+.|+..=..++..++.. +.+.+.++++.++|..++.-. +.... .+....+++|.++...+ T Consensus 187 v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~---g~~~~----~~~~~~~d~i~~~~~~l 258 (397) T protein:vir:49 187 IRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI---GTLPN----KPTLAKWDDIIDLQAKV 258 (397) T ss_pred eEeeeeee-EeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc---ccccc----cccccCHHHHHHHHHhh Confidence 77777543 33345665433444556765 667888999999998876322 11111 12223467888888888 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeC--ccccCCC--cEEEEEc-Cce Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN--NLRDTDD--EQFVAFH-PSA 226 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~--~l~~~~~--~~~~~~~-~~a 226 (273) +....+ .-.++++|..+..|.+......+ ..-...+..|.-++++|++|+.+. .+|..+. ..++.+. +.+ T Consensus 259 ~~~~~~--~a~~v~n~~~~~~l~~lkd~~g~---~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~ 333 (397) T protein:vir:49 259 DPAIKQ--TSLFLTNTSGFTALKKVKNAMGD---YLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQA 333 (397) T ss_pred hhhhcC--CCEEEEcHHHHHHHHHhhccCCc---eeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccce Confidence 877654 35789999999988653211110 001112345656789999998754 3454322 2233443 444 Q ss_pred eEEeee-eeeehhhcCC----CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 227 AAYVSQ-IDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~a~~-~~~~e~~~~~----~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +....+ -..++..+.. ..-...+++.+++|.++++|++++.++-++. T Consensus 334 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 334 VTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAI 385 (397) T ss_pred EEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEeccc Confidence 444432 2233332221 1224578999999999999999988864444 No 127 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.20 E-value=2.1e-11 Score=79.16 Aligned_cols=258 Identities=11% Similarity=-0.016 Sum_probs=153.4 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~-~~~~~~~~ 72 (273) |+.. .++|+-|...+++.+++..++.++++..- .....|+ +.+|..... +......++.... .+.+..+. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVEN-VTTLTGS-RVYEKWTDITGLANIDDEAGKIADVDDPKLSL 186 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceee-cccCccc-eEEEeeccCCcceeeecCccccccccccceee Confidence 3322 46899999999999999998888875421 1112233 445554432 2234444444443 24566777 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKEL 151 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l 151 (273) +++++.+. +.-+.|++.=..++..++.. +.++.++++++.+|..++.-.... . .......+++|.++...+ T Consensus 187 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~---~----~~~~~~~~d~i~~~~~~l 258 (397) T protein:vir:49 187 IKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAAL---P----TKPTLTKWDDIIDLEAKV 258 (397) T ss_pred EEeeeeeE-EeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---c----cccccccHHHHHHHHHhh Confidence 77777443 44456666444444566766 567888999999999877532211 1 111223467888888888 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeC--ccccCC-Cc-EEEEEc-Cce Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN--NLRDTD-DE-QFVAFH-PSA 226 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~--~l~~~~-~~-~~~~~~-~~a 226 (273) ..+..+ +-.++++|..+..|.+...... .......+..|.-++|.|++|+.+. .+|..+ +. .++.+. +.+ T Consensus 259 ~~~~~~--~a~~vmn~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~ 333 (397) T protein:vir:49 259 DPAIKQ--TSFFLTNTSGFTALKKVKNALG---DYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQA 333 (397) T ss_pred hhhhcC--CCEEEEcHHHHHHHHHhhcCCC---ceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccce Confidence 777653 4578999999999866321111 1111112345566789999998744 345432 22 234443 334 Q ss_pred eEEee-eeeeehhhcC----CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 227 AAYVS-QIDTVEALRD----QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~a~-~~~~~e~~~~----~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.... +...++..+. -......+++.+++|.++++|++++.++-+++ T Consensus 334 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:49 334 VTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAI 385 (397) T ss_pred EEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEeecc Confidence 43332 2223333222 12234578999999999999999998886555 No 128 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.20 E-value=2.8e-12 Score=83.94 Aligned_cols=260 Identities=15% Similarity=0.086 Sum_probs=154.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEee----cCcccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAG----VVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) |++ |+++..++.+.+++..+...++ ++.. -..+-.+.+.. .......+..+++. ........+...+- T Consensus 22 l~~----P~~I~~~i~e~~~~~~iad~lf-~~~~--a~~~~~v~f~~~~p~~~~~d~e~VaEggE-iP~~~~~~G~~~ia 93 (318) T protein:vir:10 22 VGN----PLWIPTALKKMMVNQFISESLF-RNGG--ANPNGVVAYNEGNPSFLEDDVADVAEFGE-IPVSAGARGLPRTA 93 (318) T ss_pred hCC----chhHHHHHHHHHhccchhhhhh-hccc--ccccceeEEEecccccccCcHhhccCccc-ccccCCCCCchhhh Confidence 333 6777777777665555444444 3321 12355777744 22233344333333 33344555455443 Q ss_pred EeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc-ccccC-C-------CHHHHHHHHHH Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTAL-SGSAP-T-------DADDAFDLIAT 146 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~-~~~~~-~-------t~~~~~~~i~~ 146 (273) .-+..+..+.|+++.......+ ++..+++++.+++++.|+..+..+..+.... ..+++ . +.....+.+.. T Consensus 94 ~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~d~~~A~e~v~~ 173 (318) T protein:vir:10 94 FAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRTDIAIAIEQIST 173 (318) T ss_pred hhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccccccchhhhhhhhh Confidence 3334578899999776655555 6889999999999999999998886543211 11111 0 11112222222 Q ss_pred HHHHHhhcCC-------CccCCEEEECHHHHHHHhhhHHHHhhhhcccccce----e-eeeee-eeEeceEEEeeCcccc Q lcl|NC_011288. 147 ALKELTKANV-------PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAG----L-RAGTI-GNLLGARIVESNNLRD 213 (273) Q Consensus 147 a~~~l~~~~v-------P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~----l-~~G~i-g~~~G~~v~~s~~l~~ 213 (273) +..-+..+.. .-.--.+|++|..+..|++++.+ ..... +..+. + ..|-+ |+++|++|+.|.++|. T Consensus 174 a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~-~~~y~-~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~ 251 (318) T protein:vir:10 174 AAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENF-MKVYE-RNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPI 251 (318) T ss_pred hhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhh-hhhhh-ccchhhhhcccccccccceeeceEEeecCccCC Confidence 2222211111 11113799999999999988754 22221 11111 1 13555 6789999999999996 Q ss_pred CCCcEEEEEcCceeEEee-ee-eeehhhcCC-------CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 214 TDDEQFVAFHPSAAAYVS-QI-DTVEALRDQ-------DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 214 ~~~~~~~~~~~~a~~~a~-~~-~~~e~~~~~-------~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .. +++..++.+|+-. .. .+++..|++ ...+..++.++.....|.+|.+++.|+-=+| T Consensus 252 ~~---alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~ 317 (318) T protein:vir:10 252 DR---VLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVT 317 (318) T ss_pred Ce---eEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccC Confidence 43 5666677777543 11 234556654 3345788999999999999999999999999 No 129 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.19 E-value=6.3e-12 Score=82.00 Aligned_cols=257 Identities=11% Similarity=0.033 Sum_probs=152.4 Q ss_pred Cccc------hhhH-HHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIP-ELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~p-ev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++| +++++.+++.+++.+++..+-.+-. ....| .++||+....+......++......++..+.+ T Consensus 357 ~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~--~~~~g-~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i 433 (632) T protein:vir:96 357 LEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARML--PGLVG-DVDIPKKTSGANFYWIGEDEDVQDSDFDFTTL 433 (632) T ss_pred hhcccccccccccccccchHHHHHHHhhcchhhhhcceEe--ecCCc-ceEEEEEeCCceeEeecCCccccccccceeeE Confidence 1111 2455 5567889999988887766522211 12223 58889876554455556666666666777777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc--ccccc------cccCCCHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN--GTALS------GSAPTDADDAFDLI 144 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~--~~~~~------~~~~~t~~~~~~~i 144 (273) ++...+. +.-+.|+..=..++..+++. +.+.+..+++.++|..++.--... +.... ..+..+....++.| T Consensus 434 ~l~~~k~-~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i 512 (632) T protein:vir:96 434 SFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASV 512 (632) T ss_pred EeeeeEE-EEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHH Confidence 7777443 23345554323344556766 557788999999999877321101 11110 00111223457788 Q ss_pred HHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcC Q lcl|NC_011288. 145 ATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP 224 (273) Q Consensus 145 ~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~ 224 (273) .++...+...++...+-..+++|..+..|.... +. +.. +..+..+ +.+.|++++.|+.+|... ++.|.- T Consensus 513 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~--l~--d~~--G~~i~~~--~~l~G~pv~~s~~ip~~~---~~~gd~ 581 (632) T protein:vir:96 513 VDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ--VF--DNT--GERIWQN--NEVNGYRAEASNQIPADT---WIFGDW 581 (632) T ss_pred HHHHHHHhhcccccCccEEEEchhHHHHHHHHh--cc--CCC--CceeecC--CeecccceEeccccccCc---EEEeec Confidence 888888888876545556788998887775432 21 222 2223322 578899999999998542 333332 Q ss_pred ceeEEeee-ee--eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCC Q lcl|NC_011288. 225 SAAAYVSQ-ID--TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 225 ~a~~~a~~-~~--~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~ 272 (273) +-+-++.. .. .+..+.....-...+++.++++.++.+|+++++++..+ T Consensus 582 s~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 582 SQIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred ceEEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 22212211 11 22222222333468899999999999999999999999 No 130 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.19 E-value=1.9e-11 Score=79.41 Aligned_cols=257 Identities=14% Similarity=0.058 Sum_probs=149.0 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |+.. .++|+-+...+++.+++.+++.+++.... .....-++.+++....+......++.... ...++.+.+ T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i 168 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEP--VTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLL 168 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCcceeeeccccccccccccceeeE Confidence 4322 46899999999999999998888775321 11122345555554433343444444332 344666777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHH-HHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATAL-KEL 151 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~l 151 (273) +++..+. +.-+.|++.-..++..++.+ +.+..++++++.+|..++.-..... .++...++++..+. ..| T Consensus 169 ~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~--------~~~~~~~~~i~~~~~~~l 239 (371) T protein:vir:81 169 QYQVKKY-AGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKA--------KTAIADLDGLKQIINVQL 239 (371) T ss_pred EeeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------ccccccHHHHHHHHHhhc Confidence 7777553 33456666544444556766 5677888999999988775322111 11122344555443 234 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC---------cEEEEE Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD---------EQFVAF 222 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~---------~~~~~~ 222 (273) ..... .+-..+++|..+..|.+-... + +.......+..|.-++++|.+|+.++++|.+.. ..++.| T Consensus 240 ~~~~~--~~a~~vmn~~~~~~L~~lkd~--~-g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~G 314 (371) T protein:vir:81 240 DPVFR--STSSVIVNQDAFNWLDTLKDQ--N-GQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVG 314 (371) T ss_pred chhhh--cCCEEEEcHHHHHHHHHhhcc--C-CCeeeecccCCCCCceecceeEEEecccccCccccccccCCcceEEEE Confidence 33332 344789999999988653211 0 011111223456668999999999998874321 123344 Q ss_pred c-CceeEEeee-eeeehhhcCC-Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 223 H-PSAAAYVSQ-IDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 223 ~-~~a~~~a~~-~~~~e~~~~~-~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) . +.++....+ ...++..+.. +.| ...+++.+++|.++.+|+++++++-+.| T Consensus 315 d~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 315 DLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred ehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 3 222222221 1122222221 122 3588999999999999999999999999 No 131 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.17 E-value=2.5e-11 Score=78.73 Aligned_cols=258 Identities=12% Similarity=0.014 Sum_probs=146.0 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~-~~~~~~~~ 72 (273) |... .++|+.|...+++.+++.+++..+++.- . .....-++.+++.... .......++.... .+.+..+. T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~-~-~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~ 193 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE-S-VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTI 193 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHhhccee-e-ccCCcceEEEEeecCCccceeeecCccccccccccceee Confidence 2221 3689999999999999999888877431 1 1111123334433322 2233344444332 34466777 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHH-H Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALK-E 150 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~-~ 150 (273) +++++.+. +.-+.|++.-..++..++.+ +.+...+++++++|..++.-. +.....+ ....++++..+.. . T Consensus 194 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~---g~~~~~~----~~~~~~~i~~~~~~~ 265 (404) T protein:vir:39 194 IKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM---GTVPKKP----TIAKFDDVITMINTS 265 (404) T ss_pred EEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc---ccccccc----ccccHHHHHHHHHHh Confidence 77777554 34456666444445566766 567888999999999877432 1111111 1223556665543 3 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc--cccCCC--cEEEEEcC-c Q lcl|NC_011288. 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDD--EQFVAFHP-S 225 (273) Q Consensus 151 l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~--l~~~~~--~~~~~~~~-~ 225 (273) ++.... .+-.++++|..+..|........+ ..-...+..|.-++|.|++|+.+.+ +|..+. ..++.+.- . T Consensus 266 ~~~~~~--~~a~~v~n~~~~~~L~~lkd~~G~---~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~ 340 (404) T protein:vir:39 266 VDPAII--ATSSLLTNQSGLNKLALVKTAEGK---YLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQ 340 (404) T ss_pred hhhhhc--cCCEEEEcHHHHHHHHHhhccCCc---eeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccc Confidence 433322 234689999999998753211110 1011123345567899999998654 443322 23455543 3 Q ss_pred eeEEee-eeeeehhhcCC----CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 226 AAAYVS-QIDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~a~-~~~~~e~~~~~----~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.... +...++..+.. .+....+++.+++|+.+++|++++.++-+++ T Consensus 341 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (404) T protein:vir:39 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAI 393 (404) T ss_pred cEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEeecc Confidence 343332 22333333322 1234678899999999999999999885555 No 132 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.16 E-value=2e-11 Score=79.22 Aligned_cols=255 Identities=8% Similarity=0.003 Sum_probs=152.5 Q ss_pred Cccc----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccce--eecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFN----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVK--DYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 74 (273) +... .++|+-|...+++.++...++.++++.- ...+.++++|........ ....++......++..+.++ T Consensus 116 ~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~----~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~ 191 (421) T protein:vir:13 116 IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVI----PVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMA 191 (421) T ss_pred ccccCCcceecchhhHHHHHHHHHhhhhhhhhceee----eccCCceEEEEeecCCccceeeccccccccccccceeEEE Confidence 2111 4789999999999999888887777531 123456777765443222 12333444444556667777 Q ss_pred EEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhh Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTK 153 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~ 153 (273) +++.+. +.-+.|++.=..++..++.. +.+..+++++..+|..+++....... .+....+++|.++...|.. T Consensus 192 ~~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~~-------~~~~~~~d~i~~~~~~l~~ 263 (421) T protein:vir:13 192 YDIDDY-GLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVLA-------EETINDYAGLVKTINSLVP 263 (421) T ss_pred eeeeee-EeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhccc-------cccccchHHHHHHHHHhhh Confidence 777543 33445665444445556766 55778888999999888865543221 1122346788888888877 Q ss_pred cCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC--cEEEEEcCc-eeEEe Q lcl|NC_011288. 154 ANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD--EQFVAFHPS-AAAYV 230 (273) Q Consensus 154 ~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~--~~~~~~~~~-a~~~a 230 (273) +..+ +-.+|++|..+..|......-.+ ... .....|.-+++.|.+|+.++++|..++ ..++.|.-+ ++... T Consensus 264 ~~~~--~a~~v~n~~~~~~l~~lkd~~G~-~i~---~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 337 (421) T protein:vir:13 264 NARK--RAIIVTNSDGRAYLDGLMDKQGR-PLL---KELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFM 337 (421) T ss_pred hhcC--CCEEEEcHHHHHHHHHhhcCCCc-eee---cCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEE Confidence 6654 34678999999988653211000 011 112345567899999999998875433 334555433 33333 Q ss_pred e-eeeeehhhcCCCce--eeeEEeeeeeeeEEecCceEEEEecCC--C Q lcl|NC_011288. 231 S-QIDTVEALRDQDSF--SDRIRALHVYGGKVVRPTGVVVFNKTG--S 273 (273) Q Consensus 231 ~-~~~~~e~~~~~~~~--~~~v~~~~~~g~~v~~~~~~v~~~~~~--s 273 (273) . +...++..+...+. -..+++.+++|.++.+|+++..+...- . T Consensus 338 ~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a 385 (421) T protein:vir:13 338 DRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGV 385 (421) T ss_pred EecceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecccce Confidence 2 33344444443322 247889999999999999865443321 1 No 133 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.16 E-value=4.8e-11 Score=77.14 Aligned_cols=262 Identities=14% Similarity=0.078 Sum_probs=145.5 Q ss_pred Cccc-------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN-------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~-------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (273) ++.. .++|+-|...+++.+++.+++.++..+-. ....| .+.+|+....+......++...+..++..+.+ T Consensus 125 ~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~--~~~~g-~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i 201 (428) T protein:vir:10 125 MAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGARSI--PLPNG-NMSLPRLAGGATASYTGENQDAKVSEARFDDV 201 (428) T ss_pred hhhcccccCCccccchhHHHHHHHHHhhhchhhhhcceee--ecCCc-ceEEEEEeCCcceeeeccCccccccccceeeE Confidence 2211 46899999999999998888766632211 11223 37888875544444555666666666777777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc--cccc---------ccccCCCHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN--GTAL---------SGSAPTDADDAF 141 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~--~~~~---------~~~~~~t~~~~~ 141 (273) ++...+. +.-+.|++.-..++..++.. +.+..+++++.++|..++.--... +... ...........+ T Consensus 202 ~~~~~k~-~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~ 280 (428) T protein:vir:10 202 KLTAKTM-IAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNL 280 (428) T ss_pred EeeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccH Confidence 7777443 34456666544455566766 557888999999999876310000 0000 000011111122 Q ss_pred HHHHHHHHHH----hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC- Q lcl|NC_011288. 142 DLIATALKEL----TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD- 216 (273) Q Consensus 142 ~~i~~a~~~l----~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~- 216 (273) +.+......+ ........+-..+++|..+..|.+... ..| ...+....=|++.|.+|+.++.+|...+ T Consensus 281 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd------~~G-~~i~~~~~~g~l~G~pv~~~~~~p~~~~~ 353 (428) T protein:vir:10 281 DTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRD------GNG-NKVYPEMAQGMLKGYPIQRTSAIPANLGE 353 (428) T ss_pred HHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhc------cCC-ceeccCCCCCeeeceeeEEeccccccccC Confidence 2222222222 111111223456889999988855321 111 1111111225799999999999885322 Q ss_pred ----cEEEEEcCceeEEeee-eeeehhhcCC----------Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 ----EQFVAFHPSAAAYVSQ-IDTVEALRDQ----------DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ----~~~~~~~~~a~~~a~~-~~~~e~~~~~----------~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.|+.+.+-+... ...++..+.. ..| ...+++-+++|..+.+|+++++++...= T Consensus 354 ~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 354 GGKESEIYFADFNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred CCccceEEEEecceEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 2234555444333321 1122222221 112 2478999999999999999999977777 No 134 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.15 E-value=2.7e-11 Score=78.52 Aligned_cols=258 Identities=11% Similarity=0.005 Sum_probs=146.5 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~------~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~-~~~~~~~~ 72 (273) |.. -.++|+.|+..+++.+++..++.++++.-. .....-++.+|+.... .......++.... .+.+.-+. T Consensus 116 ~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:10 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES--VSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTI 193 (408) T ss_pred hhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceee--ccCCcceEEEeeccccccceeeecCccccccccCcceee Confidence 211 146899999999999999998888775421 1111223444444332 2233344443332 23356677 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHH-HH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATAL-KE 150 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~ 150 (273) +++...+. +.-+.|+..-..++..++.. +.+..+++++.+.|..++.-..... . ......++++..+. .. T Consensus 194 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~---~----~~~~~~~~~l~~~~~~~ 265 (408) T protein:vir:10 194 IKYLIKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP---K----KPTIAKFDDVITMINTA 265 (408) T ss_pred EEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---c----ccccccHHHHHHHHHHh Confidence 77777543 33456666444445566766 5677888999999988775433221 1 11123355666554 34 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeC--ccccCCCc--EEEEEc-Cc Q lcl|NC_011288. 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN--NLRDTDDE--QFVAFH-PS 225 (273) Q Consensus 151 l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~--~l~~~~~~--~~~~~~-~~ 225 (273) ++.... .+-..+++|..+..|.+....-. .......+.+|.-++|.|++|+.++ .+|..+.. .++.+. +. T Consensus 266 ~~~~~~--~~a~~v~n~~~~~~l~~lkd~~G---~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~ 340 (408) T protein:vir:10 266 VDPAII--ATSSLLTNQSGLNKLALVKTAEG---KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQ 340 (408) T ss_pred hhhhhc--cCCEEEEcHHHHHHHHHhhccCC---ceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhc Confidence 443322 34468899999999876432111 1111122445666799999999865 35543332 234444 33 Q ss_pred eeEEee-eeeeehhhcCC-C---ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 226 AAAYVS-QIDTVEALRDQ-D---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~a~-~~~~~e~~~~~-~---~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.... +...++..+.. . +-...+++.+++|+++++|++++.++-+.. T Consensus 341 ~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~ 393 (408) T protein:vir:10 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAI 393 (408) T ss_pred cEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeecc Confidence 333333 22233322221 2 234689999999999999999998885554 No 135 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.15 E-value=3.1e-11 Score=78.18 Aligned_cols=257 Identities=14% Similarity=0.051 Sum_probs=146.5 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCc--ccceeecCCCccc-CCCCCccc Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVA--PTVKDYKAAGRQT-SADAISDT 71 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~--~~~~~~~~~~~~~-~~~~~~~~ 71 (273) |+.. .++|+-|...+++.+++...+..+++.- ...+.+.++|.... .... ...+++.. ..+.+..+ T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~-~~~E~~~~~~~~~~~~~ 183 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKT----PVTTPKGTYPILKRATDRFS-SVAELAENPKLAEPEFN 183 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhHHhhccee----eccCCeeEEEEEecCCCccc-cccccccccccccccce Confidence 3332 4689999999999999998887776431 12244566666532 2222 23333323 23456667 Q ss_pred eEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHH- Q lcl|NC_011288. 72 GVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALK- 149 (273) Q Consensus 72 ~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~- 149 (273) .+++.+.+. +.-+.|++.-..++..++.+ +.+..+++++...|..++.-..... ..+.+....++++.++.. T Consensus 184 ~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~-----~~~~~~~~~~d~l~~~~~~ 257 (389) T protein:vir:10 184 KVDWSVATY-RGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT-----AKKTTTDTLVDSLKHILNV 257 (389) T ss_pred eeeeeheee-EeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc-----cccccccccHHHHHHHHHh Confidence 777777443 34456665434445566766 5567778899998988876554322 122233345666666543 Q ss_pred HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcc-cccceeeeeeeeeEeceEEEeeCc--cccCCCcE-EEEEc-C Q lcl|NC_011288. 150 ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTS-GDAAGLRAGTIGNLLGARIVESNN--LRDTDDEQ-FVAFH-P 224 (273) Q Consensus 150 ~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~-~~~~~l~~G~ig~~~G~~v~~s~~--l~~~~~~~-~~~~~-~ 224 (273) .++.. .+-.++++|..+..|.+......+--.. +.......|.-++|+|++|+.++. .+..++.. ++.|. + T Consensus 258 ~~~~~----~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~ 333 (389) T protein:vir:10 258 DLDPA----YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLK 333 (389) T ss_pred hhhhh----hCcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeecc Confidence 33322 2456899999999887633111100000 000111123346899999987543 34333333 33343 3 Q ss_pred ceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 225 SAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 ~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .++.... +...++..+ ...|.+.+.+-+++|+.+++|++++.++-+.+ T Consensus 334 ~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~ 382 (389) T protein:vir:10 334 RGVLFTDRQQVTLAWED-SKIYGKYLGAAFRFGVQKADSKAGYFVTNTDV 382 (389) T ss_pred ccEEEEeecceEEEeec-cccccceEEEEEEeccEEecccceEEEEeecc Confidence 3333332 333343333 45567788899999999999999998875544 No 136 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.13 E-value=8.7e-11 Score=75.73 Aligned_cols=262 Identities=16% Similarity=0.049 Sum_probs=144.3 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccc----eeecCCCcccCCCC-Cc Q lcl|NC_011288. 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTV----KDYKAAGRQTSADA-IS 69 (273) Q Consensus 1 MA~------~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~----~~~~~~~~~~~~~~-~~ 69 (273) ++. ..++|+.|...+++.++..+.+..++..- ...|.++.+|+...... .....++......+ .. T Consensus 118 ~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 193 (413) T protein:vir:81 118 STATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNL----TMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFAD 193 (413) T ss_pred hhcccccccccccchhhHHHHHHHHhhhhhHHhhccee----eccCCceeEEEeccccccccccceecCcccccccCccc Confidence 111 13579999999999999988887776531 23355677776543221 12233333332223 23 Q ss_pred cceEEEEEeeeeecceEEchHHHHhhhHHHHHH-HHHHHHHHHHHHHHHHHHHH---------hhcccccccccCCCHHH Q lcl|NC_011288. 70 DTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADLL---------VDNGTALSGSAPTDADD 139 (273) Q Consensus 70 ~~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~~D~~i~~~~---------~~~~~~~~~~~~~t~~~ 139 (273) -+.+++.+.+. +.-+.|++. .......+.++ .+..+++++.++|..++.-- ....... ..+..+... T Consensus 194 f~~i~~~~~k~-~~~~~iS~e-ll~ds~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~-~~~~~~~~~ 270 (413) T protein:vir:81 194 FDIVTESLSKI-AGLTKITDE-MIEDYDFLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQ-TLAVSNKDE 270 (413) T ss_pred ceeeEeeeeeE-EEeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccc-cccccccch Confidence 56666666543 333566664 33223346664 46678899999999877411 0000011 111223345 Q ss_pred HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHH----hhhhcccccceeeeeeeeeEeceEEEeeCccccCC Q lcl|NC_011288. 140 AFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKL----TSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 140 ~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~----~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~ 215 (273) .++++.++...+........+ .+|++|..+..|.+-...- ......+.......+.-++++|.+|+.|+.+|.+ T Consensus 271 ~~~~i~~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~- 348 (413) T protein:vir:81 271 LADSIYKAMTNISLATPFQAD-ALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVG- 348 (413) T ss_pred hHHHHHHHHHHhhhhccCCCc-EEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcc- Confidence 677777777666544332223 3789999999885432110 0000000000001112357999999999999853 Q ss_pred CcEEEEE-cCceeEEee-eeeeehhhcCC-Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 216 DEQFVAF-HPSAAAYVS-QIDTVEALRDQ-DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ~~~~~~~-~~~a~~~a~-~~~~~e~~~~~-~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .++.+ .+.+.-... +...++..+.. ..| ...+++.++|+..+.+|++++.++-+.+ T Consensus 349 --~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~ 410 (413) T protein:vir:81 349 --KPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEV 410 (413) T ss_pred --cEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecCC Confidence 23334 333333222 22234333332 222 3478899999999999999998877666 No 137 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.12 E-value=5.2e-11 Score=76.97 Aligned_cols=258 Identities=12% Similarity=0.012 Sum_probs=148.6 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccC-CCCCccce Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTS-ADAISDTG 72 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~-~~~~~~~~ 72 (273) |... .++|+-|...+++.+++.+.+.++++.-. ......++.+++.... .......++.... .+.+..+. T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVES--VSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTI 193 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhhhcceee--ccCCcceEEEEeecCCcccccccccccccccccccceee Confidence 2221 36899999999999999988888775421 1112234556655432 3334444444433 24466677 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHH-HH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATAL-KE 150 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~ 150 (273) +++++.+. +.-+.|++.-..++..++.. +.++..++++.++|..++.- .+.....+ ....++++..+. .. T Consensus 194 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G---~G~~~~~~----~~~~~~~i~~~~~~~ 265 (408) T protein:vir:74 194 IKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAA---MGTVPKKP----TIANFDDVITMINTS 265 (408) T ss_pred EEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc---cccccccc----ccccHHHHHHHHHHh Confidence 77777543 44456666544455556766 56788899999999987642 11111111 222355666554 45 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc--cccCCC--cEEEEEc-Cc Q lcl|NC_011288. 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTDD--EQFVAFH-PS 225 (273) Q Consensus 151 l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~--l~~~~~--~~~~~~~-~~ 225 (273) +..... .+-..+++|..+..|......- +.......+..|.-++|+|++|+.+.+ +|..+. ..++.|. +. T Consensus 266 l~~~~~--~~a~~v~n~~~~~~l~~lkd~~---G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~ 340 (408) T protein:vir:74 266 VDPAII--ATSSLLTNQSGLNKLALVKTAE---GKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQ 340 (408) T ss_pred hhhhhc--CCCEEEEcHHHHHHHHHhhcCC---CceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhc Confidence 555443 2446889999999887532110 011111123345557899999987654 454332 2234443 33 Q ss_pred eeEEee-eeeeehhhcCC----CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 226 AAAYVS-QIDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~a~-~~~~~e~~~~~----~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.... +...++..+.. .+....+++.+++|+++++|++++.++-++. T Consensus 341 ~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 393 (408) T protein:vir:74 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAI 393 (408) T ss_pred cEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecc Confidence 443332 22233322221 2345678999999999999999998886555 No 138 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=99.12 E-value=1.1e-10 Score=75.09 Aligned_cols=265 Identities=14% Similarity=0.080 Sum_probs=158.4 Q ss_pred Ccc-----chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc----ceeecCCCcccCCCCCccc Q lcl|NC_011288. 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT----VKDYKAAGRQTSADAISDT 71 (273) Q Consensus 1 MA~-----~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~----~~~~~~~~~~~~~~~~~~~ 71 (273) |-. -.+.|+.+. ++++.+++...+.+++++-.. ....+..||+++... ..+-..+.+..+..+++.+ T Consensus 14 it~~d~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t---~~s~~~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~ 89 (314) T protein:vir:41 14 IDVPDLGKGILAVQRFG-EFVREVRENSAIIKDARVLNA---LKSYEVDISRISLGVELEPGRNTSGTKVAPTADEVTVS 89 (314) T ss_pred cccccCCCceeChHHHH-HHHHHHHhccchhhheeeecc---cCccceeecccccCcccccccccccCCccCCccccccc Confidence 322 247899875 688899999999888865311 123467788765321 1111122222334567777 Q ss_pred eEEEEEeeeeecceEEchHHHHhhhH--HHHH-HHHHHHHHHHHHHHHHHHH-----------------HHhhccccccc Q lcl|NC_011288. 72 GVDLLIDQEKSIDFLVDDIDRVQVAG--SLEA-YTRAGATALATDTDKFIAD-----------------LLVDNGTALSG 131 (273) Q Consensus 72 ~~~~~id~~~~~~~~i~d~d~~~~~~--~~~~-~~~~~~~ala~~~D~~i~~-----------------~~~~~~~~~~~ 131 (273) .+++...+.. ..+.|++.-..+... +++. +....++++++..+...++ ++..+...... T Consensus 90 ~~~l~~~kl~-~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~~~ 168 (314) T protein:vir:41 90 TNTLEMKELV-TKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQYTD 168 (314) T ss_pred ceeeeeEEEE-EeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccceee Confidence 8888876653 356777655444432 5766 5567778888877665542 12211222222 Q ss_pred ccCCCHHHHHHHHHHHHHHHhhcCCCcc-CCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc Q lcl|NC_011288. 132 SAPTDADDAFDLIATALKELTKANVPNV-GRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN 210 (273) Q Consensus 132 ~~~~t~~~~~~~i~~a~~~l~~~~vP~~-~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~ 210 (273) .++.+..+..+.|.++...|....--.. +-..+++++.+..+++. +.+.....+...+..|.-.++.|++|+.++. T Consensus 169 ~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~---l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~ 245 (314) T protein:vir:41 169 AEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQ---LLVRETGLGDSALIGATGLQYDGIPIQYVPA 245 (314) T ss_pred cCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHH---HhccCCcccchhhhCCCCceecceeeEeccc Confidence 2223344556667777777755332111 22456799988777542 2222333344456666667799999999998 Q ss_pred cccCC--CcEEEEEcCceeEEeeee-eeehhhcCCCceeeeEEeeeeeeeEEecCceEEE--EecCCC Q lcl|NC_011288. 211 LRDTD--DEQFVAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) Q Consensus 211 l~~~~--~~~~~~~~~~a~~~a~~~-~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~--~~~~~s 273 (273) +|... ...++.+++.-+.++... ..++.+|+...-...++..++.++.+.+++++|+ ++.+.+ T Consensus 246 ~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~ 313 (314) T protein:vir:41 246 LDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSG 313 (314) T ss_pred ccccCCCCceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCC Confidence 87533 345567777776665533 3566777776667788999999999988876664 444444 No 139 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.12 E-value=6.2e-11 Score=76.55 Aligned_cols=257 Identities=15% Similarity=0.028 Sum_probs=149.1 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |+.. .++|+.|...+++.+...+++..+++.-.. ....| ++.+|+....+......++.... .+....+.+ T Consensus 123 ~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~-~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v 200 (397) T protein:vir:12 123 MSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPV-TTRSG-TRLLEKNADMVPFSPVEELGNLPEIDQPRFTKV 200 (397) T ss_pred ccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeec-cCCce-eEEEEEecCCcceeeecccccccccccccceeE Confidence 4332 478999999999999999888777653211 11122 45566544433333444444333 234566777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHH-HHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATAL-KEL 151 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~l 151 (273) ++...+. +.-+.|++.-..++..++.+ +.+..++++++++|..++.-... . .. .+...++++.++. ..+ T Consensus 201 ~~~~~k~-~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~---~--~~---~g~~~~~~i~~~~~~~l 271 (397) T protein:vir:12 201 SYSIIDY-GGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAIAS---L--KK---VDIDGLDGIKKALNVTL 271 (397) T ss_pred Eeeheee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---c--cc---cccccHHHHHHHHhhcc Confidence 7776443 33456666444445556766 56778899999999887743211 1 11 1122356666654 344 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc-cccCC--CcEEEEEc-Ccee Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN-LRDTD--DEQFVAFH-PSAA 227 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~-l~~~~--~~~~~~~~-~~a~ 227 (273) +... ..+-..+++|..+..|.+.... + +.......+.+|.-++++|++|+.+++ .|..+ ...++.|. +.+. T Consensus 272 ~~~~--~~~a~~~~n~~~~~~L~~lkd~--~-G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~ 346 (397) T protein:vir:12 272 DPMV--APGSIVLTNQDGYDWLDTLKDG--T-GRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAI 346 (397) T ss_pred chhh--hCCCEEEEcHHHHHHHHHhhcc--C-CceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceE Confidence 4333 2345689999999988653211 1 111111224456667899999988765 33222 22244554 3344 Q ss_pred EEee-eeeeehhhcCCC----ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVS-QIDTVEALRDQD----SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~-~~~~~e~~~~~~----~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .... +-..++..+... .-...+++.+++|.++++|+++++++-|+= T Consensus 347 ~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 347 VLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 3333 222333332221 224689999999999999999998888777 No 140 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.09 E-value=1.3e-10 Score=74.73 Aligned_cols=258 Identities=12% Similarity=0.011 Sum_probs=145.7 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) ++.. .++|+-+...+++.+++...+..+++. ....+....||+....+......++.... ..++.-+.+ T Consensus 84 ~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~----~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i 159 (390) T protein:vir:40 84 IAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINF----VNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKI 159 (390) T ss_pred HhccCcccCcccccHHHHHHHHHHHHhhhhhhhhcee----eecCCceeEEEEEcCCcceeeeccccccCccccccceee Confidence 1111 468999999999999998888777754 22345667788865554444444444332 345667777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH---------Hhhccccc------ccccCCCH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL---------LVDNGTAL------SGSAPTDA 137 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~---------~~~~~~~~------~~~~~~t~ 137 (273) ++.+.+. +.-+.|+..-..++..+++. +.+..+++++.++|..++.= +...+... ......+. T Consensus 160 ~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~ 238 (390) T protein:vir:40 160 QTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTD 238 (390) T ss_pred EeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeeccccccccccccccccccch Confidence 7777544 33456666544555666766 66888899999999987741 11100000 00111222 Q ss_pred HHHHHHHHHHHHHHhhcCCC-ccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCC Q lcl|NC_011288. 138 DDAFDLIATALKELTKANVP-NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDD 216 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~vP-~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~ 216 (273) ....+.+......+....-+ ..+-..+++|..+..++.....+.+ . ++..+..+ ...|.+|+.++.+|... T Consensus 239 ~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d--~--~G~~v~~~---~~~g~pvv~~~~~p~~~- 310 (390) T protein:vir:40 239 LTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMT--P--QGVWVTGI---LPVPLEIVQSVAVPVGK- 310 (390) T ss_pred hhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccC--C--CCcccccc---CCCceeEEEcCCCCCCc- Confidence 23333444444444333221 1234678998876555443222221 1 11122211 24699999999998542 Q ss_pred cEEEEEcCceeEEeee-eeeehhhcCC--CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 217 EQFVAFHPSAAAYVSQ-IDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 217 ~~~~~~~~~a~~~a~~-~~~~e~~~~~--~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.|..+-.....+ ...++..+.. .+-.+.+++.+++|+++.+|+++++|+-++- T Consensus 311 --i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 311 --AVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGL 368 (390) T ss_pred --EEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEeecc Confidence 44444333322222 2233332221 1223679999999999999999998865444 No 141 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.08 E-value=1.6e-10 Score=74.32 Aligned_cols=258 Identities=12% Similarity=0.007 Sum_probs=143.1 Q ss_pred Ccc--------chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCc-ccceeecCCCcccC-CCCCcc Q lcl|NC_011288. 1 MAF--------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVA-PTVKDYKAAGRQTS-ADAISD 70 (273) Q Consensus 1 MA~--------~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~-~~~~~~~~~~~~~~-~~~~~~ 70 (273) |+. ..++|+-|...+++.+++.+++..+++.- ......| ++.++.... .+......++.... .+.+.. T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~-~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f 182 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVE-NVTTSHG-SRVYEKLADITPLKDLDDESALIGDNDDPEL 182 (395) T ss_pred HhhccCccCCCceecchhHhhHHHHHHHhhcchhhhccee-eccCCcc-eEEEEeeccCCccccccccccccccccccce Confidence 111 14689999999999999999888876431 1111123 334444332 22333344443332 223455 Q ss_pred ceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHH Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALK 149 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~ 149 (273) ..++++..+. +.-+.|++.=..++..++.+ +.++++++++..+|..++.-.... .. ......++++.++.. T Consensus 183 ~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~---~~----~~~~~~~~~i~~~~~ 254 (395) T protein:vir:38 183 TVVKYLIHRY-AGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKA---PK----KPTISQFDNIKDLEN 254 (395) T ss_pred eeEEeeeeee-EeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---cc----ccccccHHHHHHHHH Confidence 6666666443 23345555433344556665 667888999999998877532211 11 111223455555442 Q ss_pred -HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccC--CC-cEEEEEcCc Q lcl|NC_011288. 150 -ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDT--DD-EQFVAFHPS 225 (273) Q Consensus 150 -~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~--~~-~~~~~~~~~ 225 (273) .+.... ..+-.++++|..+..|.+-... + +.......+.+|.-++|+|++|+.+.+.+.+ .+ ..++.+.-+ T Consensus 255 ~~l~~~~--~~~a~~v~n~~~~~~L~~lkd~--~-G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~ 329 (395) T protein:vir:38 255 NTLDPAI--ESTSSFITNQSGYNILSKVKDA--D-GRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLK 329 (395) T ss_pred Hhhhhhh--cCCCEEEEcHHHHHHHHHhhcc--C-CceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEecc Confidence 343332 2345689999999998653211 0 0111112244566678999999998764432 22 234455432 Q ss_pred -eeEEee-eeeeehhhcCC-C---ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 226 -AAAYVS-QIDTVEALRDQ-D---SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 -a~~~a~-~~~~~e~~~~~-~---~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++-... +...++..+.. . +-...+++..++|+++++|++++.++-+.+ T Consensus 330 ~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (395) T protein:vir:38 330 QGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTV 383 (395) T ss_pred ccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 333332 22234333332 2 224578899999999999999998876655 No 142 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=99.08 E-value=2.6e-11 Score=78.58 Aligned_cols=251 Identities=12% Similarity=0.050 Sum_probs=141.4 Q ss_pred Ccc------chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecC-cccceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAF------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVV-APTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~------~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |.. -.++|+-+..++++.++..+.+.++++.- ...| .++|... ..+......++...+..+++.+.+ T Consensus 83 l~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~----~~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 156 (352) T protein:vir:78 83 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT----NIKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTV 156 (352) T ss_pred hccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeE----ecCC--ceEEEEecCCCcccccccccccccccccceee Confidence 321 14789999999999998888877776531 1122 2345432 222233344444454555666777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-----cccccCCCHHHHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-----LSGSAPTDADDAFDLIATA 147 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-----~~~~~~~t~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..++.+ +.+..+++++.+-+..++..-...+.. .......++...++.|.++ T Consensus 157 ~~~~~k~-~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d~i~~~ 235 (352) T protein:vir:78 157 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYDAIINA 235 (352) T ss_pred eecceeE-EeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccccchHHHHHHH Confidence 7777544 22356666544445567766 556777888766444444322111110 1112223444567888888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCcee Q lcl|NC_011288. 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...|..... .+-..++++..+..|++... + ++..+..|.=.+++|.+|+.++..+. .+.|.-+-. T Consensus 236 ~~~l~~~~~--~~a~~~mn~~t~~~l~~~~~---~-----~~~~~~~~~~~~llG~PV~~~~~~~~-----~~~Gdf~~~ 300 (352) T protein:vir:78 236 LADLHEDYR--DNATIYMRYADYVKIISVLS---N-----GTTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 300 (352) T ss_pred HhccChhhh--cCCEEEEehHHHHHHHHHHh---c-----cCCcccccCCccccccceEEecCCCc-----eeEeehhhh Confidence 877765543 34456888888877765321 1 11233445556799999999875532 233332111 Q ss_pred EEeeeeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) -.......++..++...--..+++.+++|+++++|++++.++.+.| T Consensus 301 ~~~~~~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 301 GINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred hhhhhhheeeeeccccCCeeEEEEEeeeCceeechhheEEEEeecc Confidence 0000111223333333334678889999999999999998876666 No 143 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=99.05 E-value=9.5e-11 Score=75.53 Aligned_cols=263 Identities=11% Similarity=0.065 Sum_probs=156.2 Q ss_pred Cccc----hhhHH--HHHHHHHHHHHHhhccc--hhhcccccccc---cCCceEEEeecCcccc-ee--ecC--CCcccC Q lcl|NC_011288. 1 MAFN----NFIPE--LWSDMLLEEWTAQTVFA--NLVNREYEGTA---SKGNVVHIAGVVAPTV-KD--YKA--AGRQTS 64 (273) Q Consensus 1 MA~~----~~~pe--v~~~~~~~~~~~~lv~~--~~v~~~~~~~~---~~Gdtv~ip~~~~~~~-~~--~~~--~~~~~~ 64 (273) ||.+ ..+|| +|.+.+.+.-.+...|. +.+.++-+... ..|+.+++|.|+.+.. .+ +.. .....+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t 80 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 9977 35677 79999988875554432 23434433221 3599999999988642 22 211 122445 Q ss_pred CCCCccceEEEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccc-------cc----ccc Q lcl|NC_011288. 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGT-------AL----SGS 132 (273) Q Consensus 65 ~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~-------~~----~~~ 132 (273) ++.++..+....+ ..+..++...|+....+-.+ |+.+.++.+.--.+.....+++.+...-. .. ..+ T Consensus 81 ~~kitt~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~t 159 (349) T protein:vir:78 81 PRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMV 159 (349) T ss_pred cccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhcccce Confidence 6666666665554 66778888888766554444 67788887777777766777776653211 00 000 Q ss_pred --cCCCHHHHHHHHHHHHHHHhhcCC---CccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEe Q lcl|NC_011288. 133 --APTDADDAFDLIATALKELTKANV---PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE 207 (273) Q Consensus 133 --~~~t~~~~~~~i~~a~~~l~~~~v---P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~ 207 (273) ...+.....+.+..|...|.+.-. ...-..++++|..+..|.+.. .+.. .... -++..|+.+.|..|+. T Consensus 160 ~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~-li~~---i~~s--~~~~~i~ty~G~~Viv 233 (349) T protein:vir:78 160 VDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQ-LIDF---IRDA--ENNTMFATYQGYRVIV 233 (349) T ss_pred eeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhh-hhhh---ccCc--ccCcccceecCeEEEE Confidence 001111234567777777766521 122256899999999997643 2321 1111 1244689999999999 Q ss_pred eCccccCC-----CcEEEEEcCceeEEeeee--eeehhhcCCCce----eeeEEeeeeeeeEEecCceEEEEecC----- Q lcl|NC_011288. 208 SNNLRDTD-----DEQFVAFHPSAAAYVSQI--DTVEALRDQDSF----SDRIRALHVYGGKVVRPTGVVVFNKT----- 271 (273) Q Consensus 208 s~~l~~~~-----~~~~~~~~~~a~~~a~~~--~~~e~~~~~~~~----~~~v~~~~~~g~~v~~~~~~v~~~~~----- 271 (273) ++.+|..+ .++++.+-++|+++...- ..+|..|++... .|.+..+.+|. +-|-|+--..+. T Consensus 234 DD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~~---~hp~G~s~~~a~v~~~~ 310 (349) T protein:vir:78 234 DDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTWL---LHPFGYRFTSAVITGNG 310 (349) T ss_pred eCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEEE---eeeeeeeeccccccCCc Confidence 99999643 345677889999987643 346777877542 36666655543 233333332221 Q ss_pred -------CC Q lcl|NC_011288. 272 -------GS 273 (273) Q Consensus 272 -------~s 273 (273) -| T Consensus 311 ~~~~~~sPt 319 (349) T protein:vir:78 311 TETIARSAS 319 (349) T ss_pred cccccCCCC Confidence 11 No 144 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.04 E-value=9.8e-11 Score=75.46 Aligned_cols=265 Identities=12% Similarity=0.006 Sum_probs=145.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCC--CCccce Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSAD--AISDTG 72 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~--~~~~~~ 72 (273) |... .++|+-|...+++.+++.+++..++.... .....-++.+|+....+......++.....+ .+..+. T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~--~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~ 187 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEP--VFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLER 187 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceee--ccCCccceEEEEecCCcceeeccccccccccccccceee Confidence 3221 36799999999999999988877764421 1122235556664333333333333332222 344556 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc-------ccccccCCCHHHHHHHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGT-------ALSGSAPTDADDAFDLI 144 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~-------~~~~~~~~t~~~~~~~i 144 (273) ++++..+. +.-+.|++.=..++..++.+ +.+.++++++.++|..++.--..... ....+...+....++++ T Consensus 188 i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~~ 266 (404) T protein:vir:10 188 FNFKLKDL-ADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKSPALKDF 266 (404) T ss_pred eEeeheee-EeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeeccccccHHHH Confidence 66666443 33456666433344456665 66788899999999987732111100 00011122333456677 Q ss_pred HHHHH-HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeC-cccc-CCC-cEEE Q lcl|NC_011288. 145 ATALK-ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN-NLRD-TDD-EQFV 220 (273) Q Consensus 145 ~~a~~-~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~-~l~~-~~~-~~~~ 220 (273) ..+.. .+....- .+-.++++|..+..|.+-.....+ . .....+..|.-+++.|.+|+..+ .++. +.+ ..++ T Consensus 267 ~~~~~~~l~~~~~--~~~~~v~n~~~~~~L~~lkd~~G~--~-l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~ 341 (404) T protein:vir:10 267 KKCKNVELLNVFK--ATSSWIVNQDGFNYLDSLEDKTGR--P-YLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVL 341 (404) T ss_pred HHHHHhhhhcccc--CCCEEEEcHHHHHHHHHhhccCCc--e-eeccCcCCCCCccccceeeEEecccccCCCCCccEEE Confidence 66554 3433322 234679999999988663211111 1 11112345666789999998643 3433 222 2344 Q ss_pred EEc-CceeEEee-eeeeehhhcCC----CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 221 AFH-PSAAAYVS-QIDTVEALRDQ----DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 221 ~~~-~~a~~~a~-~~~~~e~~~~~----~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .|+ +.++.... ....++..+.. ..-...+++.+++|..+.+|+++++++-+.+ T Consensus 342 ~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~a 400 (404) T protein:vir:10 342 LGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVE 400 (404) T ss_pred EEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecc Confidence 554 33443332 22233322221 1234579999999999999999998877776 No 145 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=99.03 E-value=1.2e-11 Score=80.43 Aligned_cols=248 Identities=12% Similarity=0.078 Sum_probs=134.2 Q ss_pred Cccc------hh-------------hHHHHHHHHHHHHHHhhccchhhcccccccccCCceE-EEeecCcccceeecCCC Q lcl|NC_011288. 1 MAFN------NF-------------IPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVV-HIAGVVAPTVKDYKAAG 60 (273) Q Consensus 1 MA~~------~~-------------~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv-~ip~~~~~~~~~~~~~~ 60 (273) |-.+ ++ ..+.|++-+.+ |.+.| +.. | .. ....|.+| ++|.|..........+| T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~-L~~~L---Gv~-r-~~-pla~GstIkt~k~~~y~gda~dVaEG 73 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISK-LLEML---GVT-R-KI-SVSEGMTLKTYAGYDVTLAEGNVPEG 73 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHH-HHHHh---hhc-c-cc-cccCCCEEeeccceeeeeccccccCC Confidence 2221 11 12334443332 22332 111 1 11 23459999 55678877766666778 Q ss_pred cccCCCCCccc---eEEEEEeeeeecceEEchHHHHhh-hHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCC Q lcl|NC_011288. 61 RQTSADAISDT---GVDLLIDQEKSIDFLVDDIDRVQV-AGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPT 135 (273) Q Consensus 61 ~~~~~~~~~~~---~~~~~id~~~~~~~~i~d~d~~~~-~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~ 135 (273) ..+..+.++.+ ..++++++++.. ++|+..... .++ +.+.-+|+.++|++++|++++..+..+..... . T Consensus 74 e~Iplskvt~~~~~t~t~~ikK~rK~---tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~----~ 146 (296) T protein:vir:98 74 EVIPLSKVERKIHSEKKIELKKYRKA---TTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQD----A 146 (296) T ss_pred cccchhhheeeecceEEEEeeccccc---cCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceee----e Confidence 88888888865 478888775333 466543222 233 67888999999999999999999976543322 1 Q ss_pred CHHH----HHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeee-eeEeceEEEeeCc Q lcl|NC_011288. 136 DADD----AFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTI-GNLLGARIVESNN 210 (273) Q Consensus 136 t~~~----~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~i-g~~~G~~v~~s~~ 210 (273) +++. ....+.++...|++.+ ....+++++|...+.+|++.. +...... .+.. -++.|..|++|+. T Consensus 147 t~~~lQ~Ala~~~~~l~~~feded--~~~~V~FVnP~D~a~ylg~a~-it~qt~f-------G~tyl~nfLG~~II~S~k 216 (296) T protein:vir:98 147 LGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAG-ITTQTAF-------GLTYLVDFTGTVIISTND 216 (296) T ss_pred chhhHHHHHHHHhhhhhhhccccC--CCceEEEEehHHHHHHhcCCc-cchhhee-------chhhhhhccccEEEEcCc Confidence 2222 1234455666776664 235789999999999998764 3221111 2222 2488999999999 Q ss_pred cccCCCcEE------EEEcCce---eE--Eeeeeee---ehhhcCCCceeeeEEeeeeeeeEEe---cCceEEEEecCCC Q lcl|NC_011288. 211 LRDTDDEQF------VAFHPSA---AA--YVSQIDT---VEALRDQDSFSDRIRALHVYGGKVV---RPTGVVVFNKTGS 273 (273) Q Consensus 211 l~~~~~~~~------~~~~~~a---~~--~a~~~~~---~e~~~~~~~~~~~v~~~~~~g~~v~---~~~~~v~~~~~~s 273 (273) +|.+.-..+ +++.+.. ++ |..-.|+ +-...+.....--+.- ..+.+-++ +++++|+.+-+++ T Consensus 217 V~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT-~~~~~~~lfpE~~dgiv~~tI~~~ 295 (296) T protein:vir:98 217 VTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQT-LLVSGMLMYPERIDGIVKVTLTPG 295 (296) T ss_pred CCCceEEEeeecceEEEeecccccchhhhhccccccccceEEEeccccceeeehh-HhHhHHHhcccccceEEEEEecCC Confidence 986543222 2222211 11 1000011 0011111111111111 22222223 5779998777777 No 146 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.01 E-value=1.4e-10 Score=74.63 Aligned_cols=252 Identities=10% Similarity=0.010 Sum_probs=134.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeec--CcccceeecCCCcccCCCCCccceEEEEEe Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGV--VAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~id 78 (273) +.....+|+-+...+.+. .....+...++.- ...+....+|.. ..........++......++..+.+++++. T Consensus 138 ~~~~~~vp~~~~~~i~~~-~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~ 212 (397) T protein:vir:96 138 VEGGALIPQELLQPQLEP-KDIVDLSKYVRSV----PVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVA 212 (397) T ss_pred cccccchhHHHHHHHHHh-hhhhhHHHhhhhc----cccccceeEEEEeccCCccccccccccccccccccccceeecHh Confidence 333346788788877764 3333333333221 112234444443 333333222222222234566677777774 Q ss_pred eeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHHHhhcCCC Q lcl|NC_011288. 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKELTKANVP 157 (273) Q Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~l~~~~vP 157 (273) +. +.-+.++..-..++..++.. +.+..+++++...|..++.-.... ..++...+++|.++....... T Consensus 213 ~~-~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~--------~~~~~~~~d~~~~~~~~~~~~--- 280 (397) T protein:vir:96 213 TR-RGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTA--------TAKSVVGVDGLKDLINKEIKK--- 280 (397) T ss_pred Hh-hcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------ccccccchHHHHHHHHHhhhh--- Confidence 43 33345554333344556766 456677888888888877433211 111223356666555433222 Q ss_pred ccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCcccc--CCCc-EEEEEcCc-eeEEeee- Q lcl|NC_011288. 158 NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD--TDDE-QFVAFHPS-AAAYVSQ- 232 (273) Q Consensus 158 ~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~--~~~~-~~~~~~~~-a~~~a~~- 232 (273) .-+-..|++|..+..|.+..... +.......+.+|.-++|.|.+|+.++.... ..+. .++.|.-+ +.....+ T Consensus 281 ~~~a~~v~n~~~~~~l~~lkd~~---G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 357 (397) T protein:vir:96 281 VYDVKLFISASMYSELDKLKDKN---GRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRK 357 (397) T ss_pred hcCcEEEEcHHHHHHHHHhhccC---CCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeec Confidence 22446899999999986632111 011111234455567899999998765322 2222 23444322 3322222 Q ss_pred eeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 233 IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 233 ~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ...++.. +...+.+.+++-+++|+++.+|++++.++-+.. T Consensus 358 ~~~~~~~-~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 358 QVSVSWV-DNNIYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred ceEEEEe-cccccceeEEEEEEEccEEecccceEEEEeecC Confidence 2223222 245567788999999999999999999986666 No 147 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=98.98 E-value=3e-10 Score=72.76 Aligned_cols=263 Identities=11% Similarity=0.069 Sum_probs=155.2 Q ss_pred Cccc----hhhHH--HHHHHHHHHHHHhhccc--hhhcccccccc---cCCceEEEeecCccc-cee--ecCCC--cccC Q lcl|NC_011288. 1 MAFN----NFIPE--LWSDMLLEEWTAQTVFA--NLVNREYEGTA---SKGNVVHIAGVVAPT-VKD--YKAAG--RQTS 64 (273) Q Consensus 1 MA~~----~~~pe--v~~~~~~~~~~~~lv~~--~~v~~~~~~~~---~~Gdtv~ip~~~~~~-~~~--~~~~~--~~~~ 64 (273) ||.+ ..+|| +|.+.+.+.-.+..-|. +.+.+|-+... ..|+.+++|.|+.+. ..+ +.... ..++ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~t 80 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 9977 35677 79999988875554432 34444433221 359999999998764 222 22211 1345 Q ss_pred CCCCccceEEEEEeeeeecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccc----c---------c Q lcl|NC_011288. 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTA----L---------S 130 (273) Q Consensus 65 ~~~~~~~~~~~~id~~~~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~----~---------~ 130 (273) +..++..+....+ .++..++...|+-...+-.+ |+.+.++.+.--.+.....+++.+...-.. . . T Consensus 81 ~~kit~~~~~a~~-~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~~ 159 (349) T protein:vir:94 81 PRAIQTGEMMARV-AYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMV 159 (349) T ss_pred cccccccceeeee-eeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCcee Confidence 5666655554444 56778888888765554444 777888887777777777777766532110 0 0 Q ss_pred cccCCCHHHHHHHHHHHHHHHhhcCC---CccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEe Q lcl|NC_011288. 131 GSAPTDADDAFDLIATALKELTKANV---PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE 207 (273) Q Consensus 131 ~~~~~t~~~~~~~i~~a~~~l~~~~v---P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~ 207 (273) .....+.....+.|..|...|.+.-. ...-..++++|..+..|.+.. .+.. ....+ ++..|+.+.|..|+. T Consensus 160 ~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~-li~~---i~~s~--~~~~i~ty~G~~Viv 233 (349) T protein:vir:94 160 VDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQ-LIDF---IRDAE--NNTMFATYQGYRVIV 233 (349) T ss_pred EEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcc-hhhh---ccCcc--cCcccceecCcEEEE Confidence 00011111234466667777766522 112246899999999997753 2221 11111 234589999999999 Q ss_pred eCccccCC-----CcEEEEEcCceeEEeeeee--eehhhcCCCce----eeeEEeeeeeeeEEecCceEEEEecCC---- Q lcl|NC_011288. 208 SNNLRDTD-----DEQFVAFHPSAAAYVSQID--TVEALRDQDSF----SDRIRALHVYGGKVVRPTGVVVFNKTG---- 272 (273) Q Consensus 208 s~~l~~~~-----~~~~~~~~~~a~~~a~~~~--~~e~~~~~~~~----~~~v~~~~~~g~~v~~~~~~v~~~~~~---- 272 (273) ++.+|... .++++.+-++|+++..+-. .+|..|++... .|.+..+.+| ++-|-|+--..+.. T Consensus 234 DD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~~hp~G~s~~~a~v~~~~ 310 (349) T protein:vir:94 234 DDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---LLHPFGYSFTSAVITGNG 310 (349) T ss_pred eCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---EeeeeeeeecccccCCCc Confidence 99999632 2456777899999987653 46777777543 3566665544 33344443332211 Q ss_pred --------C Q lcl|NC_011288. 273 --------S 273 (273) Q Consensus 273 --------s 273 (273) | T Consensus 311 ~~~~~~sPt 319 (349) T protein:vir:94 311 TETIARSAS 319 (349) T ss_pred cccccCCCC Confidence 1 No 148 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=98.97 E-value=1.4e-10 Score=74.57 Aligned_cols=251 Identities=12% Similarity=0.068 Sum_probs=138.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecC-cccceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVV-APTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+-+..++++.++....+..+++.-- ..| .++|... .........++...+..++..+.+ T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~----~~~--~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v 191 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN----IKG--LEIPRVSYTLDDDDFITDVETAKELKLKGDTV 191 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeee----cCC--ceEEEEeecCCccccccCccccccccccccee Confidence 2221 37899999999999988877766664311 112 3355432 112222334444444455666777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-----cccccCCCHHHHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-----LSGSAPTDADDAFDLIATA 147 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-----~~~~~~~t~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..+++. +.+..+++++.+.+..++..-...+.. .......+....+++|.++ T Consensus 192 ~~~~~k~-~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~ 270 (387) T protein:vir:93 192 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMYDAIINA 270 (387) T ss_pred eeeheee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 7766443 23356665444455567776 556777888887666665332221111 1112233455568888888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCcee Q lcl|NC_011288. 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...|+..... +-..++++..+..+++- +.+ ++ ..+..|.=.++.|.+|+.++..+. .++|.-+.. T Consensus 271 ~~~l~~~~~~--~a~~~mn~~t~~~~~~~---~~d----~~-~~~~~~~~~~llG~PV~~~~~~~~-----~~~GDf~~~ 335 (387) T protein:vir:93 271 LADLHEDYRD--NATIYMRYADYVKIISV---LSN----GT-TNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 335 (387) T ss_pred HhccChhhhc--CCEEEEechHHHHHHHH---Hhc----CC-CcccccCCccccccceEEecCCCc-----eeeeehhhh Confidence 8777766542 22457787766665442 221 11 123334446899999999875432 233322111 Q ss_pred EEeeeeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ........++..+....--..+++..++|+++++|++++.++.+++ T Consensus 336 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~ 381 (387) T protein:vir:93 336 GINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred heehhhheeeecccccCCceeEEEEeeeCceeechhheEEEEeecC Confidence 0000111122222222223457788999999999999987766444 No 149 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=98.97 E-value=8.7e-11 Score=75.73 Aligned_cols=250 Identities=12% Similarity=0.054 Sum_probs=141.0 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCc-ccceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVA-PTVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+-++..+++.+.....+..+++.- ...| .++|.... .+......++......+++.+.+ T Consensus 133 ~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~----~~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i 206 (402) T protein:vir:93 133 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT----NIKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 206 (402) T ss_pred hccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceee----ecCC--ceeeeeeccCCcccccccccccccccccccee Confidence 2221 4789999999999998888777766531 1112 34555422 12223344444444455666677 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-----cccccCCCHHHHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-----LSGSAPTDADDAFDLIATA 147 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-----~~~~~~~t~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..++.+ +.+..+++++.+.+..++..-...+.. ..+....++...+++|.++ T Consensus 207 ~~~~~k~-~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~ 285 (402) T protein:vir:93 207 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINA 285 (402) T ss_pred eecceee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 7666443 23345665433445566766 557778888887666655332211111 1112233445668888888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcC-ce Q lcl|NC_011288. 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHP-SA 226 (273) Q Consensus 148 ~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~-~a 226 (273) ...|+..... +-..++++..+..|++- +.+ . +..+..|.=.++.|.+|+.++..+. .+.|.- .+ T Consensus 286 ~~~l~~~y~~--na~~imn~~t~~~~~~~---~~d---~--~~~~~~~~~~~llG~PV~~t~~~~~-----i~~GDf~~~ 350 (402) T protein:vir:93 286 LADLHEDYRD--NATIYMRYADYVKIISV---LSN---G--TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 350 (402) T ss_pred HhccChhhhc--CCEEEEechHHHHHHHH---Hhc---C--CCcccccCCccccccceEEecCCCc-----eeeechhhh Confidence 8777665432 33457777776666542 211 1 1223345556799999999876542 223321 11 Q ss_pred eEEeeeeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 227 AAYVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 227 ~~~a~~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .. ......++..++...--..+++.+++|+++++|++++.++-++. T Consensus 351 ~~-~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 351 GI-NYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred hh-hhhhhhhhhhhcccCCceEEEEEEEeCcEEechhheEEEEeecC Confidence 11 01111223444443334578899999999999999997776555 No 150 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=98.96 E-value=9.7e-11 Score=75.48 Aligned_cols=250 Identities=12% Similarity=0.064 Sum_probs=141.6 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+-|+.++++.++....+..+++.- . ..| .++|++... .......++...+..++..+.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~--~--~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:26 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--N--IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee--e--cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 2221 4789999999999998888776666431 1 112 345553321 2223334444454455666777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-----cccccCCCHHHHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-----LSGSAPTDADDAFDLIATA 147 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-----~~~~~~~t~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..+++. +.+..+++++.+.+..++......+.. ..+....++...+++|.++ T Consensus 192 ~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~ 270 (387) T protein:vir:26 192 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINA 270 (387) T ss_pred eechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 7776544 23356665434445566766 556777888887666665332221111 1112233445668888888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCcee Q lcl|NC_011288. 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...|+....+ +-..++++..+..|++- +.+ . +..+..|.-.++.|.+|+.++..+. .+.|.-+-. T Consensus 271 ~~~l~~~y~~--na~~imn~~t~~~~~~~---~~~---~--~~~~~~~~~~~llG~PV~~~~~~~~-----~~~GDf~~~ 335 (387) T protein:vir:26 271 LADLHEDYRD--NATIYMRYADYVKIISV---LSN---G--TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 335 (387) T ss_pred HhccChhhhc--CCEEEEechHHHHHHHH---Hhc---C--CCcccccCCccccccceEEecCCCc-----eeeechhhh Confidence 8777665432 22456787777666542 211 1 1234445556899999999876532 233321110 Q ss_pred EEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.. .....+..++...--..+++.+++|+++++|++++.++-+++ T Consensus 336 -~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:26 336 -GINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred -hhhhhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 111 111122233333234578889999999999999998887666 No 151 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=98.96 E-value=9.7e-11 Score=75.48 Aligned_cols=250 Identities=12% Similarity=0.064 Sum_probs=141.6 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+-|+.++++.++....+..+++.- . ..| .++|++... .......++...+..++..+.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~--~--~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:96 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--N--IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee--e--cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 2221 4789999999999998888776666431 1 112 345553321 2223334444454455666777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-----cccccCCCHHHHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-----LSGSAPTDADDAFDLIATA 147 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-----~~~~~~~t~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..+++. +.+..+++++.+.+..++......+.. ..+....++...+++|.++ T Consensus 192 ~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~ 270 (387) T protein:vir:96 192 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINA 270 (387) T ss_pred eechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 7776544 23356665434445566766 556777888887666665332221111 1112233445668888888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCcee Q lcl|NC_011288. 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...|+....+ +-..++++..+..|++- +.+ . +..+..|.-.++.|.+|+.++..+. .+.|.-+-. T Consensus 271 ~~~l~~~y~~--na~~imn~~t~~~~~~~---~~~---~--~~~~~~~~~~~llG~PV~~~~~~~~-----~~~GDf~~~ 335 (387) T protein:vir:96 271 LADLHEDYRD--NATIYMRYADYVKIISV---LSN---G--TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 335 (387) T ss_pred HhccChhhhc--CCEEEEechHHHHHHHH---Hhc---C--CCcccccCCccccccceEEecCCCc-----eeeechhhh Confidence 8777665432 22456787777666542 211 1 1234445556899999999876532 233321110 Q ss_pred EEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.. .....+..++...--..+++.+++|+++++|++++.++-+++ T Consensus 336 -~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:96 336 -GINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred -hhhhhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 111 111122233333234578889999999999999998887666 No 152 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=98.96 E-value=9.7e-11 Score=75.48 Aligned_cols=250 Identities=12% Similarity=0.064 Sum_probs=141.6 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+-|+.++++.++....+..+++.- . ..| .++|++... .......++...+..++..+.+ T Consensus 118 ~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~--~--~~~--~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v 191 (387) T protein:vir:94 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLT--N--IKG--LEIPRVSYTLDDDDFITDVETAKELKAKGDTV 191 (387) T ss_pred hccCCCCCCceeechhHHHHHHHHHHhhchhhhhceee--e--cCC--ceeeeeeccCCcccccccccccccccccccee Confidence 2221 4789999999999998888776666431 1 112 345553321 2223334444454455666777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc-----cccccCCCHHHHHHHHHHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTA-----LSGSAPTDADDAFDLIATA 147 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~-----~~~~~~~t~~~~~~~i~~a 147 (273) ++...+. +.-+.|+..-..++..+++. +.+..+++++.+.+..++......+.. ..+....++...+++|.++ T Consensus 192 ~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~ 270 (387) T protein:vir:94 192 KFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINA 270 (387) T ss_pred eechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHH Confidence 7776544 23356665434445566766 556777888887666665332221111 1112233445668888888 Q ss_pred HHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCcee Q lcl|NC_011288. 148 LKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAA 227 (273) Q Consensus 148 ~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~ 227 (273) ...|+....+ +-..++++..+..|++- +.+ . +..+..|.-.++.|.+|+.++..+. .+.|.-+-. T Consensus 271 ~~~l~~~y~~--na~~imn~~t~~~~~~~---~~~---~--~~~~~~~~~~~llG~PV~~~~~~~~-----~~~GDf~~~ 335 (387) T protein:vir:94 271 LADLHEDYRD--NATIYMRYADYVKIISV---LSN---G--TTNFFDTPAEKVFGKPVVFTDAAVK-----PIVGDFNYF 335 (387) T ss_pred HhccChhhhc--CCEEEEechHHHHHHHH---Hhc---C--CCcccccCCccccccceEEecCCCc-----eeeechhhh Confidence 8777665432 22456787777666542 211 1 1234445556899999999876532 233321110 Q ss_pred EEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 228 AYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 228 ~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.. .....+..++...--..+++.+++|+++++|++++.++-+++ T Consensus 336 -~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:94 336 -GINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred -hhhhhhhhheecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 111 111122233333234578889999999999999998887666 No 153 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=98.95 E-value=4.5e-10 Score=71.80 Aligned_cols=256 Identities=12% Similarity=0.006 Sum_probs=134.9 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCccc-CCCCCccce Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQT-SADAISDTG 72 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~-~~~~~~~~~ 72 (273) ++.. .++|+-+...+.. +.....+..+++.- .....++++|..... ........+... ..++...+. T Consensus 156 ~~~~~~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~ 230 (437) T protein:vir:10 156 VTGIALKDGKVIIPETILTPEKE-VHQFPRLGSLVRTE----SVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITP 230 (437) T ss_pred hhhcccccccccchHHHHHHHHH-hhhhhhhhhcceeE----eeccCceeeEEeecccccccccccccccccccccccee Confidence 2211 3678888776654 44443444444321 112334556655322 222223333322 223345566 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHH-H Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALK-E 150 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~-~ 150 (273) +++...+. +.-+.|+..-..++..++.. +.+..+++++...|..++.-...... ..+....++++.++.. . T Consensus 231 v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~------~~~~~~~~~~~~~~~~~~ 303 (437) T protein:vir:10 231 ILWDLKTY-TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIK------KTTSTYLLGDLKKVLNVT 303 (437) T ss_pred eeeehhhe-eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc------ccccccchhhHHHHHHhh Confidence 66665433 33356665433444556766 55678889999999888764432211 1122223344444432 3 Q ss_pred HhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc--cccCC-Cc-EEEEEc-Cc Q lcl|NC_011288. 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN--LRDTD-DE-QFVAFH-PS 225 (273) Q Consensus 151 l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~--l~~~~-~~-~~~~~~-~~ 225 (273) |+.... .+-..+++|..+..|.+...... .......+..|.-++|+|.+|+.+++ +|..+ +. .++.|. +. T Consensus 304 l~~~~~--~~~~~~~~~~~~~~l~~lkd~~g---~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~ 378 (437) T protein:vir:10 304 LKPQDS--AAASIVMSQSAYNLFDMATDAMG---RPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKK 378 (437) T ss_pred hhhhhh--cCCEEEEcHHHHHHHHHhhccCC---CeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccc Confidence 443332 23467999999998865321100 11111223456567899999998654 45433 22 234443 33 Q ss_pred eeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 226 AAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 226 a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.... +...++...+-..+.+.+.+-+++|+++++|++++.|+.... T Consensus 379 ~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 427 (437) T protein:vir:10 379 AVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLK 427 (437) T ss_pred cEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeecc Confidence 443332 233344334444566778888999999999999999874433 No 154 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.95 E-value=2.7e-10 Score=73.06 Aligned_cols=267 Identities=16% Similarity=0.090 Sum_probs=135.6 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc-ceeecCCCcccC-----CCCC Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTS-----ADAI 68 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~-~~~~~~~~~~~~-----~~~~ 68 (273) +..+ ...|+.+...+++.++..+++..++..-. ....+.++.||+....+ ......++...+ ..++ T Consensus 157 ~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~--~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~ 234 (477) T protein:vir:84 157 LDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEP--LPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDL 234 (477) T ss_pred ccccCCCcceeeccchhHHHHHHHhhhcchHHHhhceee--ecCCcceeEEEEEecCcceeeeeccCccccccccccccc Confidence 1111 23467678889999988887777665421 11235578899864332 222333333221 1234 Q ss_pred ccceEEEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhc--cccc---------cc-ccCC Q lcl|NC_011288. 69 SDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDN--GTAL---------SG-SAPT 135 (273) Q Consensus 69 ~~~~~~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~--~~~~---------~~-~~~~ 135 (273) ..+.+++...+. +.-+.|+..-..++..++.. +.+++.++++.++|..++.=--.. +... .. .+.. T Consensus 235 ~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~ 313 (477) T protein:vir:84 235 TDGFVQANVKTI-AGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGS 313 (477) T ss_pred ceeeEEEeeeeE-EeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeecccccccccccccc Confidence 445555555442 23344555433444557766 557888999999998877211000 0000 00 0011 Q ss_pred C---HHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhh----hhccc------ccceeeeeeeeeEec Q lcl|NC_011288. 136 D---ADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTS----ADTSG------DAAGLRAGTIGNLLG 202 (273) Q Consensus 136 t---~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~----~~~~~------~~~~l~~G~ig~~~G 202 (273) + ....++.|.++...++.... ......+++|..+..|.+....-.+ .+..+ ....+.+|..|++.| T Consensus 314 t~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G 392 (477) T protein:vir:84 314 ALEKHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHG 392 (477) T ss_pred chhhHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcc Confidence 1 12345556666655544332 1234678999999888654321110 00000 001234455678999 Q ss_pred eEEEeeCccccCCC-----cEEEEEcCceeEEeeeeeeehhhcCCCceee----eEEeeeeeeeEEec-CceEEEEecCC Q lcl|NC_011288. 203 ARIVESNNLRDTDD-----EQFVAFHPSAAAYVSQIDTVEALRDQDSFSD----RIRALHVYGGKVVR-PTGVVVFNKTG 272 (273) Q Consensus 203 ~~v~~s~~l~~~~~-----~~~~~~~~~a~~~a~~~~~~e~~~~~~~~~~----~v~~~~~~g~~v~~-~~~~v~~~~~~ 272 (273) ++|+.++.+|...+ ..++.+.-+.+-... ..++...+...+.+ .++..-.+++..+| |+++|+++-++ T Consensus 393 ~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~--~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~ 470 (477) T protein:vir:84 393 LPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE--SSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTA 470 (477) T ss_pred cceEecCcccccccccCCcceEEEEEeceEEEEe--eceeEEeccccccccceeeeeehhhhhhhhhccccceEEeeccc Confidence 99999999996322 123444333222111 11111222222222 22222233445666 99999998888 Q ss_pred C Q lcl|NC_011288. 273 S 273 (273) Q Consensus 273 s 273 (273) . T Consensus 471 ~ 471 (477) T protein:vir:84 471 L 471 (477) T ss_pred c Confidence 8 No 155 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.88 E-value=2.5e-10 Score=73.23 Aligned_cols=252 Identities=10% Similarity=0.003 Sum_probs=128.8 Q ss_pred Cccc-hhh-------------HHHHHHHHHHHHHHhh---ccchhhcccccccccCCceEEEeec---CcccceeecCCC Q lcl|NC_011288. 1 MAFN-NFI-------------PELWSDMLLEEWTAQT---VFANLVNREYEGTASKGNVVHIAGV---VAPTVKDYKAAG 60 (273) Q Consensus 1 MA~~-~~~-------------pev~~~~~~~~~~~~l---v~~~~v~~~~~~~~~~Gdtv~ip~~---~~~~~~~~~~~~ 60 (273) |+.+ +++ .+.|++-+.+ |.+.| .+.++ ..|.+|+++++ ...+......+| T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~-L~~~LGv~r~~pl---------a~Gt~iktyK~~~~~y~gda~dVaEG 70 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNK-LFEALAIQNKIPM---------NVGSALKQYRFKVEDSEKPNGDVAEG 70 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHH-HHHHhhhhccccc---------cCCceeeeeeeeceeeccccccccCC Confidence 6654 221 2334444333 33333 22222 24666655554 343444445677 Q ss_pred cccCCCCCccc---eEEEEEeeeeecceEEchHHHHhh-hHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--c Q lcl|NC_011288. 61 RQTSADAISDT---GVDLLIDQEKSIDFLVDDIDRVQV-AGS-LEAYTRAGATALATDTDKFIADLLVDNGTALSGS--A 133 (273) Q Consensus 61 ~~~~~~~~~~~---~~~~~id~~~~~~~~i~d~d~~~~-~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~--~ 133 (273) ..+..+.+... ..+++++|++. .++|+..... .++ +.+.-+|+.++|++++|++++..+..+......+ + T Consensus 71 e~Iplskvt~~~~~t~~~~~kK~rK---~tTdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t 147 (303) T protein:vir:10 71 DVIPLTKVTREQVDITELQFAKYRK---STSAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKT 147 (303) T ss_pred cccchhhheeeecceEEEEeecccc---cccHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccce Confidence 77888888754 57888877644 3366543322 233 6778899999999999999999998765443322 2 Q ss_pred CCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCcccc Q lcl|NC_011288. 134 PTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 134 ~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~ 213 (273) ..+.+..-.++......|+...=-...-+++++|...+.++++.......... +.+.| -++.|+.|++|+.+|. T Consensus 148 ~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~f-G~n~L-----~nfLG~~II~S~kv~~ 221 (303) T protein:vir:10 148 KLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQF-GVNLL-----TPYVGVKIVEFADVPQ 221 (303) T ss_pred eecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhh-hhhhh-----hhhhcceEEEeccCCC Confidence 23333333333333223222110012348899999999998876543222222 22333 3599999999999986 Q ss_pred CCCc------EEEEEcCceeE-Eee----eeeee---hhhcCCCceeeeEEeeeeeeeEEe---cCceEEEEecCCC Q lcl|NC_011288. 214 TDDE------QFVAFHPSAAA-YVS----QIDTV---EALRDQDSFSDRIRALHVYGGKVV---RPTGVVVFNKTGS 273 (273) Q Consensus 214 ~~~~------~~~~~~~~a~~-~a~----~~~~~---e~~~~~~~~~~~v~~~~~~g~~v~---~~~~~v~~~~~~s 273 (273) +.-. -.+++-+.. | .+. -.|++ -...+.....--+.- ..+.+-++ ++++||+.+-++. T Consensus 222 G~~~~T~~~Ni~~ay~~~~-g~l~~~f~~t~D~tglIGv~h~~~~~~~t~eT-~~~~~~~lfpE~~dgiv~~ti~~~ 296 (303) T protein:vir:10 222 GEVWMTVAENLNVAYANPR-GELSRAFAFATDATGFVGVLHDIQPQRLTSDT-IYASAISMFPENIDAVIKVTIKKD 296 (303) T ss_pred ceEEEeeccceEEEEecCc-hhhhhhhhhccccccceEEEeccccceeeehh-HhHhHHHhcccccceEEEEEEecc Confidence 5432 122221110 1 000 00110 011111111111111 22222223 5678887776655 No 156 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.87 E-value=1.6e-09 Score=68.76 Aligned_cols=257 Identities=11% Similarity=-0.031 Sum_probs=141.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |+.. .++|+.+...+++.+++.+++.+++..-. ......+..+|+...........++.... .+.+..+.+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCccceeecccccccccccccceeE Confidence 3321 47899999999999999988877764311 11111234455544333333344443332 233566677 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHH-HHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATAL-KEL 151 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~l 151 (273) +++..+. +.-+.|++.-..++..++.. +.+..+++++...|..++.-..+.. ......+++|.++. ..| T Consensus 184 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--------~~~~~~~d~i~~~~~~~l 254 (392) T protein:vir:10 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--------KQAIKSLDDIKDVLNVKL 254 (392) T ss_pred EeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccCccCHHHHHHHHHHhh Confidence 7776443 44456666444444556665 5677888999999988875332211 11122345666654 345 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEe-e-Ccccc----CCCcE-EEEEcC Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE-S-NNLRD----TDDEQ-FVAFHP 224 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~-s-~~l~~----~~~~~-~~~~~~ 224 (273) ..... .+-..|++|..+..|.+-...-. ...-...+..|.-+.++|.+++. + +..+. ..+.. ++.+.- T Consensus 255 ~~~~~--~~a~~vm~~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 255 DPAIS--PNAILLTNQDGFNYLDKLKDKDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) T ss_pred hhhhc--cCCEEEEcHHHHHHHHHhhccCC---CeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEeh Confidence 54443 34568999999999865321100 00011123345567899987654 2 32221 12222 334432 Q ss_pred -ceeEEee-eeeeehhhcC-CCc---eeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 225 -SAAAYVS-QIDTVEALRD-QDS---FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 -~a~~~a~-~~~~~e~~~~-~~~---~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .++.... +...++..+. ... ....+++.+++|..+++|++++.++.+.+ T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2333222 1222222221 112 23468899999999999999999887777 No 157 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.87 E-value=1.6e-09 Score=68.76 Aligned_cols=257 Identities=11% Similarity=-0.031 Sum_probs=141.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |+.. .++|+.+...+++.+++.+++.+++..-. ......+..+|+...........++.... .+.+..+.+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCccceeecccccccccccccceeE Confidence 3321 47899999999999999988877764311 11111234455544333333344443332 233566677 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHH-HHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATAL-KEL 151 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~l 151 (273) +++..+. +.-+.|++.-..++..++.. +.+..+++++...|..++.-..+.. ......+++|.++. ..| T Consensus 184 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--------~~~~~~~d~i~~~~~~~l 254 (392) T protein:vir:10 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--------KQAIKSLDDIKDVLNVKL 254 (392) T ss_pred EeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccCccCHHHHHHHHHHhh Confidence 7776443 44456666444444556665 5677888999999988875332211 11122345666654 345 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEe-e-Ccccc----CCCcE-EEEEcC Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE-S-NNLRD----TDDEQ-FVAFHP 224 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~-s-~~l~~----~~~~~-~~~~~~ 224 (273) ..... .+-..|++|..+..|.+-...-. ...-...+..|.-+.++|.+++. + +..+. ..+.. ++.+.- T Consensus 255 ~~~~~--~~a~~vm~~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 255 DPAIS--PNAILLTNQDGFNYLDKLKDKDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) T ss_pred hhhhc--cCCEEEEcHHHHHHHHHhhccCC---CeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEeh Confidence 54443 34568999999999865321100 00011123345567899987654 2 32221 12222 334432 Q ss_pred -ceeEEee-eeeeehhhcC-CCc---eeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 225 -SAAAYVS-QIDTVEALRD-QDS---FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 -~a~~~a~-~~~~~e~~~~-~~~---~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .++.... +...++..+. ... ....+++.+++|..+++|++++.++.+.+ T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2333222 1222222221 112 23468899999999999999999887777 No 158 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.87 E-value=1.6e-09 Score=68.76 Aligned_cols=257 Identities=11% Similarity=-0.031 Sum_probs=141.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |+.. .++|+.+...+++.+++.+++.+++..-. ......+..+|+...........++.... .+.+..+.+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCccceeecccccccccccccceeE Confidence 3321 47899999999999999988877764311 11111234455544333333344443332 233566677 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHH-HHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATAL-KEL 151 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~l 151 (273) +++..+. +.-+.|++.-..++..++.. +.+..+++++...|..++.-..+.. ......+++|.++. ..| T Consensus 184 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--------~~~~~~~d~i~~~~~~~l 254 (392) T protein:vir:10 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--------KQAIKSLDDIKDVLNVKL 254 (392) T ss_pred EeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccCccCHHHHHHHHHHhh Confidence 7776443 44456666444444556665 5677888999999988875332211 11122345666654 345 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEe-e-Ccccc----CCCcE-EEEEcC Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE-S-NNLRD----TDDEQ-FVAFHP 224 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~-s-~~l~~----~~~~~-~~~~~~ 224 (273) ..... .+-..|++|..+..|.+-...-. ...-...+..|.-+.++|.+++. + +..+. ..+.. ++.+.- T Consensus 255 ~~~~~--~~a~~vm~~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 255 DPAIS--PNAILLTNQDGFNYLDKLKDKDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) T ss_pred hhhhc--cCCEEEEcHHHHHHHHHhhccCC---CeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEeh Confidence 54443 34568999999999865321100 00011123345567899987654 2 32221 12222 334432 Q ss_pred -ceeEEee-eeeeehhhcC-CCc---eeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 225 -SAAAYVS-QIDTVEALRD-QDS---FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 -~a~~~a~-~~~~~e~~~~-~~~---~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .++.... +...++..+. ... ....+++.+++|..+++|++++.++.+.+ T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2333222 1222222221 112 23468899999999999999999887777 No 159 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.87 E-value=1.6e-09 Score=68.76 Aligned_cols=257 Identities=11% Similarity=-0.031 Sum_probs=141.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |+.. .++|+.+...+++.+++.+++.+++..-. ......+..+|+...........++.... .+.+..+.+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v 183 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP--VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNV 183 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee--ccCCceeEEEEeecCCccceeecccccccccccccceeE Confidence 3321 47899999999999999988877764311 11111234455544333333344443332 233566677 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHH-HHH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATAL-KEL 151 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~-~~l 151 (273) +++..+. +.-+.|++.-..++..++.. +.+..+++++...|..++.-..+.. ......+++|.++. ..| T Consensus 184 ~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~--------~~~~~~~d~i~~~~~~~l 254 (392) T protein:vir:10 184 QYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLT--------KQAIKSLDDIKDVLNVKL 254 (392) T ss_pred EeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------ccCccCHHHHHHHHHHhh Confidence 7776443 44456666444444556665 5677888999999988875332211 11122345666654 345 Q ss_pred hhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEe-e-Ccccc----CCCcE-EEEEcC Q lcl|NC_011288. 152 TKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVE-S-NNLRD----TDDEQ-FVAFHP 224 (273) Q Consensus 152 ~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~-s-~~l~~----~~~~~-~~~~~~ 224 (273) ..... .+-..|++|..+..|.+-...-. ...-...+..|.-+.++|.+++. + +..+. ..+.. ++.+.- T Consensus 255 ~~~~~--~~a~~vm~~~~~~~L~~lkd~~G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdf 329 (392) T protein:vir:10 255 DPAIS--PNAILLTNQDGFNYLDKLKDKDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDL 329 (392) T ss_pred hhhhc--cCCEEEEcHHHHHHHHHhhccCC---CeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEeh Confidence 54443 34568999999999865321100 00011123345567899987654 2 32221 12222 334432 Q ss_pred -ceeEEee-eeeeehhhcC-CCc---eeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 225 -SAAAYVS-QIDTVEALRD-QDS---FSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 225 -~a~~~a~-~~~~~e~~~~-~~~---~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .++.... +...++..+. ... ....+++.+++|..+++|++++.++.+.+ T Consensus 330 s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 330 KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred hceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 2333222 1222222221 112 23468899999999999999999887777 No 160 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.82 E-value=6.8e-09 Score=65.36 Aligned_cols=262 Identities=14% Similarity=0.067 Sum_probs=139.9 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+.|...+++.+++.+.+.+++++- ...+.++.||+.... .......++......+++.+.+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~----~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc----ccCCCceEEEEEcCCCCcceeeccCcccccccccceee Confidence 2222 4689999999999999988888887542 223457889886432 2334455565555566777777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH---------Hhhccc-cccccc--------- Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL---------LVDNGT-ALSGSA--------- 133 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~---------~~~~~~-~~~~~~--------- 133 (273) ++...+... -+.|+.. ......+++. +.+.++++++.++|..++.= +..... ...... T Consensus 227 ~~~~~k~a~-~~~iS~e-ll~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) T protein:vir:10 227 YEQVGKVAN-ALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) T ss_pred EeeeeeeEe-ecHhHHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhh Confidence 777755422 3455553 3333345666 55778899999999887631 110000 000000 Q ss_pred -------------------------------------------CCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHH Q lcl|NC_011288. 134 -------------------------------------------PTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMA 170 (273) Q Consensus 134 -------------------------------------------~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~ 170 (273) ..+.......+..+...+..... ...-..+++|..+ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~vmn~~~~ 383 (497) T protein:vir:10 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNPRDW 383 (497) T ss_pred hhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-cCCCeEEEchHHH Confidence 00000111122222222221111 0111478999999 Q ss_pred HHHhhhHHHHhhhhccccccee----eeeeeeeEeceEEEeeCccccCCCcEEEEEc--CceeEEeee-eeeehhhc-CC Q lcl|NC_011288. 171 FWLRSSGSKLTSADTSGDAAGL----RAGTIGNLLGARIVESNNLRDTDDEQFVAFH--PSAAAYVSQ-IDTVEALR-DQ 242 (273) Q Consensus 171 ~~L~~~~~~~~~~~~~~~~~~l----~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~--~~a~~~a~~-~~~~e~~~-~~ 242 (273) ..|.+......+ ...+..... ..+.-.++.|.+|+.++.+|.+. ++.|. ..++....+ ...++... .. T Consensus 384 ~~l~~lkd~~G~-~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~---~~~Gd~~~~~~~i~~r~~~~v~~~~~~~ 459 (497) T protein:vir:10 384 ELLRLTKDANGQ-YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT---ILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) T ss_pred HHHHHhhcCCCc-eeccCcccccccccccCCceeeceeeEecCCCCCCc---eEEeecccceEEEEEecccEEEeecccc Confidence 887543211111 000010000 11123479999999999998643 33333 223333322 11222211 12 Q ss_pred Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 243 DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 243 ~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..| -..|++.+++|..|.+|++++.++-+.+ T Consensus 460 ~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:10 460 TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred hhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 222 3468889999999999999998876666 No 161 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.82 E-value=6.8e-09 Score=65.36 Aligned_cols=262 Identities=14% Similarity=0.067 Sum_probs=139.9 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc-cceeecCCCcccCCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP-TVKDYKAAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 73 (273) |... .++|+.|...+++.+++.+.+.+++++- ...+.++.||+.... .......++......+++.+.+ T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~----~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSR----PVTSPNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhHHhhcccc----ccCCCceEEEEEcCCCCcceeeccCcccccccccceee Confidence 2222 4689999999999999988888887542 223457889886432 2334455565555566777777 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHH---------Hhhccc-cccccc--------- Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADL---------LVDNGT-ALSGSA--------- 133 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~---------~~~~~~-~~~~~~--------- 133 (273) ++...+... -+.|+.. ......+++. +.+.++++++.++|..++.= +..... ...... T Consensus 227 ~~~~~k~a~-~~~iS~e-ll~d~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~ 304 (497) T protein:vir:78 227 YEQVGKVAN-ALTITDE-GLRDAPELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSAT 304 (497) T ss_pred EeeeeeeEe-ecHhHHH-HHHhHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhh Confidence 777755422 3455553 3333345666 55778899999999887631 110000 000000 Q ss_pred -------------------------------------------CCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHH Q lcl|NC_011288. 134 -------------------------------------------PTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMA 170 (273) Q Consensus 134 -------------------------------------------~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~ 170 (273) ..+.......+..+...+..... ...-..+++|..+ T Consensus 305 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~vmn~~~~ 383 (497) T protein:vir:78 305 VSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-QTPNAVVMNPRDW 383 (497) T ss_pred hhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-cCCCeEEEchHHH Confidence 00000111122222222221111 0111478999999 Q ss_pred HHHhhhHHHHhhhhccccccee----eeeeeeeEeceEEEeeCccccCCCcEEEEEc--CceeEEeee-eeeehhhc-CC Q lcl|NC_011288. 171 FWLRSSGSKLTSADTSGDAAGL----RAGTIGNLLGARIVESNNLRDTDDEQFVAFH--PSAAAYVSQ-IDTVEALR-DQ 242 (273) Q Consensus 171 ~~L~~~~~~~~~~~~~~~~~~l----~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~--~~a~~~a~~-~~~~e~~~-~~ 242 (273) ..|.+......+ ...+..... ..+.-.++.|.+|+.++.+|.+. ++.|. ..++....+ ...++... .. T Consensus 384 ~~l~~lkd~~G~-~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~---~~~Gd~~~~~~~i~~r~~~~v~~~~~~~ 459 (497) T protein:vir:78 384 ELLRLTKDANGQ-YMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT---ILVGHFAPSVIQTARREGVTMQMTNSNG 459 (497) T ss_pred HHHHHhhcCCCc-eeccCcccccccccccCCceeeceeeEecCCCCCCc---eEEeecccceEEEEEecccEEEeecccc Confidence 887543211111 000010000 11123479999999999998643 33333 223333322 11222211 12 Q ss_pred Cce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 243 DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 243 ~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..| -..|++.+++|..|.+|++++.++-+.+ T Consensus 460 ~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~ 493 (497) T protein:vir:78 460 TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG 493 (497) T ss_pred hhhhcCcEEEEEEEeecceeeccccEEEEEecCC Confidence 222 3468889999999999999998876666 No 162 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.71 E-value=2.7e-09 Score=67.56 Aligned_cols=259 Identities=14% Similarity=0.075 Sum_probs=133.0 Q ss_pred Ccc-chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEee Q lcl|NC_011288. 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) Q Consensus 1 MA~-~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~ 79 (273) -+. ..++|+.+...+.+.+.....+.++++.. ..+.++++|..+..+.......+......++..+.+++.+.+ T Consensus 154 ~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~-----~~~g~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k 228 (466) T protein:vir:80 154 VSGAELTIPDVMLELLRDNMHRYSKLISKVRLR-----PLKGTARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYK 228 (466) T ss_pred hccccccccHHHHHHHHHhhhhhhhhhhheeee-----ecCceeEeeeecCCcceeecccccccccccccccceeeccee Confidence 111 13679999898888888777776665421 112245666655544443344444444445666667766644 Q ss_pred eeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhccccc----cccc-----CCCHHH- Q lcl|NC_011288. 80 EKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTAL----SGSA-----PTDADD- 139 (273) Q Consensus 80 ~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~~----~~~~-----~~t~~~- 139 (273) . +.-+.|++.=..++..++.. +.+..+++++...|..++. .+...+... .... ..+... T Consensus 229 ~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (466) T protein:vir:80 229 V-GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNL 307 (466) T ss_pred e-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhh Confidence 3 33356665444445566776 5577888999999988764 111100000 0000 000000 Q ss_pred ---------HHHHHHHHHHHHhhcCCC-ccC-CEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEee Q lcl|NC_011288. 140 ---------AFDLIATALKELTKANVP-NVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES 208 (273) Q Consensus 140 ---------~~~~i~~a~~~l~~~~vP-~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s 208 (273) .+..+.++...+.....+ ..+ .+.++++..+..|+.-.-.. + ..+.-...-+.-..+.|.+|+.+ T Consensus 308 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~-~---~~g~~~~~~~~~~~i~G~pvv~s 383 (466) T protein:vir:80 308 LKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITF-N---SAGALVASLNNTMPIVGGDIVIL 383 (466) T ss_pred hhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccc-c---CCccccccCCCcccccccceeec Confidence 011111111111111111 123 34578888887776542110 0 00000000011134889999999 Q ss_pred CccccCCCcEEEEEcCceeEEeee-eeeehhhcCCCce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 209 NNLRDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 209 ~~l~~~~~~~~~~~~~~a~~~a~~-~~~~e~~~~~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.+|... ++.+.........+ ...++.... .+| .+.+++.+++|+++.+|++++.++-+.- T Consensus 384 ~~~~~~~---~~~g~~~~y~i~~r~~~~i~~~~~-~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~ 448 (466) T protein:vir:80 384 DFIPDND---IIGGYGSLYLLAERADIKLAQSEH-VRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANA 448 (466) T ss_pred CccCccc---eeeeccccEEEEeecceEEEechh-hhhhcCcEEEEEEEEEccEEeccCceEEEEecCC Confidence 9998643 45554444433332 122222222 222 2579999999999999999998854433 No 163 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.68 E-value=3.9e-09 Score=66.66 Aligned_cols=266 Identities=14% Similarity=0.117 Sum_probs=139.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccc-eeecCCCcccCCC------------- Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTV-KDYKAAGRQTSAD------------- 66 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~-~~~~~~~~~~~~~------------- 66 (273) |..++- .--|-...+.-..+.++|.++.+. +..--..|+||.+++...+.. ......|-+.... T Consensus 19 ~~~~~~-t~y~~~k~L~~Aa~~lv~~~fA~~-~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a~G~~~~~g~~y~~~rd 96 (401) T protein:vir:95 19 NSDQMQ-TFFWLKKAIITARKEQYFMPLASV-TNMPKHYGKTIKVYEYVPLLDDRNINDQGIDASGATIVNGNLYGSSKD 96 (401) T ss_pred ccceee-ehhhHHHHHhhhhhhhhhhhcccc-cccccccCCeEEEEecccccccccchhcCCCcccccccCccccccccc Confidence 333321 112555566666667888888753 222235699999988655432 1111111111111 Q ss_pred ---------------------CCccceEEEEEeeeeecceEEch-HHHHhhhHHHHHHH-HHHHHHH-HHHHHHHHHHHH Q lcl|NC_011288. 67 ---------------------AISDTGVDLLIDQEKSIDFLVDD-IDRVQVAGSLEAYT-RAGATAL-ATDTDKFIADLL 122 (273) Q Consensus 67 ---------------------~~~~~~~~~~id~~~~~~~~i~d-~d~~~~~~~~~~~~-~~~~~al-a~~~D~~i~~~~ 122 (273) ...-..+..+|.|+-.| ..++| .+.......+.+.+ ..+...- ....|.....++ T Consensus 97 v~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~-~e~Td~~~dt~~D~~l~~h~s~ell~g~~~~t~d~i~~dll 175 (401) T protein:vir:95 97 IGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFF-YEFTQESIDFDSDDGLMEHLSRELMNGATQITEAVLQKDLL 175 (401) T ss_pred cceeecccccccccccccccccceeeeeeeeeeeccCc-cchhhhhhhhhcchHHHHHHHHHHhhhhhhhHHHHHHHHHH Confidence 11112344456554333 24444 33333333455432 2222211 112222222333 Q ss_pred hhcc------cc--c---ccccCCCHHHHHHHHHHHHHHHhhcCCCc-----------------cCCEEEECH------H Q lcl|NC_011288. 123 VDNG------TA--L---SGSAPTDADDAFDLIATALKELTKANVPN-----------------VGRVVVVNA------E 168 (273) Q Consensus 123 ~~~~------~~--~---~~~~~~t~~~~~~~i~~a~~~l~~~~vP~-----------------~~r~lvv~p------~ 168 (273) .+.. .. . ......+...+++++.++...|+++..|. .-|+++|.| . T Consensus 176 ~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~s~va~~h~~L~~di~ 255 (401) T protein:vir:95 176 AAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGATRVMYVGSELVPELK 255 (401) T ss_pred hhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCccccccceEEEEecCchhHHH Confidence 2221 00 0 11122333457889999999999877765 126789999 4 Q ss_pred HHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeeeeehhhcCCCceeee Q lcl|NC_011288. 169 MAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDSFSDR 248 (273) Q Consensus 169 ~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~e~~~~~~~~~~~ 248 (273) ...+|+.++.|. ....+++...+.+|+||++.+|.|+.++......+..|-+ .+..-+.|..-.+.--+.+-|..+ T Consensus 256 a~~D~~~~~~fi-~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a---~~~~~~y~~~~~~~gg~~dVyp~l 331 (401) T protein:vir:95 256 AMKDLFGNKAFI-ETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQA---TGANPGYRTSMVSGQEHYDVYPML 331 (401) T ss_pred HHHHhcCCCCce-ehhhcCCccccccccccccCceeEEecccceeecCCcccc---cccccccccccccCCCcceeeeee Confidence 445566666654 4556667778899999999999999887764333322211 111112222223344455568899 Q ss_pred EEeeeeeeeEEecCce-----EEEEecCCC Q lcl|NC_011288. 249 IRALHVYGGKVVRPTG-----VVVFNKTGS 273 (273) Q Consensus 249 v~~~~~~g~~v~~~~~-----~v~~~~~~s 273 (273) |-|.+.||.--+...+ =+.++..|= T Consensus 332 V~G~dAf~~~~l~g~g~~~~~~~ivk~pG~ 361 (401) T protein:vir:95 332 VVGDDSFTSIGFQTDGKSLKFTVMTKMPGK 361 (401) T ss_pred EEccccceecccccCCccccceeEeecCCc Confidence 9999999987776554 234444431 No 164 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.63 E-value=3.6e-08 Score=61.40 Aligned_cols=262 Identities=13% Similarity=0.113 Sum_probs=132.6 Q ss_pred Ccc-----chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeec--CCCcccCCCCCccceE Q lcl|NC_011288. 1 MAF-----NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYK--AAGRQTSADAISDTGV 73 (273) Q Consensus 1 MA~-----~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~--~~~~~~~~~~~~~~~~ 73 (273) |.. -..+|.-+++++++.+.+...+.+.++.- .....+..||.++..+..... .++......++..+.+ T Consensus 19 ~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~----~v~~~~~~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~ 94 (321) T protein:vir:31 19 LTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTE----TVGAKKTRIPTLNIGERHRRPQDEGEWNENESDVSTGTI 94 (321) T ss_pred ccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceee----eccCcceeeeeeccCCcccccccccccccccccceeeee Confidence 211 12344446667777788777777776542 122334455665432211111 1122233345566667 Q ss_pred EEEEeeeeecceEEchHHHHhhh--HHHHH-HHHHHHHHHHHHHHHHHHHHHhh-c--------------ccccccccCC Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVA--GSLEA-YTRAGATALATDTDKFIADLLVD-N--------------GTALSGSAPT 135 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~--~~~~~-~~~~~~~ala~~~D~~i~~~~~~-~--------------~~~~~~~~~~ 135 (273) ++.+.+. .....|+..-..... .++++ +....+++++..++...+.=-.. . .......... T Consensus 95 ~~~~~k~-~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~ 173 (321) T protein:vir:31 95 DISTEKA-TVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAA 173 (321) T ss_pred eeeeEEE-EeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhcccccccccc Confidence 7777543 344556653332322 35665 55666677887776654421000 0 0000001112 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCC Q lcl|NC_011288. 136 DADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 136 t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~ 215 (273) +....++.|.++...|+...--..+...+++++.+..++.. +.+.........+..|...++.|++++.++.+|.. T Consensus 174 ~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~---l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~- 249 (321) T protein:vir:31 174 DDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYT---LTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDD- 249 (321) T ss_pred ccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHH---HhcCCCccccchhhccccccccceeEEEcCCCCCC- Confidence 22334567777777776543211234568999987665431 22222222334455666678999999999999863 Q ss_pred CcEEEEEcCceeEEeeeee-eehhhcCCCc---eeeeEE--eeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 216 DEQFVAFHPSAAAYVSQID-TVEALRDQDS---FSDRIR--ALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ~~~~~~~~~~a~~~a~~~~-~~e~~~~~~~---~~~~v~--~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+++++..-+.+..+.. .++..++... ....++ .+..+|+.|-++++++.+.--.- T Consensus 250 --~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~ 311 (321) T protein:vir:31 250 --KAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGD 311 (321) T ss_pred --cEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCc Confidence 35666665555443222 3333222221 112222 33446776777888877763222 No 165 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.61 E-value=3.3e-08 Score=61.59 Aligned_cols=264 Identities=11% Similarity=0.034 Sum_probs=143.6 Q ss_pred Cccc-----hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc----cceeecCCCcccCCCCCccc Q lcl|NC_011288. 1 MAFN-----NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP----TVKDYKAAGRQTSADAISDT 71 (273) Q Consensus 1 MA~~-----~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~----~~~~~~~~~~~~~~~~~~~~ 71 (273) |-.. .+.|+.++. +++.+.+.+.+.++++.- ....+.+..|+..+.. ...+........+-..++.+ T Consensus 19 ~t~~d~~Gg~l~P~~~~~-~i~~~~e~s~~l~~~~vi---~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~ 94 (315) T protein:vir:41 19 IDVPDLGRGVLSVDRFGE-FVKAVRDSAVIIPEARID---NALKSYEKDISRLSLVLDVGPGRDETGQKLAPPESTAEVK 94 (315) T ss_pred cCCcCCCCceechHHHHH-HHHHHHhhhhhhhhceee---eccccccccccccccCcccccccccccCcCCCCCCccccc Confidence 3222 367998754 777888888888877642 1112334445444321 11111122222222345666 Q ss_pred eEEEEEeeeeecceEEchHHHHhhh--HHHHH-HHHHHHHHHHHHHHHHHHHHHhh---------------ccccc-cc- Q lcl|NC_011288. 72 GVDLLIDQEKSIDFLVDDIDRVQVA--GSLEA-YTRAGATALATDTDKFIADLLVD---------------NGTAL-SG- 131 (273) Q Consensus 72 ~~~~~id~~~~~~~~i~d~d~~~~~--~~~~~-~~~~~~~ala~~~D~~i~~~~~~---------------~~~~~-~~- 131 (273) .+++.+.+. ...+.|++.-..... .+++. +....+++++++.+...+.==.. +.... .. T Consensus 95 ~~~l~~~~l-~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~ 173 (315) T protein:vir:41 95 TNTLYMREM-VTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASEKLTESD 173 (315) T ss_pred eeeeceeee-eeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccccccccc Confidence 666666543 334566664444443 25766 55677788888776665532000 00000 00 Q ss_pred ccCCCHHHHHHHHHHHHHHHhhcCCCc-cCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc Q lcl|NC_011288. 132 SAPTDADDAFDLIATALKELTKANVPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN 210 (273) Q Consensus 132 ~~~~t~~~~~~~i~~a~~~l~~~~vP~-~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~ 210 (273) ....+.....+.|.++...|...---. .+-..++++..+..+++- ........+...+..|.-..+.|++|+.++. T Consensus 174 ~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rkl---k~~~g~~lw~~~~~~g~~~tl~G~PV~~~~~ 250 (315) T protein:vir:41 174 VDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDA---LKGRETGLGDQALTGANSILYDGRPVQYVPA 250 (315) T ss_pred cccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHH---hccCCCccccchhhcCCCceecccceEeccc Confidence 011111223455666665554432111 233578999988877552 2222333445566677777899999999998 Q ss_pred cccCC--CcEEEEEcCceeEEeeee-eeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCC Q lcl|NC_011288. 211 LRDTD--DEQFVAFHPSAAAYVSQI-DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTG 272 (273) Q Consensus 211 l~~~~--~~~~~~~~~~a~~~a~~~-~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~ 272 (273) +|... ...++.++..-+.+.... ..++..|+.......++.+++.|+.+.+++++++..-+. T Consensus 251 m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 251 LEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred ccccCCCCccEEEecccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 87643 234556666555555433 356666776655677888899999888777655444444 No 166 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.55 E-value=2.6e-08 Score=62.21 Aligned_cols=252 Identities=11% Similarity=0.016 Sum_probs=136.8 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |... .++|+-+...+.+.+.+...+.++++.- ...| ...||+....+.......+.... ..+..-+.+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:95 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE----ecCc-ceEEEEecCCcceeeecccccccccccccceee Confidence 2111 4689999999999999999888887542 1224 35677765544333333332222 223445556 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccccccc----------- Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTALSGS----------- 132 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~~~~~----------- 132 (273) ++...+. +.-+.|+..=..++..+++. +.+..+++++..+|..++. .+.........+ T Consensus 151 ~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~ 229 (381) T protein:vir:95 151 TAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQG 229 (381) T ss_pred eecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCccccccccccccccccc Confidence 5555333 33345665433445567877 4577888999999887653 111110000000 Q ss_pred --cCCCHHHHHHHHHHHHHHHhhcC-----CCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeE--ece Q lcl|NC_011288. 133 --APTDADDAFDLIATALKELTKAN-----VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL--LGA 203 (273) Q Consensus 133 --~~~t~~~~~~~i~~a~~~l~~~~-----vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~--~G~ 203 (273) +..+....++.+......+.... .+..+.+++++|..+..|+.... ..+ . +|..-.. +|. T Consensus 230 t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~-~~~--~--------~G~~v~~l~~g~ 298 (381) T protein:vir:95 230 TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN--A--------NGVYVTALPFNL 298 (381) T ss_pred ccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc-cCC--C--------CCceeecCCCCc Confidence 01112223444444444443221 23345567899998887754321 111 1 1222122 477 Q ss_pred EEEeeCccccCCCcEEEEEcCceeEEeeee-eeehhhcCCCce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 204 RIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 204 ~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~-~~~e~~~~~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .|+.|+.+|.+. ++.|.-+......+. ..++...+ .+| .+.+++.+++|+++++|++++++.-+.. T Consensus 299 ~vv~s~~~p~~~---iifgDfs~Y~i~~r~~~~i~~~~~-~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:95 299 NVIESTVQEAGK---VLTYVKGLYDGYLAGGINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred eEEecCCCCcCc---EEEEecccEEEEEecccEEEeech-hHhhcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 799999888532 344433333222221 22322222 222 3589999999999999999998666654 No 167 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.55 E-value=2.6e-08 Score=62.21 Aligned_cols=252 Identities=11% Similarity=0.016 Sum_probs=136.8 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |... .++|+-+...+.+.+.+...+.++++.- ...| ...||+....+.......+.... ..+..-+.+ T Consensus 76 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:10 76 INKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK----NAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred HhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeE----ecCc-ceEEEEecCCcceeeecccccccccccccceee Confidence 2111 4689999999999999999888887542 1224 35677765544333333332222 223445556 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccccccc----------- Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTALSGS----------- 132 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~~~~~----------- 132 (273) ++...+. +.-+.|+..=..++..+++. +.+..+++++..+|..++. .+.........+ T Consensus 151 ~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~ 229 (381) T protein:vir:10 151 TAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQG 229 (381) T ss_pred eecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeeeccCccccccccccccccccc Confidence 5555333 33345665433445567877 4577888999999887653 111110000000 Q ss_pred --cCCCHHHHHHHHHHHHHHHhhcC-----CCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeE--ece Q lcl|NC_011288. 133 --APTDADDAFDLIATALKELTKAN-----VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL--LGA 203 (273) Q Consensus 133 --~~~t~~~~~~~i~~a~~~l~~~~-----vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~--~G~ 203 (273) +..+....++.+......+.... .+..+.+++++|..+..|+.... ..+ . +|..-.. +|. T Consensus 230 t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~-~~~--~--------~G~~v~~l~~g~ 298 (381) T protein:vir:10 230 TLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN--A--------NGVYVTALPFNL 298 (381) T ss_pred ccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccc-cCC--C--------CCceeecCCCCc Confidence 01112223444444444443221 23345567899998887754321 111 1 1222122 477 Q ss_pred EEEeeCccccCCCcEEEEEcCceeEEeeee-eeehhhcCCCce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 204 RIVESNNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 204 ~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~-~~~e~~~~~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .|+.|+.+|.+. ++.|.-+......+. ..++...+ .+| .+.+++.+++|+++++|++++++.-+.. T Consensus 299 ~vv~s~~~p~~~---iifgDfs~Y~i~~r~~~~i~~~~~-~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 299 NVIESTVQEAGK---VLTYVKGLYDGYLAGGINVQKFKE-TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred eEEecCCCCcCc---EEEEecccEEEEEecccEEEeech-hHhhcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 799999888532 344433333222221 22322222 222 3589999999999999999998666654 No 168 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.50 E-value=9.3e-08 Score=59.14 Aligned_cols=252 Identities=12% Similarity=0.045 Sum_probs=134.2 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |... .++|+-+...+++.+++.+++.+++++- ...| ++.||+....+............ ..++.-+.+ T Consensus 86 ~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~----~~~~-~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 160 (395) T protein:vir:95 86 INYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQ----NAGI-KTRVIKADPAGQAVWGKVFGEIKGQLDAAFREE 160 (395) T ss_pred HhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeE----ecCC-ceEEEEecCCcceEEeecccccCccccccceee Confidence 1111 3689999999999999999888887542 1224 46788765544443332222222 234555666 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhh---ccccc-------cc--c----c-CC Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVD---NGTAL-------SG--S----A-PT 135 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~---~~~~~-------~~--~----~-~~ 135 (273) ++...+. +.-+.|+..=..++..++++ +.+..+++++.++|..++.=--. .+... .. . + .. T Consensus 161 ~l~~~kl-~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~ 239 (395) T protein:vir:95 161 NFTQYKL-TCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTL 239 (395) T ss_pred eeceeeE-EEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccccccchh Confidence 6666332 34456665444445567776 56778899999999877631100 01000 00 0 0 00 Q ss_pred CHH---HHHHHHHHHHHHHhhc----C-CCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEe--ceEE Q lcl|NC_011288. 136 DAD---DAFDLIATALKELTKA----N-VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLL--GARI 205 (273) Q Consensus 136 t~~---~~~~~i~~a~~~l~~~----~-vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~--G~~v 205 (273) +.. ..+..+..+...+.-. . ........+++|..+..+.... +.. . ..|...++. |.+| T Consensus 240 t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~--~~~-~--------~~G~~~~~lg~g~~v 308 (395) T protein:vir:95 240 TFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARY--TYL-T--------ANGGFVTVLPYNVTI 308 (395) T ss_pred hhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcc--eec-c--------CCCcceeccCCcceE Confidence 111 1122233322222100 0 1112345678888766554321 111 0 124444554 6678 Q ss_pred EeeCccccCCCcEEEEEcCceeEEeee-eeeehhhcCCCce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 206 VESNNLRDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 206 ~~s~~l~~~~~~~~~~~~~~a~~~a~~-~~~~e~~~~~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.++.+|... ++.|.-+-.....+ ...++..+ +.+| .+.+++.+++|+++++|+++++|+-+.+ T Consensus 309 ~~~~~~p~~~---i~fgdfs~y~i~~r~~~~i~~~~-~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~ 376 (395) T protein:vir:95 309 ITSEFVPEGK---LVAFVTDRYNAVRGGGLTVKKFD-QTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVA 376 (395) T ss_pred EEcCCCCCCc---EEEEecccEEEEEecceEEEecc-chhhhCCcEEEEEEEEECCEEeccccEEEEEeecc Confidence 9999998542 33443332222221 12232222 2222 3579999999999999999999988866 No 169 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.49 E-value=5.4e-08 Score=60.44 Aligned_cols=253 Identities=11% Similarity=0.044 Sum_probs=131.6 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceEEEEE Q lcl|NC_011288. 1 MAFN--NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~--~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~i 77 (273) ..-. .++|+-+..++.+.+...+.+.++++.- . ..| ...+|+....+............ ..++.-+.+++.. T Consensus 80 t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~--~--~~~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~ 154 (381) T protein:vir:10 80 VGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIK--N--AGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQ 154 (381) T ss_pred CCCCCceecCHHHHHHHHHHHHhhcceeeeeeeE--e--cCc-ceEEEeecCCcceEEeecccccccccCccceeEeecc Confidence 1111 4689999999999999998888887542 1 123 45677765544333322222221 2234455555555 Q ss_pred eeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccccccccC----------C-- Q lcl|NC_011288. 78 DQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTALSGSAP----------T-- 135 (273) Q Consensus 78 d~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~~~~~~~----------~-- 135 (273) .+. +.-+.|+..=..++..+++. +....+++++...|..++. .+.+.+.......+ . T Consensus 155 ~kl-~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~ 233 (381) T protein:vir:10 155 NKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTF 233 (381) T ss_pred eeE-EeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCccccccccccccccccccccc Confidence 333 33356665433444556777 5567888999999887652 11100000000000 0 Q ss_pred -CHHHHHHHHHHHHHHHhhc----C-CCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeE-eceEEEee Q lcl|NC_011288. 136 -DADDAFDLIATALKELTKA----N-VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL-LGARIVES 208 (273) Q Consensus 136 -t~~~~~~~i~~a~~~l~~~----~-vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~-~G~~v~~s 208 (273) +....+..+......+... . .+..+.+++++|..+..|..... +.+ . ++. + +..+ +|..|+.+ T Consensus 234 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~-~~~--~--~G~-~----v~~lp~g~~vv~~ 303 (381) T protein:vir:10 234 ANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYT-HLN--A--NGV-Y----VTALPFNLNVIES 303 (381) T ss_pred cchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccc-cCC--C--CCc-e----eecCCCCceeEEc Confidence 1111122222222222111 1 12345678999998888765332 111 1 111 1 1112 58889999 Q ss_pred CccccCCCcEEEEEcCceeEEeeee-eeehhhcCCCce---eeeEEeeeeeeeEEecCceEEE--EecCCC Q lcl|NC_011288. 209 NNLRDTDDEQFVAFHPSAAAYVSQI-DTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVV--FNKTGS 273 (273) Q Consensus 209 ~~l~~~~~~~~~~~~~~a~~~a~~~-~~~e~~~~~~~~---~~~v~~~~~~g~~v~~~~~~v~--~~~~~s 273 (273) +.+|... ++.|.-+......+. ..++... +.+| .+.+++.+++|.++++|+++++ |+..++ T Consensus 304 ~~~p~~~---i~fGDfs~Y~i~~r~~~~i~~~~-~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~ 370 (381) T protein:vir:10 304 TVQEAGK---VLTYVKGLYDGYLAGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGH 370 (381) T ss_pred CCCCcCc---EEEEEcccEEEEEecccEEEeec-hhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCC Confidence 9998532 334433322222221 2232222 2222 3589999999999999999888 444444 No 170 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.49 E-value=6.9e-08 Score=59.83 Aligned_cols=253 Identities=14% Similarity=0.103 Sum_probs=137.6 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceEEEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGVDLL 76 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 76 (273) -... .++|+-|..++.+.+.+...+.+++++- . . +....||+....+.......+.... ..++.-+.+++. T Consensus 82 ~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~--~--~-~~~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~ 156 (377) T protein:vir:96 82 VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK--N--T-SLRLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFS 156 (377) T ss_pred CCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeE--e--c-CCceEEEEecCCcceeEeecccccccccCccceeEeee Confidence 1111 4789999999999999988888887642 1 1 2346777765444333333332222 224455555555 Q ss_pred EeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccc-c---------------- Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTA-L---------------- 129 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~-~---------------- 129 (273) ..+. +.-+.|+..=..++..+++. +.+..+++++..+|..++. .+...... . T Consensus 157 ~~kl-~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~ 235 (377) T protein:vir:96 157 QFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKE 235 (377) T ss_pred eeeE-EeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccccccccceeeccc Confidence 5332 23345555433445566877 5577888999999988763 11110000 0 Q ss_pred --ccccCCCHHHHHHHHHHHHHHHhhcC--CC---ccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEe- Q lcl|NC_011288. 130 --SGSAPTDADDAFDLIATALKELTKAN--VP---NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLL- 201 (273) Q Consensus 130 --~~~~~~t~~~~~~~i~~a~~~l~~~~--vP---~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~- 201 (273) ...+..++....+.+..+...+..++ -| ..+-+.+++|..+..+..... +.+ .+|.-..+. T Consensus 236 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~-~~~----------~~G~~~~~l~ 304 (377) T protein:vir:96 236 AIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFT-SRN----------QFGEYVTVLP 304 (377) T ss_pred cccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcccccc-ccC----------CCCCceeccC Confidence 00011223334444445555554332 12 123467899988776643211 111 123333454 Q ss_pred -ceEEEeeCccccCCCcEEEEEcCceeEEeee-eeeehhhcCC--CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 202 -GARIVESNNLRDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 202 -G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~-~~~~e~~~~~--~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) |+.++.|+.+|... ++.+..+......+ -..++..++- .+-.+.+++.+++|.++++|+++++|.-++- T Consensus 305 ~p~~v~~s~~~p~~~---i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 305 HGITILESLAVETGK---AIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred CCceEEecCCCCccc---EEEEEcCcEEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 45678888888532 33343333332222 2223322221 1224579999999999999999999988888 No 171 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.44 E-value=1.5e-07 Score=57.95 Aligned_cols=268 Identities=16% Similarity=0.149 Sum_probs=144.1 Q ss_pred Cccchh------hHHHHHHHHHHHHHHhhccch-hhcccc--------cccccCCceEEEeecCcccceeecCCCc--cc Q lcl|NC_011288. 1 MAFNNF------IPELWSDMLLEEWTAQTVFAN-LVNREY--------EGTASKGNVVHIAGVVAPTVKDYKAAGR--QT 63 (273) Q Consensus 1 MA~~~~------~pev~~~~~~~~~~~~lv~~~-~v~~~~--------~~~~~~Gdtv~ip~~~~~~~~~~~~~~~--~~ 63 (273) ||.+++ ...+|++.+...-.+.+.|.+ ++-++. +++-..||+|+|+....++..- +.++. .. T Consensus 1 Ma~T~~~~~~p~a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~g-v~Gd~~leG 79 (364) T protein:vir:93 1 MSQTVIPFGDPKAVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKP-TYGDARVEG 79 (364) T ss_pred CceeccCcCCHHHHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCC-cccCceeec Confidence 998853 356799888887766665554 554332 3334569999998876664222 22222 23 Q ss_pred CCCCCccceEEEEEeeeeecceEEc-hHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccc-------------- Q lcl|NC_011288. 64 SADAISDTGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTR-AGATALATDTDKFIADLLVDNGT-------------- 127 (273) Q Consensus 64 ~~~~~~~~~~~~~id~~~~~~~~i~-d~d~~~~~~~~~~~~~-~~~~ala~~~D~~i~~~~~~~~~-------------- 127 (273) ..+.+.-.+.+++||+.+. ++... ..++.-...++++..+ .+..=+++..|+.++-.+..+.. T Consensus 80 nee~L~~~~~~i~idq~r~-~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGarg~~~~~~~~~~~~~~ 158 (364) T protein:vir:93 80 KEESLRFYQDEVRIDQVRH-SVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGARGINLDFIETPDFTGY 158 (364) T ss_pred cccceeEEeeEEEEeeccc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccCcccc Confidence 3456788889999988644 44333 3344445566666544 44455777778877755543110 Q ss_pred ccc-----------------cc--cCCCHHHHHHHHHHHHHHHhhcCCC--------------ccCCEEEECHHHHHHHh Q lcl|NC_011288. 128 ALS-----------------GS--APTDADDAFDLIATALKELTKANVP--------------NVGRVVVVNAEMAFWLR 174 (273) Q Consensus 128 ~~~-----------------~~--~~~t~~~~~~~i~~a~~~l~~~~vP--------------~~~r~lvv~p~~~~~L~ 174 (273) ..+ .. ...+....++.|.++...++..+.+ ++-.++++.|.++..|+ T Consensus 159 ~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~~yV~~l~p~q~~~Lr 238 (364) T protein:vir:93 159 AGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDDHYVCVMSEYQATDMR 238 (364) T ss_pred cccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcceeEEEEcchhhhhhh Confidence 000 00 0111224678899999888766431 12236899999999998 Q ss_pred hhH--HH--H-hhh-hcccccceeeeeeeeeEeceEEEeeCccccCCCcE----------EEEEcCce--eEEee----e Q lcl|NC_011288. 175 SSG--SK--L-TSA-DTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ----------FVAFHPSA--AAYVS----Q 232 (273) Q Consensus 175 ~~~--~~--~-~~~-~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~----------~~~~~~~a--~~~a~----~ 232 (273) .+. ++ + +++ ...+..++|.+|.+|+|.|+-+++..+++...... .+.| ..| +++++ + T Consensus 239 ~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ralllG-aQA~~~a~g~~~g~~ 317 (364) T protein:vir:93 239 TAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARALFMG-RQAGVIAYGTANGLR 317 (364) T ss_pred hcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhhheec-ceeeEEEeecCCCCC Confidence 533 22 2 221 23445578999999999999999988775322111 1111 122 23332 1 Q ss_pred eeeehhhcC-CCceeeeEEeeeeeeeEEec----CceEEEEecCCC Q lcl|NC_011288. 233 IDTVEALRD-QDSFSDRIRALHVYGGKVVR----PTGVVVFNKTGS 273 (273) Q Consensus 233 ~~~~e~~~~-~~~~~~~v~~~~~~g~~v~~----~~~~v~~~~~~s 273 (273) ..-.|...+ ++.. .|......|.+=.| .=|+++|-..+- T Consensus 318 ~~w~Ee~~D~gn~~--~i~~~~i~G~kK~rF~~~DfGvi~idtaa~ 361 (364) T protein:vir:93 318 FDWEETVKDYGNEP--AIAAGFIAGMKKARFNNKDFGVISIDTAAK 361 (364) T ss_pred ceeeecccCCCCch--hhhhhhHhhhhhcccCCccceEEEeccccc Confidence 111121111 1111 13333333332222 114554433333 No 172 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.35 E-value=5.4e-07 Score=54.96 Aligned_cols=224 Identities=11% Similarity=0.003 Sum_probs=123.8 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhccc--------ccccccCCceEEEeecCcccceeecCCCc--ccCCCCCcc Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNRE--------YEGTASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~--------~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~--~~~~~~~~~ 70 (273) ++++.. -.+|++.+...-.+...+..+..++ .+++...||+|+|+....++..- +.++. ....+.+.. T Consensus 22 ~~~~~~-vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~g~g-v~Gd~~lEGnee~L~~ 99 (318) T protein:vir:27 22 NRNRSM-VNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRP-TMGDERVEGRGEDLSH 99 (318) T ss_pred hcCChH-HHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccccCc-cccCceeeccccceEE Confidence 555543 3578887655544444443333222 23444579999998876554221 11111 223456777 Q ss_pred ceEEEEEeeeeecceEEc-hHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhhccc--------------------- Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTR-AGATALATDTDKFIADLLVDNGT--------------------- 127 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~-d~d~~~~~~~~~~~~~-~~~~ala~~~D~~i~~~~~~~~~--------------------- 127 (273) .+..++||+.+ +++... ..+..-+..++++..+ .+..-+++..|+-++--+..... T Consensus 100 ~~d~l~IDq~r-~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laGarg~~~n~~~~~p~~~~~~~~~~~ 178 (318) T protein:vir:27 100 ADFSLKINQGR-HLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (318) T ss_pred EeeEEEEeeec-cccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccceEecccCccchhhh Confidence 88889998764 344333 2333444566666544 44455778888777654432110 Q ss_pred --cccc----------cc------CCCHHHHHHHHHHHHHHHhhcCCCc-----c--C-------CEEEECHHHHHHHhh Q lcl|NC_011288. 128 --ALSG----------SA------PTDADDAFDLIATALKELTKANVPN-----V--G-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 128 --~~~~----------~~------~~t~~~~~~~i~~a~~~l~~~~vP~-----~--~-------r~lvv~p~~~~~L~~ 175 (273) .+.. .+ ..+....++.|.++...+++..-|- + . ++++++|.++..|+. T Consensus 179 ~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~~~~~~~~yV~~~~p~q~~~Lrt 258 (318) T protein:vir:27 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (318) T ss_pred hcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeeccccccCCcceEEEEechHHHHHHhh Confidence 0000 00 0111224666778888887744331 1 1 678999999999998 Q ss_pred hHH------HHhhhhcc--cccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEeeeee Q lcl|NC_011288. 176 SGS------KLTSADTS--GDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQID 234 (273) Q Consensus 176 ~~~------~~~~~~~~--~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~ 234 (273) +.. +..++... +..++|..|.+|+|.|+=+++..++|-- +.+|. ...+..+. T Consensus 259 dt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIr----f~~G~---~v~~~~~~ 318 (318) T protein:vir:27 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR----FYQGQ---RFWYQRIT 318 (318) T ss_pred cCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEE----EcCCC---eeeeeecC Confidence 752 22333333 4567899999999999999999876521 11111 11111111 No 173 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.30 E-value=3.2e-07 Score=56.20 Aligned_cols=252 Identities=12% Similarity=0.030 Sum_probs=130.1 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCccc-CCCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQT-SADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 73 (273) |... .++|+-|...+.+.+.+...+.++++.- ...|. ..||+....+.......+... ...+..-+.+ T Consensus 83 ~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~----~~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 157 (383) T protein:vir:78 83 INKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMR----TTGLR-TKFLKSETSGVAVWGKIFGEIKGQLDATFSDE 157 (383) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeE----ecCCc-eEEEEEcCCcceEEeecccccccccCcceeeE Confidence 2222 4689999999999999998888877542 12354 578887665544444333322 2234555666 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhccccccccc------CCCH Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTALSGSA------PTDA 137 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~~~~~~------~~t~ 137 (273) ++...+. +.-+.|+..=..++..+++. +.+..+++++..+|..++. .+.+.+....... ..+. T Consensus 158 ~l~~~kl-~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~ 236 (383) T protein:vir:78 158 ESIQNKL-TAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATG 236 (383) T ss_pred eecceee-EeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccc Confidence 6666443 34456665434445567776 5577888999999988762 1111000000000 0011 Q ss_pred HHHHHHHHHHHHHH---hhc------C--CCc-cCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEe--ce Q lcl|NC_011288. 138 DDAFDLIATALKEL---TKA------N--VPN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLL--GA 203 (273) Q Consensus 138 ~~~~~~i~~a~~~l---~~~------~--vP~-~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~--G~ 203 (273) .....++......+ .++ + ... .....+++|..+..+..... .. + .+|....+. |. T Consensus 237 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~-~~--~--------~~G~~~t~l~~~~ 305 (383) T protein:vir:78 237 TLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYT-SL--N--------ANGVYVTALPFNL 305 (383) T ss_pred hhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchh-cc--C--------CCCceeeecCCCc Confidence 11111211111111 111 0 001 12245677765443322110 00 0 123333444 55 Q ss_pred EEEeeCccccCCCcEEEEEcCceeEEeee-eeeehhhcCCCce---eeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 204 RIVESNNLRDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQDSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 204 ~v~~s~~l~~~~~~~~~~~~~~a~~~a~~-~~~~e~~~~~~~~---~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .|++|+.+|... ++.+..+......+ ...++... +.+| .+.+++.+++|.++++|+++++|.-+-. T Consensus 306 ~iv~s~~~p~~~---iifgdfs~Y~i~~r~~~~i~~~~-~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~ 375 (383) T protein:vir:78 306 NIIESLFVPEKK---AISYVAERYDALIGGPLDIGTYD-QTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNIN 375 (383) T ss_pred eEEecCCCCccc---EEEeeccceEEEecccceEEecc-hhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEec Confidence 688898888532 34444333333322 22333222 2223 3689999999999999999998765544 No 174 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.27 E-value=7.6e-07 Score=54.13 Aligned_cols=259 Identities=12% Similarity=0.079 Sum_probs=130.4 Q ss_pred Cccch---hhHHHHHHHHHHHHHHhh-ccchhhccc---ccccccCCceEEEeecCcccc-----eeecCCCcccCCCCC Q lcl|NC_011288. 1 MAFNN---FIPELWSDMLLEEWTAQT-VFANLVNRE---YEGTASKGNVVHIAGVVAPTV-----KDYKAAGRQTSADAI 68 (273) Q Consensus 1 MA~~~---~~pev~~~~~~~~~~~~l-v~~~~v~~~---~~~~~~~Gdtv~ip~~~~~~~-----~~~~~~~~~~~~~~~ 68 (273) ||... |-|+++...+.. +.+.+ +|.. .... .......||.+..|-+..+.. .++. ....+++..+ T Consensus 1 m~lsD~~vfN~~~~~a~~e~-~~q~~~~fn~-as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~-~~~~vt~~ki 77 (325) T protein:vir:95 1 MALSDLAVYSEYAYSAFSET-LRQQVDLFNT-ATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAY-GSGTVAEKVL 77 (325) T ss_pred Cchhhhhhhhhhhhhhhhhh-hhhhHhhhhh-cccceeEeccccccCceeeccccccccccccccccCC-CCceecccee Confidence 88884 446666554443 33332 2221 1110 111234599999999986532 2232 2334555555 Q ss_pred ccceEEEEEeeeeecceEEchHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHH----hhcccc-----cccccCCCH- Q lcl|NC_011288. 69 SDTGVDLLIDQEKSIDFLVDDIDRVQ-VAGSLEAYTRAGATALATDTDKFIADLL----VDNGTA-----LSGSAPTDA- 137 (273) Q Consensus 69 ~~~~~~~~id~~~~~~~~i~d~d~~~-~~~~~~~~~~~~~~ala~~~D~~i~~~~----~~~~~~-----~~~~~~~t~- 137 (273) +..+. +.+...+..++...|+.... ...++..+.++.+..+++...+++++.+ .++-.. ....+..+. T Consensus 78 tt~~~-~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~~~~ 156 (325) T protein:vir:95 78 KHLVD-TSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATANTDAA 156 (325) T ss_pred ccccc-eeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeecccCcc Confidence 54433 22224455555555554433 2345676666666666665555544333 211111 111111221 Q ss_pred --HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCC Q lcl|NC_011288. 138 --DDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTD 215 (273) Q Consensus 138 --~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~ 215 (273) ....+.|.+|+++|.++. ..=..+++++..|..|.+.. +.+.-..-..... -.|+.++|-.|+.++.+|..+ T Consensus 157 ~~~~s~~~l~~A~~klGD~~--~~l~~~~MHS~v~~~L~~~~--L~~~~~~~~~~g~--~~i~t~~G~~VIVdD~~p~~~ 230 (325) T protein:vir:95 157 DKLPTWNNLNNGQAKFGDQS--SQIAAWIMHSTPMHKLYGSN--LTNGERLFTYGTV--NVVRDPFGKLLVMTDSPNLFA 230 (325) T ss_pred cccccHHHHHHHHHHhcccc--cceeEEEEchHHHHHHHHhh--ccccccccccCCc--ccccccCCcEEEEeCCCCCCC Confidence 124578999999998875 22246789999999998742 3221111111111 136778999999999988643 Q ss_pred -----CcEEEEEcCceeEEeeeee----eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEec------CCC Q lcl|NC_011288. 216 -----DEQFVAFHPSAAAYVSQID----TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK------TGS 273 (273) Q Consensus 216 -----~~~~~~~~~~a~~~a~~~~----~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~------~~s 273 (273) .+.++...++|+++...-+ ..|..+.+ ..+..++.+. ++++-|-|+---++ +.+ T Consensus 231 ~g~~~~ytty~lg~GAi~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~---tf~lhp~G~sw~~s~~g~sPt~a 299 (325) T protein:vir:95 231 AGTPNVYHILGLVPGGVLIGQNNDFDANEETKNGDE-NIIRTYQAEW---SYNIGVKGFAWDKANGGKSPTDA 299 (325) T ss_pred ccCceeEEEEEEecCeEEecCCCCccccccccCccc-ceeeeeeeee---eEEeecceeeeecccccCCcChH Confidence 3445677788888765432 12222222 2222222221 23444444443222 211 No 175 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=98.26 E-value=2.3e-07 Score=57.02 Aligned_cols=267 Identities=17% Similarity=0.200 Sum_probs=155.0 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhccccc-ccc-cCCceEEEeecCc--ccceeecCC--CcccCC----CCCcc Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNREYE-GTA-SKGNVVHIAGVVA--PTVKDYKAA--GRQTSA----DAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~-~~~-~~Gdtv~ip~~~~--~~~~~~~~~--~~~~~~----~~~~~ 70 (273) ||.-.+.+| |.+.|...|.....|.+.+--..+ .++ .+.+|.---+... ..+++|... .+..+. +.... T Consensus 1 ~avr~y~Kq-~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ssRFG~ 79 (287) T protein:vir:39 1 MAIKYFTKQ-YAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNTSRFGQ 79 (287) T ss_pred CCcccccHH-HHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCccccccc Confidence 999976655 889999999999988776532111 111 2344432222221 123344311 111111 11111 Q ss_pred ceEEEEEeee-e-ecceEEc-hHHHHhhhHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHH Q lcl|NC_011288. 71 TGVDLLIDQE-K-SIDFLVD-DIDRVQVAGSLE----AYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDL 143 (273) Q Consensus 71 ~~~~~~id~~-~-~~~~~i~-d~d~~~~~~~~~----~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~ 143 (273) -+.-+..|.+ . .++..|. .+|....+++++ ..+..++++-++.+|..+-..+...+...-.. ..+.+..... T Consensus 80 rkEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~~-~~t~d~V~~L 158 (287) T protein:vir:39 80 RKEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLSDSASETLTV-KLDEDSVTKL 158 (287) T ss_pred eeEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcchheee-eecccchHHH Confidence 1111122211 1 1222222 345544444443 34566777778888888777776655443322 3666777788 Q ss_pred HHHHHHHHhhcCCCccC-CEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEE Q lcl|NC_011288. 144 IATALKELTKANVPNVG-RVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAF 222 (273) Q Consensus 144 i~~a~~~l~~~~vP~~~-r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~ 222 (273) |.++.+.+-++++..-. -..+|+|+.|..|...+ +.......+.+ +-+--|-++-||.+.+.+.-.--.+. ...+ T Consensus 159 F~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~--l~TsaK~SsaN-iDen~i~kFkGf~l~e~P~~~~q~g~-~a~f 234 (287) T protein:vir:39 159 FSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSK--LATTAKNSSAN-VDEQTLYKFKGFILSELPDEKFQLNE-GAYF 234 (287) T ss_pred HHHHHHHhhccceeeEEEEEEEEChhHHhHHhccc--cccccccceee-eccCCcceecceEEEecchHhhccCc-EEEE Confidence 89999999888875433 56889999999998765 22222222222 33333778999999887532222222 2333 Q ss_pred cCceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 223 HPSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 223 ~~~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .++.++.+- .+........++.-|..+.|---||-.+++..+..++|++.. T Consensus 235 s~dnig~af~GI~vaR~i~sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~ 286 (287) T protein:vir:39 235 AADNVGVAGVGIQVTRAMDSEDFAGTALQAAAKYGKYLPEKNKKAILKATVT 286 (287) T ss_pred ccccceeecccceeEEeeecccccceeeecccccccccccccceEEEEEecC Confidence 444444332 333445567788889999999999999999999999999888 No 176 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=98.16 E-value=9.2e-07 Score=53.67 Aligned_cols=252 Identities=13% Similarity=0.053 Sum_probs=127.4 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccC-CCCCccceE Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS-ADAISDTGV 73 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 73 (273) |... .++|+-+..++.+.+.....+..+++.- ...|. +++|+....+.......+.... ..+..-+.+ T Consensus 79 ~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~----~~~~~-~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i 153 (377) T protein:vir:98 79 DKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFK----NTSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQ 153 (377) T ss_pred HhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeE----ecCcc-eEEEEecCCcceeEeecccccCcccCccceeE Confidence 2222 3689999999999999888887777542 12344 5777754433333322222222 223344555 Q ss_pred EEEEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccccc----cccCCCH-- Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTALS----GSAPTDA-- 137 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~~~----~~~~~t~-- 137 (273) ++...+. +.-+.|+..=..++..++++ +.+..+++++..+|..++. .+...+.... .....+. T Consensus 154 ~l~~~kl-~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~ 232 (377) T protein:vir:98 154 DFSQFKL-TAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKT 232 (377) T ss_pred eecceeE-EeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccccccccccc Confidence 5555332 23345555333345566777 5577888999999988763 1111000000 0000010 Q ss_pred --HHH--------------HHHHHHHHH--HHhhcCCCccCC-EEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeee Q lcl|NC_011288. 138 --DDA--------------FDLIATALK--ELTKANVPNVGR-VVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIG 198 (273) Q Consensus 138 --~~~--------------~~~i~~a~~--~l~~~~vP~~~r-~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig 198 (273) ... ..++..... .+++.+-. .|+ +++++|..+..+..... . ...+|... T Consensus 233 ~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~-~G~~i~~~n~~~~~~~~p~~~-------~----~~~~G~~~ 300 (377) T protein:vir:98 233 DKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKI-AGQVKLILNPEDRWALEAQFT-------S----RNQFGEYV 300 (377) T ss_pred hhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhcc-CCceEEEecccchhhcccccc-------c----cCCCCccc Confidence 000 011111111 11111112 344 45678776544432110 0 01234444 Q ss_pred eEece--EEEeeCccccCCCcEEEEEcCceeEEeee-eeeehhhcCC--CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 199 NLLGA--RIVESNNLRDTDDEQFVAFHPSAAAYVSQ-IDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 199 ~~~G~--~v~~s~~l~~~~~~~~~~~~~~a~~~a~~-~~~~e~~~~~--~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .++|+ .|+.|+.+|... ++.+..+......+ -..++...+- ..-.+.+++.+++|.++++|+++++|.-++= T Consensus 301 t~lg~p~~vv~s~~~p~~~---i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 301 TVLPHGITILESLAVETGK---AIAFVANRYDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred cccCCCceEEecCCCCccc---EEEEEecceeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 56654 577888887532 33443333322222 2233322211 1224689999999999999999999988877 No 177 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=98.15 E-value=4.7e-07 Score=55.26 Aligned_cols=263 Identities=11% Similarity=0.089 Sum_probs=144.6 Q ss_pred Cccc---hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCc---cceEE Q lcl|NC_011288. 1 MAFN---NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAIS---DTGVD 74 (273) Q Consensus 1 MA~~---~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~---~~~~~ 74 (273) |+.- +++|.+++..+.+.-+.-.+-.+++.. ...+.|.+..||.++..-..+... ++...-+.++ .+. T Consensus 74 mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk---~~L~~Grsm~F~~~g~~Ra~~IgE-GgE~~~~sld~~T~ds-- 147 (393) T protein:vir:79 74 MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQK---IRLKSGQSMIFPSIGIMRAYDVAE-GQEIPEDSIDWQTHES-- 147 (393) T ss_pred hcCCCcceechhhhhhhhhhcccchhHHHHHHHH---HhhhcCcceeccchheeeeccccc-cccccccchhhhcCCc-- Confidence 5543 688999999888866555444445432 123568999999999665444333 3333333333 334 Q ss_pred EEEeeee-ecceEEchHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------ccccC------CCH Q lcl|NC_011288. 75 LLIDQEK-SIDFLVDDIDRVQVAGS-LEAYTRAGATALATDTDKFIADLLVDNGTAL---------SGSAP------TDA 137 (273) Q Consensus 75 ~~id~~~-~~~~~i~d~d~~~~~~~-~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~---------~~~~~------~t~ 137 (273) +++.+.+ +..+.+++.-...+--+ +.-.+++++++|+++.|...+....+.+..+ ...++ -.+ T Consensus 148 v~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNG 227 (393) T protein:vir:79 148 PEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQND 227 (393) T ss_pred eeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccc Confidence 4443333 34456666433333335 4568899999999999999999887655411 11111 112 Q ss_pred HHHHHHHHHHHH-HHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeee--------e--ee-eeE-eceE Q lcl|NC_011288. 138 DDAFDLIATALK-ELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA--------G--TI-GNL-LGAR 204 (273) Q Consensus 138 ~~~~~~i~~a~~-~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~--------G--~i-g~~-~G~~ 204 (273) ...+++|.++.- .+...-. +-.+++.|-.|..+.+... +.......-++.-.. | .| +++ +.++ T Consensus 228 TlSleDllDm~~av~~~hyt---~svi~MHPLAWnv~AKna~-me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nln 303 (393) T protein:vir:79 228 TFSAEDFLDLIIAVMANEYT---PSDLMMHPLAWTVFAKNEL-MGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFN 303 (393) T ss_pred cccHHHHHHHHHHHhcccCC---cceEEEcCchhhhhhhhhh-hcceeeccccccCccccchhhhhchhhhcccccccee Confidence 234566666443 3444433 4569999999888866532 111110000010000 1 11 122 3589 Q ss_pred EEeeCccccCCCc---EEEEEcCceeEE--eeeeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 205 IVESNNLRDTDDE---QFVAFHPSAAAY--VSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 205 v~~s~~l~~~~~~---~~~~~~~~a~~~--a~~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) |+.|+.+|-.... .+++..++.++. +.-.-+++...++-+-=+-++=+.+||.+||+..+.+..-..-| T Consensus 304 v~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~ 377 (393) T protein:vir:79 304 VNLSPFIPLDKKSRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNIS 377 (393) T ss_pred EEEecccccccccceeeEEEeecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEecce Confidence 9999998853332 233444554442 22222344444444444567788999999998775554433333 No 178 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=98.06 E-value=5.7e-07 Score=54.82 Aligned_cols=251 Identities=14% Similarity=0.127 Sum_probs=143.1 Q ss_pred Ccc--chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecC-cccceee------cCCCcccCCCCCccc Q lcl|NC_011288. 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVV-APTVKDY------KAAGRQTSADAISDT 71 (273) Q Consensus 1 MA~--~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~-~~~~~~~------~~~~~~~~~~~~~~~ 71 (273) =++ ..+.|+ |-+-+++.+.+.-....++.+ -+.+|.|+..|... +.++..+ ..++..+.+..+..+ T Consensus 136 Tgd~~~~i~~~-~v~d~i~li~q~r~i~slf~t----LP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~ 210 (410) T protein:vir:83 136 TGDLQGVIPDP-IVGPVIDFIDSARPLVSTLGT----LPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVID 210 (410) T ss_pred ccccccccchh-HhhhHHHHHhhccchhhhhhh----CCCCCCeeEEeeecccccccccccccccccccccccccceeee Confidence 111 123455 777777777766656665543 24568899987753 3333322 234556777888888 Q ss_pred eEEEEEeeeeecceEEc-hHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHHHHHHHHHHHHHH Q lcl|NC_011288. 72 GVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDADDAFDLIATALKE 150 (273) Q Consensus 72 ~~~~~id~~~~~~~~i~-d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~~~~~~i~~a~~~ 150 (273) ..+..|+.+.+....-- .+|. .....++-.++-+..+-|+..++...+.+...-........+|+.++...|.++... T Consensus 211 t~tA~ikTyGGyt~LSRQ~IER-s~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~~~a~~~~Tad~~~~~i~da~~~ 289 (410) T protein:vir:83 211 RLTVNAKTLGGYVNVSRQAIDF-SSPSALDLVVNGLGQQYAIETEALVGAALASTSTGAVGYGNATADNVASAIWQAAGA 289 (410) T ss_pred eccceeehhcCcccccceeeec-CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccHHHHHHHHHHHHHH Confidence 88888887644332211 2221 222334445555555556666655555554333333345566888888888899998 Q ss_pred Hhhc--CCCccCCEEEECHHHHHHHhhhHHHHhh-----hhccc-ccceeeeeeeeeEeceEEEeeCccccCCCcEEEEE Q lcl|NC_011288. 151 LTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTS-----ADTSG-DAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAF 222 (273) Q Consensus 151 l~~~--~vP~~~r~lvv~p~~~~~L~~~~~~~~~-----~~~~~-~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~ 222 (273) .+++ ++ .-+++.|+|+....+... |+. .+..| +...+-.|.-|+++|++|++...++.++ ++.. T Consensus 290 v~da~~~~--~~~~i~vS~DVl~~~~~~---f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgT---A~f~ 361 (410) T protein:vir:83 290 VYTAVKGM--GRLVIAIAPDVLGDFGPL---FAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGD---AYLF 361 (410) T ss_pred Hhhhhccc--eeeeEEechhhhhhccce---eeccCCCCcccccccccccccchhhhhcccceEEecCCCcCe---eeEe Confidence 8887 44 447899999997665442 221 12222 1122336766999999999998876433 3333 Q ss_pred cCceeE-Eee-----eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecC Q lcl|NC_011288. 223 HPSAAA-YVS-----QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) Q Consensus 223 ~~~a~~-~a~-----~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~ 271 (273) .+.|+. +.. |+.+=....-..-|+ -+|+.-+..|++++=+.-+ T Consensus 362 ~~~Ai~~~eS~~gp~qL~d~~i~nLt~~yS------gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 362 STAAIECFEQRVGTLQVVEPSVFGLQVAYA------GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred ccceeeeeecCCceeEeeCCchhhhhhhhe------eeeeeccccccceeeeccC Confidence 555553 222 222111222233333 3445667778888876444 No 179 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=98.06 E-value=5.8e-06 Score=49.31 Aligned_cols=258 Identities=14% Similarity=0.066 Sum_probs=121.3 Q ss_pred Cccc------hhhHHHHHHHHHHHHHHhh-ccchhhcc--cccccccCCceEEEeecC---cccceeecCCCcccCCCCC Q lcl|NC_011288. 1 MAFN------NFIPELWSDMLLEEWTAQT-VFANLVNR--EYEGTASKGNVVHIAGVV---APTVKDYKAAGRQTSADAI 68 (273) Q Consensus 1 MA~~------~~~pev~~~~~~~~~~~~l-v~~~~v~~--~~~~~~~~Gdtv~ip~~~---~~~~~~~~~~~~~~~~~~~ 68 (273) ||.+ .|-+.+....+ |.+.+.+ +|.....- -....+-.||=...+.+. ....+++.. ++.+++..+ T Consensus 1 ~~~t~~sdl~vfn~~~~~a~~-e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~-~~~~t~~ki 78 (315) T protein:vir:96 1 MATTVNSDLVIYNDTAQTAYL-ERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNS-TATVAGTKI 78 (315) T ss_pred CceeeecceeeehhhhhhhHH-hhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCC-Cccccceec Confidence 8887 23455544433 3344332 33322110 001122347666666443 222334432 334555554 Q ss_pred ccc-eEEEEEeeeeecceEEchHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH----HHhhc--ccccccccCCCHHHHH Q lcl|NC_011288. 69 SDT-GVDLLIDQEKSIDFLVDDIDRVQVAGSLEAYTRAGATALATDTDKFIAD----LLVDN--GTALSGSAPTDADDAF 141 (273) Q Consensus 69 ~~~-~~~~~id~~~~~~~~i~d~d~~~~~~~~~~~~~~~~~ala~~~D~~i~~----~~~~~--~~~~~~~~~~t~~~~~ 141 (273) +.. .+.+++ -.++-++.++.........+..++.....+.++...-+.++. .+.+. .......+..+..... T Consensus 79 t~~~dvaVk~-~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~~ 157 (315) T protein:vir:96 79 AADEMVSVKV-PWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNVSGELATEGK 157 (315) T ss_pred ccccceeEEE-eecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccccccccccCH Confidence 433 344444 333444555544444333344444434444443333333332 22111 1111111122233445 Q ss_pred HHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEE Q lcl|NC_011288. 142 DLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVA 221 (273) Q Consensus 142 ~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~ 221 (273) +.|.+|.++|.++. ..=-.+++.+..|..|.+ .. +.+.-+.-.....+.+..+.+ |-.|+.+..+|.. ..+. T Consensus 158 ~~l~dA~~klGD~~--~~l~~~vMHS~v~~~L~~-q~-L~~~~~~~~~~~~~~~~~~~l-GkrViVdD~~P~~---~~~g 229 (315) T protein:vir:96 158 KVLTKGLRTMGDKA--SSIAIWVMDSTSYFDIVD-EA-IDNKLYEEAGVVVYGGTPGTL-GKPVLVTDQCPAT---KIFG 229 (315) T ss_pred HHHHHHHHHhcccc--cCeeEEEEchHHHHHHHH-hh-hhhhcccccceeEecCcCccc-ccEEEEECCCCcc---eeee Confidence 77899999998774 111347899999999988 33 444333222223333444544 9999999999963 3445 Q ss_pred EcCceeEEeeeee----eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 222 FHPSAAAYVSQID----TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~~~a~~~~----~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++|+++...-+ ..|.. +.-+++..+.+-+++++-|.|+---++.+. T Consensus 230 l~~GAi~~~~~~~~~~~~~~~~----g~e~l~~~~r~e~tf~l~p~G~sw~~~~~~ 281 (315) T protein:vir:96 230 LVAGAVMITESQAPGMRSYQID----DQENLAIGFRAEGTANVEVLGYKWKTKTNV 281 (315) T ss_pred eecceeeecCCCccccccccCC----CcceeEEEEeeeeEeeeeeeeEEeecCCCc Confidence 5577777654222 11222 223334443333445555555544322222 No 180 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=97.87 E-value=3.4e-06 Score=50.55 Aligned_cols=268 Identities=13% Similarity=0.096 Sum_probs=149.1 Q ss_pred Cccch----hhHHHHHHHHHHHHHHhhccchhhccccc-ccc-cCCceEEEeecCccc--c-eeecCC--CcccC--C-- Q lcl|NC_011288. 1 MAFNN----FIPELWSDMLLEEWTAQTVFANLVNREYE-GTA-SKGNVVHIAGVVAPT--V-KDYKAA--GRQTS--A-- 65 (273) Q Consensus 1 MA~~~----~~pev~~~~~~~~~~~~lv~~~~v~~~~~-~~~-~~Gdtv~ip~~~~~~--~-~~~~~~--~~~~~--~-- 65 (273) -+|++ .+.+.|.+-|.+.|....+|.+.+--..+ .++ .+.+|.---+....+ + ++|... ....+ . T Consensus 21 t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~Y~TdeNvaFGtGTg~S 100 (314) T protein:vir:98 21 TANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNEYNKDENVGFGEGTSRS 100 (314) T ss_pred cccCccceeeecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCcccCCCCcccccCCccc Confidence 33332 24556888888889999888776533111 112 233332222221111 1 223211 11111 1 Q ss_pred CCCccceEEEEEeee-ee-cceEEc-hHHHHhhhHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHH Q lcl|NC_011288. 66 DAISDTGVDLLIDQE-KS-IDFLVD-DIDRVQVAGSLE----AYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDAD 138 (273) Q Consensus 66 ~~~~~~~~~~~id~~-~~-~~~~i~-d~d~~~~~~~~~----~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~ 138 (273) +....-..-+..|.. .| ++..|. .+|....+++++ ..++.++++-.+.+|..+-..+...+......+..+.+ T Consensus 101 sRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~lS~~As~te~ltd~~~d 180 (314) T protein:vir:98 101 TRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFISSIAEKTETLTDYSAD 180 (314) T ss_pred cccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcchh Confidence 111111111111111 11 222222 345544444443 34556677777888887766665555443334455667 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcE Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ 218 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~ 218 (273) .....|.++.+..-+..+- ..-..+|+|+.|..|...+ +.......+.+ +-+--|-++-||.+.+.+.-...++.. T Consensus 181 ~V~~LF~~as~~yvn~ev~-~~~~AyV~~evYnaiiD~~--l~TsaK~SsaN-IDengi~~FkGf~i~e~P~~~~q~g~i 256 (314) T protein:vir:98 181 NVLRLFNELSKYYVNIEAI-GTKAAKVSPELYNAIVDHP--LTTSAKSSSAN-IDQNGIVNFKGFAIQEIPESMLQSGDV 256 (314) T ss_pred hHHHHHHHHHhhhhcceee-EEEEEEEchhHHhHhhccc--cccccccceee-eccCCcceecceEEEecchhhcCCCcE Confidence 7777888888888877763 3467899999999998764 22222222222 333337789999998876544444443 Q ss_pred EEEEcCceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 219 FVAFHPSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 219 ~~~~~~~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ++.. .+.++.+- .+........++.-|..+.|.=-||-.+++..+.+++|-+.+ T Consensus 257 a~~s-~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~k~t~t 311 (314) T protein:vir:98 257 AYTY-ITNIGKAFTGINTSRIIESEDFDGVALQGAGKAGEFILDDNKKAVAKVTST 311 (314) T ss_pred EEEc-cccceeecccceeeeeeecccccceeeecccccccccccccceeeEEEecC Confidence 3332 23343332 333345566778889999999999999999998888888888 No 181 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=97.74 E-value=2.3e-05 Score=46.01 Aligned_cols=259 Identities=12% Similarity=0.079 Sum_probs=143.5 Q ss_pred Cc--cch-hhHHHHHHHHHHHHHHhh-----ccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccce Q lcl|NC_011288. 1 MA--FNN-FIPELWSDMLLEEWTAQT-----VFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA--~~~-~~pev~~~~~~~~~~~~l-----v~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (273) +| ++. =.|-++...+-+.|...- -|..++.+..-.++++...+.+--.+.+. ..++++......+.++. T Consensus 359 ~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~---~V~E~gEyk~~t~~e~~ 435 (652) T protein:vir:79 359 AAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALR---QVREGAEYKYVTTGDKQ 435 (652) T ss_pred HHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCcc---ccCCCCccceeeecCcc Confidence 22 221 125555554444443332 23444544433456776676664444432 34556666677788888 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHhhccccc-cccc----------CCCHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLE---AYTRAGATALATDTDKFIADLLVDNGTAL-SGSA----------PTDAD 138 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~---~~~~~~~~ala~~~D~~i~~~~~~~~~~~-~~~~----------~~t~~ 138 (273) .++.+.++ +.-|.|+- ++..++|+. .+....+++-++.++..+++.+..++.-. .+.+ ..+++ T Consensus 436 e~~~l~ty-G~~~~iTR--qaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~~aa 512 (652) T protein:vir:79 436 ATIALATY-GELFSITR--QAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLESAA 512 (652) T ss_pred ceeeeecc-cCeeeeeh--heeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeeccccccccccccc Confidence 88999765 44455554 234455554 45567778888888888888886654321 1111 01123 Q ss_pred HHHHHHHHHHHHHhhcCCC-----ccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEece-EEEeeCccc Q lcl|NC_011288. 139 DAFDLIATALKELTKANVP-----NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGA-RIVESNNLR 212 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP-----~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~-~v~~s~~l~ 212 (273) ...+.+.+++..|..++-. -..+|++++|+......+ .+...... ......|.+--+.|+ +++....|. T Consensus 513 ~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~---ll~s~~v~--~a~~~~~~~Np~~~~~~~i~eprL~ 587 (652) T protein:vir:79 513 MDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQ---VIRSSSVK--GADINAGIINPVKDFATVIAEPRLD 587 (652) T ss_pred CCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHH---HhccCCCc--ccccccccccccccccccccccccC Confidence 3456677777777665521 124789999997654433 22111111 111223445556664 777787776 Q ss_pred cCCCcEE-EEEcCc--eeEEe--e--eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEec Q lcl|NC_011288. 213 DTDDEQF-VAFHPS--AAAYV--S--QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNK 270 (273) Q Consensus 213 ~~~~~~~-~~~~~~--a~~~a--~--~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~ 270 (273) ....... ++..+. .+-++ . +...+|....-+..|-.++.++-||++++|--|++..++ T Consensus 588 ~~s~~~wylaa~~~~dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 588 DNSQTTFYLAASKGSDTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred CCCcccEEEecCCCCCeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 5444333 333332 23222 2 112334433333346688999999999999999998888 No 182 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=97.71 E-value=2.6e-05 Score=45.70 Aligned_cols=262 Identities=14% Similarity=0.046 Sum_probs=127.9 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchh--------hcccccccccCCceEEEeecCcccceeecCCCc--ccCCCCCcc Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANL--------VNREYEGTASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~--------v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~--~~~~~~~~~ 70 (273) +.|+..+ .+|...+...-....-+... +.+--+++...||+|+|+....++..- +.++. ....+.+.. T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g~g-v~Gd~~lEGnee~L~~ 99 (404) T protein:vir:81 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRP-TMGDERVEGRGEDLSH 99 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecccCC-cccCceeeccccceeE Confidence 5565533 33444322222222111111 122223444679999998876654222 11222 233456888 Q ss_pred ceEEEEEeeeeecceEEc-hHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcccc-------------------- Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTR-AGATALATDTDKFIADLLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~-d~d~~~~~~~~~~~~~-~~~~ala~~~D~~i~~~~~~~~~~-------------------- 128 (273) .+.+++||+.+. ++... ..+..-+..++++..+ .+..-+++..|+.++--+...... T Consensus 100 ~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~ 178 (404) T protein:vir:81 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (404) T ss_pred EeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccee Confidence 889999988644 34333 3344445566666554 444557888888877554421110 Q ss_pred -ccccc-----------CC-------CHHHHHHHHHHHHHHHhhcCCCc-------cC-------CEEEECHHHHHHHhh Q lcl|NC_011288. 129 -LSGSA-----------PT-------DADDAFDLIATALKELTKANVPN-------VG-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~-----------~~-------t~~~~~~~i~~a~~~l~~~~vP~-------~~-------r~lvv~p~~~~~L~~ 175 (273) +...+ .+ +-...++.|.++.+.+++..-|. +. ++++++|.++..|+. T Consensus 179 ~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~ 258 (404) T protein:vir:81 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (404) T ss_pred ecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhh Confidence 00000 01 11124667888888887755442 11 678999999999999 Q ss_pred hHHH--H----hhhhc--ccccceeeeeeeeeEeceEEEeeCcccc--CCCcEE------------------------EE Q lcl|NC_011288. 176 SGSK--L----TSADT--SGDAAGLRAGTIGNLLGARIVESNNLRD--TDDEQF------------------------VA 221 (273) Q Consensus 176 ~~~~--~----~~~~~--~~~~~~l~~G~ig~~~G~~v~~s~~l~~--~~~~~~------------------------~~ 221 (273) +... | .++.. .+..++|..|.+|+|.|+-+++..+.|- ..+..+ +. T Consensus 259 dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallL 338 (404) T protein:vir:81 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLL 338 (404) T ss_pred CCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheee Confidence 8521 2 22211 2566889999999999999998776541 111111 11 Q ss_pred EcCcee--EEeee----eeeehhhcC-CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 222 FHPSAA--AYVSQ----IDTVEALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~--~~a~~----~~~~e~~~~-~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) | ..|+ ++++. ..-.|-..| ++.. .|....+.|.+=+|-. ...++ T Consensus 339 G-aQAl~~A~g~~~g~~~~w~Ee~~D~g~~~--~i~~~~i~G~kK~rF~-----~~~g~ 389 (404) T protein:vir:81 339 G-AQALANAYGQKAGGHFNMVEKKTDMDNRT--EIAISWINGLKKIRFP-----EKSGK 389 (404) T ss_pred c-ceeEEEEeeccCCCCceeEeeccccCchh--hhhhHHHhhhhhcccc-----CCCCc Confidence 1 1121 12110 000011111 1111 2333334443322210 00112 No 183 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=97.71 E-value=2.6e-05 Score=45.70 Aligned_cols=262 Identities=14% Similarity=0.046 Sum_probs=127.9 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchh--------hcccccccccCCceEEEeecCcccceeecCCCc--ccCCCCCcc Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANL--------VNREYEGTASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~--------v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~--~~~~~~~~~ 70 (273) +.|+..+ .+|...+...-....-+... +.+--+++...||+|+|+....++..- +.++. ....+.+.. T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g~g-v~Gd~~lEGnee~L~~ 99 (404) T protein:vir:10 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRP-TMGDERVEGRGEDLSH 99 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecccCC-cccCceeeccccceeE Confidence 5565533 33444322222222111111 122223444679999998876654222 11222 233456888 Q ss_pred ceEEEEEeeeeecceEEc-hHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcccc-------------------- Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTR-AGATALATDTDKFIADLLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~-d~d~~~~~~~~~~~~~-~~~~ala~~~D~~i~~~~~~~~~~-------------------- 128 (273) .+.+++||+.+. ++... ..+..-+..++++..+ .+..-+++..|+.++--+...... T Consensus 100 ~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~ 178 (404) T protein:vir:10 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (404) T ss_pred EeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccee Confidence 889999988644 34333 3344445566666554 444557888888877554421110 Q ss_pred -ccccc-----------CC-------CHHHHHHHHHHHHHHHhhcCCCc-------cC-------CEEEECHHHHHHHhh Q lcl|NC_011288. 129 -LSGSA-----------PT-------DADDAFDLIATALKELTKANVPN-------VG-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~-----------~~-------t~~~~~~~i~~a~~~l~~~~vP~-------~~-------r~lvv~p~~~~~L~~ 175 (273) +...+ .+ +-...++.|.++.+.+++..-|. +. ++++++|.++..|+. T Consensus 179 ~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~ 258 (404) T protein:vir:10 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (404) T ss_pred ecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhh Confidence 00000 01 11124667888888887755442 11 678999999999999 Q ss_pred hHHH--H----hhhhc--ccccceeeeeeeeeEeceEEEeeCcccc--CCCcEE------------------------EE Q lcl|NC_011288. 176 SGSK--L----TSADT--SGDAAGLRAGTIGNLLGARIVESNNLRD--TDDEQF------------------------VA 221 (273) Q Consensus 176 ~~~~--~----~~~~~--~~~~~~l~~G~ig~~~G~~v~~s~~l~~--~~~~~~------------------------~~ 221 (273) +... | .++.. .+..++|..|.+|+|.|+-+++..+.|- ..+..+ +. T Consensus 259 dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallL 338 (404) T protein:vir:10 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLL 338 (404) T ss_pred CCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheee Confidence 8521 2 22211 2566889999999999999998776541 111111 11 Q ss_pred EcCcee--EEeee----eeeehhhcC-CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 222 FHPSAA--AYVSQ----IDTVEALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~--~~a~~----~~~~e~~~~-~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) | ..|+ ++++. ..-.|-..| ++.. .|....+.|.+=+|-. ...++ T Consensus 339 G-aQAl~~A~g~~~g~~~~w~Ee~~D~g~~~--~i~~~~i~G~kK~rF~-----~~~g~ 389 (404) T protein:vir:10 339 G-AQALANAYGQKAGGHFNMVEKKTDMDNRT--EIAISWINGLKKIRFP-----EKSGK 389 (404) T ss_pred c-ceeEEEEeeccCCCCceeEeeccccCchh--hhhhHHHhhhhhcccc-----CCCCc Confidence 1 1121 12110 000011111 1111 2333334443322210 00112 No 184 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=97.71 E-value=2.6e-05 Score=45.70 Aligned_cols=262 Identities=14% Similarity=0.046 Sum_probs=127.9 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchh--------hcccccccccCCceEEEeecCcccceeecCCCc--ccCCCCCcc Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANL--------VNREYEGTASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~--------v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~--~~~~~~~~~ 70 (273) +.|+..+ .+|...+...-....-+... +.+--+++...||+|+|+....++..- +.++. ....+.+.. T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g~g-v~Gd~~lEGnee~L~~ 99 (404) T protein:vir:10 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRP-TMGDERVEGRGEDLSH 99 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecccCC-cccCceeeccccceeE Confidence 5565533 33444322222222111111 122223444679999998876654222 11222 233456888 Q ss_pred ceEEEEEeeeeecceEEc-hHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcccc-------------------- Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTR-AGATALATDTDKFIADLLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~-d~d~~~~~~~~~~~~~-~~~~ala~~~D~~i~~~~~~~~~~-------------------- 128 (273) .+.+++||+.+. ++... ..+..-+..++++..+ .+..-+++..|+.++--+...... T Consensus 100 ~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~ 178 (404) T protein:vir:10 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (404) T ss_pred EeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccee Confidence 889999988644 34333 3344445566666554 444557888888877554421110 Q ss_pred -ccccc-----------CC-------CHHHHHHHHHHHHHHHhhcCCCc-------cC-------CEEEECHHHHHHHhh Q lcl|NC_011288. 129 -LSGSA-----------PT-------DADDAFDLIATALKELTKANVPN-------VG-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~-----------~~-------t~~~~~~~i~~a~~~l~~~~vP~-------~~-------r~lvv~p~~~~~L~~ 175 (273) +...+ .+ +-...++.|.++.+.+++..-|. +. ++++++|.++..|+. T Consensus 179 ~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~ 258 (404) T protein:vir:10 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (404) T ss_pred ecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhh Confidence 00000 01 11124667888888887755442 11 678999999999999 Q ss_pred hHHH--H----hhhhc--ccccceeeeeeeeeEeceEEEeeCcccc--CCCcEE------------------------EE Q lcl|NC_011288. 176 SGSK--L----TSADT--SGDAAGLRAGTIGNLLGARIVESNNLRD--TDDEQF------------------------VA 221 (273) Q Consensus 176 ~~~~--~----~~~~~--~~~~~~l~~G~ig~~~G~~v~~s~~l~~--~~~~~~------------------------~~ 221 (273) +... | .++.. .+..++|..|.+|+|.|+-+++..+.|- ..+..+ +. T Consensus 259 dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallL 338 (404) T protein:vir:10 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLL 338 (404) T ss_pred CCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheee Confidence 8521 2 22211 2566889999999999999998776541 111111 11 Q ss_pred EcCcee--EEeee----eeeehhhcC-CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 222 FHPSAA--AYVSQ----IDTVEALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~--~~a~~----~~~~e~~~~-~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) | ..|+ ++++. ..-.|-..| ++.. .|....+.|.+=+|-. ...++ T Consensus 339 G-aQAl~~A~g~~~g~~~~w~Ee~~D~g~~~--~i~~~~i~G~kK~rF~-----~~~g~ 389 (404) T protein:vir:10 339 G-AQALANAYGQKAGGHFNMVEKKTDMDNRT--EIAISWINGLKKIRFP-----EKSGK 389 (404) T ss_pred c-ceeEEEEeeccCCCCceeEeeccccCchh--hhhhHHHhhhhhcccc-----CCCCc Confidence 1 1121 12110 000011111 1111 2333334443322210 00112 No 185 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=97.71 E-value=2.6e-05 Score=45.70 Aligned_cols=262 Identities=14% Similarity=0.046 Sum_probs=127.9 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchh--------hcccccccccCCceEEEeecCcccceeecCCCc--ccCCCCCcc Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANL--------VNREYEGTASKGNVVHIAGVVAPTVKDYKAAGR--QTSADAISD 70 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~--------v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~--~~~~~~~~~ 70 (273) +.|+..+ .+|...+...-....-+... +.+--+++...||+|+|+....++..- +.++. ....+.+.. T Consensus 22 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~g~g-v~Gd~~lEGnee~L~~ 99 (404) T protein:vir:32 22 NRNRSMV-NILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLSKRP-TMGDERVEGRGEDLSH 99 (404) T ss_pred hcCChhH-hhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecccCC-cccCceeeccccceeE Confidence 5565533 33444322222222111111 122223444679999998876654222 11222 233456888 Q ss_pred ceEEEEEeeeeecceEEc-hHHHHhhhHHHHHHHH-HHHHHHHHHHHHHHHHHHhhcccc-------------------- Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVD-DIDRVQVAGSLEAYTR-AGATALATDTDKFIADLLVDNGTA-------------------- 128 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~-d~d~~~~~~~~~~~~~-~~~~ala~~~D~~i~~~~~~~~~~-------------------- 128 (273) .+.+++||+.+. ++... ..+..-+..++++..+ .+..-+++..|+.++--+...... T Consensus 100 ~s~~i~Idq~r~-~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~rg~~~n~~~~vp~~~~~~~~~~~ 178 (404) T protein:vir:32 100 ADFSLKINQGRH-LVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGDFVADDTILPTAEHPEFKKIM 178 (404) T ss_pred EeeEEEEeeecc-cccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccceeecccccccccee Confidence 889999988644 34333 3344445566666554 444557888888877554421110 Q ss_pred -ccccc-----------CC-------CHHHHHHHHHHHHHHHhhcCCCc-------cC-------CEEEECHHHHHHHhh Q lcl|NC_011288. 129 -LSGSA-----------PT-------DADDAFDLIATALKELTKANVPN-------VG-------RVVVVNAEMAFWLRS 175 (273) Q Consensus 129 -~~~~~-----------~~-------t~~~~~~~i~~a~~~l~~~~vP~-------~~-------r~lvv~p~~~~~L~~ 175 (273) +...+ .+ +-...++.|.++.+.+++..-|. +. ++++++|.++..|+. T Consensus 179 ~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~~~~~~~~~yV~~~~p~q~~~Lr~ 258 (404) T protein:vir:32 179 INDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYT 258 (404) T ss_pred ecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccccccCccceEEEEechHHHHHHhh Confidence 00000 01 11124667888888887755442 11 678999999999999 Q ss_pred hHHH--H----hhhhc--ccccceeeeeeeeeEeceEEEeeCcccc--CCCcEE------------------------EE Q lcl|NC_011288. 176 SGSK--L----TSADT--SGDAAGLRAGTIGNLLGARIVESNNLRD--TDDEQF------------------------VA 221 (273) Q Consensus 176 ~~~~--~----~~~~~--~~~~~~l~~G~ig~~~G~~v~~s~~l~~--~~~~~~------------------------~~ 221 (273) +... | .++.. .+..++|..|.+|+|.|+-+++..+.|- ..+..+ +. T Consensus 259 dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~~~n~~~a~~~~~aa~~~v~RallL 338 (404) T protein:vir:32 259 STSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKVLVSENNLTATTKEVAAATNIDRAMLL 338 (404) T ss_pred CCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccceeeecCCccccccccccccccchhheee Confidence 8521 2 22211 2566889999999999999998776541 111111 11 Q ss_pred EcCcee--EEeee----eeeehhhcC-CCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 222 FHPSAA--AYVSQ----IDTVEALRD-QDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 222 ~~~~a~--~~a~~----~~~~e~~~~-~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) | ..|+ ++++. ..-.|-..| ++.. .|....+.|.+=+|-. ...++ T Consensus 339 G-aQAl~~A~g~~~g~~~~w~Ee~~D~g~~~--~i~~~~i~G~kK~rF~-----~~~g~ 389 (404) T protein:vir:32 339 G-AQALANAYGQKAGGHFNMVEKKTDMDNRT--EIAISWINGLKKIRFP-----EKSGK 389 (404) T ss_pred c-ceeEEEEeeccCCCCceeEeeccccCchh--hhhhHHHhhhhhcccc-----CCCCc Confidence 1 1121 12110 000011111 1111 2333334443322210 00112 No 186 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=97.51 E-value=5.3e-05 Score=44.04 Aligned_cols=264 Identities=13% Similarity=0.049 Sum_probs=126.1 Q ss_pred Cc--------cchhhHHHHHHHHHHHHHH-hhccchhh------------------------cccccccccCCceEEEee Q lcl|NC_011288. 1 MA--------FNNFIPELWSDMLLEEWTA-QTVFANLV------------------------NREYEGTASKGNVVHIAG 47 (273) Q Consensus 1 MA--------~~~~~pev~~~~~~~~~~~-~lv~~~~v------------------------~~~~~~~~~~Gdtv~ip~ 47 (273) |- ++.....+|++.+...-.+ ...+..++ .|-.+++...||+|+|+. T Consensus 1 ~~~a~T~~~~~~p~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vtf~L 80 (430) T protein:vir:10 1 MTASKTTMRYGDPNAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVRFHF 80 (430) T ss_pred CcceeeecccCChhHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEEEeE Confidence 32 2233467898877655433 22222222 222234445799999988 Q ss_pred cCcccceeecCCCc--ccCCCCCccceEEEEEeeeeecceEEch-HHHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHHh Q lcl|NC_011288. 48 VVAPTVKDYKAAGR--QTSADAISDTGVDLLIDQEKSIDFLVDD-IDRVQVAGSLEAYTRAGA-TALATDTDKFIADLLV 123 (273) Q Consensus 48 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~id~~~~~~~~i~d-~d~~~~~~~~~~~~~~~~-~ala~~~D~~i~~~~~ 123 (273) ...++..-. .++. ....+.+.-.+..++||+.+. ++.+.. .+..-...++++..+... .=+++..|+-++--+. T Consensus 81 ~~~L~g~gv-~Gd~~lEGnee~L~~~~d~l~IDq~R~-~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~la 158 (430) T protein:vir:10 81 VQPANAFPI-MGSEYAEGKGTGLKIGSDQLRVNQARF-PVDLGDVMSQIRNPYDLRRLGRPKAKWFMDAYLDQSMLVHLA 158 (430) T ss_pred eeccccCce-ecCceeeccccceEEEeeEEEEeeecc-ccccCCchhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 766542211 1111 223356778888999988644 555543 233334556666554433 3466666766554332 Q ss_pred hc-----------------------------cccc-----cccc-------------CCCHHHHHHHHHHHHHHHhhcCC Q lcl|NC_011288. 124 DN-----------------------------GTAL-----SGSA-------------PTDADDAFDLIATALKELTKANV 156 (273) Q Consensus 124 ~~-----------------------------~~~~-----~~~~-------------~~t~~~~~~~i~~a~~~l~~~~v 156 (273) .+ +... ++.+ ..+-...++.|.+++..++.... T Consensus 159 Garg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~a~~~~~ 238 (430) T protein:vir:10 159 GARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATYMDQIEL 238 (430) T ss_pred hhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHHHHhhCC Confidence 11 0100 0000 01112357788889999988753 Q ss_pred Cc-------cC-------CEEEECHHHHHHHhhhHHHH--h-h---hhcccccceeeeeeeeeEeceEEEeeCcc-ccCC Q lcl|NC_011288. 157 PN-------VG-------RVVVVNAEMAFWLRSSGSKL--T-S---ADTSGDAAGLRAGTIGNLLGARIVESNNL-RDTD 215 (273) Q Consensus 157 P~-------~~-------r~lvv~p~~~~~L~~~~~~~--~-~---~~~~~~~~~l~~G~ig~~~G~~v~~s~~l-~~~~ 215 (273) |- +. ++++++|.++..|+.+..+- . + ....+..++|..|.+|+|.|+-+++..++ +... T Consensus 239 ~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~virf~~ 318 (430) T protein:vir:10 239 PPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPKPIRFYA 318 (430) T ss_pred CCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCceeeecC Confidence 31 12 67899999999999997642 1 1 23334567899999999999999987533 1100 Q ss_pred C---cEEEEEcC--------------------------cee--EEeeeeeeehhhcCCCceeeeEEeeeeeeeEE----e Q lcl|NC_011288. 216 D---EQFVAFHP--------------------------SAA--AYVSQIDTVEALRDQDSFSDRIRALHVYGGKV----V 260 (273) Q Consensus 216 ~---~~~~~~~~--------------------------~a~--~~a~~~~~~e~~~~~~~~~~~v~~~~~~g~~v----~ 260 (273) + ..+..... .|+ ++++ ..++..+|. ...-+.=||-++ - T Consensus 319 g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~------~~~~g~~f~-w~Ee~~D~g~~~~i~~~ 391 (430) T protein:vir:10 319 GDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAA------SEHSGMPFF-WSEKDMDHGDKLELLIG 391 (430) T ss_pred CCccccccCCcccccccccccccccccccchhhhhccchhheeeeec------cCCCCccee-eeeeccccCchhhhhhh Confidence 0 00000000 011 1111 001111110 111111111100 0 Q ss_pred cCceEEEEecCC--C Q lcl|NC_011288. 261 RPTGVVVFNKTG--S 273 (273) Q Consensus 261 ~~~~~v~~~~~~--s 273 (273) .=-|+..++=.. + T Consensus 392 ~i~G~kK~rF~~~~~ 406 (430) T protein:vir:10 392 AILGCSKIRFAVEAT 406 (430) T ss_pred HHhccceeeecCCCC Confidence 000111111111 1 No 187 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=97.47 E-value=2.3e-05 Score=46.04 Aligned_cols=262 Identities=16% Similarity=0.168 Sum_probs=140.6 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhccchhhcccccccc-cCCceEEEeecCc--ccceeecCC--Cccc--CC-- Q lcl|NC_011288. 1 MAFNN------FIPELWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVA--PTVKDYKAA--GRQT--SA-- 65 (273) Q Consensus 1 MA~~~------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~-~~Gdtv~ip~~~~--~~~~~~~~~--~~~~--~~-- 65 (273) |+.+| .+.+.|.+-|.+.|....+|.+.+--=...++ .+.+|.---+... ..+.+|... .... +. T Consensus 1 m~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv~FGtgTg~S 80 (286) T protein:vir:94 1 MATTNNDLPVRVYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYSTDANTAFGTGTSNS 80 (286) T ss_pred CCCCccccceeehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEecccCCCccccccCCccc Confidence 55442 24555888888889999888765532001112 2333322212111 122334311 1111 11 Q ss_pred CCCccceEEEEEeee-ee-cceEEc-hHHHHhhhHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHH Q lcl|NC_011288. 66 DAISDTGVDLLIDQE-KS-IDFLVD-DIDRVQVAGSLE----AYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDAD 138 (273) Q Consensus 66 ~~~~~~~~~~~id~~-~~-~~~~i~-d~d~~~~~~~~~----~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~ 138 (273) +....-..-+..|.. .| ++..|. .+|....+++++ ..++.++++-.+.+|..+-..+...+... .+-+ T Consensus 81 sRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A~~t-----~~~D 155 (286) T protein:vir:94 81 SRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALATAGTDL-----GAVD 155 (286) T ss_pred cccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-----hhhh Confidence 111111111111111 11 222222 345544444443 34556667777888877665554433221 1124 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcE Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ 218 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~ 218 (273) .....|.++.+..-+..|- ...-.+|+|+.|..|...+ +.......+.+ +-+--|-++-||.+.+.+.-... .. T Consensus 156 ~V~~LF~~as~~yvn~ev~-~~~~ayV~~evYnaiiD~~--l~TsaK~SsaN-iDengi~~FkGf~i~e~P~~~~~--g~ 229 (286) T protein:vir:94 156 DVNALFESAVEKYTDLEVI-APVRAYVTASVYNAIIDLA--NVTTAKNSAVN-IDTNGMLSFRGIAITKVPTQYMG--GK 229 (286) T ss_pred hHHHHHHHHHHHhhhhhee-eeeEEEEchhHHHHHhccc--cccccccceee-eccCCcceecceEEeecchhhcc--Cc Confidence 5566777788888777774 3334899999999998765 22222222222 33333778999999887743222 33 Q ss_pred EEEEcCceeEEee-eeeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 219 FVAFHPSAAAYVS-QIDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 219 ~~~~~~~a~~~a~-~~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..++.++.++.+- .+........++.-|....|.=-||-.+++..+..+++++-- T Consensus 230 ~aifs~dnig~aftGIn~aR~IesEdF~GValQgAGK~G~~I~edNk~Ai~~~~~k 285 (286) T protein:vir:94 230 AVIFAPDNVARVFTGINIARTIQAIDFAGVELQGAGKYGTFILDDNKKAIFTATPK 285 (286) T ss_pred eEEEccccceeeeccceeeeeeeccccCceeeeccccccccccccCceeEEEeecC Confidence 4455555555443 334445567788889999999999999999887777766554 No 188 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=97.40 E-value=7.5e-05 Score=43.18 Aligned_cols=260 Identities=11% Similarity=0.092 Sum_probs=140.4 Q ss_pred Ccc--c-hhhHHHHHHHHHHHHHHhh-----ccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccce Q lcl|NC_011288. 1 MAF--N-NFIPELWSDMLLEEWTAQT-----VFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~--~-~~~pev~~~~~~~~~~~~l-----v~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (273) ||- + .=.|-++...+.+.+...- .|..++.+..-.++++...+.+-..+.+. ..++++......+.+.. T Consensus 394 ~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~---~V~E~gEyk~~t~~e~~ 470 (693) T protein:vir:95 394 LAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLR---QVREGAEYKYVTLGERG 470 (693) T ss_pred HHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChh---hcCCCCceeeeecCCcc Confidence 222 1 1125555544444333321 23444444333456766666654444432 23555666666778888 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHHHHH---HHHHHHHHHHHHHHHHHHHHHhhcccccccc-----------cCCCHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGSLEA---YTRAGATALATDTDKFIADLLVDNGTALSGS-----------APTDAD 138 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~~~~---~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~-----------~~~t~~ 138 (273) .++.+..+ +.-|.|+-. +..++|+.. +....+++-++.++..+++.+..++.-..+. ++.... T Consensus 471 e~~~l~ty-G~~~~iTRq--aiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~tga~sa 547 (693) T protein:vir:95 471 EQIILATY-GELFSITRQ--AIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLTGAASA 547 (693) T ss_pred ceeehhhc-CCeeeecHH--hhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeeccccccccccccc Confidence 88888665 455666543 344455544 5567888888999999998887654221111 111123 Q ss_pred HHHHHHHHHHHHHhhcCCC----------ccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEece-EEEe Q lcl|NC_011288. 139 DAFDLIATALKELTKANVP----------NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGA-RIVE 207 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP----------~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~-~v~~ 207 (273) ...+.+.+++..|..++.+ -..++++++|+......+ .+......+ .....|.+--+.|+ +++. T Consensus 548 ls~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~---l~~s~~~~~--a~~~~~~~NP~~~~~~vi~ 622 (693) T protein:vir:95 548 LSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQ---IINSESVPG--ADVNSGIVNPIRAFAQVIG 622 (693) T ss_pred cChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHH---Hhccccccc--cccccccccchhccccccc Confidence 3456677777777655422 134688888887665543 222111111 11223445556664 6777 Q ss_pred eCccccCCCc-EEEEEcCc--eeEEe--ee--eeeehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 208 SNNLRDTDDE-QFVAFHPS--AAAYV--SQ--IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 208 s~~l~~~~~~-~~~~~~~~--a~~~a--~~--~~~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .+.|....+. .+++..+. .+-++ .. ...+|....-+.-|-.++.++-||++++|--+++ |.+|. T Consensus 623 ~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~--kn~GA 693 (693) T protein:vir:95 623 EPRLDDASATAWYMAAKKGSDTIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGLQ--KSNGA 693 (693) T ss_pred cceecCCCCCceEEecCCCCCeEEEEEecCCCCCeEeecCCCCcceEEEEEEEeccCceeeccccc--cCCCC Confidence 7777654333 34444433 23222 21 1233333333344668889999999999999875 45666 No 189 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=97.22 E-value=0.00011 Score=42.33 Aligned_cols=254 Identities=10% Similarity=-0.009 Sum_probs=109.1 Q ss_pred Cc-------c-chhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccce Q lcl|NC_011288. 1 MA-------F-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA-------~-~~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (273) +. . ....|.-+...+...+.....+...+... ......+|............++......+++... T Consensus 237 ~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~------~i~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~ 310 (517) T protein:vir:97 237 WTAELKERGISGMPAPAGILKRIQDAVNDEGSLLPFIRHE------NLPTLVVGGDNALTQGTGHTTGTDKTESNITLQT 310 (517) T ss_pred eeeecccccccccccchHHHHHHHHhhhhhccceeeeeec------cccceeeecccccceeeeeecCCcccccccceee Confidence 00 0 01235555555555554444333333221 1123334433322222233334433344555566 Q ss_pred EEEEEeeeeecceEEchHHHHhhhHH----HHH-HHHHHHHHHHHHHHHHHHHHHhhccc-----cccc---ccC-CCHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQVAGS----LEA-YTRAGATALATDTDKFIADLLVDNGT-----ALSG---SAP-TDAD 138 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~~~~----~~~-~~~~~~~ala~~~D~~i~~~~~~~~~-----~~~~---~~~-~t~~ 138 (273) +++.+.+ .+.-+.++..-......+ +++ +..++.+.|+++.+..++.=-..... .... +.. .... T Consensus 311 ~~~~~~~-ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~ 389 (517) T protein:vir:97 311 RVLTPQY-VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTT 389 (517) T ss_pred EEeeHhh-hhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccc Confidence 6665533 233344444322222222 666 45678889999999887631000000 0000 001 1111 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcE Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ 218 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~ 218 (273) ...+.+......+. +..+-.+|++|..+..|.+-..... .+-....+..+....++|+.-.++. ++. +.. T Consensus 390 ~~~d~i~~l~~a~~----~a~~a~~vmn~~t~~~I~klKD~~G---~Yl~~~~~~~~~~~~l~G~~~~~~~-~~~--~~~ 459 (517) T protein:vir:97 390 NIQELLEKLSVATP----KAADSTLVIHRNDLAAIRFLKDKNG---NYVFPVGVSNQTIATHFGFNRLVQS-VAV--DEK 459 (517) T ss_pred hHHHHHHHHHHHhh----hccCCEEEECHHHHHHHHHhhcCCC---CeeccCcCCcccccccCCccccccc-ccc--Cce Confidence 22222222222222 2234567899999998865432111 1112223344555666775333321 221 222 Q ss_pred EEEEcCceeEEeeeeeeehhhcCC--CceeeeEEeeeeeeeEEecCceEE--EEecCCC Q lcl|NC_011288. 219 FVAFHPSAAAYVSQIDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVV--VFNKTGS 273 (273) Q Consensus 219 ~~~~~~~a~~~a~~~~~~e~~~~~--~~~~~~v~~~~~~g~~v~~~~~~v--~~~~~~s 273 (273) .+.+..+- ..+... .++..++- ..-.+.+...++.|..|..|+.++ +++++.+ T Consensus 460 ~~~~~~~y-~i~~~~-g~~~~~~fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~ 516 (517) T protein:vir:97 460 TAVSLSGY-VTNGSR-GMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPVA 516 (517) T ss_pred eEeecccc-EEEeec-ceeeeeeeecccCceeEeeeeeeccccccccceEEEEEcCCCC Confidence 22332221 111110 01112221 112345666778888899999555 7777777 No 190 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=96.95 E-value=0.00024 Score=40.43 Aligned_cols=263 Identities=12% Similarity=0.066 Sum_probs=121.4 Q ss_pred Cc-cc-----hhhHHHHHHHHHHHHHHhhc-cchhhcccccccccCCceEEEeecCc---ccceeecC--CCcccCCCCC Q lcl|NC_011288. 1 MA-FN-----NFIPELWSDMLLEEWTAQTV-FANLVNREYEGTASKGNVVHIAGVVA---PTVKDYKA--AGRQTSADAI 68 (273) Q Consensus 1 MA-~~-----~~~pev~~~~~~~~~~~~lv-~~~~v~~~~~~~~~~Gdtv~ip~~~~---~~~~~~~~--~~~~~~~~~~ 68 (273) |+ .+ .+.+......+++.|.+.+- +..+ .+.-..|.+.+..+... .+..+..- ...+...... T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~L-----pF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~ 75 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVL-----PFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAA 75 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhC-----CcccccCCcceeeEeeccCCcccccccccccCCCcccccc Confidence 77 43 23455567777777765442 2222 22223355666655432 22222110 0111112233 Q ss_pred ccceEEEEEeeeeecceEEch-HHHHh-h-hHH-HHHHHHHHHHHHHHHHHHHHHH----------HHhhcccccccccC Q lcl|NC_011288. 69 SDTGVDLLIDQEKSIDFLVDD-IDRVQ-V-AGS-LEAYTRAGATALATDTDKFIAD----------LLVDNGTALSGSAP 134 (273) Q Consensus 69 ~~~~~~~~id~~~~~~~~i~d-~d~~~-~-~~~-~~~~~~~~~~ala~~~D~~i~~----------~~~~~~~~~~~~~~ 134 (273) +.+.++..+. -..-.+.|+. +..+. . ..+ +...+++..+++.++.+..+++ +...........+. T Consensus 76 t~~~~~~~L~-i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~ 154 (310) T protein:vir:97 76 TFTKVNSNLT-TIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTG 154 (310) T ss_pred ccceeeeeee-eeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecC Confidence 3444444442 1222334432 11121 1 223 4556888889999998877764 22211111100111 Q ss_pred -CCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCcccc Q lcl|NC_011288. 135 -TDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 135 -~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~ 213 (273) ..+..+++++.+.....-+. .-+..+++.+|.++..+..--+...............--.+-.|.|++|+.++.+|. T Consensus 155 ~~gg~~t~d~LDeLl~~v~~~--~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~ 232 (310) T protein:vir:97 155 ATGSAISFAILDELMDLVVDK--DGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPT 232 (310) T ss_pred CCCCCCCHHHHHHHHHHHhcC--CCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCC Confidence 11222344444443333211 124468999998766554322222211222222222222467899999999999986 Q ss_pred CC------C-cEEEEEcC--c----e-eEEee---eeeeehhhc---CCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 214 TD------D-EQFVAFHP--S----A-AAYVS---QIDTVEALR---DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 214 ~~------~-~~~~~~~~--~----a-~~~a~---~~~~~e~~~---~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +. + ..+++..- + + +|.-. -...++... +..-+.+.| .++||.-++.|+++++|+.--- T Consensus 233 ~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V--~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 233 NQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRV--KWYCGLALFSEKGLACADGITN 310 (310) T ss_pred CccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEE--EEeeeEEEecccceeeeccccC Confidence 42 1 22222211 1 1 12110 012333222 233344444 6799999999999999954322 No 191 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=96.48 E-value=0.00059 Score=38.30 Aligned_cols=261 Identities=11% Similarity=-0.007 Sum_probs=114.7 Q ss_pred Cccch------hhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcc---cceeecCCCcccCCCC-Ccc Q lcl|NC_011288. 1 MAFNN------FIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAP---TVKDYKAAGRQTSADA-ISD 70 (273) Q Consensus 1 MA~~~------~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~---~~~~~~~~~~~~~~~~-~~~ 70 (273) |+.-+ +.|.-....+++.|.+.+-+.... .+.-..|.+.+.++.... +..+.+.+ ..++. .+. T Consensus 25 m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~l----pf~~ve~~~~~~~r~~~lp~a~~r~~n~~---~~~~~~~Tf 97 (330) T protein:vir:94 25 MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMM----PFTEIEGNALAYNRENVLGDVQFLAVGGT---ITAKNPATF 97 (330) T ss_pred hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhc----ccccccCCcceeeeeecCCcceeeecccc---ccccCccee Confidence 55332 234455666777776553222111 111123445555554333 33332221 11111 111 Q ss_pred ceEEEEEeeeeecceEEchHHHHhhh---HH-HHHHHHHHHHHHHHHHHHHHHH----------HHhhccccccccc-CC Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVDDIDRVQVA---GS-LEAYTRAGATALATDTDKFIAD----------LLVDNGTALSGSA-PT 135 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~d~d~~~~~---~~-~~~~~~~~~~ala~~~D~~i~~----------~~~~~~~~~~~~~-~~ 135 (273) .+++..+.-. .-.+.|+. ..++.. .+ +.+..+...++|+++.+..++. ++..........+ +. T Consensus 98 ~q~t~~l~~l-~~~~~Vd~-~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~ 175 (330) T protein:vir:94 98 TKVTSELTTL-IGDAEVNG-LIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTISAGAN 175 (330) T ss_pred eeeeechhhh-hhhHHHHH-HHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEecCCC Confidence 2333332111 11122221 122222 24 3456778888898888777664 1111110000001 11 Q ss_pred CHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeee-eeeeEeceEEEeeCccccC Q lcl|NC_011288. 136 DADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAG-TIGNLLGARIVESNNLRDT 214 (273) Q Consensus 136 t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G-~ig~~~G~~v~~s~~l~~~ 214 (273) ++..+++++.++....-.. +-+.-+++++..+...+..-.+.............+ -| .|-.|.|++|+.++.+|.+ T Consensus 176 gg~~T~d~LDeLl~~v~~~--~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~-~G~~v~~~~GvPi~~~d~ip~~ 252 (330) T protein:vir:94 176 GGTLTFELLDQLLDLVKDK--DGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLP-SGRQIPTYRGVPWFVNDFIPSN 252 (330) T ss_pred CCCCCHHHHHHHHHHhcCC--CCCCcEEEechhHHHHHHHHHHhccCCCCCCccccc-CCCEEeeeCCeEEEecccccCC Confidence 2222334444333332111 123458888888877775532222211111111112 24 3677999999999999874 Q ss_pred CC-------cEEEEEc--C-----ceeEEee---eeeeehhhc-CCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 215 DD-------EQFVAFH--P-----SAAAYVS---QIDTVEALR-DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 215 ~~-------~~~~~~~--~-----~a~~~a~---~~~~~e~~~-~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .. ..+++.. . .-+|.-. ....++... ...+......-.++||..++.|+++.+|+.-.- T Consensus 253 ~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~ 329 (330) T protein:vir:94 253 MTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVKMYCGFANFSQLGLAAIKGLIP 329 (330) T ss_pred CCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEEEeeeeEEechhheeeeccccC Confidence 32 1222221 0 1112110 012232221 111112223447899999999999999976655 No 192 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=95.38 E-value=0.00064 Score=38.10 Aligned_cols=257 Identities=16% Similarity=0.113 Sum_probs=98.7 Q ss_pred Cccchh-hHHHHHHH----HHHHHHHhhccch-----hhcccccc-cccC---------CceEEEeec-CcccceeecCC Q lcl|NC_011288. 1 MAFNNF-IPELWSDM----LLEEWTAQTVFAN-----LVNREYEG-TASK---------GNVVHIAGV-VAPTVKDYKAA 59 (273) Q Consensus 1 MA~~~~-~pev~~~~----~~~~~~~~lv~~~-----~v~~~~~~-~~~~---------Gdtv~ip~~-~~~~~~~~~~~ 59 (273) +..+.- ....+... .++..++.+.... ..-+.+.. .... +-++..... ......+.... T Consensus 184 ~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~e~~~~ 263 (480) T protein:vir:40 184 ERKFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLTLAEDGVDDTFISGTFKAG 263 (480) T ss_pred hhHHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhcceeeeccccceeeeeeeecc Confidence 111100 00001110 0111111110000 00000000 0000 001100000 00000010000 Q ss_pred CcccCCCCCccceEEEEEeeee-ecceEEchHHHHhh--hHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccc------- Q lcl|NC_011288. 60 GRQTSADAISDTGVDLLIDQEK-SIDFLVDDIDRVQV--AGSLEA-YTRAGATALATDTDKFIADLLVDNGTA------- 128 (273) Q Consensus 60 ~~~~~~~~~~~~~~~~~id~~~-~~~~~i~d~d~~~~--~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~~------- 128 (273) +...+.. +..+..+..+. +.-..+........ ..++++ +..++++.++.+.+..++.--...... T Consensus 264 ~~~~~~~----~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~ 339 (480) T protein:vir:40 264 TDKNKSQ----TATKRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTA 339 (480) T ss_pred ccccccc----ccccchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceee Confidence 0000000 00011110000 00011111111111 123666 456777788888777665321010000 Q ss_pred -cccccCCCHHHHHHHHHHHHHHHhhcCCCccCC-EEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEE Q lcl|NC_011288. 129 -LSGSAPTDADDAFDLIATALKELTKANVPNVGR-VVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIV 206 (273) Q Consensus 129 -~~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r-~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~ 206 (273) ...+...+..+.++.|..+. .+.. ..+. .+|++|..++.|.+-...- +.+...+.+..|....+.|++++ T Consensus 340 ~~~~~~~~~~~d~id~L~~al---~~~y--~~~a~~~vmn~~t~~~I~klKD~~---G~Yi~q~~~~~~~~~~llG~pvv 411 (480) T protein:vir:40 340 TDGWTKQIEYTDLFEGITDAV---AECS--ISDAITIVMSPQTFAELRKAKGTD---GHSRFNELATKEQIAQSFGAVNL 411 (480) T ss_pred cccccccchhHHHHHHHHHhh---hHHh--hCCCCEEEECHHHHHHHHHhhcCC---CCeeccCcccccCcceeccccee Confidence 00112233444444443333 2222 1233 5789999999886533211 22334455667888999999988 Q ss_pred eeC-ccccCCCcEEEEEcCceeEEeeeeeeehhhcCC--CceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 207 ESN-NLRDTDDEQFVAFHPSAAAYVSQIDTVEALRDQ--DSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 207 ~s~-~l~~~~~~~~~~~~~~a~~~a~~~~~~e~~~~~--~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +++ ..|. +.-.+.....++.+..+ .++.+++- .+-...+....+.|..+.+|+++..++..|| T Consensus 412 ~~~~~~~~--~~~~~~~~~~~~~~~d~--~~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~ 477 (480) T protein:vir:40 412 ETRVWMPK--DEVAVYNHDEYVLIGDL--NVENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGS 477 (480) T ss_pred eeeccccC--CcceeeeCCccEEEEec--ccceecccccccchhhhhhhhhhceeeEccccEEEEEeccC Confidence 754 3332 11122222333333332 23323221 1223456667788899999999999999999 No 193 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=94.83 E-value=0.0034 Score=34.12 Aligned_cols=258 Identities=11% Similarity=0.080 Sum_probs=121.1 Q ss_pred Cccc---hhhHHHH---HHHHHHHHHHhhccchhhccccccccc-CCceEEEeecCccccee-ecCCCcccCCCCCccce Q lcl|NC_011288. 1 MAFN---NFIPELW---SDMLLEEWTAQTVFANLVNREYEGTAS-KGNVVHIAGVVAPTVKD-YKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~~---~~~pev~---~~~~~~~~~~~lv~~~~v~~~~~~~~~-~Gdtv~ip~~~~~~~~~-~~~~~~~~~~~~~~~~~ 72 (273) |=+. .|..+.| -.++.+.+...++...++.... .+. ...++.++.....+... +......+...+..-+. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~--~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~ 78 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKF--DVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVR 78 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhccccc--CCCCceEEEEEeeeccceeEEEecCccccccccccccee Confidence 5444 3444444 4445555555555555443221 222 24566777765554333 22222223333455566 Q ss_pred EEEEEeeeeecceEEchHH--HHhhh-HHHH-HHHHHHHHHHHHHHHHHHHH---------HHhhccc--cccc------ Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDID--RVQVA-GSLE-AYTRAGATALATDTDKFIAD---------LLVDNGT--ALSG------ 131 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d--~~~~~-~~~~-~~~~~~~~ala~~~D~~i~~---------~~~~~~~--~~~~------ 131 (273) ....|-. .+.+|.+...| ..... .++. +....+++++++..|+.+|- ++...+. ...+ T Consensus 79 ~~~~i~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~ 157 (301) T protein:vir:80 79 KSVPIYS-IGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGN 157 (301) T ss_pred EEEEEEE-EEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCccccc Confidence 6666633 45666666444 44333 3454 35666777888888776552 1111100 0000 Q ss_pred ---ccCCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHHHhh-hhcccccceeeeeeeeeEeceEE Q lcl|NC_011288. 132 ---SAPTDADDAFDLIATALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTS-ADTSGDAAGLRAGTIGNLLGARI 205 (273) Q Consensus 132 ---~~~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~lvv~p~~~~~L~~~~~~~~~-~~~~~~~~~l~~G~ig~~~G~~v 205 (273) -...|+..++++|.++...+..+ ++ ...-.|+|+|+.|..|..- ...+ .+...- +-+++ +.-+..| T Consensus 158 ~~~w~~~t~~ei~~di~~~~~~l~~~s~g~-~~p~~L~L~p~~~~~L~~~--~~~~~~~~tvl-~~l~~----~~~~~~I 229 (301) T protein:vir:80 158 VSKWEKKTAEQIIDEIGEAHTKITVLPGYG-TASLKLCLPPKQFELINKK--RYSNEDSRSVL-KVLQD----NAWFSAI 229 (301) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHhcCce-ecccEEEecHHHHHhhhhc--cccCCCCeeHH-HHHHH----HcCcceE Confidence 01236778899999999998654 32 1224699999999988431 0100 000000 01110 1122344 Q ss_pred EeeCccccC---CCcEEEEEcC--ceeEEe--eeeeeehhhcCCCceeeeEEeeeee-eeEEecCceEEEEecC Q lcl|NC_011288. 206 VESNNLRDT---DDEQFVAFHP--SAAAYV--SQIDTVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) Q Consensus 206 ~~s~~l~~~---~~~~~~~~~~--~a~~~a--~~~~~~e~~~~~~~~~~~v~~~~~~-g~~v~~~~~~v~~~~~ 271 (273) +..+.+... ....+++..+ .-+.++ ..+...-.++... ...+...... |+-+.+|++++.+.-= T Consensus 230 ~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~--~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 230 VRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFP--RTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred EEcceeccCCCCcccEEEEEecCCcEEEEEecCceeeecceecCc--eeEeeeeeeeEEEEEEccceEEEEecC Confidence 444444321 1122333332 212221 1111111112222 2334344444 6788899999988666 No 194 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=94.39 E-value=0.0046 Score=33.42 Aligned_cols=258 Identities=10% Similarity=0.020 Sum_probs=118.0 Q ss_pred Cccc-----hhhHHHHH---HHHHHHHHHhhccchhhcccccccccC-CceEEEeecCcccc-eeecCCCcccCCCCCcc Q lcl|NC_011288. 1 MAFN-----NFIPELWS---DMLLEEWTAQTVFANLVNREYEGTASK-GNVVHIAGVVAPTV-KDYKAAGRQTSADAISD 70 (273) Q Consensus 1 MA~~-----~~~pev~~---~~~~~~~~~~lv~~~~v~~~~~~~~~~-Gdtv~ip~~~~~~~-~~~~~~~~~~~~~~~~~ 70 (273) |... .|..+.|. .++.+.....++...++.... .+.. -.+++++.....+. .-+......+..-+..- T Consensus 24 ~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~--~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~ 101 (319) T protein:vir:10 24 KQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTT--ELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALG 101 (319) T ss_pred hhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhccccc--CCCCceEEEEeeeeccccceeeecCccccccceeccc Confidence 1110 23333332 233333334444444443221 2222 33566666544332 22222222222334555 Q ss_pred ceEEEEEeeeeecceEEchHHH--Hhhh-HHHH-HHHHHHHHHHHHHHHHHHHH---------HHhhcccc---c---cc Q lcl|NC_011288. 71 TGVDLLIDQEKSIDFLVDDIDR--VQVA-GSLE-AYTRAGATALATDTDKFIAD---------LLVDNGTA---L---SG 131 (273) Q Consensus 71 ~~~~~~id~~~~~~~~i~d~d~--~~~~-~~~~-~~~~~~~~ala~~~D~~i~~---------~~~~~~~~---~---~~ 131 (273) +.....|-. .+..+.++..|. .... .++. +....+++++++..|+.++- ++...+.. . .. T Consensus 102 ~~~~~~i~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~ 180 (319) T protein:vir:10 102 TSEFGKVFR-LGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWID 180 (319) T ss_pred eeeEEEEEE-EEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecCCCCC Confidence 566666633 456666665444 3322 3454 35566677788887766541 11111000 0 01 Q ss_pred ccCCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeC Q lcl|NC_011288. 132 SAPTDADDAFDLIATALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN 209 (273) Q Consensus 132 ~~~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~ 209 (273) ....++..++++|.++...|..+ ++ ...-.|+|+|+.|..|..- ..+.+...- +-+++ +.-+.+|...+ T Consensus 181 ~~t~t~~~i~~di~~~~~~l~~~s~g~-~~p~~L~L~p~~~~~L~~~---~~~~~~t~l-~~lk~----~~~~l~I~~~p 251 (319) T protein:vir:10 181 VSTMKPETAEAELTQAIETIETITRGQ-HRATNILIPPSMRKVLAIR---MPETTMSYL-DYFKS----QNSGIEIDSIA 251 (319) T ss_pred ccccCHHHHHHHHHHHHHHHHHhcCce-eeceEEEecHHHHHhhhcc---cCCCCeeHH-HHHHH----hcCCceEEEee Confidence 12235678899999998888644 43 1234799999999888431 111111000 01110 12345566655 Q ss_pred ccccCCC---cEEEEE--cCceeEEeeeeeeehhhc-CCCceeeeEEeeeee-eeEEecCceEEEEecC Q lcl|NC_011288. 210 NLRDTDD---EQFVAF--HPSAAAYVSQIDTVEALR-DQDSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) Q Consensus 210 ~l~~~~~---~~~~~~--~~~a~~~a~~~~~~e~~~-~~~~~~~~v~~~~~~-g~~v~~~~~~v~~~~~ 271 (273) .+....+ ...++. .+.-+.++...+ +.... ........+...... |+-+.+|++++.+.-= T Consensus 252 el~~ag~~g~~~~v~y~~~~~~~~~~v~~~-~~~~~~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 252 ELEDIDGAGTKGVLVYEKNPMNMSIEIPEA-FNMLPAQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eecccCCCcceEEEEEecCCceEEEecCcc-eeeeeeeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 5543221 122333 233333332111 11111 112234455555544 5778899999987655 No 195 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=92.21 E-value=0.012 Score=31.04 Aligned_cols=261 Identities=10% Similarity=0.053 Sum_probs=101.7 Q ss_pred Cccc---------------------------hhhHHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCccc- Q lcl|NC_011288. 1 MAFN---------------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPT- 52 (273) Q Consensus 1 MA~~---------------------------~~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~- 52 (273) |+++ .|-|+++.. +++...+...|.+.++.- ...-.+..|++++... T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l~~g~L~p~~a~~-Fl~~v~~~t~iL~~~r~~----~~~s~~~ei~kig~G~r 75 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAELDGFQLPVDVTEE-FLERMQKGVQILGMADTM----TLARLEMEVPQFGVPRL 75 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccccCceeecHHHHHH-HHHHHhhccchhhhccee----eccccccccccccccee Confidence 3332 234776654 444555555555555331 1123344555544321 Q ss_pred -ceeecCCCcccCCCCCccceEEE-EEeeeeecceEEchHHHH-hhhHH---HHH-HHHHHHHHHHHHHH-------HH- Q lcl|NC_011288. 53 -VKDYKAAGRQTSADAISDTGVDL-LIDQEKSIDFLVDDIDRV-QVAGS---LEA-YTRAGATALATDTD-------KF- 117 (273) Q Consensus 53 -~~~~~~~~~~~~~~~~~~~~~~~-~id~~~~~~~~i~d~d~~-~~~~~---~~~-~~~~~~~ala~~~D-------~~- 117 (273) .+.+..++......++....++. ..++... ...+...+.. ...+. ++. +....+..+++.+. .+ T Consensus 76 ~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~-~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds 154 (360) T protein:vir:99 76 SGHTRDEEGSRTENSEAESGSVKFNATDKSYY-ILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASS 154 (360) T ss_pred eccccccCCCCCcCCcCccccCccccccceee-EeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchh Confidence 11111111111111233333333 2222222 2233222211 11111 112 22222233333221 11 Q ss_pred ------------------HHHHHhhcccccc--c-ccC------------CC------------HHHHHHHHHHHHHHHh Q lcl|NC_011288. 118 ------------------IADLLVDNGTALS--G-SAP------------TD------------ADDAFDLIATALKELT 152 (273) Q Consensus 118 ------------------i~~~~~~~~~~~~--~-~~~------------~t------------~~~~~~~i~~a~~~l~ 152 (273) .+.++.+...... + .+. .+ .......|.++.+.|. T Consensus 155 ~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp 234 (360) T protein:vir:99 155 GNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLD 234 (360) T ss_pred cccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcc Confidence 1111111100000 0 000 00 0012233556666665 Q ss_pred hcCC--CccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcEEEEEcCceeEEe Q lcl|NC_011288. 153 KANV--PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) Q Consensus 153 ~~~v--P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~~~~~~~~a~~~a 230 (273) ..-- |...-+.+++|..+..... .+.+.....+..++..+..-.+.|+++...+.+|.. .++..++.-+.+. T Consensus 235 ~kyr~~~~~~~~~~~s~~~~~~yr~---~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~---~~mlT~p~NLi~g 308 (360) T protein:vir:99 235 SRYRESDAYSPVLMTSPNQVQSYTM---SLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPDE---YMMFTDPNNLAFG 308 (360) T ss_pred hhhhcCcccceEEEccCchHHHHHH---HHhccCcccchhheecccccccceeeeEEcCCCCCC---ceEEeccCceeEE Confidence 4431 1112256788776555543 344444444445565444445789999999999853 4667777777665 Q ss_pred eeee-ee----hhhcCCCceeeeEEe-eeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 231 SQID-TV----EALRDQDSFSDRIRA-LHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 231 ~~~~-~~----e~~~~~~~~~~~v~~-~~~~g~~v~~~~~~v~~~~~~s 273 (273) .-.+ ++ |..|-..+.-..++- +..+.+-+-+++++|+++--.. T Consensus 309 ~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~ 357 (360) T protein:vir:99 309 LYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLET 357 (360) T ss_pred eeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCC Confidence 4221 22 222221111112222 2233333334556655543222 No 196 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=91.09 E-value=0.018 Score=30.21 Aligned_cols=247 Identities=13% Similarity=0.103 Sum_probs=119.2 Q ss_pred Cccch-----hhHHHHHHHHHHHHHHhhccchhhcccccccc-cCCceEEEeecCc--ccceeecCCCc---cc--CCC- Q lcl|NC_011288. 1 MAFNN-----FIPELWSDMLLEEWTAQTVFANLVNREYEGTA-SKGNVVHIAGVVA--PTVKDYKAAGR---QT--SAD- 66 (273) Q Consensus 1 MA~~~-----~~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~-~~Gdtv~ip~~~~--~~~~~~~~~~~---~~--~~~- 66 (273) |..++ .+.+.|.+-+.+.|....+|.+.+--=...++ .+.+|.---+... ..+.+|..... .. +.. T Consensus 1 mp~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNvagFGtGTg~S 80 (295) T protein:vir:47 1 MPSNQNNAVRRYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGENDGGFGDNSGAQ 80 (295) T ss_pred CCCCCCccchhhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeecccCCCcccccccCCccc Confidence 76552 34566888888889888888765432001112 2233322112111 12233331110 11 111 Q ss_pred -CCccceEEEEEeee-ee-cceEEc-hHHHHhhhHHHH----HHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCHH Q lcl|NC_011288. 67 -AISDTGVDLLIDQE-KS-IDFLVD-DIDRVQVAGSLE----AYTRAGATALATDTDKFIADLLVDNGTALSGSAPTDAD 138 (273) Q Consensus 67 -~~~~~~~~~~id~~-~~-~~~~i~-d~d~~~~~~~~~----~~~~~~~~ala~~~D~~i~~~~~~~~~~~~~~~~~t~~ 138 (273) ....-+.-+..|.. .| ++..|. .+|....+++++ ..++.++++-++.+|..+-..+...+......+..+.+ T Consensus 81 sRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~te~~td~t~d 160 (295) T protein:vir:47 81 SRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATKTEALADFTDD 160 (295) T ss_pred cccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccch Confidence 11111111111111 11 222222 345544444443 34556777777888888777776666554445566777 Q ss_pred HHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccccCCCcE Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQ 218 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~~~~~~ 218 (273) .....|.++.+..-+..|-. ..-.+|+|+.|..|...+- .......+.+ +-+--|-++-||.+.+.+.-...++.. T Consensus 161 ~V~~LF~~as~~yvn~ev~~-~~~AyV~~evYnaiiD~~l--~TsaK~SsaN-iDengi~~FkGf~i~e~P~~~~q~G~~ 236 (295) T protein:vir:47 161 KVKALFNKLSAFYTNNEVTA-PITVYLRSEFYNAIVDMAS--VTSAKGATIS-LDENGLPKYKGFTLEETPAQYFETGVI 236 (295) T ss_pred hHHHHHHHHHHHhhhhheee-eeEEEEchhHHHHHhcccc--ccccccceee-eccCCcceecceEEEeccHhhccCCcE Confidence 78888999999998888843 3348999999999987652 2222222222 333337789999998876544433333 Q ss_pred EEEEcCceeEEee-eeeeehhhcCCCcee-------------------------eeEEee Q lcl|NC_011288. 219 FVAFHPSAAAYVS-QIDTVEALRDQDSFS-------------------------DRIRAL 252 (273) Q Consensus 219 ~~~~~~~a~~~a~-~~~~~e~~~~~~~~~-------------------------~~v~~~ 252 (273) + .+.++.++-+- .+........++.-| .+.+-+ T Consensus 237 a-ifs~dnig~aftGIn~aR~IesEdF~GValQ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (295) T protein:vir:47 237 A-IFSPNGIIIPFVGISTARVIEAENFDGVNCKLLLRVVLTLLMTIRKQFTKLQELLYRR 295 (295) T ss_pred E-EEccccceeecccceeeeeeecccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 3 22222222111 011001111111111 111111 No 197 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=88.95 E-value=0.029 Score=28.99 Aligned_cols=256 Identities=8% Similarity=-0.001 Sum_probs=114.9 Q ss_pred Cccchh---hHHHHHHHHHHHHHHhhccchhhcccccccccCC-ceEEEeecCcccceeecCCCcccCCCCCccceEEEE Q lcl|NC_011288. 1 MAFNNF---IPELWSDMLLEEWTAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLL 76 (273) Q Consensus 1 MA~~~~---~pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~G-dtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (273) +++..+ ..+.+..++.+.....+....++.... .+.-+ +|++++.....+.+..-.........+..-+...-+ T Consensus 49 ~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t--~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~ 126 (339) T protein:vir:94 49 TANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVK--KGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQ 126 (339) T ss_pred ccccchhhhhhhhhchhheeecccccchhhhccccc--CCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEe Confidence 333221 122222333334444444444544321 12223 488898875554322222222221222333333333 Q ss_pred EeeeeecceEEchHHHHhhh---HHHH-HHHHHHHHHHHHHHHHHHHH---------HHhhcccc---cccc---cCCCH Q lcl|NC_011288. 77 IDQEKSIDFLVDDIDRVQVA---GSLE-AYTRAGATALATDTDKFIAD---------LLVDNGTA---LSGS---APTDA 137 (273) Q Consensus 77 id~~~~~~~~i~d~d~~~~~---~~~~-~~~~~~~~ala~~~D~~i~~---------~~~~~~~~---~~~~---~~~t~ 137 (273) + .....++.+...|...+. .++. .....+.+++.++.|+-.+- ++. .++- .+.+ +..|+ T Consensus 127 v-~~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN-~P~l~~~v~~s~~Wa~kT~ 204 (339) T protein:vir:94 127 N-YRYQTWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMN-DPSLPAPVAATVNWATAAP 204 (339) T ss_pred E-EEEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEe-CCCccccccCCCCcccCCH Confidence 3 223456777776664332 2343 34445556666666553221 111 1110 1100 23467 Q ss_pred HHHHHHHHHHHHHHhhcC----CCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCcccc Q lcl|NC_011288. 138 DDAFDLIATALKELTKAN----VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRD 213 (273) Q Consensus 138 ~~~~~~i~~a~~~l~~~~----vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~~ 213 (273) ..++++|.++...+..+- -|.....|+++|..+..|-.-.. .+.+.. +-+++ ++-+++|...+.+-. T Consensus 205 ~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~n~----~~~Tvl-~~lk~----n~pnl~i~~~~el~~ 275 (339) T protein:vir:94 205 EDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRTNN----FGLSAG-AKIAQ----TYPNIQFVAVPEFDT 275 (339) T ss_pred HHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccCCc----CCccHH-HHHHH----hcCCcEEEEcccccc Confidence 788999998888885553 13334579999999988743211 000000 01111 123456666555543 Q ss_pred CCCcEEEEEcCc-------eeEEeeeeeeehhhcCCCceeeeEEeeee-eeeEEecCceEEEEecC Q lcl|NC_011288. 214 TDDEQFVAFHPS-------AAAYVSQIDTVEALRDQDSFSDRIRALHV-YGGKVVRPTGVVVFNKT 271 (273) Q Consensus 214 ~~~~~~~~~~~~-------a~~~a~~~~~~e~~~~~~~~~~~v~~~~~-~g~~v~~~~~~v~~~~~ 271 (273) .++.....+... .+.++......-.++ ...+..+.+..+ .|+-+.+|.+++.+.-= T Consensus 276 a~g~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq~--~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 276 ASGRLVQLWVPEVNGQPTGEVAFAEKLRSHSIER--YSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred CCCceEEEEEEeccCCcceEEEcchhhhccccEE--cCceEEecceeeeeeEEEEccceeeeeecC Confidence 333322222111 122222221111112 223445555555 56777789988877554 No 198 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=87.26 E-value=0.04 Score=28.25 Aligned_cols=259 Identities=14% Similarity=0.057 Sum_probs=120.2 Q ss_pred Cccc------hhhHHHH---HHHHHHHHHHhhccchhhcccccccccC-CceEEEeecCcccc-eeecCCCcccCCCCCc Q lcl|NC_011288. 1 MAFN------NFIPELW---SDMLLEEWTAQTVFANLVNREYEGTASK-GNVVHIAGVVAPTV-KDYKAAGRQTSADAIS 69 (273) Q Consensus 1 MA~~------~~~pev~---~~~~~~~~~~~lv~~~~v~~~~~~~~~~-Gdtv~ip~~~~~~~-~~~~~~~~~~~~~~~~ 69 (273) |-.. .|..+-| -.++.+.....++...++.... .+.. -++++++.....+. ..|...+..+..-+.. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~--~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~ 78 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSN--EIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDAL 78 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceeccccc--CCCCceeEEEeeeeeccCceeEeCCCccccceeecc Confidence 4333 2333333 3334444444455555544322 1222 34666666544332 2233222223233455 Q ss_pred cceEEEEEeeeeecceEEchHHH--Hhhh-HHHHH-HHHHHHHHHHHHHHHHHHH---------HHhhcccc--cccccC Q lcl|NC_011288. 70 DTGVDLLIDQEKSIDFLVDDIDR--VQVA-GSLEA-YTRAGATALATDTDKFIAD---------LLVDNGTA--LSGSAP 134 (273) Q Consensus 70 ~~~~~~~id~~~~~~~~i~d~d~--~~~~-~~~~~-~~~~~~~ala~~~D~~i~~---------~~~~~~~~--~~~~~~ 134 (273) -++....+. ..+.++.++..|. .... .++.. ....+++++++..|+-++- ++...+.. ...+.= T Consensus 79 ~~~~~~~i~-~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W 157 (296) T protein:vir:10 79 ATERQGKVF-RFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSW 157 (296) T ss_pred ceeEEEEEE-EEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCc Confidence 556666663 3456666664443 3333 34543 5566667788887765541 11110100 001111 Q ss_pred CCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCccc Q lcl|NC_011288. 135 TDADDAFDLIATALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLR 212 (273) Q Consensus 135 ~t~~~~~~~i~~a~~~l~~~--~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l~ 212 (273) .++..++++|.++...+..+ ++ ...-.++|+|..+..|.+- ..+.+..-- +-++ -+..+.+|.....+. T Consensus 158 ~~~t~i~~Di~~~~~~l~~~s~g~-~~p~~l~L~p~~~~~L~~~---~~~~~~t~l-~~ik----~~~~~l~i~~~~~l~ 228 (296) T protein:vir:10 158 SQPTTAVSDITSLLDIIETSTNGQ-HRATHLLLPTTARRIMQNL---VPGTSVSYG-EFFR----QNNSGVTVEFVQYLN 228 (296) T ss_pred cCHHHHHHHHHHHHHHHHHhhCce-ecceeEEeCHHHHHHHhhc---cCCCCccHH-HHHH----HhcCCceEEEeeeec Confidence 24557899999999877554 33 1224689999999888532 111111000 0111 112355555555443 Q ss_pred cCCC---cEEEEEc--CceeEEeeeeeeehhh-cCCCceeeeEEeeeee-eeEEecCceEEEEecCCC Q lcl|NC_011288. 213 DTDD---EQFVAFH--PSAAAYVSQIDTVEAL-RDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 213 ~~~~---~~~~~~~--~~a~~~a~~~~~~e~~-~~~~~~~~~v~~~~~~-g~~v~~~~~~v~~~~~~s 273 (273) ...+ ...++.. +.-+.++...+ +... .........+...... |+-+.+|++++.+. --| T Consensus 229 ~a~~~g~~~~v~~~~~~~~~~~~v~~~-~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~d-GI~ 294 (296) T protein:vir:10 229 DYNGTGTSAAIAYEKDPNNMAIEIPEA-TNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMK-GIT 294 (296) T ss_pred cCCCCcceEEEEEEcCCceEEEEcCcc-eeeecccccCceEEEeeEeeEEEEEEECCceeEEEe-eee Confidence 3221 2233332 33333332111 1111 1222345566667766 58888999999772 112 No 199 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=87.22 E-value=0.04 Score=28.24 Aligned_cols=254 Identities=11% Similarity=-0.022 Sum_probs=119.3 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccc-----hhhcccccccccCCceEEEeecCccc-ceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFA-----NLVNREYEGTASKGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~-----~~v~~~~~~~~~~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 74 (273) |..+.-.+......+...|.+..-.. .++.+.. .+++ +-+....+.++ +.+.. +......+.++..+ T Consensus 1 m~it~~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~~~-sdf~---~~~~~~lg~~p~l~e~~---Ge~~~~~l~~~~~~ 73 (302) T protein:vir:10 1 MLINKQSLNAAFVAIKTIFNNAFAAAPTTWQKIAMEVP-SNTS---SNDYKWLSTFPKMRRWI---GAKVVKNLKAYKYV 73 (302) T ss_pred CcccHHHHHHHHHHHHHHHHHHHHhhhhhhhceeeecC-CCcc---eeeceecCCCCCccccc---cceeecccccccee Confidence 98884444444444444444443222 2222211 1222 22222333322 22221 23445567888888 Q ss_pred EEEeeeeecceEEchHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--cc------------cc--CC-- Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQV-AGSLEAYTRAGATALATDTDKFIADLLVDNGTAL--SG------------SA--PT-- 135 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~-~~~~~~~~~~~~~ala~~~D~~i~~~~~~~~~~~--~~------------~~--~~-- 135 (273) +++.++ +..+.|+-.+.... .+-+..+.+.++++.++..|+.+++++.++.... .+ .. .. T Consensus 74 i~~~~~-g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N~g~ 152 (302) T protein:vir:10 74 VENEDF-EATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSNKGT 152 (302) T ss_pred EEeecc-cceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccccccccc Confidence 888654 44556654333221 2234567788889999999999999887632210 00 00 00 Q ss_pred ------CHHHHHHHHHHHHHHH----hhcCCCc--cCCEEEECHHHHHHHhhhHHHHhh-hhcccccceeeeeeeeeEec Q lcl|NC_011288. 136 ------DADDAFDLIATALKEL----TKANVPN--VGRVVVVNAEMAFWLRSSGSKLTS-ADTSGDAAGLRAGTIGNLLG 202 (273) Q Consensus 136 ------t~~~~~~~i~~a~~~l----~~~~vP~--~~r~lvv~p~~~~~L~~~~~~~~~-~~~~~~~~~l~~G~ig~~~G 202 (273) ......+.+.+++..| +..+-|- ..++|||+|.......+- +.. ....+..++++ | - T Consensus 153 ~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~l---l~~~~~~~g~~Np~~-g------~ 222 (302) T protein:vir:10 153 APLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKML---LTNPKLADNTPNPYV-G------T 222 (302) T ss_pred hhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHH---hhccccCCCCcceec-c------c Confidence 0011122233344433 3333332 247899999876654331 111 01122333433 2 1 Q ss_pred eEEEeeCccccCCCcEEEEEcCceeEEee----eeeeehhhcCCCceeeeEEeeeeeee------EEecCceEEEEecCC Q lcl|NC_011288. 203 ARIVESNNLRDTDDEQFVAFHPSAAAYVS----QIDTVEALRDQDSFSDRIRALHVYGG------KVVRPTGVVVFNKTG 272 (273) Q Consensus 203 ~~v~~s~~l~~~~~~~~~~~~~~a~~~a~----~~~~~e~~~~~~~~~~~v~~~~~~g~------~v~~~~~~v~~~~~~ 272 (273) ++++.++.+... ...+++..+..+-... +...++..-+.+.-+-.++..+.||+ +-..+..+..=+.++ T Consensus 223 ~~~vv~p~L~s~-~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~g~~ 301 (302) T protein:vir:10 223 AELVVDGRIESD-TAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGSTGTG 301 (302) T ss_pred eEEEEeeccCCC-CceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccCccC Confidence 577777766432 2344444554432221 11233333334444445666666664 444555555555555 Q ss_pred C Q lcl|NC_011288. 273 S 273 (273) Q Consensus 273 s 273 (273) | T Consensus 302 ~ 302 (302) T protein:vir:10 302 A 302 (302) T ss_pred C Confidence 5 No 200 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=84.16 E-value=0.063 Score=27.18 Aligned_cols=259 Identities=10% Similarity=0.020 Sum_probs=111.8 Q ss_pred Cccc---hhhH---HHHHHHHHHHHHHhhccchhhcccccccccC-CceEEEeecCccccee-ecCCCcccCCCCCccce Q lcl|NC_011288. 1 MAFN---NFIP---ELWSDMLLEEWTAQTVFANLVNREYEGTASK-GNVVHIAGVVAPTVKD-YKAAGRQTSADAISDTG 72 (273) Q Consensus 1 MA~~---~~~p---ev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~-Gdtv~ip~~~~~~~~~-~~~~~~~~~~~~~~~~~ 72 (273) |.+. .|.. +.+...+.+.....++...++.... .+.. -++++++.....+... |......+..-+..-++ T Consensus 31 ~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~--~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~ 108 (329) T protein:vir:79 31 NDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTS--ELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTS 108 (329) T ss_pred eccchhhHHHHHHHHHHHHHHHhhhhcccchhhhccccc--CCCCceeEEEeeeeecceeeeeecCcccccceeecccce Confidence 2222 1222 1233344444444444444443221 1222 3366666665443322 22222223233444555 Q ss_pred EEEEEeeeeecceEEchHHH--Hhhh-HHHH-HHHHHHHHHHHHHHHHHHHH---------HHhhcccc--------ccc Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDR--VQVA-GSLE-AYTRAGATALATDTDKFIAD---------LLVDNGTA--------LSG 131 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~--~~~~-~~~~-~~~~~~~~ala~~~D~~i~~---------~~~~~~~~--------~~~ 131 (273) ....+.. ....+.++..|. .... .++. +....+++++++..|+-++- ++...+.. ... T Consensus 109 ~~~~i~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~ 187 (329) T protein:vir:79 109 EFGKVFR-LGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTINSAGWNNAA 187 (329) T ss_pred eEEEEEE-EEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccccCCCCCcc Confidence 5555533 455666664444 3322 3454 35556667777777765431 11110000 001 Q ss_pred ccCCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeC Q lcl|NC_011288. 132 SAPTDADDAFDLIATALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESN 209 (273) Q Consensus 132 ~~~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~ 209 (273) -+..|+..++++|.++...+..+ ++ ...-.|+|+|..+..|..-. .+.+.... +-+++ +.-+++|...+ T Consensus 188 w~~kt~~ei~~di~~~~~~l~~~s~g~-~~p~~L~Lpp~~~~~L~~~~---~~~~~tvl-~~lk~----~~~~l~I~~~~ 258 (329) T protein:vir:79 188 GTGKKPETAQDELEQAIEKIETLTNGQ-HRANMILIPPSMRKVLMVRM---PETTMSYL-DYFKQ----QNGGITIESIS 258 (329) T ss_pred ccccCHHHHHHHHHHHHHHHHHhcCce-ecccEEEecHHHHHHhhccc---CCCCccHH-HHHHH----hCCCcEEEEcc Confidence 12236678899999998888764 22 12246999999998884310 01111000 01110 01233444444 Q ss_pred ccccCC---CcEEEEEcC--ceeEEe--eeeeeehhhcCCCceeeeEEeeeee-eeEEecCceEEEEecCCC Q lcl|NC_011288. 210 NLRDTD---DEQFVAFHP--SAAAYV--SQIDTVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 210 ~l~~~~---~~~~~~~~~--~a~~~a--~~~~~~e~~~~~~~~~~~v~~~~~~-g~~v~~~~~~v~~~~~~s 273 (273) .+-... ...+++... .-+.++ ......-.++.. ....+...... |+-+.+|.+++.+.-=.- T Consensus 259 el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~~~--~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~ 328 (329) T protein:vir:79 259 ELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTAQPKD--LHFKVPCTSKCTGLTIYRPLTLVLIKGLVV 328 (329) T ss_pred cccccCCCCceEEEEEecCCceEEEecCcceeeeeceecC--ceEEEceeeeEEEEEEECcceeeeeeeeee Confidence 332211 122333322 222222 111111111222 23344444444 577788998876533222 No 201 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=82.21 E-value=0.027 Score=29.15 Aligned_cols=108 Identities=14% Similarity=0.060 Sum_probs=66.5 Q ss_pred EECHHHHHHHhhhHHHHhhhhcccccceeeeeeee-eEeceEEEeeCccccCCCcEEEEEc---------CceeEEee-- Q lcl|NC_011288. 164 VVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIG-NLLGARIVESNNLRDTDDEQFVAFH---------PSAAAYVS-- 231 (273) Q Consensus 164 vv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig-~~~G~~v~~s~~l~~~~~~~~~~~~---------~~a~~~a~-- 231 (273) +++-.+++.++.+...-.... --+.+.+.+|.+. +++|..|..+.++|.+.. ..+..+ -.+-+|+. T Consensus 1 vvsdlqfA~~~g~~v~~~aLp-RE~aNp~ltG~lpV~~~GltWl~tpnlpg~~a-~vlDst~lGgmaDE~l~~Pgya~~~ 78 (123) T protein:vir:78 1 MLSGAQFAKLIGILVDDKALP-REQANIVLTGSLPVSAYGLTWVTSRHITGTDP-WLFDVEQLGGMADEKLLSPEFAPAG 78 (123) T ss_pred CcchhhHHHHhcchhcccccc-cccCCceEecCcceeeeceeeeecCCCCCCcc-ceeehhhhccccccccCCCcccCCC Confidence 556666777777642211100 0112556677764 699999999999994432 111100 01112221 Q ss_pred -eeeeehhhcCCC--ceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 232 -QIDTVEALRDQD--SFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 232 -~~~~~e~~~~~~--~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) --.+++..|... .-+..++++-..-.-+++|.+.+.|+-.|- T Consensus 79 ~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 79 NTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred CcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 113455666655 556789999999999999999999988888 No 202 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=75.85 E-value=0.14 Score=25.23 Aligned_cols=259 Identities=12% Similarity=0.028 Sum_probs=112.5 Q ss_pred Cccc------------------------hhhHHHH---HHHHHHHHHHhhccchhhcccccccccC-CceEEEeecCccc Q lcl|NC_011288. 1 MAFN------------------------NFIPELW---SDMLLEEWTAQTVFANLVNREYEGTASK-GNVVHIAGVVAPT 52 (273) Q Consensus 1 MA~~------------------------~~~pev~---~~~~~~~~~~~lv~~~~v~~~~~~~~~~-Gdtv~ip~~~~~~ 52 (273) ||-+ .|..+-| -.++.+.....+....++.... .+.. -.+++++.....+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~--~~~~~~et~~~~~~e~~G 78 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTN--EIPGHAKYFEYPEFDGVG 78 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeecccc--CCCCceeEEEeeeecccc Confidence 2211 1222212 2222222233333333433221 1222 2377777765444 Q ss_pred c-eeecCCCcccCCCCCccceEEEEEeeeeecceEEchHHHHh--hh-HHHHH-HHHHHHHHHHHHHHHHHH-------- Q lcl|NC_011288. 53 V-KDYKAAGRQTSADAISDTGVDLLIDQEKSIDFLVDDIDRVQ--VA-GSLEA-YTRAGATALATDTDKFIA-------- 119 (273) Q Consensus 53 ~-~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~i~d~d~~~--~~-~~~~~-~~~~~~~ala~~~D~~i~-------- 119 (273) . .-|...+..+..-+..-++....+. ..+..+.++..|... .. .++.. ....+..++++..|+-++ T Consensus 79 ~a~~~~d~~~dip~vd~~~~~~~~~i~-~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~ 157 (314) T protein:vir:10 79 IAQIIADYSDDLPLVDAFMTEKQGKVF-RFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGI 157 (314) T ss_pred ceeeeCCcccccceeecccceeEEEEE-EEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccc Confidence 3 2233222323333455566666663 346677776555433 22 23533 445555667777665443 Q ss_pred -HHHhhcc--cccccccCCCHHHHHHHHHHHHHHHhhc--CCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeee Q lcl|NC_011288. 120 -DLLVDNG--TALSGSAPTDADDAFDLIATALKELTKA--NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRA 194 (273) Q Consensus 120 -~~~~~~~--~~~~~~~~~t~~~~~~~i~~a~~~l~~~--~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~ 194 (273) +++.... .....+.-.|+..++++|.++...|.++ ++- ..-.|+|+|..+..|..- ..+.+.+.- +-+++ T Consensus 158 ~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~-~p~~l~Lpp~~~~~L~~~---~~~~~~tvl-~~l~~ 232 (314) T protein:vir:10 158 VSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLH-HVTDILLPASARRVMQGL---VPQTNLSYG-ELFTR 232 (314) T ss_pred eeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccc-cceeEEecHHHHHhhccc---ccCCCccHH-HHHHH Confidence 1111100 0111111236778899999999999765 321 123689999998766321 111111000 01111 Q ss_pred eeeeeEeceEEEeeCccccCCC---cEEEEEcC--ceeEEeeeeeeehhhc-CCCceeeeEEeeeee-eeEEecCceEEE Q lcl|NC_011288. 195 GTIGNLLGARIVESNNLRDTDD---EQFVAFHP--SAAAYVSQIDTVEALR-DQDSFSDRIRALHVY-GGKVVRPTGVVV 267 (273) Q Consensus 195 G~ig~~~G~~v~~s~~l~~~~~---~~~~~~~~--~a~~~a~~~~~~e~~~-~~~~~~~~v~~~~~~-g~~v~~~~~~v~ 267 (273) +.-+++|...+.+....+ ...++..+ .-+.++.... +.... ........+...... |+.+.+|.+++. T Consensus 233 ----n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~-~~~l~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~ 307 (314) T protein:vir:10 233 ----NNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEV-TNVLPAQPKDLHFRYPVTSKATGLIVYRPLTMAV 307 (314) T ss_pred ----hCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCcc-ceeecceecCceEEEcceeeeEEEEEECcceeEe Confidence 112455555554432221 11233322 2222221111 11111 112234455455555 577889999885 Q ss_pred EecCCC Q lcl|NC_011288. 268 FNKTGS 273 (273) Q Consensus 268 ~~~~~s 273 (273) +. --| T Consensus 308 ~d-GI~ 312 (314) T protein:vir:10 308 IK-GIT 312 (314) T ss_pred ee-eee Confidence 52 112 No 203 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=72.40 E-value=0.18 Score=24.62 Aligned_cols=253 Identities=11% Similarity=0.058 Sum_probs=95.1 Q ss_pred chhhHHHHHHHHHHHH-HHhhccchhhcccccccccCCceEEEeecCcccceeecCCCcccCCCCCccceEEEEEeeeee Q lcl|NC_011288. 4 NNFIPELWSDMLLEEW-TAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQEKS 82 (273) Q Consensus 4 ~~~~pev~~~~~~~~~-~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~~~ 82 (273) -+++|..|.. ++..| .+.-+....+ ..+...|.---+|...+... +.....+ .-...++.+-+.+ T Consensus 1 i~~~P~~~g~-~~glff~~~~v~T~~V----~ie~~~~~l~lip~v~rg~~------g~~~~~~--~~~~~~f~~p~~~- 66 (320) T protein:vir:10 1 MNLLPVNYGD-SRALFAREKKVRTRTI----LVEEKNGVLTLIQSREPGST------ENVAKRG--KRKVRSFVIPHLP- 66 (320) T ss_pred CCcCCchhhh-hhhhccCCCCcccceE----EEEEecCceeeeeccCCCCC------ceeecCC--cceEEEEecceec- Confidence 4457888875 33333 2222211111 22233444444444333211 1111111 1122233332211 Q ss_pred cceEEchHHHHhh--hH-----HHHHHHHHHHHHHHHHHH----HHHHHHHhh-----ccc---------cccc------ Q lcl|NC_011288. 83 IDFLVDDIDRVQV--AG-----SLEAYTRAGATALATDTD----KFIADLLVD-----NGT---------ALSG------ 131 (273) Q Consensus 83 ~~~~i~d~d~~~~--~~-----~~~~~~~~~~~ala~~~D----~~i~~~~~~-----~~~---------~~~~------ 131 (273) ....|+-.|.... .+ .++....+....|.+++| -..+..+.. .++ .... T Consensus 67 ~~d~i~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~ildadGtv~~d~y~~fGi~~~~i~~~ 146 (320) T protein:vir:10 67 LEDVILPDEYEGLRGFGTTALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQILDADGTVLYDLYAEFGITKKTIYFG 146 (320) T ss_pred cCCccCHHHHcCcccCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEEcCCCcEEEechhhhCCccceeEEe Confidence 1122222221110 00 111111122222333333 222222211 000 0000 Q ss_pred --ccCCCH-HHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccc-c-ceeeeeee--eeEeceE Q lcl|NC_011288. 132 --SAPTDA-DDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD-A-AGLRAGTI--GNLLGAR 204 (273) Q Consensus 132 --~~~~t~-~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~-~-~~l~~G~i--g~~~G~~ 204 (273) .+.++. ....+.+..+...|. +.+..+-+++++|+++..|.+.+. +........ . ..++.... -.+.|+. T Consensus 147 l~~a~~dv~~~~~~~~~~i~~~l~--g~~~t~v~al~g~~f~~al~~h~~-Vke~y~~~~~~~~~l~~~~~~~f~~gGi~ 223 (320) T protein:vir:10 147 LDNKDANVAESCRQVLRHVEDNLR--GDVMKDVSVDVSEEFFDKFIKHAS-VKEVFLNHEAAVNRLGGDTRKGFKFGGLI 223 (320) T ss_pred cCCCCccHHHHHHHHHHHHHHHhc--cCCCCceEEEEChHHHHHHhcCHH-HHHHHHhhhhhhhhccccccceEEecCEE Confidence 001111 112223333333343 445566678999999999998775 333322111 1 12332111 2467888 Q ss_pred EEeeCc------------cccCCCcEEEEEcCcee----EEeeeeeee---------hhhcCCCceeeeEEeeeeeeeEE Q lcl|NC_011288. 205 IVESNN------------LRDTDDEQFVAFHPSAA----AYVSQIDTV---------EALRDQDSFSDRIRALHVYGGKV 259 (273) Q Consensus 205 v~~s~~------------l~~~~~~~~~~~~~~a~----~~a~~~~~~---------e~~~~~~~~~~~v~~~~~~g~~v 259 (273) |.+=.. +|......+-.+.++.+ +-+.....+ +....++..+-.+..-..-=.-+ T Consensus 224 ~~~Y~g~~~d~~g~~~~~I~~~~~~~~p~g~~~~f~~~~apad~~e~vnt~g~p~y~k~~~~~~~~g~~l~~qS~PLpi~ 303 (320) T protein:vir:10 224 FNENRARHVDEEGKETRFIKAGKGHAFPTGTTNTFFTALAPADFNETAGTLGKRYYAKMEPRRMGRGFDLHSQSNVLPMC 303 (320) T ss_pred EEEcccEEEcCCCCeeEeecCCeeEEEEecCchhheeeecccCcHhhcCCcccccccccccccCCCeEEEEeeecccccc Confidence 866221 22222111112333221 111111100 11122223333444444444566 Q ss_pred ecCceEEEEecCCC Q lcl|NC_011288. 260 VRPTGVVVFNKTGS 273 (273) Q Consensus 260 ~~~~~~v~~~~~~s 273 (273) .||+.++.++++++ T Consensus 304 ~rP~~lv~~~~~a~ 317 (320) T protein:vir:10 304 CRPGVLVELDAAAQ 317 (320) T ss_pred cCcceEEEEEecCC Confidence 79999999999999 No 204 >protein:vir:79399 Length: 455 # NCBI annotation: head protein # Family: family:all:4054 # MgeID: mge:1869 # MgeName: Av-1 # Cross-refs: genbank:acc:YP_001333662;genbank:gi:151266299;genbank:GeneID:5329881 Probab=72.22 E-value=0.19 Score=24.60 Aligned_cols=261 Identities=13% Similarity=0.106 Sum_probs=112.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccc-hhhcc--ccc-ccccCCceEEEeecCcccceeecC-----CCcccCCCCCccc Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFA-NLVNR--EYE-GTASKGNVVHIAGVVAPTVKDYKA-----AGRQTSADAISDT 71 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~-~~v~~--~~~-~~~~~Gdtv~ip~~~~~~~~~~~~-----~~~~~~~~~~~~~ 71 (273) |++..+--|.+.+- +......++-. .+.|. .|. +...-|+||.=--..-.....|++ +..+.....++-. T Consensus 45 i~d~~~qnEf~~sL-I~RIgs~L~~d~S~~NPLa~FK~g~~~fGdtIeei~~d~ak~~~yd~~~~~aev~pFk~e~P~Ik 123 (455) T protein:vir:79 45 MSDNITRNEFMSAL-INRIGSTLIRDLSWKNPLAVFKQGMMNFGDTIEEVHMDYIKPTIYEEQRDYLERDVFGQAPPPVK 123 (455) T ss_pred hhhhhHHHHHHHHH-HhccccEEEecccccCchHHhccccchhhhhhhhhhhccccccccCcchhhhhccccccCCCcee Confidence 66665544544332 22221111110 01111 111 112236776432222222222222 2223334455555 Q ss_pred eEEEEEeeeeecceEEchHHHHhh---hHHHHHHHHHHHHHHH--HHHHHHHH-----HHHhhccc-c---cc--cccCC Q lcl|NC_011288. 72 GVDLLIDQEKSIDFLVDDIDRVQV---AGSLEAYTRAGATALA--TDTDKFIA-----DLLVDNGT-A---LS--GSAPT 135 (273) Q Consensus 72 ~~~~~id~~~~~~~~i~d~d~~~~---~~~~~~~~~~~~~ala--~~~D~~i~-----~~~~~~~~-~---~~--~~~~~ 135 (273) ..-.+.+.+-.....|++-....+ ...+++++.+...+|. .++|++.. ..+..... . +. ++... T Consensus 124 A~~H~~nR~~~y~~TI~dd~i~~AF~S~~gldefi~~i~~si~sSde~dEY~ylk~Li~~~~~~~~f~~~~I~D~~t~~~ 203 (455) T protein:vir:79 124 SAFHTINRKEKFKITVNRDVLRRAFLSDNGLSEMLSQTMAVAASSDQWSEFLYMTRLFKTYEDSFGFYRMQISDMNTFEP 203 (455) T ss_pred EEEeeccccceeeeeeeHHHHHHhhcChhhHHHHHHHHHHHHhcccchHHHHHHHHHHHHhhhhccceEEEecccccccc Confidence 566666666666677776544332 3447788888777775 46676632 22221111 1 11 01111 Q ss_pred CH---HHHHHHHHHHHHHH-------hhcCCCc----cCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEe Q lcl|NC_011288. 136 DA---DDAFDLIATALKEL-------TKANVPN----VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLL 201 (273) Q Consensus 136 t~---~~~~~~i~~a~~~l-------~~~~vP~----~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~ 201 (273) +. ...+..++.+..+| +..++|. ++.+++++|++-..| +..+|.++-. .+. +-.-+.+-.+. T Consensus 204 d~~~~~~~iK~lr~aA~kM~lPTR~yN~~gv~~~tdi~DL~lI~~~dtq~ev--dv~~LA~AFN-~d~-vd~~~~~i~Vd 279 (455) T protein:vir:79 204 DKNKVDAALKALRVAANKMQYPTPAFNSAGVHSFARPEDLVLITTPEFKANV--DVTSLSAAFN-RSD-AEAPSHIITVP 279 (455) T ss_pred chhHHHHHHHHHHHHHHHhcCCCcccccccCcccccceeeEEEeCCCceeee--cHHHHHHHhC-ccc-hhcCceeEEec Confidence 11 12233444444443 2233332 456889999876655 2222322111 111 11114455566 Q ss_pred ceEEEeeCccccCCCcEEEEEcCceeEEeeeeeeehhhcCCCceeeeEEee-------eeee---eEEecCceEEEEecC Q lcl|NC_011288. 202 GARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDSFSDRIRAL-------HVYG---GKVVRPTGVVVFNKT 271 (273) Q Consensus 202 G~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~e~~~~~~~~~~~v~~~-------~~~g---~~v~~~~~~v~~~~~ 271 (273) ||-. ..++..++...+.++..-.+..++|..|++..-.+-++-- ..|- +++-.|+.+++--.. T Consensus 280 ~f~f-------a~~~~~a~~~sk~~~~i~D~l~~~~si~np~~l~~Ny~~H~w~ils~S~F~~a~af~~~~~~~~vtp~~ 352 (455) T protein:vir:79 280 GETL-------GMDDTSAILTSKQFFVIKDILLENRTISNPEGLYDNYWLHHWSILSASPFTPAIAFGTKPNTIVVTPKA 352 (455) T ss_pred cccc-------ccCCceEEEeehhhhhhhhhhhhcccccCcccceeehhhhhhhhhhhccccceeeeecCCceEEEcccc Confidence 6622 2233445666666666666667778887776433322211 1111 233344444443333 Q ss_pred CC Q lcl|NC_011288. 272 GS 273 (273) Q Consensus 272 ~s 273 (273) +| T Consensus 353 ~~ 354 (455) T protein:vir:79 353 ET 354 (455) T ss_pred cc Confidence 33 No 205 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=72.13 E-value=0.19 Score=24.58 Aligned_cols=268 Identities=13% Similarity=0.147 Sum_probs=103.2 Q ss_pred Cccc--hhh-HHHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceee--cCCCcccCCCCCccceEEE Q lcl|NC_011288. 1 MAFN--NFI-PELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDY--KAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~--~~~-pev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ 75 (273) |+.. .+. --+.+..++.-.....+-..++- ...-....++-.+|++ ..+..-+- .+.+.+...+.-..+..++ T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~Iad~lfP-~vpV~~~~~k~~~f~~-e~f~~~~t~ra~~~~~~~v~~~~~~~~~~ 78 (307) T protein:vir:79 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQTLMP-VVEVEKEGGKIPKFGK-ESFRLYQTERALRAKSNRMNPEDIDSVDV 78 (307) T ss_pred CCCCCCCcccCHHHHHHHhhccchhhhhhhcCC-cccccccccceeeecc-ccccccccccccCCCcceeeeeccccccc Confidence 5443 222 22344444432222222222221 1110111133333332 11111111 1111111111112234455 Q ss_pred EEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc-----c--cccc--cCCCHHHHHHHHH Q lcl|NC_011288. 76 LIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGT-----A--LSGS--APTDADDAFDLIA 145 (273) Q Consensus 76 ~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~-----~--~~~~--~~~t~~~~~~~i~ 145 (273) .++++ .....|+..+.....++.+. .++.....|....+..+..++.+... . .+++ .....++.+.+|. T Consensus 79 ~~~~~-~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd~~sDPi~di~ 157 (307) T protein:vir:79 79 NLDEH-DLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTAANSDPVGVIE 157 (307) T ss_pred ccccc-chhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCCCCCCcHHHHH Confidence 55543 33344555555555555433 34444444444434444444432221 1 1111 0112245567788 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceE-EEeeCccccC---------C Q lcl|NC_011288. 146 TALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR-IVESNNLRDT---------D 215 (273) Q Consensus 146 ~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~-v~~s~~l~~~---------~ 215 (273) ++++.+.+..- .....+++++..+..|++.+..+....... ...+..-++..+.|++ |+.-...+.. . T Consensus 158 ~~~~ai~~~~g-~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~-~g~it~~~la~l~~v~~V~vg~a~y~~~~~~~~~iw~ 235 (307) T protein:vir:79 158 DGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSM-KGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWG 235 (307) T ss_pred HHHHHHHHhhC-CccceEEeCHHHHHHHhcCHHHHHHhcCcc-ccccCHHHHHHHhCceeEEEeeeeeecccccchhcCC Confidence 88887766532 234579999999999999998776554432 2222222345566766 4333322211 1 Q ss_pred CcEEEEEcCc------------eeEEeeeee--eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 216 DEQFVAFHPS------------AAAYVSQID--TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 216 ~~~~~~~~~~------------a~~~a~~~~--~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) +..++++.+. ++|+-.+.. .....+.+...+..|......--.++=|++=..|+-..- T Consensus 236 ~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 236 ANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred CceEEEecccccCCCCCcccccccceeEEecCceEEecccCCCceeEEeecccccceeeccccchhhccCCC Confidence 1112222111 122221211 111112223334444333333333333332222211111 No 206 >protein:vir:15 Length: 472 # NCBI annotation: major head protein # Family: family:all:4054 # MgeID: mge:323 # MgeName: GA-1 # Cross-refs: genbank:acc:NP_073691;swissprot:sw:q9fzw7;genbank:gi:12248115;uniprot:Q9FZW7;genbank:GeneID:919909 Probab=62.92 E-value=0.33 Score=23.25 Aligned_cols=257 Identities=14% Similarity=0.099 Sum_probs=119.1 Q ss_pred CccchhhHHHHHHHHHHHHHHhhccchhhccc----cc-ccccCCceEEEeecCcccceeecCC---CcccCCCCCccce Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQTVFANLVNRE----YE-GTASKGNVVHIAGVVAPTVKDYKAA---GRQTSADAISDTG 72 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~lv~~~~v~~~----~~-~~~~~Gdtv~ip~~~~~~~~~~~~~---~~~~~~~~~~~~~ 72 (273) ||+.++--|.+.+ |+......++. .+-+++ |. ....-|++|.=-........-|.+. ..+.....++-.. T Consensus 52 ~~d~~~QnEf~~s-Lv~RIgst~V~-~~s~~NPLa~Fk~~~~~fG~~Ieei~~D~a~~~~yd~~k~Ev~pFk~~~P~IkA 129 (472) T protein:vir:15 52 LADKTLQNDFIHT-LVDRIGLVVVH-HKLMQNPLKIFKKGTLEYGRKIEEIFTDLTREHVYDPEKAETEVFKREIPNVKT 129 (472) T ss_pred hhhhhhHHHHHHH-HHhhhcchhhh-hhhccChHHHHhhcCccchhhhhhhhcccccccccchhhhhccccccCCCccee Confidence 7777655554433 22222222221 111111 11 1123466654333322222223331 1222334445555 Q ss_pred EEEEEeeeeecceEEchHHHHhh---hHHHHHHHHHHHHHHH--HHHHHHHHH-HH----hhc-cc---ccccccCCCHH Q lcl|NC_011288. 73 VDLLIDQEKSIDFLVDDIDRVQV---AGSLEAYTRAGATALA--TDTDKFIAD-LL----VDN-GT---ALSGSAPTDAD 138 (273) Q Consensus 73 ~~~~id~~~~~~~~i~d~d~~~~---~~~~~~~~~~~~~ala--~~~D~~i~~-~~----~~~-~~---~~~~~~~~t~~ 138 (273) .-.+.+.+-.....|++-....+ ...+++++.+...+|. .++|++..- .+ ... -. .......++.. T Consensus 130 ~~H~~nR~~~y~~Ti~~d~i~~AF~S~~gld~fi~~i~~si~sSde~dEY~~~k~li~~~~~k~lf~v~~i~~d~~~~~v 209 (472) T protein:vir:15 130 LFHERDRQVFYKQTISDQQLKTAFTNAQKFDEFLSTIVTSIYNSAEVDEFRYTKLLIDNYFSKNLFKIVPVSVDPATGIV 209 (472) T ss_pred EEeeccccceeeeeeeHHHHHHhhcChhhHHHHHHHHHHHHhccccHHHHHHHHHHHHHhhhccceEEEecCCCcccccc Confidence 55566666566667776444332 3447788888777775 466766321 11 111 10 11112222333 Q ss_pred HHHHHHHHHHHHHhhcCCCc----------------cCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEec Q lcl|NC_011288. 139 DAFDLIATALKELTKANVPN----------------VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLG 202 (273) Q Consensus 139 ~~~~~i~~a~~~l~~~~vP~----------------~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G 202 (273) .+.+.+..++..-.+..+|. ++.+++++|++-..| +.++|.++-. .+. +-.-+.+-.+.| T Consensus 210 ~~kd~~K~lr~aa~kM~lP~gT~~yN~~gv~~~td~~DL~lI~~~dtq~ev--dv~~LA~AFN-~d~-vd~~~~~i~Vd~ 285 (472) T protein:vir:15 210 NTKEFLAKTRATATKMTLPMGTRDFNSMAVHTRTDMDDLYIIMDADTQAEV--DVNELASAFN-LNK-ADFIGRRILIDG 285 (472) T ss_pred cHHHHHHHHHHHHHHhcCCCCCCCCCccccceeccceeeeEEeCCCceEee--cHHHHHHHhC-cch-hhcCceeEEecc Confidence 33333333444445555662 456888999876655 2223322111 111 111244555667 Q ss_pred eEEEeeCccccCCCcEEEEEcCceeEEeeeeeeehhhcCCCce-------------eeeEEeeeeeeeEEecC---ceEE Q lcl|NC_011288. 203 ARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTVEALRDQDSF-------------SDRIRALHVYGGKVVRP---TGVV 266 (273) Q Consensus 203 ~~v~~s~~l~~~~~~~~~~~~~~a~~~a~~~~~~e~~~~~~~~-------------~~~v~~~~~~g~~v~~~---~~~v 266 (273) |. .++..++...+.+++.-.++.++|..|++..- ........+||++..=| ..++ T Consensus 286 Fa---------~~d~~a~l~sk~~f~i~D~l~~m~s~rnprgL~~Ny~lHv~q~~s~s~F~naiaF~~g~~v~~~~~~~i 356 (472) T protein:vir:15 286 FA---------STGLKAVMVDKDFFMLYDQVFRMESQRNAQGMYWNYYLHVWQVLSTSRFANAVAFVDSALIDGDVSQVI 356 (472) T ss_pred cC---------CCCceeeeehhhHHHHHHHHHhcccccCcccchhHHHHHHHHHHHhccccceEEEeccccCCCccceEE Confidence 62 23444666677777776677778888887631 23333446667766432 2333 Q ss_pred EEecCCC Q lcl|NC_011288. 267 VFNKTGS 273 (273) Q Consensus 267 ~~~~~~s 273 (273) + .++.+ T Consensus 357 v-~p~~~ 362 (472) T protein:vir:15 357 V-TPTVG 362 (472) T ss_pred E-eeccc Confidence 3 44443 No 207 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=50.52 E-value=0.61 Score=21.77 Aligned_cols=268 Identities=12% Similarity=0.128 Sum_probs=102.5 Q ss_pred Cccc--hhhH-HHHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceeec--CCCcccCCCCCccceEEE Q lcl|NC_011288. 1 MAFN--NFIP-ELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYK--AAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~--~~~p-ev~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 75 (273) |... .++. -+.+..++.-.....+-..++- ...-....|+-.+|++ ..+...+-. +.+.+...+.-..+..+. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P-~vpv~~~~~k~~~f~~-eaF~~~~t~r~~~~~~~~v~~~~~~~~~~ 78 (307) T protein:vir:10 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQSLMP-VVEVEKEGGKIPKFGK-ESFRLYKTERALRARSNRMNPEDLGSIDI 78 (307) T ss_pred CCCCCCCcccChhHHHHHHhhcchhhhhhhcCC-cccccccccceeeECc-ccccchhhhcccCCCcceeeccccccccc Confidence 4433 2222 2355555543333333222221 1111112244455543 222221111 111111111111223334 Q ss_pred EEeeeeecceEEchHHHHhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc-c------cccc--cCCCHHHHHHHHH Q lcl|NC_011288. 76 LIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGT-A------LSGS--APTDADDAFDLIA 145 (273) Q Consensus 76 ~id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~-~------~~~~--~~~t~~~~~~~i~ 145 (273) .+.+ ..-...++..+.....++.+. ..+.....|....+.....++..... . .+++ -.....+.+.+|. T Consensus 79 ~~~~-~~L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsGt~~Wsd~~sDPi~di~ 157 (307) T protein:vir:10 79 VLDE-HDLEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLSATEKFTAAGSDPVGVIE 157 (307) T ss_pred cccc-ccccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEeccccccCCCCCCcHHHHH Confidence 4422 233345555555555566533 44544444443334344444332221 0 1111 0112344567788 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEee-Ccccc-CCCcEE-EEE Q lcl|NC_011288. 146 TALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVES-NNLRD-TDDEQF-VAF 222 (273) Q Consensus 146 ~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s-~~l~~-~~~~~~-~~~ 222 (273) +++..+.+..- .....+++++..+..|++.++.+...+..+ ...+..-++..+.|++.+.. ..... ..+... +++ T Consensus 158 ~~~~ai~~~~g-~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~-~g~it~~~la~ll~v~~i~vg~a~~~~~~~~~~~iw~ 235 (307) T protein:vir:10 158 DGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSM-KGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWG 235 (307) T ss_pred HHHHHHHhhhC-CccceEEeCHHHHHHHhcCHHHHHHhCCcc-ccccCHHHHHHHhCceeEEEeeeeeeccCCccceeCC Confidence 88887765532 234579999999999999998776654432 22222223455677655442 22211 111111 111 Q ss_pred cCce-------------------eEEeeeee--eehhhcCCCceeeeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 223 HPSA-------------------AAYVSQID--TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 223 ~~~a-------------------~~~a~~~~--~~e~~~~~~~~~~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) ..-. +|+--|.. .+...+.++..+..|.....+--.++=|++=..|+-..- T Consensus 236 ~~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~~~~d~~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 236 ANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVVDTRIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred CceEEEecccccCCCCCcccccccceeEEEcCCeEeeceecCCceeEEeccccccceeecccccceeccCCC Confidence 1111 22211110 111111122223233222222222222222222211111 No 208 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=50.43 E-value=0.61 Score=21.76 Aligned_cols=254 Identities=11% Similarity=0.013 Sum_probs=110.2 Q ss_pred CccchhhHHHHHHHHHHHHHHhh----ccchhhccccccccc-CCceEEEeecCccc-ceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQT----VFANLVNREYEGTAS-KGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~l----v~~~~v~~~~~~~~~-~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 74 (273) +++ .-+|..++.-+...+.+.+ ....++-... .+. .-+++.++.....+ +.-| .........+..-+..+ T Consensus 45 ~~~-~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t--~g~W~~~~~~~~~~e~~G~a~~y-gd~~D~P~~d~~~~~~~ 120 (336) T protein:vir:10 45 TGS-SGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATY-GDYSSDGDSGANINYPQ 120 (336) T ss_pred CCC-chhHHHHHhhcccceeeehhhhhhhhhhccccc--cCCccceeEEEeeeeceeeEEEe-eccCCCceeecccceee Confidence 211 1244444333322222221 1222222111 111 12477777755433 2223 22222223344455555 Q ss_pred EEEeeeeecceEEchHHHHhhh---HHH-HHHHHHHHHHHHHHHHHHHH---------HHHhhcccc---cc----cccC Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVA---GSL-EAYTRAGATALATDTDKFIA---------DLLVDNGTA---LS----GSAP 134 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~---~~~-~~~~~~~~~ala~~~D~~i~---------~~~~~~~~~---~~----~~~~ 134 (273) .++- ....++.++..|..... .++ .+....+++++.++.++..+ +++. .++- .+ .... T Consensus 121 ~~v~-~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN-~P~l~a~~t~~t~~~~~ 198 (336) T protein:vir:10 121 RQSY-FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLIN-DPSLSAPITATTPWSGS 198 (336) T ss_pred eeEE-EEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEe-CCCCccccccCCCcccc Confidence 5653 33556777766654322 234 33445555667777664322 1111 1110 11 1123 Q ss_pred CCHHHHHHHHHHHHHHHhhcC--C-C-ccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc Q lcl|NC_011288. 135 TDADDAFDLIATALKELTKAN--V-P-NVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN 210 (273) Q Consensus 135 ~t~~~~~~~i~~a~~~l~~~~--v-P-~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~ 210 (273) +++..++++|.++...|..+. + . ...-.|+++|..+..|-+-.+ .+.+.. +-++ .++=++.|...+. T Consensus 199 ~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~n~----~g~Tvl-~~lk----~n~Pnl~i~t~pE 269 (336) T protein:vir:10 199 PAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQ----YGLAAA-AKLK----DIFPKLEFVTIPE 269 (336) T ss_pred cCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCCCc----cCccHH-HHHH----HhcCccEEEEccc Confidence 456778999999888887643 2 2 234579999999887733110 000000 0011 1133556666665 Q ss_pred cccCCCcEEEEEcCc-------eeEEeeeeeeehhhcCCCceeeeEEeeeee-eeEEecCceEEEEecC Q lcl|NC_011288. 211 LRDTDDEQFVAFHPS-------AAAYVSQIDTVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) Q Consensus 211 l~~~~~~~~~~~~~~-------a~~~a~~~~~~e~~~~~~~~~~~v~~~~~~-g~~v~~~~~~v~~~~~ 271 (273) +-..++..+....+. -+.+..+....-.++.. ....+.+..+. |+-+.+|-+++.+.-= T Consensus 270 l~~a~G~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~--~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 270 YDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYS--SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cccCCCceEEEEEEecCCCcceeeecchhhhccceeecC--ceeEeccccceeeeeeeccchheeeecC Confidence 544333333222111 11222222111111222 22344444444 5666678887766544 No 209 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=50.20 E-value=0.62 Score=21.74 Aligned_cols=254 Identities=11% Similarity=0.013 Sum_probs=109.4 Q ss_pred CccchhhHHHHHHHHHHHHHHhh----ccchhhccccccccc-CCceEEEeecCccc-ceeecCCCcccCCCCCccceEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQT----VFANLVNREYEGTAS-KGNVVHIAGVVAPT-VKDYKAAGRQTSADAISDTGVD 74 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~l----v~~~~v~~~~~~~~~-~Gdtv~ip~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 74 (273) +++. =+|.+++.-+...+.+.+ ....++-... .+. .-+++.++.....+ +.-| .........+..-+..+ T Consensus 45 ~~~~-~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t--~g~W~~~~~~~~~~e~~G~a~~y-gd~~D~P~~d~~~~~~~ 120 (336) T protein:vir:36 45 TGSS-GIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATY-GDYSSDGDSGANINYPQ 120 (336) T ss_pred CCCc-chHHHHHHhhccceEeeecchhhhhhhccccc--cCCccceeEEEeeeeceeeEEEe-eccCCCceeecccceee Confidence 1111 134444433322222221 1222222111 111 12477777755433 2223 22222222344455555 Q ss_pred EEEeeeeecceEEchHHHHhhh---HHH-HHHHHHHHHHHHHHHHHHHH---------HHHhhcccc---cc----cccC Q lcl|NC_011288. 75 LLIDQEKSIDFLVDDIDRVQVA---GSL-EAYTRAGATALATDTDKFIA---------DLLVDNGTA---LS----GSAP 134 (273) Q Consensus 75 ~~id~~~~~~~~i~d~d~~~~~---~~~-~~~~~~~~~ala~~~D~~i~---------~~~~~~~~~---~~----~~~~ 134 (273) .++- ....++.+...|..... .++ .+....++++|.++.++..+ +++. .++- .+ .... T Consensus 121 ~~v~-~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN-dP~l~a~~t~~t~~~~~ 198 (336) T protein:vir:36 121 RQSY-FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLIN-DPSLSAPITATTPWSGS 198 (336) T ss_pred eeEE-EEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEe-cCCCccccccCCCcccc Confidence 5653 33556777766654322 233 33445555667666664322 1111 1110 01 1123 Q ss_pred CCHHHHHHHHHHHHHHHhhcC--C--CccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCc Q lcl|NC_011288. 135 TDADDAFDLIATALKELTKAN--V--PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN 210 (273) Q Consensus 135 ~t~~~~~~~i~~a~~~l~~~~--v--P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~ 210 (273) +++..++++|.++...|..+- + +...-.|+++|..+..|-+-.+ .+.+.. +-++ .++=++.|+..+. T Consensus 199 ~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~n~----~g~Tvl-~~lk----~n~Pnl~i~t~pE 269 (336) T protein:vir:36 199 PAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQ----YGLAAA-AKLK----DIFPKLEFVTIPE 269 (336) T ss_pred cCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCCCc----cCccHH-HHHH----HhcCccEEEEccc Confidence 456788999999888886642 1 1223579999999887733110 000000 0111 1123456666666 Q ss_pred cccCCCcEEEEEcCc-------eeEEeeeeeeehhhcCCCceeeeEEeeeee-eeEEecCceEEEEecC Q lcl|NC_011288. 211 LRDTDDEQFVAFHPS-------AAAYVSQIDTVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) Q Consensus 211 l~~~~~~~~~~~~~~-------a~~~a~~~~~~e~~~~~~~~~~~v~~~~~~-g~~v~~~~~~v~~~~~ 271 (273) +-..++..+....+. -+.+..+....-.++.. ....+.+..+. |+-+.+|-+++.+.-= T Consensus 270 l~~a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~--~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 270 YDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYS--SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cccCCCceEEEEEEecCCCcceeeecchhhhccceeecC--ceeEeccccceeeeeeeccchheeeecC Confidence 644333333222111 11222222111111222 23344444444 5666678887766544 No 210 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=31.98 E-value=1.5 Score=19.69 Aligned_cols=260 Identities=11% Similarity=0.050 Sum_probs=89.9 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHh-hcc-chhhcc----ccccc---ccCCceEEEeecCcccceeecCCCcccCCCC-C Q lcl|NC_011288. 1 MAFN--NFIPELWSDMLLEEWTAQ-TVF-ANLVNR----EYEGT---ASKGNVVHIAGVVAPTVKDYKAAGRQTSADA-I 68 (273) Q Consensus 1 MA~~--~~~pev~~~~~~~~~~~~-lv~-~~~v~~----~~~~~---~~~Gdtv~ip~~~~~~~~~~~~~~~~~~~~~-~ 68 (273) ||+- .|.+.-+.+.+.+.-... ..+ ..++-. +.+.. +..|..+- ..+...+.+..... - T Consensus 1 M~~i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~---------a~~v~~~~~~~~~~r~ 71 (348) T protein:vir:27 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVA---------LKAAAFDTNVTIRDRV 71 (348) T ss_pred CcchhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeE---------eeeecCCCCcceeccc Confidence 9964 556666666544321111 111 112211 11111 11122111 12221111111111 1 Q ss_pred ccceEEEEEeeeeecceEEchHHHHhh--hHH-H-HHHHHHHHHHHH-----------HHHHHHHHHHHhhccc------ Q lcl|NC_011288. 69 SDTGVDLLIDQEKSIDFLVDDIDRVQV--AGS-L-EAYTRAGATALA-----------TDTDKFIADLLVDNGT------ 127 (273) Q Consensus 69 ~~~~~~~~id~~~~~~~~i~d~d~~~~--~~~-~-~~~~~~~~~ala-----------~~~D~~i~~~~~~~~~------ 127 (273) .-+..+..+-. ......++..|.... ... . ....++....|+ +.++--....+..+.. T Consensus 72 ~~~~~~~~~p~-i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~ 150 (348) T protein:vir:27 72 SAEMHDEQMPF-FKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDG 150 (348) T ss_pred ceeeeeeecCc-cccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCC Confidence 11222222211 111233443332211 100 0 111122222222 2222222233321110 Q ss_pred -------------cc--ccccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccc-ce Q lcl|NC_011288. 128 -------------AL--SGSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA-AG 191 (273) Q Consensus 128 -------------~~--~~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~-~~ 191 (273) .. ++....+..+.+++|.++...+++.+. ...+++++++.+..|++++.+....+..... .. T Consensus 151 ~~~~vdfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~--~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~ 228 (348) T protein:vir:27 151 VNKDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELGL--NPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSA 228 (348) T ss_pred eeEEEeecCCcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCC--cccEEEECHHHHHHHhcCHHHHHHhcccCccccc Confidence 00 001111234567888888888877665 3347899999999999987654333222111 11 Q ss_pred eeee----eeeeEeceEEEeeCccc-cCCC--------cEEEEEcCceeEEeeeeeeehhh-----cCCCceeeeEEeee Q lcl|NC_011288. 192 LRAG----TIGNLLGARIVESNNLR-DTDD--------EQFVAFHPSAAAYVSQIDTVEAL-----RDQDSFSDRIRALH 253 (273) Q Consensus 192 l~~G----~ig~~~G~~v~~s~~l~-~~~~--------~~~~~~~~~a~~~a~~~~~~e~~-----~~~~~~~~~v~~~~ 253 (273) +... .++.+.|++|+.-+.-. ...+ ..++....+..|.-.-....|.. .........+ +.. T Consensus 229 i~~~~~~~~~~~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~-~~~ 307 (348) T protein:vir:27 229 VTKAELENYIADNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIV-DNG 307 (348) T ss_pred cCHHHHHHHHHhhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeee-CCe Confidence 1111 23456788776533222 1111 22333333333321110111110 0111111111 111 Q ss_pred eeeeEE--ecCceEEEEecCCC Q lcl|NC_011288. 254 VYGGKV--VRPTGVVVFNKTGS 273 (273) Q Consensus 254 ~~g~~v--~~~~~~v~~~~~~s 273 (273) .+-..- -+|.+..+...+.. T Consensus 308 ~~~~~~~~~dP~~~~~~~~s~~ 329 (348) T protein:vir:27 308 IAVTTTKTTDPVNVQTKVSMVA 329 (348) T ss_pred eEEEeeecCCCceEEEEEeeee Confidence 111111 24444433322222 No 211 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=31.25 E-value=1.5 Score=19.60 Aligned_cols=255 Identities=11% Similarity=0.037 Sum_probs=109.5 Q ss_pred CccchhhHHHHHHHHHHHHHHh----hccchhhcccccccccCCceEEEeecCcccc-eeecCCCcccCCCCCccceEEE Q lcl|NC_011288. 1 MAFNNFIPELWSDMLLEEWTAQ----TVFANLVNREYEGTASKGNVVHIAGVVAPTV-KDYKAAGRQTSADAISDTGVDL 75 (273) Q Consensus 1 MA~~~~~pev~~~~~~~~~~~~----lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~ 75 (273) +++.- +|..++.-+...+.+. .....++-.+..++ =.-+++.++.....+. .-| .........+..-++.+. T Consensus 45 ~~~~g-~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~-W~~~~~~~~~~e~~G~a~~y-gd~~D~P~vd~~~~~~~~ 121 (336) T protein:vir:78 45 TGSSG-IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGD-WTTLVAAFITAEPTTTVATY-GDYSSDGDSGTNINYPQR 121 (336) T ss_pred CCCcc-hHHHHHHhcccceeeehhhhhhhhhhcccccCCC-ccccEEEEeeeecceeeEEe-ecccCCCeeecceeeEEE Confidence 22221 3444433332222222 12222332221111 0125788877554432 223 222222233455555555 Q ss_pred EEeeeeecceEEchHHHHhhh---HHH-HHHHHHHHHHHHHHHHHHHH-H--------HHhhccc---ccccc----cCC Q lcl|NC_011288. 76 LIDQEKSIDFLVDDIDRVQVA---GSL-EAYTRAGATALATDTDKFIA-D--------LLVDNGT---ALSGS----APT 135 (273) Q Consensus 76 ~id~~~~~~~~i~d~d~~~~~---~~~-~~~~~~~~~ala~~~D~~i~-~--------~~~~~~~---~~~~~----~~~ 135 (273) ++ +....++.++..|..... .++ .+....+++++.++.+...+ + ++. .++ ..+.+ ... T Consensus 122 ~v-~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN-~P~l~a~~t~~~~~w~~~ 199 (336) T protein:vir:78 122 QS-YFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLIN-DPSLSAPITATTPWSGSP 199 (336) T ss_pred EE-EEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEe-CCCCCcccccCcCccccc Confidence 66 334566777766654332 234 33444555666666653211 1 111 111 01111 124 Q ss_pred CHHHHHHHHHHHHHHHhhcC---C-CccCCEEEECHHHHHHHhhhHHHHhhhhcccccceeeeeeeeeEeceEEEeeCcc Q lcl|NC_011288. 136 DADDAFDLIATALKELTKAN---V-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNL 211 (273) Q Consensus 136 t~~~~~~~i~~a~~~l~~~~---v-P~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~v~~s~~l 211 (273) |+..++++|..+...+..+- + +.....|+++|..+..|-.-.+ .+.+.. +-+++ ++=+++|...+.+ T Consensus 200 T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~n~----~g~tv~-~~lk~----n~Pnl~i~t~pel 270 (336) T protein:vir:78 200 AVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQ----YGLSAA-AKLKE----IFPKLEFVTIPEY 270 (336) T ss_pred CHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCCCc----cCccHH-HHHHH----hcCccEEEEcccc Confidence 66788999998888885543 1 2233479999999988843211 000000 01111 1224556665555 Q ss_pred ccCCCcEEEEEcCc-------eeEEeeeeeeehhhcCCCceeeeEEeeeee-eeEEecCceEEEEecC Q lcl|NC_011288. 212 RDTDDEQFVAFHPS-------AAAYVSQIDTVEALRDQDSFSDRIRALHVY-GGKVVRPTGVVVFNKT 271 (273) Q Consensus 212 ~~~~~~~~~~~~~~-------a~~~a~~~~~~e~~~~~~~~~~~v~~~~~~-g~~v~~~~~~v~~~~~ 271 (273) -..++.....+.+. -+.++.+....-.++.. ....+.+..+. |+-+.+|-+++.+.-= T Consensus 271 ~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~~~--~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 271 DTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYS--SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred cccCcceEEEEEeeccCCcceeeecchhhhccceeecC--ceeEeccccceeeeeeeccchheeeccC Confidence 43333333222211 11222222111111222 23334444444 4566678877766444 No 212 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=29.40 E-value=1.7 Score=19.37 Aligned_cols=260 Identities=14% Similarity=0.100 Sum_probs=106.0 Q ss_pred CccchhhHHHHHHH----------HHHHHHHhhccchhhcccccccccCCc------e--EEEeecCcccceeecCCCcc Q lcl|NC_011288. 1 MAFNNFIPELWSDM----------LLEEWTAQTVFANLVNREYEGTASKGN------V--VHIAGVVAPTVKDYKAAGRQ 62 (273) Q Consensus 1 MA~~~~~pev~~~~----------~~~~~~~~lv~~~~v~~~~~~~~~~Gd------t--v~ip~~~~~~~~~~~~~~~~ 62 (273) |+... +..+.-. .-+.+..+..... ..+.. ....|. . ..+..-......+-...... T Consensus 193 itg~t--ga~fa~s~~~an~astAss~Al~gEA~t~~--sTd~a-t~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~ 267 (523) T protein:vir:59 193 ASGDP--ENTVAYPLPRYNRIVGAVGSALYARLFFVT--GSDFA-TVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPD 267 (523) T ss_pred ccccc--cccccchhhccccccccccccccccccccc--ccccc-ccCCCcccccccccccccccccchhhccccccccc Confidence 11110 0001000 0000100000000 00000 000000 0 00000000000000000000 Q ss_pred cCCCCCccceEEEEEeeeeecc------eEEchHHHH----hhh-H-HHH-HHHHHHHHHHHHHHHHHHHHHHhhccccc Q lcl|NC_011288. 63 TSADAISDTGVDLLIDQEKSID------FLVDDIDRV----QVA-G-SLE-AYTRAGATALATDTDKFIADLLVDNGTAL 129 (273) Q Consensus 63 ~~~~~~~~~~~~~~id~~~~~~------~~i~d~d~~----~~~-~-~~~-~~~~~~~~ala~~~D~~i~~~~~~~~~~~ 129 (273) .........+..++|+|...-+ -.+ .+|.+ -.| + |-+ +...=+...|..+|+++|+..+...+... T Consensus 268 ~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeY-T~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~ 346 (523) T protein:vir:59 268 PGFQSLDIPEINLELRSRPVATKTRKLRAAW-TPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRT 346 (523) T ss_pred cccccccccceeeEEEeEEEeeecccccccc-cHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheee Confidence 0111233445667776642111 011 12222 233 2 333 34455667788999999999887654321 Q ss_pred -------ccc----cCCCH----H----HHHHHHHHHHHHHhhc-C-CC-----ccCCEEEECHHHHHHHhhhHHHHhhh Q lcl|NC_011288. 130 -------SGS----APTDA----D----DAFDLIATALKELTKA-N-VP-----NVGRVVVVNAEMAFWLRSSGSKLTSA 183 (273) Q Consensus 130 -------~~~----~~~t~----~----~~~~~i~~a~~~l~~~-~-vP-----~~~r~lvv~p~~~~~L~~~~~~~~~~ 183 (273) .+. ...++ . ...+++......+++. + +- -.+-|+|++|+..+.|-.++.+-... T Consensus 347 ~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~ 426 (523) T protein:vir:59 347 DNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGN 426 (523) T ss_pred eeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCC Confidence 111 01111 0 1133333332222211 1 11 13568999999988886655321111 Q ss_pred hcccccceeeee--eeeeEe-ceEEEeeCccccCCCcEEEEEcCceeE-------Eee--eeeeehhhcCCCceeeeEEe Q lcl|NC_011288. 184 DTSGDAAGLRAG--TIGNLL-GARIVESNNLRDTDDEQFVAFHPSAAA-------YVS--QIDTVEALRDQDSFSDRIRA 251 (273) Q Consensus 184 ~~~~~~~~l~~G--~ig~~~-G~~v~~s~~l~~~~~~~~~~~~~~a~~-------~a~--~~~~~e~~~~~~~~~~~v~~ 251 (273) + ......| ..|.+. |+.||+.+.-+ ...+++|.++..+ ++= -+..+....++..|.-.+=- T Consensus 427 ~----~~~~~~~~~~~g~l~~~~~vy~d~~~~---~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~ 499 (523) T protein:vir:59 427 D----NRDGGTGIFYVGMVQGRYRLYKNIYQN---QPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGL 499 (523) T ss_pred c----cccccccceeEEEecCceEEEecCCCC---cceEEEEecccCCcccccceecccchhhcccccccCCcccceeee Confidence 1 1111122 145554 57899887643 2345666665332 111 11123444678889989999 Q ss_pred eeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 252 LHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 252 ~~~~g~~v~~~~~~v~~~~~~s 273 (273) +.+||-.|.+|....+|.-.-- T Consensus 500 ~tRY~l~v~nP~~~~~~~~~~~ 521 (523) T protein:vir:59 500 MTRYALEVVRPEFYGLLYVKLL 521 (523) T ss_pred eeehhheecchhHhhhhhhhhc Confidence 9999998888885554432211 No 213 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=23.93 E-value=2.2 Score=18.66 Aligned_cols=266 Identities=12% Similarity=0.053 Sum_probs=89.9 Q ss_pred Cccc--hhhHHHHHHHHHHHHHHh--hccchhhcccccccccCCceEEEee-c-CcccceeecCCCcccCCCC-CccceE Q lcl|NC_011288. 1 MAFN--NFIPELWSDMLLEEWTAQ--TVFANLVNREYEGTASKGNVVHIAG-V-VAPTVKDYKAAGRQTSADA-ISDTGV 73 (273) Q Consensus 1 MA~~--~~~pev~~~~~~~~~~~~--lv~~~~v~~~~~~~~~~Gdtv~ip~-~-~~~~~~~~~~~~~~~~~~~-~~~~~~ 73 (273) ||+- .|.+.-+++.+.+.-... .+...++-.. ...+-++.+-. . +...+..+...+.+..... -.-+.. T Consensus 1 M~~i~d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~----~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~ 76 (348) T protein:vir:96 1 MGLIYDKVTASNIAGYFNTLQENVDSTLGESIFPAR----KQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEIH 76 (348) T ss_pred CcchhhccCHHHHHHHHHhcccchhhhhhhhcCCCc----cccceeEEEEeecCCceeEeeeecCCCCcceecccceeee Confidence 9954 556665655544322111 1111222111 01111111101 0 1111122222121111111 111222 Q ss_pred EEEEeeeeecceEEchHHHHhh--hHH--HHHHHHHHHHHHH-----------HHHHHHHHHHHhhccc----------- Q lcl|NC_011288. 74 DLLIDQEKSIDFLVDDIDRVQV--AGS--LEAYTRAGATALA-----------TDTDKFIADLLVDNGT----------- 127 (273) Q Consensus 74 ~~~id~~~~~~~~i~d~d~~~~--~~~--~~~~~~~~~~ala-----------~~~D~~i~~~~~~~~~----------- 127 (273) ++.+-.. .....++..|.... ... .....++....++ +..+--.+..+..+.. T Consensus 77 ~~~~p~i-~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~v 155 (348) T protein:vir:96 77 DEQMPFF-KEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKDI 155 (348) T ss_pred eeecCcc-ccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEEE Confidence 2232111 11223333332211 000 0011122222222 2222222333322110 Q ss_pred --------ccc--cccCCCHHHHHHHHHHHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhccccc-ceee--- Q lcl|NC_011288. 128 --------ALS--GSAPTDADDAFDLIATALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDA-AGLR--- 193 (273) Q Consensus 128 --------~~~--~~~~~t~~~~~~~i~~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~-~~l~--- 193 (273) ... .......++.+++|.++...+++.+.+ ..+++++++.+..|++++.+....+..... ..+. T Consensus 156 dfg~~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~--~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~ 233 (348) T protein:vir:96 156 DYGVKADHKKQVSKSWAEPGATPLADLEDAIETARELGLN--PERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAE 233 (348) T ss_pred eccCCcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCc--ccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHH Confidence 000 111112345678888888888776653 457999999999999987655433322111 1111 Q ss_pred -eeeeeeEeceEEEeeCccc-cCCC--------cEEEEEcCceeEEeeeeeeeh---hhcCCCceee-eEEeeeeeeeEE Q lcl|NC_011288. 194 -AGTIGNLLGARIVESNNLR-DTDD--------EQFVAFHPSAAAYVSQIDTVE---ALRDQDSFSD-RIRALHVYGGKV 259 (273) Q Consensus 194 -~G~ig~~~G~~v~~s~~l~-~~~~--------~~~~~~~~~a~~~a~~~~~~e---~~~~~~~~~~-~v~~~~~~g~~v 259 (273) ...++.+.|++|+.-+.-. ...+ ..++....+..|.-.-....| .........+ .......+-... T Consensus 234 ~~~~~~~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (348) T protein:vir:96 234 LQNYVADNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTTT 313 (348) T ss_pred HHHHHhhhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEEEee Confidence 1224456688887633222 1112 122222222222110000000 0000011111 011111222222 Q ss_pred e--cCceEEEEecCCC Q lcl|NC_011288. 260 V--RPTGVVVFNKTGS 273 (273) Q Consensus 260 ~--~~~~~v~~~~~~s 273 (273) . +|.+..+...+.. T Consensus 314 ~~~dP~~~~~~~~s~p 329 (348) T protein:vir:96 314 KTTDPVNVQTKVSMVA 329 (348) T ss_pred ecCCCceEEEEEeeee Confidence 2 3444444322222 No 214 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=23.54 E-value=2.3 Score=18.61 Aligned_cols=265 Identities=14% Similarity=0.091 Sum_probs=102.0 Q ss_pred CccchhhHH-HHHHHHHHHHHHhhccchhhcccccccccCCceEEEeecCcccceee--cCCCcccCCCCCccceEEEEE Q lcl|NC_011288. 1 MAFNNFIPE-LWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDY--KAAGRQTSADAISDTGVDLLI 77 (273) Q Consensus 1 MA~~~~~pe-v~~~~~~~~~~~~lv~~~~v~~~~~~~~~~Gdtv~ip~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~i 77 (273) |++-.+.+. +.+..++.--...++-..++-+ ..-....|+-..|++-..+...+- .+.+.+... ++..+..++.+ T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~-vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v-~~~~~~~~~~~ 78 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPR-VPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATDETGST 78 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCc-cccCccccceeeechhhcccccchhhccCCCcceE-eecccCceeee Confidence 999987755 3444444222222222222211 110111233333443222222111 121221111 23344456666 Q ss_pred eeeeecceEEchHHH--HhhhHHHHH-HHHHHHHHHHHHHHHHHHHHHhhccc-c------ccccc--CCCHHHHHHHHH Q lcl|NC_011288. 78 DQEKSIDFLVDDIDR--VQVAGSLEA-YTRAGATALATDTDKFIADLLVDNGT-A------LSGSA--PTDADDAFDLIA 145 (273) Q Consensus 78 d~~~~~~~~i~d~d~--~~~~~~~~~-~~~~~~~ala~~~D~~i~~~~~~~~~-~------~~~~~--~~t~~~~~~~i~ 145 (273) ..+ .....|+..|. +...++.+. ..+.+...|....+.....++...++ . .+++. ....++.+.+|. T Consensus 79 ~~~-~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt~~wsd~~SDPi~~i~ 157 (309) T protein:vir:99 79 EDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVIT 157 (309) T ss_pred ccc-ceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCccccCCCCCCcHHHHH Confidence 432 44445554443 333345433 44555444443333333333322221 1 11110 012234566677 Q ss_pred HHHHHHhhcCCCccCCEEEECHHHHHHHhhhHHHHhhhhcccccc-eeeeeeeeeEece-EEEeeCccccCC-----Cc- Q lcl|NC_011288. 146 TALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAA-GLRAGTIGNLLGA-RIVESNNLRDTD-----DE- 217 (273) Q Consensus 146 ~a~~~l~~~~vP~~~r~lvv~p~~~~~L~~~~~~~~~~~~~~~~~-~l~~G~ig~~~G~-~v~~s~~l~~~~-----~~- 217 (273) .++..+. . ....++++...+..|+..+..+.......... .+..-++..+.|+ .|+.......++ +. T Consensus 158 ~~~~~~g---~--~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~~~ 232 (309) T protein:vir:99 158 DALDSVI---L--RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNL 232 (309) T ss_pred HHHHhhC---C--CcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceeecccccccccc Confidence 7765552 1 23479999999999999988766554433221 1222335567787 455543332111 11 Q ss_pred EEEEEcCceeEEeeee-eeehh----------hcCCCcee---------eeEEeeeeeeeEEecCceEEEEecCCC Q lcl|NC_011288. 218 QFVAFHPSAAAYVSQI-DTVEA----------LRDQDSFS---------DRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) Q Consensus 218 ~~~~~~~~a~~~a~~~-~~~e~----------~~~~~~~~---------~~v~~~~~~g~~v~~~~~~v~~~~~~s 273 (273) .-+++...++.+.... +.++. .|..+.+- ..|+....+--.+.=+++=..|+-..| T Consensus 233 ~~iwg~~~~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va 308 (309) T protein:vir:99 233 IRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) T ss_pred ccccCCcEEEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhccc Confidence 1122222222222111 11110 01111100 111111111111222222222222222 Done!