Query lcl|Aclame:protein:vir:97031|NCBI_annot:31|genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Match_columns 402 No_of_seqs 106 out of 124 Neff 6.0 Searched_HMMs 1612 Date Mon Dec 2 11:37:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_49 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_49_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97031 Length: 402 100.0 3E-153 2E-156 857.1 34.0 402 1-402 1-402 (402) 2 protein:vir:105645 Length: 400 100.0 2E-149 1E-152 835.7 34.6 400 1-402 1-400 (400) 3 protein:vir:7019 Length: 401 # 100.0 6E-145 4E-148 811.4 33.1 399 1-402 1-400 (401) 4 protein:vir:103323 Length: 364 100.0 1E-130 9E-134 732.4 26.6 358 1-364 1-364 (364) 5 protein:vir:6324 Length: 335 # 100.0 2E-118 2E-121 665.3 26.4 331 1-343 1-335 (335) 6 protein:vir:78935 Length: 335 100.0 3E-117 2E-120 659.5 27.2 331 1-343 1-335 (335) 7 protein:vir:80213 Length: 334 100.0 1E-115 7E-119 650.8 28.3 327 1-335 1-334 (334) 8 protein:vir:10450 Length: 344 100.0 1E-109 6E-113 618.1 30.4 325 1-333 1-344 (344) 9 protein:vir:100057 Length: 375 100.0 2E-110 1E-113 621.7 24.5 333 1-340 9-375 (375) 10 protein:vir:2201 Length: 345 # 100.0 4E-109 2E-112 614.9 29.7 326 1-333 7-345 (345) 11 protein:vir:94576 Length: 347 100.0 7E-105 4E-108 591.7 29.2 329 1-333 1-347 (347) 12 protein:vir:94711 Length: 347 100.0 2E-101 1E-104 572.2 28.7 328 1-334 1-347 (347) 13 protein:vir:8885 Length: 347 # 100.0 6E-101 4E-104 570.1 28.6 326 1-334 1-347 (347) 14 protein:vir:78739 Length: 332 100.0 8E-101 5E-104 569.2 25.7 318 1-331 1-332 (332) 15 protein:vir:3364 Length: 347 # 100.0 1E-99 6E-103 563.3 28.0 330 1-335 1-347 (347) 16 protein:vir:1541 Length: 347 # 100.0 5.3E-98 3E-101 553.9 26.8 328 1-335 5-347 (347) 17 protein:vir:99675 Length: 324 100.0 5.4E-98 3E-101 553.9 25.2 314 45-366 1-324 (324) 18 protein:vir:94622 Length: 341 100.0 5.2E-75 3.2E-78 427.8 25.3 316 1-335 1-341 (341) 19 protein:vir:80180 Length: 381 100.0 1.7E-62 1E-65 359.3 21.2 321 1-333 1-381 (381) 20 protein:vir:105822 Length: 273 100.0 4.3E-59 2.7E-62 340.5 17.7 267 1-333 1-273 (273) 21 protein:vir:102605 Length: 273 100.0 4.3E-59 2.7E-62 340.5 17.7 267 1-333 1-273 (273) 22 protein:vir:1781 Length: 221 # 100.0 1E-58 6.4E-62 338.5 15.2 216 88-325 1-221 (221) 23 protein:vir:7990 Length: 273 # 100.0 8.5E-57 5.3E-60 328.0 17.5 267 1-333 1-273 (273) 24 protein:vir:3136 Length: 322 # 100.0 6E-57 3.8E-60 328.8 13.1 309 1-337 1-322 (322) 25 protein:vir:102655 Length: 322 100.0 6.6E-52 4.1E-55 301.2 21.0 308 1-336 1-322 (322) 26 protein:vir:80930 Length: 278 100.0 1.8E-43 1.1E-46 255.0 17.0 271 1-334 1-278 (278) 27 protein:vir:107120 Length: 329 100.0 1.4E-40 8.6E-44 239.1 18.0 305 1-369 12-329 (329) 28 protein:vir:94800 Length: 319 100.0 1.1E-40 7.1E-44 239.5 17.5 304 1-359 1-319 (319) 29 protein:vir:97331 Length: 319 100.0 1.1E-40 7.1E-44 239.5 17.5 304 1-359 1-319 (319) 30 protein:vir:96123 Length: 274 100.0 3.9E-40 2.4E-43 236.6 15.6 267 1-339 1-274 (274) 31 protein:vir:3613 Length: 272 # 100.0 5.6E-40 3.5E-43 235.8 15.9 267 1-333 1-272 (272) 32 protein:vir:99075 Length: 392 100.0 5.4E-39 3.4E-42 230.4 15.8 346 1-402 1-369 (392) 33 protein:vir:93742 Length: 274 100.0 3.7E-39 2.3E-42 231.3 14.8 267 1-337 1-274 (274) 34 protein:vir:94494 Length: 274 100.0 1.9E-38 1.2E-41 227.4 15.9 267 1-337 1-274 (274) 35 protein:vir:97433 Length: 274 100.0 1.9E-38 1.2E-41 227.4 15.9 267 1-337 1-274 (274) 36 protein:vir:1239 Length: 274 # 100.0 2.8E-38 1.7E-41 226.5 16.1 267 1-337 1-274 (274) 37 protein:vir:96833 Length: 275 100.0 2.4E-38 1.5E-41 226.8 15.6 268 1-338 1-275 (275) 38 protein:vir:95898 Length: 274 100.0 1.5E-38 9.5E-42 227.9 14.5 267 1-337 1-274 (274) 39 protein:vir:96262 Length: 274 100.0 1.5E-38 9.5E-42 227.9 14.5 267 1-337 1-274 (274) 40 protein:vir:108303 Length: 418 100.0 9.1E-38 5.6E-41 223.7 16.6 291 1-336 1-418 (418) 41 protein:vir:79008 Length: 299 100.0 1.4E-36 8.6E-40 217.2 18.0 288 1-337 1-299 (299) 42 protein:vir:105334 Length: 276 100.0 1.5E-35 9.1E-39 211.5 16.2 269 1-343 1-276 (276) 43 protein:vir:3525 Length: 423 # 100.0 2.4E-35 1.5E-38 210.3 17.2 291 1-334 1-423 (423) 44 protein:vir:105374 Length: 423 100.0 6.3E-35 3.9E-38 208.1 16.7 292 1-334 1-423 (423) 45 protein:vir:174 Length: 423 # 100.0 5.1E-35 3.2E-38 208.6 15.7 292 1-334 1-423 (423) 46 protein:vir:9820 Length: 272 # 100.0 7.1E-34 4.4E-37 202.3 13.6 267 1-338 1-272 (272) 47 protein:vir:3033 Length: 272 # 100.0 7.1E-34 4.4E-37 202.3 13.6 267 1-338 1-272 (272) 48 protein:vir:105522 Length: 423 100.0 1.4E-31 8.9E-35 189.7 15.2 291 1-334 1-423 (423) 49 protein:vir:78920 Length: 290 100.0 4.2E-31 2.6E-34 187.1 17.1 277 1-332 1-290 (290) 50 protein:vir:739 Length: 231 # 99.9 2E-30 1.2E-33 183.4 11.3 230 46-333 1-231 (231) 51 protein:vir:105464 Length: 346 99.9 8.8E-28 5.5E-31 168.9 15.7 330 1-386 1-346 (346) 52 protein:vir:102335 Length: 312 99.9 3.1E-27 1.9E-30 165.9 16.5 299 1-337 1-312 (312) 53 protein:vir:95107 Length: 270 99.9 7.9E-27 4.9E-30 163.7 14.2 264 1-337 1-270 (270) 54 protein:vir:99523 Length: 311 99.8 2.6E-23 1.6E-26 144.4 16.1 297 1-335 1-311 (311) 55 protein:vir:79712 Length: 285 99.8 1.8E-23 1.1E-26 145.2 14.3 270 1-334 1-285 (285) 56 protein:vir:78090 Length: 302 99.7 9.3E-19 5.8E-22 119.4 15.2 284 1-335 1-302 (302) 57 protein:vir:78523 Length: 338 99.6 1.5E-16 9.4E-20 107.3 15.6 303 1-339 1-338 (338) 58 protein:vir:78223 Length: 333 99.5 2E-15 1.2E-18 101.2 15.9 302 1-336 1-333 (333) 59 protein:vir:7771 Length: 330 # 99.5 4.7E-15 2.9E-18 99.1 15.9 301 1-343 1-330 (330) 60 protein:vir:41 Length: 299 # N 99.4 6.3E-15 3.9E-18 98.5 16.1 282 1-334 1-299 (299) 61 protein:vir:100939 Length: 430 99.4 5.5E-15 3.4E-18 98.8 13.7 293 1-334 1-430 (430) 62 protein:vir:9265 Length: 430 # 99.4 5.5E-15 3.4E-18 98.8 13.7 293 1-334 1-430 (430) 63 protein:vir:2106 Length: 430 # 99.4 7.4E-15 4.6E-18 98.1 12.8 293 1-334 1-430 (430) 64 protein:vir:94771 Length: 298 99.4 3.9E-14 2.4E-17 94.1 15.8 282 1-334 1-298 (298) 65 protein:vir:105905 Length: 304 99.3 4E-14 2.5E-17 94.1 14.6 292 1-332 1-304 (304) 66 protein:vir:94142 Length: 304 99.3 4E-14 2.5E-17 94.1 14.6 292 1-332 1-304 (304) 67 protein:vir:80684 Length: 315 99.3 9.9E-14 6.1E-17 91.9 15.7 294 1-342 1-315 (315) 68 protein:vir:191 Length: 385 # 99.3 3.1E-14 1.9E-17 94.7 12.1 285 1-334 93-385 (385) 69 protein:vir:1886 Length: 385 # 99.3 3.1E-14 1.9E-17 94.7 12.1 285 1-334 93-385 (385) 70 protein:vir:96392 Length: 324 99.3 7.1E-14 4.4E-17 92.7 13.9 288 1-342 18-324 (324) 71 protein:vir:78830 Length: 324 99.3 7.1E-14 4.4E-17 92.7 13.9 288 1-342 18-324 (324) 72 protein:vir:104085 Length: 320 99.3 1.5E-13 9.1E-17 90.9 15.2 294 1-339 1-320 (320) 73 protein:vir:97053 Length: 390 99.3 5E-14 3.1E-17 93.5 12.5 280 1-331 102-390 (390) 74 protein:vir:10364 Length: 390 99.3 1.2E-13 7.7E-17 91.3 14.5 280 1-331 104-390 (390) 75 protein:vir:96223 Length: 324 99.3 1E-13 6.4E-17 91.8 14.1 285 1-342 21-324 (324) 76 protein:vir:4511 Length: 409 # 99.3 2.8E-13 1.8E-16 89.4 16.3 292 1-336 99-409 (409) 77 protein:vir:9309 Length: 324 # 99.3 2E-13 1.2E-16 90.2 15.4 284 1-342 21-324 (324) 78 protein:vir:9574 Length: 300 # 99.3 3.2E-13 2E-16 89.1 16.3 282 1-333 1-300 (300) 79 protein:vir:1638 Length: 298 # 99.3 3.1E-13 1.9E-16 89.1 16.2 284 1-334 1-298 (298) 80 protein:vir:1328 Length: 392 # 99.3 1.4E-13 8.6E-17 91.1 13.9 282 1-334 104-392 (392) 81 protein:vir:8187 Length: 311 # 99.3 3.2E-13 2E-16 89.1 15.0 293 1-336 1-311 (311) 82 protein:vir:103955 Length: 324 99.3 2.9E-13 1.8E-16 89.3 14.2 287 1-346 18-324 (324) 83 protein:vir:6242 Length: 390 # 99.2 4.7E-13 2.9E-16 88.2 15.0 283 1-334 93-390 (390) 84 protein:vir:99749 Length: 324 99.2 3.9E-13 2.4E-16 88.6 14.2 288 1-342 18-324 (324) 85 protein:vir:81070 Length: 390 99.2 4.4E-13 2.8E-16 88.3 14.4 280 1-331 104-390 (390) 86 protein:vir:9759 Length: 303 # 99.2 5.4E-13 3.4E-16 87.8 14.3 288 1-334 1-303 (303) 87 protein:vir:4339 Length: 395 # 99.2 2.2E-12 1.3E-15 84.5 17.1 281 1-333 98-395 (395) 88 protein:vir:96762 Length: 632 99.2 3.7E-13 2.3E-16 88.8 12.7 281 1-332 347-632 (632) 89 protein:vir:9410 Length: 415 # 99.2 9E-13 5.6E-16 86.6 14.4 298 1-343 107-415 (415) 90 protein:vir:97148 Length: 324 99.2 5.9E-13 3.6E-16 87.6 13.2 283 1-346 23-324 (324) 91 protein:vir:81100 Length: 415 99.2 1.2E-12 7.1E-16 86.0 14.5 297 1-343 113-415 (415) 92 protein:vir:98339 Length: 415 99.2 1.2E-12 7.1E-16 86.0 14.5 297 1-343 113-415 (415) 93 protein:vir:79987 Length: 415 99.2 1.2E-12 7.1E-16 86.0 14.5 297 1-343 113-415 (415) 94 protein:vir:100135 Length: 418 99.2 1.4E-12 8.8E-16 85.6 14.8 282 1-336 121-418 (418) 95 protein:vir:94673 Length: 419 99.2 2.3E-12 1.4E-15 84.4 15.1 291 1-335 112-419 (419) 96 protein:vir:4600 Length: 415 # 99.2 2E-12 1.2E-15 84.7 14.7 299 1-343 110-415 (415) 97 protein:vir:4700 Length: 415 # 99.2 2E-12 1.2E-15 84.7 14.7 299 1-343 110-415 (415) 98 protein:vir:8102 Length: 543 # 99.1 2E-12 1.2E-15 84.8 13.5 287 1-334 243-543 (543) 99 protein:vir:99920 Length: 311 99.1 8.8E-12 5.4E-15 81.2 17.0 296 1-339 1-311 (311) 100 protein:vir:102119 Length: 404 99.1 2.4E-12 1.5E-15 84.2 13.8 291 1-337 100-404 (404) 101 protein:vir:4856 Length: 293 # 99.1 5.8E-12 3.6E-15 82.2 15.3 277 1-344 5-293 (293) 102 protein:vir:95763 Length: 297 99.1 3.5E-12 2.1E-15 83.4 13.9 279 1-334 1-297 (297) 103 protein:vir:81227 Length: 413 99.1 4.9E-12 3.1E-15 82.6 14.6 283 1-336 113-413 (413) 104 protein:vir:6212 Length: 434 # 99.1 7.1E-12 4.4E-15 81.7 14.1 294 1-338 131-434 (434) 105 protein:vir:485 Length: 407 # 99.1 4.7E-12 2.9E-15 82.7 12.8 296 1-340 90-407 (407) 106 protein:vir:100247 Length: 425 99.1 6.5E-12 4.1E-15 81.9 13.6 291 1-334 121-425 (425) 107 protein:vir:2344 Length: 397 # 99.1 1E-11 6.3E-15 80.9 14.2 342 1-402 1-369 (397) 108 protein:vir:80376 Length: 435 99.1 4E-11 2.5E-14 77.6 17.3 296 1-338 105-435 (435) 109 protein:vir:104256 Length: 458 99.1 1.1E-11 6.8E-15 80.7 14.2 292 1-338 153-458 (458) 110 protein:vir:2430 Length: 318 # 99.1 2.2E-11 1.4E-14 79.0 15.8 290 1-340 6-318 (318) 111 protein:vir:101607 Length: 379 99.0 1.1E-11 6.7E-15 80.7 13.3 274 1-333 98-379 (379) 112 protein:vir:4997 Length: 397 # 99.0 2.7E-11 1.7E-14 78.5 15.5 283 1-344 98-397 (397) 113 protein:vir:4830 Length: 397 # 99.0 8.7E-11 5.4E-14 75.7 18.1 285 1-344 97-397 (397) 114 protein:vir:95451 Length: 313 99.0 2.3E-12 1.4E-15 84.4 9.2 296 1-335 1-313 (313) 115 protein:vir:108211 Length: 318 99.0 2.7E-11 1.7E-14 78.5 15.0 292 1-336 1-318 (318) 116 protein:vir:1433 Length: 435 # 99.0 7.8E-11 4.9E-14 76.0 17.4 291 1-338 126-435 (435) 117 protein:vir:7409 Length: 408 # 99.0 2.8E-11 1.8E-14 78.4 14.9 290 1-349 105-408 (408) 118 protein:vir:4953 Length: 397 # 99.0 9.8E-11 6.1E-14 75.4 17.7 275 1-344 109-397 (397) 119 protein:vir:100172 Length: 394 99.0 5E-11 3.1E-14 77.1 15.8 282 1-343 103-394 (394) 120 protein:vir:7855 Length: 497 # 99.0 1.9E-11 1.2E-14 79.3 13.2 299 1-337 138-497 (497) 121 protein:vir:101650 Length: 497 99.0 1.9E-11 1.2E-14 79.3 13.2 299 1-337 138-497 (497) 122 protein:vir:95376 Length: 425 99.0 5.1E-11 3.2E-14 77.0 15.3 286 1-337 119-425 (425) 123 protein:vir:81160 Length: 371 99.0 3.8E-11 2.4E-14 77.7 14.6 271 1-333 91-371 (371) 124 protein:vir:3991 Length: 404 # 99.0 1E-10 6.3E-14 75.4 16.7 287 1-346 105-404 (404) 125 protein:vir:8420 Length: 477 # 99.0 8.5E-11 5.3E-14 75.8 16.3 304 1-339 151-477 (477) 126 protein:vir:9361 Length: 402 # 99.0 5.1E-12 3.2E-15 82.5 9.4 273 1-341 124-402 (402) 127 protein:vir:4226 Length: 326 # 99.0 1.3E-10 8E-14 74.8 16.4 302 1-338 1-326 (326) 128 protein:vir:2504 Length: 305 # 98.9 6.2E-11 3.9E-14 76.5 14.5 283 1-342 1-305 (305) 129 protein:vir:5974 Length: 324 # 98.9 8E-11 4.9E-14 75.9 14.1 304 1-366 1-324 (324) 130 protein:vir:93616 Length: 645 98.9 1.2E-10 7.4E-14 75.0 14.6 296 1-342 332-645 (645) 131 protein:vir:105004 Length: 392 98.9 1.9E-10 1.2E-13 73.9 15.7 284 1-342 84-392 (392) 132 protein:vir:102873 Length: 392 98.9 1.9E-10 1.2E-13 73.9 15.7 284 1-342 84-392 (392) 133 protein:vir:107593 Length: 392 98.9 1.9E-10 1.2E-13 73.9 15.7 284 1-342 84-392 (392) 134 protein:vir:102082 Length: 392 98.9 1.9E-10 1.2E-13 73.9 15.7 284 1-342 84-392 (392) 135 protein:vir:4456 Length: 401 # 98.9 2.5E-11 1.5E-14 78.7 10.7 293 1-333 95-401 (401) 136 protein:vir:9927 Length: 295 # 98.9 6.7E-10 4.2E-13 70.9 18.4 280 1-340 1-295 (295) 137 protein:vir:5739 Length: 366 # 98.9 3.6E-10 2.2E-13 72.4 16.8 289 1-333 52-366 (366) 138 protein:vir:93881 Length: 387 98.9 4.2E-11 2.6E-14 77.5 11.4 276 1-341 100-387 (387) 139 protein:vir:3870 Length: 400 # 98.9 2.6E-10 1.6E-13 73.1 15.4 266 1-334 123-400 (400) 140 protein:vir:1084 Length: 437 # 98.9 2.5E-10 1.5E-13 73.3 14.9 284 1-343 145-437 (437) 141 protein:vir:100884 Length: 389 98.9 2.5E-10 1.5E-13 73.3 14.7 281 1-342 99-389 (389) 142 protein:vir:105038 Length: 428 98.8 7.6E-10 4.7E-13 70.6 17.1 288 1-333 113-428 (428) 143 protein:vir:1025 Length: 408 # 98.8 2.2E-10 1.4E-13 73.5 14.1 289 1-349 105-408 (408) 144 protein:vir:1268 Length: 397 # 98.8 9.2E-11 5.7E-14 75.6 12.0 274 1-333 112-397 (397) 145 protein:vir:95875 Length: 401 98.8 1.3E-09 7.9E-13 69.3 17.6 316 1-334 1-401 (401) 146 protein:vir:94424 Length: 387 98.8 4.9E-11 3E-14 77.1 9.0 276 1-341 99-387 (387) 147 protein:vir:2685 Length: 387 # 98.8 4.9E-11 3E-14 77.1 9.0 276 1-341 99-387 (387) 148 protein:vir:96978 Length: 387 98.8 4.9E-11 3E-14 77.1 9.0 276 1-341 99-387 (387) 149 protein:vir:78640 Length: 352 98.8 1.1E-10 6.6E-14 75.3 10.6 275 1-341 73-352 (352) 150 protein:vir:3845 Length: 395 # 98.8 6.1E-10 3.8E-13 71.1 14.4 279 1-346 105-395 (395) 151 protein:vir:9704 Length: 394 # 98.7 6.4E-10 4E-13 71.0 13.5 263 1-337 121-394 (394) 152 protein:vir:102944 Length: 330 98.7 3E-10 1.9E-13 72.8 11.3 307 1-366 1-330 (330) 153 protein:vir:4092 Length: 390 # 98.7 9.3E-10 5.8E-13 70.1 13.6 308 1-363 68-390 (390) 154 protein:vir:1383 Length: 421 # 98.7 1.2E-09 7.6E-13 69.4 13.3 297 1-371 105-421 (421) 155 protein:vir:93696 Length: 364 98.6 6E-09 3.7E-12 65.7 16.1 303 1-347 1-364 (364) 156 protein:vir:105610 Length: 430 98.6 9.1E-09 5.6E-12 64.7 16.5 328 1-356 1-430 (430) 157 protein:vir:1583 Length: 351 # 98.5 3.4E-09 2.1E-12 67.0 11.8 325 1-402 1-344 (351) 158 protein:vir:3158 Length: 321 # 98.5 2.8E-09 1.7E-12 67.5 11.3 303 1-343 1-321 (321) 159 protein:vir:962 Length: 397 # 98.5 6.5E-09 4E-12 65.5 12.6 272 1-333 121-397 (397) 160 protein:vir:2770 Length: 318 # 98.5 3.7E-08 2.3E-11 61.4 15.8 251 1-274 1-318 (318) 161 protein:vir:4197 Length: 314 # 98.4 3.5E-08 2.1E-11 61.5 15.4 298 1-337 1-314 (314) 162 protein:vir:9875 Length: 296 # 98.4 1.1E-07 6.6E-11 58.8 18.1 273 1-334 1-296 (296) 163 protein:vir:101291 Length: 381 98.4 1.8E-08 1.1E-11 63.1 13.0 292 1-346 57-381 (381) 164 protein:vir:9509 Length: 381 # 98.4 1.8E-08 1.1E-11 63.1 13.0 292 1-346 57-381 (381) 165 protein:vir:10123 Length: 404 98.4 1.2E-07 7.5E-11 58.5 16.6 328 1-346 1-404 (404) 166 protein:vir:104439 Length: 404 98.4 1.2E-07 7.5E-11 58.5 16.6 328 1-346 1-404 (404) 167 protein:vir:3298 Length: 404 # 98.4 1.2E-07 7.5E-11 58.5 16.6 328 1-346 1-404 (404) 168 protein:vir:819 Length: 404 # 98.4 1.2E-07 7.5E-11 58.5 16.6 328 1-346 1-404 (404) 169 protein:vir:106647 Length: 303 98.4 1.7E-07 1.1E-10 57.7 17.1 281 1-340 1-303 (303) 170 protein:vir:9643 Length: 377 # 98.3 6.1E-08 3.8E-11 60.1 12.7 283 1-333 59-377 (377) 171 protein:vir:100632 Length: 381 98.2 5E-08 3.1E-11 60.6 11.6 293 1-346 57-381 (381) 172 protein:vir:95963 Length: 395 98.1 2E-07 1.2E-10 57.3 12.5 295 1-351 61-395 (395) 173 protein:vir:4159 Length: 315 # 98.1 1.7E-07 1.1E-10 57.6 11.9 296 1-332 1-315 (315) 174 protein:vir:80128 Length: 466 97.9 9.3E-08 5.7E-11 59.1 7.3 298 1-349 123-466 (466) 175 protein:vir:98635 Length: 377 97.8 9.7E-07 6E-10 53.5 11.9 279 1-333 59-377 (377) 176 protein:vir:78350 Length: 383 97.8 1.9E-06 1.2E-09 52.0 13.3 291 1-341 64-383 (383) 177 protein:vir:79928 Length: 393 97.3 9.2E-06 5.7E-09 48.2 10.7 313 1-348 59-393 (393) 178 protein:vir:4074 Length: 480 # 97.0 6.9E-05 4.3E-08 43.4 12.9 273 1-338 171-480 (480) 179 protein:vir:80446 Length: 367 97.0 0.00012 7.2E-08 42.2 14.0 318 1-362 1-367 (367) 180 protein:vir:97397 Length: 517 95.9 0.0013 7.8E-07 36.5 13.8 274 1-340 229-517 (517) 181 protein:vir:80068 Length: 301 95.6 0.0018 1.1E-06 35.6 16.5 287 1-333 1-301 (301) 182 protein:vir:97255 Length: 310 95.5 0.0019 1.2E-06 35.5 13.2 290 1-333 1-310 (310) 183 protein:vir:94933 Length: 330 93.2 0.0085 5.3E-06 31.9 12.8 293 1-337 25-330 (330) 184 protein:vir:8324 Length: 410 # 91.4 0.0078 4.8E-06 32.2 9.0 274 1-340 85-410 (410) 185 protein:vir:107687 Length: 319 84.2 0.062 3.8E-05 27.2 13.6 290 1-334 4-319 (319) 186 protein:vir:103285 Length: 296 82.5 0.076 4.7E-05 26.7 14.3 277 1-337 1-296 (296) 187 protein:vir:96442 Length: 418 73.0 0.15 9.4E-05 25.1 7.6 301 1-345 61-418 (418) 188 protein:vir:4786 Length: 295 # 72.1 0.19 0.00012 24.6 11.7 262 9-321 1-295 (295) 189 protein:vir:94989 Length: 349 70.1 0.21 0.00013 24.3 12.5 316 1-366 1-349 (349) 190 protein:vir:79548 Length: 652 67.7 0.25 0.00015 23.9 9.3 291 1-332 348-652 (652) 191 protein:vir:99424 Length: 360 67.6 0.25 0.00016 23.9 9.7 302 1-336 1-360 (360) 192 protein:vir:95512 Length: 693 65.9 0.28 0.00017 23.6 12.2 290 1-324 366-693 (693) 193 protein:vir:104342 Length: 314 65.0 0.29 0.00018 23.5 12.5 286 1-337 3-314 (314) 194 protein:vir:78387 Length: 349 62.8 0.33 0.0002 23.2 13.7 314 1-366 1-349 (349) 195 protein:vir:103370 Length: 418 56.6 0.45 0.00028 22.5 7.1 297 1-345 61-418 (418) 196 protein:vir:5942 Length: 523 # 52.6 0.55 0.00034 22.0 12.4 283 1-335 219-523 (523) 197 protein:vir:94528 Length: 286 50.7 0.6 0.00037 21.8 15.2 264 1-334 1-286 (286) 198 protein:vir:3969 Length: 287 # 35.0 1.3 0.00078 20.0 14.5 262 17-334 1-287 (287) 199 protein:vir:96079 Length: 382 32.0 1.5 0.0009 19.7 15.7 298 1-332 61-382 (382) 200 protein:vir:103181 Length: 457 26.7 1.9 0.0012 19.0 11.0 307 1-359 97-457 (457) 201 protein:vir:107732 Length: 379 25.6 2 0.0013 18.9 12.7 292 1-331 52-379 (379) 202 protein:vir:5670 Length: 514 # 25.0 2.1 0.0013 18.8 12.9 319 1-363 142-514 (514) No 1 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=2.6e-153 Score=857.13 Aligned_cols=402 Identities=100% Similarity=1.362 Sum_probs=389.0 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCCcc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPTQ 80 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~~~ 80 (402) ||++|++|||+|++++++++||||+|+|||+|+|+|+|+|+++|++|+|++|||+|||++|+++++||+||++|++++++ T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~a~y~~~G~~ldg~~~~ 80 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPTQ 80 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeEEeeeccccccCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccc Q lcl|Aclame:pro 81 ADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSIN 160 (402) Q Consensus 81 ~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~ 160 (402) ++|++|+||++||++++||||||+|+|||+||+||++|+|++||+++||++++++..++++.......++...+.+.... T Consensus 81 ~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~s~~ 160 (402) T protein:vir:97 81 ADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSIN 160 (402) T ss_pred cccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccccccc Confidence 99999999999999999999999999999889999999999999999999999999988877666666666666666666 Q ss_pred cccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEeccEE Q lcl|Aclame:pro 161 VNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPV 240 (402) Q Consensus 161 v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V 240 (402) +....+..++++++|+++|++++++|||+|||.+|||++|+|+|||+|++++||+|++|+.++++.+.+|+|++++||+| T Consensus 161 ~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~Gv~V 240 (402) T protein:vir:97 161 VNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPV 240 (402) T ss_pred cccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEeceEE Confidence 66666777889999999999999999999999999999999999999999999999999888888899999999999999 Q ss_pred EecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHhcCccc Q lcl|Aclame:pro 241 IPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPD 320 (402) Q Consensus 241 ~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vl 320 (402) |+|||||++++.+++|.++++++|++|+|++|+++++|++|||+|++++|++++++|.|||++||+|+|+++|+|||+++ T Consensus 241 v~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a~G~g~~ 320 (402) T protein:vir:97 241 IPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPD 320 (402) T ss_pred EecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHHhCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEEeeccCccccccchhhHHHhhhcccceEEEeecchhhhhhhhcccccchhHHHHHHHHHHhhcccccccCC Q lcl|Aclame:pro 321 RWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSAAPAGIQAEDLVAAVRAVMANDIKPTAMKP 400 (402) Q Consensus 321 RPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (402) ||||+++|+++.+.|++++++.+++|++++||+|||+++|++++.+++|+++|+|||+||||++||++|+|+||||+|+| T Consensus 321 RPeaa~vv~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (402) T protein:vir:97 321 RWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSAAPAGIQAEDLVAAVRAVMANDIKPTAMKP 400 (402) T ss_pred CccceEEEEEecccccccCCccccchhhhhcccccceEEEeccccchhccccccccchHHHHHHHHHHHhccccccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CC Q lcl|Aclame:pro 401 TE 402 (402) Q Consensus 401 ~~ 402 (402) || T Consensus 401 ~~ 402 (402) T protein:vir:97 401 TE 402 (402) T ss_pred CC Confidence 99 No 2 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=2.1e-149 Score=835.74 Aligned_cols=400 Identities=84% Similarity=1.204 Sum_probs=384.5 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCCcc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPTQ 80 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~~~ 80 (402) ||++|++|||+|+++|++++||||+|+|||+|+|+|+|||+++|++|+|++|||+||+++|+++++||+||++|++++++ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~s~a~y~~pG~~ldg~~~~ 80 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPAATSTQ 80 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEEeeecCCCCcCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccc Q lcl|Aclame:pro 81 ADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSIN 160 (402) Q Consensus 81 ~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~ 160 (402) ++|++|+||++||++++||||||||+|||.||+||++|+|++||++|||++++++.+++++.......++.+.+++.+.+ T Consensus 81 ~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~s~~ 160 (400) T protein:vir:10 81 ADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGFSVN 160 (400) T ss_pred cCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcccccccee Confidence 99999999999999999999999999999789999999999999999999999999998776666667777777788888 Q ss_pred cccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEeccEE Q lcl|Aclame:pro 161 VNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPV 240 (402) Q Consensus 161 v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V 240 (402) +.........++++|+++|++++++|+||+||.+++++++||.+|++|++++||+|++|+.+++|.+.+|+|.+++||+| T Consensus 161 v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~Gv~I 240 (400) T protein:vir:10 161 VEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSYNCPV 240 (400) T ss_pred ecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEeceEE Confidence 87777778889999999999999999999999877777778888889999999999999988888899999999999999 Q ss_pred EecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHhcCccc Q lcl|Aclame:pro 241 IPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPD 320 (402) Q Consensus 241 ~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vl 320 (402) |+|||||++++.+++|.++++++|++|+|++|+++++|++|||+|+++||++||++|.|||+++|+|+|+++|+|||+++ T Consensus 241 v~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a~G~g~~ 320 (400) T protein:vir:10 241 IPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMSEGAIPD 320 (400) T ss_pred EeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHHhCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEEeeccCccccccchhhHHHhhhcccceEEEeecchhhhhhhhcccccchhHHHHHHHHHHhhcccccccCC Q lcl|Aclame:pro 321 RWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSAAPAGIQAEDLVAAVRAVMANDIKPTAMKP 400 (402) Q Consensus 321 RPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (402) ||||+++|+++.+.|+++++.++++|+.|++|+|||++|+|.+. .+|+.+|+++++||||+|||++|+|+||||+|+| T Consensus 321 RPeaa~vv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 398 (400) T protein:vir:10 321 RWEAVSVVTTKRQSTGAVDSGNAAQHTQVLNRAQRKAVYVKNAA--PAGAFAAASLSAEDLVAAVRAVMANDIKPTAMKP 398 (400) T ss_pred chhheEEEEecCCcccccccCcchhHHHHHhhcccceEEEeccc--ccccccccccchHHHHHHHHHHHhccccccccCC Confidence 99999999999999999999999999999999999999999987 5688899999999999999999999999999999 Q ss_pred CC Q lcl|Aclame:pro 401 TE 402 (402) Q Consensus 401 ~~ 402 (402) || T Consensus 399 ~~ 400 (400) T protein:vir:10 399 TE 400 (400) T ss_pred CC Confidence 99 No 3 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=5.7e-145 Score=811.38 Aligned_cols=399 Identities=84% Similarity=1.212 Sum_probs=379.1 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCCcc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPTQ 80 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~~~ 80 (402) ||++|++|||+|+++|++++||||+|+|||+|+|+|+|||+++|++|+|++|||+|||++|+++++||+||++|++++++ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s~~~~~~pG~~ld~~~~~ 80 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPAATSTQ 80 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEeeeecCCCCcCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccc Q lcl|Aclame:pro 81 ADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSIN 160 (402) Q Consensus 81 ~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~ 160 (402) ++|++|+||++||++++||||||+|+|||.||+||++|+|++||++|||+++|++..++++.+.+...++.+.+++...+ T Consensus 81 ~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G~~i~ 160 (401) T protein:vir:70 81 ADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHGFSIN 160 (401) T ss_pred cccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCceEEe Confidence 99999999999999999999999999999889999999999999999999999999999877777777788888888888 Q ss_pred cccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEeccEE Q lcl|Aclame:pro 161 VNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPV 240 (402) Q Consensus 161 v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V 240 (402) +.....+.+.++++|+++|++++.+|||||||.+++++++||.+|++|+++++++|++|+.+++|.+.+|+|++++||+| T Consensus 161 v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~vaGv~V 240 (401) T protein:vir:70 161 VEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSSYNCPV 240 (401) T ss_pred ccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEEeceEE Confidence 88777778899999999999999999999999664445557788889999999999999988888999999999999999 Q ss_pred EecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHhcCccc Q lcl|Aclame:pro 241 IPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPD 320 (402) Q Consensus 241 ~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vl 320 (402) |+|||||+++..+++|.++++++|++|+|++|+++++|++|||+|+++||++||++|.|||+++|+|+|++||+|||+++ T Consensus 241 v~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a~g~g~~ 320 (401) T protein:vir:70 241 IPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMAEGAIPD 320 (401) T ss_pred EeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHHhCCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceEEEEEEeeccCc-cccccchhhHHHhhhcccceEEEeecchhhhhhhhcccccchhHHHHHHHHHHhhcccccccC Q lcl|Aclame:pro 321 RWEAVSVVTTKRDATT-GDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSAAPAGIQAEDLVAAVRAVMANDIKPTAMK 399 (402) Q Consensus 321 RPeaa~vv~~~~~~t~-~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 399 (402) ||||+++|+++.+.++ ...+.+..+|+.+++|+|||+++|+.+. ++.++|+|++|||||+|||++|+|+||||+|+ T Consensus 321 RPeaa~vv~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 397 (401) T protein:vir:70 321 RWEAVSVVTTKRNTTTGAVEGTDGAQHTIVKNRAQRKAVYVKNAA---PVAAAAASLSAEDLVAAVRAVMANDIKPTALK 397 (401) T ss_pred chhheEEEeecCcccccccccCCcchhhhhhhhccceeEEecccc---chhhhccccchHHHHHHHHHHHhccccccccC Confidence 9999999999986544 4558888999999999999999999887 78899999999999999999999999999999 Q ss_pred CCC Q lcl|Aclame:pro 400 PTE 402 (402) Q Consensus 400 ~~~ 402 (402) ||| T Consensus 398 ~~~ 400 (401) T protein:vir:70 398 PTE 400 (401) T ss_pred cCC Confidence 999 No 4 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=1.5e-130 Score=732.35 Aligned_cols=358 Identities=66% Similarity=1.010 Sum_probs=337.3 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCCcc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPTQ 80 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~~~ 80 (402) ||++|++|||+|++++++++||||+|+|||+|+|+|+|+|+++|++|+|++|||+|||++|+++++||+||++|++++++ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~~~~~~~G~~ld~~~~~ 80 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETELQVLSPGKSPDASPTE 80 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeEEeeeccCcccCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccc Q lcl|Aclame:pro 81 ADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSIN 160 (402) Q Consensus 81 ~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~ 160 (402) ++|++|+||++||++++||||||+|+|||.||+||++|+||+||+++||+|++++.+++.++.......+...+++.... T Consensus 81 ~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~~i~ 160 (364) T protein:vir:10 81 FDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGFSIH 160 (364) T ss_pred cCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCcceee Confidence 99999999999999999999999999999889999999999999999999999998888777666555555555555555 Q ss_pred cccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEeccEE Q lcl|Aclame:pro 161 VNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPV 240 (402) Q Consensus 161 v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V 240 (402) +.......+++++.|+++|++++++|||+|||.+|||+||+|++||+||+++||+|++|+.++++.+.+|+|++++||+| T Consensus 161 ~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~Gv~V 240 (364) T protein:vir:10 161 IVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSWNTPI 240 (364) T ss_pred ecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEeceEE Confidence 55556667889999999999999999999999999999999999999999999999999988888899999999999999 Q ss_pred EecCcccccc------CccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHH Q lcl|Aclame:pro 241 IPSNRFPTFA------QDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) Q Consensus 241 ~~SNnlP~~~------~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a 314 (402) |+|||||+.+ +.+++|+++++++|++|++.+|+++++|++|||+|+++||++++++|.||++++|+|+|+++|+ T Consensus 241 v~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~ida~~a 320 (364) T protein:vir:10 241 VPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWYIDTFLA 320 (364) T ss_pred EeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeeeeeeehc Confidence 9999999754 3578999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCcccccceEEEEEEeeccCccccccchhhHHHhhhcccceEEEeecch Q lcl|Aclame:pro 315 EGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEG 364 (402) Q Consensus 315 ~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (402) |||+++|||||++| +.+.+.++++||++|++|+|||++++|+-- T Consensus 321 ~G~g~lRPeaa~~i------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (364) T protein:vir:10 321 EGAIPDRWEAVAVV------TAADTAELATDHNAILARANRKVTLTKSVN 364 (364) T ss_pred ccCcccCccceEEE------EecCCCCCccchhhhhhhccccEEEEEecC Confidence 99999999999999 557778899999999999999999999877 No 5 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=2.5e-118 Score=665.34 Aligned_cols=331 Identities=18% Similarity=0.234 Sum_probs=303.2 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCCcc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPTQ 80 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~~~ 80 (402) ||+||++|||+|+|++++++||||+|+|||+|+|+|+|||++++++|+|++|||+|||++|+++++||+||++|++++++ T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~l~~~~~~ 80 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELERSRVV 80 (335) T ss_pred CCCcccchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeeecccCCcCcCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccc Q lcl|Aclame:pro 81 ADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSIN 160 (402) Q Consensus 81 ~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~ 160 (402) ++|++|+||++||++++||||||||+||| +|+||++|+||+||+++||+++|+++|||++.++....+.+.+|++.... T Consensus 81 ~~k~~itVD~ll~a~~~I~dlDe~~~~yD-vRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~~~~ 159 (335) T protein:vir:63 81 NDKWNLTVDTLLYLRHQFDHQDEWTQSFD-MRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLD 159 (335) T ss_pred ccceEEEecceeechhhhhhHHHHhcCch-hHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcceeee Confidence 99999999999999999999999999999 99999999999999999999999999999999999988877777655555 Q ss_pred cccCCccccccHHHHHHHHHHHHHHHHhhcCCcc---CcEEEeChHHHHHHhcccchhhcccccccC-cccccceEEEEe Q lcl|Aclame:pro 161 VNVTESEALANPQYVMAAVEYALEQQLEQEVDIS---DVAIMMPWKFFNALRDADRIVDKTYTISQS-GATINGFVLSSY 236 (402) Q Consensus 161 v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~---gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~-g~~~~G~V~~ia 236 (402) ++ +.+...++++|+++|++++++|+|+|||++ |||+||+|++||+|++++||+|++|+.+++ +.+.+|+|++++ T Consensus 160 ~t--g~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~ 237 (335) T protein:vir:63 160 LT--GLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAILN 237 (335) T ss_pred ec--cCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEEee Confidence 44 344456899999999999999999999964 599999999999999999999999986654 458999999999 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHhc Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEG 316 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~G 316 (402) ||+|++|||||+++ +++|.+++++++ |++|+++.++++||++|++++|++++++|.||++++|+|+|+++|+|| T Consensus 238 Gv~V~~sn~lP~~~--~t~~~lg~a~n~----~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G 311 (335) T protein:vir:63 238 GVKVLETPRFATKA--IAAHPLGRHFNV----SAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYN 311 (335) T ss_pred ceEEEeeccCCCCC--cccccccccCCc----cccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcC Confidence 99999999999865 788888766655 556788999999999999999999999999999999999999999999 Q ss_pred CcccccceEEEEEEeeccCccccccch Q lcl|Aclame:pro 317 AIPDRWEAVSVVTTKRDATTGDAGGPG 343 (402) Q Consensus 317 a~vlRPeaa~vv~~~~~~t~~~a~~~~ 343 (402) |+++|||||++|++++- |+-+-+| T Consensus 312 ~g~lRPe~a~~i~~tg~---~~~~~~~ 335 (335) T protein:vir:63 312 IGARRPDTAGAIELKGI---GAFDITA 335 (335) T ss_pred CcccccceEEEEEEcCC---CceeecC Confidence 99999999999998641 2222222 No 6 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=2.9e-117 Score=659.49 Aligned_cols=331 Identities=17% Similarity=0.225 Sum_probs=304.7 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCCcc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPTQ 80 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~~~ 80 (402) ||+||++|||+|+|++++++||||+|+|||+|+|+|+|||++++++|+|++|||+|||++|+++++||+||++|++++++ T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~l~~~~~~ 80 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELERSRVV 80 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeeecccccCcccCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccc Q lcl|Aclame:pro 81 ADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSIN 160 (402) Q Consensus 81 ~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~ 160 (402) ++|++|+||++||++++||||||||+||| +|+||++|+|++||+++||+++|++++|+++.+|...++.+.+|+..... T Consensus 81 ~~k~~itID~ll~a~~~VddlDe~~~~yD-vR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~~~~ 159 (335) T protein:vir:78 81 NDKWNLTVDTLLYLRHQFDHQDEWTQSFD-MRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLEKLD 159 (335) T ss_pred cCCeEEEecceeechhhHhhHHHhhcCch-hHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcceeee Confidence 99999999999999999999999999999 89999999999999999999999999999999999998888877666555 Q ss_pred cccCCccccccHHHHHHHHHHHHHHHHhhcCCcc---CcEEEeChHHHHHHhcccchhhcccccccC-cccccceEEEEe Q lcl|Aclame:pro 161 VNVTESEALANPQYVMAAVEYALEQQLEQEVDIS---DVAIMMPWKFFNALRDADRIVDKTYTISQS-GATINGFVLSSY 236 (402) Q Consensus 161 v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~---gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~-g~~~~G~V~~ia 236 (402) ++ +.+...+++.|+++++++.++|+|+|||+. |||+||+|++||+|++++||+|++|+.+++ +.+.+|+|++++ T Consensus 160 ~t--g~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~ 237 (335) T protein:vir:78 160 LT--GLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAILN 237 (335) T ss_pred ec--cccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeEEee Confidence 54 345567899999999999999999999964 799999999999999999999999986654 458999999999 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHhc Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEG 316 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~G 316 (402) ||+|++|||||+++ +++|.+++++++.+ .|+++.++++||++|+++||++++++|.||++++|+|+|+++|+|| T Consensus 238 Gv~V~~Sn~lP~~~--~t~~~lg~a~n~~~----~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G 311 (335) T protein:vir:78 238 GVKVLETPRFATKA--ISAHPLGRHFNVSA----EEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYN 311 (335) T ss_pred ceEEEeeccCCCCC--CccccccccCCccc----ccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcC Confidence 99999999999865 88898887777655 4778999999999999999999999999999999999999999999 Q ss_pred CcccccceEEEEEEeeccCccccccch Q lcl|Aclame:pro 317 AIPDRWEAVSVVTTKRDATTGDAGGPG 343 (402) Q Consensus 317 a~vlRPeaa~vv~~~~~~t~~~a~~~~ 343 (402) |+++|||||++|++++-. +-+-+| T Consensus 312 ~g~lRPe~a~~i~~tg~~---~~~~~~ 335 (335) T protein:vir:78 312 IGARRPDTAGAIELKGIE---AFDITA 335 (335) T ss_pred CcccCcceEEEEEecCCC---cccccC Confidence 999999999999987632 111111 No 7 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=1.1e-115 Score=650.79 Aligned_cols=327 Identities=22% Similarity=0.274 Sum_probs=307.4 Q ss_pred CCCC--cccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCC Q lcl|Aclame:pro 1 MSTP--NTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATP 78 (402) Q Consensus 1 Ms~~--n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~ 78 (402) ||++ |+++||+|+|++++++||||+|+|||+++|+|+|||++++++|+|++|||+|||+||+++++||+||++|++++ T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~~~~g~~l~~~~ 80 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTIAGRKAGEELVVQK 80 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceeeeeecCCCCCCCCC Confidence 9999 78999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFS 158 (402) Q Consensus 79 ~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~ 158 (402) ++++|++|+||++||++++||||||||+||| +|+||++|+||+||+++||+|+++++||+++.+|.+..+.+++|++.. T Consensus 81 ~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D-~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~~~ 159 (334) T protein:vir:80 81 NVSDKLNLTVDTVLYARHFFDKFDEWTSNLD-VRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGILLP 159 (334) T ss_pred cccCceEEEEeeeeehhhhHhhHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCccee Confidence 9999999999999999999999999999999 899999999999999999999999999999999999999999998888 Q ss_pred cccccCCccccccHHHHHHHHHHHHHHHHhhcCCc---cCcEEEeChHHHHHHhcccchhhcccccccC-cccccceEEE Q lcl|Aclame:pro 159 INVNVTESEALANPQYVMAAVEYALEQQLEQEVDI---SDVAIMMPWKFFNALRDADRIVDKTYTISQS-GATINGFVLS 234 (402) Q Consensus 159 ~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~---~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~-g~~~~G~V~~ 234 (402) ..++...+..+++++.|+++++++++.|+|+|||+ .|||+||+|++||+||+++||+|+||+.+++ ..+.+|.|++ T Consensus 160 ~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i~~ 239 (334) T protein:vir:80 160 STISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGRIAM 239 (334) T ss_pred ecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccceeEEE Confidence 87777777788999999999999999999999994 6799999999999999999999999987654 4578999999 Q ss_pred EeccEEEecCccccccCccccccccccCCcccc-ceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRY-DPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~-~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) ++||+||+|||+|+.+ ++.|.+ |..| .|++||+++++++||++|++++|++++++|.||+++||+|+|+++| T Consensus 240 v~G~~V~~Sn~~P~~~--~t~~~~-----g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~ 312 (334) T protein:vir:80 240 LNGVRVVETPRFPQSA--ITANAL-----GADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQ 312 (334) T ss_pred EeceEEEeecCCCCcc--cccccc-----ccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHH Confidence 9999999999999865 444433 4444 5778999999999999999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccC Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDAT 335 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t 335 (402) +||||++||||+++|+++.... T Consensus 313 a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 313 SYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred HcCCceeccceEEEEEEeeecC Confidence 9999999999999999987653 No 8 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=1e-109 Score=618.12 Aligned_cols=325 Identities=15% Similarity=0.118 Sum_probs=285.3 Q ss_pred CC------CCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCC Q lcl|Aclame:pro 1 MS------TPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSP 74 (402) Q Consensus 1 Ms------~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i 74 (402) |+ .+|++++|+|++++++++||||+|+|||+++|+|+|+|+++|++|+|++|||+|||++|+++++||+||++| T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~~~~~~~~~G~~l 80 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENL 80 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeeceeEEEeeecCCCC Confidence 44 448999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCC--CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 75 NAT--PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 75 ~~~--~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) ++. .++++|++|+||++||++++|||||+||+||| +|+|+++++||+||+++||+|++++++++.+..|....... T Consensus 81 ~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D-~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g- 158 (344) T protein:vir:10 81 DDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITG- 158 (344) T ss_pred CCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccc- Confidence 874 68999999999999999999999999999999 89999999999999999999999999999887776554332 Q ss_pred cccccccccccC-----CccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 153 KGHGFSINVNVT-----ESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 153 ~g~~~~~~v~~~-----~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) .+.+.++... ....+.+++++|++|++++++|||+|||++|||+||+||+|++|+++++|++.+|+ +++.+ T Consensus 159 --~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~--~~~~~ 234 (344) T protein:vir:10 159 --LGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYA--ALIDP 234 (344) T ss_pred --ccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccc--cccce Confidence 2222222211 11233456789999999999999999999999999999999999999999999986 44457 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCc------cccceeeeccceeEEeecHHHhhhhhhcccceeeccc Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNG------YRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE 301 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G------~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d 301 (402) .+|.|++++||+||+|||||.+. ++.|.+..+++. ..+.+..+++++|||+|||+|+++++++++++|.||+ T Consensus 235 ~~G~V~~v~G~~V~~Sn~lp~~~--~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~ 312 (344) T protein:vir:10 235 EKGSIRNVMGFEVVEVPHLTAGG--AGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARR 312 (344) T ss_pred eeeEEEEEeceEEEecccccccc--CCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccc Confidence 89999999999999999999753 333444333333 3345667899999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 302 KKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 302 ~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) +++|+|+|+++|+||||++||||+++|+++.. T Consensus 313 ~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 313 ANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred hhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 99999999999999999999999999999877 No 9 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=2.2e-110 Score=621.75 Aligned_cols=333 Identities=14% Similarity=0.096 Sum_probs=293.0 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCC--- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNAT--- 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~--- 77 (402) |+-+|.+|||+|++++++++||||+|+|||+++|+|+|++++++++|+|++|||+||++||+++++||+||++|+++ T Consensus 9 ~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t~~~~t~G~~i~~~~~~ 88 (375) T protein:vir:10 9 LGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMTSSFHTPGTPILGNADK 88 (375) T ss_pred cCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeEEeeecCCcCcCCcccc Confidence 66677888999999999999999999999999999999999999999999999999999999999999999999887 Q ss_pred CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) .+++++++|+||++||++|+|||||+||+||| +|+|+++|+||+||+++||+|+++++|||++.+|....+...+|+.. T Consensus 89 d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~D-lr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~~~~~Gg~~ 167 (375) T protein:vir:10 89 APPVAEKTIVMDDLLISSAFVYDLDETLAHYE-LRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATNFVEPGGTQ 167 (375) T ss_pred CCCCCceEEEecchhhhhhhHhhHHHHhcCch-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccCcce Confidence 46789999999999999999999999999999 89999999999999999999999999999999998877766666544 Q ss_pred ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcc---cchhhcccccccCcccccceEEE Q lcl|Aclame:pro 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDA---DRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 158 ~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~---~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) ....+......+.+++++|++|++++++|||++||++|||+||+||+|++||++ ++|+|+||+ +++.+.+|.|++ T Consensus 168 i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~--~~~~~~~g~v~~ 245 (375) T protein:vir:10 168 IRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQ--GSALQSGNGVIE 245 (375) T ss_pred eeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccc--ccceeccceEEE Confidence 444444555667889999999999999999999999999999999999999986 789999985 566778999999 Q ss_pred EeccEEEecCccccccCccccccc----------------------cccCCccccceeeec---cceeEEeecHHHhhhh Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLL----------------------SNEDNGYRYDPIAEM---NGAVAVLFTSDALLVG 289 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~l----------------------s~a~~G~~~~~~ad~---~~~~al~fh~~Av~tv 289 (402) ++||+||+|||+|+.+. ++|.+ +....|...+|.++| +|++||+|||+|+++| T Consensus 246 i~Gv~V~~Sn~lP~~~~--~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v 323 (375) T protein:vir:10 246 IAGIHIYKSMNIPFLGK--YGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVV 323 (375) T ss_pred EeceEEEEecccccccc--ccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeee Confidence 99999999999998652 22221 012334455677777 9999999999999999 Q ss_pred hhcccceeec---cchhHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccc Q lcl|Aclame:pro 290 RTIEVTGDIF---YEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) Q Consensus 290 ~~~dl~~e~~---~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~ 340 (402) |++++++|.+ |+.++|+|+|+++|+|||++||||||++|++.. +...+= T Consensus 324 ~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~--~~~~~~ 375 (375) T protein:vir:10 324 EAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGA--TAPSAF 375 (375) T ss_pred eeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCc--CccccC Confidence 9999999998 699999999999999999999999999986653 111111 No 10 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=3.9e-109 Score=614.93 Aligned_cols=326 Identities=16% Similarity=0.128 Sum_probs=283.1 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCC--C Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNAT--P 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~--~ 78 (402) =+.+|+++||+|++++++++||||+|+|||+++|+|+|+|+++|++|+|++|||+|||++|+++++||+||++|+++ . T Consensus 7 ~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~~~~~~~~~G~~l~~~~~~ 86 (345) T protein:vir:22 7 GQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENLDDKRKD 86 (345) T ss_pred chhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecceEEEeeecCCCCCCCCCC Confidence 13568899999999999999999999999999999999999999999999999999999999999999999999886 4 Q ss_pred ccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFS 158 (402) Q Consensus 79 ~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~ 158 (402) ++++|++|+||+++|++++||||||||+||| +|+|+++|+||+||+++||+|+++++|++++.+|.... +...+.+.. T Consensus 87 ~~~~e~~ltID~~~y~~~~VddiD~~q~~~D-~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~-~~~~~~~~~ 164 (345) T protein:vir:22 87 IKHTEKVITIDGLLTADVLIYDIEDAMNHYD-VRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN-IEGLGTATV 164 (345) T ss_pred cccceEEEEecchhhhhhhHhhHHHHhcCch-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-ccccccccc Confidence 8899999999999999999999999999999 89999999999999999999999999999987766433 333333333 Q ss_pred cccccCCcc---ccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 159 INVNVTESE---ALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 159 ~~v~~~~a~---~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) ..++..+.. ...++.++|++|++|+++|||+|||.+|||+||+||+|++|+++++|++.+|+ +++.+.+|.|+++ T Consensus 165 ~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~--~~~~~~~G~V~~i 242 (345) T protein:vir:22 165 IETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYA--ALIDPEKGSIRNV 242 (345) T ss_pred cccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccc--cccccccceEEEE Confidence 332222221 23456789999999999999999999999999999999999999999999986 3444679999999 Q ss_pred eccEEEecCccccccCc--------cccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHH Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQD--------QAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTY 307 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~--------~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d 307 (402) +||+||+|||+|..... ..++.+ .+.|+.+ +..+.++++||+|||+|+++||++++++|.||++++|+| T Consensus 243 ~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~--~~~g~~~-~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d 319 (345) T protein:vir:22 243 MGFEVVEVPHLTAGGAGTAREGTTGQKHVFP--ANKGEGN-VKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANFQAD 319 (345) T ss_pred eceEEEecccccccccCccccCccccccccc--cccccee-eeeccCceEEEEEehhheeeeeeecceeeeeechhHHHH Confidence 99999999999964322 111222 2334443 445568999999999999999999999999999999999 Q ss_pred HHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) +|+++|+||||++||||+++|++|.. T Consensus 320 ~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 320 QIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHHHhcCCcccccceeEEEEEeeC Confidence 99999999999999999999999988 No 11 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=6.8e-105 Score=591.67 Aligned_cols=329 Identities=15% Similarity=0.121 Sum_probs=278.7 Q ss_pred CCCCc----ccccccccc-cccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPN----TLTNVAVSA-SGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n----~~t~~~~~~-~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~ 75 (402) |++-. -.||++|+| ++|+++||||+|+|||+++|+|+|+|++++++|+|++|||+|||+||+++++||+||++|+ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~~~~G~~l~ 80 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLD 80 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccceeEeeeecCcCCC Confidence 77553 258999985 5799999999999999999999999999999999999999999999999999999999998 Q ss_pred C--CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 76 A--TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 76 ~--~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) + +.++++|++|+||+++|++++|||||+||+||| +|+||++++||+||+++||+|++++++++.+..+ ..+++... T Consensus 81 ~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D-~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~-~~~~~~g~ 158 (347) T protein:vir:94 81 DKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYD-VRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTA-NNENIAGL 158 (347) T ss_pred CCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-cccccccC Confidence 7 468999999999999999999999999999999 8999999999999999999999999999987554 33333332 Q ss_pred cccccccccc---CCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccc Q lcl|Aclame:pro 154 GHGFSINVNV---TESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATING 230 (402) Q Consensus 154 g~~~~~~v~~---~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G 230 (402) +++....+.. ...+.+.+++++|++|+++.++|||+|||++|||+||+|++|+.||+..++...++... ..+.+| T Consensus 159 ~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~--~~~~~G 236 (347) T protein:vir:94 159 GKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQAL--IDPSTG 236 (347) T ss_pred CcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccc--cccccc Confidence 2222222211 11223457889999999999999999999999999999999999999877776666432 346789 Q ss_pred eEEEEeccEEEecCccccccCcccccc--ccccC------CccccceeeeccceeEEeecHHHhhhhhhcccceeeccch Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHL--LSNED------NGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK 302 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~--ls~a~------~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~ 302 (402) .|++++||+||+|||+|.......... .+.++ .+...+|+.+|+++++|+|||+|+++||++++++|.||++ T Consensus 237 ~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~ 316 (347) T protein:vir:94 237 SIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRA 316 (347) T ss_pred eeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeech Confidence 999999999999999998653221111 11111 2334578899999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 303 KEKTYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 303 ~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) ++|+|+|+++|+||||++|||||++|.++.- T Consensus 317 ~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 317 NFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 9999999999999999999999999988755 No 12 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=2.4e-101 Score=572.20 Aligned_cols=328 Identities=14% Similarity=0.095 Sum_probs=277.4 Q ss_pred CCCCc---cccccccccc-ccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPN---TLTNVAVSAS-GEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n---~~t~~~~~~~-~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~ 76 (402) |++.+ ..|+|||+++ +|.++||||+|.|||+++|+|+|+|++++++|+|++|||+|||+||+++++||+||++|++ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~t~G~~l~~ 80 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPGERLSD 80 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceeeeeecCCCCcCC Confidence 66664 3589999876 5779999999999999999999999999999999999999999999999999999999976 Q ss_pred C--CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 77 T--PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 77 ~--~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) + .++++|++|+||+++|++++|||||+||+||| +|+||++++|++||+++|++|++++.+.+.+..+.... ..| T Consensus 81 ~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D-~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~---~~g 156 (347) T protein:vir:94 81 KRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYD-VAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNEN---IAG 156 (347) T ss_pred CCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc---cCC Confidence 4 68999999999999999999999999999999 89999999999999999999999998766443332211 223 Q ss_pred cccccccccCCccc----cccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccc Q lcl|Aclame:pro 155 HGFSINVNVTESEA----LANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATING 230 (402) Q Consensus 155 ~~~~~~v~~~~a~~----~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G 230 (402) .+.+.++....... ..+++++|++|++++++|||+|||++|||+||+||+|++||++.+|.+.+|.. ++.+.+| T Consensus 157 ~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~--~~~~~~G 234 (347) T protein:vir:94 157 LGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAA--LIDPETG 234 (347) T ss_pred CcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccc--ccccccc Confidence 33344443333333 34568899999999999999999999999999999999999999999888864 3456789 Q ss_pred eEEEEeccEEEecCccccccCccccc---cccccCCcc------ccceeeeccceeEEeecHHHhhhhhhcccceeeccc Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHH---LLSNEDNGY------RYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE 301 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~---~ls~a~~G~------~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d 301 (402) .|++++||+||+|||||+.+...+.. ....+|+.+ ..+|.++|+++++|+|||+|+++||++++++|.||+ T Consensus 235 ~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~ 314 (347) T protein:vir:94 235 NIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRD 314 (347) T ss_pred ceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhc Confidence 99999999999999999754321111 111222222 236889999999999999999999999999999999 Q ss_pred hhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 302 KKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 302 ~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) +++|+|+|+++|+||||++||||+++|++..-+ T Consensus 315 ~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 315 VDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 999999999999999999999999999887333 No 13 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=5.8e-101 Score=570.13 Aligned_cols=326 Identities=15% Similarity=0.113 Sum_probs=283.3 Q ss_pred CCCC---c-cccccccccc-ccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTP---N-TLTNVAVSAS-GEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~---n-~~t~~~~~~~-~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~ 75 (402) |++. + ..+|+||+++ +|+++||||+|+|||+++|+|+|+|++++++|+|++|||+|||+||+++++||+||++|+ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~~~~~g~~l~ 80 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecceeeeeeccccCCC Confidence 7754 3 3489999876 577999999999999999999999999999999999999999999999999999999998 Q ss_pred C--CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 76 A--TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 76 ~--~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) + +.++++|++|+||+++|++++|||+|+||+||| +|+|+++++|++||+++|++|+++++++++...+ ...+.. T Consensus 81 ~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D-~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~---~~~~~~ 156 (347) T protein:vir:88 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYD-VRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA---SNENIA 156 (347) T ss_pred CCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCC-chHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---cccccC Confidence 6 468999999999999999999999999999999 8999999999999999999999999999876443 234444 Q ss_pred ccccccccccCCccc----cccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccccc Q lcl|Aclame:pro 154 GHGFSINVNVTESEA----LANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATIN 229 (402) Q Consensus 154 g~~~~~~v~~~~a~~----~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~ 229 (402) |++.+..+..+.+.. ..+++.+|++|++++++|||++||.+|||+||+|++|++||+++++.+.+|.. .+.+.+ T Consensus 157 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~--~~~~~~ 234 (347) T protein:vir:88 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA--LIDPET 234 (347) T ss_pred CccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhcc--ccchhc Confidence 555555444443332 34567889999999999999999999999999999999999999999888863 345678 Q ss_pred ceEEEEeccEEEecCccccccCccccccccc----------cCCccccceeeeccceeEEeecHHHhhhhhhcccceeec Q lcl|Aclame:pro 230 GFVLSSYNCPVIPSNRFPTFAQDQAHHLLSN----------EDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIF 299 (402) Q Consensus 230 G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~----------a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~ 299 (402) |.|++++||+||+|||+|.+..+. +.... ...+...+|+.+++++++|+||++|+++||+||+++|.+ T Consensus 235 G~vg~i~G~~V~~s~nlp~~~~~~--~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~ 312 (347) T protein:vir:88 235 GNIRNVMGFEVIEVPHLTVGGAGD--NNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERA 312 (347) T ss_pred ceeeeeccceEEEeeccccccccc--ccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeee Confidence 999999999999999999754321 22111 134456689999999999999999999999999999999 Q ss_pred cchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 300 YEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 300 ~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) |+++||+|+|+++++||||++|||||++|++...+ T Consensus 313 r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 313 RRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred echhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 99999999999999999999999999999887655 No 14 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=8.4e-101 Score=569.25 Aligned_cols=318 Identities=13% Similarity=0.121 Sum_probs=273.1 Q ss_pred CCCCccccccc-----c-cccccH-HHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVA-----V-SASGEV-DSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQS 73 (402) Q Consensus 1 Ms~~n~~t~~~-----~-~~~~d~-~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~ 73 (402) |++.++.++|+ | +.++|. ++||||+|+|||+++|+|+|+|++++++|+|++|||++||+||+++++||+||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~~~~~g~~ 80 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTP 80 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEeeecCCCC Confidence 55444444333 3 334554 5999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 74 PNAT-PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 74 i~~~-~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) |+++ .+++++++|+||+++|++++|||||++|+|+| +|+|+++++||+||+++|++|++++++|+++.++....+ T Consensus 81 l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~d-l~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~--- 156 (332) T protein:vir:78 81 IVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYS-TRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEP--- 156 (332) T ss_pred CCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCcccccc--- Confidence 9987 48999999999999999999999999999999 899999999999999999999999999998776654432 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--ccchhhcccccccCcccccc Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--ADRIVDKTYTISQSGATING 230 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~r~~n~d~~~~~~g~~~~G 230 (402) ++..+. .+++.+++++++|++|++++++|||+|||.+|||+||+||+|++||+ |+||+|++++.+ ++...+| T Consensus 157 ----g~~~~~-~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~-~~~~~~g 230 (332) T protein:vir:78 157 ----GGFHVN-IGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNS-QGDMNSG 230 (332) T ss_pred ----cccccc-cCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeecccc-ccceecc Confidence 122222 23345678999999999999999999999999999999999999998 899999999654 4455676 Q ss_pred e-EEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceee---ccchhHHH Q lcl|Aclame:pro 231 F-VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDI---FYEKKEKT 306 (402) Q Consensus 231 ~-V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~---~~d~~~~~ 306 (402) + |++++||+||+|||||+.+ ++.|..+ +..|....|.++|+++++++|||+|+++|+++++++|. +|++++|+ T Consensus 231 ~~i~~i~G~~V~~Sn~lp~~~--g~~~~~~-~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~ 307 (332) T protein:vir:78 231 KGLYSIAGIRILKSNNLAGLY--GQDLSSA-AVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQG 307 (332) T ss_pred eeeeEEeeeEEEecCccccCc--ccccccc-cccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhH Confidence 5 9999999999999999755 3333322 23344456789999999999999999999999997775 77899999 Q ss_pred HHHHHHHHhcCcccccceEEEEEEe Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTK 331 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~ 331 (402) |+|+++|+|||+++||||+++|++- T Consensus 308 d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 308 DLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhhhhhhhhcCceecccceEEEeeC Confidence 9999999999999999999999876 No 15 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=1e-99 Score=563.25 Aligned_cols=330 Identities=13% Similarity=0.097 Sum_probs=280.6 Q ss_pred CCCC---c-cccccccccc-ccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTP---N-TLTNVAVSAS-GEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~---n-~~t~~~~~~~-~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~ 75 (402) |++- + -.|||||+|+ +|+++||||+|+|||+++|+|+|+|++++++|+|++|||++||+||+++++||+||++|+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~~~~~g~~l~ 80 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccceeeeeecCCCCCC Confidence 6643 3 2599999865 788999999999999999999999999999999999999999999999999999999998 Q ss_pred CC--CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-ccccccccccc Q lcl|Aclame:pro 76 AT--PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIA-NTKAERNKPRV 152 (402) Q Consensus 76 ~~--~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~-~a~~~~~~~~~ 152 (402) ++ .++++|++|+||+++|++++|||||++|+||| +|+++++++|++||+++|++|+++++++... ..++.....++ T Consensus 81 ~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D-~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~ 159 (347) T protein:vir:33 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYD-VRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLG 159 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Confidence 75 48999999999999999999999999999999 8999999999999999999999999876543 34444444443 Q ss_pred cccccccccccCCc--cccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccc Q lcl|Aclame:pro 153 KGHGFSINVNVTES--EALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATING 230 (402) Q Consensus 153 ~g~~~~~~v~~~~a--~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G 230 (402) .+.+.......++. ....+++++|++|++++++|||+|||++|||+||+||+|++||++++|++++|++ ++.+.+| T Consensus 160 ~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~--~~~~~~G 237 (347) T protein:vir:33 160 KPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQA--LLDPERG 237 (347) T ss_pred ccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhcccccccccccc--ccccccc Confidence 33333332222211 1234578999999999999999999999999999999999999999999999963 4457899 Q ss_pred eEEEEeccEEEecCccccccCcccccccc-ccCCcccc------ceeeeccceeEEeecHHHhhhhhhcccceeeccchh Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLS-NEDNGYRY------DPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK 303 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~ls-~a~~G~~~------~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~ 303 (402) .|++++||+||+|||||+.+ +++|..+ .++.++.| .++..|.++++|+||++|+++|+++++++|.+|+++ T Consensus 238 ~V~~i~G~~V~~Sn~lp~~~--~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~ 315 (347) T protein:vir:33 238 TIRNVMGFEVVEVPHLTAGG--AGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRAN 315 (347) T ss_pred eeEEEeceeEEEecccccCc--cccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchh Confidence 99999999999999999864 4444433 23444443 366778889999999999999999999999999999 Q ss_pred HHHHHHHHHHHhcCcccccceEEEEEEeeccC Q lcl|Aclame:pro 304 EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDAT 335 (402) Q Consensus 304 ~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t 335 (402) +|+|+|+++|+|||+++||||+++|+++.=.- T Consensus 316 ~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 316 YQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hhhHhhhhhhhcCCceecccceEEEecCCCCC Confidence 99999999999999999999999998865221 No 16 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=5.3e-98 Score=553.90 Aligned_cols=328 Identities=12% Similarity=0.090 Sum_probs=277.5 Q ss_pred CCCCcccccccccc-cccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCC-- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSA-SGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNAT-- 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~-~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~-- 77 (402) |+-++-+||++|++ ++|.++||||+|+|||+++|+++|+|++++++|++++|||++||+||+++++||++|++|+++ T Consensus 5 ~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~~~~~~g~~l~~~~~ 84 (347) T protein:vir:15 5 QGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLDDKRK 84 (347) T ss_pred ccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceeeeeeccCCCCCCCCC Confidence 44444569999975 478899999999999999999999999999999999999999999999999999999999774 Q ss_pred CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) .++++|++|+||+++|++++|||||++|++|| +|+++++++||+||+++|++|++++++++.+. +....+...+|+.. T Consensus 85 ~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D-~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~-~~~~~~~~~~g~~~ 162 (347) T protein:vir:15 85 DIKHTEKVIHIDGLLTADVLIYDIEDAMNHYD-VRAEYTAQLGESLAMAADGAVLAELAGLVNLP-DASNENIEGLGKPT 162 (347) T ss_pred CCccceEEEEechhhhhhHHhhhHHHHhcCCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccccccccccCccc Confidence 48999999999999999999999999999999 89999999999999999999999999876543 33444444444444 Q ss_pred ccccccCCcccccc----HHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEE Q lcl|Aclame:pro 158 SINVNVTESEALAN----PQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 158 ~~~v~~~~a~~~~~----~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~ 233 (402) ..........+..+ +++++++|++++++|||+|||.+|||+||+|++|++||++++|++++|+++ +.+.+|.|+ T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~--~~~~~G~Vg 240 (347) T protein:vir:15 163 VLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQAL--IDHERGTIR 240 (347) T ss_pred cccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhccccccccccccc--ccccceEEE Confidence 33333333333333 578899999999999999999999999999999999999999999999643 346899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccc--------eeeeccceeEEeecHHHhhhhhhcccceeeccchhHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYD--------PIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK 305 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~--------~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~ 305 (402) +++||+||+|||||..+ ++.+.. ++..|..|. .+..|++.++|+||++|+++||++++++|.+|++++| T Consensus 241 ~i~G~~V~~Sn~lp~~~--~t~~~~-~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~ 317 (347) T protein:vir:15 241 NVMGFEVVEVPHLTAGG--AGDTRE-DAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQ 317 (347) T ss_pred EEeceEEEecccccccc--cccccc-cccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhh Confidence 99999999999999754 233221 122333332 3556778899999999999999999999999999999 Q ss_pred HHHHHHHHHhcCcccccceEEEEEEeeccC Q lcl|Aclame:pro 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRDAT 335 (402) Q Consensus 306 ~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t 335 (402) +|+|+++|+|||+++||||+++|+++.=.- T Consensus 318 ~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 318 ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hhhhehhhhcCCceeccccEEEEecCCCCC Confidence 999999999999999999999998864221 No 17 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=5.4e-98 Score=553.86 Aligned_cols=314 Identities=14% Similarity=0.101 Sum_probs=286.1 Q ss_pred eeeeccccceEEeeeccceeeeeecCCCCCCC--CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHH Q lcl|Aclame:pro 45 DVQTVTGTNTVSNKYLGETELQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQ 122 (402) Q Consensus 45 ~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~--~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~a 122 (402) ++|+|++|||+|||+||+++++||+||++|++ +.++++|++|+||++||++|+|||||+||+||| +|+||++|+||+ T Consensus 1 ~vr~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~D-lr~e~s~~~G~a 79 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYD-VRSEYSTQMGEA 79 (324) T ss_pred CeeeeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCcc-chhHHHHHHHHH Confidence 99999999999999999999999999999977 669999999999999999999999999999999 899999999999 Q ss_pred HHHHHHHHHHHHHHhhhhhccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeCh Q lcl|Aclame:pro 123 LKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPW 202 (402) Q Consensus 123 LA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P 202 (402) ||+++||+|++++++++++.++...++....|++............+.+++++|++|++++++|||+|||++|||+||+| T Consensus 80 LA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~P 159 (324) T protein:vir:99 80 LAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTDP 159 (324) T ss_pred HHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCh Confidence 99999999999999999999999999999988888877777777778889999999999999999999999999999999 Q ss_pred HHHHHHhcccchhhcccccccCcccccceEEEEeccEEEecCccccccC-------ccccccccccCCccc-cceeeecc Q lcl|Aclame:pro 203 KFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQ-------DQAHHLLSNEDNGYR-YDPIAEMN 274 (402) Q Consensus 203 ~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~-------~~t~~~ls~a~~G~~-~~~~ad~~ 274 (402) |+|++|+++.++++.+|+ +.+.+.+|.|++++||+||+|||+|+... +.++|.+++++.++. .+|+.+++ T Consensus 160 ~~y~~Ll~~~~~~~~~~~--~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~ 237 (324) T protein:vir:99 160 DTYSAILAALMPNAANYA--ALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGAD 237 (324) T ss_pred HHHHHHhhcccccccccc--cccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccC Confidence 999999999999988886 34568899999999999999999998542 234455655555543 37899999 Q ss_pred ceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHHhhhccc Q lcl|Aclame:pro 275 GAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQ 354 (402) Q Consensus 275 ~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~ 354 (402) +++||+||++|++++|++++++|.||++++|+|+|+++|+|||+++||||+++|+++.++|++++|+...+++..++++- T Consensus 238 ~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~~~~~~~~~~~~~ 317 (324) T protein:vir:99 238 NVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDVITGVASFAAPAS 317 (324) T ss_pred ceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCccccccchhhhhhccccCccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988875 Q ss_pred ceEEEeecchhh Q lcl|Aclame:pro 355 RKAVYVKTEGAA 366 (402) Q Consensus 355 ~~~~~~~~~~~~ 366 (402) -.+. +++ T Consensus 318 ~~~~-----~~~ 324 (324) T protein:vir:99 318 TRAK-----SSA 324 (324) T ss_pred ceee-----ecC Confidence 4433 333 No 18 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=5.2e-75 Score=427.84 Aligned_cols=316 Identities=14% Similarity=0.057 Sum_probs=263.0 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeee--ccccceEEeeeccceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQT--VTGTNTVSNKYLGETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rt--i~~Gksv~f~~iG~~t~~~~~~G~~i~~~ 77 (402) ||+.|++|++.++ +...+.|| |+|+++|++.|+++++|+++++-++ +++|+||+||++|+.++++|++|.+|+++ T Consensus 1 ~~~~~~~~~~~~~--t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~~~~d~~~~~~i~~~ 78 (341) T protein:vir:94 1 MALGNTITGPSIN--TQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISELGVEDKATDVPVGVQ 78 (341) T ss_pred Ccchhhhcccccc--chhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCcceeeeecCCCccccc Confidence 9999999999997 66778889 9999999999999999999987665 46799999999999999999999999999 Q ss_pred CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) .+.+++++|+||+.+|+++.|+|+|+.|+++| +|+++.++++++||+++|+.++..+..++....+ T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d-~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~------------- 144 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASYD-LRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQ------------- 144 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhccc-hHHHHHHHHHHHHHHHHHHHHHHHhhhccccccC------------- Confidence 99999999999999999999999999999999 8999999999999999999999887654422111 Q ss_pred ccccccCCccccccHH-HHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEe Q lcl|Aclame:pro 158 SINVNVTESEALANPQ-YVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) Q Consensus 158 ~~~v~~~~a~~~~~~~-~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia 236 (402) ..+.........+++ ..|+.|.+++++|||++||.+|||+||+|++|+.|+++++|+++++.+ ++.+.+|.|++++ T Consensus 145 -~~~~~~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g--~~~l~~G~ig~i~ 221 (341) T protein:vir:94 145 -NVFSSSNGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFIN--NAPIAQGQIGSLM 221 (341) T ss_pred -ccccCccccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccc--cchhheeeeeeEe Confidence 011111111222333 348999999999999999999999999999999999999999999864 3457899999999 Q ss_pred ccEEEecCccccccCccccccc----------cccCCccccceeeeccceeEEeecHHHhhhhhhcc-----------cc Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLL----------SNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE-----------VT 295 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~l----------s~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~d-----------l~ 295 (402) ||+||+|||+|........... ...+...--.+.+++..+++|+||++|++++|+++ +. T Consensus 222 G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~ 301 (341) T protein:vir:94 222 GVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPR 301 (341) T ss_pred ceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhcccccccc Confidence 9999999999975533211110 01122222356778899999999999999999777 66 Q ss_pred eeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeeccC Q lcl|Aclame:pro 296 GDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDAT 335 (402) Q Consensus 296 ~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t 335 (402) ++..|+.+||+|+|+++|+||||+||||||+.|++...+- T Consensus 302 ~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 302 VTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred ccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcCCC Confidence 7788999999999999999999999999998776655543 No 19 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=1.7e-62 Score=359.27 Aligned_cols=321 Identities=12% Similarity=0.013 Sum_probs=244.6 Q ss_pred CCCCcccccccccc---cccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeec--cccceEEeeeccceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSA---SGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTV--TGTNTVSNKYLGETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~---~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti--~~Gksv~f~~iG~~t~~~~~~G~~i 74 (402) |++--.. .++.+ +....+.|+ |+|++||++.|++..++.++++.+.. +.|+|++||++|++++.+|++|.++ T Consensus 1 ~~~~~~~--~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d~~~g~~i 78 (381) T protein:vir:80 1 MATIQGT--GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRAAVYDKQPQTPV 78 (381) T ss_pred Cceeccc--ccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcceeeeecCCCcc Confidence 8876411 22222 245567777 99999999999999999998876654 6799999999999999999999999 Q ss_pred CCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 75 NATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 75 ~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) .++.+.+++++++||+.+|++++|+|+|++|.++| +|+++.++++++||+++|+.++..+.+......+....... + T Consensus 79 ~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D-~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~--~ 155 (381) T protein:vir:80 79 NLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYT-LRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDT--T 155 (381) T ss_pred cccccCCceEEEEEeeeeecceeechHHHHhhccC-hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc--c Confidence 99999999999999999999999999999999999 89999999999999999999998887655433322111110 0 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) ..... ........+....|+.|++++++|||++||.++||+||+|++|+.|+++++|+|++|.+ ...+.+|.|++ T Consensus 156 i~~~~---~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~--~~~l~~G~Ig~ 230 (381) T protein:vir:80 156 LGDGT---VNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQ--VKPVTSGVVGT 230 (381) T ss_pred ccccc---cccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhcc--chhhhceeeeE Confidence 00000 01111223456679999999999999999999999999999999999999999999864 33578999999 Q ss_pred EeccEEEecCccccccCccccccccc-cCCc-----cccceeeec----------------------------------- Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSN-EDNG-----YRYDPIAEM----------------------------------- 273 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~-a~~G-----~~~~~~ad~----------------------------------- 273 (402) ++||+||+|||+|.... +.+.... +... +...|.++| T Consensus 231 i~G~~Vv~Sn~lp~~~~--t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~ 308 (381) T protein:vir:80 231 ILGMEVIVTTQIGINSL--TGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAA 308 (381) T ss_pred EcceEEEeecccccccc--cceeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeec Confidence 99999999999997432 2222110 0000 111222222 Q ss_pred --cceeEEe--ecHHHhhhh-----hhccccee----eccchhHHHHHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 274 --NGAVAVL--FTSDALLVG-----RTIEVTGD----IFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 274 --~~~~al~--fh~~Av~tv-----~~~dl~~e----~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) ..++|++ |.+++.+.+ +.+.++.+ .-+...+|+|+|+++++||++++||.+|+.|++.+- T Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 309 DGGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred CCCceeeeehhhhhhhhhcccccccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 2345666 567777777 55554444 445667799999999999999999999999998766 No 20 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=4.3e-59 Score=340.54 Aligned_cols=267 Identities=15% Similarity=0.120 Sum_probs=227.1 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee---eccccceEEeeeccceeeeeec-CCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ---TVTGTNTVSNKYLGETELQVLA-PGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r---ti~~Gksv~f~~iG~~t~~~~~-~G~~i~ 75 (402) |+. ..|+ |+|+++|++.|++.+++.++++.. +++.|+|++||++|++++.+|+ .|.++. T Consensus 1 MA~----------------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~ 64 (273) T protein:vir:10 1 MAF----------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS 64 (273) T ss_pred Ccc----------------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccC Confidence 443 4577 999999999999999999988643 5788999999999999998776 467788 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.+++||+.+|+.+.|+|+|+.|.++| +++ +.++++++||+++|+.++..+..++... T Consensus 65 ~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~------------- 129 (273) T protein:vir:10 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRAGATALATDTDKFIADMLVDNGTAL------------- 129 (273) T ss_pred ccccccceEEEEEeeeeecceEeecHHHhhhhcc-HHH-HHHHHHHHHHHHHHHHHHHHHhcccccc------------- Confidence 8999999999999999999999999999999999 775 8899999999999999998876433110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccc-hhhcccccccCcccccceEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADR-IVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r-~~n~d~~~~~~g~~~~G~V~~ 234 (402) ......+++.+|+.|++++.+|||++||.++||+||+|++|+.|++++. +.+.++.+ +.+.+++|.|++ T Consensus 130 ---------~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~-~~~~l~~G~ig~ 199 (273) T protein:vir:10 130 ---------TGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG-DAAGLRAGTIGN 199 (273) T ss_pred ---------ccccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccc-cccceeeeeeeE Confidence 1112345678899999999999999999999999999999999999876 54556543 344578999999 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a 314 (402) ++||+||+|||+|.... + .++.||++|++.+++++ ++|..|++++|+|+|+++++ T Consensus 200 i~G~~v~~s~~lp~~~~---------------~---------~~~~~~~~A~~~a~q~~-~~e~~r~~~~~~~~v~~~~~ 254 (273) T protein:vir:10 200 LLGARIVESNNLRDTDD---------------E---------QFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHV 254 (273) T ss_pred EeceEEEEecccccCCc---------------c---------EEEEEeccceeeeeeee-hhhcccCCCcceeeeeeeee Confidence 99999999999995321 0 14788999999999776 89999999999999999999 Q ss_pred hcCcccccceEEEEEEeec Q lcl|Aclame:pro 315 EGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 315 ~Ga~vlRPeaa~vv~~~~~ 333 (402) ||++++|||++++|+..+. T Consensus 255 yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 255 YGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeeeEeccceEEEEeccCC Confidence 9999999999999876665 No 21 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=4.3e-59 Score=340.54 Aligned_cols=267 Identities=15% Similarity=0.120 Sum_probs=227.1 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee---eccccceEEeeeccceeeeeec-CCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ---TVTGTNTVSNKYLGETELQVLA-PGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r---ti~~Gksv~f~~iG~~t~~~~~-~G~~i~ 75 (402) |+. ..|+ |+|+++|++.|++.+++.++++.. +++.|+|++||++|++++.+|+ .|.++. T Consensus 1 MA~----------------~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~ 64 (273) T protein:vir:10 1 MAF----------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTS 64 (273) T ss_pred Ccc----------------hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccC Confidence 443 4577 999999999999999999988643 5788999999999999998776 467788 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.+++||+.+|+.+.|+|+|+.|.++| +++ +.++++++||+++|+.++..+..++... T Consensus 65 ~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~------------- 129 (273) T protein:vir:10 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRAGATALATDTDKFIADMLVDNGTAL------------- 129 (273) T ss_pred ccccccceEEEEEeeeeecceEeecHHHhhhhcc-HHH-HHHHHHHHHHHHHHHHHHHHHhcccccc------------- Confidence 8999999999999999999999999999999999 775 8899999999999999998876433110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccc-hhhcccccccCcccccceEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADR-IVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r-~~n~d~~~~~~g~~~~G~V~~ 234 (402) ......+++.+|+.|++++.+|||++||.++||+||+|++|+.|++++. +.+.++.+ +.+.+++|.|++ T Consensus 130 ---------~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~-~~~~l~~G~ig~ 199 (273) T protein:vir:10 130 ---------TGSAPTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG-DAAGLRAGTIGN 199 (273) T ss_pred ---------ccccccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccc-cccceeeeeeeE Confidence 1112345678899999999999999999999999999999999999876 54556543 344578999999 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a 314 (402) ++||+||+|||+|.... + .++.||++|++.+++++ ++|..|++++|+|+|+++++ T Consensus 200 i~G~~v~~s~~lp~~~~---------------~---------~~~~~~~~A~~~a~q~~-~~e~~r~~~~~~~~v~~~~~ 254 (273) T protein:vir:10 200 LLGARIVESNNLRDTDD---------------E---------QFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHV 254 (273) T ss_pred EeceEEEEecccccCCc---------------c---------EEEEEeccceeeeeeee-hhhcccCCCcceeeeeeeee Confidence 99999999999995321 0 14788999999999776 89999999999999999999 Q ss_pred hcCcccccceEEEEEEeec Q lcl|Aclame:pro 315 EGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 315 ~Ga~vlRPeaa~vv~~~~~ 333 (402) ||++++|||++++|+..+. T Consensus 255 yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 255 YGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeeeEeccceEEEEeccCC Confidence 9999999999999876665 No 22 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=100.00 E-value=1e-58 Score=338.48 Aligned_cols=216 Identities=10% Similarity=0.036 Sum_probs=166.6 Q ss_pred ecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccccccCCcc Q lcl|Aclame:pro 88 IDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESE 167 (402) Q Consensus 88 ID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~ 167 (402) ||++|+++++|||||++|+||| +|+|+++|+||+||+++||+|++++++||++..|....+ +++.... .+. T Consensus 1 iD~lL~a~~~VdDiD~aqa~~d-vr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~----~g~~~~~----~a~ 71 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWN-TRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQD----GGFSVNI----GAG 71 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccc----cCcceec----ccc Confidence 9999999999999999999999 899999999999999999999999999998877644322 2222222 334 Q ss_pred ccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--ccchhhcccccccCcccccc-eEEEEeccEEEecC Q lcl|Aclame:pro 168 ALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--ADRIVDKTYTISQSGATING-FVLSSYNCPVIPSN 244 (402) Q Consensus 168 ~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~r~~n~d~~~~~~g~~~~G-~V~~iaG~~V~~SN 244 (402) .+.+++++|++|++++++|||+|||.+|||+||+|++||.||+ +++++|+|+++++.+ ..+| .|++++||+||+|| T Consensus 72 ~t~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~-~~~g~~i~~v~G~~V~~Sn 150 (221) T protein:vir:17 72 NTNNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGD-MNTGKGLYVNAGIRIYKSN 150 (221) T ss_pred ccCCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeeccccccc-ccccceeeeecCcEEEEec Confidence 5678999999999999999999999999999999999999997 588999999765444 5666 59999999999999 Q ss_pred ccccccCcccccc-ccc-cCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHhcCccccc Q lcl|Aclame:pro 245 RFPTFAQDQAHHL-LSN-EDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRW 322 (402) Q Consensus 245 nlP~~~~~~t~~~-ls~-a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRP 322 (402) |+|+..+...... ... ...++..+|+++|+|++||+|||+|+||||++.+-..- +.-.+ . ..+.|| T Consensus 151 nlP~~~gt~~~~~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~~---~~~~~-----~----~~~~~~ 218 (221) T protein:vir:17 151 VLASLYGTNLVTDPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPPSRP---PLVIS-----M----FSIRRP 218 (221) T ss_pred cCCcccccccccCCccccccccccccccccccceEEEEEcchheeeeeeecCCCCC---ceeee-----e----eeccCC Confidence 9998553311100 111 12233457899999999999999999999999853321 11000 0 013333 Q ss_pred ceE Q lcl|Aclame:pro 323 EAV 325 (402) Q Consensus 323 eaa 325 (402) +-- T Consensus 219 ~~~ 221 (221) T protein:vir:17 219 DRR 221 (221) T ss_pred CCC Confidence 322 No 23 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=8.5e-57 Score=327.97 Aligned_cols=267 Identities=15% Similarity=0.124 Sum_probs=225.2 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee---eccccceEEeeeccceeeeee-cCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ---TVTGTNTVSNKYLGETELQVL-APGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r---ti~~Gksv~f~~iG~~t~~~~-~~G~~i~ 75 (402) |++ ..|+ |+|+++|++.|++.+++.++++.. ..+.|+|++||++|.+++.+| ..|.++. T Consensus 1 MA~----------------~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~ 64 (273) T protein:vir:79 1 MAF----------------NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTS 64 (273) T ss_pred Ccc----------------hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccC Confidence 665 3377 999999999999999999887543 335699999999999998855 4688899 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.+++||+.+++.+.|+|+|+.|.++| +++ +.++++++||+++|+.++..+..++... T Consensus 65 ~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~-~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~~------------- 129 (273) T protein:vir:79 65 ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGS-LEA-YTRAGATALATDTDKFIADMLVDNGTAL------------- 129 (273) T ss_pred ccccccceEEEEEeeecccceeeccHHHHhhccc-HHH-HHHHHHHHHHHHHHHHHHHHHhhccccc------------- Confidence 9999999999999999999999999999999999 775 8899999999999999988775432110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc-chhhcccccccCcccccceEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD-RIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~-r~~n~d~~~~~~g~~~~G~V~~ 234 (402) ......+++.+++.|.+++.+|||++||.+|||+||+|++|+.|++++ +|.+.++.+ +++.+.+|.|++ T Consensus 130 ---------~~~~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~-~~~~l~~G~ig~ 199 (273) T protein:vir:79 130 ---------TGSAPSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSG-DAAGLRAGTIGN 199 (273) T ss_pred ---------ccccccchhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcc-cccceeeeEeeE Confidence 011223566789999999999999999999999999999999999976 467777654 344578999999 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a 314 (402) ++||+|++||++|...+ + ..+.||++|++.++.+. ++|..|++++|+++|.++++ T Consensus 200 ~~G~~i~~s~~lp~~~~---------------~---------~~~a~~~~A~~~a~~~~-~~e~~r~~~~~~~~v~~~~~ 254 (273) T protein:vir:79 200 LLGARIVESNNLRDTDD---------------E---------QFVAFHPSAAAYVSQID-TVEALRDQDSFSDRIRALHV 254 (273) T ss_pred EeceEEEecccccccCc---------------e---------EEEEEeccceeeeeehh-hhhcccCcccceeeeeeeee Confidence 99999999999996321 0 13678999999998775 89999999999999999999 Q ss_pred hcCcccccceEEEEEEeec Q lcl|Aclame:pro 315 EGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 315 ~Ga~vlRPeaa~vv~~~~~ 333 (402) ||++++|||++++|+..+. T Consensus 255 yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 255 YGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred eeeEEecCceEEEEeccCC Confidence 9999999999998876655 No 24 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=6e-57 Score=328.79 Aligned_cols=309 Identities=12% Similarity=0.074 Sum_probs=235.2 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~~ 79 (402) ||+.|+.++. .++|. |+|+.+++--+.++.+...+.++.....|+||+||+||++++++|+.+++|..+++ T Consensus 1 ~~~~n~ts~~--------qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~~~~~i~~d~l 72 (322) T protein:vir:31 1 MSTGNNTSNT--------QALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRPEQGDFTFDNL 72 (322) T ss_pred CCCCCCcccc--------eEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccccCCCCcccccC Confidence 9998843332 34554 99999999999999998888776566789999999999999999999999999999 Q ss_pred cccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSI 159 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~ 159 (402) .+.+.+|+|||.+|+.|.|+| |.+|..+| ++.++.+++||+||+.+|+++...|..+|...+. ...+....+.. .. T Consensus 73 tt~~~~l~IDq~KYfaf~VdD-D~~Qa~~d-l~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~-~~~p~vin~~~-~~ 148 (322) T protein:vir:31 73 DTGEISIILRDEVYAGNAISK-KLRQDSRW-ISNVGAMLPAEQARAIMERYQTDLLALGNAQFAG-QNDPNVINGVP-HR 148 (322) T ss_pred CCceEEEEEehhhhhccccch-hHHHhhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc-cCCcceecCCc-cc Confidence 999999999999999999999 99999999 8999999999999999999999988877643221 00111111111 11 Q ss_pred ccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHH---------HHhcccchhhcccccccCcccccc Q lcl|Aclame:pro 160 NVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFN---------ALRDADRIVDKTYTISQSGATING 230 (402) Q Consensus 160 ~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~---------~Ll~~~r~~n~d~~~~~~g~~~~G 230 (402) . ....+++...|+.|++++.+|||+|||.+|||+||+|++|. +|++|+||+..+-+++..|. + T Consensus 149 i-----v~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~---~ 220 (322) T protein:vir:31 149 F-----VGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDM---Q 220 (322) T ss_pred e-----eccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhH---H Confidence 1 12234666789999999999999999999999999999865 56889999854433333322 2 Q ss_pred eEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEee---cHHHhhhhhhcccceeeccchhHHHH Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLF---TSDALLVGRTIEVTGDIFYEKKEKTY 307 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~f---h~~Av~tv~~~dl~~e~~~d~~~~~d 307 (402) .|++++||+||+||++|...-. . ++|.-...+.++.+ |..+.+. |...++..+.++ +.|.||++.||.| T Consensus 221 ~Vg~~~GF~V~~SN~l~~~~~~----i--~aG~d~~~t~ag~~-n~f~~~~~~~~~~~~~~~~~l~-~~e~~r~~~~~~d 292 (322) T protein:vir:31 221 FVRSVYGIDLFVSNLLADANET----I--NAGGDARSTTAGKC-NMFMNVSDMGLLPFVVAWKEMP-TTKSFIDDYNDDL 292 (322) T ss_pred HHHHHhceeeeeeccccccccc----c--ccCcccccccceee-cccccccchhhhhhhhHhhhhh-hhhcccCcccccc Confidence 5999999999999999843311 1 12222222333333 3233322 344555666665 8899999999999 Q ss_pred HHHHHHHhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .+.+++.||+|++|||-+++|.+..+.++= T Consensus 293 ~~~~~~~~g~g~~r~e~l~~~~a~~~~~~~ 322 (322) T protein:vir:31 293 NTATTARWGNGLVRDENLVCVLANADKVTF 322 (322) T ss_pred ceeeeeeecceeecccceEEEEeccccccC Confidence 999999999999999999999887654433 No 25 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=6.6e-52 Score=301.17 Aligned_cols=308 Identities=13% Similarity=0.101 Sum_probs=222.5 Q ss_pred CCCCccccc-ccccccccHHHHHHHHHhHHHHHHHHH-Hhhhcccceeeecc-ccc------eEEeeeccceeeeeecCC Q lcl|Aclame:pro 1 MSTPNTLTN-VAVSASGEVDSLLIEKFNGKVNEQYLK-GENILSYFDVQTVT-GTN------TVSNKYLGETELQVLAPG 71 (402) Q Consensus 1 Ms~~n~~t~-~~~~~~~d~~alfle~f~geV~t~f~~-~sv~~~~~~~rti~-~Gk------sv~f~~iG~~t~~~~~~G 71 (402) |...+..+. |--+ .+..+.|+|+|..+|+..||+ +++|++.++.++-. ++. ++.++.+|+..+..+.+. T Consensus 1 ~~~~~~~~~~~~Ms--~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 1 MKLNAIMSMLPLIA--GDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSAD 78 (322) T ss_pred Ccccceeeeeeeee--chhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccC Confidence 988887776 4444 467899999999999999998 99999999888733 333 344455555555544443 Q ss_pred CC--CCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 72 QS--PNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) Q Consensus 72 ~~--i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~ 149 (402) +. ++.+..++..+.+++++. |+.++|||+|++|.++| .|++|.++++++|+|.+|+.|+..+...|.. + T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~-~~~~~VDd~D~~k~~~D-~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~------~- 149 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTY-DTGHVVEQEDISQMLLD-PNSALITSQAYAMARKTDDLIIAGAWKPASI------K- 149 (322) T ss_pred cccCCCccccccceEEEeeccc-ccceecchHHHHHhhcC-chHHHHHHHHHHhhhHHHHHHHhhhhccccc------c- Confidence 32 223344566666666555 78899999999999999 8999999999999999999887666543311 1 Q ss_pred ccccccccccccccCCcccc-ccHHHHHHHHHHHHHHHHhhcCCcc-CcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 150 PRVKGHGFSINVNVTESEAL-ANPQYVMAAVEYALEQQLEQEVDIS-DVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 150 ~~~~g~~~~~~v~~~~a~~~-~~~~~l~dai~~a~~~LdekdVP~~-gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) +.+...... ...... .+..-.+++|++|++.|+|++||++ +||+||+|++|+.||++++|+++||.+ ..... T Consensus 150 ----~~gt~v~~~-ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~-~~~l~ 223 (322) T protein:vir:10 150 ----GTGQPVEFL-ATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTS-AMDLQ 223 (322) T ss_pred ----ccccccccC-CCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhccc-chhhh Confidence 111111100 000000 0001126789999999999999976 599999999999999999999999975 34455 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeecc-chhHHH Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFY-EKKEKT 306 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~-d~~~~~ 306 (402) .+|.|++++||.+++|||||..+. +.... | ..+..+...+. ++++|++|+++++.+++.++.++ +.+.+. T Consensus 224 ~~G~ig~~lGf~~i~s~~lp~~~~--t~~~~-----~-~~~~~~~~~~~-~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a 294 (322) T protein:vir:10 224 SKGIITNWMGYTWIVSTRLDKFDP--TQWGM-----A-AEDGPQGDEIW-CIAMTDMALGYHSCKDIWTKVAEDPSASFA 294 (322) T ss_pred hcCeeeeeeeEEEEEeccCCcccc--ccccc-----c-ccCCCCcccee-EEEEecCceeEEEeeeeeEEeeccCCcchh Confidence 789999999999999999996442 22211 1 22233332332 56899999999999999999665 666779 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) |.|.++++||+++++|+.+++|+. ++.- T Consensus 295 ~~I~~~~~~Ga~ri~~~gVv~i~~--~e~~ 322 (322) T protein:vir:10 295 WRIYSAFTADCVRVEDEHIFKLRL--KNSL 322 (322) T ss_pred hhhhhhhhhCceEeccCcEEEEEE--eccC Confidence 999999999999999997665554 4433 No 26 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=1.8e-43 Score=254.97 Aligned_cols=271 Identities=8% Similarity=0.009 Sum_probs=223.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccce-eeec--cccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFD-VQTV--TGTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~-~rti--~~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++. ++..-++|+ |+|+..|...|.+..++.++.. .+++ +.|++++||+++.+ .++++..|+.|+ T Consensus 1 Ma~~----------~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~ 70 (278) T protein:vir:80 1 MADL----------TTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAID 70 (278) T ss_pred CCCc----------ceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCc Confidence 6653 234456788 9999999999999999998874 3344 45999999998765 367899999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+.+++.+++|++.. ..+.++|+|..++..| +..++.++++++|++.+|..++..+..+... T Consensus 71 ~~~lt~~~~~~~i~~~~-~a~~v~D~~~~~~~~d-~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~-------------- 134 (278) T protein:vir:80 71 YSALETESVKHGIKKAG-KGVKLTDESVLSGYGD-PVEEAQKQIRMAIASKVDNDILEEALTTTLE-------------- 134 (278) T ss_pred ccccccceeeEeeehhh-ccccccHHHHhhcccc-HHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Confidence 99999999999999965 4799999999999998 7889999999999999999998777532210 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) + .++.........|+.|.++..+|+++++|. .|+++|+|++|+.|+++. +|+.. +..+++...+|.|+ T Consensus 135 -----~--~~~~t~~~~~~~~~~~~da~~~l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~--~~~g~~~~~~G~ig 204 (278) T protein:vir:80 135 -----V--KGAINIGLIDKIENTFTDAPDAIEDESITT-TGVLFLNYKDTAKLREEAAGSWTKA--SQLGDDLLVKGAFG 204 (278) T ss_pred -----c--ccccccchhhhHHHHHHHHHHhhcccCCCc-ccEEEECHHHHHHHHhhhhhhcccc--ccccccceeeccce Confidence 0 011122234456899999999999999995 677999999999999875 66643 33456677899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++||+||+||++|.+ .+++||+.|+++.+.+++..|.+|+++++.|.|.+++ T Consensus 205 ~~~G~~Vi~s~~~p~~---------------------------t~~l~~~gAi~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 257 (278) T protein:vir:80 205 ELLGWEIVRTKKLADG---------------------------NALAVKAGALKTFLKRNLLAESGRDMDHKLTKFNADQ 257 (278) T ss_pred eecceeEEEcCCCCcc---------------------------eEEEEeccceeeeecCCcccccccchhhccceeeeee Confidence 9999999999999842 1468899999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~ 334 (402) .||++++||+++++|+...+. T Consensus 258 ~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 258 HYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred EEEEEEEcCcceEEEeeccCC Confidence 999999999999988766665 No 27 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=1.4e-40 Score=239.11 Aligned_cols=305 Identities=10% Similarity=0.023 Sum_probs=223.9 Q ss_pred CCCC--c-----ccccccc-cccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeee--ccccceEEeeeccceeeeeec Q lcl|Aclame:pro 1 MSTP--N-----TLTNVAV-SASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQT--VTGTNTVSNKYLGETELQVLA 69 (402) Q Consensus 1 Ms~~--n-----~~t~~~~-~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rt--i~~Gksv~f~~iG~~t~~~~~ 69 (402) |--- | -+..-|. +-+-..+.+-+ |.|++++++.|...+..-....-+. -.+|++|+||+|+...+++|+ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~ 91 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYK 91 (329) T ss_pred hhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeeccccccccc Confidence 3211 1 0112222 23345566555 9999999999998877654422233 358999999999999999999 Q ss_pred CCCCCCCCCccccceeEeecceeeccchhhhHHHhhcCccc-hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 70 PGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDS-LKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERN 148 (402) Q Consensus 70 ~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~-vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~ 148 (402) +++....+.+..+..+++||+.+|+.|.||++|..|++.+. +-..+.+.+.+.++.++|.+.+..++..+.. T Consensus 92 R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~------- 164 (329) T protein:vir:10 92 RNATNEFDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAK------- 164 (329) T ss_pred CCCCccccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc------- Confidence 99999889999999999999999999999999999998762 2355778889999999999999888643311 Q ss_pred cccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccc Q lcl|Aclame:pro 149 KPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATI 228 (402) Q Consensus 149 ~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~ 228 (402) ..+...+++++|+.|.++..+|||++|| ++||++|+|++|.+|+++++|+... .. ...... T Consensus 165 ----------------~~~~~~t~~nay~~i~~a~~~Lde~~vp-~~Rvl~VtP~~~~~Lk~~~~f~~~~-~~-~~~~~~ 225 (329) T protein:vir:10 165 ----------------HLTVGSGADAQYDAVLDVSVELDEIGAG-ASRILFVTPKFYKGIKKFVIELPQG-DN-RQQVLG 225 (329) T ss_pred ----------------ccccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccc-cc-ccccee Confidence 1122346778999999999999999999 5999999999999999999998442 22 233457 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeecc-chhHHHH Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFY-EKKEKTY 307 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~-d~~~~~d 307 (402) +|.|+++.||+|+++|+..- .+.-.+++|++|+..+..++ .++.++ .+.+++| T Consensus 226 ~g~Vg~idG~~Ii~vps~~~-------------------------k~in~ii~~~~A~~~~~K~~-~~~~~~p~~~~~a~ 279 (329) T protein:vir:10 226 KGVQGELDGFTIVKVPSKML-------------------------QGVEAMAVIGEVMASPIQAN-EAKLNSNVPGMFGT 279 (329) T ss_pred eeeeeeecCeEEEEecCCcc-------------------------cceeEEEEcCCceeeeeeee-eeeeeCCCCccchh Confidence 99999999999999865321 01224788999999988887 778776 4888999 Q ss_pred HHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHHhhhcccceEEEeecchhhhhh Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAF 369 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 369 (402) ++++++.||+.|+||++.+++.....+.+.+.+.. ..|+ .+++++..+.. T Consensus 280 ~v~gr~yyd~~V~~~k~~~I~~~~~~a~~~~~~~~-~~~~-----------~~~~~~~~~~~ 329 (329) T protein:vir:10 280 LAEQMLYTGAFVPEHLQKYIFTIGGKEVETNRDGV-DAHA-----------DETNASADTGA 329 (329) T ss_pred eeeeeeeeeeEEEccccCEEEEecccCcccCCCCC-Cccc-----------cccccccccCC Confidence 99999999999999998887665443333332221 1111 11111111110 No 28 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=1.1e-40 Score=239.55 Aligned_cols=304 Identities=11% Similarity=0.017 Sum_probs=221.5 Q ss_pred CCCC---------cccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeee--ccccceEEeeeccceeeeee Q lcl|Aclame:pro 1 MSTP---------NTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQT--VTGTNTVSNKYLGETELQVL 68 (402) Q Consensus 1 Ms~~---------n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rt--i~~Gksv~f~~iG~~t~~~~ 68 (402) |--- -+++...+. +=+.+.+-| |.|++.++..|...+..-.+..-+. -.+|++|+||+|+...+++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY 79 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANK-SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) T ss_pred CCcccccccceeEeehhhhhcc-CCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc Confidence 3221 123333332 334444444 9999999988887776654322233 35899999999999999999 Q ss_pred cCCCCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchh--HHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|Aclame:pro 69 APGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLK--PKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAE 146 (402) Q Consensus 69 ~~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vr--se~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~ 146 (402) ++++....+.+..+..+++||+.+|+.|.||++|..|++.+ +. ..+.+.+.+.++.++|...+..++..+.. T Consensus 80 ~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~-l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~----- 153 (319) T protein:vir:94 80 KRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGN-IDINYVVARQGAEVVAPYLDNLRFATLARNKAK----- 153 (319) T ss_pred cCCCCcccCCcccceeEEEeecccccccccchhhHhhhhch-hhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc----- Confidence 99999999999999999999999999999999999999876 43 45678888999999999988888653311 Q ss_pred cccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcc Q lcl|Aclame:pro 147 RNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGA 226 (402) Q Consensus 147 ~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~ 226 (402) ..+...+++++|+.|.++..+|||++|| ++||++|+|++|.+|+++++|+. ...... .. T Consensus 154 ------------------~~~~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~-~~~~~~-~~ 212 (319) T protein:vir:94 154 ------------------HLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALP-QGDTRQ-QV 212 (319) T ss_pred ------------------ccccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhc-cccccc-cc Confidence 1112346778999999999999999999 79999999999999999999984 433333 34 Q ss_pred cccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeecc-chhHH Q lcl|Aclame:pro 227 TINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFY-EKKEK 305 (402) Q Consensus 227 ~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~-d~~~~ 305 (402) ..+|.|+++.||+|+++|+-.- .+.-.++.|++|+..+..++ .++.++ .+.++ T Consensus 213 ~~~g~Vg~idG~~Vi~vps~~~-------------------------k~in~i~~h~~A~~~~~k~~-~~~~~~p~~~~~ 266 (319) T protein:vir:94 213 LGKGVQGELDGFVIVKVPTKLL-------------------------QGLQAIAVVGEVLASPIQAD-LAKTNSNIPGMF 266 (319) T ss_pred eeeeeceeecCeEEEEeccccc-------------------------ccceEEEEcCCeeeeeeeee-eeeccCCCcccc Confidence 5799999999999999864210 01225788999999888876 567776 58889 Q ss_pred HHHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHHhhhcccceEEE Q lcl|Aclame:pro 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVY 359 (402) Q Consensus 306 ~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~ 359 (402) +|++++++.||+.|+||+..+++..... .+....+.-..|+.=-|.+.-.... T Consensus 267 a~~v~gr~y~d~~V~~~k~~~Iy~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:94 267 GTLAEQLLYTGAFVPEHLQKYIFTIGGT-EVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred ceeeeeeeeeeeEEeccccceEEEeecC-CcccCCCccccccccccCCcccccC Confidence 9999999999999999998887654322 2222222222333322222211111 No 29 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=1.1e-40 Score=239.55 Aligned_cols=304 Identities=11% Similarity=0.017 Sum_probs=221.5 Q ss_pred CCCC---------cccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeee--ccccceEEeeeccceeeeee Q lcl|Aclame:pro 1 MSTP---------NTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQT--VTGTNTVSNKYLGETELQVL 68 (402) Q Consensus 1 Ms~~---------n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rt--i~~Gksv~f~~iG~~t~~~~ 68 (402) |--- -+++...+. +=+.+.+-| |.|++.++..|...+..-.+..-+. -.+|++|+||+|+...+++| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY 79 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANK-SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) T ss_pred CCcccccccceeEeehhhhhcc-CCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccc Confidence 3221 123333332 334444444 9999999988887776654322233 35899999999999999999 Q ss_pred cCCCCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchh--HHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|Aclame:pro 69 APGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLK--PKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAE 146 (402) Q Consensus 69 ~~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vr--se~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~ 146 (402) ++++....+.+..+..+++||+.+|+.|.||++|..|++.+ +. ..+.+.+.+.++.++|...+..++..+.. T Consensus 80 ~R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~-l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~----- 153 (319) T protein:vir:97 80 KRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGN-IDINYVVARQGAEVVAPYLDNLRFATLARNKAK----- 153 (319) T ss_pred cCCCCcccCCcccceeEEEeecccccccccchhhHhhhhch-hhHHHHHHHHHHHHhhhhhhHHHHHHHHhhccc----- Confidence 99999999999999999999999999999999999999876 43 45678888999999999988888653311 Q ss_pred cccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcc Q lcl|Aclame:pro 147 RNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGA 226 (402) Q Consensus 147 ~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~ 226 (402) ..+...+++++|+.|.++..+|||++|| ++||++|+|++|.+|+++++|+. ...... .. T Consensus 154 ------------------~~~~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~-~~~~~~-~~ 212 (319) T protein:vir:97 154 ------------------HLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALP-QGDTRQ-QV 212 (319) T ss_pred ------------------ccccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhc-cccccc-cc Confidence 1112346778999999999999999999 79999999999999999999984 433333 34 Q ss_pred cccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeecc-chhHH Q lcl|Aclame:pro 227 TINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFY-EKKEK 305 (402) Q Consensus 227 ~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~-d~~~~ 305 (402) ..+|.|+++.||+|+++|+-.- .+.-.++.|++|+..+..++ .++.++ .+.++ T Consensus 213 ~~~g~Vg~idG~~Vi~vps~~~-------------------------k~in~i~~h~~A~~~~~k~~-~~~~~~p~~~~~ 266 (319) T protein:vir:97 213 LGKGVQGELDGFVIVKVPTKLL-------------------------QGLQAIAVVGEVLASPIQAD-LAKTNSNIPGMF 266 (319) T ss_pred eeeeeceeecCeEEEEeccccc-------------------------ccceEEEEcCCeeeeeeeee-eeeccCCCcccc Confidence 5799999999999999864210 01225788999999888876 567776 58889 Q ss_pred HHHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHHhhhcccceEEE Q lcl|Aclame:pro 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVY 359 (402) Q Consensus 306 ~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~ 359 (402) +|++++++.||+.|+||+..+++..... .+....+.-..|+.=-|.+.-.... T Consensus 267 a~~v~gr~y~d~~V~~~k~~~Iy~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:97 267 GTLAEQLLYTGAFVPEHLQKYIFTIGGT-EVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred ceeeeeeeeeeeEEeccccceEEEeecC-CcccCCCccccccccccCCcccccC Confidence 9999999999999999998887654322 2222222222333322222211111 No 30 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=3.9e-40 Score=236.61 Aligned_cols=267 Identities=11% Similarity=0.079 Sum_probs=218.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeee-c--cccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQT-V--TGTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rt-i--~~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++.+ +..-++++ |+|+..|...|.+..++.++....+ + ++|++++||+.+.+ .++.+..|+.|+ T Consensus 1 ma~~~----------T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~ 70 (274) T protein:vir:96 1 MAQGT----------TKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIP 70 (274) T ss_pred CCccc----------cchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCc Confidence 76655 23346777 9999999999999999999886543 3 35999999998753 677899999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++..++|++ .+..+.++|++..++..| ...+++++++++||+.+|..++..+..+... T Consensus 71 ~~~it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d-~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~-------------- 134 (274) T protein:vir:96 71 VDQIGTSKREAKVRK-IGKGTELTDEAVLSGFGD-PQGEAVRQHGLAIANKVDNDVLEALKGATLT-------------- 134 (274) T ss_pred hhhcccceeEEEEEe-eeceeeecHHHHHhhcch-HHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-------------- Confidence 999999999999988 578899999999999998 7889999999999999999998766432110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) ...... -|+.|++|..+|+++++ .+||++|+|++|+.|+++. +|+.. +..+++..++|.|+ T Consensus 135 ---------~~~~~~----~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~--~~~g~~~~~~g~ig 197 (274) T protein:vir:96 135 ---------VEADIT----KLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRP--TQLGDNIIVKGAFG 197 (274) T ss_pred ---------cCcccc----cHHHHHHHHHHhcccCC--CceEEEeCHHHHHHHHhccccccccc--ccccccceeecccc Confidence 000111 27889999999999876 6899999999999999974 67643 23456677899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++||+|++||++|.. .+++||+.|+++++.+++.+|.+|+++++.|.|.+++ T Consensus 198 ~~~G~~Vi~s~~~p~~---------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:96 198 EALGAVIVRSNKLNKG---------------------------EALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDK 250 (274) T ss_pred eecCeeEEEcCCCCcc---------------------------eEEEEeCcceeeeecCCcccccccchhhcccEEEEee Confidence 9999999999999842 1478899999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCcccc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTGDA 339 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~~a 339 (402) .||++++||+++++|+..... .++ T Consensus 251 ~yg~~~~~~~~vv~~t~~~~~--~~~ 274 (274) T protein:vir:96 251 HYVAYLYDESKVVKITKGAGD--EVM 274 (274) T ss_pred EEEEEEEcCccEEEEEcCccc--ccC Confidence 999999999998887543211 122 No 31 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=5.6e-40 Score=235.79 Aligned_cols=267 Identities=14% Similarity=0.043 Sum_probs=219.8 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++-. +..-++++ |+|+..|.+.|.+..++.++..+- ++. .|++++||+.+.+ ...++..|.+|+ T Consensus 1 ma~~~----------T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~ 70 (272) T protein:vir:36 1 MSKQK----------TTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEIS 70 (272) T ss_pred CCCcc----------eehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccC Confidence 55432 33456666 999999999999999999988654 354 4999999997665 346788999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.+++|++. ...+.++|+|..++..| +.+++.++++++||+.+|+.++..+..+.. T Consensus 71 ~~~lt~~~~~~~i~~~-~k~~~vtD~~~~~~~~d-~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~--------------- 133 (272) T protein:vir:36 71 LDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGD-PIGESNKQLGLSLANKVDDDLLSAAKTTSQ--------------- 133 (272) T ss_pred hhhcCCcceeEeeehh-hccccccHHHHhhccch-HHHHHHHHHHHHHHHHHHHHHHHHhccccc--------------- Confidence 9999999999999885 56899999999999998 788999999999999999988766532110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) ..+...-+|.|.+|..+|.++++| .||++|+|++|+.|+++.+|.+.. ...+++.+.+|.|+++ T Consensus 134 -------------~~~~~~~~d~i~~A~~~lgd~~~~--~~~ivv~p~~~~~L~k~~~~~~~~-~~~~~~~~~~G~ig~~ 197 (272) T protein:vir:36 134 -------------TVSTKANVDGVQAALDIFNDEDAQ--AYVLIVNPKDAAKIRKDANAKNIG-SEVGANALINGTYADV 197 (272) T ss_pred -------------cccccccHHHHHHHHHHhhhcCCC--ceEEEEcHHHHHHHhccccccccc-ccccccceeeecccee Confidence 001112367899999999999886 689999999999999999988663 2334556789999999 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) +|++|++||++|.... ....++|++.|+++...+++.+|..|++.+|.|.|.+++.| T Consensus 198 ~G~~Vv~s~~~p~~~~-----------------------~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~~~y 254 (272) T protein:vir:36 198 LGAQIVRSKKLAEGSA-----------------------LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHY 254 (272) T ss_pred cCeeEEEeCCCCCCce-----------------------eEEEEEecccceeeeecCCcccccccchhhcCcEEEEEEEE Confidence 9999999999995321 01246788999999999999999999999999999999999 Q ss_pred cCcccccceEEEEEEeec Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~~ 333 (402) |++++||++++.+++++= T Consensus 255 ~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 255 AAYLYDLTKVVNITFTGV 272 (272) T ss_pred EEEEEcCccEEEEeecCC Confidence 999999999999887765 No 32 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=5.4e-39 Score=230.37 Aligned_cols=346 Identities=8% Similarity=-0.047 Sum_probs=198.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee---ecc--ccceEEeeeccceeeeeec----- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ---TVT--GTNTVSNKYLGETELQVLA----- 69 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r---ti~--~Gksv~f~~iG~~t~~~~~----- 69 (402) |++ .+|+ |+|+.|++..|++..+|..+++.. .++ .|++|+|++.+..++.+++ T Consensus 1 Ma~----------------~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~ 64 (392) T protein:vir:99 1 MAN----------------AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAG 64 (392) T ss_pred Ccc----------------ccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccc Confidence 653 4577 899999999999999999988532 454 4999999999999998875 Q ss_pred CCCCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 70 PGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) Q Consensus 70 ~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~ 149 (402) +|.++..+.+...+..++||+.+|+.+.|+|.|+.|...| ++.++.++++++||+.+|+.++..+..+.... T Consensus 65 ~~~~~~~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~-~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~------- 136 (392) T protein:vir:99 65 AERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLES-FATQILPRQVRGVADILEEGVRDMIVGAPYEA------- 136 (392) T ss_pred cCCcccccccccceEEEEEeeeeecceeechHHHhhhhhh-hHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------- Confidence 3667888899999999999999999999999999999999 79999999999999999999987665322110 Q ss_pred ccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccC-cccc Q lcl|Aclame:pro 150 PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQS-GATI 228 (402) Q Consensus 150 ~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~-g~~~ 228 (402) .......++...|+.|.+++++|||++||. |||+||+|++|+.|+++++|++.++.+... ..++ T Consensus 137 --------------~~~~~~~~~~~~~~~i~~a~~~L~~~~vP~-~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~ 201 (392) T protein:vir:99 137 --------------AGAVHEVAPDEFFKGVNGARRALNELYIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQ 201 (392) T ss_pred --------------cccccccChhhhHHHHHHHHHHHhhcCCCC-CCEEEEcHHHHHHHhcccceeecccccchhhhhhh Confidence 012233567788999999999999999996 899999999999999999999887754332 3477 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhccccee--eccchhHHH Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGD--IFYEKKEKT 306 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e--~~~d~~~~~ 306 (402) +|.|++++||+||+||++|.... ...|.......-.......+.... .... +.. .+... ..++..... T Consensus 202 ~G~vg~i~G~~v~~s~~~~~~t~-~a~~~~a~~~at~a~v~~~~~~~~--~s~s----~~~---~v~~~~~~~~~~t~~s 271 (392) T protein:vir:99 202 EARLGRIYGYEIVESTLIPHGDA-YLYHPTAFIMATRAPAPPMGAVRS--TAIS----GDQ---RIAMRWLVDYDSTITS 271 (392) T ss_pred cceeeeeeeeEEEeecccccccc-eeeeccccccccccccccccccce--eEEe----ccc---ceecceeecccceeec Confidence 99999999999999999997532 222211000000000000000000 0000 000 00000 011111222 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHHhhhcccceEEEeecchhhhhhhhcccccchhHHHHH-- Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSAAPAGIQAEDLVAA-- 384 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 384 (402) +...-.-..|.+.+.-.+...+......+....+-....+.. +.-...+..+....+..++.|+... +.-.. T Consensus 272 ~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~----~~~~~~~~~~~~~~~~~t~~~~~~~--~~~~~vt 345 (392) T protein:vir:99 272 NRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAG----ANATITAAAGEDHTVQLKVTDANGD--DVTALCD 345 (392) T ss_pred cccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeec----ccceeEeeeccceeEEEEEEecCCc--cccceEE Confidence 211111122222221111111111111100000000000000 1122233333333333344333211 00000 Q ss_pred -------HHHHHhhcccccccCCCC Q lcl|Aclame:pro 385 -------VRAVMANDIKPTAMKPTE 402 (402) Q Consensus 385 -------~~~~~~~~~~~~~~~~~~ 402 (402) |-.|=+ +=+=|+..+-+ T Consensus 346 w~Ssn~~vAtV~~-~G~Vt~v~~G~ 369 (392) T protein:vir:99 346 FESSATDKATVAA-GGLVTGVAAGT 369 (392) T ss_pred EEEcCCeeEEEcC-CceEEEEecce Confidence 000000 00011111111 No 33 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=3.7e-39 Score=231.26 Aligned_cols=267 Identities=12% Similarity=0.087 Sum_probs=219.0 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++.. +..-++++ |+|+..|.+.+.+..++.++...- ++. +|++++||+++.+ .++.+..|+.|+ T Consensus 1 ma~~~----------T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~ 70 (274) T protein:vir:93 1 MPQGI----------TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) T ss_pred CCccc----------eehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCccc Confidence 66533 33345677 999999999999999999998653 443 5999999997653 677899999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.+++|++ .+..+.++|++..++..| ...++.++++++|++++|+.++..+..+... T Consensus 71 ~~~it~~~~~~~i~~-~~~~~~i~D~~~~~~~~d-~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~-------------- 134 (274) T protein:vir:93 71 TDILETKKREAKIRK-IAKGTSITDEALLSGYGD-PQGEQVRQHGLAHANKVDNDVLEALMGAKLT-------------- 134 (274) T ss_pred ccccccceeEEEeee-ecccccccHHHHHhhccc-hHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Confidence 999999999999988 568899999999999998 6889999999999999999998776442211 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) + ..... -++.|++|..+|+++++ .+||++|+|++|+.|+++. +|+.. +..+++...+|.|+ T Consensus 135 -----~----~~~~~----~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~--s~~g~~~~~~G~ig 197 (274) T protein:vir:93 135 -----V----NADIT----KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRA--TELGDDIIVKGAFG 197 (274) T ss_pred -----c----ccccc----CHHHHHHHHHHhhhccC--CccEEEeCHHHHHHHHhhhhhccccc--ccccccceeecccc Confidence 0 00111 26789999999998875 6899999999999999986 56643 34456667899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++||+|++||++|.. .+++||+.|+++++.+++..|..|+++++.|.|.+++ T Consensus 198 ~~~G~~Vi~s~~~p~~---------------------------t~~l~~~gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:93 198 EALGAIIVRTNKLEAG---------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDK 250 (274) T ss_pred eecCeeEEEcCCCCcc---------------------------eEEEEeCCeEEEEecCCcccccccchhhcccEEEEEE Confidence 9999999999999831 1468899999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .||++++||+.+++++...+-++- T Consensus 251 ~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 251 HYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEcCCceEEEeeCccccCC Confidence 999999999999888765444333 No 34 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=1.9e-38 Score=227.36 Aligned_cols=267 Identities=12% Similarity=0.085 Sum_probs=218.7 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++.. +-.-++++ |+|+..|...+.+..++.++...- ++. +|++++||+.+.+ .++.+..|+.|+ T Consensus 1 ma~~~----------T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~ 70 (274) T protein:vir:94 1 MPQGL----------TKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) T ss_pred CCccc----------eehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccc Confidence 66543 33345677 999999999999999999998764 343 5999999996543 567889999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+...+.+++|++ ....+.++|++..++..| ...++.++++++||+++|+.++..+..+.... T Consensus 71 ~~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d-p~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~------------- 135 (274) T protein:vir:94 71 TDILETKKREAKIRK-IAKGTSITDEALLSGYGD-PQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------------- 135 (274) T ss_pred ccccccceeEEEeee-ecceecccHHHHHhccch-HHHHHHHHHHHHHHHHHHHHHHHHHhccCccc------------- Confidence 999999999999988 467899999999999998 67889999999999999999987775432110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) . ....+ |+.|++|..+|++++. .+||++|+|++|+.|+++. +|++. +..+++...+|.|+ T Consensus 136 ------~----~~~~~----~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~--s~~g~~~~~~G~ig 197 (274) T protein:vir:94 136 ------N----ADITK----LNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRA--TELGDDIIVKGAFG 197 (274) T ss_pred ------c----ccccC----HHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhcccc--Ccccccceeccccc Confidence 0 01111 6789999999998765 6799999999999999985 67754 33455667899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++||+||+||++|.. .+++|++.|++.++.+++..|.+|+++++.|.|.+++ T Consensus 198 ~~~G~~Vi~s~~~p~~---------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:94 198 EALGAIIVRTNKLEAG---------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDK 250 (274) T ss_pred eecCeeEEEcCCCCcc---------------------------eEEEEeCcceEeeecCCceeccccchhhcccEEEEEE Confidence 9999999999999831 1468899999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .||++++||+.+++++....-++- T Consensus 251 ~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 251 HYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEcCCceEEEecCcccccC Confidence 999999999998888765443333 No 35 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=1.9e-38 Score=227.36 Aligned_cols=267 Identities=12% Similarity=0.085 Sum_probs=218.7 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++.. +-.-++++ |+|+..|...+.+..++.++...- ++. +|++++||+.+.+ .++.+..|+.|+ T Consensus 1 ma~~~----------T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~ 70 (274) T protein:vir:97 1 MPQGL----------TKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) T ss_pred CCccc----------eehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccc Confidence 66543 33345677 999999999999999999998764 343 5999999996543 567889999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+...+.+++|++ ....+.++|++..++..| ...++.++++++||+++|+.++..+..+.... T Consensus 71 ~~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d-p~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~------------- 135 (274) T protein:vir:97 71 TDILETKKREAKIRK-IAKGTSITDEALLSGYGD-PQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------------- 135 (274) T ss_pred ccccccceeEEEeee-ecceecccHHHHHhccch-HHHHHHHHHHHHHHHHHHHHHHHHHhccCccc------------- Confidence 999999999999988 467899999999999998 67889999999999999999987775432110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) . ....+ |+.|++|..+|++++. .+||++|+|++|+.|+++. +|++. +..+++...+|.|+ T Consensus 136 ------~----~~~~~----~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~--s~~g~~~~~~G~ig 197 (274) T protein:vir:97 136 ------N----ADITK----LNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRA--TELGDDIIVKGAFG 197 (274) T ss_pred ------c----ccccC----HHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhcccc--Ccccccceeccccc Confidence 0 01111 6789999999998765 6799999999999999985 67754 33455667899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++||+||+||++|.. .+++|++.|++.++.+++..|.+|+++++.|.|.+++ T Consensus 198 ~~~G~~Vi~s~~~p~~---------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:97 198 EALGAIIVRTNKLEAG---------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDK 250 (274) T ss_pred eecCeeEEEcCCCCcc---------------------------eEEEEeCcceEeeecCCceeccccchhhcccEEEEEE Confidence 9999999999999831 1468899999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .||++++||+.+++++....-++- T Consensus 251 ~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 251 HYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEcCCceEEEecCcccccC Confidence 999999999998888765443333 No 36 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=2.8e-38 Score=226.49 Aligned_cols=267 Identities=12% Similarity=0.080 Sum_probs=217.9 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeee-c--cccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQT-V--TGTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rt-i--~~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++.. +-.-++++ |+|+..|...|.+..++.++..+-. + ++|++++||+.+.+ .++.+..|+.|+ T Consensus 1 ma~~~----------T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~ 70 (274) T protein:vir:12 1 MAQGL----------TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) T ss_pred CCcce----------eehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccc Confidence 65543 33346677 9999999999999999999987643 4 45999999985543 467889999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+...+.+++|++ .+..+.++|+|..++..| ...++.++++++||+++|+.++..+..+.... T Consensus 71 ~~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d-~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~------------- 135 (274) T protein:vir:12 71 TDILETKKREAKIRK-IAKGTSITDEALLSGYGD-PQGEQVRQHGLAHANKVDNDVLEALMGAKLTV------------- 135 (274) T ss_pred hhhcccceeeEEeee-ecceeeecHHHHHhcccc-hHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------------- Confidence 999999999999998 588999999999999998 67889999999999999999987765432110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) ..... -|+.|++|..+|++++. .+||++|+|++|+.|+++. +|++. +..+++..++|.|+ T Consensus 136 ----------~~~a~----~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~--s~~g~~~~~~G~ig 197 (274) T protein:vir:12 136 ----------NADIT----KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRA--TELGDDIIVKGAFG 197 (274) T ss_pred ----------ccccc----CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhhhhhcccc--ccccccceecccce Confidence 00011 27889999999998764 7899999999999999985 77754 33345667899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++|++||+||++|.. .+++|++-|++.+..+++..|.+|+++++.|.|.+++ T Consensus 198 ~~~G~~Vi~s~~~p~~---------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:12 198 EALGAIIVRSNKLEAG---------------------------TAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDK 250 (274) T ss_pred eecCeeEEEeCCCCcc---------------------------eEEEEeccceeeeecCCceeccccchhhcccEEEeee Confidence 9999999999999841 1468899999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .||++++||+.+++|+...+-++- T Consensus 251 ~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 251 HYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEcCCceEEEEcCCccccC Confidence 999999999998888644333222 No 37 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=2.4e-38 Score=226.80 Aligned_cols=268 Identities=13% Similarity=0.095 Sum_probs=218.0 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |+.+|..+ .-++++ |+|+-.|...+.+..+|.++..+- ++. .|++++||+...+ .++.+..|++|+ T Consensus 1 ~~~~~~T~---------l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~ 71 (275) T protein:vir:96 1 MALENMTK---------LANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIP 71 (275) T ss_pred CCCcccch---------hhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcc Confidence 87777322 223666 999999999999999999998654 354 4999999986554 566889999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.+.+|.+ .+..+.++|++..++..| ...++.+++|++||+++|+.++..+..+... T Consensus 72 ~~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d-~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~-------------- 135 (275) T protein:vir:96 72 IDLIETKKRQATIRK-IGKGTVLTDEALLSGYGD-PKGEAVRQHGLAIANKVDNDVLEALQGATLK-------------- 135 (275) T ss_pred hhhcccceeeEEeeh-hcccccccHHHHHhhccc-hHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Confidence 999999999999977 599999999999999988 6889999999999999999988766432210 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) + ..... -||.|++|..+|.+.+. .+||++|+|++|+.|+++. +|+..+ ..+++...+|.|+ T Consensus 136 -----~----~~~~~----~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~--~~g~~~~~~G~ig 198 (275) T protein:vir:96 136 -----V----EADIT----KLAGLQTAIDKFNDEDL--EPMVLFVNPLDAGKLRASATDNFTRAT--LLGDNVIVKGAFG 198 (275) T ss_pred -----c----ccccc----CHHHHHHHHHHhccccC--CccEEEeCHHHHHHHHhcccccccccc--cccccceeccccc Confidence 0 00111 27889999999987654 7899999999999998874 787543 2345667899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++|++||+||++|.. .+++|++.|++++...++.+|.+|++.++.|.|.+++ T Consensus 199 ~~~G~~Vi~s~~~p~~---------------------------t~~i~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 251 (275) T protein:vir:96 199 EALGAIIVRSNKIKEG---------------------------EAILAKRGAVKLITKRDFFLETERHASHKSTALFSDK 251 (275) T ss_pred eecCeeEEEeCCCCcc---------------------------eEEEEeccceeeeecCCcccccccchhhcCcEEEEeE Confidence 9999999999999842 1478899999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCccc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTGD 338 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~~ 338 (402) .||++++||+.+++++++.... |+ T Consensus 252 ~y~~~~~~~~~vv~~t~~~~~~-~~ 275 (275) T protein:vir:96 252 HYVAYLYDESKVVKITKSASGL-GV 275 (275) T ss_pred EEEEEEEcCccEEEEEeccccc-CC Confidence 9999999999998887654332 33 No 38 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=1.5e-38 Score=227.90 Aligned_cols=267 Identities=11% Similarity=0.083 Sum_probs=216.5 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++.. +---++++ |+|+..|...+.+..++.++..+- ++. .|++++||+...+ .++.+..|+.|. T Consensus 1 m~~~~----------T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~ 70 (274) T protein:vir:95 1 MAQGM----------TKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIP 70 (274) T ss_pred CCcce----------eehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccc Confidence 66543 23345666 999999999999999999987543 454 4999999986543 566788999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+...+.+++|++ .+..+.++|+|..++..| +..++.++++++||+.+|+.++..+.++... T Consensus 71 ~~~lt~~~~~~~i~~-~~~a~~i~D~~~~~~~~d-~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~-------------- 134 (274) T protein:vir:95 71 TDILETKKREAKIRK-IAKGTSISDEALLSGYGD-PQGEQVRQHGLAHANKVDDDVLEALKSAKLT-------------- 134 (274) T ss_pred hhhcccceeEEEeee-eecceeehHHHHhhccch-HHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Confidence 999999999999998 588999999999999888 7889999999999999999998766543211 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) +. .... -|+.|.+|..+|++++. .+||++|+|++|+.|+++. +|+.. +..+++..++|.|+ T Consensus 135 -----~~----~~~~----~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~--s~~g~~~~~~G~ig 197 (274) T protein:vir:95 135 -----VE----ADIT----KLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRA--TELGDDVIVKGAFG 197 (274) T ss_pred -----cc----cccc----CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhcccccccc--ccccccceeccccc Confidence 00 0011 27889999999997764 7899999999999999986 67743 23345667899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++||+||+||++|.. .+++|++.|+++...+++.+|..|++.++.|.|.+++ T Consensus 198 ~~~G~~Vi~s~~~~~~---------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:95 198 EALGAVIVRSNKLEAG---------------------------TAILAKKGAVKLITKRDFFLETDRDPSTKTTALYSDK 250 (274) T ss_pred eecCeEEEEeCCCCCc---------------------------eEEEEeccceeeeecCCcccccccccccccCEEEEeE Confidence 9999999999999831 1468889999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .||++++||+.+++++.-.+..+- T Consensus 251 ~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 251 HYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEcCCcEEEEEcCCccccC Confidence 999999999999888654443332 No 39 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=1.5e-38 Score=227.90 Aligned_cols=267 Identities=11% Similarity=0.083 Sum_probs=216.5 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++.. +---++++ |+|+..|...+.+..++.++..+- ++. .|++++||+...+ .++.+..|+.|. T Consensus 1 m~~~~----------T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~ 70 (274) T protein:vir:96 1 MAQGM----------TKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIP 70 (274) T ss_pred CCcce----------eehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccc Confidence 66543 23345666 999999999999999999987543 454 4999999986543 566788999999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+...+.+++|++ .+..+.++|+|..++..| +..++.++++++||+.+|+.++..+.++... T Consensus 71 ~~~lt~~~~~~~i~~-~~~a~~i~D~~~~~~~~d-~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~-------------- 134 (274) T protein:vir:96 71 TDILETKKREAKIRK-IAKGTSISDEALLSGYGD-PQGEQVRQHGLAHANKVDDDVLEALKSAKLT-------------- 134 (274) T ss_pred hhhcccceeEEEeee-eecceeehHHHHhhccch-HHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Confidence 999999999999998 588999999999999888 7889999999999999999998766543211 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc--chhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD--RIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~--r~~n~d~~~~~~g~~~~G~V~ 233 (402) +. .... -|+.|.+|..+|++++. .+||++|+|++|+.|+++. +|+.. +..+++..++|.|+ T Consensus 135 -----~~----~~~~----~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~--s~~g~~~~~~G~ig 197 (274) T protein:vir:96 135 -----VE----ADIT----KLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRA--TELGDDVIVKGAFG 197 (274) T ss_pred -----cc----cccc----CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhcccccccc--ccccccceeccccc Confidence 00 0011 27889999999997764 7899999999999999986 67743 23345667899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++||+||+||++|.. .+++|++.|+++...+++.+|..|++.++.|.|.+++ T Consensus 198 ~~~G~~Vi~s~~~~~~---------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:96 198 EALGAVIVRSNKLEAG---------------------------TAILAKKGAVKLITKRDFFLETDRDPSTKTTALYSDK 250 (274) T ss_pred eecCeEEEEeCCCCCc---------------------------eEEEEeccceeeeecCCcccccccccccccCEEEEeE Confidence 9999999999999831 1468889999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .||++++||+.+++++.-.+..+- T Consensus 251 ~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 251 HYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEcCCcEEEEEcCCccccC Confidence 999999999999888654443332 No 40 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=9.1e-38 Score=223.65 Aligned_cols=291 Identities=11% Similarity=-0.016 Sum_probs=201.7 Q ss_pred CCCCcccccccccccccHHHHH-HHHHhHHHHHHHHHHhhhcccceee---ec-cccceEEeeeccceeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLL-IEKFNGKVNEQYLKGENILSYFDVQ---TV-TGTNTVSNKYLGETELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alf-le~f~geV~t~f~~~sv~~~~~~~r---ti-~~Gksv~f~~iG~~t~~~~~~G~~i~ 75 (402) |+.-+| ++. -|+|+.|.+..|++..+|.++++.. .+ +.|+||+||+.+..++++ |..+. T Consensus 1 m~~~~N-------------~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~d---g~~~~ 64 (418) T protein:vir:10 1 MAVQDN-------------NLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSAS---GRTLV 64 (418) T ss_pred CCcccc-------------ccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecc---cCCcc Confidence 665431 111 2799999999999999999888632 22 459999999999999986 55688 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) ++.+...+..|+||+.+|+.+.|+|.|..|...| ++.++.++++++||+.+|+.++..+..++... T Consensus 65 ~~~~te~~v~l~id~~k~~~~~itD~e~a~~~~d-~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~------------- 130 (418) T protein:vir:10 65 KQPMVDQTIPFKIAYQEHVGLEYTVKDKTLDIMQ-FSERYLKSGMVQIANQIDRSLALTLKKAFHSS------------- 130 (418) T ss_pred ccccccceEEEEEecccccceeechHHHhhhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------- Confidence 8899999999999999999999999999999998 89999999999999999999987665432210 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccC-cEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISD-VAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~g-R~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) +. +. ++...|+.|.+++.+|||++||.+| ||+||+|++|+.|+++.++. .+.+ ..+..+++|.|++ T Consensus 131 gt--------~g---t~~~~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~-~~~~-~~~~~lr~G~IG~ 197 (418) T protein:vir:10 131 GT--------PG---VRPGAFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKL-FKES-MVEQAYKMGYRGN 197 (418) T ss_pred cc--------CC---cCcchHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhcccc-cccc-ccchhhheeeeee Confidence 00 00 1112488999999999999999985 99999999999999988875 3332 2334588999999 Q ss_pred EeccEEEecCccccccCccccc------------cccc-------cC---Cccccc----------------------ee Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHH------------LLSN-------ED---NGYRYD----------------------PI 270 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~------------~ls~-------a~---~G~~~~----------------------~~ 270 (402) ++||+||+|||+|....+..+. ..+. .+ .|..+. +. T Consensus 198 i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~ 277 (418) T protein:vir:10 198 VAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVL 277 (418) T ss_pred eeceEEEEecCCCcccccccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEE Confidence 9999999999999533221100 0000 00 011111 11 Q ss_pred eec---------------------------------------------------------cceeEEeecHHHhhhh--hh Q lcl|Aclame:pro 271 AEM---------------------------------------------------------NGAVAVLFTSDALLVG--RT 291 (402) Q Consensus 271 ad~---------------------------------------------------------~~~~al~fh~~Av~tv--~~ 291 (402) ++. +-..-|+||++|+..+ .+ T Consensus 278 ~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l~~~~l 357 (418) T protein:vir:10 278 EDVDTDAGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIALAMIDL 357 (418) T ss_pred eeccccccCcceeEeccccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEEEEeec Confidence 110 0112389999976533 22 Q ss_pred --------cc--------cce--eeccchhHHHHHHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 292 --------IE--------VTG--DIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 292 --------~d--------l~~--e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) +. ++. -.+||.+.+.+.+.=-..||.+.+|||.++ ++-+...+ T Consensus 358 ~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~--~~~g~~~~ 418 (418) T protein:vir:10 358 ELPQSAVIKSRAADPETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELAL--RLWGAASS 418 (418) T ss_pred cCCCCCCcceEEEeccCCeEEEEEEcccccccceEEEEEeecCceeecccceE--EEEeecCC Confidence 00 111 123555554444333347999999999863 34444322 No 41 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=1.4e-36 Score=217.17 Aligned_cols=288 Identities=12% Similarity=0.049 Sum_probs=198.3 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceee---ec--cccceEEeeeccceeeeeecCCCC-C Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQ---TV--TGTNTVSNKYLGETELQVLAPGQS-P 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~r---ti--~~Gksv~f~~iG~~t~~~~~~G~~-i 74 (402) |+..| |.|+|+.++++.|.+.+++..+.+.. .+ .|||+|+||+|+...+++|++++. . T Consensus 1 MA~~n----------------~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~ 64 (299) T protein:vir:79 1 MAALN----------------YAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAV 64 (299) T ss_pred Cccch----------------hHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcc Confidence 65322 56999999999999999988765532 34 478999999999999999999774 4 Q ss_pred CCCCccccceeEeecceeeccchhhhHHHhhcCccc-hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 75 NATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDS-LKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 75 ~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~-vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ....+..+..+++||+.+|+.|.||++|.-+++... +-..+.+.+-+.++-++|...+..|+.++... T Consensus 65 ~~g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~----------- 133 (299) T protein:vir:79 65 AQRNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTAL----------- 133 (299) T ss_pred cccccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhc----------- Confidence 555788899999999999999999966655555431 34446666778888899999988887544210 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~ 233 (402) .......+.+++++|++|.++..+|||++||.++||++|+|++|.+|.++++|... ......+...+|.|+ T Consensus 134 --------g~~~~~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~-~~~~~~~~~~~g~Vg 204 (299) T protein:vir:79 134 --------GNTADTTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRT-VNIKDAGTSLNRQTT 204 (299) T ss_pred --------CCcccccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcc-cccccccceeeeeee Confidence 01122334578899999999999999999999999999999999999999999843 334444456799999 Q ss_pred EEeccEEEe--cCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH-HHHHH Q lcl|Aclame:pro 234 SSYNCPVIP--SNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK-TYYID 310 (402) Q Consensus 234 ~iaG~~V~~--SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~-~d~i~ 310 (402) ++.||+|++ ||+|++.-....+... + .+-.+.-.+++|++|+..+...+ .++.|.+...+ +|++. T Consensus 205 ~idG~~Ii~Vps~r~~t~~~~~~G~~~-----~------~~ak~in~ii~~~~a~~~~~K~~-~~~~~~P~~~~~~~~~~ 272 (299) T protein:vir:79 205 DIDTVKIIKVPSNLMKTAYDFTTGWKV-----G------AGAKQIFMSLVHPSAIITPVSYQ-FSKLDEPTAVTEGKYFY 272 (299) T ss_pred eecceEEEEechhhcCccceeccCccc-----c------CcccccceEEEcCCeeeeeEeee-eEEeecCCCCCccceee Confidence 999999987 7888853322111111 1 11112335888999998877776 45566554443 33322 Q ss_pred HHHHhc-CcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 311 TFMAEG-AIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 311 ~~~a~G-a~vlRPeaa~vv~~~~~~t~~ 337 (402) -...|+ .-++.... -.|.+-..+.-+ T Consensus 273 ~~r~y~d~~v~~nk~-~~i~~~~~~a~~ 299 (299) T protein:vir:79 273 FEESFEDVFILNKKA-DAIQFVVEGAGA 299 (299) T ss_pred eeeeeeeeeeecccc-CeEEEEeeecCC Confidence 233333 33332222 222222222111 No 42 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=1.5e-35 Score=211.54 Aligned_cols=269 Identities=12% Similarity=0.097 Sum_probs=217.9 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeee-c--cccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQT-V--TGTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rt-i--~~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |++-. +-.-++++ |+|+..|...+.+..+|.++..+-+ + +.|++++||+.+.+ .++.+.-|++|+ T Consensus 1 Ma~~~----------T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~ 70 (276) T protein:vir:10 1 MAQGT----------TTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIP 70 (276) T ss_pred CCcce----------eehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccC Confidence 55432 33445666 9999999999999999999987654 4 36999999987554 456788899999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.+.+|.+ .+..+.++|++..++..| ...++.+++|++||+++|+.++..+..+... T Consensus 71 ~~~lt~~~~~a~i~~-~~k~~~~tD~a~~~~~~d-p~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~-------------- 134 (276) T protein:vir:10 71 VDKIETNRREAKIHK-IGKGTDITDEALLSGYGD-PQGEAVRQHGLAIANKVDNDVLEALRGTKLT-------------- 134 (276) T ss_pred ccccccceeeEEeeh-ccccccccHHHHHhhccc-hHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------------- Confidence 999999999999965 689999999999999998 7889999999999999999998776532211 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcc--cchhhcccccccCcccccceEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDA--DRIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~--~r~~n~d~~~~~~g~~~~G~V~ 233 (402) .. ....+ ++.|.+|..+|+++++ ..++++|.|++|..|+++ .+|++.. ..+++...+|.|+ T Consensus 135 -----~~----~~~~t----~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s--~~g~~~~~~G~ig 197 (276) T protein:vir:10 135 -----VS----ADIGT----LAGLEAAIDTFDDEDL--EPMVLFINPKDAGKLRSSASDNFTRAT--ELGDNIIVKGAFG 197 (276) T ss_pred -----cc----ccccC----HHHHHHHHHHhccccC--cccEEEEcHHHHHHHHHhccccccccc--cccccceeccccc Confidence 00 00111 6789999999988765 689999999999999764 6888543 3356667899999 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFM 313 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~ 313 (402) +++|++|+.|+++|.. .+++|++.|++.+...++.+|..|++.++.|.|.+.+ T Consensus 198 ~~~G~~Vi~s~~~p~~---------------------------t~~l~~~gAi~~~~~~~~~vE~dRd~~~~~d~i~~~~ 250 (276) T protein:vir:10 198 EALGAVIVRSKKLDEG---------------------------EAILAKRGAVKLITKRDFFLETDRDPSTKTTALYSDK 250 (276) T ss_pred eecceeEEEcCCCCcc---------------------------eEEEEeccceeeeecCCceeecccchhhcccEEEEee Confidence 9999999999999831 1468899999999999999999999999999999999 Q ss_pred HhcCcccccceEEEEEEeeccCccccccch Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDATTGDAGGPG 343 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~ 343 (402) .||+++++|+.++.|+.-. +|.++.| T Consensus 251 ~y~~~~~~~~~vv~~t~~~----~~~~~~~ 276 (276) T protein:vir:10 251 HYVAYLYDESKAVKVTKGA----GTTDSGA 276 (276) T ss_pred EEEEEEEcCcceEEEecCC----cCCcCCC Confidence 9999999999887776433 4444444 No 43 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=2.4e-35 Score=210.34 Aligned_cols=291 Identities=11% Similarity=-0.018 Sum_probs=197.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee---ec---cccceEEeeeccceeeeeecCC-- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ---TV---TGTNTVSNKYLGETELQVLAPG-- 71 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r---ti---~~Gksv~f~~iG~~t~~~~~~G-- 71 (402) |+ | ...-|| ++|+.+.+..|++..+|.++++.. .+ +.|+||+|++.+..+++++.++ T Consensus 1 MA--N------------~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~ 66 (423) T protein:vir:35 1 MA--N------------NLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDI 66 (423) T ss_pred Cc--c------------chhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCC Confidence 54 1 233565 999999999999999999998632 23 3499999999999999999774 Q ss_pred CCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 72 QSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 72 ~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) ..+.++.+...+..|+||+.+|+.+.++|.|..|.--| ++.. .+.++++|++.+|+.++..++.++.. T Consensus 67 ~~~~~~~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~~-~~~~-l~~a~~ala~~vd~~l~~~l~~~a~~---------- 134 (423) T protein:vir:35 67 TGKDKNGLFSAKATGKVGKYITVAVEWTQIEEALKLNQ-LDQI-LSPIHERMVTDLETELAHFMMNNGAL---------- 134 (423) T ss_pred CCccccccccceeeEEeccceeccceeCHHHHHhhHHH-HHHH-HHHHHHHHHHHHHHHHHHHHhhcccc---------- Confidence 67888999888999999999999999999999997766 6654 46778999999999998877654310 Q ss_pred ccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccc-hhhcccccccCcccccc Q lcl|Aclame:pro 152 VKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADR-IVDKTYTISQSGATING 230 (402) Q Consensus 152 ~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r-~~n~d~~~~~~g~~~~G 230 (402) .. + .+. ++...|+.|.+++.+|||++||..|||+||+||+|..|+++++ |.+.+ +. ....+++| T Consensus 135 ~v--g--------t~~---t~~~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~-~~-~~~alr~g 199 (423) T protein:vir:35 135 SL--G--------SPN---TAIKKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAAD-QL-VRTAWENA 199 (423) T ss_pred cc--c--------ccc---CCcchHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccc-cc-hhHHHhhc Confidence 00 0 000 1112378999999999999999999999999999999997554 55433 22 33347788 Q ss_pred eE-EEEeccEEEecCccccccCccccccc----------------------------cccC---Cccccceee------- Q lcl|Aclame:pro 231 FV-LSSYNCPVIPSNRFPTFAQDQAHHLL----------------------------SNED---NGYRYDPIA------- 271 (402) Q Consensus 231 ~V-~~iaG~~V~~SNnlP~~~~~~t~~~l----------------------------s~a~---~G~~~~~~a------- 271 (402) .| ++++||+||+|||+|....+..+... +..+ -|-.++|++ T Consensus 200 ~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~ 279 (423) T protein:vir:35 200 QISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQ 279 (423) T ss_pred cceeeecceEEEEcCCCccccccccccceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeeeecccc Confidence 76 89999999999999963222111000 0000 000001111 Q ss_pred --------------------ec----------------------------------------------cceeEEeecHHH Q lcl|Aclame:pro 272 --------------------EM----------------------------------------------NGAVAVLFTSDA 285 (402) Q Consensus 272 --------------------d~----------------------------------------------~~~~al~fh~~A 285 (402) +- ....-|+||++| T Consensus 280 t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~~~a 359 (423) T protein:vir:35 280 SKQTLYNGSTAMSFTATVLEETNSTASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYNKFF 359 (423) T ss_pred ccceeecccCCceeEEEEeccccccccCceeEEccccccccCCCcccccccccccCCceeeeeecCCCceeEEEeecCce Confidence 00 011457999998 Q ss_pred hhhhhhcc-----------------cceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 286 LLVGRTIE-----------------VTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 286 v~tv~~~d-----------------l~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) +..+..-. +.+-.+||.+..-..+.==..||.+.+|||.++-|- +.. T Consensus 360 ~~l~~~~l~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~--g~~ 423 (423) T protein:vir:35 360 CGLGTIPLPKLHSLDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCFNPHMGGQFF--GNP 423 (423) T ss_pred eEEEEEccccCCccceeeccccCceEEEEEeeccccCceEEEEEeecceeeecccceEEEE--ecC Confidence 76543211 122234444433322222235999999999875442 222 No 44 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=6.3e-35 Score=208.07 Aligned_cols=292 Identities=10% Similarity=-0.010 Sum_probs=196.6 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee---ec---cccceEEeeeccceeeeeecCC-- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ---TV---TGTNTVSNKYLGETELQVLAPG-- 71 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r---ti---~~Gksv~f~~iG~~t~~~~~~G-- 71 (402) |+ +....|+ ++|..+++..|++..++.++++.. .+ +.|+||+|++.+..++++++++ T Consensus 1 Ma--------------N~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~ 66 (423) T protein:vir:10 1 MP--------------NNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDI 66 (423) T ss_pred Cc--------------cchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccc Confidence 43 1223354 999999999999999999998642 23 3599999999999999999865 Q ss_pred CCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 72 QSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 72 ~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) ..++.+.+...+..|+||+.+|+.+.++|.|..+.-- ++ .++.+.+.++||+.+|+.++..+...+-. T Consensus 67 ~~~~~~dl~e~~v~l~id~~k~va~~v~d~E~~~~i~-~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~---------- 134 (423) T protein:vir:10 67 SGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLN-QL-EEILAPVRQRIVTDLETELAHFMMNNGAL---------- 134 (423) T ss_pred cccccCccccceeEEEeeceeeeeeeechHHHhcChh-hH-HHHHHHHHHHHHHHHHHHHHHHHhhcccc---------- Confidence 4578889999999999999999999999999986554 47 56778889999999999998776543210 Q ss_pred ccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccce Q lcl|Aclame:pro 152 VKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) Q Consensus 152 ~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~ 231 (402) . ++ .+.. +...|+.|.+++.+|+|++||..|||+||+||+|..|++++++...+-.+ ....+++|. T Consensus 135 ~--------~g--t~~t---~~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~g~ 200 (423) T protein:vir:10 135 S--------LG--SPNT---PITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQL-VRTAWENAQ 200 (423) T ss_pred c--------cc--cCCc---ccchHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceeccccc-chhhhhhcc Confidence 0 00 0000 11237889999999999999999999999999999999766544333233 334478888 Q ss_pred E-EEEeccEEEecCccccccCcccccc------cc---cc--CC--------------------ccccceee-------- Q lcl|Aclame:pro 232 V-LSSYNCPVIPSNRFPTFAQDQAHHL------LS---NE--DN--------------------GYRYDPIA-------- 271 (402) Q Consensus 232 V-~~iaG~~V~~SNnlP~~~~~~t~~~------ls---~a--~~--------------------G~~~~~~a-------- 271 (402) | ++++||+||+|||+|....+..+.. +. .+ ++ |-.+++++ T Consensus 201 i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~t 280 (423) T protein:vir:10 201 IPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQT 280 (423) T ss_pred ceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeeecccc Confidence 7 8999999999999996322211100 00 00 00 11111111 Q ss_pred -------------------ec----------------------------------------------cceeEEeecHHHh Q lcl|Aclame:pro 272 -------------------EM----------------------------------------------NGAVAVLFTSDAL 286 (402) Q Consensus 272 -------------------d~----------------------------------------------~~~~al~fh~~Av 286 (402) +- ....-|+||++|+ T Consensus 281 k~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~ 360 (423) T protein:vir:10 281 KQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYNKFFC 360 (423) T ss_pred cccccccccCcceEEEEEeeeeeccCCceeeeccCccccccCCcccccccccccCCceeeccccccCCeeEEEEecCcce Confidence 10 0113379999987 Q ss_pred hhhhh-----------------cccceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 287 LVGRT-----------------IEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 287 ~tv~~-----------------~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) ..+.. +.+.+-.+||.+..-..+.==..||.+.+|||.++-|- +.. T Consensus 361 ~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~--g~~ 423 (423) T protein:vir:10 361 GLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFF--GNP 423 (423) T ss_pred EEEEEcccCCCccceeeccccCceEEEEEeeeccccceEEEEEeecceeeeccceEEEEE--ecC Confidence 75432 11222234444433322222234999999999875442 222 No 45 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=5.1e-35 Score=208.57 Aligned_cols=292 Identities=10% Similarity=-0.027 Sum_probs=195.2 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee---ec---cccceEEeeeccceeeeeecCC-- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ---TV---TGTNTVSNKYLGETELQVLAPG-- 71 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r---ti---~~Gksv~f~~iG~~t~~~~~~G-- 71 (402) |+ | ....|+ ++|+.+.+..|++..+|.++++.. .+ +.|+||+|++.+..+++.+... T Consensus 1 Ma--N------------~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~ 66 (423) T protein:vir:17 1 MP--N------------NLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDI 66 (423) T ss_pred Cc--c------------chhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCccc Confidence 44 1 123454 999999999999999999988643 23 3599999999999999988654 Q ss_pred CCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 72 QSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 72 ~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) ..+..+.+...+..|+||+.+|+.+.++|.|..+.--| + .++.+.++++||+.+|+.++..+++.+.. T Consensus 67 ~~~~~~~l~e~~v~l~id~~k~va~~v~d~E~~~~i~~-~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~---------- 134 (423) T protein:vir:17 67 SGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQ-L-EEILAPVRQRIVTDLETELAHFMMNNGAL---------- 134 (423) T ss_pred CCcccCccccceeEEEeeceeeeeeeecHHHHhcChhH-H-HHHHHHHHHHHHHHHHHHHHHHHhhcccc---------- Confidence 34677888888999999999999999999999865544 7 56778889999999999998877653210 Q ss_pred ccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccce Q lcl|Aclame:pro 152 VKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) Q Consensus 152 ~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~ 231 (402) . .+. .+.+ ...|+.|.+++.+|+|++||..|||+||+||+|..|++++++...+.+. ....+++|. T Consensus 135 ~--~gt-----~~t~------~~a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~g~ 200 (423) T protein:vir:17 135 S--LGS-----PNTP------ITKWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQL-VRTAWENAQ 200 (423) T ss_pred c--ccc-----CCcc------cccHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceeccccc-chHHHhhcc Confidence 0 000 0011 1237889999999999999999999999999999999866544343333 333478888 Q ss_pred E-EEEeccEEEecCccccccCcccccc----------------------------------------ccccC-------- Q lcl|Aclame:pro 232 V-LSSYNCPVIPSNRFPTFAQDQAHHL----------------------------------------LSNED-------- 262 (402) Q Consensus 232 V-~~iaG~~V~~SNnlP~~~~~~t~~~----------------------------------------ls~a~-------- 262 (402) | ++++||+||+|||+|....+..+.. .+.+| T Consensus 201 i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t 280 (423) T protein:vir:17 201 IPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQT 280 (423) T ss_pred ceeeecceEEEEeCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeeecccc Confidence 7 8999999999999996432221100 00000 Q ss_pred ----------Cccccceeeec----------------------------------------------cceeEEeecHHHh Q lcl|Aclame:pro 263 ----------NGYRYDPIAEM----------------------------------------------NGAVAVLFTSDAL 286 (402) Q Consensus 263 ----------~G~~~~~~ad~----------------------------------------------~~~~al~fh~~Av 286 (402) ....|.+.++- ....-|+||++|+ T Consensus 281 k~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~~~a~ 360 (423) T protein:vir:17 281 KQALYNGATPISFTATVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYNKFFC 360 (423) T ss_pred cccccccccccceEEEEEecccccccCceEEEecCccccccCCcccccceecccCCceeeccccccCCeeEEEEecCcce Confidence 00111111110 0113379999987 Q ss_pred hhhhh-----------------cccceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 287 LVGRT-----------------IEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 287 ~tv~~-----------------~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) ..+.. +.+.+-.+||.+..-..+.==..||.+.+|||.++-|- +.. T Consensus 361 ~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~--g~~ 423 (423) T protein:vir:17 361 GLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFF--GNP 423 (423) T ss_pred EEEEEcccCCCccceeecccCCcEEEEEEecccccceeEEEEEeecceeeeccceEEEEE--ecC Confidence 75432 11112223444332211222234999999999875442 222 No 46 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=7.1e-34 Score=202.30 Aligned_cols=267 Identities=14% Similarity=0.055 Sum_probs=212.5 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccc-eeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGE-TELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~-~t~~~~~~G~~i~ 75 (402) |++-++. .-++++ |+|+-.|...+.+.+++.++..+- ++. .|++++||+.+. ..+..+.-|+.++ T Consensus 1 MA~~~T~----------~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~ 70 (272) T protein:vir:98 1 MAVGTTK----------MAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP 70 (272) T ss_pred CCCcccc----------chheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccc Confidence 8766532 224667 999999999999999999888754 343 599999999763 4677888899999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.++++++ ....+.+.|++..++..| +.+++.+++++++++++|+.++..+..+... T Consensus 71 ~~~~~~~~~~~~~~~-~~~~~~itd~~~~~s~~d-~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-------------- 134 (272) T protein:vir:98 71 MTQLGFKKTTMTIKK-AGKGVEITDEAILSGYGD-PVGQAAKQIVEAIDHKVDADVLDALSKSTQT-------------- 134 (272) T ss_pred ccccccceEEEEeee-eeeeeeecHHHHhhcccc-HHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------- Confidence 999999999999988 456789999999999999 7899999999999999999988665432110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) ++ ...-++.|.++..+|++.+ ...|+++|+|++|..|+++..+...+.+..+++...+|.++++ T Consensus 135 -----~~---------~~~t~d~i~da~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i 198 (272) T protein:vir:98 135 -----VE---------ATATVDGVSKALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEV 198 (272) T ss_pred -----cc---------cccCHHHHHHHHHHHhccC--CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhh Confidence 00 0012678999999998775 4578999999999999987533222333445566778999999 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) +|++|++||++|... +++|++.|++.+...++..|.+|++.++.|.|.+++.| T Consensus 199 ~G~~Vi~s~~~p~~t---------------------------~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~ 251 (272) T protein:vir:98 199 LGVQIVRSRKCPKGT---------------------------AYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHY 251 (272) T ss_pred cCeeEEEcCCCCcce---------------------------EEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEE Confidence 999999999998311 46788899999999999999999999999999999999 Q ss_pred cCcccccceEEEEEEeeccCccc Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKRDATTGD 338 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~~~t~~~ 338 (402) |.+++||++.+.++++. .... T Consensus 252 ~~~v~~~~~vv~~t~~~--a~~~ 272 (272) T protein:vir:98 252 GVYLYKAEKAVKITLKD--AAKK 272 (272) T ss_pred EEEEEcCCceEEEEecc--cccC Confidence 99999999888887652 1111 No 47 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=7.1e-34 Score=202.30 Aligned_cols=267 Identities=14% Similarity=0.055 Sum_probs=212.5 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-ecc--ccceEEeeeccc-eeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVT--GTNTVSNKYLGE-TELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~--~Gksv~f~~iG~-~t~~~~~~G~~i~ 75 (402) |++-++. .-++++ |+|+-.|...+.+.+++.++..+- ++. .|++++||+.+. ..+..+.-|+.++ T Consensus 1 MA~~~T~----------~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~ 70 (272) T protein:vir:30 1 MAVGTTK----------MAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIP 70 (272) T ss_pred CCCcccc----------chheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccc Confidence 8766532 224667 999999999999999999888754 343 599999999763 4677888899999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.++++++ ....+.+.|++..++..| +.+++.+++++++++++|+.++..+..+... T Consensus 71 ~~~~~~~~~~~~~~~-~~~~~~itd~~~~~s~~d-~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-------------- 134 (272) T protein:vir:30 71 MTQLGFKKTTMTIKK-AGKGVEITDEAILSGYGD-PVGQAAKQIVEAIDHKVDADVLDALSKSTQT-------------- 134 (272) T ss_pred ccccccceEEEEeee-eeeeeeecHHHHhhcccc-HHHHHHHHHHHHHHHHHHHHHHHHhcccccc-------------- Confidence 999999999999988 456789999999999999 7899999999999999999988665432110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) ++ ...-++.|.++..+|++.+ ...|+++|+|++|..|+++..+...+.+..+++...+|.++++ T Consensus 135 -----~~---------~~~t~d~i~da~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i 198 (272) T protein:vir:30 135 -----VE---------ATATVDGVSKALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEV 198 (272) T ss_pred -----cc---------cccCHHHHHHHHHHHhccC--CCccEEEEcHHHHHHHHHhccccccccccccccccccccchhh Confidence 00 0012678999999998775 4578999999999999987533222333445566778999999 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) +|++|++||++|... +++|++.|++.+...++..|.+|++.++.|.|.+++.| T Consensus 199 ~G~~Vi~s~~~p~~t---------------------------~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~ 251 (272) T protein:vir:30 199 LGVQIVRSRKCPKGT---------------------------AYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHY 251 (272) T ss_pred cCeeEEEcCCCCcce---------------------------EEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEE Confidence 999999999998311 46788899999999999999999999999999999999 Q ss_pred cCcccccceEEEEEEeeccCccc Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKRDATTGD 338 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~~~t~~~ 338 (402) |.+++||++.+.++++. .... T Consensus 252 ~~~v~~~~~vv~~t~~~--a~~~ 272 (272) T protein:vir:30 252 GVYLYKAEKAVKITLKD--AAKK 272 (272) T ss_pred EEEEEcCCceEEEEecc--cccC Confidence 99999999888887652 1111 No 48 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=99.95 E-value=1.4e-31 Score=189.68 Aligned_cols=291 Identities=9% Similarity=-0.023 Sum_probs=189.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee---ec---cccceEEeeeccceeeeeecCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ---TV---TGTNTVSNKYLGETELQVLAPGQS 73 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r---ti---~~Gksv~f~~iG~~t~~~~~~G~~ 73 (402) |+ |+ ..-|+ ++|+.|.+..|++..+|.++++.. .+ +.|+||+|++.+..++.+. ++.. T Consensus 1 MA--Ns------------l~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~-~~~~ 65 (423) T protein:vir:10 1 MA--NN------------LDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERT-MDGD 65 (423) T ss_pred Cc--cc------------cccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecc-cCcc Confidence 54 11 11144 899999999999999999988632 22 3599999999999999764 3444 Q ss_pred CCC---CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 74 PNA---TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) Q Consensus 74 i~~---~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~ 150 (402) +.+ +.+...+..++||+.+|+.+.++|.|..+.--| + .++.+.+.++||..+|+.++..+.+.+- T Consensus 66 ~t~~~~~~l~e~~v~l~id~~k~~a~~v~d~E~~l~i~~-~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~---------- 133 (423) T protein:vir:10 66 ITGKSKNSLISAKATGEVGNYITVAVEYRQIEEALKLNQ-L-DQILVPINERMVTDLETELALFMMKHGA---------- 133 (423) T ss_pred cCcccccccccceEEEEecceeeeeeeeChHHHhcChhH-H-HHHHHHHHHHHHHHHHHHHHHHhhhccc---------- Confidence 433 456667899999999999999999999865544 7 4677888999999999999866654221 Q ss_pred cccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccc Q lcl|Aclame:pro 151 RVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATING 230 (402) Q Consensus 151 ~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G 230 (402) ...| .. +... ..|+.+.+++.+|++.+||..+||+||+||+|..|++++++.....+. ....+++| T Consensus 134 ~~vg-t~------~t~~------~a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~-~~~alr~~ 199 (423) T protein:vir:10 134 LSLG-SP------NTPI------KKWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQL-VRTAWENA 199 (423) T ss_pred cccc-cc------cccc------ccHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhcccccc-chHHHHhc Confidence 0000 00 0111 137889999999999999999999999999999999876655443332 33346788 Q ss_pred eE-EEEeccEEEecCccccccCc---cccc----c------c---------------------------cccC------- Q lcl|Aclame:pro 231 FV-LSSYNCPVIPSNRFPTFAQD---QAHH----L------L---------------------------SNED------- 262 (402) Q Consensus 231 ~V-~~iaG~~V~~SNnlP~~~~~---~t~~----~------l---------------------------s~a~------- 262 (402) .| ++++||+||+|||+|....+ ++.+ . . +.+| T Consensus 200 ~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~ 279 (423) T protein:vir:10 200 QISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQ 279 (423) T ss_pred ccceeecceEEEEecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeeccc Confidence 76 89999999999999952211 1000 0 0 0000 Q ss_pred ---------Cc--cccceeeec----------------------------------------------cceeEEeecHHH Q lcl|Aclame:pro 263 ---------NG--YRYDPIAEM----------------------------------------------NGAVAVLFTSDA 285 (402) Q Consensus 263 ---------~G--~~~~~~ad~----------------------------------------------~~~~al~fh~~A 285 (402) .| ..|.+.++- ....-|+||++| T Consensus 280 tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~~~a 359 (423) T protein:vir:10 280 SKQTLYNGASALSFTATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYNKLF 359 (423) T ss_pred ccceeecccCCcceEEEEEecccccccCceEEEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEecCcc Confidence 00 011111110 012347999998 Q ss_pred hhhhhh-----------------cccceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 286 LLVGRT-----------------IEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 286 v~tv~~-----------------~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) +..+.. +.+.+-.+||.+..-..+.==..||.+.+|||.++-|- +.. T Consensus 360 ~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~--g~~ 423 (423) T protein:vir:10 360 CGLGTIPLPKLHSIDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGGQFF--GNP 423 (423) T ss_pred eEEEEEcccCCCccceeecccccceEEEEEeeeccccceEEEEEeecceeeeccceEEEEE--ecC Confidence 775432 11222234444433322222234999999999875442 222 No 49 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=99.95 E-value=4.2e-31 Score=187.12 Aligned_cols=277 Identities=10% Similarity=0.095 Sum_probs=203.6 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeec--cccceEEeeeccceeeeeecCCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTV--TGTNTVSNKYLGETELQVLAPGQSPNATP 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti--~~Gksv~f~~iG~~t~~~~~~G~~i~~~~ 78 (402) |+ ..++ ++|++++++.|.+.+++-.+.+ +.+ .+||+|+||+|+...+++|++++...... T Consensus 1 Ma----------------in~a-~~~~~~Ld~~~~~~~~t~~l~~-~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~ 62 (290) T protein:vir:78 1 MA----------------INYV-DKYGKELDQKLVFGTYTNELET-PNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGS 62 (290) T ss_pred Cc----------------hhHH-HHHHHHHHHHHHhhheeeeccc-cceeeccCCEEEEeeeccCcccccccCCCcccCc Confidence 43 2222 8999999999999999887764 344 58999999999999999999999888888 Q ss_pred ccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 79 ~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) +..+..++++|+..++.|.|| |+||.+.... +-..+.+.+.+.++-++|...+..|+..|.... T Consensus 63 v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~-~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~------------- 128 (290) T protein:vir:78 63 ASNTNKSYTIDFDRDVEFFVDVMDVDETGQALS-AANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS------------- 128 (290) T ss_pred cccceeeEEeeccccceeeccccchhHHhhhhh-HHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC------------- Confidence 889999999999999999999 9999998876 677888889999999999999988876553110 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccc--cCcccccceEEE Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTIS--QSGATINGFVLS 234 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~--~~g~~~~G~V~~ 234 (402) . ....+.+++++|++|.++..+||| ||.++||++|+|++|.+|.++++|. +..... +.+ ..+|.|++ T Consensus 129 --~-----~~~~t~t~~n~~~~i~~~~~~lde--vp~~~rvl~vtp~~~~lL~~~~~f~-r~~~~~~~~~~-~i~~~V~~ 197 (290) T protein:vir:78 129 --N-----SVAEEITKDNVFTKLKAAIRKVKK--YGTQNLVMYVSPDVMAALELSDDFV-RAINVQNIGPS-SIETRITA 197 (290) T ss_pred --c-----ccccccCHHHHHHHHHHHHHHHHh--cCCCCeEEEECHHHHHHHhhChhhh-ccccccccccc-cccceeee Confidence 0 011235788999999999999997 8999999999999999999999997 333222 222 34899999 Q ss_pred EeccEEEecC---ccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH---HHH Q lcl|Aclame:pro 235 SYNCPVIPSN---RFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK---TYY 308 (402) Q Consensus 235 iaG~~V~~SN---nlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~---~d~ 308 (402) +.||+|++.+ ++-+.-....| |....+-.+.-.++.|+.|+......+ .+..|.+...+ +|+ T Consensus 198 idG~~ii~vps~~r~~t~~~f~~G-----------~~~~~~ak~in~ii~~~~a~i~~~K~~-~~~~~~P~~~~~~d~~~ 265 (290) T protein:vir:78 198 IDGTRIVEVEAEDRFYDTFDFTDG-----------YKPAAGAKKLNFLLVNKGSVVGGAKHA-SIYLHAPGSVGQGDGWL 265 (290) T ss_pred ecCcEEEEecccchhhhhhhhccc-----------ccccCCccceeEEEEcCCceeeeeeee-EEEeeCCCCCcCcceee Confidence 9999999865 22211110111 111122223346888999988877777 56666555432 467 Q ss_pred HHHHHHhcCcccccceEEEE-EEee Q lcl|Aclame:pro 309 IDTFMAEGAIPDRWEAVSVV-TTKR 332 (402) Q Consensus 309 i~~~~a~Ga~vlRPeaa~vv-~~~~ 332 (402) +..+.-+..-++.....++. .+.. T Consensus 266 ~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 266 YQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred eeeeeeeeeeeeccccCeeEEEeeC Confidence 77777777777655433322 1111 No 50 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.94 E-value=2e-30 Score=183.45 Aligned_cols=230 Identities=13% Similarity=0.055 Sum_probs=190.7 Q ss_pred eeeccccceEEee-eccceeeeeecCCCCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHH Q lcl|Aclame:pro 46 VQTVTGTNTVSNK-YLGETELQVLAPGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLK 124 (402) Q Consensus 46 ~rti~~Gksv~f~-~iG~~t~~~~~~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA 124 (402) .--+..|++++|| +|| .++.+.-|++|+.+.+..++.+.+|.+. .-.+.|+|++..++..| .-.|.++|+|.+|| T Consensus 1 ~~~~~~Gdtit~P~~iG--da~~v~eG~~i~~~~l~~t~~~atIk~~-gk~~~itD~a~l~~~gD-p~~ea~~Q~~~~iA 76 (231) T protein:vir:73 1 ENGINLANLCEYPNDIG--DAADVAEGGEISLDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGD-PIGESNKQLGLSLA 76 (231) T ss_pred CccccCCceEEeccccc--chhhhcCCCcCChhhccccceeeeEeee-ccceeeeHHHHhhccCc-hHHHHHHHHHHHHH Confidence 2336789999998 566 4578999999999999999999999764 78999999999999888 78999999999999 Q ss_pred HHHHHHHHHHHHhhhhhccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHH Q lcl|Aclame:pro 125 RLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKF 204 (402) Q Consensus 125 ~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~ 204 (402) +++|..++..+.++.... .. ..+ ++.|.+|..+|.+++ ..++|+||+|++ T Consensus 77 ~kvD~di~~~~~~a~l~~---------------------~~---~~t----~d~i~~A~~~fgde~--~~~~vivv~p~~ 126 (231) T protein:vir:73 77 NKVDDDLLKAAKTTSQTV---------------------ST---KAN----VDGVQAALDIFNDED--AQAYVLIVNPKD 126 (231) T ss_pred HhhhHHHHHhhccccccc---------------------cc---ccc----HHHHHHHHHHhcccc--ccceEEEEcchH Confidence 999999887665433211 00 111 678899999998876 367899999999 Q ss_pred HHHHhcccchhhcccccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHH Q lcl|Aclame:pro 205 FNALRDADRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSD 284 (402) Q Consensus 205 y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~ 284 (402) |+.|.++.++.+.. +..+++..++|.|+++.|++|+.|+++|.+.. ...+ +++.+. T Consensus 127 ~~~Lrk~~~~~~~~-~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~-------------~~~~----------~i~~~g 182 (231) T protein:vir:73 127 AAKIRKDANAKNIG-SEVGANALINGTYADVLGAQIVRSKKLAEGSA-------------LMFK----------IVSNSP 182 (231) T ss_pred HHhhhhccchhhhh-hhhccceeeecccceEcceEEEEcCCCCCCce-------------eeee----------EEeecc Confidence 99999988877542 34567778999999999999999999995331 1111 345688 Q ss_pred HhhhhhhcccceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 285 ALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 285 Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) |++....+++..|.+||+.++.+.|.+.+.|++++.+|+.++.+++++- T Consensus 183 Al~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 183 ALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred ceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 9999999999999999999999999999999999999999999988866 No 51 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.92 E-value=8.8e-28 Score=168.91 Aligned_cols=330 Identities=11% Similarity=0.062 Sum_probs=209.4 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccc-e---eeec--cccceEEeeecc-ceeeeeecCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYF-D---VQTV--TGTNTVSNKYLG-ETELQVLAPGQS 73 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~-~---~rti--~~Gksv~f~~iG-~~t~~~~~~G~~ 73 (402) |+.. +-++|+.++++.|..+++.-... + ...+ .|||+|+||.|. .+..++|++..- T Consensus 1 Main-----------------ya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g 63 (346) T protein:vir:10 1 MTIN-----------------YAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTI 63 (346) T ss_pred Ccch-----------------hHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCC Confidence 4431 13889999999998887663222 1 2223 489999999995 567899988776 Q ss_pred CCC-CCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 74 PNA-TPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) Q Consensus 74 i~~-~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~ 150 (402) ... ..+..+..++++|+..++.|.|| |+||.+.... +-..+.+.+-...+-++|...|..|+..+.... T Consensus 64 ~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~-~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~------- 135 (346) T protein:vir:10 64 TTPVANYSNDWDSYELKNERYWSTLVDPSDIDETNMVVS-LANITKQFNLDSKMPEKDRYMFSHLYSGKEAAH------- 135 (346) T ss_pred cccccccccceeEEEeeccccceecccccchHHHHHHhH-HHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhc------- Confidence 654 56888999999999999999999 8888776554 455555556667777889998888875543211 Q ss_pred cccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccc Q lcl|Aclame:pro 151 RVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATING 230 (402) Q Consensus 151 ~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G 230 (402) . ....+.+.+++++|++|.++..+|+|+.||.++||++|+|++|.+|.++++|. +.....+.+. .+| T Consensus 136 ------~-----~~~~~~a~T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~-k~~~v~~~~~-i~~ 202 (346) T protein:vir:10 136 ------D-----GGITTNTLDEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMN-RALTLKDPNN-IQR 202 (346) T ss_pred ------c-----ccccccccCHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhhe-eccccccccc-cce Confidence 0 01112345788999999999999999999999999999999999999999986 6555544444 599 Q ss_pred eEEEEeccEEEe--cCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-hHH-H Q lcl|Aclame:pro 231 FVLSSYNCPVIP--SNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-KEK-T 306 (402) Q Consensus 231 ~V~~iaG~~V~~--SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-~~~-~ 306 (402) .|++++||+|++ |++|++.-.-..| ++....-...-.++.|+.|+......+ .+..|.+. .+. . T Consensus 203 ~V~siDGv~Ii~VPs~r~~t~~~f~~G-----------~~~~t~ak~INfiiv~~~A~ia~~K~~-~~~if~P~~~~~g~ 270 (346) T protein:vir:10 203 TVYSLDDVTIRVVPSDLMQTAYDFSDG-----------SKIIDTAKQIEMFLIYNGVQIAPEKYS-FVGFDQPSAATSGN 270 (346) T ss_pred eeeeecCeEEEEcchhhcccchhhccC-----------ccccCCccceeEEEECCceeeeeeeee-eeEeeCCCCCcccc Confidence 999999999977 7788743221111 111122223446888999888776665 34444433 222 2 Q ss_pred HHHHHHHHhcCcccccceEEEEEE--eeccCccccccchhhHHHhhhcccceEEEeecchhhhhhhhcccccchhHHHHH Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTT--KRDATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSAAPAGIQAEDLVAA 384 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~--~~~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 384 (402) |++..+.-+..-|+.....++..- +..++..+.++.-+.-+.=.--..-|+-|++--+-=+.-| .-.||.+- T Consensus 271 ~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~~~~~~~~~kpt~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~ 344 (346) T protein:vir:10 271 YLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKDQEQSGQDAKPTAESTLEEIKAYLDKNHIDYTGKT------KKDELLAL 344 (346) T ss_pred eeeeeeeeeeeeeeccccceEEEeeecccccCccCcccccCcccccchHHHHHHhccccccccccc------chhhHHhh Confidence 677777777777776554433211 2233333333221111110000112333333322211111 23455555 Q ss_pred HH Q lcl|Aclame:pro 385 VR 386 (402) Q Consensus 385 ~~ 386 (402) |. T Consensus 345 ~~ 346 (346) T protein:vir:10 345 VK 346 (346) T ss_pred cC Confidence 44 No 52 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.91 E-value=3.1e-27 Score=165.92 Aligned_cols=299 Identities=13% Similarity=0.039 Sum_probs=206.0 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhccccee-ee--ccccceEEeeeccceeeeeecCCCC--CC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QT--VTGTNTVSNKYLGETELQVLAPGQS--PN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~-rt--i~~Gksv~f~~iG~~t~~~~~~G~~--i~ 75 (402) |+ |+ . -+.++|+.++++.|...+++-.+..- .. ..|||+|+||+|.....++|++++. .+ T Consensus 1 Ma--nt------------l-~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~ 65 (312) T protein:vir:10 1 MA--NT------------L-AYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYV 65 (312) T ss_pred CC--cc------------h-hHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccc Confidence 54 11 1 24489999999999999987766421 22 4689999999999999999999877 44 Q ss_pred CCCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ...+..+..+.++++..++.|.|| |+||.+.... +-..+.+.+-+...-++|...|..|+..|... T Consensus 66 ~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s-~anv~~ef~r~~vvPEiDayrfskla~~a~~~----------- 133 (312) T protein:vir:10 66 GGDVKFEYETKTMTQDRGRKFTLDAMDVDETNFLVT-ATTVMGEFQRLKVIPEIDAYRLSRLATIAIGI----------- 133 (312) T ss_pred cccccccceeEEeeecccceeeccccchhhHhhHHH-HHHHHHHHHHhhhcchhhHHHHHHHHhhhhcc----------- Confidence 446889999999999999999999 9999987776 67777777888999999999988887655321 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~ 233 (402) +.... ...+.+.+.+++|++|.++.++|||..|| .+|+++|+|++|.+|-++..+. ..-...+.+. .+|.|+ T Consensus 134 --~~~~~---~~~~~~~T~~ni~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~lLk~~~~~~-~~~~~~~~~~-i~~~V~ 205 (312) T protein:vir:10 134 --KGDTN---VEYSYSVNSSTIINKIKTGIKIIRENGYN-GPLVCHLTYDSMFAIEEKVLEK-LTAVTFAQGG-IQTQVP 205 (312) T ss_pred --ccccc---cccccccCHHHHHHHHHHHHHHHHHccCC-CceEEEeChHHHHHHhhhhhce-ecccccccce-eeeeee Confidence 01011 12233457889999999999999999999 6999999999986666543222 2212223333 589999 Q ss_pred EEeccEEEec--CccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhH-H--HHH Q lcl|Aclame:pro 234 SSYNCPVIPS--NRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE-K--TYY 308 (402) Q Consensus 234 ~iaG~~V~~S--NnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~-~--~d~ 308 (402) ++.||+|++. ++|.+.-.-..|. .+. ...+.|....+-.+.-.++.|+.|+......+ .+..|.+... . +|+ T Consensus 206 ~iDgv~Ii~VPs~r~~t~~~f~dG~-t~~-~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~-~~~if~P~~~~~~d~~~ 282 (312) T protein:vir:10 206 SIDGCALIKTPQNRMYSSILLNDGT-TSN-QTAGGYLKGTKALDTNFIIAPVDVPLAITKQD-KMRIFDPETNQTANAWS 282 (312) T ss_pred eecccEEEEchhhhccceeeeccCc-ccc-cccCceeecCcccccceEEeCCceeeceeeee-eeeeeCCCCCCCcceee Confidence 9999999874 3343211100000 000 01123444444445557899999888777666 4555544333 2 488 Q ss_pred HHHHHHhcCcccccceEEE-EEEeeccCcc Q lcl|Aclame:pro 309 IDTFMAEGAIPDRWEAVSV-VTTKRDATTG 337 (402) Q Consensus 309 i~~~~a~Ga~vlRPeaa~v-v~~~~~~t~~ 337 (402) +..+.-+..-|+.....++ +..+...+.| T Consensus 283 ~~~R~Y~D~fv~~nk~~~Iyv~~k~a~~~~ 312 (312) T protein:vir:10 283 MDYRRYHDLWVTDNKANSVYANFKDAKPVG 312 (312) T ss_pred eeeeeeeeeeeeccccCeEEEEeecccCCC Confidence 8888888888887776665 3333322222 No 53 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.90 E-value=7.9e-27 Score=163.69 Aligned_cols=264 Identities=14% Similarity=0.027 Sum_probs=203.5 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeec---cccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTV---TGTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti---~~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) |+- +.--++.+ |+|+..|.+++.+..+|.++..+.+. ++|++++||...-+ .++.+.-|+.|+ T Consensus 1 Ma~------------T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~ 68 (270) T protein:vir:95 1 MTQ------------TKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMD 68 (270) T ss_pred CCc------------eehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccc Confidence 442 12223434 99999999999999999999887653 56999999875432 456788899999 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+.+..++.+.+|-+. --.+.++|++...+.-| .-.+.++++|.++|+++|..++..+..+.... T Consensus 69 ~~~lt~~~~~a~i~~~-gk~~~itD~a~~~~~~d-p~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~------------- 133 (270) T protein:vir:95 69 TTQMSMTTTKVTVKET-GKAVEVTQTAIITNVNG-TLQEASRQLAMSLADKVEIDYIAELNKSKQTA------------- 133 (270) T ss_pred hhhcccchheeeeehh-hCcceecHHHHhhhccc-hHHHHHHHHHHHHHHHHHHHHHHHhccccccc------------- Confidence 9999999999999664 67888999999888777 67889999999999999998876664321110 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) +...+ ++.|.++..+|.+. +....+++|+|+.|..|.++..+. +...+++...+|.++.+ T Consensus 134 -----------~~~~t----~~~~~dA~~~lgd~--~~~~~~i~vhs~~~~~Lrk~~~~~---~~~~~~~~~~~G~ig~~ 193 (270) T protein:vir:95 134 -----------TVSAD----ATGILDAIEVFNSE--NDEDYVLYVNPKDYNKLVKSLFKV---GGNVQDRAISKGDLVEI 193 (270) T ss_pred -----------ccccC----HHHHHHHHHHhccc--cCCCcEEEEcHHHHHHHHhhhccc---ccccccchhccccccee Confidence 01112 45677788888543 233468999999999999887554 33445666789999999 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) .|++|+.+.+.|.. +.+.+|++.|++.+...++..|..||+.++.|.|.+.+.| T Consensus 194 ~G~~Viv~s~~~~~--------------------------~~~~l~~~gAi~~~~~~~~~vEtdRd~~~~~d~i~~~~~y 247 (270) T protein:vir:95 194 VGVSDIVKSKRVSE--------------------------NTAFLQRYGAMEIVNKKKPEAYTDFDILKRTHLLSTNYHY 247 (270) T ss_pred cceeEEEeCCCCCc--------------------------eeEEEEeccceeeeecCCceeeeccchhhcccEEEeeeEE Confidence 99998776665531 1257889999999999999999999999999999999999 Q ss_pred cCcccccceEEEEEEee-ccCcc Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKR-DATTG 337 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~-~~t~~ 337 (402) |.++.+|+.++.++++. +.|.- T Consensus 248 ~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 248 SVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred EEEEEccceEEEEEecCCCCcCC Confidence 99999999988888753 22222 No 54 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.84 E-value=2.6e-23 Score=144.40 Aligned_cols=297 Identities=9% Similarity=0.069 Sum_probs=195.4 Q ss_pred CCCCcccccccccccccHHHH-HHHHHhHHHHHHHHHHhhhcccceeee--c-cccceEEeeeccceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSL-LIEKFNGKVNEQYLKGENILSYFDVQT--V-TGTNTVSNKYLGETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~al-fle~f~geV~t~f~~~sv~~~~~~~rt--i-~~Gksv~f~~iG~~t~~~~~~G~~i~~ 76 (402) |-. ..+..+| +.++|+.++++.|...++.-.+.+ .. + .|||+|+||.|....+++|++++.... T Consensus 1 ~~~-----------~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~-~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~ 68 (311) T protein:vir:99 1 MPT-----------DAETRGFNYVTKDGNLLDQKITAGLFTAALGT-PEVDLVNGGRSFTLKTISTSGLKDHTRGKGFNS 68 (311) T ss_pred CCC-----------cchhhHHHHHHHHHHHHHHHHHhhhcccceec-CchheeecCCEEEEEeeeeccccccccccCccc Confidence 211 1233455 679999999999999887655543 22 3 589999999999999999999987777 Q ss_pred CCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 77 ~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..+..+..+.++++..++.|.|| |+||....+. +-.-+.+.+-....=++|..-|..|+..+...... .. T Consensus 69 g~v~~~~et~tl~~DR~~~f~vD~mDvdETn~~~~-~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~------~~- 140 (311) T protein:vir:99 69 GTISDEKTIYTMGQDRDVEFYLDRQDVDETDNELA-MANISNVFITEHVQPELDSYRFSKIATSFDNLDGT------DT- 140 (311) T ss_pred cceeeeeeEEEeeeccceeeecchhchhhhhhhhH-HHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc------cc- Confidence 78899999999999999999999 7777665543 34444444445666778998888887554321100 00 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccc--cCcccccceE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTIS--QSGATINGFV 232 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~--~~g~~~~G~V 232 (402) +..........+...+.+++++.|..+..+|+| ||.++|+++|+|++|.+|.+.++|. |..+.. +.+ ..++.| T Consensus 141 -~~~~~~~~~~~~~~lt~~nvl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~-r~~~~~~~~~~-~i~~~V 215 (311) T protein:vir:99 141 -EGTLLAKTHKTEETLDETNAYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFT-RNITNQNVGTT-ALESRI 215 (311) T ss_pred -chhhhccccccccccCHHHHHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhh-eeeeccccccc-cccccc Confidence 000001111223457888999999999999987 7999999999999999887777765 333222 222 258899 Q ss_pred EEEeccEEE---ecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhH---HH Q lcl|Aclame:pro 233 LSSYNCPVI---PSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE---KT 306 (402) Q Consensus 233 ~~iaG~~V~---~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~---~~ 306 (402) +++.|++|+ .|++|.+.-....|... ..+-.+.-.++.|+.|+..+...+ .+..|.+... -+ T Consensus 216 ~~lDgv~Ii~V~ps~r~~t~~~ft~G~~~-----------~~~ak~INfiiv~~~a~i~~~K~~-~v~~f~P~~~~~gd~ 283 (311) T protein:vir:99 216 TSIDGVQLIEVYESNRFMTKYDFTDGAKP-----------TEDAKAINFLVVAKPAVISIVKEN-AVFLFAPGQHTDGDG 283 (311) T ss_pred ceecCeEEEEecCchhhcchhhhcCCccc-----------cCcccccceEEeCCCeeeeeeeee-eeeeeCCCCCCCcce Confidence 999999987 55667643221111111 111123346888999888766655 4444543322 35 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeeccC Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDAT 335 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t 335 (402) |++..+.-+..-|+.....++ .+-.... T Consensus 284 ~l~~~R~Y~D~fv~~nk~~~I-yv~~k~A 311 (311) T protein:vir:99 284 YLYQNRLYHDLFIKKHKRDGI-FVSVKKA 311 (311) T ss_pred eeeeeeeeeeeeeeccccCeE-EEeeecC Confidence 777777777777776654443 2222111 No 55 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.84 E-value=1.8e-23 Score=145.23 Aligned_cols=270 Identities=13% Similarity=0.150 Sum_probs=183.1 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhccccee-----eeccccceEEeeecc-ceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-----QTVTGTNTVSNKYLG-ETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~-----rti~~Gksv~f~~iG-~~t~~~~~~G~~i 74 (402) |+.. +.++|+..+++.|..++.+.++..- ..-.|||+|+||.|. ...+++|+++... T Consensus 1 Main-----------------~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~ 63 (285) T protein:vir:79 1 MTVV-----------------LDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDN 63 (285) T ss_pred Ccch-----------------hhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCc Confidence 3211 2389999999999998888766543 223589999999996 5689999999888 Q ss_pred CCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 75 NATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 75 ~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ....+..+..++++++..++.|.||.+|.-.+..=.+-..+.+.+-....-++|..-|..|+..+. T Consensus 64 ~~g~v~~~~et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~-------------- 129 (285) T protein:vir:79 64 ARKTISVGKETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAA-------------- 129 (285) T ss_pred cccccceeeeEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcc-------------- Confidence 888899999999999999999999944433322111333333333445556888888877764321 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccC--cccccceE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQS--GATINGFV 232 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~--g~~~~G~V 232 (402) .. .+.+.+.+++|++|.++..+|||..|| ++||++|+|++|.+|.++++|. +.....+. ..-.++.| T Consensus 130 ----~~-----~~~~~T~~nv~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~~Lk~s~~~~-r~~~~~~~~~~~~i~~~V 198 (285) T protein:vir:79 130 ----KK-----ATDSITKDNALDAYDTAEAYMFDNEVP-GGFVMFVSSAYYTALKQSAAVT-RTFSTDGTMVINGIDRRV 198 (285) T ss_pred ----cc-----cccccCHHHHHHHHHHHHHHHHHcCCC-CceEEEEChHHHHHHHhhhhhh-eecccccceeccceeeee Confidence 00 112356789999999999999999999 7999999999999999998887 43322121 11246789 Q ss_pred EEEec-cEEEe--cCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-hHH--H Q lcl|Aclame:pro 233 LSSYN-CPVIP--SNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-KEK--T 306 (402) Q Consensus 233 ~~iaG-~~V~~--SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-~~~--~ 306 (402) .++.| ++|++ |++|++.. .+ .+.-.++.|+.|+......+ .+..|.++ .+. + T Consensus 199 ~~lDg~v~ii~Vps~r~kt~~------------------~~---k~Infiiv~~~a~i~~~K~~-~~~~f~P~~~~~~d~ 256 (285) T protein:vir:79 199 AQLDGGVPIVRVSSDRLKGLG------------------IT---NHVNFILTPLSAIAPIVKYD-SVSVIDPSTDRSGNR 256 (285) T ss_pred ccccceeEEEEcchhhccCcC------------------cc---hhccEEEecCceeccceeee-eeEeECCCCCCCcce Confidence 99998 99988 46664311 00 12335888999888776666 44455444 222 4 Q ss_pred HHHHHHHHhcCcccccceEEEEE-Eeecc Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVT-TKRDA 334 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~-~~~~~ 334 (402) |++..+.-+..-|+.-...++.. .+.+. T Consensus 257 ~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 257 WTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred eeeeeeeeeeeeehhhccceeeeeecccC Confidence 67777777777777555444321 11222 No 56 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.68 E-value=9.3e-19 Score=119.43 Aligned_cols=284 Identities=11% Similarity=0.102 Sum_probs=191.8 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhccccee-e--eccccceEEeeecc-----ceeeeeecCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-Q--TVTGTNTVSNKYLG-----ETELQVLAPGQ 72 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~-r--ti~~Gksv~f~~iG-----~~t~~~~~~G~ 72 (402) |+ |+ . -+.++|+.++++.|...+++-.+... . ...|||+|+||.|- .+..++|++++ T Consensus 1 Ma--nt------------l-~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~ 65 (302) T protein:vir:78 1 MA--NS------------L-ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRST 65 (302) T ss_pred CC--ch------------h-HHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeecccccccccccccc Confidence 54 11 2 24589999999999999987766432 1 25689999999994 55788999988 Q ss_pred CCCCCCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 73 SPNATPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) Q Consensus 73 ~i~~~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~ 150 (402) ......+..+..+.++++..++.|.|| |+||...-.. +-..+.+.+-....=++|..-|..|+..|... T Consensus 66 g~~~g~v~~~~et~tlt~DR~~~f~vD~mDvdETn~~~~-~ani~~ef~r~~vvPEiDayrfskla~~a~~~-------- 136 (302) T protein:vir:78 66 GFTQGSVTLAWSDYTLDYDLAQSFQIDAMDVDETKNLAT-VGNVLSEYQRTKIVPAIDKYRFTKLANDGTGV-------- 136 (302) T ss_pred CccccceeeeeeeEEeeeccceeeeccccchhhhhhhhH-HHHHHHHHHHhhhcchhhHHHHHHHHHhhhcc-------- Confidence 766666888999999999999999999 7777665553 45555555666777889999888886543211 Q ss_pred cccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhccccc--ccCcccc Q lcl|Aclame:pro 151 RVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTI--SQSGATI 228 (402) Q Consensus 151 ~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~--~~~g~~~ 228 (402) +.... ......+.+++++.|..+..+|+|. ++|+++|+|++|.+|.+++.|. +..+. .+.+. . T Consensus 137 -------~~~~~--~~~~~~t~~nvl~~i~~~~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~-~~~~~~~~~~~~-i 201 (302) T protein:vir:78 137 -------GGVID--LSKPDASAQALMGDIATAMELVDDS----NQLILVTSPTTLAGLLNTALIR-ESKNTQVLRRGE-V 201 (302) T ss_pred -------Ccccc--ccccchhHHHHHHHHHHHHHHhhcc----CCeEEEEChHHHHHHhcchhhc-cceecccccccc-c Confidence 00100 1112346789999999999999995 5999999999999888776664 33222 12222 4 Q ss_pred cceEEEEeccEEEecC--ccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-hHH Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSN--RFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-KEK 305 (402) Q Consensus 229 ~G~V~~iaG~~V~~SN--nlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-~~~ 305 (402) ++.|.++.|++|++.+ +|.+.-.-.. | +....+-.+.-.++.|+.|+......+ .+..|.+. .+. T Consensus 202 ~~~V~~lDgv~Ii~VPs~r~~t~~~f~~---------G--~~~~~~ak~INfiiv~~~a~ia~~K~~-~~~if~P~~~~~ 269 (302) T protein:vir:78 202 DTKITFIQDVEVLQVPSEYLYDKVAPKV---------G--VPDYTGAKKIPYMIFKRDAPTGIVKTD-KVRVFEPDTNQS 269 (302) T ss_pred cceeeeecccEEEEchhhhcccceeccC---------C--ccccCCccceeEEEECCCeeeeeeeee-eeEeeCCCCCCC Confidence 8899999999998753 4443211111 1 122222234557899999888776666 45555443 444 Q ss_pred H--HHHHHHHHhcCcccccceEEEE-EEeeccC Q lcl|Aclame:pro 306 T--YYIDTFMAEGAIPDRWEAVSVV-TTKRDAT 335 (402) Q Consensus 306 ~--d~i~~~~a~Ga~vlRPeaa~vv-~~~~~~t 335 (402) + |++..+.-+..-|+.....++. ..+.+.. T Consensus 270 gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~~~ 302 (302) T protein:vir:78 270 ADAYKVDLRLYHDLIVPKNQRPGIIKASFGTIA 302 (302) T ss_pred cceeeeeeeeEeeeeeeccccCeEEEeeccccC Confidence 4 6888888888888776644443 2233322 No 57 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.56 E-value=1.5e-16 Score=107.32 Aligned_cols=303 Identities=11% Similarity=-0.027 Sum_probs=165.5 Q ss_pred CCCCcccccc-----cccccc-cHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec---------cceee Q lcl|Aclame:pro 1 MSTPNTLTNV-----AVSASG-EVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL---------GETEL 65 (402) Q Consensus 1 Ms~~n~~t~~-----~~~~~~-d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i---------G~~t~ 65 (402) |+.-|-+... ..++.. ..-+++-++|..++++..++.+.++.+.++.++.+ +++++|+. |..++ T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~~~~~ 79 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISY-GETIIPTTVKRPEVGQVGVGTS 79 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCccceeeccccc Confidence 6665432211 112211 22236669999999999999999999999888764 57777764 33344 Q ss_pred eeecCCCCCCCCCccccceeEeecceeeccchhhhHHHh--hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 66 QVLAPGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDV--QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANT 143 (402) Q Consensus 66 ~~~~~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~--q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a 143 (402) .....|+.+....+..++.++..-.+ .....|.+ |+ ++.+| +.+.+.+++++++++.+|+.++.-- .... T Consensus 80 ~~~~Eg~~~~~~~~~f~~v~l~~~k~-~~~~~is~--ell~ds~~~-~~~~i~~~la~a~~~~~d~~~l~G~----g~~~ 151 (338) T protein:vir:78 80 NEQREGGTKPLSGTAWDTRSVAPIKL-ATIVTVSE--EFARMNPSG-LYTKLQADLAYAIGRGIDLAVFHGK----SPLT 151 (338) T ss_pred ccccccccccccccceeEEEEEEEEE-EEeehhhH--HHHhcCHHH-HHHHHHHHHHHHHHHHHHHHhhccc----CCCc Confidence 45555666665556666666655432 22222322 22 24466 7889999999999999999886211 0000 Q ss_pred ccccccccccccccccccc--cCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhccccc Q lcl|Aclame:pro 144 KAERNKPRVKGHGFSINVN--VTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTI 221 (402) Q Consensus 144 ~~~~~~~~~~g~~~~~~v~--~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~ 221 (402) + ..+ .+........ ........+...+|+.|.++...+. ++........+++|..|..|.+...+.|.+..- T Consensus 152 ~---~~~--~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~ 225 (338) T protein:vir:78 152 G---SAL--QGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVS-ANTDVDFNGWAADPRYRARLLRSQAYRDANGNV 225 (338) T ss_pred c---ccc--cccccccccccccccccccccchhhHHHHHHHHHHhh-hhccccceEEEEchHHHHHHHHHhhhccCCCce Confidence 0 000 0000000000 0111122334566888888877664 344444556799999999997765554433111 Q ss_pred ccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccc Q lcl|Aclame:pro 222 SQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE 301 (402) Q Consensus 222 ~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d 301 (402) .-.....+|.-.++.|+||+.|+++|.......+ ....=+-+||++.. +..+ .++..+..++ T Consensus 226 l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~--------~~~~~~~gdfs~~~--~~~~--------~~~~i~~~~~ 287 (338) T protein:vir:78 226 DPTRINLAASAGDLLGLPVQFGKAVGGDLGAATD--------SKVRVVGGDFSQLK--YGFA--------DEIRVKMSDT 287 (338) T ss_pred eecccccCCCCceeeeeeEEEccccCccccccCC--------cccEEEEEecceEE--EEee--------cccEEEEeec Confidence 1122233566688999999999999953321110 11111335665421 2111 1222222221 Q ss_pred hh--------------HHH--HHHHHHHHhcCcccccceEEEEEEeeccCcccc Q lcl|Aclame:pro 302 KK--------------EKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDA 339 (402) Q Consensus 302 ~~--------------~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a 339 (402) .- .+. ..+++.+-+|.+++||++.+.|+- .+.++| T Consensus 288 ~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~---~~~~~~ 338 (338) T protein:vir:78 288 ATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVD---DEDPDA 338 (338) T ss_pred ccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEec---ccCCCC Confidence 10 111 224566678999999998776532 333444 No 58 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.48 E-value=2e-15 Score=101.18 Aligned_cols=302 Identities=12% Similarity=-0.030 Sum_probs=160.6 Q ss_pred CCCCccccc-----ccccccc-cHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCC Q lcl|Aclame:pro 1 MSTPNTLTN-----VAVSASG-EVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQS 73 (402) Q Consensus 1 Ms~~n~~t~-----~~~~~~~-d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~ 73 (402) |+.-|-+.. ...++.. ..-+++-+++..++++..++.++++.+.++.++.+|. .++|+. +..++..+..|+. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~eg~~ 79 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGE-TIIPTTVKRPEVGQVGVGTS 79 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEeCCceeEeecCccc Confidence 444432211 1111111 1123667999999999999999999999988887644 456654 6667766666654 Q ss_pred CC--------CCCccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Q lcl|Aclame:pro 74 PN--------ATPTQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTK 144 (402) Q Consensus 74 i~--------~~~~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~ 144 (402) .. ...+...+. ++...++.....-.-+=. ++..| +.+.+.+++++++++.+|+.++.-- ....+ T Consensus 80 ~~~~e~~~~~~~~~~f~~i--~l~~~kl~~~~~is~ell~~s~~~-~~~~i~~~la~ai~~~~d~~~l~G~----g~~~~ 152 (333) T protein:vir:78 80 NEQREGGLKPLSGTAWDTR--SVSPIKLATIVTVSEEFARMNPSG-LYTKLQGDLAYAIGRGIDLAVFHGK----SPLTG 152 (333) T ss_pred ccccccccccccccceeEE--EEeeEEEEEeehhhHHHHhcCHHH-HHHHHHHHHHHHHHHHHHHHHhccc----CCCCC Confidence 32 222333333 344444444333222222 34566 7889999999999999999886311 11110 Q ss_pred ccccccccccccccccccc--CCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccc Q lcl|Aclame:pro 145 AERNKPRVKGHGFSINVNV--TESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTIS 222 (402) Q Consensus 145 ~~~~~~~~~g~~~~~~v~~--~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~ 222 (402) . ...|......+.. ............++.|+++...+..+. .......+++|..|..|++...+.|.+..-. T Consensus 153 ~-----~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i 226 (333) T protein:vir:78 153 S-----ALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANT-DVEFNGWAVDPRFRAHLLRAQAYRDANGNVD 226 (333) T ss_pred c-----ccccccccccccccccccccccccchhHHHHHHHHHhhcccc-ccCceEEEEcchHHHHHHHHhhhcCCCCcee Confidence 0 0001111000000 001111222335777888877765443 3333457889999999987666554431111 Q ss_pred cCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch Q lcl|Aclame:pro 223 QSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK 302 (402) Q Consensus 223 ~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~ 302 (402) -......|..++++|+||+.|+++|...... ..+...=+-+||++.. +. .-.+++.+..++. T Consensus 227 ~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~--------~~~~~~~~~gD~~~~~-~g---------~~~~~~i~~~~~~ 288 (333) T protein:vir:78 227 PSRINLAAQTGDVLGLPAQFGRAVGGDLGAA--------VDSKTRIIGGDFSQLK-FG---------FADEIRIKMSDTA 288 (333) T ss_pred ecCccccCCCceeeceeeEEccccCCCcccc--------CCCccEEEEEecccEE-EE---------EeeccEEEEeccc Confidence 1222234556789999999999999543211 1111122446665422 11 1122333332221 Q ss_pred ----------hHH-H--HHHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 303 ----------KEK-T--YYIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 303 ----------~~~-~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) ..| . -.+++.+-++.++++|++.+.| +..+.| T Consensus 289 ~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l--~~~~a~ 333 (333) T protein:vir:78 289 TLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKF--VDDEQP 333 (333) T ss_pred cccccccceeehhhcCcEEEEEEEEEccEEecccceEEE--eccCCC Confidence 011 1 1234556689999999987765 333333 No 59 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.45 E-value=4.7e-15 Score=99.13 Aligned_cols=301 Identities=10% Similarity=0.005 Sum_probs=167.8 Q ss_pred CCCCcc-cccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCC Q lcl|Aclame:pro 1 MSTPNT-LTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATP 78 (402) Q Consensus 1 Ms~~n~-~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~ 78 (402) |+.... .... ..+++.-.+..+++..++++..+..++++++.++.++.+ ..+++|+. +...+..+.-|+.+.... T Consensus 1 m~~~~~~a~~~--~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~ 77 (330) T protein:vir:77 1 MAGSTVPSTQV--ALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGP-TGISIPHWTGAVSASWTGEAERKPITK 77 (330) T ss_pred Ccccccchhhc--cccCCCcceechhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEcCCcceeEecCCCcccccc Confidence 877641 1111 112222234446778899999999999999998877665 44778876 777888888888888777 Q ss_pred ccccceeEeecceeeccc-hhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARN-TVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 79 ~~~~e~~itID~~lya~~-~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) +..++.++..-. +... .|.+ +-.+ +.+| +.+.+.+++++++++++|+.++ .+.....+...-- .+.. T Consensus 78 ~~f~~i~~~~~k--~~~~~~is~-ell~ds~~~-~~~~i~~~l~~ai~~~~~~~~l----~G~g~~~~~~g~~---~~~~ 146 (330) T protein:vir:77 78 GSFGKQELEPVK--ITTIFAESA-EVVRLNPLN-YLNTMRTKIAEAIALKFDAAAI----HGIDKPSAFKGYL---AETT 146 (330) T ss_pred ceeeEEEEeEEE--EEEeehhhH-HHHhcchHH-HHHHHHHHHHHHHHHHHHHHhh----cccCCCCcccccc---cccc Confidence 777777666643 3332 3322 1122 3456 7889999999999999998776 1111111100000 0000 Q ss_pred ccccc-ccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccccce Q lcl|Aclame:pro 157 FSINV-NVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATINGF 231 (402) Q Consensus 157 ~~~~v-~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~~G~ 231 (402) ..... .....+........|+.|.++...+...+.+.. ..|++|..|..|.+ | .|.+-.. ....+...... T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~G~~l~~~--~~~~~~~~~~~ 222 (330) T protein:vir:77 147 KVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWT--GTLLDNVTEPILNTAVDGNGRPLFVE--STYTEQVGAIR 222 (330) T ss_pred ccceeecccccccccccchhHHHHHHHHHhhhhcCCCcc--EEEEcHHHHHHHHHHhccCCceeecC--ccccccccccC Confidence 00010 011112222334567888888888887776543 45899999998874 1 2222111 01111112233 Q ss_pred EEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch--------- Q lcl|Aclame:pro 232 VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK--------- 302 (402) Q Consensus 232 V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~--------- 302 (402) -.++.|+||+.|+++|......... =+-+||++.. + +...++..+..++. T Consensus 223 ~~~l~G~PV~~~~~~p~~~~~~~~~-----------~~~gd~s~~~--i--------~~~~~~~i~~~~e~~~~~~~~~~ 281 (330) T protein:vir:77 223 EGRILGRPTYVADNVVNGTVGNRVV-----------GVMGDFSQVI--W--------GQIGGLSFDVTDQATLDFGEEQG 281 (330) T ss_pred CceecceeeEEeccccCCCCCCccE-----------EEEEecceEE--E--------EEecCcEEEEeecceeeeccccc Confidence 4578999999999999643211111 1334444421 1 11122233322211 Q ss_pred -----------hHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccccch Q lcl|Aclame:pro 303 -----------KEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPG 343 (402) Q Consensus 303 -----------~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~ 343 (402) .+=...+++..-+|..++||+|.+.|+.+. +++.|+-- T Consensus 282 ~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~---~~~~~~~~ 330 (330) T protein:vir:77 282 GVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQV---AGTDPEEE 330 (330) T ss_pred ccccccccchhhcCcEEEEEEEEeccEEecccceEEEEecc---CCcCCCCC Confidence 001133466677899999999988775543 34444321 No 60 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.45 E-value=6.3e-15 Score=98.45 Aligned_cols=282 Identities=10% Similarity=0.033 Sum_probs=165.8 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCCCCcc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNATPTQ 80 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~~~~ 80 (402) |.+....+.....+.+ +.-++++.++++...+.++++++.++.++.+ ++.++++.....+..+..|++++...+. T Consensus 1 ~g~~a~~~~~~~~~~~----~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 75 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTG----SIPINISEQIITGVKNGSAAMKLAKAVPMTK-PEEEFTFMSGVGAFWVDEAERIQTSKPT 75 (299) T ss_pred CCcCCCcccccCCCce----ecchhHHHHHHHHHHhcchhhhhceeeecCC-CcEEEEEEcCCceeeeecCccccccccc Confidence 6665432222222111 3448899999999999999999998888754 6678998888888999999988877788 Q ss_pred ccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccccc Q lcl|Aclame:pro 81 ADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSI 159 (402) Q Consensus 81 ~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~ 159 (402) .++.++.... +..+.--..+-.+ +..| +.+.+.+++++++++.+|+.++. +.....+ .|.... T Consensus 76 f~~v~l~~~k--~~~~~~is~ell~ds~~~-~~~~i~~~l~~a~~~~~d~a~l~----G~g~~~~--------~gil~~- 139 (299) T protein:vir:41 76 FTKAKMRSKK--MGVIIPTTKENLNYSVTN-FFSLMQAEIVEAFYKKFDQAVFT----GVESPYN--------WNILKS- 139 (299) T ss_pred eeEEEEeeEE--EEEeehhhHHHHhcCHHH-HHHHHHHHHHHHHHHHHHHHHhh----cccCccc--------cccccc- Confidence 7777776655 3333222222222 3355 78899999999999999988762 1111000 011110 Q ss_pred ccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEeccE Q lcl|Aclame:pro 160 NVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCP 239 (402) Q Consensus 160 ~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~ 239 (402) ..... +.......-++.|.++..+|.+.+.+.. .++++|..|..|.+-.. .+..|-- ... . .+...++.|+| T Consensus 140 -~~~~~-~~~~~~~~~~~~l~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd-~~G~~l~-~~~-~-~~~~~~l~G~P 211 (299) T protein:vir:41 140 -ATDAS-NLVEETANKYDDLNEAIGLIEAEDLEPN--GIATIRKQRVKYRSTKD-GNGMPIF-NTA-T-SNGVDDVLGLP 211 (299) T ss_pred -ccccc-eeeccccccHHHHHHHHHhhhcccCCcC--EEEEcHHHHHHHHHhhc-cCCceee-cCC-c-CCCCceeccee Confidence 00000 1111112236778888888888877633 46999999999985211 1111100 111 1 12335789999 Q ss_pred EEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh--------------HH Q lcl|Aclame:pro 240 VIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK--------------EK 305 (402) Q Consensus 240 V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~--------------~~ 305 (402) |+.++++|...+ ...-+-+||++.. +.. -.++..+..++.- .+ T Consensus 212 V~~~~~~~~~~~-------------~~~~~~gdfs~~~-i~~---------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 268 (299) T protein:vir:41 212 IAYTPKYTFGDK-------------DISELVGDWNQAY-YGI---------LRGVEYEILTEATLTTVADETGKPLNLAE 268 (299) T ss_pred eEEecccCCCCC-------------ceEEEEEecccEE-EEE---------ecCcEEEEeecccccccccccccchhhhh Confidence 999999995321 1112345555432 111 1222333322211 12 Q ss_pred HH--HHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 306 TY--YIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 306 ~d--~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) .+ .+++..-+|.++++|+|.+.|+.+..- T Consensus 269 ~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 269 RDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred cCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 22 234455689999999999888665433 No 61 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.41 E-value=5.5e-15 Score=98.77 Aligned_cols=293 Identities=12% Similarity=0.007 Sum_probs=167.0 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhccccee-ee-----ccccceEEeeeccceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QT-----VTGTNTVSNKYLGETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~-rt-----i~~Gksv~f~~iG~~t~~~~~~G~~i 74 (402) |++- +.-.+++-.-|.+..|+...+|...+.+ |. -|.|+++.+|.-=..... .|..+ T Consensus 1 MAn~--------------l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~---~G~~~ 63 (430) T protein:vir:10 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ---EGWDL 63 (430) T ss_pred Cccc--------------hhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccc---cCccc Confidence 5543 2223344556778888888888764332 21 267899887765333333 36655 Q ss_pred CCC--CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 75 NAT--PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 75 ~~~--~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) .++ .+...+..++||..+--.+.+.+-+ +...+..+ ++-+....+||..+|..++.++..-+.+... T Consensus 64 t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~-~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~-------- 132 (430) T protein:vir:10 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYR-HRIQSAARKLANNVELKVANMAAEMGSLVIT-------- 132 (430) T ss_pred CCCCCccccceEEEEEeeeccceEEechhH--hcChhHHH-HHhHHHHHHHHHHHHHHHHHHhhhccccccc-------- Confidence 544 3445678999999987777777643 45555444 4447778999999999998776532211100 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCcc-CcEEEeChHHHHHHhcc-cchhhcccccccCcccccc Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDIS-DVAIMMPWKFFNALRDA-DRIVDKTYTISQSGATING 230 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~-gR~~VV~P~~y~~Ll~~-~r~~n~d~~~~~~g~~~~G 230 (402) ...+ ..+... ..+..+-.+.+.|++..||.+ +|.+|++|+.+..|... .++-+.+ ......+++| T Consensus 133 --~~~~------t~~~~~---~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~--~~~~~A~r~g 199 (430) T protein:vir:10 133 --SPDA------IGTNTA---DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG--RIPEEAYRDG 199 (430) T ss_pred --cccc------CCCcCC---cchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccc--cchhHHHhhc Confidence 0000 000001 124567778899999999995 89999999999998752 3333222 1223347899 Q ss_pred eEEE-Eecc-EEEecCccccccCccccccc-c-------------------------------ccC---Cccccceee-- Q lcl|Aclame:pro 231 FVLS-SYNC-PVIPSNRFPTFAQDQAHHLL-S-------------------------------NED---NGYRYDPIA-- 271 (402) Q Consensus 231 ~V~~-iaG~-~V~~SNnlP~~~~~~t~~~l-s-------------------------------~a~---~G~~~~~~a-- 271 (402) .|++ +.|| .+++|+++|....+..+... + ..+ .|-.+++++ T Consensus 200 ~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~ 279 (430) T protein:vir:10 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) T ss_pred cccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEeccee Confidence 9997 8999 59999999963322111000 0 000 011111221 Q ss_pred --------------ecc---------------------------------------------------ceeEEeecHHHh Q lcl|Aclame:pro 272 --------------EMN---------------------------------------------------GAVAVLFTSDAL 286 (402) Q Consensus 272 --------------d~~---------------------------------------------------~~~al~fh~~Av 286 (402) +|. -+..++||++|+ T Consensus 280 ~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~ 359 (430) T protein:vir:10 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) T ss_pred eeccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccce Confidence 010 023489999976 Q ss_pred hhh--hh-------------c------ccceee--ccchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 287 LVG--RT-------------I------EVTGDI--FYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 287 ~tv--~~-------------~------dl~~e~--~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) .-+ .+ . .+.+.. +||.+.......==..||.+.+|||.++++-.-+.+ T Consensus 360 aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 532 22 0 122222 234333221222223599999999988766332222 No 62 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.41 E-value=5.5e-15 Score=98.77 Aligned_cols=293 Identities=12% Similarity=0.007 Sum_probs=167.0 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhccccee-ee-----ccccceEEeeeccceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QT-----VTGTNTVSNKYLGETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~-rt-----i~~Gksv~f~~iG~~t~~~~~~G~~i 74 (402) |++- +.-.+++-.-|.+..|+...+|...+.+ |. -|.|+++.+|.-=..... .|..+ T Consensus 1 MAn~--------------l~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~---~G~~~ 63 (430) T protein:vir:92 1 MALN--------------EGQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ---EGWDL 63 (430) T ss_pred Cccc--------------hhhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccc---cCccc Confidence 5543 2223344556778888888888764332 21 267899887765333333 36655 Q ss_pred CCC--CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 75 NAT--PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 75 ~~~--~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) .++ .+...+..++||..+--.+.+.+-+ +...+..+ ++-+....+||..+|..++.++..-+.+... T Consensus 64 t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~-~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~-------- 132 (430) T protein:vir:92 64 TDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYR-HRIQSAARKLANNVELKVANMAAEMGSLVIT-------- 132 (430) T ss_pred CCCCCccccceEEEEEeeeccceEEechhH--hcChhHHH-HHhHHHHHHHHHHHHHHHHHHhhhccccccc-------- Confidence 544 3445678999999987777777643 45555444 4447778999999999998776532211100 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCcc-CcEEEeChHHHHHHhcc-cchhhcccccccCcccccc Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDIS-DVAIMMPWKFFNALRDA-DRIVDKTYTISQSGATING 230 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~-gR~~VV~P~~y~~Ll~~-~r~~n~d~~~~~~g~~~~G 230 (402) ...+ ..+... ..+..+-.+.+.|++..||.+ +|.+|++|+.+..|... .++-+.+ ......+++| T Consensus 133 --~~~~------t~~~~~---~~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~--~~~~~A~r~g 199 (430) T protein:vir:92 133 --SPDA------IGTNTA---DAWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG--RIPEEAYRDG 199 (430) T ss_pred --cccc------CCCcCC---cchhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccc--cchhHHHhhc Confidence 0000 000001 124567778899999999995 89999999999998752 3333222 1223347899 Q ss_pred eEEE-Eecc-EEEecCccccccCccccccc-c-------------------------------ccC---Cccccceee-- Q lcl|Aclame:pro 231 FVLS-SYNC-PVIPSNRFPTFAQDQAHHLL-S-------------------------------NED---NGYRYDPIA-- 271 (402) Q Consensus 231 ~V~~-iaG~-~V~~SNnlP~~~~~~t~~~l-s-------------------------------~a~---~G~~~~~~a-- 271 (402) .|++ +.|| .+++|+++|....+..+... + ..+ .|-.+++++ T Consensus 200 ~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~ 279 (430) T protein:vir:92 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVK 279 (430) T ss_pred cccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEeccee Confidence 9997 8999 59999999963322111000 0 000 011111221 Q ss_pred --------------ecc---------------------------------------------------ceeEEeecHHHh Q lcl|Aclame:pro 272 --------------EMN---------------------------------------------------GAVAVLFTSDAL 286 (402) Q Consensus 272 --------------d~~---------------------------------------------------~~~al~fh~~Av 286 (402) +|. -+..++||++|+ T Consensus 280 ~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~ 359 (430) T protein:vir:92 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) T ss_pred eeccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcccce Confidence 010 023489999976 Q ss_pred hhh--hh-------------c------ccceee--ccchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 287 LVG--RT-------------I------EVTGDI--FYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 287 ~tv--~~-------------~------dl~~e~--~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) .-+ .+ . .+.+.. +||.+.......==..||.+.+|||.++++-.-+.+ T Consensus 360 aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 532 22 0 122222 234333221222223599999999988766332222 No 63 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.39 E-value=7.4e-15 Score=98.05 Aligned_cols=293 Identities=12% Similarity=0.031 Sum_probs=165.5 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhccccee-ee--c---cccceEEeeeccceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDV-QT--V---TGTNTVSNKYLGETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~-rt--i---~~Gksv~f~~iG~~t~~~~~~G~~i 74 (402) |++-- +. ++++=--|++.-|+...+|..++.+ |. . |.|+++.+|.-=.... ..|..+ T Consensus 1 Ma~~~----------~~----~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~---~~G~~~ 63 (430) T protein:vir:21 1 MALNE----------GQ----IVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT---QEGWDL 63 (430) T ss_pred Ccccc----------ch----hhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccc---cccccc Confidence 65431 11 2222117888899999888875332 22 2 6799998874422222 224444 Q ss_pred CCC--CccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 75 NAT--PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 75 ~~~--~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) .++ .+...+..++||+.+--.+.+. -+| +...| ...++-+....+||..+|+.++.++..-.-.... T Consensus 64 t~~~~~~~e~~v~~~~~~~~~V~~~~~-~kE-l~~~~-~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~-------- 132 (430) T protein:vir:21 64 TDKATGLLELNVAVNMGEPDNDFFQLR-ADD-LRDET-AYRRRIQSAARKLANNVELKVANMAAEMGSLVIT-------- 132 (430) T ss_pred cCCCccceeeeEeEEEeeeccceEEee-hhH-hcChh-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccc-------- Confidence 433 3556778899999876555554 444 34444 3456778888999999999998887542211100 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCcc-CcEEEeChHHHHHHhc-ccchhhcccccccCcccccc Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDIS-DVAIMMPWKFFNALRD-ADRIVDKTYTISQSGATING 230 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~-gR~~VV~P~~y~~Ll~-~~r~~n~d~~~~~~g~~~~G 230 (402) ...+ ..+...+ .+..+-++.+.|++..||.+ +|.++++|+.|..|.. -.++-+.+ ......+++| T Consensus 133 --~~~~------t~~~~~~---~~~~~A~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~--~~~~~A~r~g 199 (430) T protein:vir:21 133 --SPDA------IGTNTAD---AWNFVADAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFG--RIPEEAYRDG 199 (430) T ss_pred --ccCC------CCCCCCc---chhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhccccccc--cchhHHHhhc Confidence 0000 0000111 24667778889999999995 7999999999998865 34444332 1223357899 Q ss_pred eEEE-EeccE-EEecCccccccCccccccc-cc-------------cC---------------------Cccccceee-- Q lcl|Aclame:pro 231 FVLS-SYNCP-VIPSNRFPTFAQDQAHHLL-SN-------------ED---------------------NGYRYDPIA-- 271 (402) Q Consensus 231 ~V~~-iaG~~-V~~SNnlP~~~~~~t~~~l-s~-------------a~---------------------~G~~~~~~a-- 271 (402) .|++ ++||+ |++|+++|....+..+... ++ .+ .|-.+++++ T Consensus 200 ~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~ 279 (430) T protein:vir:21 200 TIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVK 279 (430) T ss_pred ccccccchhhhhhhcCCcccccCccCcCceeccccccccccceeccccccccccccceeeeeecccceecccEEEeccee Confidence 9997 89996 9999999963221111000 00 00 011112222 Q ss_pred --------------------ecc---------------------------------------------ceeEEeecHHHh Q lcl|Aclame:pro 272 --------------------EMN---------------------------------------------GAVAVLFTSDAL 286 (402) Q Consensus 272 --------------------d~~---------------------------------------------~~~al~fh~~Av 286 (402) +.+ -+..++||++|+ T Consensus 280 ~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~ 359 (430) T protein:vir:21 280 FLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAI 359 (430) T ss_pred eeccccccccCCcceEEEEEecCCceeEEeecccccccccccccccccceeccccccCceeEEeccCCcccceeEcccee Confidence 000 012389999976 Q ss_pred hhh--hh-------------------cccceeec--cchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 287 LVG--RT-------------------IEVTGDIF--YEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 287 ~tv--~~-------------------~dl~~e~~--~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) .-+ .+ ..+.+..+ ||.+.......==..||.+.+|||.++++-.-+.+ T Consensus 360 ~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 360 RIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEecccCCCChhHhhheeeeeccccceEEEEEEccccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 532 22 12333333 33222222222223599999999988766332222 No 64 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.37 E-value=3.9e-14 Score=94.10 Aligned_cols=282 Identities=12% Similarity=0.029 Sum_probs=161.4 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |+..... +.-++|..++++..++.|+++.+.++.++.+| +.+||++ |...+..+.-|+++....+ T Consensus 1 ma~~gG~-------------lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 66 (298) T protein:vir:94 1 MVLNKGT-------------LFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKKTHGGV 66 (298) T ss_pred Ceecccc-------------ccChhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceEEeeCCcccccccc Confidence 7664422 22378899999999999999999988877664 4678876 7788888888888887777 Q ss_pred cccceeEeecceeeccchhhhHHHhh-----cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQ-----GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q-----~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..++.++..-.+- ....|. +|++ +..+ +.+.+.+++++++++.+|+.++.-... ....+.. + T Consensus 67 ~f~~v~l~~~k~~-~~~~iS--~ell~~~~~~~~~-l~~~i~~~la~ai~~~~d~~~l~G~~~-------~~g~~~~--~ 133 (298) T protein:vir:94 67 TLAPQTMVPIKVE-YGARIS--DEFMYASDEEKIN-ILQAFNDGFAKKVARGIDLMAFHGVNP-------RLGTASA--V 133 (298) T ss_pred ceeEEEEeeeEEE-Eeeehh--HHHhccCCccHHH-HHHHHHHHHHHHHHHHHHHHhhccccc-------CCCcccc--c Confidence 7777777654332 222222 2222 1223 567888999999999999988643110 0001100 1 Q ss_pred cccccccccC--CccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceE Q lcl|Aclame:pro 155 HGFSINVNVT--ESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFV 232 (402) Q Consensus 155 ~~~~~~v~~~--~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V 232 (402) .+........ ..........+++.|.++..+|...+.... ..+++|..|..|.+-.. .|..|-- .....+|.. T Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd-~~G~~l~--~~~~~~~~~ 208 (298) T protein:vir:94 134 IGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKD-LQGNALF--PELKWGATP 208 (298) T ss_pred ccccccccccccccccccccccHHHHHHHHHHhhhhcCCCcc--EEEEcHHHHHHHHHhhc-cCCCeee--cCcccCCCC Confidence 1110111100 111112233467788899999988877644 47999999999865111 0111110 111224556 Q ss_pred EEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh------HHH Q lcl|Aclame:pro 233 LSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK------EKT 306 (402) Q Consensus 233 ~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~------~~~ 306 (402) ++++|+||+.|+++|...+... ..=+.+||++...+... +-+.++...+-++. .+. T Consensus 209 ~tl~G~PV~~~~~v~~~~~~~~-----------~~~~~Gdfs~~~~~~~~-------~~~~~~~~~~~~~d~~~~~~f~~ 270 (298) T protein:vir:94 209 DTINGLPVDVNKTVSDMSLTQR-----------DRAIIGDFANGFKWGYA-------KEVPLEVIQYGDPDNSGLDLKGY 270 (298) T ss_pred ceecceeeEEecccccccCCCc-----------cEEEEeeccceEEEEEe-------cCceEEEeecCCCcCcchhhhhc Confidence 7899999999999995432111 11244666654322211 11222111111111 111 Q ss_pred H--HHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 307 Y--YIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 307 d--~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) | .+++.+-+|..++||++.+.| +.-+ T Consensus 271 ~~v~~r~~~r~~~~~~~~~a~~~l--~~~t 298 (298) T protein:vir:94 271 NQVYIRAELFLGWGILDATKFARV--TEAN 298 (298) T ss_pred CcEEEEEEEEeccEeecccceEEE--EecC Confidence 1 234556689999999986665 3322 No 65 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.35 E-value=4e-14 Score=94.06 Aligned_cols=292 Identities=10% Similarity=0.061 Sum_probs=163.8 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |+-.. .++.+...+.+--.++=+.+..++.+...+.++++++.++.++.+ ++.+||+. +...+..+.-|++++...+ T Consensus 1 ma~~~-~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:10 1 MATPT-YTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTA-QKKKFTYLAKGVGAYWVSETERIQTSKP 78 (304) T ss_pred Ccccc-cccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEeCCcceEEeecCcccccccc Confidence 87664 122222222222235558899999999999999999998888765 55778876 6777888887888877777 Q ss_pred cccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFS 158 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~ 158 (402) ..++.++..-. +.....-.-+-.+ +.+| +.+.+.+++++++++.+|+.++.- .....+. .....+.. T Consensus 79 ~~~~i~~~~~k--~~~~~~iS~ell~ds~~~-l~~~i~~~l~~~ia~~~d~~~l~G----~g~~~~~-----~~~~~~~~ 146 (304) T protein:vir:10 79 EYAQAEMEAKK--IGVIIPLSKEFLKWTAKD-FFNEVKPLIAEAFYKAFDQAVIFG----TKSPYNT-----STSGKPLV 146 (304) T ss_pred eeeEEEEEEEE--EEEeehhhHHHHhcchHH-HHHHHHHHHHHHHHHHHHhhheec----cCCCccc-----cccccccc Confidence 77777776654 3333222222222 3456 788899999999999999887521 1111000 00000000 Q ss_pred cccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEecc Q lcl|Aclame:pro 159 INVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNC 238 (402) Q Consensus 159 ~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~ 238 (402) ... ........+....|+.|.++..++...+.... .++++|..|..|.+- .+. .+.. +.+...++++|. T Consensus 147 ~~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l---kd~----~G~~-l~~~~~~~l~G~ 215 (304) T protein:vir:10 147 EGA-EEKGNVVTDTNNLYVDLSALMATIEDEELDPN--GVLTTRSFRSKMRNA---LDA----NDRP-LFDANGNEIMGL 215 (304) T ss_pred ccc-cccccccccccchHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHh---hcc----CCcE-eecCCCccccce Confidence 000 01111122333458889999888888776644 468999999999752 111 1111 123344679999 Q ss_pred EEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhH-------HH---HH Q lcl|Aclame:pro 239 PVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE-------KT---YY 308 (402) Q Consensus 239 ~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~-------~~---d~ 308 (402) ||+.++++|...+.. .-+-+||++.. ++-+..+..--..+.+.+..+..+- |. .. T Consensus 216 PV~~~~~~~~~~~~~-------------~~~~gd~~~~~--~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~ 280 (304) T protein:vir:10 216 PLSYTGADVYDKKKS-------------LALMGDWDYAR--YGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFA 280 (304) T ss_pred eeEEecccccCCCCc-------------EEEEEehhhEE--EEEecceEEEEeecceeeeecccccCccchhhhhcCcEE Confidence 999999999533211 11335555431 1111111000000001111111111 11 22 Q ss_pred HHHHHHhcCcccccceEEEEEEee Q lcl|Aclame:pro 309 IDTFMAEGAIPDRWEAVSVVTTKR 332 (402) Q Consensus 309 i~~~~a~Ga~vlRPeaa~vv~~~~ 332 (402) +++.+-+|..++||++.+.|+... T Consensus 281 ~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 281 LRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEEEEeccEeecccceEEEEecC Confidence 344456899999999988776544 No 66 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.35 E-value=4e-14 Score=94.06 Aligned_cols=292 Identities=10% Similarity=0.061 Sum_probs=163.8 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |+-.. .++.+...+.+--.++=+.+..++.+...+.++++++.++.++.+ ++.+||+. +...+..+.-|++++...+ T Consensus 1 ma~~~-~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~~~~~~ 78 (304) T protein:vir:94 1 MATPT-YTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTA-QKKKFTYLAKGVGAYWVSETERIQTSKP 78 (304) T ss_pred Ccccc-cccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEeCCcceEEeecCcccccccc Confidence 87664 122222222222235558899999999999999999998888765 55778876 6777888887888877777 Q ss_pred cccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFS 158 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~ 158 (402) ..++.++..-. +.....-.-+-.+ +.+| +.+.+.+++++++++.+|+.++.- .....+. .....+.. T Consensus 79 ~~~~i~~~~~k--~~~~~~iS~ell~ds~~~-l~~~i~~~l~~~ia~~~d~~~l~G----~g~~~~~-----~~~~~~~~ 146 (304) T protein:vir:94 79 EYAQAEMEAKK--IGVIIPLSKEFLKWTAKD-FFNEVKPLIAEAFYKAFDQAVIFG----TKSPYNT-----STSGKPLV 146 (304) T ss_pred eeeEEEEEEEE--EEEeehhhHHHHhcchHH-HHHHHHHHHHHHHHHHHHhhheec----cCCCccc-----cccccccc Confidence 77777776654 3333222222222 3456 788899999999999999887521 1111000 00000000 Q ss_pred cccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEecc Q lcl|Aclame:pro 159 INVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNC 238 (402) Q Consensus 159 ~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~ 238 (402) ... ........+....|+.|.++..++...+.... .++++|..|..|.+- .+. .+.. +.+...++++|. T Consensus 147 ~~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l---kd~----~G~~-l~~~~~~~l~G~ 215 (304) T protein:vir:94 147 EGA-EEKGNVVTDTNNLYVDLSALMATIEDEELDPN--GVLTTRSFRSKMRNA---LDA----NDRP-LFDANGNEIMGL 215 (304) T ss_pred ccc-cccccccccccchHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHh---hcc----CCcE-eecCCCccccce Confidence 000 01111122333458889999888888776644 468999999999752 111 1111 123344679999 Q ss_pred EEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhH-------HH---HH Q lcl|Aclame:pro 239 PVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE-------KT---YY 308 (402) Q Consensus 239 ~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~-------~~---d~ 308 (402) ||+.++++|...+.. .-+-+||++.. ++-+..+..--..+.+.+..+..+- |. .. T Consensus 216 PV~~~~~~~~~~~~~-------------~~~~gd~~~~~--~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~ 280 (304) T protein:vir:94 216 PLSYTGADVYDKKKS-------------LALMGDWDYAR--YGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFA 280 (304) T ss_pred eeEEecccccCCCCc-------------EEEEEehhhEE--EEEecceEEEEeecceeeeecccccCccchhhhhcCcEE Confidence 999999999533211 11335555431 1111111000000001111111111 11 22 Q ss_pred HHHHHHhcCcccccceEEEEEEee Q lcl|Aclame:pro 309 IDTFMAEGAIPDRWEAVSVVTTKR 332 (402) Q Consensus 309 i~~~~a~Ga~vlRPeaa~vv~~~~ 332 (402) +++.+-+|..++||++.+.|+... T Consensus 281 ~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 281 LRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEEEEeccEeecccceEEEEecC Confidence 344456899999999988776544 No 67 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.33 E-value=9.9e-14 Score=91.88 Aligned_cols=294 Identities=10% Similarity=0.009 Sum_probs=159.5 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeee-ccceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~~~G~~i~~~~~ 79 (402) |+...+.+... +.-++|++++++..++.|+++.+.++..+.+ ..++||+ .|...+..+.-|+.+....+ T Consensus 1 Ma~~~~~~gg~---------~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~-~~~~ip~~~~~~~a~wv~Eg~~~~~s~~ 70 (315) T protein:vir:80 1 MADDFLSAGKL---------ELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEVKPSASV 70 (315) T ss_pred CCCCcCCcCce---------EcchHHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeeCCcccccccc Confidence 88765433211 2238899999999999999999988776654 5677887 47788888888888887777 Q ss_pred cccceeEeecceeeccc-hhhhHHHhh-c-Ccc---chhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARN-TVAHIHDVQ-G-DID---SLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 80 ~~~e~~itID~~lya~~-~IddlDe~q-~-~~D---~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ..++.++..-. ++.. .|. +|.. . ..| .+++.+.++++++|++.+|+.++.-- .+....+ .. T Consensus 71 ~f~~v~l~~~k--l~~~~~iS--~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~-------~~~~~~~--~~ 137 (315) T protein:vir:80 71 DVSAFTAQPIK--VVTQQRVS--DEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGI-------DPATGKA--AS 137 (315) T ss_pred ceeeeEeeeee--EEeeehhh--HHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeecc-------CCCCCcc--cc Confidence 77777765433 3222 221 2222 1 111 26788999999999999998876210 0000000 00 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhccc----chhhcccccccCccccc Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD----RIVDKTYTISQSGATIN 229 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~----r~~n~d~~~~~~g~~~~ 229 (402) +...... . ...........++-|.++...+...+.-.... .+++|..+..|.+-. +-.+..+-. ..... T Consensus 138 ~~~~~~~--~-~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~-~imn~~~~~~L~~l~~~~g~~~~g~~~~---~~~~~ 210 (315) T protein:vir:80 138 AVHTSLN--K-TKNIVDATDSATADLVKAVGLIAGAGLQVPNG-VALDPAFSFALSTEVYPKGSPLAGQPMY---PAAGF 210 (315) T ss_pred ccccccc--c-ccceeeccccchHHHHHHHHHHhhccCccceE-EEEcHHHHHHHHHHhhccCCcccccccc---ccccc Confidence 1111100 0 11111111123455666666665444433333 578999999996532 222222211 11224 Q ss_pred ceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccc--hh---- Q lcl|Aclame:pro 230 GFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE--KK---- 303 (402) Q Consensus 230 G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d--~~---- 303 (402) |..++++|.||+.|+++|........ ....=+.+||++..- .+ .+-+. .+..++ +. T Consensus 211 g~~~tl~G~PV~~~~~~~~~~~~~~~--------~~~~~~~GDfs~~~~-g~-------~~~~~--i~i~~~~~~~~~~~ 272 (315) T protein:vir:80 211 AGLDNWRGLNVGASSTVSGAPEMSPA--------SGVKAIVGDFSRVHW-GF-------QRNFP--IELIEYGDPDQTGR 272 (315) T ss_pred CCCceecceeeEecCcCCcccccccc--------cccEEEEeecccEEE-EE-------ecCee--EEEeccccccCccc Confidence 45578999999999999964422111 000114467776321 11 11122 222111 11 Q ss_pred --HHH--HHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 304 --EKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 304 --~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) .+. -.+++.+-+|.+++||++.+.|+-+....+....++ T Consensus 273 ~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 273 DLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAEN 315 (315) T ss_pred chhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCCCCC Confidence 111 123444568999999999877654433222222233 No 68 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.32 E-value=3.1e-14 Score=94.68 Aligned_cols=285 Identities=14% Similarity=0.052 Sum_probs=159.7 Q ss_pred CCCCcccc-c-ccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-c-ceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLT-N-VAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t-~-~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G-~~t~~~~~~G~~i~~ 76 (402) +....... + ....++++.-.+..+.+..+++......+.++++.++.++.+ .++++++. + ..++..+.-|+.+.. T Consensus 93 ~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~ 171 (385) T protein:vir:19 93 QGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSS-NALEYVREEVFTNNADVVAEKALKPE 171 (385) T ss_pred hccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccC-cceEEEEEecCCcceeeeccCccccc Confidence 10000000 0 011112222234557888999999999999999998887754 57888876 3 456667777888877 Q ss_pred CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 77 ~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) ..+...+.++.+..+- ....|.+ ++..+...+.+.+.+++++++++.+|+.++. +.....+ + .|.. T Consensus 172 ~~~~~~~~~~~~~k~~-~~~~is~--ell~d~~~l~~~i~~~la~a~~~~~d~~~l~----G~g~~~~-----~--~Gi~ 237 (385) T protein:vir:19 172 SDITFSKQTANVKTIA-HWVQASR--QVMDDAPMLQSYINNRLMYGLALKEEGQLLN----GDGTGDN-----L--EGLN 237 (385) T ss_pred cccceeEEEEeeeeEE-EeehhhH--HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccCCCCc-----c--cccc Confidence 7777777777777643 1122221 1222222367788889999999999988762 2111111 0 0111 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEe Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia 236 (402) .... ........+....++.|.++..+|...+.... .++++|..|..|.+-.. .+..|-. .+ ...|...+++ T Consensus 238 ~~~~--~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~--~~~~~~~~~~~l~~lkd-~~G~~l~--~~-~~~~~~~~l~ 309 (385) T protein:vir:19 238 KVAT--AYDTSLNATGDTRADIIAHAIYQVTESEFSAS--GIVLNPRDWHNIALLKD-NEGRYIF--GG-PQAFTSNIMW 309 (385) T ss_pred cccc--cccccccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceec--cC-cccCCCceec Confidence 0000 01111112233457888888888877665533 56899999999865221 1111110 11 1234556789 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh----HHHHHHHHH Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK----EKTYYIDTF 312 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~----~~~d~i~~~ 312 (402) |.||+.|+++|... -+-+||++..- ++. -.+++.+..+... +-...+++. T Consensus 310 G~pV~~~~~~p~~~-----------------~~~gd~~~~~~-~~~--------~~~~~v~~~~~~~~~~~~~~~~~~~~ 363 (385) T protein:vir:19 310 GLPVVPTKAQAAGT-----------------FTVGGFDMASQ-VWD--------RMDATVEVSREDRDNFVKNMLTILCE 363 (385) T ss_pred ceeeEEcCcCCCCc-----------------EEEeecccEEE-EEE--------ecceEEEEeccccchhhcCcEEEEEE Confidence 99999999999532 12234433221 221 1233333332221 111234556 Q ss_pred HHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 313 MAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 313 ~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) +-||..+++|++.+.++++..+ T Consensus 364 ~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 364 ERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EeeccEEecccceEEEEeccCC Confidence 6689999999998888776544 No 69 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.32 E-value=3.1e-14 Score=94.68 Aligned_cols=285 Identities=14% Similarity=0.052 Sum_probs=159.7 Q ss_pred CCCCcccc-c-ccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-c-ceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLT-N-VAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t-~-~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G-~~t~~~~~~G~~i~~ 76 (402) +....... + ....++++.-.+..+.+..+++......+.++++.++.++.+ .++++++. + ..++..+.-|+.+.. T Consensus 93 ~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~ 171 (385) T protein:vir:18 93 QGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSS-NALEYVREEVFTNNADVVAEKALKPE 171 (385) T ss_pred hccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccC-cceEEEEEecCCcceeeeccCccccc Confidence 10000000 0 011112222234557888999999999999999998887754 57888876 3 456667777888877 Q ss_pred CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 77 ~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) ..+...+.++.+..+- ....|.+ ++..+...+.+.+.+++++++++.+|+.++. +.....+ + .|.. T Consensus 172 ~~~~~~~~~~~~~k~~-~~~~is~--ell~d~~~l~~~i~~~la~a~~~~~d~~~l~----G~g~~~~-----~--~Gi~ 237 (385) T protein:vir:18 172 SDITFSKQTANVKTIA-HWVQASR--QVMDDAPMLQSYINNRLMYGLALKEEGQLLN----GDGTGDN-----L--EGLN 237 (385) T ss_pred cccceeEEEEeeeeEE-EeehhhH--HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccCCCCc-----c--cccc Confidence 7777777777777643 1122221 1222222367788889999999999988762 2111111 0 0111 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEe Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia 236 (402) .... ........+....++.|.++..+|...+.... .++++|..|..|.+-.. .+..|-. .+ ...|...+++ T Consensus 238 ~~~~--~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~--~~~~~~~~~~~l~~lkd-~~G~~l~--~~-~~~~~~~~l~ 309 (385) T protein:vir:18 238 KVAT--AYDTSLNATGDTRADIIAHAIYQVTESEFSAS--GIVLNPRDWHNIALLKD-NEGRYIF--GG-PQAFTSNIMW 309 (385) T ss_pred cccc--cccccccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceec--cC-cccCCCceec Confidence 0000 01111112233457888888888877665533 56899999999865221 1111110 11 1234556789 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh----HHHHHHHHH Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK----EKTYYIDTF 312 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~----~~~d~i~~~ 312 (402) |.||+.|+++|... -+-+||++..- ++. -.+++.+..+... +-...+++. T Consensus 310 G~pV~~~~~~p~~~-----------------~~~gd~~~~~~-~~~--------~~~~~v~~~~~~~~~~~~~~~~~~~~ 363 (385) T protein:vir:18 310 GLPVVPTKAQAAGT-----------------FTVGGFDMASQ-VWD--------RMDATVEVSREDRDNFVKNMLTILCE 363 (385) T ss_pred ceeeEEcCcCCCCc-----------------EEEeecccEEE-EEE--------ecceEEEEeccccchhhcCcEEEEEE Confidence 99999999999532 12234433221 221 1233333332221 111234556 Q ss_pred HHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 313 MAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 313 ~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) +-||..+++|++.+.++++..+ T Consensus 364 ~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 364 ERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EeeccEEecccceEEEEeccCC Confidence 6689999999998888776544 No 70 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.31 E-value=7.1e-14 Score=92.67 Aligned_cols=288 Identities=10% Similarity=0.017 Sum_probs=160.4 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |-.......-+...+.+...+.=+.|..++++.....+.++++.++.++.+ .+++||+. +...+..+..|+.++...+ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) T protein:vir:96 18 NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIETSKA 96 (324) T ss_pred hhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeEecCCcccccccc Confidence 110000000000001111123338899999999999999999988877664 56788876 7778888888888888778 Q ss_pred cccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 80 ~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) ...+.++..-.+- .-..|.+ +++ +.+| +.+.+.+++++++++.+|+.++.- .... ....+. T Consensus 97 ~~~~v~~~~~k~~-~~~~is~ell~d--s~~~-l~~~i~~~la~ai~~~~d~a~l~G----~g~~---------~~~~gi 159 (324) T protein:vir:96 97 TWVNATMRAFKLG-VILPVTKEFLNY--TYSQ-FFEEMKPMIAEAFYKKFDEAGILN----QGNN---------PFGKSI 159 (324) T ss_pred ceeEEEEeeEEEE-EeehhhHHHHhc--chHH-HHHHHHHHHHHHHHHHHHHHHhcc----CCCC---------CcCccc Confidence 8888777765432 2222322 222 2356 788999999999999999988631 1100 000111 Q ss_pred ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEec Q lcl|Aclame:pro 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYN 237 (402) Q Consensus 158 ~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG 237 (402) ....... .........++.|.++..+|...+.... .++++|..|..|.+-. +.+ +...+.++...+++| T Consensus 160 ~~~~~~~--~~~~~~~~t~~~i~~~~~~l~~~~~~~~--~~vmn~~~~~~L~~l~---d~~----G~~~~~~~~~~~l~G 228 (324) T protein:vir:96 160 AQSIEKT--NKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIV---DPE----TKERIYDRNSDSLDG 228 (324) T ss_pred ccccccc--ceeccccccHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhh---ccC----CCeeecCCCCCcccc Confidence 1111110 1111112237778888888887776433 4689999999986521 111 122233455567899 Q ss_pred cEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-------------H Q lcl|Aclame:pro 238 CPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-------------E 304 (402) Q Consensus 238 ~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-------------~ 304 (402) +||+.++..+...+ .-+-+||++.. +.. -.++..+..++.. . T Consensus 229 ~PV~~~~~~~~~~~---------------~~~~gd~~~~~-~g~---------~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:96 229 LPVVNLKSSNLKRG---------------ELITGDFDKLI-YGI---------PQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred eeeEeeCCCCCCcc---------------eEEEEecceEE-EEE---------ecCcEEEEeecccccccccccccchhh Confidence 99998876543211 11335555422 111 1222333322211 0 Q ss_pred H---HHHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 305 K---TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 305 ~---~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) | ...+++.+-||.+++||++.+.|+.....+++..++. T Consensus 284 f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 284 FEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 1 1233445568999999999888766443333322222 No 71 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.31 E-value=7.1e-14 Score=92.67 Aligned_cols=288 Identities=10% Similarity=0.017 Sum_probs=160.4 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |-.......-+...+.+...+.=+.|..++++.....+.++++.++.++.+ .+++||+. +...+..+..|+.++...+ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) T protein:vir:78 18 NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIETSKA 96 (324) T ss_pred hhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeEecCCcccccccc Confidence 110000000000001111123338899999999999999999988877664 56788876 7778888888888888778 Q ss_pred cccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 80 ~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) ...+.++..-.+- .-..|.+ +++ +.+| +.+.+.+++++++++.+|+.++.- .... ....+. T Consensus 97 ~~~~v~~~~~k~~-~~~~is~ell~d--s~~~-l~~~i~~~la~ai~~~~d~a~l~G----~g~~---------~~~~gi 159 (324) T protein:vir:78 97 TWVNATMRAFKLG-VILPVTKEFLNY--TYSQ-FFEEMKPMIAEAFYKKFDEAGILN----QGNN---------PFGKSI 159 (324) T ss_pred ceeEEEEeeEEEE-EeehhhHHHHhc--chHH-HHHHHHHHHHHHHHHHHHHHHhcc----CCCC---------CcCccc Confidence 8888777765432 2222322 222 2356 788999999999999999988631 1100 000111 Q ss_pred ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEec Q lcl|Aclame:pro 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYN 237 (402) Q Consensus 158 ~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG 237 (402) ....... .........++.|.++..+|...+.... .++++|..|..|.+-. +.+ +...+.++...+++| T Consensus 160 ~~~~~~~--~~~~~~~~t~~~i~~~~~~l~~~~~~~~--~~vmn~~~~~~L~~l~---d~~----G~~~~~~~~~~~l~G 228 (324) T protein:vir:78 160 AQSIEKT--NKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIV---DPE----TKERIYDRNSDSLDG 228 (324) T ss_pred ccccccc--ceeccccccHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhh---ccC----CCeeecCCCCCcccc Confidence 1111110 1111112237778888888887776433 4689999999986521 111 122233455567899 Q ss_pred cEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-------------H Q lcl|Aclame:pro 238 CPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-------------E 304 (402) Q Consensus 238 ~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-------------~ 304 (402) +||+.++..+...+ .-+-+||++.. +.. -.++..+..++.. . T Consensus 229 ~PV~~~~~~~~~~~---------------~~~~gd~~~~~-~g~---------~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:78 229 LPVVNLKSSNLKRG---------------ELITGDFDKLI-YGI---------PQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred eeeEeeCCCCCCcc---------------eEEEEecceEE-EEE---------ecCcEEEEeecccccccccccccchhh Confidence 99998876543211 11335555422 111 1222333322211 0 Q ss_pred H---HHHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 305 K---TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 305 ~---~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) | ...+++.+-||.+++||++.+.|+.....+++..++. T Consensus 284 f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:78 284 FEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 1 1233445568999999999888766443333322222 No 72 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.30 E-value=1.5e-13 Score=90.94 Aligned_cols=294 Identities=11% Similarity=0.001 Sum_probs=156.0 Q ss_pred CCCCcc---cccccccc-cccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNT---LTNVAVSA-SGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~---~t~~~~~~-~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~ 75 (402) |.-... ..+..... +++.-.+.-..+..++++...+.++++++.++.++.+ .+.+||+. +...+..+.-|+++. T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~E~~~~~ 79 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGT-TGQKIPHWIGDVSAQWIGEGDMKP 79 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEeCCcceEEecCCcccc Confidence 655432 11222211 1121224458899999999999999999988777754 56788875 677888888888888 Q ss_pred CCCccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ...++..+.++.+-. +.....-.-+-.+ +..| +.+.+.+++++++++.+|+.++. +.... .+....+ T Consensus 80 ~~~~~f~~v~~~~~k--~~~~~~is~ell~ds~~~-l~~~i~~~l~~a~a~~~d~a~l~----G~g~~-----~~~~~~~ 147 (320) T protein:vir:10 80 ITKGNMTSQNIAPHK--IATIFVASAETVRANPAN-YLGTMRTKVATAFAMAFDSAALN----GTDSP-----FPTYLAQ 147 (320) T ss_pred ccccceeEEEEeeEE--EEEeehhhHHHHhcChHH-HHHHHHHHHHHHHHHHHHHHhhc----ccCCC-----CCccccc Confidence 777777777666654 3333222222222 3456 78899999999999999998752 11100 0000011 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccccc Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATING 230 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~~G 230 (402) ..........+.....+...+-+.+.++...+...+.+ .-+.|++|..|..|.+ | .+.+..+....+...... T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~- 224 (320) T protein:vir:10 148 TTKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKK--WTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFR- 224 (320) T ss_pred ccccccceecccccccccccHHHHHHHHHhhhhcccCC--CcEEEEcHHHHHHHHHhhccCCceeeccccccCcccccc- Confidence 11111111111111111122223455566666655444 3366889999999964 2 122211111111111112 Q ss_pred eEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-------- Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-------- 302 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-------- 302 (402) -+++.|++|+.|+++|..... -+-+||++.. +. .-.++..+..++. T Consensus 225 -~~~i~g~pv~~~~~~~~~~~~---------------~~~gd~~~~~-~~---------~~~~~~i~~~~~~~~~~~~~~ 278 (320) T protein:vir:10 225 -AGRIVSRPTILSDHVADGTTV---------------GYMGDFRNVI-WG---------QVGGLSFDVTDQATLNLGTPT 278 (320) T ss_pred -CceeeeeeeEecCCCCCCceE---------------EEEeecceEE-EE---------EecCeEEEEeecceeeecccc Confidence 246899999999999853210 1234554432 11 1112222222111 Q ss_pred --------hHHHHHHHHHHHhcCcccccceEEEEEEeeccCcccc Q lcl|Aclame:pro 303 --------KEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDA 339 (402) Q Consensus 303 --------~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a 339 (402) .+-.-.+++.+-+|.+++||+|.+.|+- .+ + +.| T Consensus 279 ~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~-~~-a-p~~ 320 (320) T protein:vir:10 279 EPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTN-VV-T-PDA 320 (320) T ss_pred ccccchhhhcCcEEEEEEEeeccEEecccceEEEEe-cc-C-CCC Confidence 0111224555668999999999877641 12 2 222 No 73 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.30 E-value=5e-14 Score=93.52 Aligned_cols=280 Identities=13% Similarity=0.046 Sum_probs=165.1 Q ss_pred CCCC--cccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-c-ceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTP--NTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~--n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G-~~t~~~~~~G~~i~~ 76 (402) +... ....+.....+++.-.+..+.+..+++......+.+++++++.++.+ .+.++++. + ..++..+.-|+.+.. T Consensus 102 ~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~~ 180 (390) T protein:vir:97 102 ATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEGALKPE 180 (390) T ss_pred hhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccC-CceEEEEEecCCcceeeecCCccccc Confidence 1100 00111111222333346668899999999999999999988887765 46778876 3 356778888888877 Q ss_pred CCccccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 77 ~~~~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..+...+.++.+..+- .-..|.+ +++. .+ +.+.+.+++++++++++|+.++. +..... .+ .| T Consensus 181 ~~~~~~~i~~~~~k~~-~~~~is~ell~ds---~~-l~~~i~~~la~a~~~~~d~a~l~----G~g~~~-----~p--~G 244 (390) T protein:vir:97 181 SSLKFAKKTDTTHVIA-HTMKATRQILSDA---PQ-LASYMNNRLIRGLKVKEDAEILR----GTGAND-----GL--LG 244 (390) T ss_pred cccceeEEEEeeeeEE-EeehhhHHHHHhH---HH-HHHHHHHHHHHHHHHHHHHHHhh----cCCCCc-----cc--cc Confidence 7788888888888643 2222322 2222 23 67889999999999999998762 111110 01 11 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) .... .+........+....++.|.++..++...+.+.. .+|++|..|..|.+-.. .+..|-- .++ .++...+ T Consensus 245 i~~~--~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd-~~G~~l~-~~~--~~~~~~~ 316 (390) T protein:vir:97 245 LIPQ--ATTYAAPTTIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKD-ANNQYLI-GNA--RGTLTPT 316 (390) T ss_pred eeec--cccccccccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceee-cCc--cCCCCce Confidence 1100 0011111112233457888888899998888754 35889999999874211 1111100 111 1334468 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-hHHHHH--HHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-KEKTYY--IDT 311 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-~~~~d~--i~~ 311 (402) +.|.||+.|+.+|... -+-+||++..- + +.-.+++.+..++. ..+.++ +++ T Consensus 317 l~G~pV~~~~~~~~~~-----------------~~~gd~~~~~~-~--------~~~~~~~i~~~~~~~~f~~~~~~~r~ 370 (390) T protein:vir:97 317 LWGLPVVATQAMAPGE-----------------FLVGAFDLAAQ-I--------FDQWDARVEIGYVNDDFQRNMVTVLA 370 (390) T ss_pred ecceeeEEcCCCCCCc-----------------EEEEeccceEE-E--------EEecceEEEEeecccccccCcEEEEE Confidence 8999999999998521 12344443211 1 22334455554432 223333 445 Q ss_pred HHHhcCcccccceEEEEEEe Q lcl|Aclame:pro 312 FMAEGAIPDRWEAVSVVTTK 331 (402) Q Consensus 312 ~~a~Ga~vlRPeaa~vv~~~ 331 (402) ..-||..+++|++.+.+++- T Consensus 371 ~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 371 EERLALVVYRPEALITGSFA 390 (390) T ss_pred EEeeccEEeccccEEEEEeC Confidence 56799999999999998877 No 74 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.30 E-value=1.2e-13 Score=91.33 Aligned_cols=280 Identities=13% Similarity=0.037 Sum_probs=160.1 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSPNATP 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i~~~~ 78 (402) +-......+....++++.-.+....+..+++......+.+++++++.++.+ .++.+++. +..++..+..|+.+.... T Consensus 104 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 182 (390) T protein:vir:10 104 MNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEGALKPESS 182 (390) T ss_pred hHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCCcceeeecCCccccccc Confidence 111111111111222333345667777788888888888899988887755 46778865 335677778888887777 Q ss_pred ccccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 79 ~~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) +...+.++.+.++- ....|.+ |++. .+ +.+.+.+++++++++..|+.++ .+..... .+ .|.. T Consensus 183 ~~~~~i~~~~~k~~-~~~~is~ell~d~---~~-l~~~i~~~l~~~~~~~~~~~il----~G~G~~~-----~p--~Gi~ 246 (390) T protein:vir:10 183 LKFAKKTDTTHVIA-HTMKATRQILSDA---PQ-LASYMNNRLIRGLKVKEDAEIL----RGTGAND-----GL--LGLI 246 (390) T ss_pred cceeEEEEeeEEEE-EeehhhHHHHHhH---HH-HHHHHHHHHHHHHHHHHHHHHh----hcCCCCc-----cc--cccc Confidence 77888888877643 2222332 3332 24 6788889999999999998775 2211110 01 1111 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEe Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia 236 (402) .... .............++.+.++...|...+.+.. .+|++|..|..|.+-.. .+..|-. ..+ ..+...+++ T Consensus 247 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd-~~g~~l~-~~~--~~~~~~~l~ 318 (390) T protein:vir:10 247 PQAT--TYAAPTTIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKD-ANNQYLI-GNA--RGTLTPTLW 318 (390) T ss_pred cccc--cccccccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceee-cCC--cCcCCceec Confidence 1100 01111112223357788888889988887744 46899999999874211 1111110 111 123345789 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-hHHHH--HHHHHH Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-KEKTY--YIDTFM 313 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-~~~~d--~i~~~~ 313 (402) |+||+.|+.+|.+. -+-+||+...-++. -.++..+..++. ....+ .+.+.+ T Consensus 319 G~pv~~~~~~p~~~-----------------~~~gdf~~~~~~~~---------~~~~~i~~~~~~~~~~~~~~~~r~~~ 372 (390) T protein:vir:10 319 GLPVVATQAMAPGE-----------------FLVGAFDLAAQIFD---------QWDARVEIGYVNDDFQRNMVTVLAEE 372 (390) T ss_pred ceeeEEcCCCCCCc-----------------EEEEeccceEEEEE---------ecceEEEEeecccccccCcEEEEEEE Confidence 99999999999532 13355554322221 123333433322 22223 334556 Q ss_pred HhcCcccccceEEEEEEe Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTK 331 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~ 331 (402) -|+.++++|++.+.+++- T Consensus 373 r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 373 RLALVVYRPEALISGSFA 390 (390) T ss_pred eeccEEeccccEEEEEeC Confidence 799999999999988877 No 75 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.30 E-value=1e-13 Score=91.77 Aligned_cols=285 Identities=9% Similarity=0.010 Sum_probs=161.2 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) +...+..+ ...+.+...+.-+.+..++++.....+.++++.++.++.+ .+++||+. +...+..+..|+.++...+ T Consensus 21 ~~~~~a~~---~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) T protein:vir:96 21 PQVFNPDN---VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIETSKA 96 (324) T ss_pred hhhccccc---ccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeeecCCcccccccc Confidence 11111110 0001111124448899999999999999999998888765 56888876 6677888888888887777 Q ss_pred cccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 80 ~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) ...+.++..-.+- .-..|.+ +++ +..| +.+.+.+++++++++.+|+.++.- .... ... .+. T Consensus 97 ~f~~v~~~~~k~~-~~~~is~ell~d--s~~~-l~~~i~~~l~~aia~~~d~~~l~G----~g~~-------~~~--~~~ 159 (324) T protein:vir:96 97 TWVNATMRAFKLG-VILPVTKEFLNY--TYSQ-FFEEMKPMIAEAFYKKFDEAGILN----QGNN-------PFG--KSI 159 (324) T ss_pred ceeEEEEEeEEEE-EeehhhHHHHhc--chHH-HHHHHHHHHHHHHHHHHHHHhhhc----CCCC-------CcC--ccc Confidence 7777777665432 2233322 222 3355 788999999999999999988631 1100 000 000 Q ss_pred ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEec Q lcl|Aclame:pro 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYN 237 (402) Q Consensus 158 ~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG 237 (402) ..... ...........++.|.++..++.+.+.... .++++|..|..|.+-.. .+ +...+.++...+++| T Consensus 160 ~~~~~--~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~--~~i~n~~~~~~L~~lkd---~~----G~~~~~~~~~~~l~G 228 (324) T protein:vir:96 160 AQSIK--KTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVD---PE----TKERIYDRNSDSLDG 228 (324) T ss_pred ccccc--ccceecccccchHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhC---CC----CCeeecCCCCCcccc Confidence 00000 011111112236778888888887766433 46899999999865211 11 122223444567899 Q ss_pred cEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-------------- Q lcl|Aclame:pro 238 CPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-------------- 303 (402) Q Consensus 238 ~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-------------- 303 (402) +||+.++..+...+ . =+-+||++.. +.. ..++..+..++.. T Consensus 229 ~PV~~~~~~~~~~~----~-----------~~~gd~s~~~-~~~---------~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:96 229 LPVVNLKSSNLKRG----E-----------LITGDFDKLI-YGI---------PQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred eeeEeecCCCCCcc----e-----------EEEEecceEE-EEE---------ecCcEEEEeecccccccccccccchhh Confidence 99998876653211 0 1234444421 111 1122222222210 Q ss_pred HH--HHHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 304 EK--TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 304 ~~--~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) .+ .-.+++.+-+|.+++||++.+.|+.....++...+.. T Consensus 284 ~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 284 FEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 01 1234555668999999999988876554444444444 No 76 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.30 E-value=2.8e-13 Score=89.38 Aligned_cols=292 Identities=15% Similarity=0.067 Sum_probs=152.5 Q ss_pred CCCC----c----ccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeecccee--eeeec Q lcl|Aclame:pro 1 MSTP----N----TLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETE--LQVLA 69 (402) Q Consensus 1 Ms~~----n----~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t--~~~~~ 69 (402) |... . ...+....+++..-...| ++|.++++......+.++++.++.++.++..+.++..+... ..... T Consensus 99 ~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 178 (409) T protein:vir:45 99 GASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLG 178 (409) T ss_pred hhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCcccccccc Confidence 1000 0 000000001111112233 88999999999999999999999999888888888775432 33444 Q ss_pred CCCCCCCCCccccceeEeecce--eeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 70 PGQSPNATPTQADKNQLVIDTT--VIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) Q Consensus 70 ~G~~i~~~~~~~~e~~itID~~--lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~ 147 (402) -|+.+....+...+.++.--.+ .+....-.-+++ +.+| +.+.+.+++++++++..|+.++. +.....+ T Consensus 179 E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~d--s~~~-l~~~i~~~la~a~~~~~~~a~l~----G~G~~~~--- 248 (409) T protein:vir:45 179 ENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQD--SAID-MEAYLARRIAERIGRGEARYLIQ----GTGAGTP--- 248 (409) T ss_pred ccccccccccccceeeeeeeeeeeeehhhhHHHHhc--cHHH-HHHHHHHHHHHHHHHHHHHHhhc----cCCCCCc--- Confidence 5556655666666555543222 111222223333 3456 78899999999999999998752 1110000 Q ss_pred cccccccccccccccc-CCccccccHHHHHHHHHHHHHHHHhhcCCccCcE-EEeChHHHHHHh--cccchhhccccccc Q lcl|Aclame:pro 148 NKPRVKGHGFSINVNV-TESEALANPQYVMAAVEYALEQQLEQEVDISDVA-IMMPWKFFNALR--DADRIVDKTYTISQ 223 (402) Q Consensus 148 ~~~~~~g~~~~~~v~~-~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~-~VV~P~~y~~Ll--~~~r~~n~d~~~~~ 223 (402) ..+ .|+........ ....... -++.|.++...|...+. ....| ++++|..|..|. +|.. ..|- - T Consensus 249 ~~p--~Gil~~~~~~~~~~~~~~~----~~d~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lkd~~---G~~i--~ 316 (409) T protein:vir:45 249 KQP--KGLAASVTGTTQTAAANAV----KWQEILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEMEDGQ---GRPL--W 316 (409) T ss_pred ccc--ceeeecccccccccccccc----chHHHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhhcCC---Ccee--e Confidence 000 11111100000 0111111 25667777777766553 34567 467999988874 3322 1110 0 Q ss_pred CcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh Q lcl|Aclame:pro 224 SGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK 303 (402) Q Consensus 224 ~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~ 303 (402) .+...+|.-.+++|.||+.++++|..+.+ ...=+-+||++. +++. ...+..+...+.- T Consensus 317 ~~~~~~~~~~~l~G~PV~~~~~~p~~~~~------------~~~i~~Gd~~~~--~i~~--------~~~~~~~~~~d~~ 374 (409) T protein:vir:45 317 LPDIVGVAPASVLNVPYVIDQEIDDIGAG------------KKFMFCGDFDRF--IIRR--------VRYMILKRLVERY 374 (409) T ss_pred ccCcCCCCCceecceeeEEecCcCCccCC------------ccEEEEeehhhh--heee--------ccceEEEEeeccc Confidence 11122344468999999999999963311 111122555442 1221 1122223222221 Q ss_pred HHHH--HHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 304 EKTY--YIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 304 ~~~d--~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) ..-+ .+++.+=||.++.+|++.+.++.+..+.+ T Consensus 375 ~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 375 AEYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred ccCCcEEEEEEEEeccEeechhheEEEEeccCCCC Confidence 1112 25666678999999999888776654432 No 77 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.29 E-value=2e-13 Score=90.24 Aligned_cols=284 Identities=10% Similarity=0.026 Sum_probs=160.5 Q ss_pred CCC--CcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MST--PNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~--~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~ 77 (402) |.. +++.+-.. +...+.-+++..++.+.....++++++.++.++.+ .+++||+. |...+.-+..|+.++.. T Consensus 21 ~~~~~a~~~~~~~-----~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~~~~~~ 94 (324) T protein:vir:93 21 PQVFNPDNVMMHE-----KKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIETS 94 (324) T ss_pred hhhcccccccccC-----CCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeeecCCcccccc Confidence 111 11111111 11124458999999999999999999998877665 45678765 77888888889888887 Q ss_pred CccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) .+..++.++..-.+ ..-..|.+ +-. ++.+| +.+.+.+++++++++.+|+.++.- ... +....+. T Consensus 95 ~~~f~~i~~~~~k~-~~~~~iS~-ell~ds~~~-l~~~i~~~l~~aia~~~d~a~l~G----~g~-------~~~~~~~- 159 (324) T protein:vir:93 95 KATWVNATMRAFKL-GVILPVTK-EFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGILN----QGN-------NPFGKSI- 159 (324) T ss_pred ccceeEEEEEeEEE-EEeehhhH-HHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhcC----CCC-------CCcCccc- Confidence 78777777766442 22233322 111 23456 678999999999999999987532 110 0000010 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEe Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia 236 (402) ..... ...........++.|.++...|...+.... .++++|..|..|.+- .+. .+...+.++.-.+++ T Consensus 160 -~~~~~--~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l---~d~----~G~~~~~~~~~~~l~ 227 (324) T protein:vir:93 160 -AQSIE--KTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKI---VDP----ETKERIYDRNSDSLD 227 (324) T ss_pred -ccccc--ccceeccccccHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHh---hCC----CCCeeecCCCCCccc Confidence 00000 001111112236778888888888776433 568999999998752 111 122223344456789 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh------------- Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK------------- 303 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~------------- 303 (402) |+||+.+++.+...+ .=+.+||++.. +.+ ..++..+..++.. T Consensus 228 G~PVv~~~~~~~~~~---------------~i~~gdfs~~~-~~~---------~~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) T protein:vir:93 228 GLPVVNLKSSNLKRG---------------ELITGDFDKLI-YGI---------PQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) T ss_pred ceeeEeecCCCCCcc---------------eEEEEecceEE-EEE---------ecCcEEEEeecccccccccccccchh Confidence 999998876553211 01335555421 111 1222333332210 Q ss_pred ---HHHHHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 304 ---EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 304 ---~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) +-.-.+++.+-||.+++||++.+.|+.....+++..+.. T Consensus 283 ~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 011334555668999999999988864333332222222 No 78 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.29 E-value=3.2e-13 Score=89.09 Aligned_cols=282 Identities=12% Similarity=0.024 Sum_probs=165.7 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeee-ccceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~~~G~~i~~~~~ 79 (402) |+...+.++. +.-.+++.++++...+.|.++.+.+++++.+| .++||+ .|...+..+.-|+.+....+ T Consensus 1 ma~~t~~~G~----------lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~s~~ 69 (300) T protein:vir:95 1 MSEAQLSKGN----------LFNPELVTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDIDIVAENGKKTHGGV 69 (300) T ss_pred CcccccCCcc----------eechhhHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEeeCCcccccccc Confidence 8877654321 34478999999999999999999888877765 456776 47778888888888877777 Q ss_pred cccceeEeecceeeccchhhhHHHhh-----cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQ-----GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q-----~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..++.++..-. +..-..|. +|.+ +..| +-+++.++++++++++.|+.++.-.- +....+.... T Consensus 70 ~f~~v~l~~~k-~~~~~~iS--~ell~~~~d~~~~-l~~~i~~~l~~aia~~~d~~~l~G~~-------~~~g~~~~~~- 137 (300) T protein:vir:95 70 SLDPVTIVPLK-VEYGARVS--DEFLHASEEAKVD-MLTDFVEGFSKKLARGLDIMSIHGIN-------PRTKQASTII- 137 (300) T ss_pred cceeeEeeeEE-EEEeehhh--HHHhccCCCCHHH-HHHHHHHHHHHHHHHHHHHhhhhccc-------CCCCCCcccc- Confidence 77777776543 22222232 2222 1234 56788899999999999998873211 0000110000 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcc----cchhhcccccccCcccccc Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDA----DRIVDKTYTISQSGATING 230 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~----~r~~n~d~~~~~~g~~~~G 230 (402) +..............+....++.|.++..++...+.... ..+++|..+..|.+- .+.+.. ....+| T Consensus 138 -~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~L~~lkd~~G~~i~~-------~~~~~~ 207 (300) T protein:vir:95 138 -GDNCFDKKVTQTVPFKDTNPDESMEDAVGMIDGSERDIT--GAILDPIFTTALSKMKNAEGGKLYP-------ELAWGG 207 (300) T ss_pred -cccccccccceeecccccchHHHHHHHHHHhhhcCCCcc--EEEECHHHHHHHHHhhccCCCeecc-------CccccC Confidence 000000011111122334457888888888887665433 468999999998642 122211 111245 Q ss_pred eEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh------H Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK------E 304 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~------~ 304 (402) ..++++|+||+.|+++|...+. ....-+.+||++..-+... +-+.++...+.++. . T Consensus 208 ~~~~l~G~Pv~~s~~v~~~~~~-----------~~~~~~~GDf~~~~~~~~~-------~~~~~~v~~~~~~d~~~~~~f 269 (300) T protein:vir:95 208 VPDAINGLAVDKNRTVSYSQTD-----------PKNTAIVGDFETMFKWGYA-------KEVPMEIIKYGDPDNSGRDLK 269 (300) T ss_pred CCceecceeeEEecCCCCCCCC-----------CccEEEEeeccceEEEEEe-------cccEEEEeeccCCCCcchhhh Confidence 5678999999999999864321 1111244777654332221 11222222221211 1 Q ss_pred HH--HHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 305 KT--YYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 305 ~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) +. -.+++.+-+|.+++||++.+.|+=..| T Consensus 270 ~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 270 GYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred hcCcEEEEEEEeecceeecccceEEEecCCC Confidence 11 223455568999999999988876666 No 79 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.29 E-value=3.1e-13 Score=89.15 Aligned_cols=284 Identities=13% Similarity=0.048 Sum_probs=159.4 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeee-ccceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~~~G~~i~~~~~ 79 (402) |+... | .+..+++..+++...+..++++++.++.++.+|+ ++||+ .|...+..+.-|+++....+ T Consensus 1 ma~~g----------G---~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~~~~~~ 66 (298) T protein:vir:16 1 MVLNK----------G---TLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKKTHGGV 66 (298) T ss_pred CcccC----------c---ceechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEecCCcccccccc Confidence 55433 1 1444788899999999999999999988877655 56776 57788888888888877767 Q ss_pred cccceeEeecceeeccchhhhHHHhh-----cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQ-----GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q-----~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ...+.++..- +++.... ==+|.+ +..+ +-.++.+++++++++.+|+.++.-.-. ....+....+ T Consensus 67 ~f~~v~l~~~--k~a~~~~-iS~ell~~s~d~~~~-l~~~i~~~la~ai~~~~d~~~l~G~~~-------~~g~~~~~~~ 135 (298) T protein:vir:16 67 TLAPQTMVPI--KVEYGAR-ISDEFMYASDEEKIN-ILQEFNDGFAKKVARGIDLMAFHGVNP-------RLGTASAVIG 135 (298) T ss_pred ceeEEEEeee--eEEEeeh-hhHHHhhcCcccHHH-HHHHHHHHHHHHHHHHHHHHhhccccC-------CCCccccccc Confidence 6666655543 3333221 112332 2234 567888999999999999988632110 0111111100 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) ....................+++.|.++..++...+.+.. ..+++|..+..|.+-..- |..|-- . ....+|..++ T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~-~G~~i~-~-~~~~~~~~~~ 210 (298) T protein:vir:16 136 TNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKDL-QDNALF-P-ELKWGATPDT 210 (298) T ss_pred ccccccccccccccccccccHHHHHHHHHHHhhhcCCCcc--EEEEcHHHHHHHHHhhcc-CCCeee-c-CcccCCCCce Confidence 0000000000111112223457788888888888777644 368899999988652111 111110 1 1123455578 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-----HH-H-- Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-----EK-T-- 306 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-----~~-~-- 306 (402) ++|.||+.++++|..... +...-+.+||++...+..... .+++..+ +.++. .| . T Consensus 211 l~G~PV~~~~~v~~~~~~-----------~~~~~~~GDfs~~~~~~~~~~--~~~~~~~-----~~~~~~~~~~~f~~~~ 272 (298) T protein:vir:16 211 INGLPVDVNKTVSDMSLT-----------QRDRAIIGDFANGFKWGYAKE--VPLEVIQ-----YGDPDNSGLDLKGYNQ 272 (298) T ss_pred ecceeeEEecccccccCC-----------CccEEEEeeccceEEEEEecC--ceEEEee-----ccCCcCcchhhhhcCc Confidence 999999999999964321 111224577766443322211 1111111 11111 11 1 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) -.+++..-+|.+++||++.+.| +..+ T Consensus 273 v~~ra~~r~d~~v~~~~a~~~l--~~at 298 (298) T protein:vir:16 273 VYIRAELFLGWGILDATKFARV--TEAN 298 (298) T ss_pred EEEEEEEEEccEeecccceEEE--eecC Confidence 2245556699999999987665 3322 No 80 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.28 E-value=1.4e-13 Score=91.07 Aligned_cols=282 Identities=9% Similarity=-0.023 Sum_probs=156.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATP 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~ 78 (402) +......+..-..+.+ .+.. +++...+.....+.++++.+.++.+..+++.+.||+. |..++..+.-|+.+.... T Consensus 104 ~~~~~~~~~~t~~~~g---~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~ 180 (392) T protein:vir:13 104 FEFAPEKRDGTKAGNP---NVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEIPESY 180 (392) T ss_pred HHhhhhhhcccccCCC---ccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccccccc Confidence 1111100000011111 1233 5566667777788889999988888878888888865 667787888888887777 Q ss_pred ccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 79 ~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) +..++.++..-. ++....-.-+-. ++.+| +.+.+.+++++++++..|+.++. +.....| .|... T Consensus 181 ~~f~~v~~~~~k--~~~~~~iS~ell~ds~~~-l~~~i~~~l~~~i~~~~d~~~l~----G~Gt~~p--------~Gil~ 245 (392) T protein:vir:13 181 PATTQRSMGGFK--YGFASVVSYEFATDQVLD-LVGFLVSDAGPAIGDAMGRHFLT----GTGTGQP--------RGILT 245 (392) T ss_pred cceeeEEeeeee--EEeeehhHHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhc----ccCCccc--------ccccc Confidence 777777776644 333322121111 24555 77889999999999999998762 1111111 01111 Q ss_pred ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--ccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--ADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 158 ~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) .........+........|+.|.++...|+..... ...| |++|..|..|.+ |.. ..|-- . +....|.-.++ T Consensus 246 ~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~-~a~~-v~n~~~~~~l~~lkd~~---G~~l~-~-~~~~~g~~~~l 318 (392) T protein:vir:13 246 DATGANAAFGEADADSKVSDALIDLFHEVPSAYRK-NAKF-VVNDLRAAQMRKLKDAN---GQYLW-Q-SALTVGAPDTF 318 (392) T ss_pred ccccccccccccccccccHHHHHHHHHhhhhhhhc-CCEE-EEcHHHHHHHHHhhccC---Cceee-c-CCcCCCCCcee Confidence 11100000011111122377777877777665433 3445 789999998864 211 11100 1 11223445689 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH--HHHHHHHH Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK--TYYIDTFM 313 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~--~d~i~~~~ 313 (402) +|.||+.|+++|... . +-+||+.. +++.+ .++..+...+.... ...+++.+ T Consensus 319 ~G~Pv~~~~~~~~~~------i-----------~~Gdf~~~--~i~~~--------~~~~i~~~~~~~~~~~~~~~r~~~ 371 (392) T protein:vir:13 319 NGKVVETDDGMPADK------V-----------LFADLSKY--RVRFA--------GSLRVDRSVDAKFSTDQIVYRFLQ 371 (392) T ss_pred cceeeEEcCCCCCCc------E-----------EEeeccce--eEEee--------cceEEEeeccccccCCcEEEEEEE Confidence 999999999998521 1 22455432 12111 12233333222211 13446677 Q ss_pred HhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~~~~ 334 (402) =+|.++.+|+|..+++.+..+ T Consensus 372 r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 372 RADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred EeccEEecccceEEEEeeccC Confidence 789999999998888776555 No 81 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.26 E-value=3.2e-13 Score=89.11 Aligned_cols=293 Identities=11% Similarity=-0.024 Sum_probs=158.7 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATP 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~ 78 (402) |+...+. ..++ ++|..++++..+..|+++.+.++.++.+| .+++|+. |...+..+.-|+.+.... T Consensus 1 mat~~~g------------g~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~~~~~ 67 (311) T protein:vir:81 1 MVALATG------------TFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQKSEST 67 (311) T ss_pred CceecCC------------ceEcchhHHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCccccccc Confidence 6665431 1233 88999999999999999999988877665 4778875 788888888898888777 Q ss_pred ccccceeEeecceeeccchhhhHHHhh-----cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARNTVAHIHDVQ-----GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 79 ~~~~e~~itID~~lya~~~IddlDe~q-----~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) +..++.+|..-.+ ..-..|. +|+. +..+ +.+.+.++++++|++.+|+.++.---.+ .+.... T Consensus 68 ~~f~~v~l~~~kl-~~~~~iS--~ell~~~~d~~~~-l~~~i~~~la~ai~~~~d~a~l~G~~~~---------~~~~~~ 134 (311) T protein:vir:81 68 ATFAPVTAIPRKV-QVTQRFS--QEVKWADESRQLG-VLQTMADLSGVALGRALDLIGIHGINPL---------TGAALS 134 (311) T ss_pred ceeeEEEEeeEEE-EEeehhh--HHHhhcCcccHHH-HHHHHHHHHHHHHHHHHHHhhhccccCC---------CCcccc Confidence 7777777766443 2222222 2322 2233 5678889999999999999886332100 000000 Q ss_pred ccccccccccC-CccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceE Q lcl|Aclame:pro 154 GHGFSINVNVT-ESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFV 232 (402) Q Consensus 154 g~~~~~~v~~~-~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V 232 (402) +......-... ......+....++.|..+..++...+... ...+++|..|..|.+-.. .|..|-- ......+.. T Consensus 135 gi~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd-~~G~~l~--~~~~~~~~~ 209 (311) T protein:vir:81 135 GSPAKILDTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQRD-SQGRKLY--PELGFGTDV 209 (311) T ss_pred cccccccccceeeeecccccchHHHHHHHHHHHhhhcCCCc--eEEEEcHHHHHHHHhhhc-cCCCeee--cCccccCCC Confidence 11111000000 00111122233444555555555444433 346999999999964110 0111100 011123556 Q ss_pred EEEeccEEEecCccccccCcccccccccc-CCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh------HH Q lcl|Aclame:pro 233 LSSYNCPVIPSNRFPTFAQDQAHHLLSNE-DNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK------EK 305 (402) Q Consensus 233 ~~iaG~~V~~SNnlP~~~~~~t~~~ls~a-~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~------~~ 305 (402) +++.|.||+.++++|.............. +.+...=+-+||++..- .. ..++..+..++.. .| T Consensus 210 ~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i-~~---------~~~~~~~~~~~~~~~~~~~~~ 279 (311) T protein:vir:81 210 ASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRW-GV---------QVSIPLELIEFGDPDGLGDLK 279 (311) T ss_pred ceecceeEEecccccccccccccccchhcccCCccEEEEEecccEEE-EE---------eccceEEEeccCCCCcchhhh Confidence 78999999999999964432221111111 11111224566665221 11 1222333332211 11 Q ss_pred H---HHHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 306 T---YYIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 306 ~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) . -.+++.+-+|.++++|++.+.| +..+.+ T Consensus 280 ~~~~v~~r~~~r~d~~v~~~~a~~~l--~~a~~~ 311 (311) T protein:vir:81 280 RQNQIAIRAEVVYGIGIMSTDAFAVV--RDADES 311 (311) T ss_pred hcCcEEEEEEEEeccEeecccceEEE--EeeccC Confidence 1 1334456799999999987665 333322 No 82 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.25 E-value=2.9e-13 Score=89.30 Aligned_cols=287 Identities=10% Similarity=0.039 Sum_probs=157.8 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |-........+.....+...+.-+.|..++++.....+.++++.++.++.+ .+++||+. +...+..+.-|+.++...+ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) T protein:vir:10 18 NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIETSKA 96 (324) T ss_pred hhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCcceeEeccCcccccccc Confidence 222211111111111111124448899999999999999999998887765 45788876 6777888888888887777 Q ss_pred cccceeEeecceeeccc-hhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARN-TVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 80 ~~~e~~itID~~lya~~-~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) ...+.++..-. +... .|.+ +++ +..| +.+.+.+++++++++.+|+.++.-- .. +....+.. T Consensus 97 ~~~~v~~~~~k--~~~~~~iS~ell~d--s~~~-l~~~i~~~l~~ai~~~~d~a~l~G~----g~-------~~~~~~i~ 160 (324) T protein:vir:10 97 TWVNATMRAFK--LGVILPVTKEFLNY--TYSQ-FFEEMKPMIAEAFYKKFDEAGILNQ----GN-------NPFGKSIA 160 (324) T ss_pred ceeEEEEeeEE--EEEeehhhHHHHhc--chHH-HHHHHHHHHHHHHHHHHHHHhhhcC----CC-------CccCcccc Confidence 77777776544 3322 2322 222 3355 7889999999999999999886311 10 00000110 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEe Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia 236 (402) . .... .+.......-++.|.++...|...+.... .++++|..|..|.+- .+.+ +...+..+.-.++. T Consensus 161 ~--~~~~--~~~~~~~~~t~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l---~d~~----g~~~~~~~~~~~l~ 227 (324) T protein:vir:10 161 Q--SIEK--TNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKI---VDPE----TKERIYDRNSDTLD 227 (324) T ss_pred c--cccc--cceeccccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHh---hccC----CceeecCCCCcccc Confidence 0 0100 01111111236778888888887765433 468999999988652 1111 12222233345789 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-------------- Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-------------- 302 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-------------- 302 (402) |+||+.++..+.... . =+-+||++.. +.. ..++..+..++. T Consensus 228 G~PV~~~~~~~~~~~----~-----------~~~gd~~~~~-~~~---------~~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) T protein:vir:10 228 GLPVVNLKSSNLKRG----E-----------LITGDFDKLI-YGI---------PQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) T ss_pred ceeEEeecCCCCCcc----e-----------EEEEecccEE-EEE---------ecCcEEEEeecccccccccccccchh Confidence 999998876553211 0 1234554421 111 112222322221 Q ss_pred --hHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 303 --KEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 303 --~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) .+-...+++.+-||.++++|++.+.|+-....++...++ | T Consensus 283 ~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~----~ 324 (324) T protein:vir:10 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE----V 324 (324) T ss_pred hhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC----C Confidence 011123345566899999999988875543333222222 2 No 83 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.25 E-value=4.7e-13 Score=88.17 Aligned_cols=283 Identities=10% Similarity=-0.016 Sum_probs=154.4 Q ss_pred CCCCcc------ccccc-ccc-cccHHHHHHH-HHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecC Q lcl|Aclame:pro 1 MSTPNT------LTNVA-VSA-SGEVDSLLIE-KFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAP 70 (402) Q Consensus 1 Ms~~n~------~t~~~-~~~-~~d~~alfle-~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~ 70 (402) |-..+. ...+. ..+ ....-.+.++ ++...+.......++++.+.++.+..+++.++||+. |...+..+.- T Consensus 93 ~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~E 172 (390) T protein:vir:62 93 LRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVGE 172 (390) T ss_pred HhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecc Confidence 000000 00000 000 1111224454 444445555666788888988888778888889866 7678888888 Q ss_pred CCCCCCCCccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 71 GQSPNATPTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) Q Consensus 71 G~~i~~~~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~ 149 (402) |+.++...+...+.++.+-.+- +..+.-.-|++ +.+| +.+.+.+++++++++..|+.++. +.. .+ T Consensus 173 ~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~-l~~~i~~~l~~~i~~~~d~~~l~----G~G-------~p 238 (390) T protein:vir:62 173 TAEIPESYPATAQRSMGGFKYGFASVVSYEFATD--QVLD-LVGFLVSDAGPAIGDAMGRHFIT----GTG-------QP 238 (390) T ss_pred cccccccccceeeeEeeeeeEEeehHHHHHHHhh--hhHH-HHHHHHHHHHHHHHHHHHhhhhc----cCC-------cc Confidence 8888887888888888776543 11112222222 4455 77889999999999999998762 110 00 Q ss_pred ccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHh--cccchhhcccccccCccc Q lcl|Aclame:pro 150 PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALR--DADRIVDKTYTISQSGAT 227 (402) Q Consensus 150 ~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll--~~~r~~n~d~~~~~~g~~ 227 (402) .|....................-++.|.++...|+..+.. .-..|++|..|..|. +|.. ..|- .. +.. T Consensus 239 ---~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd~~---g~~l-~~-~~~ 308 (390) T protein:vir:62 239 ---RGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRA--NAKYVVNDLRAAQMRKLKDAN---GQYL-WQ-SGL 308 (390) T ss_pred ---ccccccccccccceecccccccchHHHHHHHHhhhhhhhc--CCEEEEchHHHHHHHHhhccC---CCee-ec-CCc Confidence 0110100000000011111112366677777778766543 224588999999984 3321 1121 01 112 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHH- Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKT- 306 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~- 306 (402) .+|.-..+.|.||+.++++|... . +-+||+.. ++..+ .++..+...+....- T Consensus 309 ~~g~~~~l~G~Pv~~~~~~p~~~------i-----------~~gd~s~~--~i~~~--------~~~~v~~~~~~~~~~~ 361 (390) T protein:vir:62 309 TVGAPSLFNGKVVETDDGMPADK------I-----------LFADLSKY--RVRFA--------GSLRVDRSVDAKFSTD 361 (390) T ss_pred CCCccceecccceEEecCCCCcc------E-----------EEeeccce--eEEee--------cceEEEeeccccccCC Confidence 34555689999999999998521 0 12455432 12211 122223222222111 Q ss_pred -HHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 307 -YYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 307 -d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) ..+++.+=+|.++++|+|..+|+.+..+ T Consensus 362 ~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 362 QIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred cEEEEEEEEeCcEeechhheEEEEeecCC Confidence 2235666789999999999888876655 No 84 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.24 E-value=3.9e-13 Score=88.63 Aligned_cols=288 Identities=9% Similarity=0.031 Sum_probs=160.3 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |-........+...+.+...+.-+.|..++++.....+.++++.++.++.+ .+++||+. |...+..+.-|+.++...+ T Consensus 18 ~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) T protein:vir:99 18 NVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIETSKA 96 (324) T ss_pred hhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeEeccCcccccccc Confidence 111111111111101111124448899999999999999999998888665 46788876 6677888888888888778 Q ss_pred cccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 80 ~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) ...+.++..-.+- .-..|.+ +++ +..| +.+.+.+++++++++.+|+.++. +... +... .+. T Consensus 97 ~~~~v~~~~~k~~-~~~~iS~ell~d--s~~~-l~~~i~~~l~~ai~~~~d~~~l~----G~g~-------~~~~--~~~ 159 (324) T protein:vir:99 97 TWVNATMRAFKLG-VILPVTKEFLNY--TYSQ-FFEEMKPMIAEAFYKKFDEAGIL----NQGN-------NPFG--KSI 159 (324) T ss_pred ceeEEEEeeEEEE-EeehhhHHHHhc--chHH-HHHHHHHHHHHHHHHHHHHHhhh----cCCC-------CccC--ccc Confidence 7777777765432 2222322 332 2355 67899999999999999998862 1110 0000 010 Q ss_pred ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEec Q lcl|Aclame:pro 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYN 237 (402) Q Consensus 158 ~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG 237 (402) ..... ..+........++.|.++...|...+.... .++++|..|..|.+- .+.+ ++..+..+.-.+++| T Consensus 160 ~~~~~--~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l---~d~~----g~~~~~~~~~~~l~G 228 (324) T protein:vir:99 160 AQSIE--KTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKI---VDPE----TKERIYDRNSDTLDG 228 (324) T ss_pred ccccc--ccceeccccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHh---hcCC----CceeecCCCCccccc Confidence 11111 111111112236778888888887765433 468999999988642 1111 122222333457899 Q ss_pred cEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-------------- Q lcl|Aclame:pro 238 CPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-------------- 303 (402) Q Consensus 238 ~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-------------- 303 (402) .||+.++..+.... .=+-+||++.. +.. ..+++.+..++.. T Consensus 229 ~PVv~~~~~~~~~~---------------~~i~gd~~~~~-~~~---------~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:99 229 LPVVNLKSSNLKRG---------------ELITGDFDKLI-YGI---------PQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred eeEEeecCCCCCcc---------------eEEEEecccEE-EEE---------ecCcEEEEeecccccccccccccchhh Confidence 99999887663221 01334554421 111 1222233222210 Q ss_pred --HHHHHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 304 --EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 304 --~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) +-.-.+++.+-||.+++||++.+.|+.....++...++. T Consensus 284 f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 284 FEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred hhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 111234455668999999999888765433333322222 No 85 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.23 E-value=4.4e-13 Score=88.31 Aligned_cols=280 Identities=13% Similarity=0.065 Sum_probs=160.6 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-c-ceeeeeecCCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQSPNATP 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G-~~t~~~~~~G~~i~~~~ 78 (402) +-......+.....+++.-.+..++|..+++......+.+++++++.++.+ .++++++. + ..++..+..|+.+.... T Consensus 104 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 182 (390) T protein:vir:81 104 MNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEGALKPESS 182 (390) T ss_pred hHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccC-CceEEEEEecCCcceeeecCCccccccc Confidence 000000000011122333345567888889999999999999988777654 56778875 3 34677788888887777 Q ss_pred ccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 79 ~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) +...+.++.+..+-. -..|. -+++. .+ +.+.+.++++.++++..|++++. +..... .+ .|.. T Consensus 183 ~~~~~i~~~~~k~~~-~~~is~ell~d~---~~-~~~~i~~~l~~~~~~~~d~a~l~----G~g~~~-----~~--~Gi~ 246 (390) T protein:vir:81 183 LKFAKKTDTTHVIAH-TMKATRQILSDA---PQ-LASYMNNRLIRGLKVKEDAEILR----GTGAND-----GL--LGLI 246 (390) T ss_pred ceeeEEEEeeeEEEE-eehhhHHHHHhH---HH-HHHHHHHHHHHHHHHHHHHHHHh----cCCCCC-----cc--ccee Confidence 777887777775431 12222 23332 24 67888899999999999987752 111110 00 1111 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEe Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia 236 (402) .... .............|+.|.++..++...+.+.. .+|++|..|..|.+-.. .+..|-- .+. ..+.-.+++ T Consensus 247 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd-~~G~~l~-~~~--~~~~~~~l~ 318 (390) T protein:vir:81 247 PQAT--TYAAPTTIAGATRVDQLRLAMLQASLAEYNPS--GIVINPIDWAAIELAKD-ANNQYLI-GNA--RGTLTPTLW 318 (390) T ss_pred eccc--ccccccccccchhHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceee-cCc--ccccCceec Confidence 1000 00111112223357788888889988877644 35889999998864211 1111100 111 123345889 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhH-HHH--HHHHHH Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE-KTY--YIDTFM 313 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~-~~d--~i~~~~ 313 (402) |+||+.|+++|.+. -+-+||++..- ++ .-.+++.+..+.... ..+ .+++.+ T Consensus 319 G~pv~~~~~~p~~~-----------------~~~gd~~~~~~-~~--------~~~~~~v~~~~~~~~~~~~~v~~r~~~ 372 (390) T protein:vir:81 319 GLPVVATQAMAPGE-----------------FLVGAFDLAAQ-IF--------DQWDARVEIGYVGEDFQRNMITVLAEE 372 (390) T ss_pred ceeeEEcCCCCCCc-----------------EEEEehhceEE-EE--------EecceEEEEecccchhhcCcEEEEEEE Confidence 99999999999532 13355544221 22 122444454433222 223 345677 Q ss_pred HhcCcccccceEEEEEEe Q lcl|Aclame:pro 314 AEGAIPDRWEAVSVVTTK 331 (402) Q Consensus 314 a~Ga~vlRPeaa~vv~~~ 331 (402) -|+.++++|++.+.+++- T Consensus 373 r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 373 RLALVVYRPEALISGSFA 390 (390) T ss_pred eeccEEecccceEEEEeC Confidence 789999999999888776 No 86 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.22 E-value=5.4e-13 Score=87.82 Aligned_cols=288 Identities=13% Similarity=0.037 Sum_probs=158.7 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeee-ccceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~~~G~~i~~~~~ 79 (402) |++..+. | -+.=++|+.++++..+..|.++.+.++.++.+| +++||+ .+...+..+.-|+.+....+ T Consensus 1 m~t~t~g--------g---~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~ip~~~~~~~a~wv~E~~~~~~s~~ 68 (303) T protein:vir:97 1 MGTETSK--------A---SLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFN-GSKEFTFTLDSDIDVVAENGKKTHGGL 68 (303) T ss_pred CcccCCC--------C---eEcchhHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEecCcceEEeecCcccccccc Confidence 6644310 1 122288899999999999999999988887654 567776 47778888888888877777 Q ss_pred cccceeEeecceeeccchhhhHHHhh-----cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQ-----GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q-----~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..++.++..-.+ .....|. +|+. ...+ +.+++.+++++++++..|+.++.-.-. ....+ ... T Consensus 69 ~f~~v~l~~~kl-~~~~~iS--~ell~~~~d~~~~-l~~~i~~~la~a~~~~ld~a~l~G~~~-------~~g~~--~~~ 135 (303) T protein:vir:97 69 SLEPVTIVPIKV-EYGARLS--DEFLYATEEEKID-ILKAFNEGFAKKLARGIDLMAMHGINP-------RTKKA--SDV 135 (303) T ss_pred ceeeEEeeeEEE-EEeehhh--HHHhhcCccchHH-HHHHHHHHHHHHHHHHHHhhhhccccc-------CCccc--ccc Confidence 777666654322 2222222 2222 1233 567899999999999999988643210 00000 000 Q ss_pred cccccccc-cCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEE Q lcl|Aclame:pro 155 HGFSINVN-VTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 155 ~~~~~~v~-~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~ 233 (402) .+...... .+......+....|+.|.++..++...+.... .++++|..+..|.+-..-. ..|-- ......++... T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~L~~lkd~~-g~~~~-~~~~~~~~~~~ 211 (303) T protein:vir:97 136 IGTNHFDSKVTQVVKFTESEDADANIEAAVNLIQGAEGVVT--GLAMDTEFSTALAKVTNGE-MGPKM-YPELAWGANPD 211 (303) T ss_pred ccccccccccccccccccccchHHHHHHHHHHHhhcCCCcc--EEEEcHHHHHHHHHhhccC-CCeEE-ecCccCCCCCc Confidence 01001000 01111112233457888888888877665443 3789999999886411100 00000 00011133456 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh------HHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK------EKTY 307 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~------~~~d 307 (402) +++|+||+.|+++|...... .+...-+-+||.+.+.+..... ..++..+ +.+.. .+.| T Consensus 212 ~l~G~Pv~~s~~v~~~~~~~---------~~~~~~~~Gdf~~~~~~~~~~~--~~~~~~~-----~~~~d~~~~~~~~~n 275 (303) T protein:vir:97 212 SINGLKSSVNTTVGAGADEA---------ESKDLVIIGDFESMFKWGYAKQ--IPMEIIK-----YGDPDNSGKDLKGYN 275 (303) T ss_pred eecceeeEEecccCCccccC---------CCccEEEEeeccccEEEEEecC--cEEEEee-----ccCCCCcchhhhhcC Confidence 89999999999999643211 1112235577766444332211 1111111 11111 1111 Q ss_pred --HHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 308 --YIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 308 --~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) .+++..-++.+++||+|.+.|+ +..+ T Consensus 276 ~~~~r~~~r~~~~v~~p~af~~l~-~~~~ 303 (303) T protein:vir:97 276 QIYLRAEAYIGWGILDAKSFARVT-KGEV 303 (303) T ss_pred cEEEEEEEEeccEeecccceEEee-CCCC Confidence 3445566899999999876653 2222 No 87 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.21 E-value=2.2e-12 Score=84.55 Aligned_cols=281 Identities=12% Similarity=0.060 Sum_probs=160.4 Q ss_pred CCC----C-ccc-ccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-c-ceeeeeecCCC Q lcl|Aclame:pro 1 MST----P-NTL-TNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQ 72 (402) Q Consensus 1 Ms~----~-n~~-t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G-~~t~~~~~~G~ 72 (402) |.. . ... .+.....+++.-.+..+.|+.+++......+.+++++++.++.+ .++.+++. + ..++..+.-|+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~ 176 (395) T protein:vir:43 98 TSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTES-NSVEYVRETGFVNNAAPVSEGT 176 (395) T ss_pred HHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCC-CceEEEEEecCCCceeeecCCc Confidence 000 0 000 01111112223346678899999999999999999999888765 46788874 4 35666777787 Q ss_pred CCCCCCccccceeEeecceee-ccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 73 SPNATPTQADKNQLVIDTTVI-ARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 73 ~i~~~~~~~~e~~itID~~ly-a~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) .+....+...+.++.+..+-. ..+.-.-|++. .+ +.+.+.+++++++++..|..++. +.....++ T Consensus 177 ~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~-l~~~v~~~la~a~~~~~d~~~l~----G~g~~~~~------ 242 (395) T protein:vir:43 177 QKPYSDLTFELENAPVRTIAHLFKASRQILDDA---SA-LQSYIDARARYGLMLVEECQLLY----GNGTGANL------ 242 (395) T ss_pred cccccccceeEEEEeeeeEEEeehhhHHHHHhH---HH-HHHHHHHHHHHHHHHHHHHHHHh----ccCCCCcc------ Confidence 777777777887777776542 12222222222 23 66788888999999999988752 22111111 Q ss_pred ccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcc----cchhhcccccccCccc Q lcl|Aclame:pro 152 VKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDA----DRIVDKTYTISQSGAT 227 (402) Q Consensus 152 ~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~----~r~~n~d~~~~~~g~~ 227 (402) .|......+.........+....++.|.++...+...+.+.. .+|++|..|..|.+- .+++..+ . T Consensus 243 -~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd~~G~~i~~~--------~ 311 (395) T protein:vir:43 243 -HGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPAS--GIVLNPIDWALIELNKDAENRYIIGS--------P 311 (395) T ss_pred -ccccccccccccccccccccchhHHHHHHHHHhhccccCCCc--EEEEcHHHHHHHHHhhccCCceeccc--------c Confidence 111111111111111223334568889999888888776533 468999999988642 1222111 1 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-HH- Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-EK- 305 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-~~- 305 (402) .+|...+++|+||+.|+++|... -+-+||+... +++-+ .+++.+..+... .| T Consensus 312 ~~~~~~~l~G~pVv~~~~~~~~~-----------------~~~gd~~~~~-~~~~~--------~~~~i~~~~~~~~~f~ 365 (395) T protein:vir:43 312 QNGTTPTLWRLPVVETQAITQDE-----------------FLTGAFSLGA-QIFDR--------MDIEVLVSTENDKDFE 365 (395) T ss_pred ccCCCceecceeeEEcCCCCCCc-----------------EEEEeccceE-EEEEe--------cceEEEEeccccchhh Confidence 23445678999999999998532 0224444321 11111 122333332221 11 Q ss_pred --HHHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 306 --TYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 306 --~d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) ...+++.+-+|.++++|++.+.|+.+.. T Consensus 366 ~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 366 NNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred cCcEEEEEEEeeccEEecccceEEEEeccC Confidence 2234455668999999999877755433 No 88 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.21 E-value=3.7e-13 Score=88.77 Aligned_cols=281 Identities=12% Similarity=0.015 Sum_probs=150.9 Q ss_pred CCCCcccccccccccccHH-HHHH-HHHhHHHHHHHHHHhhhccc-ceeeeccccceEEeeec-cceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVD-SLLI-EKFNGKVNEQYLKGENILSY-FDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~-alfl-e~f~geV~t~f~~~sv~~~~-~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~ 76 (402) |.......+....++...- .|-. +.++.+++......++++.+ .++-+...| .++||+. |..++..+.-|+.+.. T Consensus 347 ~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g-~~~ip~~~~~~~a~wv~E~~~~~~ 425 (632) T protein:vir:96 347 MPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVG-DVDIPKKTSGANFYWIGEDEDVQD 425 (632) T ss_pred hhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCc-ceEEEEEeCCceeEeecCCccccc Confidence 1111111111111111111 1223 33467777777777887776 333333344 5778865 7777777777888877 Q ss_pred CCccccceeEeecceeeccchhhhHHH-hhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVIARNTVAHIHD-VQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 77 ~~~~~~e~~itID~~lya~~~IddlDe-~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) ..+..++.++..- +++....-.-+= -++.+| +.+.+..++++++++..|+.++. +.... ..|. |. T Consensus 426 s~~~f~~i~l~~~--k~~~~v~iS~ell~ds~~~-~~~~i~~~l~~a~~~~~d~a~l~----G~G~~-----~~p~--Gi 491 (632) T protein:vir:96 426 SDFDFTTLSFSPK--TIAGAVPVTRKLRKQSSIH-VENLIREDLIEGIGVALDLAMLT----GTGLA-----NDPV--GL 491 (632) T ss_pred cccceeeEEeeee--EEEEehhhHHHHHhccchH-HHHHHHHHHHHHHHHHHHHHhhc----ccCCC-----Cccc--ee Confidence 7777776666653 333332222111 134566 78889999999999999998752 11110 0010 11 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) .....+ ..........-|+.|.++..++...++....-..+++|..+..|.... +.+ ..+...+.+ +++ T Consensus 492 ~~~~~~---~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~-l~d----~~G~~i~~~---~~l 560 (632) T protein:vir:96 492 LNMTGV---PALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQ-VFD----NTGERIWQN---NEV 560 (632) T ss_pred eecccc---cceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHh-ccC----CCCceeecC---Cee Confidence 100000 000000111125677888888888887755556688998887776532 211 112223323 367 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) .|.+|+.||++|... -+-+||+... + +...-+.+....+....+-.-.+.+++-+ T Consensus 561 ~G~pv~~s~~ip~~~-----------------~~~gd~s~~~--i------~~~~~~~i~~~~~~~~~~~~v~~~~~~~~ 615 (632) T protein:vir:96 561 NGYRAEASNQIPADT-----------------WIFGDWSQIV--I------AMWGVLDLKVDPYTKAASDGLVLRVFQDV 615 (632) T ss_pred cccceEeccccccCc-----------------EEEeecceEE--E------EEecceEEEEccccccccCceEEEEEeec Confidence 999999999999532 1224444321 1 11111222222222233334466677789 Q ss_pred cCcccccceEEEEEEee Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKR 332 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~ 332 (402) +.+++||++.++++-+. T Consensus 616 d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 616 DAGVRRKEAFCIAKKGA 632 (632) T ss_pred CceeechhhhhheeecC Confidence 99999999988776665 No 89 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.20 E-value=9e-13 Score=86.62 Aligned_cols=298 Identities=9% Similarity=0.019 Sum_probs=163.0 Q ss_pred CCCCcccc-----cccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEeee-ccceeeeeecCCC Q lcl|Aclame:pro 1 MSTPNTLT-----NVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNKY-LGETELQVLAPGQ 72 (402) Q Consensus 1 Ms~~n~~t-----~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~~-iG~~t~~~~~~G~ 72 (402) ..+.+.+. +.+..+ .+.-...| +.+.++++......+.+++++++.++.+++ ++.++. .+...+..+..|. T Consensus 107 ~~~~~~~~~~~~~~~~~~~-~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~ 185 (415) T protein:vir:94 107 RDFTEYLETRNDIQGGSLK-TDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELE 185 (415) T ss_pred HHHHHHhhhhhhhhhhccc-cccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceeccccc Confidence 00000000 000000 11112223 789999999999999999999999987543 444443 4556677777777 Q ss_pred CCCC-CCccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 73 SPNA-TPTQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) Q Consensus 73 ~i~~-~~~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~ 150 (402) .+.. ..+...+.++.+-.+ +....-.-+=. ++.+| +.+.+.+++++++++.+|+.++.-.-.+. +. T Consensus 186 ~~~~~~~~~~~~i~~~~~k~--~~~~~is~ell~ds~~~-~~~~i~~~l~~~~~~~~~~~il~g~g~g~---------~~ 253 (415) T protein:vir:94 186 ENPELAVKPFFQLAYDINTH--RGYFRISREAIEDAKVN-VLQELKLWMARTIAATRNKAIIDVITKGS---------TG 253 (415) T ss_pred cccccccccceeeEeeheee--eeechhhHHHHhhchHH-HHHHHHHHHHHHHHHHHHHHHhhccccCc---------cc Confidence 7654 345556666655543 33332222112 23456 78899999999999999998864432110 00 Q ss_pred cccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccc Q lcl|Aclame:pro 151 RVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATING 230 (402) Q Consensus 151 ~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G 230 (402) . .+... .........+....|+.|.++...+...+... . .+|++|..|..|.+-.. .+..|- -.....+| T Consensus 254 ~---~~~~~--~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~-~~vmn~~~~~~l~~lkd-~~G~~l--~~~~~~~~ 323 (415) T protein:vir:94 254 S---TSSGF--EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-N-VAIVSQTMFAKLDKMKD-KLGNYL--IQPDVKEK 323 (415) T ss_pred c---ccccc--cccccccccccccchHHHHHHHHhhhhhccCC-C-EEEEcHHHHHHHHHhhc-cCCCee--eccCcCCC Confidence 0 00000 00011111222234778888888887777652 2 45889999999965211 011111 01122356 Q ss_pred eEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHH Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYID 310 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~ 310 (402) ...+++|+||+.++++|.+..+.. .-+-+||++...+ + .-.+++.+..+ .......++ T Consensus 324 ~~~~l~G~pV~~~~~~~~~~~~~~------------~i~~gd~~~~~~~-~--------~~~~~~v~~~~-~~~~~~~~r 381 (415) T protein:vir:94 324 TQQRLLGAKIEILPDEVLGQKGNN------------TLIIGNLKDAIVL-F--------DRSQYQASWTD-YMHFGECLM 381 (415) T ss_pred CCceecceeeEEecccccCCCCcc------------EEEEEehhccEEE-E--------eecceEEEEec-cccCceEEE Confidence 667899999999999986432111 1133555543221 1 22233333332 233334566 Q ss_pred HHHHhcCcccccceEEEEEEeec-cCccccccch Q lcl|Aclame:pro 311 TFMAEGAIPDRWEAVSVVTTKRD-ATTGDAGGPG 343 (402) Q Consensus 311 ~~~a~Ga~vlRPeaa~vv~~~~~-~t~~~a~~~~ 343 (402) +.+-++..+++|+|.+.+++... ..+|+-+-.+ T Consensus 382 ~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 382 IAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 77789999999999988887643 3333433222 No 90 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.19 E-value=5.9e-13 Score=87.64 Aligned_cols=283 Identities=8% Similarity=0.007 Sum_probs=158.6 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATP 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~ 78 (402) +..+.+.+ ..+.-...| +.|..++++.....++++++.++.++. +.+++||+. |...+.-+.-|+.++... T Consensus 23 ~~~a~~~~------~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~~~ 95 (324) T protein:vir:97 23 VFNPDNVM------MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIETSK 95 (324) T ss_pred hhcccccc------ccCCCcceechhHHHHHHHHHHhhcchhhhcceeecc-CCceEEEEEecCcceeEeccCccccccc Confidence 11111111 111122234 889999999999999999998777765 456888876 677777888888888777 Q ss_pred ccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccccc Q lcl|Aclame:pro 79 TQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) Q Consensus 79 ~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~ 157 (402) +..++.++..-.+- .-..|.+ +-.+ +.++ +.+.+.+++++++++..|+.++.- ... +. ...+. T Consensus 96 ~~f~~v~~~~~k~~-~~~~is~-ell~ds~~~-l~~~i~~~l~~aia~~~d~a~l~G----~g~-------~~--~~~gi 159 (324) T protein:vir:97 96 ATWVNATMRAFKLG-VILPVTK-EFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGILN----QGN-------NP--FGKSI 159 (324) T ss_pred cceeEEEEeeEEEE-EeehhhH-HHHhcchHH-HHHHHHHHHHHHHHHHHHHHhhcc----CCC-------Cc--cCccc Confidence 77777777665532 2222322 2122 2355 678999999999999999988631 100 00 00000 Q ss_pred ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEec Q lcl|Aclame:pro 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYN 237 (402) Q Consensus 158 ~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG 237 (402) ..... ..+........|+.|.++...|.+.+.... .++++|..|..|.+- .+.+ +...+..+.-+++.| T Consensus 160 ~~~~~--~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l---kd~~----g~~~~~~~~~~tl~G 228 (324) T protein:vir:97 160 AQSIE--KTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKI---VDPE----TKERIYDRNSDTLDG 228 (324) T ss_pred ccccc--ccceeccccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHh---hcCC----CceeecCCCCccccc Confidence 00100 011111112236778888888887765533 468999999988642 1111 112222333457899 Q ss_pred cEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-------------H Q lcl|Aclame:pro 238 CPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-------------E 304 (402) Q Consensus 238 ~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-------------~ 304 (402) .||+.|+..+...+ .-+-+||++.. +. ...++..+..++.. . T Consensus 229 ~PV~~~~~~~~~~~---------------~~~~gd~~~~~-i~---------~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:97 229 LPVVNLKSSNLKRG---------------ELITGDFDKLI-YG---------IPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred eeeEeecCCCCCcc---------------eEEEEecccEE-EE---------EecCcEEEEeecccccccccccccchhh Confidence 99999887653221 11335555432 11 12233333333211 0 Q ss_pred H---HHHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 305 K---TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 305 ~---~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) | .-.+++.+-+|.++++|++.+.|+..-..++... .++ T Consensus 284 f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~----~~~ 324 (324) T protein:vir:97 284 FEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVP----GEV 324 (324) T ss_pred hhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCC----CCC Confidence 1 1223444568999999999888776544333322 222 No 91 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.19 E-value=1.2e-12 Score=86.04 Aligned_cols=297 Identities=10% Similarity=0.041 Sum_probs=163.8 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~~~~~~G~~i~~~ 77 (402) +...+..... ....+.-...+ +.|..+++......+.++++.++.++.+++ ++.++ ..+...+..+..|..+... T Consensus 113 ~~~~~~~~~~--~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~ 190 (415) T protein:vir:81 113 LETRNDIQGG--SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL 190 (415) T ss_pred Hhhhhhhhhc--cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcc Confidence 0000000000 00011112233 788999999999999999999998887543 34344 4566677777777777643 Q ss_pred -CccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 78 -PTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 78 -~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+...+.++.+..+- +..+.-.-++ ++.+| +.+.+.+++++++++..|+.++.-.-.+. +.. . T Consensus 191 ~~~~~~~v~~~~~k~~~~~~iS~ell~--ds~~~-l~~~i~~~l~~~~~~~~~~~il~g~g~g~---------~~~---~ 255 (415) T protein:vir:81 191 AVKPFFQLAYDINTHRGYFRISREAIE--DAKVN-VLQELKLWMARTIAATRNKAIIDVITKGS---------TGS---T 255 (415) T ss_pred cccceeeEEeeeeeeEeeehhhHHHHh--hchHH-HHHHHHHHHHHHHHHHHHHHHhhccccCc---------ccc---c Confidence 456677777666543 1111112222 24566 78899999999999999998864332110 000 0 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) +... .........+....|+.|.++..++...+... .. +|++|..|..|.+-.. .|.+|-. .....+|...++ T Consensus 256 ~~~~--~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~-~v~n~~~~~~l~~lkd-~~G~~l~--~~~~~~~~~~~l 328 (415) T protein:vir:81 256 SSGF--EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-NV-AIVSQTMFAKLDKMKD-KLGNYLI--QPDVKEKTQQRL 328 (415) T ss_pred cccc--cccccccccccccchhHHHHHHHhhhhhccCC-CE-EEEcHHHHHHHHHhhc-cCCceee--ccCcCCCCCcee Confidence 0000 00011111222234788888888888777652 33 5889999999964111 1111110 111234556789 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) +|+||+.++++|.+..+. ..-+-+||++..- ...-.+++.+..+. ..+...+.+.+-+ T Consensus 329 ~G~pV~~~~~~~~~~~~~------------~~~~~Gd~~~~~~---------~~~~~~~~v~~~~~-~~~~~~~~~~~r~ 386 (415) T protein:vir:81 329 LGAKIEILPDEVLGQKGN------------NTLIIGNLKDAIV---------LFDRSQYQASWTDY-MHFGECLMIAVRQ 386 (415) T ss_pred cceeeEEecccccCCCCc------------cEEEEEehhccEE---------EEeecceEEEEecc-ccCceEEEEEEEe Confidence 999999999998643211 1113345544221 12223344443322 2233445677789 Q ss_pred cCcccccceEEEEEEee-ccCccccccch Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKR-DATTGDAGGPG 343 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~-~~t~~~a~~~~ 343 (402) +..+++|++.+.+++.. ...+|+-+-.+ T Consensus 387 d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:81 387 DCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccEEeccccEEEEEEeccCCCCCccccCC Confidence 99999999998888765 33334444222 No 92 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.19 E-value=1.2e-12 Score=86.04 Aligned_cols=297 Identities=10% Similarity=0.041 Sum_probs=163.8 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~~~~~~G~~i~~~ 77 (402) +...+..... ....+.-...+ +.|..+++......+.++++.++.++.+++ ++.++ ..+...+..+..|..+... T Consensus 113 ~~~~~~~~~~--~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~ 190 (415) T protein:vir:98 113 LETRNDIQGG--SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL 190 (415) T ss_pred Hhhhhhhhhc--cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcc Confidence 0000000000 00011112233 788999999999999999999998887543 34344 4566677777777777643 Q ss_pred -CccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 78 -PTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 78 -~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+...+.++.+..+- +..+.-.-++ ++.+| +.+.+.+++++++++..|+.++.-.-.+. +.. . T Consensus 191 ~~~~~~~v~~~~~k~~~~~~iS~ell~--ds~~~-l~~~i~~~l~~~~~~~~~~~il~g~g~g~---------~~~---~ 255 (415) T protein:vir:98 191 AVKPFFQLAYDINTHRGYFRISREAIE--DAKVN-VLQELKLWMARTIAATRNKAIIDVITKGS---------TGS---T 255 (415) T ss_pred cccceeeEEeeeeeeEeeehhhHHHHh--hchHH-HHHHHHHHHHHHHHHHHHHHHhhccccCc---------ccc---c Confidence 456677777666543 1111112222 24566 78899999999999999998864332110 000 0 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) +... .........+....|+.|.++..++...+... .. +|++|..|..|.+-.. .|.+|-. .....+|...++ T Consensus 256 ~~~~--~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~-~v~n~~~~~~l~~lkd-~~G~~l~--~~~~~~~~~~~l 328 (415) T protein:vir:98 256 SSGF--EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-NV-AIVSQTMFAKLDKMKD-KLGNYLI--QPDVKEKTQQRL 328 (415) T ss_pred cccc--cccccccccccccchhHHHHHHHhhhhhccCC-CE-EEEcHHHHHHHHHhhc-cCCceee--ccCcCCCCCcee Confidence 0000 00011111222234788888888888777652 33 5889999999964111 1111110 111234556789 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) +|+||+.++++|.+..+. ..-+-+||++..- ...-.+++.+..+. ..+...+.+.+-+ T Consensus 329 ~G~pV~~~~~~~~~~~~~------------~~~~~Gd~~~~~~---------~~~~~~~~v~~~~~-~~~~~~~~~~~r~ 386 (415) T protein:vir:98 329 LGAKIEILPDEVLGQKGN------------NTLIIGNLKDAIV---------LFDRSQYQASWTDY-MHFGECLMIAVRQ 386 (415) T ss_pred cceeeEEecccccCCCCc------------cEEEEEehhccEE---------EEeecceEEEEecc-ccCceEEEEEEEe Confidence 999999999998643211 1113345544221 12223344443322 2233445677789 Q ss_pred cCcccccceEEEEEEee-ccCccccccch Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKR-DATTGDAGGPG 343 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~-~~t~~~a~~~~ 343 (402) +..+++|++.+.+++.. ...+|+-+-.+ T Consensus 387 d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:98 387 DCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccEEeccccEEEEEEeccCCCCCccccCC Confidence 99999999998888765 33334444222 No 93 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.19 E-value=1.2e-12 Score=86.04 Aligned_cols=297 Identities=10% Similarity=0.041 Sum_probs=163.8 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~~~~~~G~~i~~~ 77 (402) +...+..... ....+.-...+ +.|..+++......+.++++.++.++.+++ ++.++ ..+...+..+..|..+... T Consensus 113 ~~~~~~~~~~--~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~ 190 (415) T protein:vir:79 113 LETRNDIQGG--SLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPEL 190 (415) T ss_pred Hhhhhhhhhc--cccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccccCcc Confidence 0000000000 00011112233 788999999999999999999998887543 34344 4566677777777777643 Q ss_pred -CccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 78 -PTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 78 -~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+...+.++.+..+- +..+.-.-++ ++.+| +.+.+.+++++++++..|+.++.-.-.+. +.. . T Consensus 191 ~~~~~~~v~~~~~k~~~~~~iS~ell~--ds~~~-l~~~i~~~l~~~~~~~~~~~il~g~g~g~---------~~~---~ 255 (415) T protein:vir:79 191 AVKPFFQLAYDINTHRGYFRISREAIE--DAKVN-VLQELKLWMARTIAATRNKAIIDVITKGS---------TGS---T 255 (415) T ss_pred cccceeeEEeeeeeeEeeehhhHHHHh--hchHH-HHHHHHHHHHHHHHHHHHHHHhhccccCc---------ccc---c Confidence 456677777666543 1111112222 24566 78899999999999999998864332110 000 0 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) +... .........+....|+.|.++..++...+... .. +|++|..|..|.+-.. .|.+|-. .....+|...++ T Consensus 256 ~~~~--~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~-~v~n~~~~~~l~~lkd-~~G~~l~--~~~~~~~~~~~l 328 (415) T protein:vir:79 256 SSGF--EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-NV-AIVSQTMFAKLDKMKD-KLGNYLI--QPDVKEKTQQRL 328 (415) T ss_pred cccc--cccccccccccccchhHHHHHHHhhhhhccCC-CE-EEEcHHHHHHHHHhhc-cCCceee--ccCcCCCCCcee Confidence 0000 00011111222234788888888888777652 33 5889999999964111 1111110 111234556789 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) +|+||+.++++|.+..+. ..-+-+||++..- ...-.+++.+..+. ..+...+.+.+-+ T Consensus 329 ~G~pV~~~~~~~~~~~~~------------~~~~~Gd~~~~~~---------~~~~~~~~v~~~~~-~~~~~~~~~~~r~ 386 (415) T protein:vir:79 329 LGAKIEILPDEVLGQKGN------------NTLIIGNLKDAIV---------LFDRSQYQASWTDY-MHFGECLMIAVRQ 386 (415) T ss_pred cceeeEEecccccCCCCc------------cEEEEEehhccEE---------EEeecceEEEEecc-ccCceEEEEEEEe Confidence 999999999998643211 1113345544221 12223344443322 2233445677789 Q ss_pred cCcccccceEEEEEEee-ccCccccccch Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKR-DATTGDAGGPG 343 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~-~~t~~~a~~~~ 343 (402) +..+++|++.+.+++.. ...+|+-+-.+ T Consensus 387 d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:79 387 DCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ccEEeccccEEEEEEeccCCCCCccccCC Confidence 99999999998888765 33334444222 No 94 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.18 E-value=1.4e-12 Score=85.55 Aligned_cols=282 Identities=12% Similarity=0.061 Sum_probs=154.4 Q ss_pred CCC-----CcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-c-ceeeeeecCCCC Q lcl|Aclame:pro 1 MST-----PNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-G-ETELQVLAPGQS 73 (402) Q Consensus 1 Ms~-----~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G-~~t~~~~~~G~~ 73 (402) |.. ..........+..+...+.-+.|+.+++......+.+++++++.++. |.++.+++. + ..++..+.-|+. T Consensus 121 ~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~ 199 (418) T protein:vir:10 121 RVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTS-SSSIEYTVETGFTNNAAAVAEGAQ 199 (418) T ss_pred hhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEEecCCCceeeeccCcc Confidence 000 00011111111122233556899999999999999999999888775 456778874 3 346667777777 Q ss_pred CCCCCccccceeEeecceee-ccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 74 PNATPTQADKNQLVIDTTVI-ARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 74 i~~~~~~~~e~~itID~~ly-a~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) +....++.++.++....+-. ....-.-+++. -| +.+.+.+++++++++..|.+++. +..... .+. T Consensus 200 ~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~-l~~~i~~~l~~a~~~~~d~a~l~----G~g~~~-----~p~- 265 (418) T protein:vir:10 200 KPTSDLKFNLKNQPVRTIAHLFKASRQILDDA---PA-LQSYIDGRARYGLQLTEEGQILK----GDGTGA-----NIL- 265 (418) T ss_pred ccccccceeeEEEeeeeEEEeehhhHHHHHhH---HH-HHHHHHHHHHHHHHHHHHHHHhc----cCCCCc-----ccc- Confidence 76666777777766665331 11111222222 24 67888899999999999998862 111110 011 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccc Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATI 228 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~ 228 (402) |....... .......+....|+.|+++...+...+.+.. .+|++|..|..|.+ | .+++.. + .. T Consensus 266 -Gi~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd~~G~~i~~-------~-~~ 332 (418) T protein:vir:10 266 -GILPQASA--FMPSITLANATPIDKIRLALLQAVLAEFPAT--GIVLNPIDWASIELTKDSQGRYIVG-------N-PV 332 (418) T ss_pred -cccccccc--ccccccccccccHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhcCCCceecc-------c-cc Confidence 11100000 0111111122246777777777776665433 36789999998864 2 222211 1 12 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-HHH- Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-EKT- 306 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-~~~- 306 (402) +|.-++++|+||+.|+++|.+. -+-+||+... +++. -.+++.++.+... .|. T Consensus 333 ~~~~~~l~G~pV~~~~~~p~~~-----------------~~~gd~s~~~-~~~~--------~~~~~i~~~~~~~~~f~~ 386 (418) T protein:vir:10 333 NGTTPRLWNLPVVETQAMTANE-----------------FLVGAFSMAA-QIFD--------RMEIEVLLSTENVDDFEK 386 (418) T ss_pred cCCCceecceeeEEcCCCCCCc-----------------EEEeeccceE-EEEE--------ecceEEEEecccchhhhc Confidence 3445678999999999999532 1234444321 1221 1233333332221 122 Q ss_pred --HHHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 307 --YYIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 307 --d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) ..+++.+-++.++++|++.+.++.+..+.. T Consensus 387 ~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 387 NMVSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred CceEEEEEEeeccEEecccceEEEEeccCCCC Confidence 234455568999999999887766533322 No 95 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.17 E-value=2.3e-12 Score=84.40 Aligned_cols=291 Identities=11% Similarity=0.001 Sum_probs=156.9 Q ss_pred CCCCcccc-ccccccc-ccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeecccee---------eeee Q lcl|Aclame:pro 1 MSTPNTLT-NVAVSAS-GEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETE---------LQVL 68 (402) Q Consensus 1 Ms~~n~~t-~~~~~~~-~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t---------~~~~ 68 (402) +...+..+ .....++ .......+ +.+.+++.........++++.++.+.. +++++|++....+ +..+ T Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a~~v 190 (419) T protein:vir:94 112 DIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKAAVV 190 (419) T ss_pred HHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeecc-CCceeeeeeccccccccccCccccee Confidence 00000001 1111111 12222333 666777777777777888888877764 4667777653322 2333 Q ss_pred cCCCCCCCCCccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 69 APGQSPNATPTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) Q Consensus 69 ~~G~~i~~~~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~ 147 (402) .-|+.++...++..++++.+..+- +..+.-.-|++. .+ +.+.+.+++++++++..|+.++. +.....|. T Consensus 191 ~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~-l~~~i~~~la~a~~~~~d~aii~----G~G~~~p~-- 260 (419) T protein:vir:94 191 PEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQ-LMGYIQGRLTYGLRFLRDRQLLN----GNGSTEMQ-- 260 (419) T ss_pred cCCccccccccceeeEEeeeeeEEEeehhhHHHHHhH---HH-HHHHHHHHHHHHHHHHHHHHHHh----ccCccccc-- Confidence 345555545555666666655443 112222233333 23 67888889999999999998862 21111110 Q ss_pred ccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 148 NKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 148 ~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) ..........+.........+....|+.|.++...+...+.+.. .+|++|..|..|++-..-.++.|-. .. .. T Consensus 261 ---Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~--~~v~n~~~~~~l~~~k~~~~~~~~~-~~-~~ 333 (419) T protein:vir:94 261 ---GILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD--GVVVHPQDWESIELDQAPGSGVFRV-IA-NV 333 (419) T ss_pred ---ceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHHhhcCCCceee-cC-Cc Confidence 00000001111111222334455678999999999988777543 5699999999986532211122111 11 12 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh---- Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK---- 303 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~---- 303 (402) .++...+++|+||+.|+++|... -+-+||+...- ++. -.+++.+..+... T Consensus 334 ~~~~~~~l~G~pV~~~~~~~~~~-----------------~~~gd~~~~~~-~~~--------~~~~~v~~~~~~~~~~~ 387 (419) T protein:vir:94 334 QGEATPRIWGLNVVSTVAIAQGT-----------------ALVGGFRQGAT-LWS--------RQGITVLMTDSHADFFT 387 (419) T ss_pred ccCCCccccceeeEEcCCCCCcc-----------------EEEeeccceEE-EEE--------ecceEEEEeccccchhh Confidence 24556789999999999998532 12245544222 222 1233444433221 Q ss_pred HHHHHHHHHHHhcCcccccceEEEEEEeeccC Q lcl|Aclame:pro 304 EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDAT 335 (402) Q Consensus 304 ~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t 335 (402) +-...+++..-+|.++++|++.+.++++.-+| T Consensus 388 ~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 388 ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred cCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 12234566777999999999999888876665 No 96 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.16 E-value=2e-12 Score=84.72 Aligned_cols=299 Identities=9% Similarity=0.031 Sum_probs=159.4 Q ss_pred CCCCccccccccccc-ccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSAS-GEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~-~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~~~~~~G~~i~~ 76 (402) +.+......-..... .+.-...| +.|.+++++.....+.+++++++.++.++. ++.+. ..+...+..+..|..+.. T Consensus 110 ~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~ 189 (415) T protein:vir:46 110 TEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred HHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeeccccccccc Confidence 000000000001111 11122233 899999999999999999999988887653 23222 345556667777766654 Q ss_pred -CCccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 77 -TPTQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 77 -~~~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..+..++.++..-. ++....-.-+=. ++.+| +.+.+.+++++++++..|+.++.-.-.+. +. + T Consensus 190 ~~~~~~~~v~~~~~k--~~~~~~iS~ell~ds~~~-l~~~i~~~l~~~i~~~~d~~il~g~g~g~---------~~---~ 254 (415) T protein:vir:46 190 LAVKPFFQLAYDINT--HRGYFRISREAIEDAKVN-VLQELKLWMARTIAATRNKAIIDVITKGS---------TG---S 254 (415) T ss_pred ccccceeeEEeeeee--eEeeehhhHHHHhhchHH-HHHHHHHHHHHHHHHHHHHHHhhccccCC---------cc---c Confidence 34555655555544 333322222112 23456 78899999999999999998864332110 00 0 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) .+..... .......+....|+.|.++...+...+... . .+|++|..|..|.+-.. .|..|-. .....+|.-.+ T Consensus 255 ~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~-~~v~n~~~~~~L~~lkd-~~G~~i~--~~~~~~~~~~~ 327 (415) T protein:vir:46 255 TSSGFEK--EGKKLEVKKAKSLDDIKDAINLNVKPNYEH-N-VAIVSQTMFAKLDKMKD-KLGNYLI--QPDVKEKTQQR 327 (415) T ss_pred ccccccc--ccceeccccccchHHHHHHHHhhhhhccCC-C-EEEEcHHHHHHHHHhhc-cCCCeee--ccCcCCCCCcc Confidence 1111000 011111122223677788888877766542 2 45899999999854110 1111110 11123455578 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a 314 (402) ++|+||+.++++|.+..+. ..=+-+||++...+ + .-.+++.+..+. ......+.+.+- T Consensus 328 l~G~pV~~~~~~~~~~~~~------------~~~~~gd~~~~~~~-~--------~~~~~~v~~~~~-~~~~~~~~~~~r 385 (415) T protein:vir:46 328 LLGAKIEILPDEVLGQKGN------------NTLIIGNLKDAIVL-F--------DRSQYQASWTDY-MHFGECLMIAVR 385 (415) T ss_pred ccceeeEEeccccccCCCc------------cEEEEEehhccEEE-E--------eecceEEEeecc-ccCceEEEEEEE Confidence 9999999999998543211 11134555543221 1 222333333322 222234567777 Q ss_pred hcCcccccceEEEEEEee-ccCccccccch Q lcl|Aclame:pro 315 EGAIPDRWEAVSVVTTKR-DATTGDAGGPG 343 (402) Q Consensus 315 ~Ga~vlRPeaa~vv~~~~-~~t~~~a~~~~ 343 (402) ++.++++|++.+.++++. ..-+|+-+-.+ T Consensus 386 ~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:46 386 QDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred eccEEeccccEEEEEeeccCCCCCCccCCC Confidence 999999999998887764 33334443222 No 97 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.16 E-value=2e-12 Score=84.72 Aligned_cols=299 Identities=9% Similarity=0.031 Sum_probs=159.4 Q ss_pred CCCCccccccccccc-ccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSAS-GEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~-~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~~~~~~G~~i~~ 76 (402) +.+......-..... .+.-...| +.|.+++++.....+.+++++++.++.++. ++.+. ..+...+..+..|..+.. T Consensus 110 ~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~ 189 (415) T protein:vir:47 110 TEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPE 189 (415) T ss_pred HHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeeccccccccc Confidence 000000000001111 11122233 899999999999999999999988887653 23222 345556667777766654 Q ss_pred -CCccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 77 -TPTQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 77 -~~~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..+..++.++..-. ++....-.-+=. ++.+| +.+.+.+++++++++..|+.++.-.-.+. +. + T Consensus 190 ~~~~~~~~v~~~~~k--~~~~~~iS~ell~ds~~~-l~~~i~~~l~~~i~~~~d~~il~g~g~g~---------~~---~ 254 (415) T protein:vir:47 190 LAVKPFFQLAYDINT--HRGYFRISREAIEDAKVN-VLQELKLWMARTIAATRNKAIIDVITKGS---------TG---S 254 (415) T ss_pred ccccceeeEEeeeee--eEeeehhhHHHHhhchHH-HHHHHHHHHHHHHHHHHHHHHhhccccCC---------cc---c Confidence 34555655555544 333322222112 23456 78899999999999999998864332110 00 0 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) .+..... .......+....|+.|.++...+...+... . .+|++|..|..|.+-.. .|..|-. .....+|.-.+ T Consensus 255 ~~~~~~~--~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~-~~v~n~~~~~~L~~lkd-~~G~~i~--~~~~~~~~~~~ 327 (415) T protein:vir:47 255 TSSGFEK--EGKKLEVKKAKSLDDIKDAINLNVKPNYEH-N-VAIVSQTMFAKLDKMKD-KLGNYLI--QPDVKEKTQQR 327 (415) T ss_pred ccccccc--ccceeccccccchHHHHHHHHhhhhhccCC-C-EEEEcHHHHHHHHHhhc-cCCCeee--ccCcCCCCCcc Confidence 1111000 011111122223677788888877766542 2 45899999999854110 1111110 11123455578 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a 314 (402) ++|+||+.++++|.+..+. ..=+-+||++...+ + .-.+++.+..+. ......+.+.+- T Consensus 328 l~G~pV~~~~~~~~~~~~~------------~~~~~gd~~~~~~~-~--------~~~~~~v~~~~~-~~~~~~~~~~~r 385 (415) T protein:vir:47 328 LLGAKIEILPDEVLGQKGN------------NTLIIGNLKDAIVL-F--------DRSQYQASWTDY-MHFGECLMIAVR 385 (415) T ss_pred ccceeeEEeccccccCCCc------------cEEEEEehhccEEE-E--------eecceEEEeecc-ccCceEEEEEEE Confidence 9999999999998543211 11134555543221 1 222333333322 222234567777 Q ss_pred hcCcccccceEEEEEEee-ccCccccccch Q lcl|Aclame:pro 315 EGAIPDRWEAVSVVTTKR-DATTGDAGGPG 343 (402) Q Consensus 315 ~Ga~vlRPeaa~vv~~~~-~~t~~~a~~~~ 343 (402) ++.++++|++.+.++++. ..-+|+-+-.+ T Consensus 386 ~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:47 386 QDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred eccEEeccccEEEEEeeccCCCCCCccCCC Confidence 999999999998887764 33334443222 No 98 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.14 E-value=2e-12 Score=84.76 Aligned_cols=287 Identities=11% Similarity=0.046 Sum_probs=150.4 Q ss_pred CCCCccccccccccc-ccHHHHHHHHHhHHHH-HHHHHHhhhcccceeeeccccceEEeee-ccceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSAS-GEVDSLLIEKFNGKVN-EQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~-~d~~alfle~f~geV~-t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~~~G~~i~~~ 77 (402) +...+. + ..+.+ ++--.+.-+.|..+++ ..+...++++.+.++... +|+ +.+|+ .|...+..+.-|..+... T Consensus 243 ~~~~~~--~-~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~-~~~~~~~~~~~a~~v~Eg~~~~~~ 317 (543) T protein:vir:81 243 RAINEV--R-AMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGD-VWHGVSSAAVQWSWDAEFEEVSDD 317 (543) T ss_pred hhhhhh--h-hcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-Ccc-eEEEEecCCcceeecccCcccccc Confidence 111100 0 00000 1111233367777765 556667888888776543 344 44554 577777788888888777 Q ss_pred CccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 78 ~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) .+...+.++.+..+- +..+.-.-++ ...| +.+.+.+.+++++++..|+.++ .|.... ..-.|+. T Consensus 318 ~~~~~~i~~~~~k~~~~~~is~ell~---d~~~-~~~~i~~~l~~~~~~~~d~ail----~G~Gt~-------~~p~Gi~ 382 (543) T protein:vir:81 318 SPEFGQPEIPVKKAQGFVPISIEALQ---DEAN-VTETVALLFAEGKDELEAVTLT----TGTGQG-------NQPTGIV 382 (543) T ss_pred ccccceeeeeeeeeEeeehhhHHHHh---ccHH-HHHHHHHHHHHHHHHHHHHHHh----ccCCCC-------cccccch Confidence 777777777766654 2222233333 3346 7889999999999999999885 111100 0000110 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--cc--chhhcccccccCcccccceE Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--AD--RIVDKTYTISQSGATINGFV 232 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~--r~~n~d~~~~~~g~~~~G~V 232 (402) ..........+...+....++.+.++...|...+-+. -.+|++|..|..|.+ |. +++..+ ..+|.- T Consensus 383 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--~~~v~n~~~~~~l~~lkd~~G~~l~~~--------~~~g~~ 452 (543) T protein:vir:81 383 TALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQ--GAWLANNLIYNKIRQFDTQGGAGLWTT--------IGNGEP 452 (543) T ss_pred hhcccccccccccccccccHHHHHHHHHhhhccccCC--cEEEEcHHHHHHHHHhhcCCCceeccC--------cCCCCC Confidence 0000000011111122234677777777776655442 245899999999964 21 222111 123444 Q ss_pred EEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceee----c--cchhHHH Q lcl|Aclame:pro 233 LSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDI----F--YEKKEKT 306 (402) Q Consensus 233 ~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~----~--~d~~~~~ 306 (402) .+++|.||+.++++|....... ..|...=+-+||++.. ++-+ .++.++. + ++..+.. T Consensus 453 ~~l~G~pv~~~~~~~~~~~~~~-------~~~~~~i~~gd~~~~~--i~~~--------~~~~i~~~~~~~~~~~~~~~~ 515 (543) T protein:vir:81 453 SQLLGRPVGEAEAMDANWNTSA-------SADNFVLLYGNFQNYV--IADR--------IGMTVEFIPHLFGTNRRPNGS 515 (543) T ss_pred ccccceeeEEeccccccccccc-------cCCcceEEEeecccee--EEee--------cccEEEEeccccccchhhcCc Confidence 6789999999999996442111 1121112346665421 1111 1222221 1 1111222 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) ..+.+++-+|.++++|++.+.++.+-.+ T Consensus 516 ~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 516 RGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred eEEEEEEeeccEeecccceEEEEecccC Confidence 2345555679999999998777665444 No 99 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.13 E-value=8.8e-12 Score=81.20 Aligned_cols=296 Identities=11% Similarity=-0.019 Sum_probs=149.2 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |++..+.... +.=++|+.++++.....++++.+.++.++.+ +..+||++ |..++..+.-|+++....+ T Consensus 1 Mat~tt~~g~----------~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~-~~~~~p~~~~~~~a~wv~Eg~~~~~~~~ 69 (311) T protein:vir:99 1 MATFGTGNLK----------NLPRNIADGMVKDVVQGSTVAVLSARKPQRF-GNEDIITFNGRPKAEFVGEGQQKSSTTG 69 (311) T ss_pred CceecCCCce----------eccHHHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEEeecCcccccccc Confidence 8865422211 1126888999999999999999988777665 44678876 7888888888888887777 Q ss_pred cccceeEeecceeeccc-hhhhHHHhh-----cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARN-TVAHIHDVQ-----GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 80 ~~~e~~itID~~lya~~-~IddlDe~q-----~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ...+.++.. .+++.. .|. +|.+ +..| +.+.+.+++++++++++|+.++.-.- .. .+.... T Consensus 70 ~f~~v~l~~--~k~~~~~~iS--~ell~~~~d~~~~-l~~~i~~~la~ai~~~~d~~~l~G~g----~~-----~g~~~~ 135 (311) T protein:vir:99 70 EFDFVTSTP--KKAQVTMRFN--EEVQWADEDYQLG-VLQTLSEAGAEALARALDLGLYHRIN----PL-----TGTVIP 135 (311) T ss_pred eeeEEEEee--EEEEEeehhh--HHHhhcccccHHH-HHHHHHHHHHHHHHHHHHHHhhcccC----cc-----cCcccc Confidence 777766655 333332 222 2222 2344 67889999999999999998863211 00 000000 Q ss_pred ccccccccccCCccc-cccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceE Q lcl|Aclame:pro 154 GHGFSINVNVTESEA-LANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFV 232 (402) Q Consensus 154 g~~~~~~v~~~~a~~-~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V 232 (402) +............+. ..+...+++-|..+...+...+....--..+++|..+..|.+-.. .|..|-- .+...++.. T Consensus 136 g~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd-~~G~~l~--~~~~~~~~~ 212 (311) T protein:vir:99 136 GWSNYLGAASKRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARY-TDGRKKF--PELGLGIGV 212 (311) T ss_pred ccccccccccceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhc-cCCCeee--cCcccCCCC Confidence 000000000000011 111222333344444444333332211125889999999864111 0111110 111123456 Q ss_pred EEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH-----HH Q lcl|Aclame:pro 233 LSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK-----TY 307 (402) Q Consensus 233 ~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~-----~d 307 (402) +++.|+||+.|+++|........... .......+-+-+||++.+-+...+ -+.+....+.+...+ .| T Consensus 213 ~~l~G~Pv~~s~~i~~~~~~~~~~~~-~~~~~~~~~~~Gdf~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~d 284 (311) T protein:vir:99 213 SSFEGIDASVSDTVNGGDEADPDDED-LDAARAVRGIVGDFANGIHWGVQR-------DIPVELIKYGDPDGQGDLKRHN 284 (311) T ss_pred ceecceeeEeecccccccccccccch-hhccCcceEEEeeccccEEEEEec-------CceEEEeecCCCCcchhhhhcC Confidence 78999999999999853322111110 111111223446665533221111 111111111111111 12 Q ss_pred H--HHHHHHhcCcccccceEEEEEEeeccCcccc Q lcl|Aclame:pro 308 Y--IDTFMAEGAIPDRWEAVSVVTTKRDATTGDA 339 (402) Q Consensus 308 ~--i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a 339 (402) . +++..-+|..+++|+++.. +..+ | T Consensus 285 ~~~~r~~~r~d~~v~~~~~v~~---~~~~----A 311 (311) T protein:vir:99 285 QIALRLEIVYGWYVFTDRFVVI---ENAV----A 311 (311) T ss_pred cEEEEEEEeecceecChhHeee---eccc----C Confidence 1 3455667888888865432 2222 1 No 100 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.13 E-value=2.4e-12 Score=84.25 Aligned_cols=291 Identities=9% Similarity=0.009 Sum_probs=152.2 Q ss_pred CCCCcccccccccccccHHH-HHHHHHhHHHHHHHHHHhhhcccceeeecccc-ceEEeee-ccceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDS-LLIEKFNGKVNEQYLKGENILSYFDVQTVTGT-NTVSNKY-LGETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~a-lfle~f~geV~t~f~~~sv~~~~~~~rti~~G-ksv~f~~-iG~~t~~~~~~G~~i~~~ 77 (402) +.....-.+....+++..-. +.=+.|.++++......+.++++.++.++.++ .++.+++ .+...+..+..|+.+..+ T Consensus 100 ~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~ 179 (404) T protein:vir:10 100 LNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTN 179 (404) T ss_pred hcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecccccccccc Confidence 10000000110011111111 22278889999999999999999999988642 3555654 577788888888776554 Q ss_pred --CccccceeEeecceeeccch-h--hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 78 --PTQADKNQLVIDTTVIARNT-V--AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 78 --~~~~~e~~itID~~lya~~~-I--ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) .+..++.++...++ ..+. | .-+++ +.++ +.+.+.+++++++++..|+.|+. +..... ++. T Consensus 180 ~~~~~f~~i~~~~~k~--~~~~~iS~ell~d--s~~~-l~~~i~~~la~~~~~~~~~~il~----G~g~~~-----~~~- 244 (404) T protein:vir:10 180 GDNGKLERFNFKLKDL--ADFMSIPNDLLKF--ADKS-LEDWIINWFVDKVRITRNAEILY----GAGGDE-----HAT- 244 (404) T ss_pred ccccceeeeEeeheee--EeeehhhHHHHhh--cHHH-HHHHHHHHHHHHHHHHHHHHHhh----cCCCCC-----ccc- Confidence 34555555555543 3322 2 22222 3345 67889999999999999997752 111111 000 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHH-HHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccce Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALE-QQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~-~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~ 231 (402) |.... .+..+...+....++.+..+.. .|....-+ .. .+|++|..|..|.+-.. .+..|-- .....+|. T Consensus 245 -gi~~~----~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~~-~~v~n~~~~~~L~~lkd-~~G~~l~--~~~~~~~~ 314 (404) T protein:vir:10 245 -GIMTA----NKFKKITLPKSPALKDFKKCKNVELLNVFKA-TS-SWIVNQDGFNYLDSLED-KTGRPYL--QPDPKDPT 314 (404) T ss_pred -ceeec----cccceeeccccccHHHHHHHHHhhhhccccC-CC-EEEEcHHHHHHHHHhhc-cCCceee--ccCcCCCC Confidence 11100 1111111222223555555443 34433322 33 45899999998865211 1122211 11123455 Q ss_pred EEEEeccEEEec-CccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccc----hhHHH Q lcl|Aclame:pro 232 VLSSYNCPVIPS-NRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE----KKEKT 306 (402) Q Consensus 232 V~~iaG~~V~~S-NnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d----~~~~~ 306 (402) ..+++|.||+.+ +.+|.... +...=+-++|+...- ...-.+++.+..++ -.+-. T Consensus 315 ~~~l~G~PV~~~~~~~~~~~~------------~~~~~~~gd~s~~~~---------~~~~~~~~i~~~~~~~~~~~~~~ 373 (404) T protein:vir:10 315 QYRFLGLPVIELPNDLLLSTE------------SAIPVLLGDTKEAYK---------YVSDGAYELATTNIGAGAFETNT 373 (404) T ss_pred CccccceeeEEecccccCCCC------------CccEEEEEeccccEE---------EEEecceEEEEeccccchhhcCc Confidence 568999999854 44443211 100012344443211 11222333333222 12233 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) ..+.+.+-||.+++||++.+.++.+..+.|+ T Consensus 374 ~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 374 TKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred eEEEEEEeeccEEecccceEEEEeecccCCC Confidence 4577888899999999999998887776666 No 101 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.12 E-value=5.8e-12 Score=82.17 Aligned_cols=277 Identities=11% Similarity=-0.030 Sum_probs=156.2 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccc-cceEEeeecc--ceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTG-TNTVSNKYLG--ETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~-Gksv~f~~iG--~~t~~~~~~G~~i~~ 76 (402) |+.-+ ++.-...| ++|..++++.....+.++++.++.++.+ ..+..|+... ...+..+.-|+++.. T Consensus 5 ~~~~t----------~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~ 74 (293) T protein:vir:48 5 KTDHS----------GSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIAD 74 (293) T ss_pred ecccc----------cCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCccccc Confidence 33222 12222333 8899999999999999999998888764 3456676543 345566766777754 Q ss_pred -CCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 77 -TPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 77 -~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ..++..+.++....+-. ...|. -+++ +.+| +.+.+.+++++++++..|+.|+..+-+. T Consensus 75 ~~~~~~~~i~l~~~k~~~-~~~iS~ell~d--s~~~-l~~~i~~~la~~~~~~~~~~i~~g~~~~--------------- 135 (293) T protein:vir:48 75 IDDPKLSLIKYTIKRYAG-ISTVTNSLLAD--SAEN-ILAWLSGWIAKKVVVTRNKAILGVVDKL--------------- 135 (293) T ss_pred ccccceeEEEEeeeEEEE-eehhhHHHHhh--hhHH-HHHHHHHHHHHHHHHHHHhHHhhccccc--------------- Confidence 45667777776655432 12222 2222 3455 6889999999999999999887432110 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~ 233 (402) . ......+ |+.|.++..+|+..+.+. . ..+++|..|..|.+-..- +..|- -.....+|.-. T Consensus 136 ---~-------~~~~~~~----~d~i~~~~~~l~~~~~~~-a-~~vmn~~~~~~L~~lkd~-~g~~l--~~~~~~~~~~~ 196 (293) T protein:vir:48 136 ---P-------TKPTLTK----WDDIIDLEAKVDPAIKQT-S-FFLTNTSGFTALKKVKNA-LGDYL--MERDVKSPTGY 196 (293) T ss_pred ---c-------ccccccC----HHHHHHHHHhhhhhhcCC-C-EEEEcHHHHHHHHHhhcc-CCceE--eecCcCCCCCc Confidence 0 0111112 566777777777665542 3 458899999998541111 11110 01112345567 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-hHH---HHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-KEK---TYYI 309 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-~~~---~d~i 309 (402) +++|.||+.+.+.+..... .+...=+-+||++...++ ...++..+..+.. +.| ...+ T Consensus 197 ~l~G~Pv~~~~~~~~~~~~----------~~~~~~~~gd~~~~~~~~---------~~~~~~i~~~~~~~~~~~~~~~~~ 257 (293) T protein:vir:48 197 SIAGFAVKEISDRWLPNAS----------SGVMPLYFGDLKQAVTLF---------DRQQMSLLSTNIGGGAFETDTTKV 257 (293) T ss_pred eecceeeEEecccccCCcc----------CCceEEEEEeccceEEEE---------EecceEEEEecccchhhhcCeEEE Confidence 8999999876554422110 011111234444432211 1223333333221 122 2334 Q ss_pred HHHHHhcCcccccceEEEEEEeeccCccccc-cchh Q lcl|Aclame:pro 310 DTFMAEGAIPDRWEAVSVVTTKRDATTGDAG-GPGD 344 (402) Q Consensus 310 ~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~-~~~~ 344 (402) ++.+-+|.++++|++.+.++++..++++... +.|. T Consensus 258 r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:48 258 RVIDRFDVVATDTEAFVPASFKAIADQKGNIGSTAV 293 (293) T ss_pred EEEEeeCcEEecccceEEEEeeccccCCccccccCC Confidence 5556689999999999999887655554444 4433 No 102 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.11 E-value=3.5e-12 Score=83.42 Aligned_cols=279 Identities=9% Similarity=0.002 Sum_probs=157.6 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |....- ...+.-.+.+.-.+.-++|..++.+.....+.++.+.++..+.++....+++. +...+..+.-|+.+....+ T Consensus 1 m~~~~~-~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 79 (297) T protein:vir:95 1 MTVQTF-NPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKP 79 (297) T ss_pred CCcccc-ccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCcccccccc Confidence 665521 00111111111224459999999999999999999998888876666667744 6678888888888887777 Q ss_pred cccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFS 158 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~ 158 (402) ...+.++..-.+ .....|.+ +-.+ +..| +.+.+.+++++++++..|+.++. +.....+ .+.... T Consensus 80 ~f~~v~l~~~k~-~~~~~is~-ell~ds~~~-l~~~i~~~la~ai~~~~d~a~l~----G~g~~~~--------~gi~~~ 144 (297) T protein:vir:95 80 EVVPVTLKAHKL-GIILVTSR-EALNYTWKK-FFEDMKPQIVEAFYKKIDEAGLL----GHDTPFA--------NSVAKA 144 (297) T ss_pred ceeEEEEeeEEE-EEeehhhH-HHHhcCHHH-HHHHHHHHHHHHHHHHHHHHHhc----ccCCccc--------cccccc Confidence 777777766542 22233332 2222 3455 78899999999999999998862 1111100 011110 Q ss_pred cccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEecc Q lcl|Aclame:pro 159 INVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNC 238 (402) Q Consensus 159 ~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~ 238 (402) ... ........--|+.|.++..+|...+.+.. ..+++|..|..|.+ +.+.+ +.. +.++..+++.|+ T Consensus 145 --~~~--~~~~~~~~~t~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~---l~d~~----G~~-i~~~~~~~l~G~ 210 (297) T protein:vir:95 145 --AKD--ANKVIGGPINYDNILKLQDALYDADVEPN--AFVSKIQNRSALRE---ARDGN----KVS-IYDKAANTIDGI 210 (297) T ss_pred --ccc--cceecccccCHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHH---hhccC----Cce-eecCCCCcccce Confidence 000 00000011126778888888888776643 46889999999874 22111 111 123344578999 Q ss_pred EEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh--------------H Q lcl|Aclame:pro 239 PVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK--------------E 304 (402) Q Consensus 239 ~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~--------------~ 304 (402) ||+.+++.+...+ . =+.+||++.. +. .-.++..+..++.. . T Consensus 211 Pv~~~~~~~~~~~----~-----------~~~gd~s~~~--~~--------~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 265 (297) T protein:vir:95 211 TTVDLKSARFEKG----D-----------LLAGDFDNLI--YG--------VPYNITYKISEEGQISTITNADGTPINLF 265 (297) T ss_pred eeEeecCCCCCCc----e-----------EEEEecccEE--EE--------EecCeEEEEeeccccccccccCccchhhh Confidence 9998876553221 1 1234554421 11 11222233322211 1 Q ss_pred H--HHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 305 K--TYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 305 ~--~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) + .-.+++..-+|.++++|++.+.|+..-.+ T Consensus 266 ~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 266 EQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred hcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 1 12234455789999999998877532222 No 103 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.11 E-value=4.9e-12 Score=82.58 Aligned_cols=283 Identities=13% Similarity=0.051 Sum_probs=154.7 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccc-----eeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGE-----TELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~-----~t~~~~~~G~~i~ 75 (402) |..... ..+.+++...+.-+.|+.+++......+.+++++++.++.+ .++.+++... ..+..+.-|+.+. T Consensus 113 ~~~~~~----~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~a~~v~Eg~~~~ 187 (413) T protein:vir:81 113 ASDPAS----TATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTN-TTIKYLMEKANRVVEGGFKTVAEGGKKP 187 (413) T ss_pred hhhhhh----hcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccC-CceeEEEeccccccccccceecCccccc Confidence 111110 01111233344558899999999999999999999888765 4566665422 2345566666664 Q ss_pred CCC-ccccceeEeeccee----eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 76 ATP-TQADKNQLVIDTTV----IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) Q Consensus 76 ~~~-~~~~e~~itID~~l----ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~ 150 (402) ... ....+.++.+..+- +++.. |++.. + +.+.+.+++++++++..|+.++. +.-...+ + T Consensus 188 ~~~~~~f~~i~~~~~k~~~~~~iS~el---l~ds~---~-l~~~i~~~la~~~~~~~d~~~l~----G~G~~~~-----~ 251 (413) T protein:vir:81 188 YMRFADFDIVTESLSKIAGLTKITDEM---IEDYD---F-LVSYINARLLEELAIEEERQLLL----GDGTGNN-----L 251 (413) T ss_pred ccCcccceeeEeeeeeEEEeehhhHHH---HHHHH---H-HHHHHHHHHHHHHHHHHHHHHhc----cCCCCCc-----c Confidence 433 45666666666542 22222 22221 2 56788888899999999998752 1111100 0 Q ss_pred cccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcc Q lcl|Aclame:pro 151 RVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGA 226 (402) Q Consensus 151 ~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~ 226 (402) .|.. ...........+...+++.+.++...+..++.-.... +|++|..|..|.+ | .|++..+......+. T Consensus 252 --~Gi~---~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~ 325 (413) T protein:vir:81 252 --TGLL---KRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADA-LVINPLDYQELRLAKDANGQYYGGGVFQGQYGS 325 (413) T ss_pred --cccc---cccccccccccccchhHHHHHHHHHHhhhhccCCCcE-EEEcHHHHHHHHHhhccCCceeccccccccccc Confidence 0110 0000111112233456777777777766554433334 4889999998853 3 333322211111111 Q ss_pred cccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-HH Q lcl|Aclame:pro 227 TINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-EK 305 (402) Q Consensus 227 ~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-~~ 305 (402) -..+...+++|.||+.|+++|.+. -+-+||++.. +++.+ .+++.+..+... .| T Consensus 326 ~~~~~~~~l~G~pv~~s~~~~~~~-----------------~~~gd~~~~~-~~~~~--------~~~~v~~~~~~~~~~ 379 (413) T protein:vir:81 326 GGIMLDPAPWGLRTVQSQVVPVGK-----------------PVVGAFRSAA-SVLRK--------GGVRIDSTNTNVDDF 379 (413) T ss_pred cccccCceecceeeEEcCCCCccc-----------------EEEEecccEE-EEEEe--------cceEEEEeccccchh Confidence 112234578999999999998532 1235555421 22222 223334433221 12 Q ss_pred -H--HHHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 306 -T--YYIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 306 -~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) . -.+++.+-|+..+.+|++.+.++.+..++| T Consensus 380 ~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 380 ENNLITVRAEERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred hcCcEEEEEEEeeccEEecccceEEEEecCCCCC Confidence 1 244455668999999999999887666666 No 104 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.08 E-value=7.1e-12 Score=81.70 Aligned_cols=294 Identities=11% Similarity=0.001 Sum_probs=147.3 Q ss_pred CCCCcccccccc-cccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeee---cCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAV-SASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVL---APGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~-~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~---~~G~~i 74 (402) |.......+... +...++-...| +.|+.+|+......++++.+.++.... | .++||+. +..++... ..|..+ T Consensus 131 l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~-~~~~p~~~~~~~a~~~~~~~e~~~~ 208 (434) T protein:vir:62 131 IVGNIDEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK-E-NIKYPVLVKKAEAQGHKNERTNNEM 208 (434) T ss_pred hccccchhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC-C-ceEEEEEecCCcccceecccccccc Confidence 111100111100 10111112234 889999999999999999998876543 3 4677765 33333332 223344 Q ss_pred CCCCccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 75 NATPTQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 75 ~~~~~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) +...+..++.++.+- +++....-.-+=.+ +.+| +.+.+.++++++|++..|+.++. +.....+ .. T Consensus 209 ~~~~~~f~~v~~~~~--k~~~~~~iS~ell~ds~~~-l~~~i~~~la~~~~~~~d~~~l~----G~G~~~~-------~~ 274 (434) T protein:vir:62 209 PETDIEFDEIELSPT--EFDALATVTKKLLARTGLP-IEQIVMDELKKAYVRKETQYMVN----GDEANNI-------ND 274 (434) T ss_pred cccccceeeEEeehe--eeEeehhhHHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhc----cCCCCcc-------cc Confidence 444455555555544 34443322221111 3456 78899999999999999988762 2111111 00 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--ccchhhcccccccCcccccce Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--ADRIVDKTYTISQSGATINGF 231 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~r~~n~d~~~~~~g~~~~G~ 231 (402) |.. ...+.+...+....++.|+++...|+..+.+ ...| |++|..|..|.+ |. |..|--.......+|. T Consensus 275 g~~-----~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~-~a~~-v~n~~~~~~L~~lkd~---~G~~l~~~~~~~~~g~ 344 (434) T protein:vir:62 275 GAL-----AKKAVEFKTDEKNLYDALVKMKNTPVKEVRK-KARW-VLNTAALTKIETMKTD---DGFPLLRPFNQAEGGI 344 (434) T ss_pred cee-----ecccccccccccchhhHHHHHHhhcchhhhc-CCEE-EEcHHHHHHHHHhhcc---CCCEeeccCCCccCCC Confidence 111 1111112223334688888888888776554 3355 789999998853 32 1122110111112344 Q ss_pred EEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHH Q lcl|Aclame:pro 232 VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDT 311 (402) Q Consensus 232 V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~ 311 (402) -.+++|.||+.++++|....+.... =+-+||+... ++.+...+ + +....+.|+....- .+.+ T Consensus 345 ~~tl~G~pV~~~~~~~~~~~~~~~~-----------i~~Gdfs~~~-i~~~~g~~-~---i~~~~~~~~~~~~v--~~~~ 406 (434) T protein:vir:62 345 GYTLLGFPVEEEDAIDIPDSPDTPV-----------FYFGDFSKFY-IQDVIGSL-E---VQKLVELFSRTNRV--GFRI 406 (434) T ss_pred CceecceeeEEecCccCccCCCceE-----------EEEeeccceE-EEEeecee-E---EEeehhhhcccCce--EEEE Confidence 4679999999999998643221111 1235665431 12111111 0 11111222211110 1223 Q ss_pred HHHhcCccc-ccceEEEEEEeeccCccc Q lcl|Aclame:pro 312 FMAEGAIPD-RWEAVSVVTTKRDATTGD 338 (402) Q Consensus 312 ~~a~Ga~vl-RPeaa~vv~~~~~~t~~~ 338 (402) +.=+..+++ +|++..+++.+....++. T Consensus 407 ~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 407 WNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred EeeecceeecCcccceEEEEEeccCCCC Confidence 333445544 599999988887554444 No 105 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.07 E-value=4.7e-12 Score=82.69 Aligned_cols=296 Identities=13% Similarity=0.054 Sum_probs=152.7 Q ss_pred CCCCc--ccc----cccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEee-eccceeeeeecCCC Q lcl|Aclame:pro 1 MSTPN--TLT----NVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNK-YLGETELQVLAPGQ 72 (402) Q Consensus 1 Ms~~n--~~t----~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~-~iG~~t~~~~~~G~ 72 (402) |-... .++ +....++..+-...| +.|..++++.....++++++.++.++.++ +..++ ..+..++....-|+ T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~E~~ 168 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGS-DYKKLVNLGGTTSGWVGETD 168 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCcceeeecccc Confidence 11000 000 000001111111233 88999999999999999999988887766 44454 45667777666666 Q ss_pred CCCCC-CccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 73 SPNAT-PTQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) Q Consensus 73 ~i~~~-~~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~ 150 (402) .+... .+...+.++.+-. ++.+.--.-+-. ++.+| +.+.+.+++++++++..|+.++. +.....| T Consensus 169 ~~~~~~~~~f~~i~~~~~k--~~~~~~iS~ell~ds~~~-l~~~i~~~l~~~i~~~~~~a~l~----G~G~~~p------ 235 (407) T protein:vir:48 169 ARPETATSKLGLIEPFMGE--IYGNPQATQKMLDDAFFN-VEDWINSELALEFAEQEEIAFTS----GDGSKKP------ 235 (407) T ss_pred cccccccccceeEEeeeee--eEeehhhHHHHHhcchHH-HHHHHHHHHHHHHHHHHHhhhhc----cCCCCcc------ Confidence 66543 3455666665543 333322222111 13455 78899999999999999987752 1111111 Q ss_pred cccccccccccccC----------CccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--ccchhhcc Q lcl|Aclame:pro 151 RVKGHGFSINVNVT----------ESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--ADRIVDKT 218 (402) Q Consensus 151 ~~~g~~~~~~v~~~----------~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~r~~n~d 218 (402) .|+......... ..........-++.|.++...|...+.+. ..| |++|..|..|.+ |.. .. T Consensus 236 --~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~-a~~-v~n~~~~~~L~~lkD~~---Gr 308 (407) T protein:vir:48 236 --KGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSG-AKF-MMNNSSLFAIRLLKDND---GN 308 (407) T ss_pred --ceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcC-CEE-EEcHHHHHHHHHhhccC---Cc Confidence 000000000000 00000111112667777788887776652 334 799999998854 321 11 Q ss_pred cccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceee Q lcl|Aclame:pro 219 YTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDI 298 (402) Q Consensus 219 ~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~ 298 (402) |-- . ....+|...+++|.||+.++++|..+.+ ...=+-+||+...-+ +.+.. +.+..+. T Consensus 309 ~l~-~-~~~~~g~~~~l~G~PV~~~~~~p~~~~~------------~~~i~~Gd~~~~~~i-~~~~~------~~i~~d~ 367 (407) T protein:vir:48 309 YLW-R-PGIELGQPSSLAGYGIVENEQMPDIAAD------------AKAIAFGNFKRGYTI-VDRIG------TRILRDP 367 (407) T ss_pred eee-c-cCcCCCCCceecceeeEEecCcCCccCC------------ccEEEEEeccccEEE-EEeec------eEEEeec Confidence 100 1 1123455578999999999999964321 111122566542221 11111 1111111 Q ss_pred ccchhHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccc Q lcl|Aclame:pro 299 FYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) Q Consensus 299 ~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~ 340 (402) +. .+....+.+.+-||.++++|++.+.++.+..++...+. T Consensus 368 ~~--~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 368 YT--NKPFVGFYTTKRTGGMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred cc--cCCcEEEEEEEEeccEEecccceEEEEeeccCCCCCCC Confidence 11 11122345556689999999998877665444333222 No 106 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.07 E-value=6.5e-12 Score=81.90 Aligned_cols=291 Identities=12% Similarity=0.072 Sum_probs=149.9 Q ss_pred CCCCcccccccccccccHHH-HHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEee-eccceeeeeecCCCCCCCC- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDS-LLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNK-YLGETELQVLAPGQSPNAT- 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~a-lfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~-~iG~~t~~~~~~G~~i~~~- 77 (402) +.... ..+.-..++..+-. +.=+.|..+++...+..+.++++.++.++.+++ .++| ..|..++....-|+.+... T Consensus 121 l~~~e-~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~~~~~~~~~a~wv~E~~~~~~~~ 198 (425) T protein:vir:10 121 VKRGD-VQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAG-FSKLFNMGGTTSGWVGEASQRPQTN 198 (425) T ss_pred hhhhh-hHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCc-eEEEEEcCCcceeeecccccccccc Confidence 00000 00000001111111 333899999999999999999999988886654 4455 4566666666555555433 Q ss_pred CccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) .+...+.++.. .++.....-.-+=. ++.+| +.+.+.+++++++++..|+.++. |.....|. |.. T Consensus 199 ~~~f~~v~~~~--~k~~~~i~iS~ell~ds~~~-l~~~i~~~la~ai~~~~d~~~l~----G~G~~~p~--------Gil 263 (425) T protein:vir:10 199 AATFQPLSFAS--GEIYANPAATQQILDDAEID-LESWLATEVQTEFAKQEGKAFLA----GDGTNKPN--------GLL 263 (425) T ss_pred ccccceeeeeh--eeeEeehHhHHHHHhcchhH-HHHHHHHHHHHHHHHHHHhhhhc----ccCCCCcc--------eee Confidence 34455555544 33333322222212 23466 78899999999999999997752 11111100 000 Q ss_pred ccccccc----------CCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcc Q lcl|Aclame:pro 157 FSINVNV----------TESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGA 226 (402) Q Consensus 157 ~~~~v~~----------~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~ 226 (402) ....... ............++.|+++...|+..+.. .. ..|++|..|..|.+-.. .+..|- -. .. T Consensus 264 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~-~a-~~vmn~~~~~~L~~lkD-~~G~~l-~~-~~ 338 (425) T protein:vir:10 264 TYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTG-NA-RFAMNRNTQRQVRKLKD-GQGNYL-WQ-PS 338 (425) T ss_pred eccccccccccccccccccccccccccccHHHHHHHHhhhhhhhcc-CC-EEEEchHHHHHHHHhhc-CCCcee-ec-cC Confidence 0000000 00001111223467777777777765543 22 34899999999864111 001110 00 11 Q ss_pred cccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHH Q lcl|Aclame:pro 227 TINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKT 306 (402) Q Consensus 227 ~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~ 306 (402) ..+|.-++++|.||+.++++|....+.. .=+-+||++...+ +.+.. +.+..+.+.. +.. T Consensus 339 ~~~g~~~~l~G~PV~~~~~~p~~~~~~~------------~i~~Gd~~~~~~i-~~~~~------~~v~~d~~~~--~~~ 397 (425) T protein:vir:10 339 YVAGQPATLAGYPVTEVPDMPDVAANST------------PILFGDFQQTYLI-IDRIG------VRVLRDPYTA--KPY 397 (425) T ss_pred ccCCCCceecceeeEEecCcCCccCCcc------------EEEEEehhccEEE-EEecc------eEEEeccccc--CCc Confidence 2345557899999999999996432211 0122566553222 22211 1112222221 112 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) ..+.+..-|+.++++|++...++.+... T Consensus 398 ~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 398 VLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred EEEEEEEEeccEeecccceEEEEeeccC Confidence 2344555699999999988777665433 No 107 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.06 E-value=1e-11 Score=80.86 Aligned_cols=342 Identities=12% Similarity=0.001 Sum_probs=175.5 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~~ 79 (402) |.+.-..+.-....+.+.-.+..+++..++++...+.+.++++.++.++.+ .+.+||+. +...+..+.-|+.+....+ T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~Eg~~~~~s~~ 79 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGA-TGIVIPHWTGDVSAQWIGEGDMKPITKG 79 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEcCCcceEEecCCcccccccc Confidence 887744333333322222234557778888999899999999988877664 55778865 6677777777888877777 Q ss_pred cccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFS 158 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~ 158 (402) ...+.++.+-. +.....-.-+-. ++.+| +.+++.+++++++++++|+.++.-- -. +....+ T Consensus 80 ~f~~v~l~~~k--~~~~v~iS~ell~ds~~~-l~~~i~~~l~~aia~~~d~a~l~G~----gt--------~~~~~~--- 141 (397) T protein:vir:23 80 NMTKRDVHPAK--IATIFVASAETVRANPAN-YLGTMRTKVATAIAMAFDNAALHGT----NA--------PSAFQG--- 141 (397) T ss_pred ceeEEEEeeEE--EEEeehhhHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHHhhcc----cC--------Cccccc--- Confidence 77777766643 333322222112 24466 7899999999999999999886211 00 000000 Q ss_pred cccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcc----cchhhcccccccCcccccceEEE Q lcl|Aclame:pro 159 INVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDA----DRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 159 ~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~----~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) ................++.+.++..+|.+.+.+. -..+++|..|..|.+- .|.+-.. ...++....+..++ T Consensus 142 -~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~--a~~vmn~~~~~~L~~lkd~~G~~i~~~--~~~~~~~~~~~~~t 216 (397) T protein:vir:23 142 -YLDQSNKTQSISPNAYQGLGVSGLTKLVTDGKKW--THTLLDDTVEPVLNGSVDANGRPLFVE--STYESLTTPFREGR 216 (397) T ss_pred -ccccccceeeecccchhHHHHHHHHhhhhcccCC--CEEEEcHHHHHHHHHhhccCCceeecc--cccccccccccCce Confidence 0111111112222334566777777777766542 3469999999999752 2222110 11222222334468 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh----------- Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK----------- 303 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~----------- 303 (402) +.|+||+.|+++|..... -+.+||++.. +..+ .++..+..++.- T Consensus 217 l~G~Pv~~s~~~~~g~~~---------------~~~gDfs~~~--i~~~--------~~i~i~~~~e~~~~~~~~~~~~~ 271 (397) T protein:vir:23 217 ILGRPTILSDHVAEGDVV---------------GYAGDFSQII--WGQV--------GGLSFDVTDQATLNLGSQESPNF 271 (397) T ss_pred eeeeeEEEeCCCCCCceE---------------EEEeecceEE--EEEE--------eceEEEEeeeeeeeeccccccce Confidence 999999999999853210 1345555432 1111 111112111110 Q ss_pred ---HH--HHHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHHhhhcccceEEEeecchhhhhhhhcccccch Q lcl|Aclame:pro 304 ---EK--TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSAAPAGIQA 378 (402) Q Consensus 304 ---~~--~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 378 (402) .+ .-.+++.+-++.+++||++.+.++.......-+...+... . -+-|.++ ++. +++. -+-...+ T Consensus 272 ~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~--~----~~~~~~~-~~~--~~~~--~~~~a~~ 340 (397) T protein:vir:23 272 VSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYALDLDGAS--A----GNFTLSL-DGK--TSAN--IAYNAST 340 (397) T ss_pred eeeeeccceeEEEEeeeccceecccceEEEeeccccceeeecccccC--c----ceEEEEe-cCc--cccC--cccccch Confidence 11 1234555668999999999888776433222221111000 0 1111111 110 0000 0011234 Q ss_pred hHHHHHHHHHH----hhcccccc-cCCCC Q lcl|Aclame:pro 379 EDLVAAVRAVM----ANDIKPTA-MKPTE 402 (402) Q Consensus 379 ~~~~~~~~~~~----~~~~~~~~-~~~~~ 402 (402) .++-+|+.++- +.++.-|. --|-. T Consensus 341 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 369 (397) T protein:vir:23 341 ATVKSAIVAIDDGVSADDVTVTGSAGDYT 369 (397) T ss_pred hhhHHHhhhcccccccceeeeecCCceeE Confidence 44555555442 12222222 00111 No 108 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.05 E-value=4e-11 Score=77.60 Aligned_cols=296 Identities=13% Similarity=0.094 Sum_probs=153.3 Q ss_pred CCC-----------------CcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhccc-ceeeeccccceEEeeec- Q lcl|Aclame:pro 1 MST-----------------PNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSY-FDVQTVTGTNTVSNKYL- 60 (402) Q Consensus 1 Ms~-----------------~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~-~~~rti~~Gksv~f~~i- 60 (402) |.. ..........+++..-...+ +.+..++++..+..++++.+ .++.+...| .+.+|+. T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~~p~~~ 183 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIPRLK 183 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCC-ceEEEEEe Confidence 000 00000000000111111223 77888898888888888876 344444444 4778766 Q ss_pred cceeeeeecCCCCCCCCCccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 61 GETELQVLAPGQSPNATPTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGG 139 (402) Q Consensus 61 G~~t~~~~~~G~~i~~~~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA 139 (402) |...+..+.-|+.+....+...+.++.+..+- +..+.-.-|++.....+ +.+.+.+++++++++..|++++. +. T Consensus 184 ~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~-l~~~i~~~l~~a~~~~~d~a~l~----G~ 258 (435) T protein:vir:80 184 GGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPN-VDQIVVGDLTAAIGAREDKAFIR----DD 258 (435) T ss_pred CCcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHhhc----cC Confidence 66677677777777666677777776665543 22222233445444556 68889999999999999998852 11 Q ss_pred hhcccccccccccccccccccc-ccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcc Q lcl|Aclame:pro 140 IANTKAERNKPRVKGHGFSINV-NVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKT 218 (402) Q Consensus 140 ~~~a~~~~~~~~~~g~~~~~~v-~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d 218 (402) .... .| .|....... .........+...++..+.++...|...+.....-..|++|..|..|.+-.. .|.. T Consensus 259 G~~~-----~p--~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd-~~G~ 330 (435) T protein:vir:80 259 GTAN-----TP--KGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRD-GNGN 330 (435) T ss_pred CCCC-----cc--cceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhc-cCCc Confidence 1000 00 011110000 0011112223334455566776777666655444445899999988854110 1111 Q ss_pred cccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceee Q lcl|Aclame:pro 219 YTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDI 298 (402) Q Consensus 219 ~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~ 298 (402) |--. ...+| +++|+||+.|+++|...+..+ ....=+-+||+..+ ++- -.++..+. T Consensus 331 ~l~~---~~~~~---~l~G~pv~~~~~~p~~~~~~~---------~~~~i~~gd~s~~~--i~~--------~~~~~i~~ 385 (435) T protein:vir:80 331 KVYP---ELANG---MLKGYPVGKTTQVPINLGEAG---------KESEIYFTDFGDVF--IGE--------EETLEIDY 385 (435) T ss_pred eecc---CCCCC---eEeeeeeEEeccccccccCCC---------CcceEEEEEcccEE--EEe--------ecceEEEE Confidence 1100 11122 689999999999996332110 11112346666532 221 12233333 Q ss_pred ccchh-------------HHHHHHHHHHHhcCcccccceEEEEEEeeccCccc Q lcl|Aclame:pro 299 FYEKK-------------EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGD 338 (402) Q Consensus 299 ~~d~~-------------~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~ 338 (402) .++.. +-...+++.+-|+.++.||++.+.|+--.- |+ T Consensus 386 ~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~---~~ 435 (435) T protein:vir:80 386 SKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAW---GA 435 (435) T ss_pred eccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCC---CC Confidence 32221 112456788889999999998877643221 11 No 109 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.05 E-value=1.1e-11 Score=80.68 Aligned_cols=292 Identities=10% Similarity=-0.026 Sum_probs=144.1 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEee-eccceeeeeecCCCCCCCCC- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNK-YLGETELQVLAPGQSPNATP- 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~-~iG~~t~~~~~~G~~i~~~~- 78 (402) +...............+...+.-..|..+++......++++++.++.++.++ ...++ ..+...+....-|.....+. T Consensus 153 ~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~e~~~~~~~~~ 231 (458) T protein:vir:10 153 HGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSK-ILTMLVEPDAGKATWVAASTYGTDTTT 231 (458) T ss_pred hhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCc-ceEEEEecCCcceeecccccccccccc Confidence 0000000000000011222355688999999999999999999988887765 44455 44556666666565554332 Q ss_pred -----ccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 79 -----TQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 79 -----~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) +...++ ++...+++.+..-.-+=.. +.++ +.+.+.++++++|++..|+.++. +.....|...-. . T Consensus 232 ~~~~~~~~~~i--~~~~~k~~~~v~is~ell~ds~~~-~~~~i~~~l~~~i~~~~d~~~l~----G~G~~~p~Gi~~--~ 302 (458) T protein:vir:10 232 GEEVKGALKEI--HFSTYKLAAKSFITDETEEDAIFS-LLPLLRKRLIEAHAVSIEEAFMT----GDGSGKPKGLLT--L 302 (458) T ss_pred cccccccceee--EeeeeeEEeeehhhHHHHhcchHH-HHHHHHHHHHHHHHHHHHHHhhc----CCCCCccceeee--c Confidence 223333 4444444443222222122 2355 78899999999999999998852 111111100000 0 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccc Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATI 228 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~ 228 (402) .+.......... +.......-|+.|.++...|...+.. ... .|++|..|..|.+ | .+.+..... . .... T Consensus 303 ~~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~l~~~~~~-~~~-~v~~~~~~~~l~~lkd~~G~~i~~~~~--~-~~~~ 375 (458) T protein:vir:10 303 ASEDSAKVVTEA--KADGSVLVTAKTISKLRRKLGRHGLK-LSK-LVLIVSMDAYYDLLEDEEWQDVAQVGN--D-SVKL 375 (458) T ss_pred ccccccceeecc--cccccccccHHHHHHHHHhhhhhhcC-CCE-EEEcHHHHHHHHhhcccCCceeecccc--c-cccc Confidence 000000000000 00001111267788888888877654 233 4899999988753 2 233211110 1 1123 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccce--eeccchhHHH Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTG--DIFYEKKEKT 306 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~--e~~~d~~~~~ 306 (402) +|...+++|.||+.++++|..++.. . -+-++|..-. +++.+ .++++ +.|.. ... T Consensus 376 ~~~~~~l~G~pv~~~~~~p~~~~~~-~------------~~~~~f~~~~-~~~~~--------~~~~v~~d~~~~--~~~ 431 (458) T protein:vir:10 376 QGQVGRIYGLPVVVSEYFPAKANSA-E------------FAVIVYKDNF-VMPRQ--------RAVTVERERQAG--KQR 431 (458) T ss_pred cCcCceecceeeEEccccccccCCc-c------------eEEEEecccE-EEEEe--------eceEEEeecccC--CCc Confidence 4556689999999999999643211 1 1223443211 11111 12222 22211 111 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeeccCccc Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGD 338 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~ 338 (402) ..+.+..-+|-.+.||++.+..++- +| T Consensus 432 ~~~~~~~r~~~~v~~~~a~v~~~~a-----a~ 458 (458) T protein:vir:10 432 DAYYVTQRVNLQRYFANGVVSGTYA-----AS 458 (458) T ss_pred eEEEEEEEecceEecccceEEEeec-----cC Confidence 1233444578889999987654332 22 No 110 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.05 E-value=2.2e-11 Score=79.00 Aligned_cols=290 Identities=10% Similarity=-0.027 Sum_probs=150.5 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeee-ccceeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~~~G~~i~~~~~ 79 (402) +..+....-. ..++.+.-.+.-+++..+|++..++.++++++.++.++.+ .+.+||+ .+...++.+.-|+++....+ T Consensus 6 ~~~~e~~~~~-~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~~~~~~~~ 83 (318) T protein:vir:24 6 AFAVDHAQIA-QTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGT-TGQKIPHWVGDVSAQWIGEGDMKPITKG 83 (318) T ss_pred CCCHHHHHhh-cccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCcceEEecCCcccccccc Confidence 1111100000 0001111123448899999999999999999998877764 5577875 46778888888888887777 Q ss_pred cccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccc Q lcl|Aclame:pro 80 QADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFS 158 (402) Q Consensus 80 ~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~ 158 (402) ..++.++..-. +.....-.-+-.+ +.+| +.+.+.+++++++++++|+.++.- .-... +......... T Consensus 84 ~f~~i~~~~~k--~~~~~~iS~e~l~ds~~~-~~~~i~~~l~~~~~~~~d~a~l~G----~g~~~-----~~~~~~~~~~ 151 (318) T protein:vir:24 84 NMTSQTIAPHK--IATIFVASAETVRANPAN-YLGTMRTKVATAFAMAFDGAAMHG----TDSPF-----PTYIGQTTKA 151 (318) T ss_pred ceeEEEEeeEE--EEEeehhhHHHhhcChHH-HHHHHHHHHHHHHHHHHHHhhhcc----cCCCC-----Cccccccccc Confidence 77776655544 3332221112212 3456 788999999999999999988521 11100 0000000001 Q ss_pred cccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--cc--chh-hcccccccCcccccceEE Q lcl|Aclame:pro 159 INVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--AD--RIV-DKTYTISQSGATINGFVL 233 (402) Q Consensus 159 ~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~--r~~-n~d~~~~~~g~~~~G~V~ 233 (402) ........ ......+.+.++...+...+.+ .-..+++|..|..|.+ |. +.+ ..+.. ++......-. T Consensus 152 ~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~---~~~~~~~~~~ 222 (318) T protein:vir:24 152 ISIADTTG----ATTVYDQVAVNGLSLLVNDGKK--WTHTLLDDITEPILNGAKDQNGRPLFIESTY---GEAASPFRSG 222 (318) T ss_pred cccccccc----ccchHHHHHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhccCCceeecCccc---cCccccccCc Confidence 11111111 1112223344455555444433 3356999999999964 21 211 11111 1111112235 Q ss_pred EEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch----------- Q lcl|Aclame:pro 234 SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK----------- 302 (402) Q Consensus 234 ~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~----------- 302 (402) .+.|++|+.++++|..... -+-+||++.. +.. ..++..+..++. T Consensus 223 ~i~g~pv~~~~~~~~~~~~---------------~~~gdfs~~~--~~~--------~~~l~i~~~~~~~~~~~~~~~~~ 277 (318) T protein:vir:24 223 RIVARPTILSDHVVEGTTV---------------GFMGDFSQLI--WGQ--------IGGLSFDVTDQATLNLGTVESPN 277 (318) T ss_pred eEEEEeeEEeCCCCCCccE---------------EEEeecceEE--EEE--------ecCeEEEEeeccceecccccccc Confidence 7899999999999853211 1234554421 111 112222222211 Q ss_pred -----hHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccc Q lcl|Aclame:pro 303 -----KEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) Q Consensus 303 -----~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~ 340 (402) .+=...+++.+-||.+++||++.+.|+... ..++.+ T Consensus 278 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~--a~~~~~ 318 (318) T protein:vir:24 278 FVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVV--SGGGEG 318 (318) T ss_pred chhhhhcCcEEEEEEEEEccEEecccceEEEEeec--cCCCCC Confidence 111133466677999999999987765432 222222 No 111 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.03 E-value=1.1e-11 Score=80.70 Aligned_cols=274 Identities=8% Similarity=-0.025 Sum_probs=146.8 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-c--ceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-G--ETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G--~~t~~~~~~G~~i~~~ 77 (402) |-......+....-+++.-.+..+.|..+++....+.+.++++.++.++.+ .++.|++. | ........-|+.++.. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~Eg~~~~~~ 176 (379) T protein:vir:10 98 GKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISG-GTYTFVRENGAGEGAIGAQVEGATKGQK 176 (379) T ss_pred hhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccC-CceEEEEeecCCCcccccccCCcccccc Confidence 110100000000000111123458899999999999999999998888754 56788864 3 2333445566666655 Q ss_pred CccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 78 ~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) .+...+.++.+.++- +..+.-.-|++.. .+-+.+..++.+++++..|+.++.-+- T Consensus 177 ~~~f~~i~~~~~k~~~~~~iS~ell~D~~----~l~~~i~~~la~~~~~~~~~~~~~g~~-------------------- 232 (379) T protein:vir:10 177 DYDISMIDVNTDFIAGFTRYSKKMANNLP----FLTSFIPNALRRDYAKAENAAFNAVLA-------------------- 232 (379) T ss_pred ccceeeeEeeeeeEEeeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHhcccc-------------------- Confidence 666666666665533 1112222233332 255677788889999999987752210 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEe Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSY 236 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia 236 (402) ........+.+. ...++.|.++..++...+.+.. .+|++|..|..|.+-.. .|..|-...+....+|.-.+++ T Consensus 233 ~~~~~~~~~~~~----~~~~d~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd-~~G~~l~~~~~~~~~~~~~~l~ 305 (379) T protein:vir:10 233 ANATASTEIITN----KNKVEMLINEIAKQENLDFPVT--AIVLRPTDYYDILVTQK-SVGAGYGLPGVVTQDNGVLRIN 305 (379) T ss_pred cccccccccccC----cccHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhc-cCCceeccCCccCCCCCcceec Confidence 000000011111 1124667777777776665533 35789999999864211 1112211111111234456899 Q ss_pred ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-HH---HHHHHHH Q lcl|Aclame:pro 237 NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-EK---TYYIDTF 312 (402) Q Consensus 237 G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-~~---~d~i~~~ 312 (402) |+||+.|+.+|.+. -+-+||++. .+++. .+++.+..++.. .| ...+++. T Consensus 306 G~pvv~s~~~~ag~-----------------~~~gdf~~~-~~~~~---------~~~~i~~~~~~~~~f~~~~~~~r~~ 358 (379) T protein:vir:10 306 GIPLFRATWLAANK-----------------YYVGDWTRV-TKVTT---------EGLSLEFSEVEGTNFVKNNITARIE 358 (379) T ss_pred ceeeEecCCCCCCc-----------------eEEeecccE-EEEEE---------eceEEEEeecccccccCCcEEEEEE Confidence 99999999998421 134566653 22221 123333333321 12 2234455 Q ss_pred HHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 313 MAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 313 ~a~Ga~vlRPeaa~vv~~~~~ 333 (402) .=+|..++||++.+.+++..= T Consensus 359 ~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 359 AQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred EEeccEEecCccEEEEEecCC Confidence 568999999999877766433 No 112 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.03 E-value=2.7e-11 Score=78.51 Aligned_cols=283 Identities=13% Similarity=0.049 Sum_probs=153.0 Q ss_pred CCCC-cccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEeeeccc--eeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTP-NTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNKYLGE--TELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~-n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~~iG~--~t~~~~~~G~~i~ 75 (402) |-.. ....+-...+++..-...| +.|..+++......+.++++++++.+.++. ++.+++... ..+....-|..+. T Consensus 98 l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 177 (397) T protein:vir:49 98 VRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIG 177 (397) T ss_pred hhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccc Confidence 1000 0000000011111112233 889999999999999999999998887532 344554432 2344455566654 Q ss_pred C-CCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 76 A-TPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 76 ~-~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) . ..+...++++....+- .-..|. -+++ +.+| +.+.+.+++++++++..|+.++.-. T Consensus 178 ~~~~~~~~~v~~~~~k~~-~~~~iS~ell~d--s~~~-l~~~i~~~l~~~~~~~~d~ail~G~----------------- 236 (397) T protein:vir:49 178 QNDDPKLSLIRYAIKRYA-GISTVTNSLLAD--SAEN-ILAWLSGWIAKKVVVTRNKAILEAI----------------- 236 (397) T ss_pred cccccceeeeEeeeeeeE-eehhhHHHHHhh--hhHH-HHHHHHHHHHHHHHHHHHHHHHhcc----------------- Confidence 3 3355677777666542 222232 2322 4456 7889999999999999999875211 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--ccchhhcccccccCcccccc Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--ADRIVDKTYTISQSGATING 230 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~r~~n~d~~~~~~g~~~~G 230 (402) |.+.. .....+ |+.|.++...|+..+.+.. ..|++|..|..|.+ |.. ..|-- .....+| T Consensus 237 ---g~~~~-----~~~~~~----~d~i~~~~~~l~~~~~~~a--~~v~n~~~~~~l~~lkd~~---g~~l~--~~~~~~g 297 (397) T protein:vir:49 237 ---GTLPN-----KPTLAK----WDDIIDLQAKVDPAIKQTS--LFLTNTSGFTALKKVKNAM---GDYLM--ERDVKSP 297 (397) T ss_pred ---ccccc-----cccccC----HHHHHHHHHhhhhhhcCCC--EEEEcHHHHHHHHHhhccC---Cceee--cccccCC Confidence 11111 011112 5667778888887777643 56899999998854 211 11100 0112234 Q ss_pred eEEEEeccEEEecCc--cccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch----hH Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNR--FPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK----KE 304 (402) Q Consensus 231 ~V~~iaG~~V~~SNn--lP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~----~~ 304 (402) .-.+++|+||+.+.+ +|....+ ...-+-+||++. +..+...+++.+..+.. .+ T Consensus 298 ~~~~l~G~pV~~~~~~~~~~~~~~------------~~~~~~gd~~~~---------~~~~~~~~~~i~~~~~~~~~~~~ 356 (397) T protein:vir:49 298 TGYSIDGFVVKEISDRFLPNGTGG------------AMPLYFGDLKQA---------VTLFDRQHLSLLSTNIGGGAFET 356 (397) T ss_pred CCceecceeeEEecccccccccCC------------ceeEEEeeccce---------EEEEeecccEEEEeccccchhhc Confidence 456899999987554 4432211 000123444432 22222233333333211 22 Q ss_pred HHHHHHHHHHhcCcccccceEEEEEEeeccCc-cccccchh Q lcl|Aclame:pro 305 KTYYIDTFMAEGAIPDRWEAVSVVTTKRDATT-GDAGGPGD 344 (402) Q Consensus 305 ~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~-~~a~~~~~ 344 (402) -...+++.+-+|.++++|++.+.++++..+++ +.++..++ T Consensus 357 ~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 357 DTTKVRVIDRFDVVSTDTEAFVPASFKAIADQKAKLSTAGA 397 (397) T ss_pred CeeeEEEEEeeccEEecccceEEEEecccccccCcccccCC Confidence 23446677779999999999999988764443 33333333 No 113 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.03 E-value=8.7e-11 Score=75.73 Aligned_cols=285 Identities=12% Similarity=0.022 Sum_probs=153.1 Q ss_pred CCCCcccccc--cccccccHH-HHHHHHHhHHHHHHHHHHhhhcccceeeecccc--ceEEeeec-cceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNV--AVSASGEVD-SLLIEKFNGKVNEQYLKGENILSYFDVQTVTGT--NTVSNKYL-GETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~--~~~~~~d~~-alfle~f~geV~t~f~~~sv~~~~~~~rti~~G--ksv~f~~i-G~~t~~~~~~G~~i 74 (402) +-........ ...+++.+- .+.=+.|..+++......+.+++++++.++.++ +-..++.. +...+....-|+.+ T Consensus 97 ~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 176 (397) T protein:vir:48 97 LVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSI 176 (397) T ss_pred HHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeecccccc Confidence 0000000000 000111111 233388999999999999999999998887643 33323332 22334555556666 Q ss_pred CC-CCccccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 75 NA-TPTQADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 75 ~~-~~~~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) .. ..+...+.++.+..+- ....|.+ |+ ++.+| +.+.+.+++++++++..|+.++.-. T Consensus 177 ~~~~~~~~~~v~~~~~k~~-~~~~iS~ell~--ds~~~-l~~~v~~~l~~~~~~~~d~~il~G~---------------- 236 (397) T protein:vir:48 177 GTNDDPKLYPIRYAIKRYA-GISTVTNSLLA--DSAEN-ILAWLSGWIAKKVVVTRNKAILEAI---------------- 236 (397) T ss_pred ccccccceeeEEeeheeee-eehhhHHHHHh--hchHH-HHHHHHHHHHHHHHHHHHHHHhhcc---------------- Confidence 54 4567777777776532 2223332 22 24566 7889999999999999999886211 Q ss_pred ccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccce Q lcl|Aclame:pro 152 VKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) Q Consensus 152 ~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~ 231 (402) +.+... +... -++.|.++..+|+..+.+. . ..|++|..|..|.+-..- |..|-- .....+|. T Consensus 237 ----g~~~~~-----~~~~----~~d~i~~~~~~l~~~~~~~-a-~~v~n~~~~~~L~~lkd~-~G~~i~--~~~~~~~~ 298 (397) T protein:vir:48 237 ----ATLPTK-----PTLT----KWDDIIDLQAKVDPAIKQT-S-FFLTNTSGFTALKKVKNA-FGDYLM--ERDVKSPT 298 (397) T ss_pred ----cccccc-----cccc----cHHHHHHHHHHhhhhhcCC-C-EEEECHHHHHHHHHhhcC-CCceee--ccCcCCCC Confidence 111111 1111 1566778888888777653 3 458999999999652111 111110 11123455 Q ss_pred EEEEeccEEEecCc--cccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch----hHH Q lcl|Aclame:pro 232 VLSSYNCPVIPSNR--FPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK----KEK 305 (402) Q Consensus 232 V~~iaG~~V~~SNn--lP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~----~~~ 305 (402) -.+++|+||+.+.+ +|.... +..--+-+||++..-+ +.-.++..+..+.. .+- T Consensus 299 ~~~l~G~PV~~~~~~~~~~~~~------------~~~~~~~gd~~~~~~~---------~~~~~~~i~~~~~~~~~~~~~ 357 (397) T protein:vir:48 299 GYSIDGFAVKEVADRWLANASS------------GAMPLYFGDLKQAVTL---------FDRQQMSLLSTNIGGGAFETD 357 (397) T ss_pred CceeccceeEEecccccCCcCC------------CceEEEEEeccceEEE---------EeecceEEEEeccchhhhhcC Confidence 57899999987654 332211 1111122444432211 12222333332211 122 Q ss_pred HHHHHHHHHhcCcccccceEEEEEEeecc-Cccccccchh Q lcl|Aclame:pro 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRDA-TTGDAGGPGD 344 (402) Q Consensus 306 ~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~-t~~~a~~~~~ 344 (402) ...+.+.+-++..+++|++.+.++++..+ .++..++++. T Consensus 358 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 358 TTKIRVIDRFDVVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred ceeEEEEeeeccEEecccceEEEEecccccCCCCccccCC Confidence 23556677789999999999888877643 3333344433 No 114 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.02 E-value=2.3e-12 Score=84.44 Aligned_cols=296 Identities=14% Similarity=0.150 Sum_probs=165.1 Q ss_pred CCCCcccccccccccccHHHHHH--HHHhHHHHHHHHHHhhhcccce-eeeccccceEEeeeccceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI--EKFNGKVNEQYLKGENILSYFD-VQTVTGTNTVSNKYLGETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl--e~f~geV~t~f~~~sv~~~~~~-~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~~ 77 (402) |-. .++-.-|| |+|+-+++--...+-+--.+-+ +-..-.|.+.+|+.||.++++...-.+++..+ T Consensus 1 ~~~------------TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~tiGs~~~~~~~E~~~~~~~ 68 (313) T protein:vir:95 1 MQL------------TSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTIGSVTLQEAEEDTPLIYN 68 (313) T ss_pred Ccc------------cccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEecccCceeeeccccCCCeeec Confidence 322 22223344 8888888665554432222222 22345799999999999999988888888999 Q ss_pred CccccceeEeecceeeccchhh---hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNTVA---HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~Id---dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ++.+.|.++-|-+ |....-+ ||-+--..+|.+-.+...|...|+-+.|...++..- ++.++.. ..+..+.| T Consensus 69 ~i~TGEIt~~i~~--Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G--~~~FA~~--~~P~~vNG 142 (313) T protein:vir:95 69 PIETGEITFQITE--YKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTG--AEYFAAN--PGPHNVNG 142 (313) T ss_pred ccccceEEEEEEe--ecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhc--hhhhccC--CCCccccc Confidence 9999999999887 7766622 222222233333345556667777777766555432 1233221 11222222 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhh--ccc--ccccCccc-cc Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVD--KTY--TISQSGAT-IN 229 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n--~d~--~~~~~g~~-~~ 229 (402) . ..++-.++ ++..-.+.-|....-.+++.++|.+||+.+|+|-.---|---..+.| .|+ -....|.- .. T Consensus 143 ~--PH~~V~~~----T~~~~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~ 216 (313) T protein:vir:95 143 F--PHVIVSAE----TNGVFALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQ 216 (313) T ss_pred c--cceEEecc----CCceehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchh Confidence 2 22221111 22222244466777789999999999999999988666643222222 011 01222321 13 Q ss_pred ceEEEEeccEEEecCccccccCccccccccccCCccc---c-ceeeeccceeEEeecHHHhhhhhhcccceeeccchhH- Q lcl|Aclame:pro 230 GFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYR---Y-DPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE- 304 (402) Q Consensus 230 G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~---~-~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~- 304 (402) .+|.++.|++|+.||.|...+.. +. ...++|.- | ++..+-.+-..+.+ +-| +++|.++++.+ T Consensus 217 ~Fi~~~YG~Di~~SN~L~~AN~~--D~--~tT~~G~~~NlFM~i~D~~~~P~~~AW--------r~M-P~s~~~~~~~~~ 283 (313) T protein:vir:95 217 RFIMNLYGWDILTSNRLHVANYN--DG--TTTGNGYVGNLFMCILDDQTKPIMGAW--------RRM-PKSEGERNKDRA 283 (313) T ss_pred HHHHHHhhhhhhhhhhhhhcccc--cc--ccccCceeeeeeeeeecccccceeeee--------ccc-cccccccccccc Confidence 56888999999999999754322 11 11122210 0 12122122222222 223 26666766654 Q ss_pred -HHHHHHHHHHhcCcccccceEEEEEEeeccC Q lcl|Aclame:pro 305 -KTYYIDTFMAEGAIPDRWEAVSVVTTKRDAT 335 (402) Q Consensus 305 -~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t 335 (402) .-+++.++ ||-|+.|-|.++.|-+...+- T Consensus 284 ~~~~~~~~R--~G~Gi~R~~~L~~~~~~A~~~ 313 (313) T protein:vir:95 284 RDEHVVRCR--YGFGIQRLDTLGLLATSATAY 313 (313) T ss_pred cccceeeee--ecccceeecceeEEEeccccC Confidence 44455554 788899999988876544333 No 115 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.02 E-value=2.7e-11 Score=78.55 Aligned_cols=292 Identities=12% Similarity=0.072 Sum_probs=163.3 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceee-eccccceEEe----eeccceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQ-TVTGTNTVSN----KYLGETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~r-ti~~Gksv~f----~~iG~~t~~~~~~G~~i 74 (402) |+.|++++-.+-++.=...+|.= ..|-........+...+.++...+ .-+++-+++| |.......+.+.+|.++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 99997766555554323223221 344444444444555555544433 2445668888 44555677888999998 Q ss_pred CCCCccccceeE-eec-ceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 75 NATPTQADKNQL-VID-TTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 75 ~~~~~~~~e~~i-tID-~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) +-....+.+..| .+. --+=+++.=..+++ +++|.|...+ ++++-+++++.|+.++..|..+..-.-+ .+.. T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~--n~~~~v~r~~-~~l~Nti~r~~d~~a~dal~sa~t~~~~----~s~~ 153 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMIDE--NRVGAVNDQM-LQLRNTFIRANDRSAKALLQSPIVPTLA----VPTA 153 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHhh--cChhHHHHHH-HHHHHHHHHHHHHHHHHHHhcccccccc----CCcC Confidence 766555544333 332 11222332223322 4445455544 7889999999999887766544321111 1111 Q ss_pred cccccccccccCCccccccHH-----HHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccC--- Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQ-----YVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQS--- 224 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~-----~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~--- 224 (402) +. +.+.+.. +..++. ...+.+..-....+ .+-...--.+||.|..|..|++++++... |...++ T Consensus 154 w~-~~~~~~~-----d~~~A~e~v~~a~~~~~~a~~~~~~-~~~GY~pdtIVlhP~~~~~l~~n~~~~~~-y~~~a~~~~ 225 (318) T protein:vir:10 154 WD-NGGKVRT-----DIAIAIEQISTAAPTAYPAGVGSSD-EYFGFIPDTIVMHYALLPILMDNENFMKV-YERNANYVS 225 (318) T ss_pred CC-Ccccccc-----cchhhhhhhhhhhhhhhhhhhhhhh-hccCccceeeEECHHHHHHHhcchhhhhh-hhccchhhh Confidence 11 1111110 111110 01111111111122 23344446799999999999999887532 211111 Q ss_pred --cccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhh-hhcccceeeccc Q lcl|Aclame:pro 225 --GATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVG-RTIEVTGDIFYE 301 (402) Q Consensus 225 --g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv-~~~dl~~e~~~d 301 (402) ..+.+..=++++|++|+.|+++|... ++++.+..+|+- -..+|+.+.+|+ T Consensus 226 ~~~~~tg~~~g~~lGl~vi~s~~~p~~~---------------------------alvlq~g~vG~~~d~~pl~~t~~~~ 278 (318) T protein:vir:10 226 TAPDWTGNFPGSVMGLNVIRSRTFPIDR---------------------------VLIMERGTVGFYSDTRPLQFTALYP 278 (318) T ss_pred hcccccccccceeeceEEeecCccCCCe---------------------------eEEEecCCcceeeccccceeeeccc Confidence 11122223678999999999999522 355566666632 345677888887 Q ss_pred h-------hHHHHHHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 302 K-------KEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 302 ~-------~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) + ...+|.++.......+|.+|.+ ++.+++=.|| T Consensus 279 egg~~~g~~~~s~~~~~~~~~~~~V~~PkA--~~~itgi~~~ 318 (318) T protein:vir:10 279 EGNGPNGGPTESYRADASHKRALAVDQPKA--ALWLTGIVTP 318 (318) T ss_pred CCCCCCCCcchhhheehheeeeeeeeCcce--eEEEeeccCC Confidence 7 7788999999999999999975 5666765555 No 116 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.01 E-value=7.8e-11 Score=75.99 Aligned_cols=291 Identities=14% Similarity=0.123 Sum_probs=149.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhccc-ceeeeccccceEEeeec-cceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSY-FDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~-~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~ 77 (402) +.-.+..+ .+++..-...+ +.+..++++..+..++++.+ .++.+..+| .+++|+. |...+..+.-|..+... T Consensus 126 ~~~~~~~~----~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~E~~~~~~~ 200 (435) T protein:vir:14 126 EEVAMSLN----TLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIPRLKGGAIVGYIGADTDIPTT 200 (435) T ss_pred hhhhhhcc----cCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCC-ceEEEEEeCCcceeeeccCcccccc Confidence 00011000 01111111223 77788888888888888876 444444444 5778876 66677677667777666 Q ss_pred CccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+...+.++.+-++- .-+.|. -+++.....+ +.+.+..++++++++..|+.++ .+.... ..| .|. T Consensus 201 ~~~f~~i~~~~~k~~-~~~~iS~ell~ds~~~~~-l~~~i~~~l~~ai~~~~d~a~l----~G~G~~-----~~p--~Gi 267 (435) T protein:vir:14 201 QQQFDDLKLTAKKMA-ALVPIANDLIKYAGVNPN-VDQIVVGDLTAAIGAREDKAFI----RDDGTA-----NTP--KGL 267 (435) T ss_pred ccceeEEEeeeEEEE-EeehhhHHHHHhhccCHH-HHHHHHHHHHHHHHHHHHHHhh----ccCCCC-----ccc--cce Confidence 666666666664432 222332 2444433333 6788899999999999999885 111100 001 111 Q ss_pred cccc-ccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 156 GFSI-NVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 156 ~~~~-~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) .... ...........+.+.+++.+.++...+...+.-......|++|..|..|.+-.. .|..|--. ...+| + T Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd-~~G~~l~~---~~~~g---~ 340 (435) T protein:vir:14 268 RFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRD-GNGNKVYP---ELANG---M 340 (435) T ss_pred eecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhc-cCCceecc---CCCCC---e Confidence 1000 000111122233444555566666666555443344556999999998854211 11111100 01122 6 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch----------h- Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK----------K- 303 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~----------~- 303 (402) ++|+||+.|+++|......+. ...=+-+||+... ++.+. ++..+..++. . T Consensus 341 l~G~Pv~~~~~~p~~~~~~~~---------~~~i~~gd~s~~~--i~~~~--------~~~~~~~~~~~~~~~~~~~~~~ 401 (435) T protein:vir:14 341 LKGYPVGKTTQVPINLGETGK---------ESEIYFTDFGDVF--IGEEE--------TLEIDYSKEATYKDADGHMVSA 401 (435) T ss_pred eecceeEeeccccccccCCCc---------cceEEEeecccEE--EEEec--------ccEEEEeccccccccccchhhh Confidence 899999999999964321110 0011335665432 22221 2222222111 0 Q ss_pred --HHHHHHHHHHHhcCcccccceEEEEEEeeccCccc Q lcl|Aclame:pro 304 --EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGD 338 (402) Q Consensus 304 --~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~ 338 (402) +=.-.+++.+-++.++.||++.+.|+- .+-|+ T Consensus 402 f~~~~~~~r~~~r~d~~~~~~~a~~~l~~---~~~~~ 435 (435) T protein:vir:14 402 FQRDQTLIRVIAKNDFGPRHVESIAVLAG---VAWGA 435 (435) T ss_pred hhcChhheeeeeeeCceeecccceEEEec---CCCCC Confidence 112466788889999999998766532 22222 No 117 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.01 E-value=2.8e-11 Score=78.41 Aligned_cols=290 Identities=14% Similarity=0.048 Sum_probs=149.6 Q ss_pred CCCCcc-cccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeecccc-ceEEeeeccce-eee-eecCCCCCC Q lcl|Aclame:pro 1 MSTPNT-LTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGT-NTVSNKYLGET-ELQ-VLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~-~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~G-ksv~f~~iG~~-t~~-~~~~G~~i~ 75 (402) +...+. ..+....++...-...| +.|+.+++......+.+++++++.++.++ .++.+++.... ... ...-|+.+. T Consensus 105 ~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~ 184 (408) T protein:vir:74 105 MAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIP 184 (408) T ss_pred hhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccccccc Confidence 111111 11111111111112223 88999999999999999999999888754 35556654332 222 333344554 Q ss_pred C-CCccccceeEeecceee-ccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 76 A-TPTQADKNQLVIDTTVI-ARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 76 ~-~~~~~~e~~itID~~ly-a~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) . ..+..++.++....+-. ..+.-.-+++ +.+| +.+.+.+++++++++..|+.++.- T Consensus 185 ~~~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~-l~~~i~~~l~~~~~~~~d~~il~G------------------- 242 (408) T protein:vir:74 185 DLDNPRLTIIKYLIKRYAGIITATNTLLKD--TAEN-ILAWLSSWIAKKVVVTRNQAIIAA------------------- 242 (408) T ss_pred cccccceeeEEeeeeeEEeeehhHHHHHhh--chHH-HHHHHHHHHHHHHHHHHHHHHhhc------------------- Confidence 3 44666777776665321 1111122322 3455 688999999999999999977521 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFV 232 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V 232 (402) .|.+.. .....+ ++.|.++. ..|+..+.+. -..|++|..|..|.+-.. .+..|-- .+...+|.- T Consensus 243 -~G~~~~-----~~~~~~----~~~i~~~~~~~l~~~~~~~--a~~v~n~~~~~~l~~lkd-~~G~~l~--~~~~~~~~~ 307 (408) T protein:vir:74 243 -MGTVPK-----KPTIAN----FDDVITMINTSVDPAIIAT--SSLLTNQSGLNKLALVKT-AEGKYLL--EPDPTKPNS 307 (408) T ss_pred -cccccc-----cccccc----HHHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhc-CCCceEe--ccCcCCCCC Confidence 111100 011112 33444443 4566655542 245789999999965211 1111111 111223444 Q ss_pred EEEeccEEEecCc--cccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch----hHHH Q lcl|Aclame:pro 233 LSSYNCPVIPSNR--FPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK----KEKT 306 (402) Q Consensus 233 ~~iaG~~V~~SNn--lP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~----~~~~ 306 (402) .+++|+||+.+.+ +|..+. +...=+-+||++..- ++. -.+++.+..+.. .+.. T Consensus 308 ~~l~G~pV~~~~~~~~~~~~~------------~~~~i~~gd~~~~~~-~~~--------~~~~~i~~~~~~~~~f~~~~ 366 (408) T protein:vir:74 308 YLIKGKQVIVVADRWLPNSGS------------TVYPLYYGDMSQAIT-LFD--------RENMSLLPTNIGAGAFETDT 366 (408) T ss_pred ceecceeeEEecCcccccccC------------CcceEEEEehhccEE-EEE--------ecceEEEEeccccchhhcce Confidence 6899999998765 443221 111113345544222 111 123333333221 2233 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHHh Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATV 349 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~ 349 (402) ..+.+.+-+|.++++|++.+.++++..++++.+. .+..-++| T Consensus 367 ~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~~ 408 (408) T protein:vir:74 367 TKIRVIDRFDVKATDSEALVAGSFTAIADQVGNF-KTTTSTAV 408 (408) T ss_pred eeEEEEEeeCcEEecccceEEEEeecccCCCCCC-CCCccccC Confidence 4466777799999999999999886644333222 11112222 No 118 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.01 E-value=9.8e-11 Score=75.45 Aligned_cols=275 Identities=10% Similarity=0.008 Sum_probs=157.6 Q ss_pred CCCCcccccccccccccHHH-HHHHHHhHHHHHHHHHHhhhcccceeeeccc-cceEEeeec--cceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDS-LLIEKFNGKVNEQYLKGENILSYFDVQTVTG-TNTVSNKYL--GETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~a-lfle~f~geV~t~f~~~sv~~~~~~~rti~~-Gksv~f~~i--G~~t~~~~~~G~~i~~ 76 (402) |+..+ +..-. +.=+.|..++.......+.++++.++.++.+ ..+..+++. +...+..+.-|..+.. T Consensus 109 ~~~~t----------~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~ 178 (397) T protein:vir:49 109 KTDAS----------GSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIAD 178 (397) T ss_pred hhccc----------cccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCcceeeecCcccccc Confidence 22111 11112 2238889999999999999999999888864 223445543 3345667777777765 Q ss_pred -CCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 77 -TPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 77 -~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ..+...+.++.+..+- .-..|. -++ ++.+| +.+.+.+++++++++..|+.++.-. T Consensus 179 ~~~~~~~~i~~~~~k~~-~~~~iS~ell~--ds~~~-l~~~i~~~l~~~~~~~~d~ai~~G~------------------ 236 (397) T protein:vir:49 179 VDDPKLSLIKYTIKRYA-GISTVTNSLLA--DSAEN-ILAWLSGWIAKKVVVTRNKAILEAI------------------ 236 (397) T ss_pred ccccceeeEEeeeeeEE-eeehhHHHHHh--hhHHH-HHHHHHHHHHHHHHHHHHHHHHhhc------------------ Confidence 4577777777775532 222232 222 23466 7889999999999999999876321 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~ 233 (402) |.+.. .....+ ++.|.++...|...+.+. -.+|++|..|..|.+-.. .+..|-- .....+|.-. T Consensus 237 --g~~~~-----~~~~~~----~d~i~~~~~~l~~~~~~~--a~~vmn~~~~~~l~~lkd-~~G~~l~--~~~~~~~~~~ 300 (397) T protein:vir:49 237 --AALPT-----KPTLTK----WDDIIDLEAKVDPAIKQT--SFFLTNTSGFTALKKVKN-ALGDYLM--ERDVKSPTGY 300 (397) T ss_pred --ccccc-----cccccc----HHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhc-CCCceee--ccCcCCCCCc Confidence 11100 001111 566777888888777654 356899999999964211 1111110 1112345557 Q ss_pred EEeccEEEecCc--cccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeecc----chhHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNR--FPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFY----EKKEKTY 307 (402) Q Consensus 234 ~iaG~~V~~SNn--lP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~----d~~~~~d 307 (402) +++|+||+.+.+ +|.... +...=+-+||++...+ +.-.++..+..+ .-.+-.. T Consensus 301 ~l~G~PV~~~~~~~~~~~~~------------~~~~i~~gd~~~~~~~---------~~~~~~~i~~~~~~~~~~~~~~~ 359 (397) T protein:vir:49 301 SIDGFAVKEVADRWLANGTG------------GAMPLYFGDLKQAVTL---------FDRQHMSLLSTNIGGGAFETDTT 359 (397) T ss_pred eecceeeEEecccccccccC------------CceeEEEeeccceEEE---------EeecceEEEEeccccchhhcCce Confidence 899999987544 443221 1111122444432111 112223333322 1122234 Q ss_pred HHHHHHHhcCcccccceEEEEEEee-ccCccccccchh Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKR-DATTGDAGGPGD 344 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~-~~t~~~a~~~~~ 344 (402) .+++.+-+|.++++|++.+.++++. ...++..+++|. T Consensus 360 ~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 360 KVRVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred eEEEEeeeCcEEecccceEEEEeecccCCCCCcccccC Confidence 5677778999999999999888765 445555556655 No 119 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.00 E-value=5e-11 Score=77.07 Aligned_cols=282 Identities=12% Similarity=0.038 Sum_probs=148.8 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCCCC- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSPNA- 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i~~- 76 (402) |.... .......++.+-...| +.|.++++......+.++++.++.++.++ +.++++. +...+....-|.+... T Consensus 103 ~~~~~--~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~ 179 (394) T protein:vir:10 103 HGKVI--DNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELAENPAL 179 (394) T ss_pred cchhh--hhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEecCCCcccccccccccccc Confidence 11100 0000111112122334 88999999999999999999998887654 4556544 4445555555555543 Q ss_pred CCccccceeEeecceeeccch-hhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVIARNT-VAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 77 ~~~~~~e~~itID~~lya~~~-IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) ..+..+++++.+-.+ +.+. |.+-=--++.+| +.+.+.+++++++++..|+.++...- T Consensus 180 ~~~~~~~v~l~~~k~--~~~~~iS~ell~ds~~~-l~~~i~~~la~~~~~~~~~~il~g~g------------------- 237 (394) T protein:vir:10 180 AEPEFEQVDWSVSTY--RGAIPLSEEAIADSAVD-LTSLVGQSINEKSVNTYNAMIAPVLQ------------------- 237 (394) T ss_pred ccccceeEEeeeeee--EeeehhHHHHHhhhhHH-HHHHHHHHHHHHHHHHHHHHHhhccc------------------- Confidence 456667777766543 3322 222101124466 78899999999999999988753221 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHH-HHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccccc Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALE-QQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATING 230 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~-~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~~G 230 (402) .+.... .... ..++.|.++.. .++... .-..|++|..|..|.+ | .|++-. ..-......| T Consensus 238 -~~~~~~---~~~~----~~~d~l~~~~~~~~~~~~----~a~~vmn~~~~~~l~~lkd~~G~~i~~---~~~~~~~~~~ 302 (394) T protein:vir:10 238 -SFTAKA---TTTD----TLVDSLKHILNVDLDPAY----SRALVVTQSLFNTLDTLKDKNGRYLLH---DASDSITDGT 302 (394) T ss_pred -cccccc---cccc----ccHHHHHHHHHhhhhhhc----cCEEEecHHHHHHHHHhhccCCCeeee---ccccccccCC Confidence 111000 0111 12344544432 333332 2356899999999874 2 222211 0011112234 Q ss_pred eEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHH Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYID 310 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~ 310 (402) .-.+++|+||+.+++..-.... |...=+-+||++..-++- -.+++.+.. +...+...+. T Consensus 303 ~~~~L~G~PV~~~~~~~~~~~~-----------~~~~i~~gd~s~~~~~~~---------~~~~~v~~~-~~~~~~~~~~ 361 (394) T protein:vir:10 303 AKGTVLGVPVYVVGDALLGSAA-----------GDQKAFVGDLKRGVLFAD---------RQQVTLAWE-DSKIYGRYLG 361 (394) T ss_pred cccccccceeEEecccccCCCC-----------CceEEEEeeccccEEEEe---------ecceEEEEe-cccccceeEE Confidence 4568999999987753221110 111114456665322221 122233322 2233444566 Q ss_pred HHHHhcCcccccceEEEEEEeeccCccccccch Q lcl|Aclame:pro 311 TFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPG 343 (402) Q Consensus 311 ~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~ 343 (402) +.+=++.++++|++.+.|+....+.++.+++.. T Consensus 362 ~~~r~d~~~~~~~ai~~~~~~~~~~~~~~~~~~ 394 (394) T protein:vir:10 362 AAFRFGVKQADSNAGYFVTNTDAASGSTSGTGK 394 (394) T ss_pred EEEEeccEEeccccEEEEEeecccCCCCCCCCC Confidence 677789999999999888776655444444443 No 120 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.99 E-value=1.9e-11 Score=79.34 Aligned_cols=299 Identities=12% Similarity=0.051 Sum_probs=148.7 Q ss_pred CCCCc---cccccc-ccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPN---TLTNVA-VSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n---~~t~~~-~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i 74 (402) +...- ...+-. .+.+++--.+..+.|..+++....+.+.++++.++.++.++ ++.||+. |..++....-|+.+ T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~~ 216 (497) T protein:vir:78 138 FADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTY 216 (497) T ss_pred HhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCccc Confidence 00000 000000 01111112344588999999999999999999998887765 5888864 35577777778877 Q ss_pred CCCCccccceeEeecceeeccchh---hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc---- Q lcl|Aclame:pro 75 NATPTQADKNQLVIDTTVIARNTV---AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER---- 147 (402) Q Consensus 75 ~~~~~~~~e~~itID~~lya~~~I---ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~---- 147 (402) +...+..++.++..-.+ +.+.. .-|++. -+ +.+.+.+++++++++..|+.++. +.....|.-. T Consensus 217 ~~s~~~f~~i~~~~~k~--a~~~~iS~ell~d~---~~-l~~~i~~~l~~~i~~~~d~~~l~----G~G~~~p~Gil~~~ 286 (497) T protein:vir:78 217 PFSSEEFARVYEQVGKV--ANALTITDEGLRDA---PE-LFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLLQRS 286 (497) T ss_pred ccccccceeeEeeeeee--EeecHhHHHHHHhH---HH-HHHHHHHHHHHHHHHHHHHHhhc----CCCccccccccccc Confidence 76667777766665543 33221 223333 23 57888899999999999988752 1100000000 Q ss_pred ---cccccccccccc--------ccccCCcccccc-----------------------------HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 ---NKPRVKGHGFSI--------NVNVTESEALAN-----------------------------PQYVMAAVEYALEQQL 187 (402) Q Consensus 148 ---~~~~~~g~~~~~--------~v~~~~a~~~~~-----------------------------~~~l~dai~~a~~~Ld 187 (402) ............ ............ ...+.+.++.+...+. T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (497) T protein:vir:78 287 TGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQ 366 (497) T ss_pred ccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhh Confidence 000000000000 000000000000 0011111222222222 Q ss_pred hhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccccceEEEEeccEEEecCccccccCccccccccccCC Q lcl|Aclame:pro 188 EQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDN 263 (402) Q Consensus 188 ekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~ 263 (402) ..+.=..+ ..|++|..|..|.+ | .|++..+......+. ..+...+++|.||+.|+.+|.+. T Consensus 367 ~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~-~~~~~~~l~G~pV~~t~~~~~~~------------- 431 (497) T protein:vir:78 367 LTLFQTPN-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN-PVNGGKNIWGVPVVTTPLIPLGT------------- 431 (497) T ss_pred hhcccCCC-eEEEchHHHHHHHHhhcCCCceeccCcccccccc-cccCCceeeceeeEecCCCCCCc------------- Confidence 11110111 46899999998853 3 334322211111111 12334588999999999998532 Q ss_pred ccccceeeeccceeEEeecHHHhhhhhhcccceeeccc-hh---HHHHHHHHHHHhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 264 GYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE-KK---EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 264 G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d-~~---~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) -+-+||+...-+++.+. ++++++... .. +-...+++..-++..+++|++.+.|+++..+++. T Consensus 432 ----~~~Gd~~~~~~~i~~r~--------~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 432 ----ILVGHFAPSVIQTARRE--------GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred ----eEEeecccceEEEEEec--------ccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 02255543222233222 222222211 11 1122355556689999999999999987666544 No 121 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.99 E-value=1.9e-11 Score=79.34 Aligned_cols=299 Identities=12% Similarity=0.051 Sum_probs=148.7 Q ss_pred CCCCc---cccccc-ccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPN---TLTNVA-VSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n---~~t~~~-~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i 74 (402) +...- ...+-. .+.+++--.+..+.|..+++....+.+.++++.++.++.++ ++.||+. |..++....-|+.+ T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~~ 216 (497) T protein:vir:10 138 FADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGTY 216 (497) T ss_pred HhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCccc Confidence 00000 000000 01111112344588999999999999999999998887765 5888864 35577777778877 Q ss_pred CCCCccccceeEeecceeeccchh---hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc---- Q lcl|Aclame:pro 75 NATPTQADKNQLVIDTTVIARNTV---AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER---- 147 (402) Q Consensus 75 ~~~~~~~~e~~itID~~lya~~~I---ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~---- 147 (402) +...+..++.++..-.+ +.+.. .-|++. -+ +.+.+.+++++++++..|+.++. +.....|.-. T Consensus 217 ~~s~~~f~~i~~~~~k~--a~~~~iS~ell~d~---~~-l~~~i~~~l~~~i~~~~d~~~l~----G~G~~~p~Gil~~~ 286 (497) T protein:vir:10 217 PFSSEEFARVYEQVGKV--ANALTITDEGLRDA---PE-LFNFVQGRLLEGIQRKEEVQLLA----GGGYPGVNGLLQRS 286 (497) T ss_pred ccccccceeeEeeeeee--EeecHhHHHHHHhH---HH-HHHHHHHHHHHHHHHHHHHHhhc----CCCccccccccccc Confidence 76667777766665543 33221 223333 23 57888899999999999988752 1100000000 Q ss_pred ---cccccccccccc--------ccccCCcccccc-----------------------------HHHHHHHHHHHHHHHH Q lcl|Aclame:pro 148 ---NKPRVKGHGFSI--------NVNVTESEALAN-----------------------------PQYVMAAVEYALEQQL 187 (402) Q Consensus 148 ---~~~~~~g~~~~~--------~v~~~~a~~~~~-----------------------------~~~l~dai~~a~~~Ld 187 (402) ............ ............ ...+.+.++.+...+. T Consensus 287 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (497) T protein:vir:10 287 TGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQ 366 (497) T ss_pred ccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhh Confidence 000000000000 000000000000 0011111222222222 Q ss_pred hhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccccceEEEEeccEEEecCccccccCccccccccccCC Q lcl|Aclame:pro 188 EQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDN 263 (402) Q Consensus 188 ekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~ 263 (402) ..+.=..+ ..|++|..|..|.+ | .|++..+......+. ..+...+++|.||+.|+.+|.+. T Consensus 367 ~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~-~~~~~~~l~G~pV~~t~~~~~~~------------- 431 (497) T protein:vir:10 367 LTLFQTPN-AVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGN-PVNGGKNIWGVPVVTTPLIPLGT------------- 431 (497) T ss_pred hhcccCCC-eEEEchHHHHHHHHhhcCCCceeccCcccccccc-cccCCceeeceeeEecCCCCCCc------------- Confidence 11110111 46899999998853 3 334322211111111 12334588999999999998532 Q ss_pred ccccceeeeccceeEEeecHHHhhhhhhcccceeeccc-hh---HHHHHHHHHHHhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 264 GYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE-KK---EKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 264 G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d-~~---~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) -+-+||+...-+++.+. ++++++... .. +-...+++..-++..+++|++.+.|+++..+++. T Consensus 432 ----~~~Gd~~~~~~~i~~r~--------~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 432 ----ILVGHFAPSVIQTARRE--------GVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred ----eEEeecccceEEEEEec--------ccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 02255543222233222 222222211 11 1122355556689999999999999987666544 No 122 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=98.98 E-value=5.1e-11 Score=77.02 Aligned_cols=286 Identities=12% Similarity=0.005 Sum_probs=143.2 Q ss_pred CCCCcccccc---------cccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeec Q lcl|Aclame:pro 1 MSTPNTLTNV---------AVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLA 69 (402) Q Consensus 1 Ms~~n~~t~~---------~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~ 69 (402) |.......+. ....+.+.-...+ ++|..++++..+..+.+++++++.++. |+ .++|+. +...+..+. T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~-~~ip~~~~~~~a~~v~ 196 (425) T protein:vir:95 119 LKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GT-TRILVDTDTSPATWIE 196 (425) T ss_pred HhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ce-eEEEEecCCccccccc Confidence 1000000000 0000011111222 788999999999999999999888764 44 467765 444555666 Q ss_pred CCCCCCCCC-ccccceeEeecceeeccc-hhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|Aclame:pro 70 PGQSPNATP-TQADKNQLVIDTTVIARN-TVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKA 145 (402) Q Consensus 70 ~G~~i~~~~-~~~~e~~itID~~lya~~-~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~ 145 (402) -|.++.... +..++.++..- +++.. .|.+ |++. ..+ +.+.+.+++++++++..|+.++. +.... T Consensus 197 E~~~~~~~~~~~f~~i~l~~~--k~~~~~~iS~ell~ds--~~~-l~~~i~~~l~~~i~~~~d~~il~----G~G~~--- 264 (425) T protein:vir:95 197 QSGALPTGDVGTIASIDFDGF--KVGKVTFVDNYLLQDS--IIN-LDDYVTKKIARAIAKALDLAIVK----GTGAA--- 264 (425) T ss_pred cccccccccccccceeeeehe--eeeeeehhhHHHHhcc--HHH-HHHHHHHHHHHHHHHHHHHHhhc----cCCCC--- Confidence 677665443 34566555544 33332 2222 2222 234 67888999999999999997752 10000 Q ss_pred ccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHH-HHhcccchhhc--ccccc Q lcl|Aclame:pro 146 ERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFN-ALRDADRIVDK--TYTIS 222 (402) Q Consensus 146 ~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~-~Ll~~~r~~n~--d~~~~ 222 (402) ...| .|.... ++.............++.|.++...+...+.+...-+.+++|..|+ .|.+-..+.+. .|-. T Consensus 265 -~~~p--~Gil~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~- 338 (425) T protein:vir:95 265 -NKQP--LGIIPS--LPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVG- 338 (425) T ss_pred -cccc--ceeecc--cccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceee- Confidence 0000 011110 1111111111112245667777777766555444433455655444 34321111111 1110 Q ss_pred cCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch Q lcl|Aclame:pro 223 QSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK 302 (402) Q Consensus 223 ~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~ 302 (402) . ..++...+++|.||+.|+++|... =+-+||++. .+.. -.++..+...+ T Consensus 339 ~---~~~~~~~~l~G~pvv~~~~~~~~~-----------------i~~Gd~~~~-~~~~---------~~~~~i~~~~~- 387 (425) T protein:vir:95 339 K---LPNLRTPDLLGLRVVFNNFLDDDT-----------------VLFGEFEQY-TLVE---------RENITIDSSTH- 387 (425) T ss_pred c---cCCCCCccccceeeEEcCcCCCcc-----------------EEEEecccE-EEEe---------ecceEEEeecc- Confidence 0 012344578999999999999532 022455441 1111 11222222222 Q ss_pred hHHH---HHHHHHHHhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 303 KEKT---YYIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 303 ~~~~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .+|. ..+++.+=++.++++|++.+.++++-.+-++ T Consensus 388 ~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 388 VKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred cccccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 2232 2344455579999999999988776555444 No 123 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.98 E-value=3.8e-11 Score=77.70 Aligned_cols=271 Identities=12% Similarity=0.020 Sum_probs=148.9 Q ss_pred CCCCcccccccccccccHHH-HHHHHHhHHHHHHHHHHhhhcccceeeeccccc-eEEeeec-cceeeeeecCCCCCCC- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDS-LLIEKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNKYL-GETELQVLAPGQSPNA- 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~a-lfle~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~~i-G~~t~~~~~~G~~i~~- 76 (402) |+..+. ..-. +.=+.|..+++......+.+++++++..+.+++ +..++.. +...+..+..|+.+.. T Consensus 91 ~~~~t~----------~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~ 160 (371) T protein:vir:81 91 MSEGSN----------QDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEK 160 (371) T ss_pred hccCCC----------ccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccccccc Confidence 332221 1112 223778999999999999999999988886532 3444544 4567778888887754 Q ss_pred CCccccceeEeecceee-ccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVI-ARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 77 ~~~~~~e~~itID~~ly-a~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) ..+...+.++....+-. ..+.-.-+++ +.+| +.+.+.+++++++++..|+.++.-.- T Consensus 161 ~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~-l~~~i~~~l~~a~~~~~~~~i~~g~g------------------- 218 (371) T protein:vir:81 161 ATPQFTLLQYQVKKYAGFFRVTNELLND--STEA-IVNTLVRWIGDESRVTRNGLIINVLN------------------- 218 (371) T ss_pred cccceeeEEeeeeEEEEeehhhHHHHhh--hhHH-HHHHHHHHHHHHHHHHHHHHHHhhcc------------------- Confidence 45667777776665431 1111122222 2345 68899999999999999987753210 Q ss_pred ccccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) .+ .+....+ ++.+..+. ..|+...- ..-..|++|..|..|.+-.. .+..|- -.+....|.-++ T Consensus 219 -~~------~~~~~~~----~~~i~~~~~~~l~~~~~--~~a~~vmn~~~~~~L~~lkd-~~g~~l--~~~~~~~~~~~~ 282 (371) T protein:vir:81 219 -TK------AKTAIAD----LDGLKQIINVQLDPVFR--STSSVIVNQDAFNWLDTLKD-QNGQYL--LQPSISSPTGRQ 282 (371) T ss_pred -cc------ccccccc----HHHHHHHHHhhcchhhh--cCCEEEEcHHHHHHHHHhhc-cCCCee--eecccCCCCCce Confidence 00 0010111 23333333 23433332 22356899999999864211 111111 111123455578 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-h---HHHHHHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-K---EKTYYID 310 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-~---~~~d~i~ 310 (402) ++|.||+.++++|.+.....+. +.+...=+-+||++..-+ +.-.+++.+..+.. . +-...++ T Consensus 283 l~G~pV~~~~~~~~~~~~~~~~-----~~~~~~i~~Gd~~~~~~~---------~~~~~~~i~~~~~~~~~f~~~~v~~~ 348 (371) T protein:vir:81 283 LLGLPVVIVSNKVLANRVDGGT-----GAQFAPIIVGDLKEAVVM---------FDRQRTEIMSSNVAMDAFETDATLWR 348 (371) T ss_pred ecceeEEEecccccCccccccc-----cCCcceEEEEehhceEEE---------EeecceEEEEeccccchhhcCceEEE Confidence 9999999999999654321110 111111133555432211 12222333332221 1 2234566 Q ss_pred HHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 311 TFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 311 ~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) +.+-+|.++++|++.+.++.+.- T Consensus 349 ~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 349 AIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred EEEeeccEEecccceEEEEEecC Confidence 77779999999999887765433 No 124 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.98 E-value=1e-10 Score=75.38 Aligned_cols=287 Identities=12% Similarity=0.013 Sum_probs=151.4 Q ss_pred CCCCcc-cccccccccccHHH-HHHHHHhHHHHHHHHHHhhhcccceeeecccc-ceEEeeec--cceeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNT-LTNVAVSASGEVDS-LLIEKFNGKVNEQYLKGENILSYFDVQTVTGT-NTVSNKYL--GETELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~-~t~~~~~~~~d~~a-lfle~f~geV~t~f~~~sv~~~~~~~rti~~G-ksv~f~~i--G~~t~~~~~~G~~i~ 75 (402) |...+. ..+....+++..-. +.=+.|..++++.....+.+++++++.++.++ .+..++.. +...+..+.-|+.+. T Consensus 105 ~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~ 184 (404) T protein:vir:39 105 MAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIP 184 (404) T ss_pred hhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccc Confidence 211111 11111111111112 23389999999999999999999998888754 34444443 233455566666665 Q ss_pred C-CCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 76 A-TPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 76 ~-~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) . ..+...+.++.+..+- ....|. -+++ +.+| +.+.+.+++++++++..|+.++.-. T Consensus 185 ~~~~~~f~~i~~~~~k~~-~~~~iS~ell~d--s~~~-l~~~i~~~l~~~~~~~~d~~il~g~----------------- 243 (404) T protein:vir:39 185 DLDNPRLTIIKYLIKRYA-GIITATNTLLKD--TAEN-ILAWLSSWIAKKVVVTRNQAIIAAM----------------- 243 (404) T ss_pred cccccceeeEEeeeeeEE-eeehhHHHHHhh--chHH-HHHHHHHHHHHHHHHHHHHHHHhcc----------------- Confidence 4 4567777777777643 112232 2222 3456 7889999999999999999875211 Q ss_pred cccccccccccCCccccccHHHHHHHHHHHHH-HHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccce Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALANPQYVMAAVEYALE-QQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~-~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~ 231 (402) |.+.. .....+ ++.+.++.. .++...-+ . -..|++|..|..|.+-.. .+..|-.. ....++. T Consensus 244 ---g~~~~-----~~~~~~----~~~i~~~~~~~~~~~~~~-~-a~~v~n~~~~~~L~~lkd-~~G~~l~~--~~~~~~~ 306 (404) T protein:vir:39 244 ---GTVPK-----KPTIAK----FDDVITMINTSVDPAIIA-T-SSLLTNQSGLNKLALVKT-AEGKYLLE--PDPTKPN 306 (404) T ss_pred ---ccccc-----cccccc----HHHHHHHHHHhhhhhhcc-C-CEEEEcHHHHHHHHHhhc-cCCceeec--cCcCCCC Confidence 11110 011112 233444332 33333322 2 246999999999975211 11111111 1123455 Q ss_pred EEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch----hHHHH Q lcl|Aclame:pro 232 VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK----KEKTY 307 (402) Q Consensus 232 V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~----~~~~d 307 (402) ..+++|+||+.+.+.+.... +.+...=+-+||+...-++ .-.++..+..+.. .+-.. T Consensus 307 ~~~l~G~pV~~~~~~~~~~~----------~~~~~~~~~gd~~~~~~~~---------~~~~~~i~~~~~~~~~~~~~~~ 367 (404) T protein:vir:39 307 SYLIKGKKVIVVADRWLPNS----------GSTVYPLYYGDMSQAITLF---------DRENMSLLPTNIGAGAFETDTT 367 (404) T ss_pred cceecceeEEEecccccCcc----------CCCccEEEEEeccccEEEE---------eecceEEEEeccchhhhhhcee Confidence 57899999998766432211 0111112345555422221 1233333333322 12234 Q ss_pred HHHHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) .+++.+-||..+++|++.+.++++.-+ +..++..++- T Consensus 368 ~~r~~~r~d~~~~~~~a~~~~~~~~~a--~~~~~~~~~~ 404 (404) T protein:vir:39 368 KIRVIDRFDVKTTDSEALVAGSFTAIA--DQVGNFTAGK 404 (404) T ss_pred eEEEEeeeccEEecccceEEEEeeccc--cCCCCCCCCC Confidence 566778899999999998887776543 2222222222 No 125 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.98 E-value=8.5e-11 Score=75.80 Aligned_cols=304 Identities=13% Similarity=0.076 Sum_probs=147.3 Q ss_pred CCCCcccccccccccccHHHHHHH-HHhHHHHHHHHHHhhhcccceeeeccc-cceEEeeec--cceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIE-KFNGKVNEQYLKGENILSYFDVQTVTG-TNTVSNKYL--GETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle-~f~geV~t~f~~~sv~~~~~~~rti~~-Gksv~f~~i--G~~t~~~~~~G~~i~~ 76 (402) +......+. .++..-.+.+. .+.+++.......++++++++++++.+ +.++.||++ |........-|..+.. T Consensus 151 ~~~~~~~~~----~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~ 226 (477) T protein:vir:84 151 GEEYRDLDR----NGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTA 226 (477) T ss_pred hhhhccccc----cCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCccccc Confidence 111111111 11111223443 346788888888899999999998875 567999986 2222333433444332 Q ss_pred CC-ccc--cceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 77 TP-TQA--DKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 77 ~~-~~~--~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) +. +.+ .=..++++..+++....-.-+=.+ +.+| +.+.+.++++++++++.|+.++ .+..... . | T Consensus 227 ~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~-l~~~i~~~l~~~~~~~~d~~~l----~G~Gt~~----~-p-- 294 (477) T protein:vir:84 227 PSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAAVS-VDEFVFRDLAADYANKLNVQVI----SGTGSNN----Q-V-- 294 (477) T ss_pred ccccccccceeeEEEeeeeEEeeeHHHHHHHhccchh-HHHHHHHHHHHHHHHHHHHHHh----ccCCCCC----c-c-- Confidence 21 111 112344444445444433322222 2456 7889999999999999998775 2221110 0 0 Q ss_pred ccccc---ccccccCC-ccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--cc--chhh-ccccc-- Q lcl|Aclame:pro 153 KGHGF---SINVNVTE-SEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--AD--RIVD-KTYTI-- 221 (402) Q Consensus 153 ~g~~~---~~~v~~~~-a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~--r~~n-~d~~~-- 221 (402) .|... ...++.+. .....+...+++.|.++...++....-. ..+.|++|..|..|.+ |. |.+. .++.. T Consensus 295 ~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~-~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~ 373 (477) T protein:vir:84 295 VGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLE-PEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFN 373 (477) T ss_pred ceeeeccccccccccccccchhhHHHHHHHHHHHHhhccccccCC-ccEEEEcHHHHHHHHHhhccCCCeeeecCccccc Confidence 01110 01111111 1122233456777777776665443322 2345888998888854 32 2221 11110 Q ss_pred ---ccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceee Q lcl|Aclame:pro 222 ---SQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDI 298 (402) Q Consensus 222 ---~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~ 298 (402) ...+...+|..++++|+||+.|+.+|...+..+ ....-+-++|+.. +++. ..+ +. ....+. T Consensus 374 ~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~---------d~~~i~~gd~~~~--~i~~-~~~---~~-~~~~~~ 437 (477) T protein:vir:84 374 NLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGT---------DQDVIHVLRASDL--ALFE-SSV---RM-RALQET 437 (477) T ss_pred ccccccccccccccchhcccceEecCcccccccccC---------CcceEEEEEeceE--EEEe-ece---eE-Eecccc Confidence 112223456667899999999999995321110 0001133556542 2221 111 00 111111 Q ss_pred ccchhHHHHHHHHHHHhcCcccc-cceEEEEEEeeccCcccc Q lcl|Aclame:pro 299 FYEKKEKTYYIDTFMAEGAIPDR-WEAVSVVTTKRDATTGDA 339 (402) Q Consensus 299 ~~d~~~~~d~i~~~~a~Ga~vlR-Peaa~vv~~~~~~t~~~a 339 (402) +.+.......+.+++.+ ..+| |++.+.|+....++|--+ T Consensus 438 ~~~~~~~~~~v~~~~~~--~~~r~~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 438 RAENLSVLLQVYGYLAF--TAARFPQSVVEIGGTALTAPTFA 477 (477) T ss_pred ccccceeeeeehhhhhh--hhhccccceEEeecccccccccC Confidence 21111111123344444 4555 999988877666655444 No 126 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=98.97 E-value=5.1e-12 Score=82.50 Aligned_cols=273 Identities=12% Similarity=0.069 Sum_probs=145.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i~~~ 77 (402) |.... ..+....+++..-...| +.|+.++++.....+.++++.+++++.+ .++|++ +..++..+.-|+.+... T Consensus 124 ~~~~~-~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v~Eg~~~~~~ 199 (402) T protein:vir:93 124 MEAQR-LLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVETAKEL 199 (402) T ss_pred HhHHH-HHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC---ceeeeeeccCCcccccccccccccc Confidence 00000 00000001111112344 8889999999999999999999888754 334543 44556667777777666 Q ss_pred CccccceeEeecceeeccch-hh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNT-VA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~-Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) .+..++.++.+.. ++... |. -|++ +.+| +.+.+.+++++++++..++.+|.... + .| T Consensus 200 ~~~f~~i~~~~~k--~~~~i~iS~ell~D--s~~~-l~~~i~~~la~~~~~~e~~~~~~~g~------------g---~g 259 (402) T protein:vir:93 200 KAKGDTVKFTTNK--FKVFAAISDTVIHG--SDVD-LVNWVENALQSGLAAKERKDALAVSP------------K---SG 259 (402) T ss_pred ccccceeeeccee--eeeechhhHHHHhh--hHHH-HHHHHHHHHHHHHHHHHHHhHhhcCC------------C---cc Confidence 6777776666544 33332 22 2332 2455 67888899999999877665542110 0 01 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) ...+..... .....+....+|.|.++...|+..+.. ...| |+++..|..|++-.+=-++. +..|.=.+ T Consensus 260 ~p~g~~~~~--~~~~~~~~~~~d~l~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~d~~~~--------~~~~~~~~ 327 (402) T protein:vir:93 260 LEHMSFYNG--SVKEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGTTN--------FFDTPAEK 327 (402) T ss_pred ccceeeecc--ccccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCCCc--------ccccCCcc Confidence 111111100 111122344688888888888877654 5567 56666665554311101121 22233346 Q ss_pred EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHH Q lcl|Aclame:pro 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) Q Consensus 235 iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a 314 (402) +.|.||+.++..|.. +-+||+..... +. .+..+.+++.......+.+.+= T Consensus 328 llG~PV~~t~~~~~i-------------------~~GDf~~~~~~-~~----------~~~~~~~~~~~~~~~~~~~~~r 377 (402) T protein:vir:93 328 VFGKPVVFTDAAVKP-------------------IVGDFNYFGIN-YD----------GTTYDTDKDVKKGEYLFVLTAW 377 (402) T ss_pred ccccceEEecCCCce-------------------eeechhhhhhh-hh----------hhhhhhhhcccCCceEEEEEEE Confidence 899999998866521 22444432111 11 1112223222221122233444 Q ss_pred hcCcccccceEEEEEEeeccCcccccc Q lcl|Aclame:pro 315 EGAIPDRWEAVSVVTTKRDATTGDAGG 341 (402) Q Consensus 315 ~Ga~vlRPeaa~vv~~~~~~t~~~a~~ 341 (402) ++.++++|+|...++.+..+ +++|+ T Consensus 378 ~Dg~v~~~~A~~~l~ik~~~--~~~~~ 402 (402) T protein:vir:93 378 YDQQRTLDSAFRIAKAKENT--GPLPS 402 (402) T ss_pred eCcEEechhheEEEEeecCC--CCCCC Confidence 89999999999988887653 44444 No 127 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.95 E-value=1.3e-10 Score=74.80 Aligned_cols=302 Identities=9% Similarity=-0.059 Sum_probs=150.4 Q ss_pred CCCC--------cc-cccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecC Q lcl|Aclame:pro 1 MSTP--------NT-LTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAP 70 (402) Q Consensus 1 Ms~~--------n~-~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~ 70 (402) |..- .. -.+.-.-++++.-.+.-+.+..++++...+.+.++++.++.++. +++.+||+. +...+..+.- T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E 79 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMG-TTGQKIPHWTGDVSASWIGE 79 (326) T ss_pred CCCCccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeecc-CCceEEEEEeCCcceEEecC Confidence 2211 10 11111111111123555889999999999999999988777665 456778764 6677888888 Q ss_pred CCCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 71 GQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKP 150 (402) Q Consensus 71 G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~ 150 (402) |+.+....+..++.++....+ ..-..|.+-=--++.+| +.+.+.+++++++++.+|+.++. +.....|.. T Consensus 80 g~~~~~~~~~f~~i~~~~~k~-~~~v~iS~ell~~s~~~-~~~~i~~~l~~a~~~~~d~a~l~----G~gs~~p~g---- 149 (326) T protein:vir:42 80 GDMKPITKGNMTSQTIAPHKI-ATIFVASAETVRANPAN-YLGTMRTKVATAFAMAFDNAAIN----GTDSPFPTF---- 149 (326) T ss_pred CccccccccceeEEEEeeEEE-EEeehhhHHHHhcCHHH-HHHHHHHHHHHHHHHHHHHHhhc----ccCCCcccc---- Confidence 888888778888877777653 22333322111124466 78899999999999999998852 211111100 Q ss_pred cccccccccccccCCccccccHHHHHHH-HHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCc Q lcl|Aclame:pro 151 RVKGHGFSINVNVTESEALANPQYVMAA-VEYALEQQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSG 225 (402) Q Consensus 151 ~~~g~~~~~~v~~~~a~~~~~~~~l~da-i~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g 225 (402) .........................+. +..+...+. +.....-..|++|..|..|.+ | .+.+..+ ...++ T Consensus 150 -i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~--~~~~~ 224 (326) T protein:vir:42 150 -LAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLV--NAGKKWTHTLLDDITEPILNGAKDKSGRPLFIE--STYTE 224 (326) T ss_pred -ccccccccceeecccccccccchhHHHHHHHHHhhhh--hhccCccEEEEeHHHHHHHHHhhccCCceeecc--ccccC Confidence 000000000000000000000011111 122222222 222233345799999999964 2 2222111 11122 Q ss_pred ccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcc---cceeeccch Q lcl|Aclame:pro 226 ATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE---VTGDIFYEK 302 (402) Q Consensus 226 ~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~d---l~~e~~~d~ 302 (402) ......-+++.|+||+.++++|.... .-+.+||++. ++..+..+ .++..+ ++.....+. T Consensus 225 ~~~~~~~~~l~G~pv~~~~~~~~~~~---------------~~~~Gd~s~~--~~~~~~~~-~v~~~~e~~~~~~~~~~~ 286 (326) T protein:vir:42 225 ENSPFRLGRIVARPTILSDHVASGTV---------------VGYQGDFRQL--VWGQVGGL-SFDVTDQATLNLGTPQAP 286 (326) T ss_pred ccccccCceeeeeeEEEcCCCCCCce---------------EEEEeecceE--EEEEecce-EEEEeecceeeecccccc Confidence 22223346789999999999985321 0133455543 22222211 111100 000000001 Q ss_pred h----HHH--HHHHHHHHhcCcccccceEEEEEEeeccCccc Q lcl|Aclame:pro 303 K----EKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGD 338 (402) Q Consensus 303 ~----~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~ 338 (402) . .+. ..+++.+-++.+++||+|.+.|+ ..+++++ T Consensus 287 ~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~--~~~~~~~ 326 (326) T protein:vir:42 287 NFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLT--NVDATEA 326 (326) T ss_pred cchhhhhcCcEEEEEEEEeccEEecccceEEEe--eccccCC Confidence 1 111 33466777899999999876653 3333333 No 128 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.95 E-value=6.2e-11 Score=76.53 Aligned_cols=283 Identities=11% Similarity=0.004 Sum_probs=144.7 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeecCCCCCCCCC- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATP- 78 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i~~~~- 78 (402) |+.-++...+ .+.=+.++.++++..++.+.++++.++.++.+ ++.+||+. +...+..+.-|+...... T Consensus 1 ma~~t~~~gg---------~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~wv~E~~~~~~~~~ 70 (305) T protein:vir:25 1 MADISRAEVA---------SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESATDPKGVK 70 (305) T ss_pred CCCccCCccc---------eecCHHHHHHHHHHHHhhchhhhhcceeeccC-CcEEEEEEeCCcceEEeecccccccccc Confidence 6665533321 13338889999999999999999999888765 46778865 566777777776654432 Q ss_pred ----ccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 79 ----TQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 79 ----~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) +...+ +.+...++.....-.-+=.+ +.+| +.+.+.+++++++++.+|+.++.- -. .+ .+.... T Consensus 71 ~~s~~~f~~--i~~~~~k~~~~~~is~ell~ds~~~-~~~~i~~~l~~~~a~~~d~a~~~G----~g--~~---~~~~~~ 138 (305) T protein:vir:25 71 PTSKVTWAN--RTLVAEEIAVIIPVHENVIDDATVA-VLTEVAELGGQAIGKKLDQAVIFG----TD--KP---ASWVSP 138 (305) T ss_pred cccccceee--EEeeeEEEEEeehhhHHHHhcchHH-HHHHHHHHHHHHHHHHHhhhheec----cC--CC---CCcccc Confidence 22333 34444444443322222222 3455 688999999999999999988631 10 00 000000 Q ss_pred ccc-ccccc--ccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccc Q lcl|Aclame:pro 154 GHG-FSINV--NVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATING 230 (402) Q Consensus 154 g~~-~~~~v--~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G 230 (402) +.. ..... .............+++.+..+...+....-... -++++|..|..|.+ +.+. .+...+.. T Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~---lkd~----~G~~i~~~- 208 (305) T protein:vir:25 139 ALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPD--TLLSSLALRYEVAN---IRDA----NGNPVFRD- 208 (305) T ss_pred ccccccccccccccccccchhhhHHHHHHHHHHHhhhhcccccc--eeEecHHHHHHHHH---hhcc----CCceeecC- Confidence 000 00000 001111112223455555555555443221111 25889999999854 2111 11122222 Q ss_pred eEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-----h-- Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-----K-- 303 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-----~-- 303 (402) ..++|+||+.++++|...+. + .-+-+||++.. +.-+ .++..+..++. . T Consensus 209 --~~l~G~Pv~~~~~~~~~~~~--~-----------~~~~gd~s~~~--i~~~--------~~~~i~~~~~~~~~~~~~~ 263 (305) T protein:vir:25 209 --DSFAGFRTFFNRNGAWDADA--A-----------IEVIADSSRVK--IGVR--------QDITVKFLDQATLGTGENQ 263 (305) T ss_pred --CcccccceEEcCccCCCCCc--c-----------EEEEEecceEE--EEEe--------cCeEEEEeeeeeeecCCce Confidence 26899999999999853321 1 11335555421 1111 11111111110 0 Q ss_pred ---HHH--HHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 304 ---EKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 304 ---~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) .+. -.+++..-+|.+++||++++.++... .+.++|.. T Consensus 264 ~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~--~~~~~pa~ 305 (305) T protein:vir:25 264 INLAERDMVALRLKARFAYVLGVSATAQGANKTP--VAVVAPAA 305 (305) T ss_pred eeeeecCcEEEEEEEeecceeeCcccEEEEcccc--ccccCCCC Confidence 011 12334445788999999876664432 22222222 No 129 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.92 E-value=8e-11 Score=75.95 Aligned_cols=304 Identities=14% Similarity=0.087 Sum_probs=182.8 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcc------cceeeec----cccceEEeeeccce--eeee Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILS------YFDVQTV----TGTNTVSNKYLGET--ELQV 67 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~------~~~~rti----~~Gksv~f~~iG~~--t~~~ 67 (402) |+. .---++|+ |+|.-.|.....+.+.|.. .-.+... .+|+++.+|..+.+ ..+. T Consensus 1 MA~------------T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~ 68 (324) T protein:vir:59 1 MAY------------TKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQV 68 (324) T ss_pred CCc------------eeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccc Confidence 762 11135666 9999999888888877732 2222221 37999999998876 4677 Q ss_pred ecCCCCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 68 LAPGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) Q Consensus 68 ~~~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~ 147 (402) +.-|+.|..+.+...+..-+|= ...-.+.+.|+-...+--| .-.+++++.+.++++..|..++..| +++....... T Consensus 69 v~~~~~i~~~~l~t~~~~a~i~-~~~k~~~~tD~a~~~sg~d-p~~~i~~q~a~~~~~~~~~~lia~l-~g~~~~~~~~- 144 (324) T protein:vir:59 69 LNDTDDLVPQKINAGQDKAVLI-LRGNAWSSHDLAATLSGSD-PMQAIGSRVAAYWAREMQKIVFAEL-AGVFSNDDMK- 144 (324) T ss_pred cCCCcccchhhcccceeeEEEE-eecCceeehhhhhhhccch-HHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhccccc- Confidence 8888888888888777666554 3455677888877766655 6778999999999999888887766 3443221110 Q ss_pred ccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 148 NKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 148 ~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) ++. ..+.. .+....+ ++.|.+|..+|.++. ..-..++|.|..|..|.+.. +++. -...+ T Consensus 145 ------~~~--~dvsa-~~~~~~s----~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li~~--~~~s~--- 203 (324) T protein:vir:59 145 ------DNK--LDISG-TADGIYS----AETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQD-LIEF--VKDSQ--- 203 (324) T ss_pred ------cce--eeeec-cccceec----HHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHhh-hhhh--ccccc--- Confidence 000 01111 1111122 356778888886542 23457899999999998764 4432 21122 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhc-ccceeeccchhHHH Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTI-EVTGDIFYEKKEKT 306 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~-dl~~e~~~d~~~~~ 306 (402) .++.|+...|.+|+.+..+|..... |... +...++|-+-|++..... ++..|..|++.+.. T Consensus 204 ~~~~i~~~~G~~VivdD~~p~~~~~-----------~~~~-------~y~s~l~~~GAi~~~~~~~~v~vE~dRd~~~g~ 265 (324) T protein:vir:59 204 SGIRFPTYMNKRVIVDDSMPVETLE-----------DGTK-------VFTSYLFGAGALGYAEGQPEVPTETARNALGSQ 265 (324) T ss_pred cCceeeeecccEEEEeCCCCccccC-----------CCCc-------eEEEEEEecCeEEEeecCCCcceecccCccccc Confidence 2467899999999999999963211 1111 223466777788877655 46789999998877 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHH------hhhcccceEEEeecchhh Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHAT------VLARAQRKAVYVKTEGAA 366 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~------~~~~~~~~~~~~~~~~~~ 366 (402) +.+...+-|...++ -++.+.....+..|++ ++.+. |.-.-+=..+..+.-..| T Consensus 266 ~~l~~r~~~~~~p~------G~s~~~~~~~~~sPt~-~~L~~~~NW~~v~~~k~i~i~~~~~~~~~ 324 (324) T protein:vir:59 266 DILINRKHFVLHPR------GVKFTENAMAGTTPTD-EELANGANWQRVYDPKKIRIVQFKHRLQA 324 (324) T ss_pred eEEEEeeEEEeEee------eEEecccccCCCCCCh-hhhcCCcccccccCccccceEEEEeeccC Confidence 77766666654444 1444333223334443 22221 111111122223332222 No 130 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=98.91 E-value=1.2e-10 Score=74.99 Aligned_cols=296 Identities=15% Similarity=0.024 Sum_probs=146.3 Q ss_pred CCCCc-ccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhccccee--eeccc-cceEEeee-ccceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPN-TLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDV--QTVTG-TNTVSNKY-LGETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n-~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~--rti~~-Gksv~f~~-iG~~t~~~~~~G~~i 74 (402) ++... ..+.+.+. -.+.+ +.|.++++......++++.+-.. ...++ -..+++|+ .|..++..+.-|+.+ T Consensus 332 ~a~~~~~~~~~~~~-----Gg~~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~ 406 (645) T protein:vir:93 332 SAVGAGTTTDPQWA-----GSLSEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTK 406 (645) T ss_pred hhhhcccccccccc-----CCccCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccc Confidence 11110 01111111 12333 77888888888888888766432 12221 12456775 477788888888888 Q ss_pred CCCCccccceeEeecceeeccch-hhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 75 NATPTQADKNQLVIDTTVIARNT-VAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 75 ~~~~~~~~e~~itID~~lya~~~-Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) ....+..+++++..- +++... |.+ |+ ++..| +.+.+.+++++++++..|+.++.--- ++ ... T Consensus 407 ~~s~~~f~~v~l~~~--kla~~~~iS~ell~--ds~~~-~~~~i~~~l~~aia~~~d~a~l~g~g-~~---------~~~ 471 (645) T protein:vir:93 407 PLTKFDFESITFSHA--KVSAIAVLTEELIR--FSSPA-ADALVRNALAEAVVARLDTDFVDPKK-AA---------VAD 471 (645) T ss_pred cccccceeEEEEeeE--EEEEeehhHHHHHh--hchHH-HHHHHHHHHHHHHHHHHHHHhhcCCC-cc---------cCC Confidence 777777777666553 333322 222 22 33455 67889999999999999998862110 00 000 Q ss_pred ccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccce Q lcl|Aclame:pro 152 VKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) Q Consensus 152 ~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~ 231 (402) . .+.+.... ..........++-+..+...|..+++...+-+.|++|..+..|.+-.. -|..+-- .+... .| T Consensus 472 ~--~p~gi~~~---~~~~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd-~~G~~~~-~~~~~-~~- 542 (645) T protein:vir:93 472 V--SPASITHD---VKGTASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKN-ALGQKEY-PDMTL-LG- 542 (645) T ss_pred c--cccceecc---ccccccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccc-cCCceee-cCCCC-CC- Confidence 0 01111100 000001111233455666777777876666566899999999865321 1111110 01111 12 Q ss_pred EEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeec-c-c-----hhH Q lcl|Aclame:pro 232 VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIF-Y-E-----KKE 304 (402) Q Consensus 232 V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~-~-d-----~~~ 304 (402) ++++|.||+.|+++|..- .+. .-..+ +-+++... -+-+.+.|-. +..+-....+ . . ... T Consensus 543 -~tL~G~PV~~s~~vp~~~------~~g---d~s~~-~ig~~~~v-~i~~s~~a~~--~~~~~~~~~~~~~~~~~~v~lf 608 (645) T protein:vir:93 543 -GSFQGLPVIVSQYVGDQL------VLV---NAPDI-YLADDGGV-AVDMSREASL--EMQSEPTGDSTTPSPVELVSMF 608 (645) T ss_pred -ceeeceeeEEeccCCcce------eEe---ccccE-EEEEecce-EEEeecceeE--EEeecccccccccccccchhHh Confidence 378999999999998411 110 00011 11111111 1111111110 0000000000 0 0 001 Q ss_pred HH--HHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 305 KT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 305 ~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) +- -.|++.+-++-+++||+|.++|+ +++=|+++.. T Consensus 609 ~~d~vaira~~r~d~~~~~p~a~~~lt---~~~~g~~~~~ 645 (645) T protein:vir:93 609 QTGSVAIRAERWINWRRRRTAAVAVIT---GVNYGSASGG 645 (645) T ss_pred hcCceEEEEEEEEcceeeCccceEEEe---cccCCcccCC Confidence 11 23455566788999999988775 5666666655 No 131 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.91 E-value=1.9e-10 Score=73.90 Aligned_cols=284 Identities=11% Similarity=0.036 Sum_probs=148.1 Q ss_pred CC------------CCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceee Q lcl|Aclame:pro 1 MS------------TPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETEL 65 (402) Q Consensus 1 Ms------------~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~ 65 (402) |- ....-.+....+++..-...| +.|.+++...-...+.++++.+++.+.++. ...++ ..+...+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 00 000011111112222223334 888999999999999999999999987532 33344 3455677 Q ss_pred eeecCCCCCCC-CCccccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 66 QVLAPGQSPNA-TPTQADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 66 ~~~~~G~~i~~-~~~~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) ..+.-|..+.. ..+..+++++..-.+ +.-..|.+ |++ +.+| +.+.+.+.+++++++..|..++.-.- T Consensus 164 ~~v~E~~~~~~~~~~~~~~v~l~~~k~-~~~~~iS~ell~d--s~~~-l~~~i~~~l~~~i~~~~d~~~~~g~g------ 233 (392) T protein:vir:10 164 AEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQD--SDQN-ILKYVTKWLGKKSKVTRNVLILGVIE------ 233 (392) T ss_pred eeecccccccccccccceeEEeeeeeE-EEeehhhHHHHhh--hHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------ Confidence 77777777764 346777777777554 22233332 332 3466 78999999999999999988853210 Q ss_pred cccccccccccccccccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhccccc Q lcl|Aclame:pro 143 TKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTI 221 (402) Q Consensus 143 a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~ 221 (402) .+. +....+ ++.|.++. ..|+....+ .-..|++|..|..|.+-.. .|..|-- T Consensus 234 --------------~~~------~~~~~~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~ 286 (392) T protein:vir:10 234 --------------KLT------KQAIKS----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYIL 286 (392) T ss_pred --------------ccc------ccCccC----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEe Confidence 000 011112 34444443 345554443 2335899999999964211 1111110 Q ss_pred ccCcccccceEEEEeccEEEe--cCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeec Q lcl|Aclame:pro 222 SQSGATINGFVLSSYNCPVIP--SNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIF 299 (402) Q Consensus 222 ~~~g~~~~G~V~~iaG~~V~~--SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~ 299 (402) .....+|.-.+++|++++. ++++|...+.. .+...=+-+||++.+-+ +.-.++..+.. T Consensus 287 --~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~---------~~~~~~~~gdfs~~~~i---------~~~~~~~~~~~ 346 (392) T protein:vir:10 287 --QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT---------AKKAPLIIGDLKEAIVL---------FKREDMELAST 346 (392) T ss_pred --ecCccCCccccccCcccEEEecccccCCCccc---------CCceEEEEEehhceEEE---------EeecceEEEEe Confidence 0112234456789987654 34444321111 11111123455442221 11122222222 Q ss_pred --cchhHHH--HHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 300 --YEKKEKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 300 --~d~~~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) .+..... -.+++.+-+|.++++|++.+.++++..+ +.+.|.. T Consensus 347 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a-~~~~~~~ 392 (392) T protein:vir:10 347 DVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA-PVEQPQG 392 (392) T ss_pred ccccchhhcCceEEEEEEeeccEEecccceEEEEecccc-cccCCCC Confidence 2222222 2366777899999999998887665422 2222222 No 132 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.91 E-value=1.9e-10 Score=73.90 Aligned_cols=284 Identities=11% Similarity=0.036 Sum_probs=148.1 Q ss_pred CC------------CCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceee Q lcl|Aclame:pro 1 MS------------TPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETEL 65 (402) Q Consensus 1 Ms------------~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~ 65 (402) |- ....-.+....+++..-...| +.|.+++...-...+.++++.+++.+.++. ...++ ..+...+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 00 000011111112222223334 888999999999999999999999987532 33344 3455677 Q ss_pred eeecCCCCCCC-CCccccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 66 QVLAPGQSPNA-TPTQADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 66 ~~~~~G~~i~~-~~~~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) ..+.-|..+.. ..+..+++++..-.+ +.-..|.+ |++ +.+| +.+.+.+.+++++++..|..++.-.- T Consensus 164 ~~v~E~~~~~~~~~~~~~~v~l~~~k~-~~~~~iS~ell~d--s~~~-l~~~i~~~l~~~i~~~~d~~~~~g~g------ 233 (392) T protein:vir:10 164 AEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQD--SDQN-ILKYVTKWLGKKSKVTRNVLILGVIE------ 233 (392) T ss_pred eeecccccccccccccceeEEeeeeeE-EEeehhhHHHHhh--hHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------ Confidence 77777777764 346777777777554 22233332 332 3466 78999999999999999988853210 Q ss_pred cccccccccccccccccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhccccc Q lcl|Aclame:pro 143 TKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTI 221 (402) Q Consensus 143 a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~ 221 (402) .+. +....+ ++.|.++. ..|+....+ .-..|++|..|..|.+-.. .|..|-- T Consensus 234 --------------~~~------~~~~~~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~ 286 (392) T protein:vir:10 234 --------------KLT------KQAIKS----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYIL 286 (392) T ss_pred --------------ccc------ccCccC----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEe Confidence 000 011112 34444443 345554443 2335899999999964211 1111110 Q ss_pred ccCcccccceEEEEeccEEEe--cCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeec Q lcl|Aclame:pro 222 SQSGATINGFVLSSYNCPVIP--SNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIF 299 (402) Q Consensus 222 ~~~g~~~~G~V~~iaG~~V~~--SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~ 299 (402) .....+|.-.+++|++++. ++++|...+.. .+...=+-+||++.+-+ +.-.++..+.. T Consensus 287 --~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~---------~~~~~~~~gdfs~~~~i---------~~~~~~~~~~~ 346 (392) T protein:vir:10 287 --QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT---------AKKAPLIIGDLKEAIVL---------FKREDMELAST 346 (392) T ss_pred --ecCccCCccccccCcccEEEecccccCCCccc---------CCceEEEEEehhceEEE---------EeecceEEEEe Confidence 0112234456789987654 34444321111 11111123455442221 11122222222 Q ss_pred --cchhHHH--HHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 300 --YEKKEKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 300 --~d~~~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) .+..... -.+++.+-+|.++++|++.+.++++..+ +.+.|.. T Consensus 347 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a-~~~~~~~ 392 (392) T protein:vir:10 347 DVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA-PVEQPQG 392 (392) T ss_pred ccccchhhcCceEEEEEEeeccEEecccceEEEEecccc-cccCCCC Confidence 2222222 2366777899999999998887665422 2222222 No 133 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.91 E-value=1.9e-10 Score=73.90 Aligned_cols=284 Identities=11% Similarity=0.036 Sum_probs=148.1 Q ss_pred CC------------CCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceee Q lcl|Aclame:pro 1 MS------------TPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETEL 65 (402) Q Consensus 1 Ms------------~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~ 65 (402) |- ....-.+....+++..-...| +.|.+++...-...+.++++.+++.+.++. ...++ ..+...+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 00 000011111112222223334 888999999999999999999999987532 33344 3455677 Q ss_pred eeecCCCCCCC-CCccccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 66 QVLAPGQSPNA-TPTQADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 66 ~~~~~G~~i~~-~~~~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) ..+.-|..+.. ..+..+++++..-.+ +.-..|.+ |++ +.+| +.+.+.+.+++++++..|..++.-.- T Consensus 164 ~~v~E~~~~~~~~~~~~~~v~l~~~k~-~~~~~iS~ell~d--s~~~-l~~~i~~~l~~~i~~~~d~~~~~g~g------ 233 (392) T protein:vir:10 164 AEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQD--SDQN-ILKYVTKWLGKKSKVTRNVLILGVIE------ 233 (392) T ss_pred eeecccccccccccccceeEEeeeeeE-EEeehhhHHHHhh--hHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------ Confidence 77777777764 346777777777554 22233332 332 3466 78999999999999999988853210 Q ss_pred cccccccccccccccccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhccccc Q lcl|Aclame:pro 143 TKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTI 221 (402) Q Consensus 143 a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~ 221 (402) .+. +....+ ++.|.++. ..|+....+ .-..|++|..|..|.+-.. .|..|-- T Consensus 234 --------------~~~------~~~~~~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~ 286 (392) T protein:vir:10 234 --------------KLT------KQAIKS----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYIL 286 (392) T ss_pred --------------ccc------ccCccC----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEe Confidence 000 011112 34444443 345554443 2335899999999964211 1111110 Q ss_pred ccCcccccceEEEEeccEEEe--cCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeec Q lcl|Aclame:pro 222 SQSGATINGFVLSSYNCPVIP--SNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIF 299 (402) Q Consensus 222 ~~~g~~~~G~V~~iaG~~V~~--SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~ 299 (402) .....+|.-.+++|++++. ++++|...+.. .+...=+-+||++.+-+ +.-.++..+.. T Consensus 287 --~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~---------~~~~~~~~gdfs~~~~i---------~~~~~~~~~~~ 346 (392) T protein:vir:10 287 --QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT---------AKKAPLIIGDLKEAIVL---------FKREDMELAST 346 (392) T ss_pred --ecCccCCccccccCcccEEEecccccCCCccc---------CCceEEEEEehhceEEE---------EeecceEEEEe Confidence 0112234456789987654 34444321111 11111123455442221 11122222222 Q ss_pred --cchhHHH--HHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 300 --YEKKEKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 300 --~d~~~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) .+..... -.+++.+-+|.++++|++.+.++++..+ +.+.|.. T Consensus 347 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a-~~~~~~~ 392 (392) T protein:vir:10 347 DVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA-PVEQPQG 392 (392) T ss_pred ccccchhhcCceEEEEEEeeccEEecccceEEEEecccc-cccCCCC Confidence 2222222 2366777899999999998887665422 2222222 No 134 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.91 E-value=1.9e-10 Score=73.90 Aligned_cols=284 Identities=11% Similarity=0.036 Sum_probs=148.1 Q ss_pred CC------------CCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccc-eEEee-eccceee Q lcl|Aclame:pro 1 MS------------TPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNK-YLGETEL 65 (402) Q Consensus 1 Ms------------~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~-~iG~~t~ 65 (402) |- ....-.+....+++..-...| +.|.+++...-...+.++++.+++.+.++. ...++ ..+...+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 00 000011111112222223334 888999999999999999999999987532 33344 3455677 Q ss_pred eeecCCCCCCC-CCccccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 66 QVLAPGQSPNA-TPTQADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 66 ~~~~~G~~i~~-~~~~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) ..+.-|..+.. ..+..+++++..-.+ +.-..|.+ |++ +.+| +.+.+.+.+++++++..|..++.-.- T Consensus 164 ~~v~E~~~~~~~~~~~~~~v~l~~~k~-~~~~~iS~ell~d--s~~~-l~~~i~~~l~~~i~~~~d~~~~~g~g------ 233 (392) T protein:vir:10 164 AEITEMGEIPETDNPKFSNVQYAVKDR-AGILPLSRSLLQD--SDQN-ILKYVTKWLGKKSKVTRNVLILGVIE------ 233 (392) T ss_pred eeecccccccccccccceeEEeeeeeE-EEeehhhHHHHhh--hHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------ Confidence 77777777764 346777777777554 22233332 332 3466 78999999999999999988853210 Q ss_pred cccccccccccccccccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhccccc Q lcl|Aclame:pro 143 TKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTI 221 (402) Q Consensus 143 a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~ 221 (402) .+. +....+ ++.|.++. ..|+....+ .-..|++|..|..|.+-.. .|..|-- T Consensus 234 --------------~~~------~~~~~~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lkd-~~G~~l~ 286 (392) T protein:vir:10 234 --------------KLT------KQAIKS----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLKD-KDGKYIL 286 (392) T ss_pred --------------ccc------ccCccC----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhhc-cCCCeEe Confidence 000 011112 34444443 345554443 2335899999999964211 1111110 Q ss_pred ccCcccccceEEEEeccEEEe--cCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeec Q lcl|Aclame:pro 222 SQSGATINGFVLSSYNCPVIP--SNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIF 299 (402) Q Consensus 222 ~~~g~~~~G~V~~iaG~~V~~--SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~ 299 (402) .....+|.-.+++|++++. ++++|...+.. .+...=+-+||++.+-+ +.-.++..+.. T Consensus 287 --~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~---------~~~~~~~~gdfs~~~~i---------~~~~~~~~~~~ 346 (392) T protein:vir:10 287 --QSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTT---------AKKAPLIIGDLKEAIVL---------FKREDMELAST 346 (392) T ss_pred --ecCccCCccccccCcccEEEecccccCCCccc---------CCceEEEEEehhceEEE---------EeecceEEEEe Confidence 0112234456789987654 34444321111 11111123455442221 11122222222 Q ss_pred --cchhHHH--HHHHHHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 300 --YEKKEKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 300 --~d~~~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) .+..... -.+++.+-+|.++++|++.+.++++..+ +.+.|.. T Consensus 347 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a-~~~~~~~ 392 (392) T protein:vir:10 347 DVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA-PVEQPQG 392 (392) T ss_pred ccccchhhcCceEEEEEEeeccEEecccceEEEEecccc-cccCCCC Confidence 2222222 2366777899999999998887665422 2222222 No 135 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.90 E-value=2.5e-11 Score=78.72 Aligned_cols=293 Identities=10% Similarity=0.008 Sum_probs=145.2 Q ss_pred CCCC--cccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEee-eccceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTP--NTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNK-YLGETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~--n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~-~iG~~t~~~~~~G~~i~~ 76 (402) .... ..-.+....+++..-...| ++|..++++.....++++++.++.++.++. ..++ ..+...+....-|..... T Consensus 95 ~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~a~wv~E~~~~~~ 173 (401) T protein:vir:44 95 REDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSD-YKKLVNLGGTASGWVGETDTRSQ 173 (401) T ss_pred hhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCc-eEEEEecCCccceeeccccccCc Confidence 0000 0000000111111112234 899999999999999999999888876544 4455 445555555544554443 Q ss_pred C-CccccceeEeecceeeccch---hhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc-ccc Q lcl|Aclame:pro 77 T-PTQADKNQLVIDTTVIARNT---VAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERN-KPR 151 (402) Q Consensus 77 ~-~~~~~e~~itID~~lya~~~---IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~-~~~ 151 (402) . .+..++.++.+-++ +.+. -.-|++ +.+| +.+.+.+++++++++..|+.++. +.....|.-.- ... T Consensus 174 ~~~~~~~~v~~~~~k~--~~~~~iS~ell~d--s~~~-l~~~i~~~la~ai~~~~~~~~l~----G~G~~~p~Gil~~~~ 244 (401) T protein:vir:44 174 TATSRLGLIEPFMGEI--YGNPQATQKMLDD--AFFN-VEAWINSELATEFAEQEEIAFTT----GDGTKKPKGFLAYES 244 (401) T ss_pred cccccceeeeeehhhe--eeehhhhHHHHhc--chHH-HHHHHHHHHHHHHHHHHHhhhhc----cCCCCccceeecccc Confidence 2 34555565555543 3322 222222 2445 67889999999999999988762 11111000000 000 Q ss_pred ccccccccccccCCccccccHH-HHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--cc--chhhcccccccCcc Q lcl|Aclame:pro 152 VKGHGFSINVNVTESEALANPQ-YVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--AD--RIVDKTYTISQSGA 226 (402) Q Consensus 152 ~~g~~~~~~v~~~~a~~~~~~~-~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~~--r~~n~d~~~~~~g~ 226 (402) ..........+........... --|+.|+++...|...+.. .-..|++|..|..|.+ |. |++ | ... T Consensus 245 ~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~--~a~~v~n~~~~~~L~~lkd~~G~~l---~----~~~ 315 (401) T protein:vir:44 245 TEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRT--GAKFMMNNNSLFAIRLLKDTEGNYL---W----RPG 315 (401) T ss_pred ccccccccccccccccccccccccCHHHHHHHHHhcchhhhc--CCEEEEcHHHHHHHHHhhccCCcee---e----cCC Confidence 0000000000000000000111 1267777777777665443 2235799999998853 32 221 1 111 Q ss_pred cccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHH Q lcl|Aclame:pro 227 TINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKT 306 (402) Q Consensus 227 ~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~ 306 (402) +.+|.-.+++|.||+.|+++|..+.+. ..=+-+||+...- ++.+. +++.+....-.+-. T Consensus 316 ~~~g~~~~l~G~PVv~~~~~p~~~~~~------------~~i~~Gd~~~~~~-i~~~~--------~~~~~~~~~~~~~~ 374 (401) T protein:vir:44 316 LELGQPSSLAGYGIAENEQMPDIAADA------------KAIAFGNFKRGYT-IVDRI--------GTRILRDPYTNKPF 374 (401) T ss_pred cCCCCCceecceeeEEecCcCCccCCc------------cEEEEeehhccEE-EEEec--------ceEEeeeccccCCc Confidence 234555689999999999999643211 1112255544222 22222 12221111011111 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) ..+.+.+=+|..+++|++.+.|+.+.- T Consensus 375 v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 375 VGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EEEEEEEEeccEEecccceEEEEeecC Confidence 223455568999999999888776543 No 136 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.90 E-value=6.7e-10 Score=70.87 Aligned_cols=280 Identities=11% Similarity=0.007 Sum_probs=166.4 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccc-eeeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGE-TELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~-~t~~~~~~G~~i~~~~~ 79 (402) |+..|..+...-.++ ..--|+..|+.-+.+ =..+++..|..++..|+++++|.-.- ..++++.-|+.|+.+.+ T Consensus 1 mAe~nlt~~~dL~~~--~sidfv~~f~~~i~~----L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskv 74 (295) T protein:vir:99 1 MAEKNLNTMADLGDI--KSIDFVNKFSKNIND----LLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKV 74 (295) T ss_pred CCCcccccHhhccCc--eeehhhHHhhhhHHH----HHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhh Confidence 999775443322211 122488999865533 33457778888899999999997432 24567888999988887 Q ss_pred ccc---ceeEeecceeeccchhhhHHHh-h-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 80 QAD---KNQLVIDTTVIARNTVAHIHDV-Q-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 80 ~~~---e~~itID~~lya~~~IddlDe~-q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ... ..++.+. +|... + =||+ | .-|+.=..|-.+++..++++++|..++..+..+.... T Consensus 75 t~~~~~t~t~kik--K~rK~-t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~------------ 137 (295) T protein:vir:99 75 TRTKDKDYTVKWF--KKRRA-T--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKV------------ 137 (295) T ss_pred eeeeeeeeEEEee--eeccc-c--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceee------------ Confidence 654 3566664 44443 3 3666 3 6666667899999999999999999987774321110 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ 234 (402) ...+-+..|+.+..+...+.|.+= ...+++|+|..++.|+++-.+. ++. + ..+.---+.. T Consensus 138 -------------tg~~lq~a~a~~~~al~~f~Ee~~--~~~V~FVnP~D~a~yl~~A~~~---~~~-a-~~fG~~~L~n 197 (295) T protein:vir:99 138 -------------KGVGLQKALSASWAKLATFNEFEG--SPLVSFVSPLDVANYLGDTKVG---ADA-S-NVFGMTLLKN 197 (295) T ss_pred -------------ehhhHHHHHHHhhhhhhhcccccC--CceEEEEehHHHHHHHhccccc---cch-h-hhhhhhhhhh Confidence 001123445566666666554321 2469999999999999987653 211 1 0011122335 Q ss_pred EeccE-EEecCccccccCccccc--------cccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH Q lcl|Aclame:pro 235 SYNCP-VIPSNRFPTFAQDQAHH--------LLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK 305 (402) Q Consensus 235 iaG~~-V~~SNnlP~~~~~~t~~--------~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~ 305 (402) +.|++ |+.|+.+|.+.-..|.. ..++-.-+..|....|.+..+|+- |.. ....++. . T Consensus 198 fLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~~~D~tglIg~~-h~~-----~~~~~t~--------e 263 (295) T protein:vir:99 198 FLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLGGLFADFTDETGLIAAA-RNR-----QLSNLTY--------E 263 (295) T ss_pred hhccceEEEcccCCCceEEEeeccceEEEEecCCchhhhhhhhhccCcccceEEE-ecc-----ccceeee--------h Confidence 89997 99999999765433321 111122345566666666666643 211 0111111 1 Q ss_pred HHHHHHHHHhcCcccccceEEEEEEeeccCccccc Q lcl|Aclame:pro 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) Q Consensus 306 ~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~ 340 (402) .-++-|...| +=|+|..+..+++...+|+.-+ T Consensus 264 t~~~~~~~lf---pE~~dgiv~~tI~~~~~~~~~~ 295 (295) T protein:vir:99 264 SVFFGANVLF---AEIPEGVVEATIEAAAVPGIGG 295 (295) T ss_pred hhhHhHHHhc---ccccceEEEEEEecCcCCCCCC Confidence 1111111111 3356688888888888887776 No 137 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.89 E-value=3.6e-10 Score=72.35 Aligned_cols=289 Identities=13% Similarity=0.069 Sum_probs=140.0 Q ss_pred CCCCc----ccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhccc-ceeeeccccceEEeeec-cceeeeeecCCCC Q lcl|Aclame:pro 1 MSTPN----TLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSY-FDVQTVTGTNTVSNKYL-GETELQVLAPGQS 73 (402) Q Consensus 1 Ms~~n----~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~-~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~ 73 (402) |.... ...+ ..+.+.+.-...| +++.+++.+.....++++.+ .++-+...| .+++|+. +...+....-|+. T Consensus 52 ~a~~~~~~~~~~~-a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g-~~~~p~~t~~~~a~wv~E~~~ 129 (366) T protein:vir:57 52 FAATELGDTGLSM-AISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNG-NLSMPRLSGGATAGYVGEGKD 129 (366) T ss_pred HHHHhhcchhhhh-hccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCC-ceEEEEEeCCcceeeeccCcc Confidence 10000 0000 0000011111223 77889999888888888776 443334444 4778765 7777888888888 Q ss_pred CCCCCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 74 PNATPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 74 i~~~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) +....+..++.++..-.+- .-..|. -|+ ++.++ +.+.+.+++++++++.+|+.++. +..... .|. T Consensus 130 ~~~s~~~f~~i~~~~~k~~-~~~~iS~ell~--ds~~~-~~~~i~~~l~~a~~~~~d~a~l~----G~G~~~-----~p~ 196 (366) T protein:vir:57 130 VVATGATFDDVKLSAKTMI-ALVPVSNQLIG--RAGFN-VEQLLLGDILSAIATREDKAFLR----DDGTGD-----TPK 196 (366) T ss_pred ccccccceeEEEEeeEEEE-EeehhhHHHHh--hhhHH-HHHHHHHHHHHHHHHHHHHHhhc----cCCCCc-----ccc Confidence 8777777776666554322 222222 122 23455 67889999999999999987752 111100 000 Q ss_pred cccccc---ccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 152 VKGHGF---SINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 152 ~~g~~~---~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) |... .............+... ++.+.+.. ......+.....-..+++|..|..|.+-.. .+..|--. .. T Consensus 197 --Gi~~~~~~~~~~~~~~~t~~~~~~-~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd-~~G~~l~~---~~ 269 (366) T protein:vir:57 197 --GMKAVATAANRLVAWTGTAINLTT-IDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRD-GNGNKVYP---EM 269 (366) T ss_pred --ceeeccccccceeeccccccchhh-HHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhc-cCCceecc---CC Confidence 1100 00000000000111111 22222222 112222222223334799999998865211 11111100 01 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh---- Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK---- 303 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~---- 303 (402) .+ ++++|+||+.|+++|...+..+ +...=+-+||+... +..+ .++..+..++.. T Consensus 270 ~~---g~l~G~Pvv~s~~ip~~~~~~~---------~~~~i~~gdfs~~~--i~~~--------~~i~i~~~~ea~~~~~ 327 (366) T protein:vir:57 270 SQ---GILKGYPIQRTSAIPANLGDDG---------NESEIYFCDFNDVV--IGED--------GMMKVDFSTEATYKDA 327 (366) T ss_pred CC---CeecceeeEEccccccccccCC---------CccEEEEEecceEE--EEEe--------cceEEEEeeccccccc Confidence 12 3689999999999996321110 11111346666532 2211 122223222211 Q ss_pred -------HHH--HHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 304 -------EKT--YYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 304 -------~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) .+. -.|++.+-++-+++||++.+.++--.= T Consensus 328 ~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 328 DGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred cccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 111 245667778999999999887754333 No 138 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=98.88 E-value=4.2e-11 Score=77.47 Aligned_cols=276 Identities=11% Similarity=0.036 Sum_probs=143.3 Q ss_pred CCCCc--------ccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeee--ccceeeeeec Q lcl|Aclame:pro 1 MSTPN--------TLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY--LGETELQVLA 69 (402) Q Consensus 1 Ms~~n--------~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~--iG~~t~~~~~ 69 (402) +.... ...+....+++..-...| +.|..+++......+.++++.++.++.+. ++|+ .+..++..+. T Consensus 100 ~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~---~~p~~~~~~~~a~~v~ 176 (387) T protein:vir:93 100 LPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDDDDFIT 176 (387) T ss_pred hhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCCc---eEEEEeecCCcccccc Confidence 00000 000000000111112233 78889999999998999999998887543 3444 2445566677 Q ss_pred CCCCCCCCCccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 70 PGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERN 148 (402) Q Consensus 70 ~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~ 148 (402) -|+......+..+++++. ..+++.+.--.-+=.+ +.+| +.+.+.+++++++++..++.+|... . T Consensus 177 E~~~~~~~~~~f~~v~~~--~~k~~~~~~iS~ell~Ds~~~-l~~~i~~~la~~~~~~e~~~~~~~g---~--------- 241 (387) T protein:vir:93 177 DVETAKELKLKGDTVKFT--TNKFKVFAAISDTVIHGSDVD-LVNWVENALQSGLAAKERKDALAVS---P--------- 241 (387) T ss_pred Ccccccccccccceeeee--heeeeeechhhHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHhHhhcC---C--------- Confidence 777776666766665554 4445443322211122 3455 6788989999999987666554111 0 Q ss_pred cccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccc Q lcl|Aclame:pro 149 KPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATI 228 (402) Q Consensus 149 ~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~ 228 (402) + .+...+.....+ ....+....||.|.++...|+..+.. ...| |+++..|..|++-.+=-++. +. T Consensus 242 g---~g~p~g~l~~~~--~~~v~~~~~~d~i~~~~~~l~~~~~~-~a~~-~mn~~t~~~~~~~~~d~~~~--------~~ 306 (387) T protein:vir:93 242 K---SGLDHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGTTN--------FF 306 (387) T ss_pred C---ccccceeeeccc--cccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCCCc--------cc Confidence 0 011111111001 11123344688888888888887665 4566 66777666654311111122 22 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHH Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYY 308 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~ 308 (402) .|.=.++.|.||+.++..|.. +-+||+..... +. ++..+.+.+......- T Consensus 307 ~~~~~~llG~PV~~~~~~~~~-------------------~~GDf~~~~~~-~~----------~~~~~~~~~~~~~~~~ 356 (387) T protein:vir:93 307 DTPAEKVFGKPVVFTDAAVKP-------------------IVGDFNYFGIN-YD----------GTTYDTDKDVKKGEYL 356 (387) T ss_pred ccCCccccccceEEecCCCce-------------------eeeehhhhhee-hh----------hheeeecccccCCcee Confidence 233347899999998865421 22444432111 11 0111222222222222 Q ss_pred HHHHHHhcCcccccceEEEEEEeeccCcccccc Q lcl|Aclame:pro 309 IDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGG 341 (402) Q Consensus 309 i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~ 341 (402) +.+..=||.++++|+|...++.+..+ +++|+ T Consensus 357 ~~~~~r~d~~v~~~eA~~~l~~k~~~--~~~~~ 387 (387) T protein:vir:93 357 FVLTAWYDQQRTLDSAFRIAKAKENT--GSLPS 387 (387) T ss_pred EEEEeeeCceeechhheEEEEeecCC--CCCCC Confidence 33444579999999998877766543 44444 No 139 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=98.88 E-value=2.6e-10 Score=73.13 Aligned_cols=266 Identities=11% Similarity=0.046 Sum_probs=145.5 Q ss_pred CCCCccccccccccc-ccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSAS-GEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~-~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i~~ 76 (402) +...-........+. ...-...+ +.|..+++......+.++++.++.++.++ +..+|+. +...+..+.-|..... T Consensus 123 ~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~ 201 (400) T protein:vir:38 123 RAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQ-KGTYPTVANATTKMVTVAELEKNPA 201 (400) T ss_pred hhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCc-ceEEEEEecCCCccccccccccccc Confidence 010000000000000 00011223 88999999999999999999998887654 4556654 4444556655555543 Q ss_pred -CCccccceeEeecceeeccchhhhHHHhh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 77 -TPTQADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 77 -~~~~~~e~~itID~~lya~~~IddlDe~q--~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ..+..++.++++-. ++....-.- +.. +.+| +.+.+.++++++|+...|+.|+...- T Consensus 202 ~~~~~f~~i~~~~~k--~~~~~~is~-ell~ds~~~-~~~~i~~~l~~~~~~~~~~~i~~~~~----------------- 260 (400) T protein:vir:38 202 MAKPEFKPVNWSVET--YRQALPVSQ-ESIDDSAID-LVGLIAQNGQQIKVNTTNGAVATLLK----------------- 260 (400) T ss_pred cccccceeeEeehhh--eeeehhhHH-HHHhhhHHH-HHHHHHHHHHHHHHHHHHHhhhhccc----------------- Confidence 45666666666543 333222111 222 2455 77889999999999999987752210 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHH-HHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccc Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALE-QQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATI 228 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~-~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~ 228 (402) .+.. ....+ ++.|.++.. .++.. ..-..|++|..|..|.+ | .+++- .+... T Consensus 261 -~~~~--------~~~~~----~~~~~~~~~~~~~~~----~~a~~v~~~~~~~~l~~lkd~~G~~i~-------~~~~~ 316 (400) T protein:vir:38 261 -GFTA--------KTISS----VDDLKHINNVDLDPA----YSRVIIASQSFYNFLDTVKDGNGRYLL-------QDSIL 316 (400) T ss_pred -cccc--------ccccc----HHHHHHHHHhhhhhh----hCcEEEEcHHHHHHHHHhhccCCCeee-------ecCcC Confidence 0000 01111 233333322 22211 23455889999999864 2 22221 11123 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHH Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYY 308 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~ 308 (402) +|.-++++|+||+.++++|.... |...-+-+||++..-++ .-.++..+..+ ...+... T Consensus 317 ~~~~~~l~G~pv~~~~~~~~~~~------------g~~~~~~gd~s~~~~~~---------~~~~~~~~~~~-~~~~~~~ 374 (400) T protein:vir:38 317 TPSGKSVLGMPIAVVSDDTLGAA------------GEAHAFLGDIKRAILFA---------NRADFMVRWVD-DQIYGQF 374 (400) T ss_pred CCCccccccceeEEecccccCCC------------CceEEEEEeccccEEEE---------eecceEEEEec-cccccee Confidence 45556899999999999986431 11112445655532221 11223333332 3345567 Q ss_pred HHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 309 IDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 309 i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) +++.+=+|.++++|++.+.|+++..+ T Consensus 375 ~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 375 LQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEEEEeccEEecccceEEEEeecCC Confidence 78888899999999999888876554 No 140 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=98.86 E-value=2.5e-10 Score=73.26 Aligned_cols=284 Identities=14% Similarity=0.036 Sum_probs=139.1 Q ss_pred CCCCcc-cccccccccccHHH-HHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNT-LTNVAVSASGEVDS-LLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~-~t~~~~~~~~d~~a-lfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i~~ 76 (402) ..+... ..+..+.....+.. +.-+.+..++... ...+.++.+.++.++.++ +..+|.. +...+....-|..+.. T Consensus 145 ~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~e 222 (437) T protein:vir:10 145 ADYLKTGEVRDVTGIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTT-TGKLPIFNNSTDLLTAHTEYGQTTK 222 (437) T ss_pred HHHHHhhhhhhhhhcccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccC-ceeeEEeeccccccccccccccccc Confidence 000000 00111111111111 2226677777554 445567788877776654 3445544 3334555555555532 Q ss_pred -CCccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 77 -TPTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 77 -~~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..+..++.++.+-.+- +..+.-.-+++ +.+| +.+.+.+.++++|++..|..|+.-.- T Consensus 223 ~~~~~~~~v~~~~~k~~~~~~is~ell~d--s~~~-~~~~i~~~l~~~~~~~~~~~i~~g~g------------------ 281 (437) T protein:vir:10 223 NATPVITPILWDLKTYTGGYVFSQELISD--SSYD-WQAELQSRLIELRDNTDDSLIITALT------------------ 281 (437) T ss_pred cccccceeeeeehhheeeehhhhHHHHhh--hHHH-HHHHHHHHHHHHHHHHHHHHHhhhhc------------------ Confidence 3355566666554432 11222222232 2345 67889999999999999988763221 Q ss_pred cccccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEE Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVL 233 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~ 233 (402) .+.... ... .. ++.|.++. ..|+..+.+ ... .|++|..|..|.+-.. .|..|-- .+.+.+|.-. T Consensus 282 --~~~~~~--~~~--~~----~~~~~~~~~~~l~~~~~~-~~~-~~~~~~~~~~l~~lkd-~~g~~~~--~~~~~~~~~~ 346 (437) T protein:vir:10 282 --DGIKKT--TST--YL----LGDLKKVLNVTLKPQDSA-AAS-IVMSQSAYNLFDMATD-AMGRPLL--QPNVTAATGY 346 (437) T ss_pred --cccccc--ccc--cc----hhhHHHHHHhhhhhhhhc-CCE-EEEcHHHHHHHHHhhc-cCCCeee--ccCccCCCCc Confidence 111000 001 11 22233332 245554443 234 4999999999865211 1111110 1112345456 Q ss_pred EEeccEEEecCcc--ccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHH Q lcl|Aclame:pro 234 SSYNCPVIPSNRF--PTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDT 311 (402) Q Consensus 234 ~iaG~~V~~SNnl--P~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~ 311 (402) +++|.||+.+++. |.... |...=+-+||++...+ +.+. +++++...+-..+...+.+ T Consensus 347 ~l~G~pv~~~~~~~~~~~~~------------~~~~~~~gd~~~~~~~-~~r~--------~~~~~~~~~~~~~~~~~~~ 405 (437) T protein:vir:10 347 TLLGKTVVIVDDKLFPSASA------------GDVNIVVAPLKKAVIN-FKLT--------EITGQFQDTYDIWYKQLGI 405 (437) T ss_pred ccccceeEEecccccCCcCC------------CceEEEEeeccccEEE-Eeee--------ceEEEEecccccccceeeE Confidence 8999999988764 43221 2111245666654332 2221 2233322222233344555 Q ss_pred HHHhcCcccccceEEEEEEeeccCccccccch Q lcl|Aclame:pro 312 FMAEGAIPDRWEAVSVVTTKRDATTGDAGGPG 343 (402) Q Consensus 312 ~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~ 343 (402) .+=|+.++++|++.+.|+.+..+.+-+.++++ T Consensus 406 ~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 406 FLRQNVVQASKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred EEEEccEEecccceEEEEeeccccccCCCCCC Confidence 55679999999999988876555444444443 No 141 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=98.86 E-value=2.5e-10 Score=73.27 Aligned_cols=281 Identities=12% Similarity=0.048 Sum_probs=145.9 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCCCC- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSPNA- 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i~~- 76 (402) |-......+....++..+-...| +.|..++++.....+.++++.++.++.++ +.++++. +......+.-|..... T Consensus 99 lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~~~~~ 177 (389) T protein:vir:10 99 IHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELAENPKL 177 (389) T ss_pred hhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCC-eeEEEEEecCCCcccccccccccccc Confidence 11111001111111112122234 78889999999999999999988887654 3455544 3344455555555543 Q ss_pred CCccccceeEeecceeeccchhhhHHHh-hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVIARNTVAHIHDV-QGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 77 ~~~~~~e~~itID~~lya~~~IddlDe~-q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) ..+...+.++.+..+ +.+..-.-+-. .+.+| +.+.+.++++++|++..|..|+.-+- T Consensus 178 ~~~~~~~i~~~~~k~--~~~~~iS~ell~ds~~~-l~~~i~~~la~~~~~~~~~~i~~g~~------------------- 235 (389) T protein:vir:10 178 AEPEFNKVDWSVATY--RGAIPLSEEAIADSAVD-LTALVGQSIKEKSVNTYNAMIAPVLQ------------------- 235 (389) T ss_pred ccccceeeeeeheee--EeeehhhHHHHhhhhHH-HHHHHHHHHHHHHHHHHHHHHhhhhc------------------- Confidence 455666666666543 22222111111 13455 78899999999999999988753221 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHH-HHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCcccccc Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALE-QQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATING 230 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~-~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~~G 230 (402) .+... +..... -|+.|.++.. .++... .-.+|++|..|..|.+ | .|++-. ..-.+....| T Consensus 236 -~~~~~---~~~~~~----~~d~l~~~~~~~~~~~~----~a~~~~n~~~~~~L~~lkd~~G~~i~~---~~~~~~~~~~ 300 (389) T protein:vir:10 236 -SFTAK---KTTTDT----LVDSLKHILNVDLDPAY----SRALVVTQSLFNTLDTLKDKNGRYLLH---DASDSITDGT 300 (389) T ss_pred -ccccc---cccccc----cHHHHHHHHHhhhhhhh----CcEEEecHHHHHHHHHhhccCCCeeee---cCcccccccc Confidence 00000 111111 2444555443 343322 3456899999999975 2 233211 1111112345 Q ss_pred eEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHH Q lcl|Aclame:pro 231 FVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYID 310 (402) Q Consensus 231 ~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~ 310 (402) ...+++|+||+.+++...... .|...-+-+||++...++-. .+++.++.+ ...+...+. T Consensus 301 ~~~~l~G~pV~~~~~~~~~~~-----------~~~~~~~~gd~~~~~~~~~~---------~~~~i~~~~-~~~~~~~~~ 359 (389) T protein:vir:10 301 AKGTILGVPVYVVGDTLLGSL-----------AGDQKAFVGDLKRGVLFTDR---------QQVTLAWED-SKIYGKYLG 359 (389) T ss_pred cccccccceeEEecccccCCC-----------CCceEEEEeeccccEEEEee---------cceEEEeec-cccccceEE Confidence 567899999987765321110 01111244666653332221 223333322 333445667 Q ss_pred HHHHhcCcccccceEEEEEEeeccCccccccc Q lcl|Aclame:pro 311 TFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) Q Consensus 311 ~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~ 342 (402) +.+=+|..+++|++.+.+++.. +++++++- T Consensus 360 ~~~r~d~~~~~~~a~~~~~~~~--~~~~~~~~ 389 (389) T protein:vir:10 360 AAFRFGVQKADSKAGYFVTNTD--VPGSALGK 389 (389) T ss_pred EEEEeccEEecccceEEEEeec--cCCCCCCC Confidence 7777999999999877765433 33333322 No 142 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.85 E-value=7.6e-10 Score=70.58 Aligned_cols=288 Identities=13% Similarity=0.060 Sum_probs=138.3 Q ss_pred CCCCc----ccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhccc-ceeeeccccceEEeeec-cceeeeeecCCCC Q lcl|Aclame:pro 1 MSTPN----TLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSY-FDVQTVTGTNTVSNKYL-GETELQVLAPGQS 73 (402) Q Consensus 1 Ms~~n----~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~-~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~ 73 (402) |.... ...+.....++ .-...| +.|..+++......++++.+ +++-+..+| .++||++ +..++..+.-|+. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~-~gg~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g-~~~~p~~~~~~~a~~v~Eg~~ 190 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAG-SGGVLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNG-NMSLPRLAGGATASYTGENQD 190 (428) T ss_pred HhhhhhhhhhHhhhhccccc-CCccccchhHHHHHHHHHhhhchhhhhcceeeecCCc-ceEEEEEeCCcceeeeccCcc Confidence 11000 00010000111 111233 67778888888888888887 332222233 4778875 5667777777888 Q ss_pred CCCCCccccceeEeecceeeccchhhh--HHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 74 PNATPTQADKNQLVIDTTVIARNTVAH--IHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 74 i~~~~~~~~e~~itID~~lya~~~Idd--lDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) ++...+..++.++..-.+- .-..|.+ |+ ++..+ +.+.+.++++++|++..|+.++. +.... ..| T Consensus 191 ~~~~~~~f~~i~~~~~k~~-~~v~is~ell~--ds~~~-l~~~i~~~l~~ai~~~~d~~~l~----G~G~~-----~~p- 256 (428) T protein:vir:10 191 AKVSEARFDDVKLTAKTMI-AMVPISNALIG--RAGFN-VEQLVLQDILTAISVREDKAFMR----DDGTG-----DTP- 256 (428) T ss_pred ccccccceeeEEeeeEEEE-EeehhhHHHHh--hhhHH-HHHHHHHHHHHHHHHHHHHHHhc----cCCCC-----ccc- Confidence 8777777777777664332 2222322 22 23455 67889999999999999998751 21110 000 Q ss_pred cccccc----ccccccCCccccccHHHHHHHHHHHHHHHHh-hcC-CccCcEEEeChHHHHHHhcccchhhcccccccCc Q lcl|Aclame:pro 152 VKGHGF----SINVNVTESEALANPQYVMAAVEYALEQQLE-QEV-DISDVAIMMPWKFFNALRDADRIVDKTYTISQSG 225 (402) Q Consensus 152 ~~g~~~----~~~v~~~~a~~~~~~~~l~dai~~a~~~Lde-kdV-P~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g 225 (402) .|... ...+.........+... .+...++...+.. .+. .....| |++|..|..|.+-.. .|..|-- . T Consensus 257 -~Gi~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~-v~n~~~~~~L~~lkd-~~G~~i~-~-- 329 (428) T protein:vir:10 257 -IGMKARATQWNRLLPWAADAAVNLDT-IDTYLDSIILMSMDGNSNMISSGW-GMSNRTYMKLFGLRD-GNGNKVY-P-- 329 (428) T ss_pred -cccccccccccccccccccccccHHH-HHHHHHHHHHhhhccccccccCEE-EEcHHHHHHHHHhhc-cCCceec-c-- Confidence 01110 00111111111111111 1222222222211 111 123344 779999988854111 1111110 0 Q ss_pred ccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchh-- Q lcl|Aclame:pro 226 ATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKK-- 303 (402) Q Consensus 226 ~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~-- 303 (402) ...+| +++|+||+.|+++|...+... ....=+-+||+... +.. -.++..+..++.. T Consensus 330 ~~~~g---~l~G~pv~~~~~~p~~~~~~~---------~~~~i~~gd~s~~~-i~~---------~~~i~i~~~~~~~~~ 387 (428) T protein:vir:10 330 EMAQG---MLKGYPIQRTSAIPANLGEGG---------KESEIYFADFNDVV-IGE---------DGNMKVDFSKEASYI 387 (428) T ss_pred CCCCC---eeeceeeEEeccccccccCCC---------ccceEEEEecceEE-EEE---------ecceEEEeecccccc Confidence 11223 689999999999996432111 11111335555422 111 1122222222211 Q ss_pred -----------HHHHHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 304 -----------EKTYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 304 -----------~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) +=.-.+++..-++..+.||++.+.++--.= T Consensus 388 ~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 388 DTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 112345677789999999999877643222 No 143 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.85 E-value=2.2e-10 Score=73.51 Aligned_cols=289 Identities=13% Similarity=0.047 Sum_probs=146.5 Q ss_pred CCCCcc-cccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccc-cceEEeeeccc--eeeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNT-LTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTG-TNTVSNKYLGE--TELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~-~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~-Gksv~f~~iG~--~t~~~~~~G~~i~ 75 (402) |...+. ..+....+++.+-...| +.|+.+++......+.+++++++.++.+ ..++.++.... ..+..+.-|+.+. T Consensus 105 ~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~ 184 (408) T protein:vir:10 105 MAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIP 184 (408) T ss_pred hhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCccccc Confidence 211111 11111111111112223 8899999999999999999999888764 23344554432 3444555566665 Q ss_pred C-CCccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 76 A-TPTQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 76 ~-~~~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) . ..+..++.++..-.+ +....-...=.+ +.+| +.+.+.+++++++++..|+.|+.-.- T Consensus 185 ~~~~~~~~~i~~~~~k~--~~~~~iS~ell~ds~~~-l~~~i~~~l~~~~~~~~~~~il~g~g----------------- 244 (408) T protein:vir:10 185 DLDNPQLTIIKYLIKRY--AGIITATNTSLKDTAEN-ILAWLSSWIAKKVVVTRNQAIIEVMK----------------- 244 (408) T ss_pred cccCcceeeEEeeeeeE--EeeehhHHHHHhhchHH-HHHHHHHHHHHHHHHHHHHHHhhccc----------------- Confidence 4 335556666655543 322221211111 3456 78899999999999999988753221 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHH-HHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYAL-EQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFV 232 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~-~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V 232 (402) .+... .+ ..+ ++.|.++. ..|+...-+ .. ..|++|..|..|.+-..- |..|-- .. ...+|.. T Consensus 245 ---~~~~~--~~---~~~----~~~l~~~~~~~~~~~~~~-~a-~~v~n~~~~~~l~~lkd~-~G~~i~-~~-~~~~~~~ 307 (408) T protein:vir:10 245 ---AAPKK--PT---IAK----FDDVITMINTAVDPAIIA-TS-SLLTNQSGLNKLALVKTA-EGKYLL-EP-DPTKPNS 307 (408) T ss_pred ---ccccc--cc---ccc----HHHHHHHHHHhhhhhhcc-CC-EEEEcHHHHHHHHHhhcc-CCceEe-cc-CcCCCCC Confidence 11100 01 112 34454443 344443332 23 458999999998752211 122211 11 1234555 Q ss_pred EEEeccEEEecCc--cccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch----hHHH Q lcl|Aclame:pro 233 LSSYNCPVIPSNR--FPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK----KEKT 306 (402) Q Consensus 233 ~~iaG~~V~~SNn--lP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~----~~~~ 306 (402) .+++|+||+.+++ +|..+. +...=+-+||++...++ .-.+++.+..+.. .+-. T Consensus 308 ~~l~G~PV~~~~~~~~~~~~~------------~~~~i~~gd~~~~~~~~---------~~~~~~v~~~~~~~~~f~~~~ 366 (408) T protein:vir:10 308 YLIKGKQVIVVADRWLPNTGS------------TVYPLYYGDMSQAITLF---------DRENMSLLPTNIGAGAFETDT 366 (408) T ss_pred ceecceeeEEecccccCccCC------------CceEEEEEehhccEEEE---------EecceEEEEcccccchhhcCc Confidence 7899999998765 443221 11112345555432222 1122333332221 1222 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeecc-CccccccchhhHHHh Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDA-TTGDAGGPGDDHATV 349 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~-t~~~a~~~~~~~~~~ 349 (402) ..+++.+-|+.++++|++.+.+++...+ ..+..+++++ .+| T Consensus 367 ~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~--~~~ 408 (408) T protein:vir:10 367 TKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTS--TAV 408 (408) T ss_pred eEEEEEEeeccEEeccccEEEEEeeccccCCCCCCCCCc--ccC Confidence 3456667799999999999888876421 1122221111 111 No 144 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.84 E-value=9.2e-11 Score=75.59 Aligned_cols=274 Identities=12% Similarity=0.001 Sum_probs=149.7 Q ss_pred CCCCcc-cccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeecccc-ceEEeee-ccceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNT-LTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGT-NTVSNKY-LGETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~-~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~G-ksv~f~~-iG~~t~~~~~~G~~i~~ 76 (402) +...+. ..+.....+...-...| +.|..++.......+.++++.++.++.++ ..+.+++ .+...+..+..|..+.. T Consensus 112 ~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~ 191 (397) T protein:vir:12 112 RDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPE 191 (397) T ss_pred HHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccc Confidence 000000 00111111111122233 89999999999999999999988888753 3444554 56777888888887764 Q ss_pred -CCccccceeEeecceeeccchhhhHHHhh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 77 -TPTQADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 77 -~~~~~~e~~itID~~lya~~~IddlDe~q--~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ..+..++.++....+- ....-. +++. +.+| +.+.+.++++++|++..|..++.-. T Consensus 192 ~~~~~~~~v~~~~~k~~--~~~~is-~e~l~ds~~~-l~~~i~~~l~~~~~~~~d~~il~G~------------------ 249 (397) T protein:vir:12 192 IDQPRFTKVSYSIIDYG--GIMTLS-NSMLNDSDQA-IMTYVAKWFAKKSVVTRNNLILAAI------------------ 249 (397) T ss_pred cccccceeEEeeheeeE--eeehhh-HHHHhhchHH-HHHHHHHHHHHHHHHHHHHHHHhcc------------------ Confidence 4567777777775543 222111 1222 3455 7888999999999999999876321 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHH-HHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALE-QQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFV 232 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~-~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V 232 (402) +.+. +....+ |+.|.++.. .|+..+- .+-..+++|..|..|.+-.. .+..|- -.....+|.- T Consensus 250 --g~~~------~~g~~~----~~~i~~~~~~~l~~~~~--~~a~~~~n~~~~~~L~~lkd-~~G~~l--~~~~~~~g~~ 312 (397) T protein:vir:12 250 --ASLK------KVDIDG----LDGIKKALNVTLDPMVA--PGSIVLTNQDGYDWLDTLKD-GTGRYL--LQPDPTNPTK 312 (397) T ss_pred --cccc------cccccc----HHHHHHHHhhccchhhh--CCCEEEEcHHHHHHHHHhhc-cCCcee--ecccccCCCC Confidence 0000 000111 344444442 4443332 23345899999999864211 011121 0111234555 Q ss_pred EEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch----hHHHHH Q lcl|Aclame:pro 233 LSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK----KEKTYY 308 (402) Q Consensus 233 ~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~----~~~~d~ 308 (402) .+++|+||+.+++....... |...=+-+||++..-+ +.-.++..+..+.. .+-... T Consensus 313 ~~l~G~pv~~~~~~~~~~~~-----------~~~~~~~gd~~~~~~~---------~~~~~~~i~~~~~~~~~f~~~~~~ 372 (397) T protein:vir:12 313 KLLDGRPVVPFTNRVLKTQK-----------GKAPLIIGNLKEAIVL---------FDREQQSIASTDTGAGAFETNSTK 372 (397) T ss_pred ccccceeeEEecccccccCC-----------CccEEEEEehhceEEE---------EeecceEEEEeccccchhhcCceE Confidence 78999999998874322111 1111133455432111 11122333332221 122346 Q ss_pred HHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 309 IDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 309 i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) +++.+-++.++++|++.+.+++... T Consensus 373 ~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 373 VRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EEEEEeeccEEecccceEEEEEeeC Confidence 6777789999999999988888766 No 145 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.82 E-value=1.3e-09 Score=69.34 Aligned_cols=316 Identities=12% Similarity=0.014 Sum_probs=164.0 Q ss_pred CCCCc-ccccccccccccH-HHHHHHHHhHHHHHHHHHHhhhcccceeeec--cccceEEeeeccceee--eeecCCCCC Q lcl|Aclame:pro 1 MSTPN-TLTNVAVSASGEV-DSLLIEKFNGKVNEQYLKGENILSYFDVQTV--TGTNTVSNKYLGETEL--QVLAPGQSP 74 (402) Q Consensus 1 Ms~~n-~~t~~~~~~~~d~-~alfle~f~geV~t~f~~~sv~~~~~~~rti--~~Gksv~f~~iG~~t~--~~~~~G~~i 74 (402) |-.-| +-.....+..|+. .+....-|.-.++..-.+--++..+-.+++| .+|||+.|.+--...- .-.+-|-+. T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 44444 3333333333332 2333344555555555556777778788887 4799999986532211 112223322 Q ss_pred CCCCc------cc--cceeEe------------ecce-----------eeccchhhhHHHhh-cCcc-chhHHHHHHHH- Q lcl|Aclame:pro 75 NATPT------QA--DKNQLV------------IDTT-----------VIARNTVAHIHDVQ-GDID-SLKPKLAMNQA- 120 (402) Q Consensus 75 ~~~~~------~~--~e~~it------------ID~~-----------lya~~~IddlDe~q-~~~D-~vrse~s~~~G- 120 (402) .++.+ -+ +--+|| ++.. .=+.+|+.-=|+.+ .|.| .+-.++++++. T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 33211 00 001111 2211 12334432222221 1111 12222333332 Q ss_pred HHHHHHHHHHHHHHHHhhhhhccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCc------- Q lcl|Aclame:pro 121 KQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI------- 193 (402) Q Consensus 121 ~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~------- 193 (402) -+-.+..|. +-+.++.++... .+.. ........+.+...+...-++.|..+..+|+++..|. T Consensus 161 g~~~~t~d~-i~~dll~ag~~v--iyAg--------~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~ 229 (401) T protein:vir:95 161 GATQITEAV-LQKDLLAAAGTV--LYAG--------AATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITG 229 (401) T ss_pred hhhhhHHHH-HHHHHHhhcCee--ecCC--------ccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhh Confidence 222333443 344554333211 0110 1011111111111222233788999999999877776 Q ss_pred ----------cCcEEEeCh------HHHHHHhcccchhhcccccccCcccccceEEEEeccEEEecCccccccCcccccc Q lcl|Aclame:pro 194 ----------SDVAIMMPW------KFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHL 257 (402) Q Consensus 194 ----------~gR~~VV~P------~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ 257 (402) .-||.++.| +-++.|+.++.|+...--+ ..+...+|+|+++.+|+++.++.+--..+.+.... T Consensus 230 s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa-~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~ 308 (401) T protein:vir:95 230 SRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYA-DAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQAT 308 (401) T ss_pred hhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcC-CccccccccccccCceeEEecccceeecCCccccc Confidence 227888877 4557788889999874323 34456799999999999999998643222211111 Q ss_pred c-------cccCCccccceeeeccceeEEeecHHHhhhhhhccccee--------e--c-----cchhHHHHHHHHHHHh Q lcl|Aclame:pro 258 L-------SNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGD--------I--F-----YEKKEKTYYIDTFMAE 315 (402) Q Consensus 258 l-------s~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e--------~--~-----~d~~~~~d~i~~~~a~ 315 (402) . +....|..+++.. +|++-++|.+++.+...-.- . + .|+--|-=.+-=|+.| T Consensus 309 ~~~~~y~~~~~~~gg~~dVyp------~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~ 382 (401) T protein:vir:95 309 GANPGYRTSMVSGQEHYDVYP------MLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYY 382 (401) T ss_pred ccccccccccccCCCcceeee------eeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhh Confidence 1 1122344555432 68889999999887764311 0 0 3444455566678899 Q ss_pred cCcccccceEEEEEEeecc Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~~~ 334 (402) ++.+||||..+.|++.... T Consensus 383 a~~vL~~e~m~~ies~a~~ 401 (401) T protein:vir:95 383 GILVKRPERLALIKTVAPL 401 (401) T ss_pred hhheeccceeEEEEeecCC Confidence 9999999999998887665 No 146 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=98.80 E-value=4.9e-11 Score=77.12 Aligned_cols=276 Identities=10% Similarity=0.027 Sum_probs=143.5 Q ss_pred CCCC---------cccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeee Q lcl|Aclame:pro 1 MSTP---------NTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVL 68 (402) Q Consensus 1 Ms~~---------n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~ 68 (402) |... ....+.-..+++..-...| +.|..++++.....+.++++.+++++.+. ++|++ +..++..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~~a~~v 175 (387) T protein:vir:94 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDDDDFI 175 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCCccccc Confidence 0000 0000000001111112334 78899999999999999999998887543 34432 34556666 Q ss_pred cCCCCCCCCCccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 69 APGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) Q Consensus 69 ~~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~ 147 (402) .-|+..+...++.+++++.+..+ ..+..-.-+=.+ +.+| +.+.+.+++++++++..++.+|... . T Consensus 176 ~Eg~~~~~~~~~f~~v~l~~~k~--~~~i~iS~ell~ds~~~-l~~~i~~~la~~~~~~e~~~~~~~g---~-------- 241 (387) T protein:vir:94 176 TDVETAKELKAKGDTVKFTTNKF--KVFAAISDTVIHGSDVD-LVNWVENALQSGLAAKERKDALAVS---P-------- 241 (387) T ss_pred cccccccccccccceeeechhee--eeechhhHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHhHhhcC---C-------- Confidence 77777777677777777666543 333222211111 2355 6788888899999887666554211 0 Q ss_pred ccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 148 NKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 148 ~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) ..+...+.....+ ....+....+|.|.++...|+..+.+ ...| |+++..|..|++-.+=.++ .+ T Consensus 242 ----g~g~~~g~~~~~~--~~~~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~~~--------~~ 305 (387) T protein:vir:94 242 ----KSGLEHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGTT--------NF 305 (387) T ss_pred ----Cccccceeeeccc--cccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCCC--------cc Confidence 0011111111111 11122344688888888888877665 4566 5666666665531110112 12 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHH Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTY 307 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d 307 (402) ..|.-.++.|.||+.++..|.. +-+||+.... .+. ++..+.+++...-.. T Consensus 306 ~~~~~~~llG~PV~~~~~~~~~-------------------~~GDf~~~~~-~~~----------~~~~~~~~~~~~~~~ 355 (387) T protein:vir:94 306 FDTPAEKVFGKPVVFTDAAVKP-------------------IVGDFNYFGI-NYD----------GTTYDTDKDVKKGEY 355 (387) T ss_pred cccCCccccccceEEecCCCce-------------------eeechhhhhh-hhh----------hhhheecccccCCce Confidence 2333357899999999876531 2244443211 110 111122222221111 Q ss_pred HHHHHHHhcCcccccceEEEEEEeeccCcccccc Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGG 341 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~ 341 (402) .+.+.+=|+.++++|++...++.+..+ ++.|+ T Consensus 356 ~~~~~~r~Dg~v~~~~A~~~l~~ka~~--~~~~~ 387 (387) T protein:vir:94 356 LFVLTAWYDQQRTLDSAFRIAKAKENT--GPLPS 387 (387) T ss_pred EEEEEEEeCcEeechhheEEEEeecCC--CCCCC Confidence 222333489999999999887776543 44444 No 147 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=98.80 E-value=4.9e-11 Score=77.12 Aligned_cols=276 Identities=10% Similarity=0.027 Sum_probs=143.5 Q ss_pred CCCC---------cccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeee Q lcl|Aclame:pro 1 MSTP---------NTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVL 68 (402) Q Consensus 1 Ms~~---------n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~ 68 (402) |... ....+.-..+++..-...| +.|..++++.....+.++++.+++++.+. ++|++ +..++..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~~a~~v 175 (387) T protein:vir:26 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDDDDFI 175 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCCccccc Confidence 0000 0000000001111112334 78899999999999999999998887543 34432 34556666 Q ss_pred cCCCCCCCCCccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 69 APGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) Q Consensus 69 ~~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~ 147 (402) .-|+..+...++.+++++.+..+ ..+..-.-+=.+ +.+| +.+.+.+++++++++..++.+|... . T Consensus 176 ~Eg~~~~~~~~~f~~v~l~~~k~--~~~i~iS~ell~ds~~~-l~~~i~~~la~~~~~~e~~~~~~~g---~-------- 241 (387) T protein:vir:26 176 TDVETAKELKAKGDTVKFTTNKF--KVFAAISDTVIHGSDVD-LVNWVENALQSGLAAKERKDALAVS---P-------- 241 (387) T ss_pred cccccccccccccceeeechhee--eeechhhHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHhHhhcC---C-------- Confidence 77777777677777777666543 333222211111 2355 6788888899999887666554211 0 Q ss_pred ccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 148 NKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 148 ~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) ..+...+.....+ ....+....+|.|.++...|+..+.+ ...| |+++..|..|++-.+=.++ .+ T Consensus 242 ----g~g~~~g~~~~~~--~~~~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~~~--------~~ 305 (387) T protein:vir:26 242 ----KSGLEHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGTT--------NF 305 (387) T ss_pred ----Cccccceeeeccc--cccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCCC--------cc Confidence 0011111111111 11122344688888888888877665 4566 5666666665531110112 12 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHH Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTY 307 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d 307 (402) ..|.-.++.|.||+.++..|.. +-+||+.... .+. ++..+.+++...-.. T Consensus 306 ~~~~~~~llG~PV~~~~~~~~~-------------------~~GDf~~~~~-~~~----------~~~~~~~~~~~~~~~ 355 (387) T protein:vir:26 306 FDTPAEKVFGKPVVFTDAAVKP-------------------IVGDFNYFGI-NYD----------GTTYDTDKDVKKGEY 355 (387) T ss_pred cccCCccccccceEEecCCCce-------------------eeechhhhhh-hhh----------hhhheecccccCCce Confidence 2333357899999999876531 2244443211 110 111122222221111 Q ss_pred HHHHHHHhcCcccccceEEEEEEeeccCcccccc Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGG 341 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~ 341 (402) .+.+.+=|+.++++|++...++.+..+ ++.|+ T Consensus 356 ~~~~~~r~Dg~v~~~~A~~~l~~ka~~--~~~~~ 387 (387) T protein:vir:26 356 LFVLTAWYDQQRTLDSAFRIAKAKENT--GPLPS 387 (387) T ss_pred EEEEEEEeCcEeechhheEEEEeecCC--CCCCC Confidence 222333489999999999887776543 44444 No 148 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=98.80 E-value=4.9e-11 Score=77.12 Aligned_cols=276 Identities=10% Similarity=0.027 Sum_probs=143.5 Q ss_pred CCCC---------cccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeee Q lcl|Aclame:pro 1 MSTP---------NTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVL 68 (402) Q Consensus 1 Ms~~---------n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~ 68 (402) |... ....+.-..+++..-...| +.|..++++.....+.++++.+++++.+. ++|++ +..++..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~---~~p~~~~~~~~a~~v 175 (387) T protein:vir:96 99 ILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL---EIPRVSYTLDDDDFI 175 (387) T ss_pred HhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCc---eeeeeeccCCccccc Confidence 0000 0000000001111112334 78899999999999999999998887543 34432 34556666 Q ss_pred cCCCCCCCCCccccceeEeecceeeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 69 APGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) Q Consensus 69 ~~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~ 147 (402) .-|+..+...++.+++++.+..+ ..+..-.-+=.+ +.+| +.+.+.+++++++++..++.+|... . T Consensus 176 ~Eg~~~~~~~~~f~~v~l~~~k~--~~~i~iS~ell~ds~~~-l~~~i~~~la~~~~~~e~~~~~~~g---~-------- 241 (387) T protein:vir:96 176 TDVETAKELKAKGDTVKFTTNKF--KVFAAISDTVIHGSDVD-LVNWVENALQSGLAAKERKDALAVS---P-------- 241 (387) T ss_pred cccccccccccccceeeechhee--eeechhhHHHHhhhHHH-HHHHHHHHHHHHHHHHHHHhHhhcC---C-------- Confidence 77777777677777777666543 333222211111 2355 6788888899999887666554211 0 Q ss_pred ccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 148 NKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 148 ~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) ..+...+.....+ ....+....+|.|.++...|+..+.+ ...| |+++..|..|++-.+=.++ .+ T Consensus 242 ----g~g~~~g~~~~~~--~~~~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~~~--------~~ 305 (387) T protein:vir:96 242 ----KSGLEHMSFYNGS--VKEVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGTT--------NF 305 (387) T ss_pred ----Cccccceeeeccc--cccccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCCC--------cc Confidence 0011111111111 11122344688888888888877665 4566 5666666665531110112 12 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHH Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTY 307 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d 307 (402) ..|.-.++.|.||+.++..|.. +-+||+.... .+. ++..+.+++...-.. T Consensus 306 ~~~~~~~llG~PV~~~~~~~~~-------------------~~GDf~~~~~-~~~----------~~~~~~~~~~~~~~~ 355 (387) T protein:vir:96 306 FDTPAEKVFGKPVVFTDAAVKP-------------------IVGDFNYFGI-NYD----------GTTYDTDKDVKKGEY 355 (387) T ss_pred cccCCccccccceEEecCCCce-------------------eeechhhhhh-hhh----------hhhheecccccCCce Confidence 2333357899999999876531 2244443211 110 111122222221111 Q ss_pred HHHHHHHhcCcccccceEEEEEEeeccCcccccc Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGG 341 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~ 341 (402) .+.+.+=|+.++++|++...++.+..+ ++.|+ T Consensus 356 ~~~~~~r~Dg~v~~~~A~~~l~~ka~~--~~~~~ 387 (387) T protein:vir:96 356 LFVLTAWYDQQRTLDSAFRIAKAKENT--GPLPS 387 (387) T ss_pred EEEEEEEeCcEeechhheEEEEeecCC--CCCCC Confidence 222333489999999999887776543 44444 No 149 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=98.79 E-value=1.1e-10 Score=75.27 Aligned_cols=275 Identities=12% Similarity=0.039 Sum_probs=140.6 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSPNAT 77 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i~~~ 77 (402) +.......+....+++..-...| +.|..++++..+..+.++++.++.++.+ . ++|++ +..++....-|+.+... T Consensus 73 ~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~-~--~~p~~~~~~~~a~~v~E~~~~~~~ 149 (352) T protein:vir:78 73 SMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG-L--EIPRVSYTLDDDDFITDVETAKEL 149 (352) T ss_pred HhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC-c--eEEEEecCCCcccccccccccccc Confidence 00000000000011111112244 8899999999999999999999887654 2 34432 33455566666677666 Q ss_pred CccccceeEeecceeeccchhhhHHHhh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 78 PTQADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 78 ~~~~~e~~itID~~lya~~~IddlDe~q--~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) .+..+++++.+.. ++.+.--.- +.. +.+| +.+.+.+++++++++..++.++.. +.- .+. T Consensus 150 ~~~f~~v~~~~~k--~~~~i~is~-ell~Ds~~~-l~~~i~~~la~~~~~~e~~~~~~~---g~g------------~~~ 210 (352) T protein:vir:78 150 KLKGDTVKFTTNK--FKVFAAISD-TVIHGSDVD-LVNWVENALQSGLAAKERKDALAV---SPK------------SGL 210 (352) T ss_pred cccceeeeeccee--EEeechhhH-HHHhhhhHH-HHHHHHHHHHHHHHHHHHHhhhhc---CCC------------Ccc Confidence 6777776666644 333322121 222 2355 678888888888887645434311 100 000 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) ..+..... .....+....||.|.++...|+..+.. ...| |++|..|..|++-.+=.++. +..|.=.++ T Consensus 211 ~~g~l~~~--~~~~~t~~~~~d~i~~~~~~l~~~~~~-~a~~-~mn~~t~~~l~~~~~~~~~~--------~~~~~~~~l 278 (352) T protein:vir:78 211 EHMSFYNG--SVKEVEGANMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGTTN--------FFDTPAEKV 278 (352) T ss_pred cccceecc--ccccccccchHHHHHHHHhccChhhhc-CCEE-EEehHHHHHHHHHHhccCCc--------ccccCCccc Confidence 01100000 011122333578888888888776544 3455 77888887776521111111 122333468 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) +|.||+.++..|.. +-+||+..... + + .+..+.+.+.......+.+.+=| T Consensus 279 lG~PV~~~~~~~~~-------------------~~Gdf~~~~~~-~--~--------~~~~~~~~~~~~g~~~f~~~~r~ 328 (352) T protein:vir:78 279 FGKPVVFTDAAVKP-------------------IVGDFNYFGIN-Y--D--------GTTYDTDKDVKKGEYLFVLTAWY 328 (352) T ss_pred cccceEEecCCCce-------------------eEeehhhhhhh-h--h--------hheeeeeccccCCeeEEEEEeee Confidence 99999998865421 22444332110 0 0 11222333322222333445568 Q ss_pred cCcccccceEEEEEEeeccCcccccc Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKRDATTGDAGG 341 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~~~t~~~a~~ 341 (402) +.+++|||+...++.+. .+++.|+ T Consensus 329 Dg~~~~~eA~~~l~~~a--~~~~~~~ 352 (352) T protein:vir:78 329 DQQRTLDSAFRIAKAKE--STGSLPS 352 (352) T ss_pred CceeechhheEEEEeec--ccCCCCC Confidence 99999999876665543 3344444 No 150 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.78 E-value=6.1e-10 Score=71.10 Aligned_cols=279 Identities=11% Similarity=-0.016 Sum_probs=145.9 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccc-cceEEeeeccce--eeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTG-TNTVSNKYLGET--ELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~-Gksv~f~~iG~~--t~~~~~~G~~i~~ 76 (402) |+.... .+++ -...| +.|+.+++......+.++++.++.++.+ ..+..++..... .+....-|+.+.. T Consensus 105 ~~~~~~-------~~~~-gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~ 176 (395) T protein:vir:38 105 VTSGTT-------GTGN-AGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGD 176 (395) T ss_pred HhhccC-------ccCC-CceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcccccccccccccc Confidence 222111 1111 11223 7889999999999999999998888754 334445554432 2334444555543 Q ss_pred C-CccccceeEeecceeeccchhhhHHHhh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 77 T-PTQADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 77 ~-~~~~~e~~itID~~lya~~~IddlDe~q--~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) . .+..+++++.... ++.+..-.- ++. +.+| +.+.+.+++++++++..|+.|+.-. T Consensus 177 ~~~~~f~~v~~~~~k--~~~~~~iS~-ell~ds~~~-l~~~i~~~la~~~~~~~~~~il~g~------------------ 234 (395) T protein:vir:38 177 NDDPELTVVKYLIHR--YAGITTVTN-TLLKDTVDN-IIQWLVNWAAKKDVVTRNAKILEVM------------------ 234 (395) T ss_pred ccccceeeEEeeeee--eEeehhhHH-HHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHhhcc------------------ Confidence 3 3555555555443 333322111 222 3455 6889999999999999998876311 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHH-HHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALE-QQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFV 232 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~-~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V 232 (402) +.+...+ + ..+ |+.|.++.. .|+...-+ .-..|++|..|..|.+-.. .+..|- -.+...+|.- T Consensus 235 --g~~~~~~--~---~~~----~~~i~~~~~~~l~~~~~~--~a~~v~n~~~~~~L~~lkd-~~G~~l--~~~~~~~~~~ 298 (395) T protein:vir:38 235 --GKAPKKP--T---ISQ----FDNIKDLENNTLDPAIES--TSSFITNQSGYNILSKVKD-ADGRYL--MQPDVTSPDK 298 (395) T ss_pred --ccccccc--c---ccc----HHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHHhhc-cCCcee--eccCcCCCCc Confidence 1111110 1 112 233443332 34433332 3346899999999865211 011111 0112335556 Q ss_pred EEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccch-hHH---HHH Q lcl|Aclame:pro 233 LSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEK-KEK---TYY 308 (402) Q Consensus 233 ~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~-~~~---~d~ 308 (402) .+++|+||+.+.+.|..... +...-+-+||++..- .+...++..+..+.. ..| ... T Consensus 299 ~~l~G~pV~~~~~~~~~~~~-----------~~~~i~~gd~~~~~~---------i~~~~~~~i~~~~~~~~~~~~~~~~ 358 (395) T protein:vir:38 299 YLIDGKPVIRIADKWLPDVS-----------GSHPLYFGDLKQGIT---------LFDRQQMQIDTTNVGAGSFEHDTTK 358 (395) T ss_pred ceeccceeEEecccccCcCC-----------CcceEEEEeccccEE---------EEEecceEEEEeccccchhhcCceE Confidence 78999999999886543211 111113345544211 112233444443322 222 234 Q ss_pred HHHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 309 IDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 309 i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) +++..-||..+++|++.+.++.+..++.+.++.. ++- T Consensus 359 ~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~-~~~ 395 (395) T protein:vir:38 359 LRFIDRFDVQLIDDGAFAAASFKTVANQAQGTAG-TGK 395 (395) T ss_pred EEEEEeeccEEecccceEEEEeecccCCCCCccC-CCC Confidence 5566668999999999999988765444333311 222 No 151 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=98.74 E-value=6.4e-10 Score=70.97 Aligned_cols=263 Identities=11% Similarity=0.021 Sum_probs=145.5 Q ss_pred CCCCcccccccccccccHHH-HHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec--cceeeeeecCCCCCCC- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDS-LLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL--GETELQVLAPGQSPNA- 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~a-lfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i--G~~t~~~~~~G~~i~~- 76 (402) +...+...... +...-. +.=+.|..++.......+.++++.++.++.+|+ .++|+. +..++..+.-|..... T Consensus 121 ~~~~~~~~~~~---t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~~~~~~~ 196 (394) T protein:vir:97 121 TTPVEPQKDGI---KKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEKNPAL 196 (394) T ss_pred hhhhhhhcccc---ccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceeccccccccc Confidence 11111111110 111111 222788889998888899999999988877654 566654 4455666666666654 Q ss_pred CCccccceeEeecceeeccchhhhHHHhh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) Q Consensus 77 ~~~~~~e~~itID~~lya~~~IddlDe~q--~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g 154 (402) ..+..++.++....+- .-..|.+ +.. +.+| +.+.+..+++++|++..|+.|+..+- T Consensus 197 ~~~~~~~v~l~~~k~~-~~i~is~--ell~ds~~~-~~~~i~~~la~~~~~~~~~~i~~g~~------------------ 254 (394) T protein:vir:97 197 AKPDFKDVAWNIDTYR-GAIPLSQ--ESIDDADVD-LVGIVSESISQIKVNTTNDAIAKVLK------------------ 254 (394) T ss_pred ccccceeEEeehhhee-eehhhHH--HHHhhhhHH-HHHHHHHHHHHHHHHHHHHHHhhccc------------------ Confidence 4566777777775432 1122221 222 3455 77889999999999999988753221 Q ss_pred cccccccccCCccccccHHHHHHHHHHHHHH-HHhhcCCccCcEEEeChHHHHHHhcc----cchhhcccccccCccccc Q lcl|Aclame:pro 155 HGFSINVNVTESEALANPQYVMAAVEYALEQ-QLEQEVDISDVAIMMPWKFFNALRDA----DRIVDKTYTISQSGATIN 229 (402) Q Consensus 155 ~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~-LdekdVP~~gR~~VV~P~~y~~Ll~~----~r~~n~d~~~~~~g~~~~ 229 (402) .+ .+....+ ++.|.++... ++. ...-..|++|..|..|.+- .+++ |. ....+ T Consensus 255 --~~------~~~~~~~----~~~~~~~~~~~~~~----~~~a~~v~n~~~~~~l~~lkd~~G~~i---~~----~~~~~ 311 (394) T protein:vir:97 255 --SF------TTKTVKN----LDEIKALLNGGFDP----AYNVSLIVSQSFYQTLDTLKDGNGRYL---LQ----DDITA 311 (394) T ss_pred --cc------ccccccc----HHHHHHHHHhhhhh----hhCCEEEEcHHHHHHHHHhhccCCCee---ee----cCcCC Confidence 00 0011112 2233333322 221 1223458999999998642 2222 11 11223 Q ss_pred ceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHH Q lcl|Aclame:pro 230 GFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYI 309 (402) Q Consensus 230 G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i 309 (402) |.-++++|+||+.+++.+.+.+. =+-+||++...++- ..++..+..+ ...+...+ T Consensus 312 ~~~~~l~G~pv~~~~~~~~~~~~---------------~~~gd~~~~~~~~~---------~~~~~~~~~~-~~~~~~~~ 366 (394) T protein:vir:97 312 VSGKVLLGKPVFVLSDEVLGANK---------------AFIGDFKRGVLFAD---------RKDLGLRWAD-NEIYGQYL 366 (394) T ss_pred CCCceeccceeEEecccccCCcc---------------EEEeeccccEEEEE---------ecceEEEEec-ccccceeE Confidence 44468999999997765432210 13355554222111 1223333222 23334456 Q ss_pred HHHHHhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 310 DTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 310 ~~~~a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) ++.+=||.++.+|++.+.|+++..++|- T Consensus 367 ~~~~r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 367 QAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred EEEEEEccEEecccceEEEEecccccCC Confidence 7888899999999999999887666665 No 152 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.73 E-value=3e-10 Score=72.76 Aligned_cols=307 Identities=12% Similarity=0.044 Sum_probs=169.1 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcc---cce---eee-c-cccceEEeeeccce--eeeeec Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILS---YFD---VQT-V-TGTNTVSNKYLGET--ELQVLA 69 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~---~~~---~rt-i-~~Gksv~f~~iG~~--t~~~~~ 69 (402) |++-. +---++|+ |+|...|.....+.+.|.. +++ +.. + .+|+++.||..+.+ ..+.+. T Consensus 1 Ma~~~----------T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~ 70 (330) T protein:vir:10 1 MANEL----------TKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLG 70 (330) T ss_pred CCCCc----------eEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccC Confidence 66421 22235666 8899889888888776642 122 222 2 36999999998876 344554 Q ss_pred CCC-CCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 70 PGQ-SPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERN 148 (402) Q Consensus 70 ~G~-~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~ 148 (402) -|. .|..+.+.+.+..-+|=.. --.+.+.|+-...+--| .-.++.++.+...++..+..++..+ ++.......... T Consensus 71 dg~~~i~~~ki~t~~~~a~i~~~-~k~~~~tD~a~~~~g~d-p~~~i~~q~a~~w~~~~q~~lla~l-~gvf~~~~~~~~ 147 (330) T protein:vir:10 71 NGDKALETGKITAGADIACVLYR-GRGWAANELTGVVAGSD-PVRAILNRIGAYWLREDQKALIATL-NGIFATGTAGEK 147 (330) T ss_pred CCccccchhhcccceeEEEEEee-cceeeehhhhhhhcchh-HHHHHHHHHHHHhhhhHHHHHHHHH-Hhhhhhhhcccc Confidence 453 6777777776665555432 23467888887776666 6778999999999997777666544 333221111100 Q ss_pred cccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccc Q lcl|Aclame:pro 149 KPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATI 228 (402) Q Consensus 149 ~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~ 228 (402) ... ....... ..++....+ ++.|.+|..+|.++. ..-..++|.|..|..|.+. ++++.. ...+ . T Consensus 148 ~~~--~~~~~~~--~~~~~a~~s----~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~-~li~~~--~~s~---~ 211 (330) T protein:vir:10 148 GAL--EETHVSD--QSKASTGID----AGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKD-NLIQYI--QPTT---A 211 (330) T ss_pred hhh--hhhheec--ccccccccC----HHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHh-hhhhhh--cccc---c Confidence 000 0000001 111122223 456777777885543 3456899999999999874 455432 1121 2 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcc---cceeeccchhHH Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE---VTGDIFYEKKEK 305 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~d---l~~e~~~d~~~~ 305 (402) ++.|+.++|.+|+.|..+|...+ .| ..++|-+-|++..+..+ +..|..|++... T Consensus 212 ~~~i~~~~G~~VivdD~~p~~~~----------------~y-------t~yl~~~GAi~~~~~~~~~~v~~EtdRd~~~g 268 (330) T protein:vir:10 212 TINIPTYLGYRVIIDDGIAPTGD----------------IY-------TSYLFRTGSIGLNTGNPSGLTTFETSREAAKG 268 (330) T ss_pred CcccccccceEEEEeCCCCCCCC----------------ce-------eEEEEecCceeeecccCCccccccccCCcccc Confidence 46789999999999999985321 11 12455555666555332 567888888887 Q ss_pred HHHHHHHHHhcCcccccceEEEEEEeecc-C-ccccccchhhHH------HhhhcccceEEEeecchhh Q lcl|Aclame:pro 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRDA-T-TGDAGGPGDDHA------TVLARAQRKAVYVKTEGAA 366 (402) Q Consensus 306 ~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~-t-~~~a~~~~~~~~------~~~~~~~~~~~~~~~~~~~ 366 (402) .+.+...+-|...++ -++.+..+ + .+..|+. ++.+ .|--+-+=..+..+.-+.- T Consensus 269 ~~~l~~r~~~~~hp~------G~s~~~~~~~~~~~sPt~-~~L~~~~NW~~v~~~k~i~iv~~~~~~~~ 330 (330) T protein:vir:10 269 NDMIYTRRALVMHPY------GVKWTGAEVDAGNITPSN-ADLAKFKNWKRVYEPKNIGIIALKHKIGK 330 (330) T ss_pred ceEEEEeeEEEeeee------eeeecccccccCcCCcCh-HHhcCCcCcccccChhhcceEEEEEecCC Confidence 777777765554432 23333221 1 1222332 2211 1111111112222221111 No 153 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.71 E-value=9.3e-10 Score=70.10 Aligned_cols=308 Identities=10% Similarity=-0.078 Sum_probs=147.6 Q ss_pred CCCC-ccccccc-----ccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeee-ccceeeeeecCCC Q lcl|Aclame:pro 1 MSTP-NTLTNVA-----VSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVLAPGQ 72 (402) Q Consensus 1 Ms~~-n~~t~~~-----~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~~~G~ 72 (402) +... ..--|-. ..++..+-...| +.|..++++...+.+.+++++++.++.+|. ..||+ .+...+....-|. T Consensus 68 ~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~-~~i~~~~~~~~a~~~~E~~ 146 (390) T protein:vir:40 68 GANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATT-EWIISVGDVATAWWGPLCA 146 (390) T ss_pred CchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCce-eEEEEEcCCcceeeecccc Confidence 0000 0000000 001111122344 899999999999999999999988876654 45665 4555666666555 Q ss_pred CCCC-CCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 73 SPNA-TPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) Q Consensus 73 ~i~~-~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~ 149 (402) ++.. ..+..++.++..-.+ +.-..|. -++ ++.+| +-+.+.+++++++++..|+.++. +.....|.-.-. T Consensus 147 ~~~~~~~~~f~~i~l~~~k~-~~~i~iS~ell~--ds~~~-l~~~i~~~la~~i~~~~~~a~l~----G~G~~~P~Gil~ 218 (390) T protein:vir:40 147 EIKEVLDNGFDKIQTGMYKL-SAYIPVCNAMLD--LGPSW-LDQYVRTILGEAMALGLEAGIVN----GSGKDQPIGMMR 218 (390) T ss_pred ccCccccccceeeEeeeeeE-EEeehhhHHHHh--cchHH-HHHHHHHHHHHHHHHHHHhhhhc----ccCCCccceeee Confidence 5543 356666766666543 2222222 233 24445 67889999999999999998863 111111100000 Q ss_pred ccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCc-cCcEEEeChHHHHHHhcccchhhcccccccCcccc Q lcl|Aclame:pro 150 PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI-SDVAIMMPWKFFNALRDADRIVDKTYTISQSGATI 228 (402) Q Consensus 150 ~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~-~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~ 228 (402) ...+...+. ..... ....+...+++.+..+...+....-+. ..-+.+++|..|..+++..+.+ .| .+|.+. T Consensus 219 -~~~~~~~~~-~~~~~-~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~-~d----~~G~~v 290 (390) T protein:vir:40 219 -DLNNVTAGE-HPVKT-ATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSY-MT----PQGVWV 290 (390) T ss_pred -ccccccccc-ccccc-ccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhc-cC----CCCccc Confidence 000000000 00000 111122223333333333333322221 2233578887766555533322 11 222222 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH--- Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK--- 305 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~--- 305 (402) .+. ...|.+|+.|+++|... . +-+||+.. +++.+ .+++.+... +..| T Consensus 291 ~~~--~~~g~pvv~~~~~p~~~------i-----------~~Gd~s~~--~i~~~--------~~~~v~~~~-~~~f~~~ 340 (390) T protein:vir:40 291 TGI--LPVPLEIVQSVAVPVGK------A-----------VAGRAKDY--FMGIG--------SEQVIRTST-EYRLLDD 340 (390) T ss_pred ccc--CCCceeEEEcCCCCCCc------E-----------EEEeeceE--EEEee--------cceEEEecc-hhhhhcC Confidence 221 23699999999999532 1 22555542 22222 233333332 2222 Q ss_pred HHHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhHHHhhhcccceEEEeecc Q lcl|Aclame:pro 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTE 363 (402) Q Consensus 306 ~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 363 (402) ...+++.+-++.++++|++.++++++.-......+...+-+.. -+. ++++ T Consensus 341 ~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~~~~~---~~~-----~~~~ 390 (390) T protein:vir:40 341 ETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAIDVNVVNNAT---PSE-----TPAE 390 (390) T ss_pred cEEEEEEEEeCCEEecccceEEEEeeccCCCCCCCcceeeCCC---CCC-----CCCC Confidence 2345677789999999999999988754322222211110000 000 1111 No 154 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=98.68 E-value=1.2e-09 Score=69.43 Aligned_cols=297 Identities=10% Similarity=0.000 Sum_probs=154.7 Q ss_pred CCCCc-ccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccce---eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPN-TLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGET---ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n-~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~---t~~~~~~G~~i~ 75 (402) +.... .-.|.+...+ .-...| +.|..++.......+.+++++++.++.++ +.++++.... .+..+.-|..+. T Consensus 105 ~~~~~~~~~ra~~t~~--~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~~~~~ 181 (421) T protein:vir:13 105 RGIQLSEEERDIMSST--NNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANLAKDTELV 181 (421) T ss_pred hccchhHHHhhccccC--CcceecchhhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeecccccccc Confidence 11110 0111111111 111123 78888998888889999999988887654 5566644222 244566666766 Q ss_pred CCCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 76 ~~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ...+...+.++.+..+- .-..|. -+++ +.+| +.+.+.++++++++++.|+.++.++. + T Consensus 182 ~s~~~f~~i~~~~~k~~-~~v~iS~ell~d--s~~~-l~~~i~~~la~~~~~~~~~~i~~~~~-g--------------- 241 (421) T protein:vir:13 182 KAMLKTQPMAYDIDDYG-LLAPIDNSLLED--SEIN-FLEFVNEEFAEFAVNTENAEIVKQAK-A--------------- 241 (421) T ss_pred ccccceeEEEeeeeeeE-eehhhhHHHHhh--hHHH-HHHHHHHHHHHHHHHHhhhhHhhhhh-h--------------- Confidence 66677777777766532 122222 2222 3455 78889999999999999988864331 0 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCccccc Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSGATIN 229 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g~~~~ 229 (402) .. ...+ ..+ |+.|.++...|...+.+. . .+|++|..|..|.. | .+++..+ ..+ T Consensus 242 -~~-----~~~~---~~~----~d~i~~~~~~l~~~~~~~-a-~~v~n~~~~~~l~~lkd~~G~~i~~~--------~~~ 298 (421) T protein:vir:13 242 -VL-----AEET---IND----YAGLVKTINSLVPNARKR-A-IIVTNSDGRAYLDGLMDKQGRPLLKE--------LSD 298 (421) T ss_pred -cc-----cccc---ccc----hHHHHHHHHHhhhhhcCC-C-EEEEcHHHHHHHHHhhcCCCceeecC--------cCC Confidence 00 0001 111 566777777887766553 3 45889999999864 2 2222111 123 Q ss_pred ceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH--HH Q lcl|Aclame:pro 230 GFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK--TY 307 (402) Q Consensus 230 G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~--~d 307 (402) |.-.+++|.||+.++++|...++. ..=+-+||++..-+. .-.+++.+..++.... .. T Consensus 299 ~~~~tl~G~pV~~~~~~~~~~~~~------------~~~~~gd~~~~~~~~---------~~~~~~v~~~~~~~f~~~~~ 357 (421) T protein:vir:13 299 GGDLVFKGRPVIELEESIFDVGDE------------TKFIVSDFKTLIKFM---------DRKQYLIDQSKEAGYTKNET 357 (421) T ss_pred CCCceecceeeEEeccccccCCCc------------eEEEEEeccccEEEE---------EecceEEEeecccccccCee Confidence 445689999999999998543211 111345555532222 2234445554443221 13 Q ss_pred HHHHHHHhcCcccccceEEEEEEee-------ccCccccccchhhHHHhhhcccceEEEeecchhhhhhhh Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKR-------DATTGDAGGPGDDHATVLARAQRKAVYVKTEGAAAAFSA 371 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~-------~~t~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 371 (402) .+++.+=|+..+++|+++.++.... ..+++.++..+..+.+ - ||-+-++++ ++++-- T Consensus 358 ~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~~----~-~~~~~~~~~--~~~~~~ 421 (421) T protein:vir:13 358 IARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSSPRSGKNKNE----S-KEEIKEEGE--ATQQNE 421 (421) T ss_pred EEEEEeeecceeecchhhheeeecccceeeccccccCCCCcCCCCccc----c-chheeeccc--cccCCC Confidence 4566677889999999865544321 1111111111111111 1 111111111 111100 No 155 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.64 E-value=6e-09 Score=65.66 Aligned_cols=303 Identities=12% Similarity=0.086 Sum_probs=163.1 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccc----------eeeecc--ccceEEeeeccceeeeee Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYF----------DVQTVT--GTNTVSNKYLGETELQVL 68 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~----------~~rti~--~Gksv~f~~iG~~t~~~~ 68 (402) |+.-+ .+ .+|+.+ .++|+..+.+.-.+.+-|.... +...++ .|++|+|+.+...+-... T Consensus 1 Ma~T~----~~---~~~p~a--~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L~g~gv 71 (364) T protein:vir:93 1 MSQTV----IP---FGDPKA--VKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHLRGKPT 71 (364) T ss_pred Cceec----cC---cCCHHH--HHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeecccCCc Confidence 87433 22 244444 4999999999998887666521 122232 499999999999887777 Q ss_pred cCCCCCCC--CCccccceeEeecceeeccchhh---hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 69 APGQSPNA--TPTQADKNQLVIDTTVIARNTVA---HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANT 143 (402) Q Consensus 69 ~~G~~i~~--~~~~~~e~~itID~~lya~~~Id---dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a 143 (402) .-++.+.+ +.++.....|+||+ .++.|+ .+++=...+| +|++-...++.-+++..|+.+|..+.- ++-.. T Consensus 72 ~Gd~~leGnee~L~~~~~~i~idq---~r~~V~~~g~ms~qRt~~d-lr~~ar~~L~~w~~~~~d~~~f~~laG-arg~~ 146 (364) T protein:vir:93 72 YGDARVEGKEESLRFYQDEVRIDQ---VRHSVSAGGRMSRKRTVHN-IRRIARDRLGDYFYKFTDELLFIYLSG-ARGIN 146 (364) T ss_pred ccCceeeccccceeEEeeEEEEee---ccccccccCchhhhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHhhc-ccccc Confidence 76777876 45888899999998 455664 4777778898 899988899999999999999988852 22111 Q ss_pred -cccccc------------ccccccccccccccCCcccccc-HHHH-HHHHHHHHHHHHhhcCCc-------------cC Q lcl|Aclame:pro 144 -KAERNK------------PRVKGHGFSINVNVTESEALAN-PQYV-MAAVEYALEQQLEQEVDI-------------SD 195 (402) Q Consensus 144 -~~~~~~------------~~~~g~~~~~~v~~~~a~~~~~-~~~l-~dai~~a~~~LdekdVP~-------------~g 195 (402) +..... |..+-+..+. ..+.....+ .+.+ ++.|..+...++....+. .+ T Consensus 147 ~~~~~~~~~~~~~~N~v~aPt~~r~~~~~---~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~~g~~ 223 (364) T protein:vir:93 147 LDFIETPDFTGYAGNPLDAPDVDHLLYGG---VATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVSIDGDD 223 (364) T ss_pred cccccccCcccccccccCCCCCCcEEecc---ccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeEecCcc Confidence 000000 0000000000 001111111 1111 556666666666543211 23 Q ss_pred c-EEEeChHHHHHHhc--ccchhhccc-----ccccCcccccceEEEEeccEEEecCccccccCccccccccccCCcccc Q lcl|Aclame:pro 196 V-AIMMPWKFFNALRD--ADRIVDKTY-----TISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRY 267 (402) Q Consensus 196 R-~~VV~P~~y~~Ll~--~~r~~n~d~-----~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~ 267 (402) . ++++.|.+++.|.. ++.+.+-.- .+..++ +..|.++.++|+-|++.++++......++ . T Consensus 224 ~yV~~l~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nP-lF~G~~gm~ngvii~~~~~vi~~~~~~~~-----------~ 291 (364) T protein:vir:93 224 HYVCVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNP-IFKGGLGMINNVVLHKHRNVIRFNDYGAG-----------A 291 (364) T ss_pred eeEEEEcchhhhhhhhcCCHHHHHHHHHhhhcccccCC-ceecCeeeEcCeEEeccCCcccccccccC-----------c Confidence 3 58899999999985 334332111 112233 45799999999999999999876533211 1 Q ss_pred ceeeeccceeEEeecHHHhhhh--hhcc----cceeeccchhHHHHHHHHHHHhcCcccccce--EEEEEEeeccCcccc Q lcl|Aclame:pro 268 DPIAEMNGAVAVLFTSDALLVG--RTIE----VTGDIFYEKKEKTYYIDTFMAEGAIPDRWEA--VSVVTTKRDATTGDA 339 (402) Q Consensus 268 ~~~ad~~~~~al~fh~~Av~tv--~~~d----l~~e~~~d~~~~~d~i~~~~a~Ga~vlRPea--a~vv~~~~~~t~~~a 339 (402) ++.. .++|++-..|++.+ +.-. +.-|.+.-.++++ |-....+|.+=.|-+- -|+|.+ T Consensus 292 ~v~~----~ralllGaQA~~~a~g~~~g~~~~w~Ee~~D~gn~~~--i~~~~i~G~kK~rF~~~DfGvi~i--------- 356 (364) T protein:vir:93 292 NVEA----ARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPA--IAAGFIAGMKKARFNNKDFGVISI--------- 356 (364) T ss_pred cccc----hhhheecceeeEEEeecCCCCCceeeecccCCCCchh--hhhhhHhhhhhcccCCccceEEEe--------- Confidence 2211 12344444443322 2111 1112222222221 2223334444444320 011111 Q ss_pred ccchhhHH Q lcl|Aclame:pro 340 GGPGDDHA 347 (402) Q Consensus 340 ~~~~~~~~ 347 (402) .+.+.-|. T Consensus 357 dtaa~~~~ 364 (364) T protein:vir:93 357 DTAAKKHS 364 (364) T ss_pred cccccccC Confidence 01111111 No 156 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=98.62 E-value=9.1e-09 Score=64.67 Aligned_cols=328 Identities=10% Similarity=0.032 Sum_probs=162.9 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHh----hhc----------------------ccceeeecc--cc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGE----NIL----------------------SYFDVQTVT--GT 52 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~s----v~~----------------------~~~~~rti~--~G 52 (402) |+..-+....+.. +-.++|+.-+.+.-.+.+ +|. ..+++..++ .| T Consensus 1 ~~~a~T~~~~~~p-------~a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~G 73 (430) T protein:vir:10 1 MTASKTTMRYGDP-------NAMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKG 73 (430) T ss_pred CcceeeecccCCh-------hHHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCc Confidence 8776554443322 223445544444443321 112 245555563 59 Q ss_pred ceEEeeeccceeeeeecCCCCCCC--CCccccceeEeecceeeccchhh---hHHHhhcCccchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 53 NTVSNKYLGETELQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVA---HIHDVQGDIDSLKPKLAMNQAKQLKRLE 127 (402) Q Consensus 53 ksv~f~~iG~~t~~~~~~G~~i~~--~~~~~~e~~itID~~lya~~~Id---dlDe~q~~~D~vrse~s~~~G~aLA~~~ 127 (402) ++|.|+.+-..+-....-++.+.+ +.++.....|+||++ ++.|+ .+++=...+| +|++--..++.-+++.. T Consensus 74 D~Vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~---R~~V~~gg~msqQRt~~d-lR~~ar~~L~~w~~~~~ 149 (430) T protein:vir:10 74 DEVRFHFVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQA---RFPVDLGDVMSQIRNPYD-LRRLGRPKAKWFMDAYL 149 (430) T ss_pred cEEEEeEeeccccCceecCceeeccccceEEEeeEEEEeee---ccccccCCchhhhhhhhH-HHHHHHHHHHHHHHHHH Confidence 999999998888877777778876 458888999999995 44543 4455556788 89988888999999999 Q ss_pred HHHHHHHHHhhhhh---------------------cccccccccccc-----cccc-ccccccCCccccccHHHH-HHHH Q lcl|Aclame:pro 128 DQMAIQQMLLGGIA---------------------NTKAERNKPRVK-----GHGF-SINVNVTESEALANPQYV-MAAV 179 (402) Q Consensus 128 Dq~i~~~l~kaA~~---------------------~a~~~~~~~~~~-----g~~~-~~~v~~~~a~~~~~~~~l-~dai 179 (402) ||.+|..|. +||- ..+.. .|..+ ++.. ......++.....+.+.+ ++.| T Consensus 150 Dq~~~v~la-Garg~~~~~~~~~~~~~~~~~~~~~~N~v~--aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~i 226 (430) T protein:vir:10 150 DQSMLVHLA-GARGNHYNKEWCLPLETHPKLADMLVNRVK--APTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVV 226 (430) T ss_pred HHHHHHHHh-hhhcccccccccccccCCcchhhhhccccC--CCCCceeEeecccccccccccccccchhhhcccCHHHH Confidence 999999885 2211 01100 01111 1100 000000000011111222 5566 Q ss_pred HHHHHHHHhhcCCc-------cC-------cEEEeChHHHHHHhcccchhh----cc-cc--cccCcccccceEEEEecc Q lcl|Aclame:pro 180 EYALEQQLEQEVDI-------SD-------VAIMMPWKFFNALRDADRIVD----KT-YT--ISQSGATINGFVLSSYNC 238 (402) Q Consensus 180 ~~a~~~LdekdVP~-------~g-------R~~VV~P~~y~~Ll~~~r~~n----~d-~~--~~~~g~~~~G~V~~iaG~ 238 (402) ..+...+++.+.|- +. ++++++|.||..|..++.+.+ +- .. +..++. ..|.++.++|+ T Consensus 227 d~a~~~a~~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPl-F~G~~gm~ngv 305 (430) T protein:vir:10 227 DSIATYMDQIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPI-FRVDAGLWSNT 305 (430) T ss_pred HHHHHHHHhhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCc-eecceeeecCe Confidence 67777888776442 22 678999999999999988742 11 11 112444 57999999999 Q ss_pred EEEecCccc-cccCccccccccccCCcc--c---cceeeeccceeEEeecHHHhhhhhhc--------ccceeeccchhH Q lcl|Aclame:pro 239 PVIPSNRFP-TFAQDQAHHLLSNEDNGY--R---YDPIAEMNGAVAVLFTSDALLVGRTI--------EVTGDIFYEKKE 304 (402) Q Consensus 239 ~V~~SNnlP-~~~~~~t~~~ls~a~~G~--~---~~~~ad~~~~~al~fh~~Av~tv~~~--------dl~~e~~~d~~~ 304 (402) -|+|-.+.= +-.++...+..++..+.. . -.+....+-..+|++-..|++.+-.. .+.-|.+.-.++ T Consensus 306 ii~~~~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~ 385 (430) T protein:vir:10 306 LIIKMPKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDK 385 (430) T ss_pred EEecCCceeeecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCch Confidence 999876431 111111111000000000 0 00111222224555555544432221 112222222222 Q ss_pred HHHHHHHHHHhcCcccccceEEEEEEeeccCccc------cccchhhHHHhhhcccce Q lcl|Aclame:pro 305 KTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGD------AGGPGDDHATVLARAQRK 356 (402) Q Consensus 305 ~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~------a~~~~~~~~~~~~~~~~~ 356 (402) ++ |-....+|.+=.|-. ...++ =+.-+-++++-+--+ || T Consensus 386 ~~--i~~~~i~G~kK~rF~----------~~~~~~~~~~DfGvi~idtaa~~~~~-~~ 430 (430) T protein:vir:10 386 LE--LLIGAILGCSKIRFA----------VEATNGLEYTDHGVMAIDTAVKIIGP-RK 430 (430) T ss_pred hh--hhhhHHhccceeeec----------CCCCCCceeeeeEEEEhhhhhhhhcC-CC Confidence 22 222333454444331 11110 011111222211112 22 No 157 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.53 E-value=3.4e-09 Score=67.04 Aligned_cols=325 Identities=10% Similarity=0.072 Sum_probs=172.2 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcc---cce---eee-c-cccceEEeeeccce--eeeeec Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILS---YFD---VQT-V-TGTNTVSNKYLGET--ELQVLA 69 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~---~~~---~rt-i-~~Gksv~f~~iG~~--t~~~~~ 69 (402) |+. .---++|+ |+|...|.+.+.+.+.|.. +++ +.. + .+|+++.||..+.+ ..+.+. T Consensus 1 MA~------------T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~ 68 (351) T protein:vir:15 1 MAE------------THLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWT 68 (351) T ss_pred CCc------------eeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccC Confidence 662 11135666 8999999888888777643 122 222 1 36999999988776 577888 Q ss_pred CCCCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 70 PGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) Q Consensus 70 ~G~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~ 149 (402) -|+.|..+.+...+..-+|=..- -.+.+.|+....+--| .-.+++++.+.+.++..|..++..| +++..... T Consensus 69 ~~~~i~~~kitt~~~~a~i~~~~-kg~~~tD~a~~~sg~d-p~~~i~~q~a~~w~~~~q~~lla~l-~gv~~~~~----- 140 (351) T protein:vir:15 69 DSDDIDVNNLTSGKQQGIKFYQT-KAYGYTDLGTMISGAP-VQETIGNRFAAFWQRADQKTLLSVL-KGVMGVTK----- 140 (351) T ss_pred CCcccchheecccceeEEEEeec-cceehhhhhHhhccch-HHHHHHHHHHHHHHHHHHHHHHHHH-HHHhhchh----- Confidence 88999999988888777774332 3377888877776656 6778999999999998888777665 44322211 Q ss_pred ccccccccccccccCCccccccHHHHHHHHHHHHHHHHh-hcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccc Q lcl|Aclame:pro 150 PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLE-QEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATI 228 (402) Q Consensus 150 ~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~Lde-kdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~ 228 (402) ...++....+.. .+.....+ ++.|.+|..+|-+ ++ ..-..++|.|..|..|.+.. +++ |-...+ . T Consensus 141 -~~~~~~~d~t~~-~~~~~~is----~~~l~~A~~~~GD~~~--~~~~~ivmhS~v~~~L~~~~-li~--~~~~s~---~ 206 (351) T protein:vir:15 141 -IANSKVYDQTKV-SPSEPMFG----AKGFTGAIGLMGDLQD--TAFGAIAVNSATYSLMKVQG-LIE--TIQPQN---G 206 (351) T ss_pred -hcccceeccccc-cccccccC----HHHHHHHHHHhccccc--cceEEEEEChHHHHHHHhhh-hhh--hccccc---c Confidence 111111111110 11111222 4667888888844 33 12367889999999998763 442 222222 2 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH--H Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK--T 306 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~--~ 306 (402) ++.|+.+.|.+|+.+..+|...... ++. . ...++|-+-|++..+.. +..|..|++... - T Consensus 207 ~~~i~t~~G~~VivdD~~p~~~~~~---------~~~--~-------ytsyl~~~GAi~~~~~~-~~ve~~rd~~~~~g~ 267 (351) T protein:vir:15 207 ATPFEAYNGLRIVLDDDIEIDLTDK---------TKP--V-------STSYIFAPGAVRYSTNM-RSTETKYDPLINGGQ 267 (351) T ss_pred CcccceecceEEEEcCCCccccCCC---------CCc--e-------eEEEEEecceeeeecCC-cCcceeecccCCCCc Confidence 4678999999999999999643211 111 1 12355556666654443 356777776542 2 Q ss_pred HHHHHHHHhcCcccccceEEEEEEeecc--CccccccchhhHHHhhhcccceEE--Eeecchhhhhh-hhcccccchhHH Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKRDA--TTGDAGGPGDDHATVLARAQRKAV--YVKTEGAAAAF-SAAPAGIQAEDL 381 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~~~--t~~~a~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~~~~~~~~~~~ 381 (402) |.+..++.|...+. -++.+..+ +.+..|+. ++++. -+|=+.+ +-+-.++-..+ +-.+. +| T Consensus 268 d~l~~r~~~~~hp~------G~s~~~~~~~~~~~sPt~-~~L~~---~~NW~~v~~~d~k~I~iv~~~~~~~~-----~~ 332 (351) T protein:vir:15 268 DVIVQKRVGTIHVA------GTSIKASFSPSKASFPTI-DELAK---SSTWEVVDGIDVRSIGVVAYTAQLDP-----AL 332 (351) T ss_pred eEEEEeeeeeeeee------eeeecccccccCcCCcCh-HHhcC---CcccccccCCCccccceEEEEEecCc-----cc Confidence 44333333332221 12222111 11112222 11111 1221111 11111111111 00000 11 Q ss_pred HHHHHHHHhhcccccccCCCC Q lcl|Aclame:pro 382 VAAVRAVMANDIKPTAMKPTE 402 (402) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~~~ 402 (402) +- -++++--||. T Consensus 333 ~~---------~~~~~~~~~~ 344 (351) T protein:vir:15 333 TP---------GAQMPAADTS 344 (351) T ss_pred cc---------CCcCcCCCCc Confidence 10 0233333333 No 158 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.53 E-value=2.8e-09 Score=67.50 Aligned_cols=303 Identities=10% Similarity=0.057 Sum_probs=144.6 Q ss_pred CCCCc---ccccccc-c--ccccHHHHHH--HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeec--C Q lcl|Aclame:pro 1 MSTPN---TLTNVAV-S--ASGEVDSLLI--EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLA--P 70 (402) Q Consensus 1 Ms~~n---~~t~~~~-~--~~~d~~alfl--e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~--~ 70 (402) ||.-. .+.+... + +.++...-|+ ..+..+++...++.+.++++.++.++++. +-+|+.+|-......+ . T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~-~~~i~~~~~~~~~~~~~~e 79 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAK-KTRIPTLNIGERHRRPQDE 79 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCc-ceeeeeeccCCcccccccc Confidence 66542 2222221 1 1122222232 77888888999999999999998887653 3567766532211111 1 Q ss_pred CC-CCCCCCccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 71 GQ-SPNATPTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERN 148 (402) Q Consensus 71 G~-~i~~~~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~ 148 (402) |+ +.....+..++..+..-++- .....-+-||+|....| +.+.+...+++++++..++.++.= -....++.. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d-~e~~i~~~ia~~~a~~~~~~~~nG----d~~~~~~~~- 153 (321) T protein:vir:31 80 GEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEA-LADRILNLMTDAWSADVEDLAANG----DEDAEDSFE- 153 (321) T ss_pred cccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchh-HHHHHHHHHHHHHHHHHHhheeec----cccCCCccc- Confidence 21 12222334444444333222 12223356777765556 788999999999999888766521 111111100 Q ss_pred cccccccccccccccCCcc-ccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 149 KPRVKGHGFSINVNVTESE-ALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 149 ~~~~~g~~~~~~v~~~~a~-~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) ...+|... ........ .......-++.|.++...|+++.--..+-+++|+++.+..+++ .+.+++- ....... T Consensus 154 -~~n~G~l~--~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~--~l~~~~~-~~~~~~l 227 (321) T protein:vir:31 154 -NQNDGFIT--VAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHY--TLTDRDT-PLGDNVI 227 (321) T ss_pred -ccchhhhh--hhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHH--HHhcCCC-ccccchh Confidence 00011110 00000000 0001111145677777778776643334456899998766543 2222221 1222234 Q ss_pred ccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHH- Q lcl|Aclame:pro 228 INGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKT- 306 (402) Q Consensus 228 ~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~- 306 (402) .+|...++.|+||+.++++|...- .-.++.|.+-.+ ..++..+.+++..... T Consensus 228 ~~~~~~tl~G~pvv~~~~mP~~~i-----------------l~t~~~nl~~~~----------~~~~~~~~~~~~~~~~~ 280 (321) T protein:vir:31 228 MGEADVNPFSFPIIGSGLWPDDKA-----------------MFTDPQNLIYAL----------YRDLEIDVLTESDKVSE 280 (321) T ss_pred hccccccccceeEEEcCCCCCCcE-----------------EEeccccEEEEE----------eeccEEEEeecCccccc Confidence 466667899999999999996431 112333322111 1122333333322111 Q ss_pred --HHHHHHH--HhcCcccccceEEEEEEeeccCccccccch Q lcl|Aclame:pro 307 --YYIDTFM--AEGAIPDRWEAVSVVTTKRDATTGDAGGPG 343 (402) Q Consensus 307 --d~i~~~~--a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~ 343 (402) +.+..++ =++-.|-.+++++.|+=....-+....++. T Consensus 281 ~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~~~~~~~ 321 (321) T protein:vir:31 281 RDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEHLEEETS 321 (321) T ss_pred cceeeEeeeeeecceeEeccccEEEEecCCcchhcccCCCC Confidence 1111111 155567778888777621111111111111 No 159 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=98.50 E-value=6.5e-09 Score=65.46 Aligned_cols=272 Identities=9% Similarity=0.027 Sum_probs=136.5 Q ss_pred CCCCcc-ccccc-ccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccc-eEEeeeccceeeeeecCCCCCC-C Q lcl|Aclame:pro 1 MSTPNT-LTNVA-VSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNKYLGETELQVLAPGQSPN-A 76 (402) Q Consensus 1 Ms~~n~-~t~~~-~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~~iG~~t~~~~~~G~~i~-~ 76 (402) +.+... ..... .....+...+-.+.+..++... .....++.+.++.++.+++ .+.++..+...+..+.-|.... . T Consensus 121 ~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~ 199 (397) T protein:vir:96 121 NAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLEP-KDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQL 199 (397) T ss_pred HHHHHhhhhhhhhcccccccccchhHHHHHHHHHh-hhhhhHHHhhhhccccccceeEEEEeccCCcccccccccccccc Confidence 111000 00000 0011122233346777777764 3333446666666655432 2333344445555555555544 3 Q ss_pred CCccccceeEeeccee-eccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTV-IARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGH 155 (402) Q Consensus 77 ~~~~~~e~~itID~~l-ya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~ 155 (402) ..+...+.++.+..+- +..+.-..+++ +.+| +.+.+.+.+++++++..|..|+.-. T Consensus 200 ~~~~~~~i~~~~~~~~~~~~~s~ell~d--s~~~-l~~~i~~~l~~~~~~~~~~~i~~g~-------------------- 256 (397) T protein:vir:96 200 ANPKMVEIDYSVATRRGYIPISQEMIDD--ASYD-VTGLIADEIQDQSLNTKNADIAAVL-------------------- 256 (397) T ss_pred ccccccceeecHhHhhcchhhHHHHHhh--hHHH-HHHHHHHHHHHHHHHHHHHHHhhcc-------------------- Confidence 4567777777776531 22222222332 2345 6788889999999999998775211 Q ss_pred ccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEE Q lcl|Aclame:pro 156 GFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSS 235 (402) Q Consensus 156 ~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i 235 (402) +.+ .+....+ ||.|.++....... .. .-..|++|..|..|.+-.. .+..|-- .+...+|.-.++ T Consensus 257 g~~------~~~~~~~----~d~~~~~~~~~~~~-~~--~a~~v~n~~~~~~l~~lkd-~~G~~~~--~~~~~~~~~~~l 320 (397) T protein:vir:96 257 KTA------TAKSVVG----VDGLKDLINKEIKK-VY--DVKLFISASMYSELDKLKD-KNGRYLL--QDSITAASGKQL 320 (397) T ss_pred ccc------ccccccc----hHHHHHHHHHhhhh-hc--CcEEEEcHHHHHHHHHhhc-cCCCeEe--ccCccCCCcccc Confidence 000 0111112 33444443332221 12 2345999999999865211 1112211 112234555689 Q ss_pred eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHh Q lcl|Aclame:pro 236 YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAE 315 (402) Q Consensus 236 aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~ 315 (402) +|.||+.+++.+.+... |...=+-+||++..-+ +.+ .++..+.. +...+...+++.+=+ T Consensus 321 ~G~pv~~~~~~~~~~~~-----------~~~~~~~gd~~~~~~~-~~~--------~~~~~~~~-~~~~~~~~~~~~~r~ 379 (397) T protein:vir:96 321 LGKEVVVLDDDVIGKSV-----------GNVVGFIGDAKAFASF-FDR--------KQVSVSWV-DNNIYGQLLAGIIRY 379 (397) T ss_pred cccceEEecccccCCCC-----------CceEEEEeehhcceEe-Eee--------cceEEEEe-cccccceeEEEEEEE Confidence 99999998876543211 1111134566553222 211 22222222 223344567788889 Q ss_pred cCcccccceEEEEEEeec Q lcl|Aclame:pro 316 GAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 316 Ga~vlRPeaa~vv~~~~~ 333 (402) |.++++|++.+.|+++.. T Consensus 380 d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 380 DVKATDKKAGFYVTFTIG 397 (397) T ss_pred ccEEecccceEEEEeecC Confidence 999999999999887765 No 160 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.46 E-value=3.7e-08 Score=61.35 Aligned_cols=251 Identities=12% Similarity=0.071 Sum_probs=141.9 Q ss_pred CCCCccccccc--c-----cccccHHHHHHHHHhHHHHHHHHHHhhhcc---------cceeeecc--ccceEEeeeccc Q lcl|Aclame:pro 1 MSTPNTLTNVA--V-----SASGEVDSLLIEKFNGKVNEQYLKGENILS---------YFDVQTVT--GTNTVSNKYLGE 62 (402) Q Consensus 1 Ms~~n~~t~~~--~-----~~~~d~~alfle~f~geV~t~f~~~sv~~~---------~~~~rti~--~Gksv~f~~iG~ 62 (402) |++-.... |+ + ...+-.+. .++.|++.+...-.+.+-++. .+++..++ .|++|.|+.+-. T Consensus 1 mt~~~~~~-~~~~~~~~~ft~~~~~~~-~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~ 78 (318) T protein:vir:27 1 MTTVTSAQ-ANKLFQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHK 78 (318) T ss_pred CCccCCCC-hHHHHHHHHHHHHhcCCh-HHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeec Confidence 77653222 32 1 11121122 578999988776665433332 12223342 599999999988 Q ss_pred eeeeeecCCCCCCC--CCccccceeEeecceeeccchh---hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 63 TELQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTV---AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLL 137 (402) Q Consensus 63 ~t~~~~~~G~~i~~--~~~~~~e~~itID~~lya~~~I---ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~k 137 (402) .+-....-++.+.+ +.++.....|+||++ ++.| ..+++-...+| +|++--..++..+++..||.+|..|. T Consensus 79 L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~---r~~V~~gg~msqqRt~~d-lR~~ar~~L~~w~~~~~Dq~~~v~la- 153 (318) T protein:vir:27 79 LSKRPTMGDERVEGRGEDLSHADFSLKINQG---RHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVHLA- 153 (318) T ss_pred cccCccccCceeeccccceEEEeeEEEEeee---ccccccccchhhhhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHh- Confidence 88877777777876 457888899999985 4444 35555556788 89988888999999999999998885 Q ss_pred hhhhc------------cccc-------cccccccccccccccccCCcc-ccccHHHH-HHHHHHHHHHHHhhcCC---- Q lcl|Aclame:pro 138 GGIAN------------TKAE-------RNKPRVKGHGFSINVNVTESE-ALANPQYV-MAAVEYALEQQLEQEVD---- 192 (402) Q Consensus 138 aA~~~------------a~~~-------~~~~~~~g~~~~~~v~~~~a~-~~~~~~~l-~dai~~a~~~LdekdVP---- 192 (402) +++.. ++-. ...|..+-+..+. .++.. ...+.+.+ ++.|-.+...+++..-| T Consensus 154 Garg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g---~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV 230 (318) T protein:vir:27 154 GARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGG---DATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPV 230 (318) T ss_pred hcccccccccceEecccCccchhhhhcccCCCCCCcEEecc---CccchhhhhhcccccHHHHHHHHHHHHHhCCCCcce Confidence 23310 0000 0111111111100 00001 01111111 34445566666663222 Q ss_pred -cc--C-------cEEEeChHHHHHHhcccc---hh----hcccc--cccCcccccceEEEEeccEEEecCccccccCcc Q lcl|Aclame:pro 193 -IS--D-------VAIMMPWKFFNALRDADR---IV----DKTYT--ISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQ 253 (402) Q Consensus 193 -~~--g-------R~~VV~P~~y~~Ll~~~r---~~----n~d~~--~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~ 253 (402) -+ + ++++++|.||..|..++. +. |+... +..++ +..|.++.++|+=|+|-.++|--= T Consensus 231 ~v~g~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knP-LF~G~~gm~ngvil~~~~~vpIrf--- 306 (318) T protein:vir:27 231 RLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHP-LFKGECAMWRNILVRKYAGMPIRF--- 306 (318) T ss_pred eeccccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCC-ceecceeeecCEEEeecCCccEEE--- Confidence 12 2 678999999999998752 22 22222 12233 457999999999999999987210 Q ss_pred ccccccccCCccccceeeecc Q lcl|Aclame:pro 254 AHHLLSNEDNGYRYDPIAEMN 274 (402) Q Consensus 254 t~~~ls~a~~G~~~~~~ad~~ 274 (402) + .|....| +.++ T Consensus 307 ------~--~G~~v~~-~~~~ 318 (318) T protein:vir:27 307 ------Y--QGQRFWY-QRIT 318 (318) T ss_pred ------c--CCCeeee-eecC Confidence 0 1111111 1111 No 161 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=98.45 E-value=3.5e-08 Score=61.49 Aligned_cols=298 Identities=10% Similarity=0.031 Sum_probs=141.5 Q ss_pred CCCCcccc-------cccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccce--eeeeecC- Q lcl|Aclame:pro 1 MSTPNTLT-------NVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGET--ELQVLAP- 70 (402) Q Consensus 1 Ms~~n~~t-------~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~--t~~~~~~- 70 (402) |=+-+.+- -+-.+ .|- |-=++|+ +.+...+..+.++++.++.+-.+..+..|+++|.. ......- T Consensus 1 ~~~~~~~~~~~k~it~~d~~-gG~---L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~ 75 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLG-KGI---LAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTS 75 (314) T ss_pred CchhhhHHHhhcccccccCC-Cce---eChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCcccccccccc Confidence 44432111 11111 111 1126664 67788899999999998754323455788888642 1111111 Q ss_pred C--CCCCCCCccccceeEeecceee-ccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 71 G--QSPNATPTQADKNQLVIDTTVI-ARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) Q Consensus 71 G--~~i~~~~~~~~e~~itID~~ly-a~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~ 147 (402) | +......+......|.+-.+.. .+..-+-|+++.-.-| +.+.+...+++++++.....+++= ..+.....| .. T Consensus 76 ~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~-le~~i~~~~Ae~~g~~~~~~~~nG-dg~~~s~~~-~~ 152 (314) T protein:vir:41 76 GTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSA-FEQTITSLLASGVTYDLECFFLHA-DSSLTTGRE-LY 152 (314) T ss_pred cCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhh-HHHHHHHHHHHHHHHHHHHHhhcc-ccCCcCccc-ch Confidence 1 2223334555555555544432 2222344555543335 788898889999998887765421 100001011 00 Q ss_pred cccccccccc--ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccC-cEEEeChHHHHHHhcccchhhcccccccC Q lcl|Aclame:pro 148 NKPRVKGHGF--SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISD-VAIMMPWKFFNALRDADRIVDKTYTISQS 224 (402) Q Consensus 148 ~~~~~~g~~~--~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~g-R~~VV~P~~y~~Ll~~~r~~n~d~~~~~~ 224 (402) +- .+|... ...+...+... .....+.|.++...|+.++--... -..++++..+.++.+ .+-++. ....+ T Consensus 153 ~~--p~G~l~~a~~~~~~~~~~~---~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~--~l~~~~-~~l~~ 224 (314) T protein:vir:41 153 RI--NDGWMKLAGNQYTDAEPED---ENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRK--QLLVRE-TGLGD 224 (314) T ss_pred hc--chhhhhhcccceeecCccc---cccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHH--HHhccC-Ccccc Confidence 00 111111 11111111111 122345556666666654432222 234679999887764 111111 12334 Q ss_pred cccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhH Q lcl|Aclame:pro 225 GATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKE 304 (402) Q Consensus 225 g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~ 304 (402) ..+..|.-.++.|+||+.++.+|..+.+ ...=+-++|.+. .-+-..++..+.+|+.++ T Consensus 225 ~~~~~~~~~~l~G~PV~~~~~~~~~~~~------------~~~i~fgd~~nl----------v~~~~~~ir~~~~~~a~~ 282 (314) T protein:vir:41 225 SALIGATGLQYDGIPIQYVPALDALGDD------------KARALLTVPTNL----------VYGFWRNIRIEPKRDAAM 282 (314) T ss_pred hhhhCCCCceecceeeEecccccccCCC------------CceEEEechhhe----------EEEeeceeEEeecccCcC Confidence 4455677778999999999999853311 111122334332 223344455566665544 Q ss_pred HHHHHHHHHHhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 305 KTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 305 ~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) ....+...+=++.++.-+++++.. +-+...+| T Consensus 283 ~~~~~~~~~r~d~~~~~~~aa~~~-~~~~~~~~ 314 (314) T protein:vir:41 283 RRTEYIASLRADCNYEDENAAVAA-VIDMSSGG 314 (314) T ss_pred CeEEEEEEEEeceEEEEcCcEEEE-EeeccCCC Confidence 332223333345555555555444 33333333 No 162 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.45 E-value=1.1e-07 Score=58.81 Aligned_cols=273 Identities=10% Similarity=0.004 Sum_probs=158.1 Q ss_pred CCCCcccccccccccccH----HHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccc--eeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEV----DSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGE--TELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~----~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~--~t~~~~~~G~~i 74 (402) |-..-+.--.+-.-+.|- ---|.++|+.-+.+- ..+++.+|..++..|++++++.-+. ..+++..-|+.| T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L----~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~I 76 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKL----LEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVI 76 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHH----HHHhhhcccccccCCCEEeeccceeeeeccccccCCccc Confidence 533221111111111111 124889998776443 3457888888899999997653322 244677788899 Q ss_pred CCCCccccc---eeEeecceeeccchhhhHHHh-h-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 75 NATPTQADK---NQLVIDTTVIARNTVAHIHDV-Q-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) Q Consensus 75 ~~~~~~~~e---~~itID~~lya~~~IddlDe~-q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~ 149 (402) +.+.+...+ .+++|.. |... + =||+ | .=|+.=-.|-.+++..++++++|..++..|..+.... T Consensus 77 plskvt~~~~~t~t~~ikK--~rK~-t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~------- 144 (296) T protein:vir:98 77 PLSKVERKIHSEKKIELKK--YRKA-T--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ------- 144 (296) T ss_pred chhhheeeecceEEEEeec--cccc-c--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccccee------- Confidence 888776543 6666654 4444 3 4777 4 5555455788899999999999999988775332110 Q ss_pred ccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccccc Q lcl|Aclame:pro 150 PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATIN 229 (402) Q Consensus 150 ~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~ 229 (402) . ++...=...|+..+.++..++.+.+ ....+++|+|...+.+|++.++... ...+ T Consensus 145 ------------~---~t~~~lQ~Ala~~~~~l~~~feded--~~~~V~FVnP~D~a~ylg~a~it~q--------t~fG 199 (296) T protein:vir:98 145 ------------D---ALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQ--------TAFG 199 (296) T ss_pred ------------e---echhhHHHHHHHHhhhhhhhccccC--CCceEEEEehHHHHHHhcCCccchh--------heec Confidence 0 0000011234556667777776653 3468999999999999998876421 1112 Q ss_pred ceEE-EEeccEEEecCccccccCcccc--------ccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeecc Q lcl|Aclame:pro 230 GFVL-SSYNCPVIPSNRFPTFAQDQAH--------HLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFY 300 (402) Q Consensus 230 G~V~-~iaG~~V~~SNnlP~~~~~~t~--------~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~ 300 (402) +..+ .+.|..|+.|+.+|.+.-..|. ....+-.-+..|++..|.+..+|+- |.. ....++. T Consensus 200 ~tyl~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~-h~~-----~~~~~t~---- 269 (296) T protein:vir:98 200 LTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMN-HFQ-----ENTTLTI---- 269 (296) T ss_pred hhhhhhccccEEEEcCcCCCceEEEeeecceEEEeecccccchhhhhccccccccceEEE-ecc-----ccceeee---- Confidence 2222 3788999999999976543321 1111223466777777877777744 311 0111111 Q ss_pred chhHHHHHHHHHHHhcCccc---ccceEEEEEEeecc Q lcl|Aclame:pro 301 EKKEKTYYIDTFMAEGAIPD---RWEAVSVVTTKRDA 334 (402) Q Consensus 301 d~~~~~d~i~~~~a~Ga~vl---RPeaa~vv~~~~~~ 334 (402) ..- ++++-.+ |+|..+..+++.++ T Consensus 270 ----eT~------~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 270 ----QTL------LVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred ----hhH------hHhHHHhcccccceEEEEEecCCC Confidence 111 2333333 45566666665444 No 163 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.41 E-value=1.8e-08 Score=63.09 Aligned_cols=292 Identities=8% Similarity=-0.009 Sum_probs=148.2 Q ss_pred CCCCc--cccc------ccc-cccccHH-HHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeec Q lcl|Aclame:pro 1 MSTPN--TLTN------VAV-SASGEVD-SLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLA 69 (402) Q Consensus 1 Ms~~n--~~t~------~~~-~~~~d~~-alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~ 69 (402) +.... .++. ... .+++.+- .|.=+.|..+++......|.++.+.++.++. |+ .+|++- +...+.-.. T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a~w~~ 134 (381) T protein:vir:10 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGK 134 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcceeeec Confidence 00000 0000 000 0111111 2333899999999999999999999988874 44 456655 444444444 Q ss_pred CCCCCCCC-CccccceeEeecceeeccch-h--hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|Aclame:pro 70 PGQSPNAT-PTQADKNQLVIDTTVIARNT-V--AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKA 145 (402) Q Consensus 70 ~G~~i~~~-~~~~~e~~itID~~lya~~~-I--ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~ 145 (402) -+.++..+ .+...+. .+...+++.+. | .-||+ +.+| +.+.+.+++++++++..|++++. +.....|. T Consensus 135 e~~~~~~~~~~~f~~i--~l~~~kl~~~~~is~elL~D--s~~~-ie~~i~~~la~~~a~~~~~a~i~----G~G~~qP~ 205 (381) T protein:vir:10 135 IYGEIKGQLDAAFSEE--TAIQNKLTAFVVLPKDLNDF--GPAW-IERFVRVQIEEAFAVALETAFLK----GTGKDQPI 205 (381) T ss_pred ccccccccccccceee--eecceeEEeechhhHHHhhc--CHHH-HHHHHHHHHHHHHHHHhhheeEe----ccCCCCce Confidence 34444432 3444444 44444444333 2 22444 3345 67889999999999999987741 11111110 Q ss_pred cccccccccccccccccc--------CCccccccHHHHHHHHHHHHHHHHh----hcC-CccCcEEEeChHHHHHHhccc Q lcl|Aclame:pro 146 ERNKPRVKGHGFSINVNV--------TESEALANPQYVMAAVEYALEQQLE----QEV-DISDVAIMMPWKFFNALRDAD 212 (402) Q Consensus 146 ~~~~~~~~g~~~~~~v~~--------~~a~~~~~~~~l~dai~~a~~~Lde----kdV-P~~gR~~VV~P~~y~~Ll~~~ 212 (402) -.- ........... .+.....++..+++.|.++...|.. +.. +...-+.+++|..++.|+.-. T Consensus 206 Gil----~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~ 281 (381) T protein:vir:10 206 GLN----RQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY 281 (381) T ss_pred eee----eccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccc Confidence 000 00000000000 0111123445566666666655542 122 334556789999988886432 Q ss_pred chhhcccccccCcccccceEEEE--eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhh Q lcl|Aclame:pro 213 RIVDKTYTISQSGATINGFVLSS--YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGR 290 (402) Q Consensus 213 r~~n~d~~~~~~g~~~~G~V~~i--aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~ 290 (402) .+.+ ++|.+ ... .|.+|++|+++|... + +-+||++- +++-+. T Consensus 282 ~~~~------~~G~~-----v~~l~~g~~vv~s~~~p~~~--i---------------ifgDfs~Y--~i~~r~------ 325 (381) T protein:vir:10 282 THLN------ANGVY-----VTALPFNLNVIESTVQEAGK--V---------------LTYVKGLY--DGYLAG------ 325 (381) T ss_pred ccCC------CCCce-----eecCCCCceEEecCCCCcCc--E---------------EEEecccE--EEEEec------ Confidence 2211 22222 222 377799999998522 0 12444431 122222 Q ss_pred hcccceeeccchhHHH---HHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 291 TIEVTGDIFYEKKEKT---YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 291 ~~dl~~e~~~d~~~~~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) ++..+.. ++..|. ..+++++=++.++++|++.++++++....+.+.....--+ T Consensus 326 --~~~i~~~-~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 326 --GINVQKF-KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred --ccEEEee-chhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 2223332 223333 2466667789999999999998888755444443222111 No 164 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.41 E-value=1.8e-08 Score=63.09 Aligned_cols=292 Identities=8% Similarity=-0.009 Sum_probs=148.2 Q ss_pred CCCCc--cccc------ccc-cccccHH-HHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeec Q lcl|Aclame:pro 1 MSTPN--TLTN------VAV-SASGEVD-SLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLA 69 (402) Q Consensus 1 Ms~~n--~~t~------~~~-~~~~d~~-alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~ 69 (402) +.... .++. ... .+++.+- .|.=+.|..+++......|.++.+.++.++. |+ .+|++- +...+.-.. T Consensus 57 ~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a~w~~ 134 (381) T protein:vir:95 57 SLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGK 134 (381) T ss_pred HhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcceeeec Confidence 00000 0000 000 0111111 2333899999999999999999999988874 44 456655 444444444 Q ss_pred CCCCCCCC-CccccceeEeecceeeccch-h--hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|Aclame:pro 70 PGQSPNAT-PTQADKNQLVIDTTVIARNT-V--AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKA 145 (402) Q Consensus 70 ~G~~i~~~-~~~~~e~~itID~~lya~~~-I--ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~ 145 (402) -+.++..+ .+...+. .+...+++.+. | .-||+ +.+| +.+.+.+++++++++..|++++. +.....|. T Consensus 135 e~~~~~~~~~~~f~~i--~l~~~kl~~~~~is~elL~D--s~~~-ie~~i~~~la~~~a~~~~~a~i~----G~G~~qP~ 205 (381) T protein:vir:95 135 IYGEIKGQLDAAFSEE--TAIQNKLTAFVVLPKDLNDF--GPAW-IERFVRVQIEEAFAVALETAFLK----GTGKDQPI 205 (381) T ss_pred ccccccccccccceee--eecceeEEeechhhHHHhhc--CHHH-HHHHHHHHHHHHHHHHhhheeEe----ccCCCCce Confidence 34444432 3444444 44444444333 2 22444 3345 67889999999999999987741 11111110 Q ss_pred cccccccccccccccccc--------CCccccccHHHHHHHHHHHHHHHHh----hcC-CccCcEEEeChHHHHHHhccc Q lcl|Aclame:pro 146 ERNKPRVKGHGFSINVNV--------TESEALANPQYVMAAVEYALEQQLE----QEV-DISDVAIMMPWKFFNALRDAD 212 (402) Q Consensus 146 ~~~~~~~~g~~~~~~v~~--------~~a~~~~~~~~l~dai~~a~~~Lde----kdV-P~~gR~~VV~P~~y~~Ll~~~ 212 (402) -.- ........... .+.....++..+++.|.++...|.. +.. +...-+.+++|..++.|+.-. T Consensus 206 Gil----~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~ 281 (381) T protein:vir:95 206 GLN----RQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY 281 (381) T ss_pred eee----eccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccc Confidence 000 00000000000 0111123445566666666655542 122 334556789999988886432 Q ss_pred chhhcccccccCcccccceEEEE--eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhh Q lcl|Aclame:pro 213 RIVDKTYTISQSGATINGFVLSS--YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGR 290 (402) Q Consensus 213 r~~n~d~~~~~~g~~~~G~V~~i--aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~ 290 (402) .+.+ ++|.+ ... .|.+|++|+++|... + +-+||++- +++-+. T Consensus 282 ~~~~------~~G~~-----v~~l~~g~~vv~s~~~p~~~--i---------------ifgDfs~Y--~i~~r~------ 325 (381) T protein:vir:95 282 THLN------ANGVY-----VTALPFNLNVIESTVQEAGK--V---------------LTYVKGLY--DGYLAG------ 325 (381) T ss_pred ccCC------CCCce-----eecCCCCceEEecCCCCcCc--E---------------EEEecccE--EEEEec------ Confidence 2211 22222 222 377799999998522 0 12444431 122222 Q ss_pred hcccceeeccchhHHH---HHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 291 TIEVTGDIFYEKKEKT---YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 291 ~~dl~~e~~~d~~~~~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) ++..+.. ++..|. ..+++++=++.++++|++.++++++....+.+.....--+ T Consensus 326 --~~~i~~~-~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:95 326 --GINVQKF-KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred --ccEEEee-chhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCcccccccC Confidence 2223332 223333 2466667789999999999998888755444443222111 No 165 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.37 E-value=1.2e-07 Score=58.49 Aligned_cols=328 Identities=12% Similarity=0.044 Sum_probs=166.4 Q ss_pred CCCCcccc-ccccccc----ccHHHHHHHHHhHHHHHHHHHHhhhc---------ccceeeecc--ccceEEeeecccee Q lcl|Aclame:pro 1 MSTPNTLT-NVAVSAS----GEVDSLLIEKFNGKVNEQYLKGENIL---------SYFDVQTVT--GTNTVSNKYLGETE 64 (402) Q Consensus 1 Ms~~n~~t-~~~~~~~----~d~~alfle~f~geV~t~f~~~sv~~---------~~~~~rti~--~Gksv~f~~iG~~t 64 (402) |+.-.... ...|..+ -..+.-+++.|.+.+...-+..+-+. ..+++..+. .|++|.|+.+-..+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 77654221 2222211 11223368888887544433322222 222233342 59999999999888 Q ss_pred eeeecCCCCCCC--CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 65 LQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 65 ~~~~~~G~~i~~--~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) -....-++.+.+ +.++.....|+||++.-.=..=..+++=...+| +|++--..++..+++..||.+|..|. +++.. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~d-lr~~ar~~L~~w~~~~~d~~~~~~la-G~rg~ 158 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVHLA-GARGD 158 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHh-ccccc Confidence 777776777876 458888999999996433111245666667788 89999899999999999999998886 33321 Q ss_pred ------------cccc-------cccccccccccccccccCCcc-ccccHHHH-HHHHHHHHHHHHhhcCCc-------c Q lcl|Aclame:pro 143 ------------TKAE-------RNKPRVKGHGFSINVNVTESE-ALANPQYV-MAAVEYALEQQLEQEVDI-------S 194 (402) Q Consensus 143 ------------a~~~-------~~~~~~~g~~~~~~v~~~~a~-~~~~~~~l-~dai~~a~~~LdekdVP~-------~ 194 (402) .+-. ...|..+-+..+. .++.. ...+.+.+ ++.|-.+...+++..-|- + T Consensus 159 ~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g---~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~ 235 (404) T protein:vir:10 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGG---DATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) T ss_pred cccccceeeccccccccceeecccCCCCCCcEEecc---CccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccc Confidence 0000 0011111010000 00000 01111112 445556667776643332 1 Q ss_pred C-------cEEEeChHHHHHHhcccc---hhhccc----c--cccCcccccceEEEEeccEEEecCccccccCccccccc Q lcl|Aclame:pro 195 D-------VAIMMPWKFFNALRDADR---IVDKTY----T--ISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLL 258 (402) Q Consensus 195 g-------R~~VV~P~~y~~Ll~~~r---~~n~d~----~--~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~l 258 (402) . +++++.|.||..|..++. +.+-.- . +-.++ +..|.++.++|+-|+|-.+.|-.-..+..... T Consensus 236 ~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nP-lF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:10 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHP-LFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred cccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCC-ceecCeeEEcCEEEEecCCceeeecccceeee Confidence 2 678999999999999863 222111 1 12344 45799999999999998887731111111011 Q ss_pred cccCCcccc-ceeeeccceeEEeecHHHhh--hhhh----cccceeeccchhHHHHHHHHHHHhcCcccc-cc------e Q lcl|Aclame:pro 259 SNEDNGYRY-DPIAEMNGAVAVLFTSDALL--VGRT----IEVTGDIFYEKKEKTYYIDTFMAEGAIPDR-WE------A 324 (402) Q Consensus 259 s~a~~G~~~-~~~ad~~~~~al~fh~~Av~--tv~~----~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlR-Pe------a 324 (402) ++.+.+... ..++..+-..+|++-..|++ .++. -.+.-|.+.-.++++ |-....+|.+=.| |. - T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~--i~~~~i~G~kK~rF~~~~g~~~D 392 (404) T protein:vir:10 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE--IAISWINGLKKIRFPEKSGKMQD 392 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhh--hhhHHHhhhhhccccCCCCceee Confidence 111111111 12222233345666555543 2332 112333333333332 3334446666666 31 1 Q ss_pred EEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 325 VSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 325 a~vv~~~~~~t~~~a~~~~~~~ 346 (402) -|+|.+.- ++-. T Consensus 393 fGvi~idt----------a~~~ 404 (404) T protein:vir:10 393 HGVIAVDT----------AVKL 404 (404) T ss_pred EEEEEecc----------cccC Confidence 22222210 0000 No 166 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.37 E-value=1.2e-07 Score=58.49 Aligned_cols=328 Identities=12% Similarity=0.044 Sum_probs=166.4 Q ss_pred CCCCcccc-ccccccc----ccHHHHHHHHHhHHHHHHHHHHhhhc---------ccceeeecc--ccceEEeeecccee Q lcl|Aclame:pro 1 MSTPNTLT-NVAVSAS----GEVDSLLIEKFNGKVNEQYLKGENIL---------SYFDVQTVT--GTNTVSNKYLGETE 64 (402) Q Consensus 1 Ms~~n~~t-~~~~~~~----~d~~alfle~f~geV~t~f~~~sv~~---------~~~~~rti~--~Gksv~f~~iG~~t 64 (402) |+.-.... ...|..+ -..+.-+++.|.+.+...-+..+-+. ..+++..+. .|++|.|+.+-..+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:10 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 77654221 2222211 11223368888887544433322222 222233342 59999999999888 Q ss_pred eeeecCCCCCCC--CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 65 LQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 65 ~~~~~~G~~i~~--~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) -....-++.+.+ +.++.....|+||++.-.=..=..+++=...+| +|++--..++..+++..||.+|..|. +++.. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~d-lr~~ar~~L~~w~~~~~d~~~~~~la-G~rg~ 158 (404) T protein:vir:10 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVHLA-GARGD 158 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHh-ccccc Confidence 777776777876 458888999999996433111245666667788 89999899999999999999998886 33321 Q ss_pred ------------cccc-------cccccccccccccccccCCcc-ccccHHHH-HHHHHHHHHHHHhhcCCc-------c Q lcl|Aclame:pro 143 ------------TKAE-------RNKPRVKGHGFSINVNVTESE-ALANPQYV-MAAVEYALEQQLEQEVDI-------S 194 (402) Q Consensus 143 ------------a~~~-------~~~~~~~g~~~~~~v~~~~a~-~~~~~~~l-~dai~~a~~~LdekdVP~-------~ 194 (402) .+-. ...|..+-+..+. .++.. ...+.+.+ ++.|-.+...+++..-|- + T Consensus 159 ~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g---~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~ 235 (404) T protein:vir:10 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGG---DATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) T ss_pred cccccceeeccccccccceeecccCCCCCCcEEecc---CccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccc Confidence 0000 0011111010000 00000 01111112 445556667776643332 1 Q ss_pred C-------cEEEeChHHHHHHhcccc---hhhccc----c--cccCcccccceEEEEeccEEEecCccccccCccccccc Q lcl|Aclame:pro 195 D-------VAIMMPWKFFNALRDADR---IVDKTY----T--ISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLL 258 (402) Q Consensus 195 g-------R~~VV~P~~y~~Ll~~~r---~~n~d~----~--~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~l 258 (402) . +++++.|.||..|..++. +.+-.- . +-.++ +..|.++.++|+-|+|-.+.|-.-..+..... T Consensus 236 ~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nP-lF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:10 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHP-LFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred cccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCC-ceecCeeEEcCEEEEecCCceeeecccceeee Confidence 2 678999999999999863 222111 1 12344 45799999999999998887731111111011 Q ss_pred cccCCcccc-ceeeeccceeEEeecHHHhh--hhhh----cccceeeccchhHHHHHHHHHHHhcCcccc-cc------e Q lcl|Aclame:pro 259 SNEDNGYRY-DPIAEMNGAVAVLFTSDALL--VGRT----IEVTGDIFYEKKEKTYYIDTFMAEGAIPDR-WE------A 324 (402) Q Consensus 259 s~a~~G~~~-~~~ad~~~~~al~fh~~Av~--tv~~----~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlR-Pe------a 324 (402) ++.+.+... ..++..+-..+|++-..|++ .++. -.+.-|.+.-.++++ |-....+|.+=.| |. - T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~--i~~~~i~G~kK~rF~~~~g~~~D 392 (404) T protein:vir:10 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE--IAISWINGLKKIRFPEKSGKMQD 392 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhh--hhhHHHhhhhhccccCCCCceee Confidence 111111111 12222233345666555543 2332 112333333333332 3334446666666 31 1 Q ss_pred EEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 325 VSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 325 a~vv~~~~~~t~~~a~~~~~~~ 346 (402) -|+|.+.- ++-. T Consensus 393 fGvi~idt----------a~~~ 404 (404) T protein:vir:10 393 HGVIAVDT----------AVKL 404 (404) T ss_pred EEEEEecc----------cccC Confidence 22222210 0000 No 167 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.37 E-value=1.2e-07 Score=58.49 Aligned_cols=328 Identities=12% Similarity=0.044 Sum_probs=166.4 Q ss_pred CCCCcccc-ccccccc----ccHHHHHHHHHhHHHHHHHHHHhhhc---------ccceeeecc--ccceEEeeecccee Q lcl|Aclame:pro 1 MSTPNTLT-NVAVSAS----GEVDSLLIEKFNGKVNEQYLKGENIL---------SYFDVQTVT--GTNTVSNKYLGETE 64 (402) Q Consensus 1 Ms~~n~~t-~~~~~~~----~d~~alfle~f~geV~t~f~~~sv~~---------~~~~~rti~--~Gksv~f~~iG~~t 64 (402) |+.-.... ...|..+ -..+.-+++.|.+.+...-+..+-+. ..+++..+. .|++|.|+.+-..+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:32 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 77654221 2222211 11223368888887544433322222 222233342 59999999999888 Q ss_pred eeeecCCCCCCC--CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 65 LQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 65 ~~~~~~G~~i~~--~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) -....-++.+.+ +.++.....|+||++.-.=..=..+++=...+| +|++--..++..+++..||.+|..|. +++.. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~d-lr~~ar~~L~~w~~~~~d~~~~~~la-G~rg~ 158 (404) T protein:vir:32 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVHLA-GARGD 158 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHh-ccccc Confidence 777776777876 458888999999996433111245666667788 89999899999999999999998886 33321 Q ss_pred ------------cccc-------cccccccccccccccccCCcc-ccccHHHH-HHHHHHHHHHHHhhcCCc-------c Q lcl|Aclame:pro 143 ------------TKAE-------RNKPRVKGHGFSINVNVTESE-ALANPQYV-MAAVEYALEQQLEQEVDI-------S 194 (402) Q Consensus 143 ------------a~~~-------~~~~~~~g~~~~~~v~~~~a~-~~~~~~~l-~dai~~a~~~LdekdVP~-------~ 194 (402) .+-. ...|..+-+..+. .++.. ...+.+.+ ++.|-.+...+++..-|- + T Consensus 159 ~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g---~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~ 235 (404) T protein:vir:32 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGG---DATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) T ss_pred cccccceeeccccccccceeecccCCCCCCcEEecc---CccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccc Confidence 0000 0011111010000 00000 01111112 445556667776643332 1 Q ss_pred C-------cEEEeChHHHHHHhcccc---hhhccc----c--cccCcccccceEEEEeccEEEecCccccccCccccccc Q lcl|Aclame:pro 195 D-------VAIMMPWKFFNALRDADR---IVDKTY----T--ISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLL 258 (402) Q Consensus 195 g-------R~~VV~P~~y~~Ll~~~r---~~n~d~----~--~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~l 258 (402) . +++++.|.||..|..++. +.+-.- . +-.++ +..|.++.++|+-|+|-.+.|-.-..+..... T Consensus 236 ~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nP-lF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:32 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHP-LFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred cccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCC-ceecCeeEEcCEEEEecCCceeeecccceeee Confidence 2 678999999999999863 222111 1 12344 45799999999999998887731111111011 Q ss_pred cccCCcccc-ceeeeccceeEEeecHHHhh--hhhh----cccceeeccchhHHHHHHHHHHHhcCcccc-cc------e Q lcl|Aclame:pro 259 SNEDNGYRY-DPIAEMNGAVAVLFTSDALL--VGRT----IEVTGDIFYEKKEKTYYIDTFMAEGAIPDR-WE------A 324 (402) Q Consensus 259 s~a~~G~~~-~~~ad~~~~~al~fh~~Av~--tv~~----~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlR-Pe------a 324 (402) ++.+.+... ..++..+-..+|++-..|++ .++. -.+.-|.+.-.++++ |-....+|.+=.| |. - T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~--i~~~~i~G~kK~rF~~~~g~~~D 392 (404) T protein:vir:32 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE--IAISWINGLKKIRFPEKSGKMQD 392 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhh--hhhHHHhhhhhccccCCCCceee Confidence 111111111 12222233345666555543 2332 112333333333332 3334446666666 31 1 Q ss_pred EEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 325 VSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 325 a~vv~~~~~~t~~~a~~~~~~~ 346 (402) -|+|.+.- ++-. T Consensus 393 fGvi~idt----------a~~~ 404 (404) T protein:vir:32 393 HGVIAVDT----------AVKL 404 (404) T ss_pred EEEEEecc----------cccC Confidence 22222210 0000 No 168 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.37 E-value=1.2e-07 Score=58.49 Aligned_cols=328 Identities=12% Similarity=0.044 Sum_probs=166.4 Q ss_pred CCCCcccc-ccccccc----ccHHHHHHHHHhHHHHHHHHHHhhhc---------ccceeeecc--ccceEEeeecccee Q lcl|Aclame:pro 1 MSTPNTLT-NVAVSAS----GEVDSLLIEKFNGKVNEQYLKGENIL---------SYFDVQTVT--GTNTVSNKYLGETE 64 (402) Q Consensus 1 Ms~~n~~t-~~~~~~~----~d~~alfle~f~geV~t~f~~~sv~~---------~~~~~rti~--~Gksv~f~~iG~~t 64 (402) |+.-.... ...|..+ -..+.-+++.|.+.+...-+..+-+. ..+++..+. .|++|.|+.+-..+ T Consensus 1 ~~~~~~~~a~~~~~~~lft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~~~L~ 80 (404) T protein:vir:81 1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (404) T ss_pred CCCcCCcchhhhHHHHHHHHHhcCChhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEeeecc Confidence 77654221 2222211 11223368888887544433322222 222233342 59999999999888 Q ss_pred eeeecCCCCCCC--CCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 65 LQVLAPGQSPNA--TPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 65 ~~~~~~G~~i~~--~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) -....-++.+.+ +.++.....|+||++.-.=..=..+++=...+| +|++--..++..+++..||.+|..|. +++.. T Consensus 81 g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~d-lr~~ar~~L~~w~~~~~d~~~~~~la-G~rg~ 158 (404) T protein:vir:81 81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFN-LASSARTLLGTYFNDLQDQCAIVHLA-GARGD 158 (404) T ss_pred cCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHh-ccccc Confidence 777776777876 458888999999996433111245666667788 89999899999999999999998886 33321 Q ss_pred ------------cccc-------cccccccccccccccccCCcc-ccccHHHH-HHHHHHHHHHHHhhcCCc-------c Q lcl|Aclame:pro 143 ------------TKAE-------RNKPRVKGHGFSINVNVTESE-ALANPQYV-MAAVEYALEQQLEQEVDI-------S 194 (402) Q Consensus 143 ------------a~~~-------~~~~~~~g~~~~~~v~~~~a~-~~~~~~~l-~dai~~a~~~LdekdVP~-------~ 194 (402) .+-. ...|..+-+..+. .++.. ...+.+.+ ++.|-.+...+++..-|- + T Consensus 159 ~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g---~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~~~g~ 235 (404) T protein:vir:81 159 FVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGG---DATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGD 235 (404) T ss_pred cccccceeeccccccccceeecccCCCCCCcEEecc---CccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceEeccc Confidence 0000 0011111010000 00000 01111112 445556667776643332 1 Q ss_pred C-------cEEEeChHHHHHHhcccc---hhhccc----c--cccCcccccceEEEEeccEEEecCccccccCccccccc Q lcl|Aclame:pro 195 D-------VAIMMPWKFFNALRDADR---IVDKTY----T--ISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLL 258 (402) Q Consensus 195 g-------R~~VV~P~~y~~Ll~~~r---~~n~d~----~--~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~l 258 (402) . +++++.|.||..|..++. +.+-.- . +-.++ +..|.++.++|+-|+|-.+.|-.-..+..... T Consensus 236 ~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nP-lF~G~~gm~ngvii~~~~~~~Irf~~g~~~~~ 314 (404) T protein:vir:81 236 ELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHP-LFKGECAMWRNILVRKYAGMPIRFYQGSKVLV 314 (404) T ss_pred cccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCC-ceecCeeEEcCEEEEecCCceeeecccceeee Confidence 2 678999999999999863 222111 1 12344 45799999999999998887731111111011 Q ss_pred cccCCcccc-ceeeeccceeEEeecHHHhh--hhhh----cccceeeccchhHHHHHHHHHHHhcCcccc-cc------e Q lcl|Aclame:pro 259 SNEDNGYRY-DPIAEMNGAVAVLFTSDALL--VGRT----IEVTGDIFYEKKEKTYYIDTFMAEGAIPDR-WE------A 324 (402) Q Consensus 259 s~a~~G~~~-~~~ad~~~~~al~fh~~Av~--tv~~----~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlR-Pe------a 324 (402) ++.+.+... ..++..+-..+|++-..|++ .++. -.+.-|.+.-.++++ |-....+|.+=.| |. - T Consensus 315 ~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~--i~~~~i~G~kK~rF~~~~g~~~D 392 (404) T protein:vir:81 315 SENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTE--IAISWINGLKKIRFPEKSGKMQD 392 (404) T ss_pred cCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhh--hhhHHHhhhhhccccCCCCceee Confidence 111111111 12222233345666555543 2332 112333333333332 3334446666666 31 1 Q ss_pred EEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 325 VSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 325 a~vv~~~~~~t~~~a~~~~~~~ 346 (402) -|+|.+.- ++-. T Consensus 393 fGvi~idt----------a~~~ 404 (404) T protein:vir:81 393 HGVIAVDT----------AVKL 404 (404) T ss_pred EEEEEecc----------cccC Confidence 22222210 0000 No 169 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.36 E-value=1.7e-07 Score=57.70 Aligned_cols=281 Identities=15% Similarity=0.084 Sum_probs=158.6 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec----cceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL----GETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i----G~~t~~~~~~G~~i~~ 76 (402) |+-.++++..---+.. .---|.++|+.-+.+=+ .+++.+|..++..|.+++.+.. -....++..-|+.|+. T Consensus 1 M~~e~nl~~~~dL~~a-~siDF~~~f~~~i~~L~----~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Ipl 75 (303) T protein:vir:10 1 MSAENNLINVEALGKA-KSIDFANKLGVGLNKLF----EALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPL 75 (303) T ss_pred CCCCcCCcchhhcccc-eeehhhhhhhhhHHHHH----HHhhhhccccccCCceeeeeeeeceeeccccccccCCcccch Confidence 9988877644422111 01248899987764433 3567777777777877765532 1123457778888887 Q ss_pred CCccc---cceeEeecceeeccchhhhHHHh-h-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 77 TPTQA---DKNQLVIDTTVIARNTVAHIHDV-Q-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPR 151 (402) Q Consensus 77 ~~~~~---~e~~itID~~lya~~~IddlDe~-q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~ 151 (402) ..+.. ...+++++. |.... =||+ | .=|+.=-.|-.++++.++++++|..++..+..+-.... T Consensus 76 skvt~~~~~t~~~~~kK--~rK~t---TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~-------- 142 (303) T protein:vir:10 76 TKVTREQVDITELQFAK--YRKST---SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGK-------- 142 (303) T ss_pred hhheeeecceEEEEeec--ccccc---cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccc-------- Confidence 77654 346666754 45533 4566 3 55554567888999999999999999988765321110 Q ss_pred ccccccccccccCCccccccHHHHHHHH---HHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCcccc Q lcl|Aclame:pro 152 VKGHGFSINVNVTESEALANPQYVMAAV---EYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATI 228 (402) Q Consensus 152 ~~g~~~~~~v~~~~a~~~~~~~~l~dai---~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~ 228 (402) . +.+...+.+.|-.+| ..-...++|.++ --+++|+|...+.||.+..+... ++.=|. T Consensus 143 -----------~-t~~t~~s~~glq~Al~~~~~kl~~~~ed~~---~~V~FvNP~Daa~yl~~A~i~~~---~t~fG~-- 202 (303) T protein:vir:10 143 -----------R-TNKTKLSAENLQGALSKGRANLSVLLDDEI---TPIAFVNPNDTAEYLANGFINST---GAQFGV-- 202 (303) T ss_pred -----------c-ccceeecHHHHHHHHHhhhhhccccccccc---cEEEEEchHHHHHHhhcCCcchh---hhhhhh-- Confidence 0 001111222222222 222333455443 24899999999999998777532 111111 Q ss_pred cceEEEEeccEEEecCccccccCcccccc-------ccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccc Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHL-------LSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE 301 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~-------ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d 301 (402) --+..+.|++|+.|+.+|.+.-..|... ...-.-+..|+++.|.+..+|+- |.. ....++. T Consensus 203 -n~L~nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~D~tglIGv~-h~~-----~~~~~t~----- 270 (303) T protein:vir:10 203 -NLLTPYVGVKIVEFADVPQGEVWMTVAENLNVAYANPRGELSRAFAFATDATGFVGVL-HDI-----QPQRLTS----- 270 (303) T ss_pred -hhhhhhhcceEEEeccCCCceEEEeeccceEEEEecCchhhhhhhhhccccccceEEE-ecc-----ccceeee----- Confidence 2233589999999999998654333210 01112456778888877777744 311 0111111 Q ss_pred hhHHHHHHHHHHHhcCcc---cccceEEEEEEeeccCccccc Q lcl|Aclame:pro 302 KKEKTYYIDTFMAEGAIP---DRWEAVSVVTTKRDATTGDAG 340 (402) Q Consensus 302 ~~~~~d~i~~~~a~Ga~v---lRPeaa~vv~~~~~~t~~~a~ 340 (402) ..- ++++-. =|+|..+..+++.+..+...+ T Consensus 271 ---eT~------~~~~~~lfpE~~dgiv~~ti~~~e~~~~~~ 303 (303) T protein:vir:10 271 ---DTI------YASAISMFPENIDAVIKVTIKKDEAGELPS 303 (303) T ss_pred ---hhH------hHhHHHhcccccceEEEEEEeccccCCCCC Confidence 111 122233 355677777776555333322 No 170 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.26 E-value=6.1e-08 Score=60.15 Aligned_cols=283 Identities=8% Similarity=-0.054 Sum_probs=141.8 Q ss_pred CCCCc--cccc---c----cccccc-cHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeee Q lcl|Aclame:pro 1 MSTPN--TLTN---V----AVSASG-EVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVL 68 (402) Q Consensus 1 Ms~~n--~~t~---~----~~~~~~-d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~ 68 (402) ...+. .++. - ...+.+ +.-..+| +.|..++++...+.|.++++.++.++.+ + .+|++- +..++... T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~-~~i~~~~~~~~a~wv 136 (377) T protein:vir:96 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL-R-LKALTAETSGTAVWG 136 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC-c-eEEEEecCCcceeEe Confidence 11110 0100 0 001111 1112234 7889999999999999999999988754 3 456654 44455554 Q ss_pred cCCCCCCC-CCccccceeEeecceeeccch-h--hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Q lcl|Aclame:pro 69 APGQSPNA-TPTQADKNQLVIDTTVIARNT-V--AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTK 144 (402) Q Consensus 69 ~~G~~i~~-~~~~~~e~~itID~~lya~~~-I--ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~ 144 (402) .-+.++.. ..+...+.+|.. .+++.+. | .-|++ +.+| +-+.+.++++.++++..|++++. |.....| T Consensus 137 ~e~~~~~~~~~~~f~~i~l~~--~kl~~~~~is~~ll~d--s~~~-le~~i~~~l~~~~~~~~~~a~i~----G~G~~~P 207 (377) T protein:vir:96 137 DIFGEIKGQLKQAFKEQDFSQ--FKLTAFVVIPKDALKF--GPKW-LKQFITEQLKEAIAVALELAIVK----GNGLLQP 207 (377) T ss_pred ecccccccccCccceeEeeee--eeEEeechhhHHHhhc--chhh-HHHHHHHHHHHHHHHHHhhceEe----ccCCCcc Confidence 44445543 245555555544 4444332 2 22333 3444 67889999999999999988852 1110000 Q ss_pred cc-------ccccccccccccccccc---CCccccccHHHHHHHHHHHHHHHHhhc--CC---ccCcEEEeChHHHHHHh Q lcl|Aclame:pro 145 AE-------RNKPRVKGHGFSINVNV---TESEALANPQYVMAAVEYALEQQLEQE--VD---ISDVAIMMPWKFFNALR 209 (402) Q Consensus 145 ~~-------~~~~~~~g~~~~~~v~~---~~a~~~~~~~~l~dai~~a~~~Ldekd--VP---~~gR~~VV~P~~y~~Ll 209 (402) .- .......+......... .+.....++..+++.+..+...+.... -| ...-+.+++|..|..++ T Consensus 208 ~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~ 287 (377) T protein:vir:96 208 VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLE 287 (377) T ss_pred eeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcc Confidence 00 00000000000000000 001112345566666666555554221 12 12234678998887765 Q ss_pred cccchhhcccccccCcccccceEEEEe--ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhh Q lcl|Aclame:pro 210 DADRIVDKTYTISQSGATINGFVLSSY--NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALL 287 (402) Q Consensus 210 ~~~r~~n~d~~~~~~g~~~~G~V~~ia--G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~ 287 (402) ......+ ++ |.-..+. |++|++|+.+|.+. + +-+||++ -+++ T Consensus 288 ~~~~~~~------~~-----G~~~~~l~~p~~v~~s~~~p~~~--i---------------~fgdf~~--Y~i~------ 331 (377) T protein:vir:96 288 AKFTSRN------QF-----GEYVTVLPHGITILESLAVETGK--A---------------IAFVANR--YDAF------ 331 (377) T ss_pred ccccccC------CC-----CCceeccCCCceEEecCCCCccc--E---------------EEEEcCc--EEEE------ Confidence 3222221 12 2223444 55689999999532 1 1133333 1111 Q ss_pred hhhhcccceeeccchhHHH---HHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 288 VGRTIEVTGDIFYEKKEKT---YYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 288 tv~~~dl~~e~~~d~~~~~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) .-.++..+..+ +.+|. ..+++++=++.++++|++.+++.+..| T Consensus 332 --~r~~~~i~~~~-~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 332 --MATASTIEEYD-QTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred --EecccEEEeeh-hhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 11223333332 23322 345666678999999999999999998 No 171 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.23 E-value=5e-08 Score=60.60 Aligned_cols=293 Identities=8% Similarity=-0.045 Sum_probs=145.7 Q ss_pred CCCC--cccccc------cc-cccccHH-HHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeec-cceeeeeec Q lcl|Aclame:pro 1 MSTP--NTLTNV------AV-SASGEVD-SLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLA 69 (402) Q Consensus 1 Ms~~--n~~t~~------~~-~~~~d~~-alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~i-G~~t~~~~~ 69 (402) +... +.++.- .. .+++..- -|.=++|..+++......|.++.+.++.++. |+ .++++- +..++.-.. T Consensus 57 ~~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~-~~-~~i~~~~~~~~a~W~~ 134 (381) T protein:vir:10 57 SLPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAVWGK 134 (381) T ss_pred HhcccccccCHHHHHHHHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecC-cc-eEEEeecCCcceEEee Confidence 0000 000000 00 1111111 2333899999999999999999999988874 44 355544 333333222 Q ss_pred CCCCCCCC-CccccceeEeecceeeccch---hhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|Aclame:pro 70 PGQSPNAT-PTQADKNQLVIDTTVIARNT---VAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKA 145 (402) Q Consensus 70 ~G~~i~~~-~~~~~e~~itID~~lya~~~---IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~ 145 (402) -+.++..+ .+.. ..+++...+++.+. -.-||+... | +.+.+..++++++++..|++++ .+.....|. T Consensus 135 e~~~~~~~~~~~f--~~i~l~~~kl~a~i~is~elL~Ds~~--~-le~~i~~~la~~~a~~~~~afi----~GdG~~qP~ 205 (381) T protein:vir:10 135 IYGEIKGQLDAAF--SEETAIQNKLTAFVVLPKDLNDFGPA--W-IERFVRVQIEEAFAVALETAFL----KGTGKDQPI 205 (381) T ss_pred cccccccccCccc--eeEeecceeEEeeccccHHHHhccHH--H-HHHHHHHHHHHHHHHHhhceeE----ecccCCCce Confidence 12233222 2333 34555555554433 233555544 3 5688999999999999998774 121111110 Q ss_pred ccccccccccccccccccC--------CccccccHHHHHHHHHHHHHHHH----hhcC-CccCcEEEeChHHHHHHhccc Q lcl|Aclame:pro 146 ERNKPRVKGHGFSINVNVT--------ESEALANPQYVMAAVEYALEQQL----EQEV-DISDVAIMMPWKFFNALRDAD 212 (402) Q Consensus 146 ~~~~~~~~g~~~~~~v~~~--------~a~~~~~~~~l~dai~~a~~~Ld----ekdV-P~~gR~~VV~P~~y~~Ll~~~ 212 (402) -. - .....+.....+ +.....++..+++.+......+. .+.. +..+.+.+++|..|+.|+... T Consensus 206 Gi-l---~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~ 281 (381) T protein:vir:10 206 GL-N---RQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY 281 (381) T ss_pred ee-e---ecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhcccc Confidence 00 0 000000000000 00011233444555444333332 1222 334567799999999887644 Q ss_pred chhhcccccccCcccccceEEEE-eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhh Q lcl|Aclame:pro 213 RIVDKTYTISQSGATINGFVLSS-YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRT 291 (402) Q Consensus 213 r~~n~d~~~~~~g~~~~G~V~~i-aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~ 291 (402) .+.+ ++|.+.. .. .|.+|++|+++|... + .-+||++- +++-+ T Consensus 282 ~~~~------~~G~~v~----~lp~g~~vv~~~~~p~~~--i---------------~fGDfs~Y--~i~~r-------- 324 (381) T protein:vir:10 282 THLN------ANGVYVT----ALPFNLNVIESTVQEAGK--V---------------LTYVKGLY--DGYLA-------- 324 (381) T ss_pred ccCC------CCCceee----cCCCCceeEEcCCCCcCc--E---------------EEEEcccE--EEEEe-------- Confidence 3322 2222221 11 488899999999522 1 12455441 22212 Q ss_pred cccceeeccchhHHH---HHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 292 IEVTGDIFYEKKEKT---YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 292 ~dl~~e~~~d~~~~~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) .++..+.. ++.+|. ..+++++=++.++++|++.++++++.-.+++....+..-+ T Consensus 325 ~~~~i~~~-~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 325 GGINVQKF-KETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEETL 381 (381) T ss_pred cccEEEee-chhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccccccccC Confidence 22233333 233333 2456666789999999999998888655555544443332 No 172 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.11 E-value=2e-07 Score=57.31 Aligned_cols=295 Identities=14% Similarity=0.040 Sum_probs=137.7 Q ss_pred CCC--------Ccccc---cc---c-ccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccce- Q lcl|Aclame:pro 1 MST--------PNTLT---NV---A-VSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGET- 63 (402) Q Consensus 1 Ms~--------~n~~t---~~---~-~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~- 63 (402) +.. .+.++ +- . ..+++..--..| +.+..++++...+.+.+++++++.++. |+ .+|++.... T Consensus 61 ~~~~~~~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~ 138 (395) T protein:vir:95 61 VVDNGILAKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAG-IK-TRVIKADPAG 138 (395) T ss_pred HHHHHHHhhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCc Confidence 000 00000 00 0 001111111234 889999999999999999999988874 44 467765443 Q ss_pred eeeeecCCCCCCC-CCccccceeEeecceeeccch---hhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 64 ELQVLAPGQSPNA-TPTQADKNQLVIDTTVIARNT---VAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGG 139 (402) Q Consensus 64 t~~~~~~G~~i~~-~~~~~~e~~itID~~lya~~~---IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA 139 (402) .+....-+.++.. ..++.++.++.. .+++... -.-|++.. +| +.+.+.+.+++++++..|++++. +. T Consensus 139 ~a~w~~e~~~~~~~~~~~f~~i~l~~--~kl~~~~~iS~ell~ds~--~~-ie~~i~~~la~~ia~~~~~a~i~----G~ 209 (395) T protein:vir:95 139 QAVWGKVFGEIKGQLDAAFREENFTQ--YKLTCFVVLPDDLSTFGP--AW-IERFVRTQIQEAISVALESAIIN----GG 209 (395) T ss_pred ceEEeecccccCccccccceeeeece--eeEEEeecccHHHHhcch--hH-HHHHHHHHHHHHHHHHHhhheee----cc Confidence 3333332234433 245555555544 4433322 22333333 44 56889999999999999987741 10 Q ss_pred hhcccccccccccccccccc-----ccccCCcc---ccccHHHHHHHHHHHHHHHHh----hc-CCccCcEEEeChHHHH Q lcl|Aclame:pro 140 IANTKAERNKPRVKGHGFSI-----NVNVTESE---ALANPQYVMAAVEYALEQQLE----QE-VDISDVAIMMPWKFFN 206 (402) Q Consensus 140 ~~~a~~~~~~~~~~g~~~~~-----~v~~~~a~---~~~~~~~l~dai~~a~~~Lde----kd-VP~~gR~~VV~P~~y~ 206 (402) ... ...|. |..... ........ ...+...+++.+.++...|.- +. ........+++|..|. T Consensus 210 G~~----~~qP~--Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~ 283 (395) T protein:vir:95 210 GAA----KTQPV--GLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW 283 (395) T ss_pred CCC----CcCce--eeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh Confidence 000 00000 000000 00000001 111122234444443333310 11 1112234578888776 Q ss_pred HHhcccchhhcccccccCcccccceEEEEe--ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHH Q lcl|Aclame:pro 207 ALRDADRIVDKTYTISQSGATINGFVLSSY--NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSD 284 (402) Q Consensus 207 ~Ll~~~r~~n~d~~~~~~g~~~~G~V~~ia--G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~ 284 (402) .+.. +.+..+ .+|...++. |++|++|+++|... . +-+||+.- +++-+ T Consensus 284 ~~~g--~~~~~~---------~~G~~~~~lg~g~~v~~~~~~p~~~------i-----------~fgdfs~y--~i~~r- 332 (395) T protein:vir:95 284 DVQA--RYTYLT---------ANGGFVTVLPYNVTIITSEFVPEGK------L-----------VAFVTDRY--NAVRG- 332 (395) T ss_pred hcCC--cceecc---------CCCcceeccCCcceEEEcCCCCCCc------E-----------EEEecccE--EEEEe- Confidence 5432 222111 124444554 67799999999522 0 12455441 12211 Q ss_pred HhhhhhhcccceeeccchhHHH---HHHHHHHHhcCcccccceEEEEEEeeccCccccc-cchhhHHHhhh Q lcl|Aclame:pro 285 ALLVGRTIEVTGDIFYEKKEKT---YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG-GPGDDHATVLA 351 (402) Q Consensus 285 Av~tv~~~dl~~e~~~d~~~~~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~-~~~~~~~~~~~ 351 (402) .++..+... +.++. ..+++..=+|.++++|++..+++++....+...+ ..++.-.-.+| T Consensus 333 -------~~~~i~~~~-~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:95 333 -------GGLTVKKFD-QTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAGGTTDGIAEA 395 (395) T ss_pred -------cceEEEecc-chhhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCCCCCCccccC Confidence 222223222 22222 2345555679999999999999987544433333 22222222222 No 173 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.09 E-value=1.7e-07 Score=57.63 Aligned_cols=296 Identities=11% Similarity=0.045 Sum_probs=140.6 Q ss_pred CCCCccccccccc------ccccHHHHHH--HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccce--eeeeecC Q lcl|Aclame:pro 1 MSTPNTLTNVAVS------ASGEVDSLLI--EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGET--ELQVLAP 70 (402) Q Consensus 1 Ms~~n~~t~~~~~------~~~d~~alfl--e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~--t~~~~~~ 70 (402) |=+.+........ +..|...-+| +++ .+.+...++.|.++.+.++.+..++.+.+++.+|-. ....... T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l~P~~~-~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~ 79 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVLSVDRF-GEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDE 79 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCceechHHH-HHHHHHHHhhhhhhhhceeeeccccccccccccccCccccccccc Confidence 5544432211110 1112222223 555 456778888999999998866555566667776532 2211111 Q ss_pred CC---CCCCCCccccceeEeecceee-ccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc-ccc Q lcl|Aclame:pro 71 GQ---SPNATPTQADKNQLVIDTTVI-ARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN-TKA 145 (402) Q Consensus 71 G~---~i~~~~~~~~e~~itID~~ly-a~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~-a~~ 145 (402) +. +.....+...+..|.+-.+.. ....=+-||++.-..| +.+.+..+.++++++..+..++. |-... .|- T Consensus 80 ~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~-~e~~l~~~~a~~~a~~~~~~~~n----Gdg~s~~p~ 154 (315) T protein:vir:41 80 TGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKA-FEQKIVTLLGEGISYVLEKYYLH----GDTSSSDPL 154 (315) T ss_pred ccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhcccc-HHHHHHHHHHHHHHHHHHHHhhc----cCCcCcCcc Confidence 11 112223444444444443221 2233356676665556 88999999999999988876642 21111 110 Q ss_pred cccccccccccc--ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCc--cCcEEEeChHHHHHHhcccchhhccccc Q lcl|Aclame:pro 146 ERNKPRVKGHGF--SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI--SDVAIMMPWKFFNALRDADRIVDKTYTI 221 (402) Q Consensus 146 ~~~~~~~~g~~~--~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~--~gR~~VV~P~~y~~Ll~~~r~~n~d~~~ 221 (402) ...+ +|... ........ .........++.|.++...|..+.--. .-+| ++++..+..|.+ +.+.+-.. T Consensus 155 ~~~~---~G~l~~a~~~~~~~~-~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~-imn~~t~~~~rk---lk~~~g~~ 226 (315) T protein:vir:41 155 LRMS---DGWLKLASEKLTESD-VDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKF-YVTWDIYRAYRD---ALKGRETG 226 (315) T ss_pred cccc---ccceecccccccccc-cccccccccHHHHHHHHHhcChHHhhcCCceEE-EEcHHHHHHHHH---HhccCCCc Confidence 0000 11111 00000000 000111112455666666666544322 2245 899999887754 22221122 Q ss_pred ccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccc Q lcl|Aclame:pro 222 SQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE 301 (402) Q Consensus 222 ~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d 301 (402) .....+..|.-..+.|.||+.++++|........ =.-++|.+.+- +--.++..+.+++ T Consensus 227 lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~------------ilf~d~~nl~~----------~~~~~i~i~~~~~ 284 (315) T protein:vir:41 227 LGDQALTGANSILYDGRPVQYVPALEALNDGKSR------------ALFVVPTQLVY----------GFWRNIKVVPDYD 284 (315) T ss_pred cccchhhcCCCceecccceEecccccccCCCCcc------------EEEecccceEE----------EeccccEEEeeec Confidence 2334455677778999999999999864322111 12234443211 1123345555555 Q ss_pred hhHHHHHHHHHHHhcCcccccceEEEEEEee Q lcl|Aclame:pro 302 KKEKTYYIDTFMAEGAIPDRWEAVSVVTTKR 332 (402) Q Consensus 302 ~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~ 332 (402) ..+....+....-.|.++.-++++++-..+. T Consensus 285 a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 285 AEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred CCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 4332222222222355555455544444444 No 174 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=97.90 E-value=9.3e-08 Score=59.14 Aligned_cols=298 Identities=9% Similarity=-0.034 Sum_probs=132.2 Q ss_pred CCCC---------------c-ccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccce Q lcl|Aclame:pro 1 MSTP---------------N-TLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGET 63 (402) Q Consensus 1 Ms~~---------------n-~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~ 63 (402) |... . .........+...-.+.+ +.+..++.......+.+++++++.++.+ .++++.-+.. T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g--~~~~~~~~~~ 200 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG--TARQNIAGAI 200 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCc--eeEeeeecCC Confidence 1000 0 000000000000011223 6777888888888888899998888764 3455554443 Q ss_pred -eeeeecCCCCCCCCCccccceeEeecceeeccch---hhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 64 -ELQVLAPGQSPNATPTQADKNQLVIDTTVIARNT---VAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGG 139 (402) Q Consensus 64 -t~~~~~~G~~i~~~~~~~~e~~itID~~lya~~~---IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA 139 (402) .+.-..-|..++...+...+.++.+.. |+.+. -.-|++.. +| +-+.+...++++++...|+.|+. |. T Consensus 201 ~~a~wv~E~~~~~~~~~~f~~i~~~~~k--~~~~~~iS~ell~ds~--~~-l~~~i~~~la~~~~~~~~~ail~----G~ 271 (466) T protein:vir:80 201 PEGVWTEAVANLNELSLSFSQIEVDGYK--VGGFIPIPNSTLEDSD--LN-LADEILDAIGQAIGFALDKAILY----GT 271 (466) T ss_pred cceeecccccccccccccccceeeccee--eeeehhhhHHHHhcch--HH-HHHHHHHHHHHHHHHHHhhheee----cc Confidence 333344555565545666666655554 33332 23333332 34 67889999999999999987752 11 Q ss_pred hhcccccccccccccccc---ccccccCC-----ccccccHHHH----------HHHHHHHHHHHHhhcC-CccCc-EEE Q lcl|Aclame:pro 140 IANTKAERNKPRVKGHGF---SINVNVTE-----SEALANPQYV----------MAAVEYALEQQLEQEV-DISDV-AIM 199 (402) Q Consensus 140 ~~~a~~~~~~~~~~g~~~---~~~v~~~~-----a~~~~~~~~l----------~dai~~a~~~LdekdV-P~~gR-~~V 199 (402) ....| . |... ..+..... .....+...+ +..+.++...+.-... ...++ +.+ T Consensus 272 G~~~P------~--Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~ 343 (466) T protein:vir:80 272 GTKMP------V--GIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWA 343 (466) T ss_pred CCCCc------c--eeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEE Confidence 11000 0 1000 00000000 0000111111 1111122212211111 12333 346 Q ss_pred eChHHHHHHhcccchhhcccccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEE Q lcl|Aclame:pro 200 MPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAV 279 (402) Q Consensus 200 V~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al 279 (402) +++..|..|+.-.-..+.+..-...+ .++ ..+.|.+|+.|+++|.+. . +-++|..- . T Consensus 344 ~~~~~~~~l~~~~~~~~~~g~~~~~~--~~~--~~i~G~pvv~s~~~~~~~------~-----------~~g~~~~y--~ 400 (466) T protein:vir:80 344 MSSNTHAVLMSKAITFNSAGALVASL--NNT--MPIVGGDIVILDFIPDND------I-----------IGGYGSLY--L 400 (466) T ss_pred ecchhHHHhhcccccccCCccccccC--CCc--ccccccceeecCccCccc------e-----------eeeccccE--E Confidence 78888887764322211110000000 112 348999999999999633 1 11222210 1 Q ss_pred eecHHHhhhhhhcccceeeccchh--HHHHHHHHHHHhcCcccccceEEEEEEee---ccCccccccchhhHHHh Q lcl|Aclame:pro 280 LFTSDALLVGRTIEVTGDIFYEKK--EKTYYIDTFMAEGAIPDRWEAVSVVTTKR---DATTGDAGGPGDDHATV 349 (402) Q Consensus 280 ~fh~~Av~tv~~~dl~~e~~~d~~--~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~---~~t~~~a~~~~~~~~~~ 349 (402) ++-+ .++..+...+.. +-...+++.+=++.++++|++.+.+++.. .++....++. +++-.| T Consensus 401 i~~r--------~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~~~~~-~~~~~~ 466 (466) T protein:vir:80 401 LAER--------ADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITFAPDE-ANVPEV 466 (466) T ss_pred EEee--------cceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceeeecCc-CcCCCC Confidence 1111 112222221111 11123455566899999999988887643 2222233322 222222 No 175 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=97.83 E-value=9.7e-07 Score=53.55 Aligned_cols=279 Identities=8% Similarity=-0.077 Sum_probs=132.7 Q ss_pred CCCCcc--cc---ccc-----ccccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeee-ccceeeeee Q lcl|Aclame:pro 1 MSTPNT--LT---NVA-----VSASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVL 68 (402) Q Consensus 1 Ms~~n~--~t---~~~-----~~~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~ 68 (402) +..... ++ +-. ..++.++-...| +.|..++++...+.+.++.++++.++. |+ +++++ .+..++... T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~~~~~~~~~~a~w~ 136 (377) T protein:vir:98 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAVWG 136 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecC-cc-eEEEEecCCcceeEe Confidence 111000 00 000 001111111233 889999999999999999999988875 44 45664 455555554 Q ss_pred cCCCCCCC-CCccccceeEeecceeeccchhh---hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccc Q lcl|Aclame:pro 69 APGQSPNA-TPTQADKNQLVIDTTVIARNTVA---HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTK 144 (402) Q Consensus 69 ~~G~~i~~-~~~~~~e~~itID~~lya~~~Id---dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~ 144 (402) .-+.++.. ..+.. ..|++...+++.+..- -||+.. +| +-+.+.++++.++++..|++++. |.....| T Consensus 137 ~e~~~~~~~~~~~f--~~i~l~~~kl~a~~~is~elL~ds~--~~-ie~~i~~~la~~~a~~~~~a~i~----G~G~~qP 207 (377) T protein:vir:98 137 DIFGEIKGQLKQAF--KEQDFSQFKLTAFVVIPKDALKFGP--KW-IKQFITEQLKEAIAVALELAIVK----GDGLLQP 207 (377) T ss_pred ecccccCcccCccc--eeEeecceeEEeeecccHHhhhccH--hH-HHHHHHHHHHHHHHHHHhhceEe----ccCCCcc Confidence 44444433 22333 4555666665444322 344433 34 56889999999999999987751 1111111 Q ss_pred cccccccccccccccccc---cCCccccccHHHHHHHHH-----------HHHHH--HHh-h-cCCccCcE-EEeChHHH Q lcl|Aclame:pro 145 AERNKPRVKGHGFSINVN---VTESEALANPQYVMAAVE-----------YALEQ--QLE-Q-EVDISDVA-IMMPWKFF 205 (402) Q Consensus 145 ~~~~~~~~~g~~~~~~v~---~~~a~~~~~~~~l~dai~-----------~a~~~--Lde-k-dVP~~gR~-~VV~P~~y 205 (402) .-.- .....+.+.. ....+...+.+.+.+..+ -++.. +.. + --...||+ .+++|..| T Consensus 208 ~Gil----~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~ 283 (377) T protein:vir:98 208 VGLL----KDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDR 283 (377) T ss_pred eeee----ecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccch Confidence 0000 0000000000 000011111112211111 11111 111 1 11235655 44677666 Q ss_pred HHHhcccchhhcccccccCcccccceEEEEecc--EEEecCccccccCccccccccccCCccccceeeeccceeEEeecH Q lcl|Aclame:pro 206 NALRDADRIVDKTYTISQSGATINGFVLSSYNC--PVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTS 283 (402) Q Consensus 206 ~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~--~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~ 283 (402) +.++--.... .+ +|.-..+.|+ +|++|+++|... + .-+||+.- +++ T Consensus 284 ~~~~p~~~~~------~~-----~G~~~t~lg~p~~vv~s~~~p~~~--i---------------~fgdf~~Y--~i~-- 331 (377) T protein:vir:98 284 WALEAQFTSR------NQ-----FGEYVTVLPHGITILESLAVETGK--A---------------IAFVANRY--DAF-- 331 (377) T ss_pred hhcccccccc------CC-----CCccccccCCCceEEecCCCCccc--E---------------EEEEecce--eEE-- Confidence 6554211111 11 2333355654 488999998532 1 11333331 111 Q ss_pred HHhhhhhhcccceeeccchhHHH---HHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 284 DALLVGRTIEVTGDIFYEKKEKT---YYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 284 ~Av~tv~~~dl~~e~~~d~~~~~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) .-.++..+.. ++.+|. ..+++++=+|.+++.|++.+++.+..| T Consensus 332 ------~r~~~~i~~~-~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 332 ------MATASTIEEY-DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ------eecceEEEee-chhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 1123333333 233332 345666678999999999999999998 No 176 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=97.82 E-value=1.9e-06 Score=51.97 Aligned_cols=291 Identities=10% Similarity=-0.009 Sum_probs=132.9 Q ss_pred CCCC--cccc----cc--c-ccccccHH-HHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccce-eeeeec Q lcl|Aclame:pro 1 MSTP--NTLT----NV--A-VSASGEVD-SLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGET-ELQVLA 69 (402) Q Consensus 1 Ms~~--n~~t----~~--~-~~~~~d~~-alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~-t~~~~~ 69 (402) +.-. ..++ +. . ..+++.+- -+.=+.|..++++...+.|.++.++++.++. |+ .+|++.... .+.... T Consensus 64 ~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~-~~-~~i~~~~~~~~a~w~~ 141 (383) T protein:vir:78 64 SASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTG-LR-TKFLKSETSGVAVWGK 141 (383) T ss_pred HhcCChhhhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecC-Cc-eEEEEEcCCcceEEee Confidence 0000 0000 00 0 00111111 2333889999999999999999999988874 45 467766444 343333 Q ss_pred CCCCCCC-CCccccceeEeecceeeccchhh--hHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|Aclame:pro 70 PGQSPNA-TPTQADKNQLVIDTTVIARNTVA--HIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAE 146 (402) Q Consensus 70 ~G~~i~~-~~~~~~e~~itID~~lya~~~Id--dlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~ 146 (402) -+.++.. ..++..+.+|..-.+ +.-..|. -||+.. +| +.+.+.++++.++++..|++++. +.....|.- T Consensus 142 e~~~~~~~~~~~f~~i~l~~~kl-~~~i~is~ell~Ds~--~~-ie~~i~~~l~~~~a~~~~~a~i~----G~G~~qP~G 213 (383) T protein:vir:78 142 IFGEIKGQLDATFSDEESIQNKL-TAFVVVPKDLEKFGP--AW-VKRFVVTQIEEAFAVALESAYIV----GDGNDKPIG 213 (383) T ss_pred cccccccccCcceeeEeecceee-EeeccchHHHhhccH--HH-HHHHHHHHHHHHHHHHHhhheEe----ccCCCCcee Confidence 3334433 244555555555322 3333332 244333 44 56889999999999999988741 111111100 Q ss_pred cccccccccc--cccccccCCc---cccccHHHHHHHHHHHHHH--HHhhcCC--ccC-cEEEeChHHHHHHhcccchhh Q lcl|Aclame:pro 147 RNKPRVKGHG--FSINVNVTES---EALANPQYVMAAVEYALEQ--QLEQEVD--ISD-VAIMMPWKFFNALRDADRIVD 216 (402) Q Consensus 147 ~~~~~~~g~~--~~~~v~~~~a---~~~~~~~~l~dai~~a~~~--LdekdVP--~~g-R~~VV~P~~y~~Ll~~~r~~n 216 (402) .-. ...+.+ .........+ ....+...+++.+...... +..+.-+ ..+ ...+++|..|+.++.-.... T Consensus 214 il~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~- 291 (383) T protein:vir:78 214 LNR-KVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSL- 291 (383) T ss_pred eee-ccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhcc- Confidence 000 000000 0000000000 0111222233332222111 1111111 111 23467776555443211111 Q ss_pred cccccccCcccccceEEEEe--ccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhccc Q lcl|Aclame:pro 217 KTYTISQSGATINGFVLSSY--NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEV 294 (402) Q Consensus 217 ~d~~~~~~g~~~~G~V~~ia--G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl 294 (402) .+ +|.-..+. |++|++|+++|... + .-+||+. . +++ .-.++ T Consensus 292 -----~~-----~G~~~t~l~~~~~iv~s~~~p~~~--i---------------ifgdfs~-Y-~i~--------~r~~~ 334 (383) T protein:vir:78 292 -----NA-----NGVYVTALPFNLNIIESLFVPEKK--A---------------ISYVAER-Y-DAL--------IGGPL 334 (383) T ss_pred -----CC-----CCceeeecCCCceEEecCCCCccc--E---------------EEeeccc-e-EEE--------ecccc Confidence 11 23333454 55689999998532 0 1133333 1 111 12233 Q ss_pred ceeeccchhHHH---HHHHHHHHhcCcccccceEEEEEEeeccCcccccc Q lcl|Aclame:pro 295 TGDIFYEKKEKT---YYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGG 341 (402) Q Consensus 295 ~~e~~~d~~~~~---d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~ 341 (402) ..+.. +..+|. ..+++++=++.++++|++.++++.+....+.+-.. T Consensus 335 ~i~~~-~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 335 DIGTY-DQTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred eEEec-chhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 34433 344444 35666777899999999999988875443333222 No 177 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=97.30 E-value=9.2e-06 Score=48.19 Aligned_cols=313 Identities=14% Similarity=0.097 Sum_probs=168.0 Q ss_pred CCCCccccccccc---ccccHHHHHH-HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVS---ASGEVDSLLI-EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~---~~~d~~alfl-e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~~~G~~i~~ 76 (402) |.-..+....+.. .+++ -++.| +.-++-|.++-.-=.+-..++..-+++.|.+-.|+.+|-+-+....-|+++.. T Consensus 59 m~G~~p~~eV~~~e~mtt~~-a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~~ 137 (393) T protein:vir:79 59 MEGETPTNEVNLREFMATPS-AQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYDVAEGQEIPE 137 (393) T ss_pred hcCCCchhheehhhhhcCCC-cceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeeccccccccccc Confidence 4422211111111 1112 22333 66666666543332222233333346789999999999999999999998887 Q ss_pred CCcc-ccceeEeecceeeccchhhhHHHhh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 77 TPTQ-ADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 77 ~~~~-~~e~~itID~~lya~~~IddlDe~q--~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ..++ .+.-.+.+.+-++ ...|.-=||.- +.+| +=+-+-+.+|.+|+++-|+-++++.-+=+..+--.. ..... T Consensus 138 ~sld~~T~dsv~~~~gK~-G~~Ia~SqEmIsDSg~D-vin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~--st~t~ 213 (393) T protein:vir:79 138 DSIDWQTHESPEIRVGKS-GIRLRFTDEMISDSQWD-LMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNY--STNKL 213 (393) T ss_pred cchhhhcCCceeEEechh-hhhhhhHHHHhhcchHH-HHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeecc--ccCcc Confidence 7765 4444555555443 44455555554 5677 556667899999999999999999865333111111 11111 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhc------ccccccCcc- Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDK------TYTISQSGA- 226 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~------d~~~~~~g~- 226 (402) ++..|-... +-.......+.+.|.++. .+.... .+-++++.|--|+..-+....-.. +|..-.-.+ T Consensus 214 ahptGr~~~-~~qNGTlSleDllDm~~a---v~~~hy---t~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts 286 (393) T protein:vir:79 214 AHTTGLDKN-GVQNDTFSAEDFLDLIIA---VMANEY---TPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSS 286 (393) T ss_pred ceeecCCcc-ccccccccHHHHHHHHHH---HhcccC---CcceEEEcCchhhhhhhhhhhcceeeccccccCccccchh Confidence 111111100 111222333444444332 233333 446788888888887765332100 111000000 Q ss_pred ------cccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeecc Q lcl|Aclame:pro 227 ------TINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFY 300 (402) Q Consensus 227 ------~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~ 300 (402) ..+|++ -..+.|+-|+-+|.... ..+|+|..=-.||++++.-++ ++++|.|. T Consensus 287 ~algp~~i~~~~--~~nlnv~~sPfvp~d~k------------~~rFd~~~Vd~NnvgvlLV~D--------~i~tdq~d 344 (393) T protein:vir:79 287 MALGPDSIQGRL--PFNFNVNLSPFIPLDKK------------SRRFDVYAVDRNNVGVLLVRD--------DLKTDQWD 344 (393) T ss_pred hhhchhhhcccc--ccceeEEEecccccccc------------cceeeEEEeecCCceEEEEec--------Ccceeccc Confidence 011111 13488888888885432 346677766678888776333 78899999 Q ss_pred chhHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccccc--hhhHHH Q lcl|Aclame:pro 301 EKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP--GDDHAT 348 (402) Q Consensus 301 d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~--~~~~~~ 348 (402) |+-|--.-|+-+--||.|||.-.-+.++ ++-=....+-+++ .-.++- T Consensus 345 dk~rdiq~iKl~ERYG~gvLn~gkaiav-akNI~~~k~y~~P~~~~~~~~ 393 (393) T protein:vir:79 345 EKARGLQNIKMIERYGIGILNEGKAIAV-AKNISMDKSYAEPMLIKNVGN 393 (393) T ss_pred cccccceeeeeeeeeceeeeeCCceEEE-EecceeecccccchhhhccCC Confidence 9988777788888899999988755443 3221111111111 111111 No 178 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=97.00 E-value=6.9e-05 Score=43.38 Aligned_cols=273 Identities=9% Similarity=0.066 Sum_probs=110.1 Q ss_pred CCCCcccccccccccccHHHHHHHHHh----HHHHHHHHH--------------Hhhhcccceeeecccc-ceE-----E Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFN----GKVNEQYLK--------------GENILSYFDVQTVTGT-NTV-----S 56 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~----geV~t~f~~--------------~sv~~~~~~~rti~~G-ksv-----~ 56 (402) |........+. +.....++.++ +.-...|.+ .++++.+.++..+... .+. . T Consensus 171 ~~~~~~~~~~~-----~~~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 245 (480) T protein:vir:40 171 REASIPSEKPE-----DAERKFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQGLT 245 (480) T ss_pred hhhhccccchh-----hhhhHHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhhhhhhcce Confidence 22111111111 11111222221 111111211 1122222222111110 000 0 Q ss_pred eeeccce----eeeeecCCCCCCCCCccccceeEeecce---ee--ccchhhhHHHhhcCccchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 57 NKYLGET----ELQVLAPGQSPNATPTQADKNQLVIDTT---VI--ARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLE 127 (402) Q Consensus 57 f~~iG~~----t~~~~~~G~~i~~~~~~~~e~~itID~~---ly--a~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~ 127 (402) +...|.. ....+..+........ .+.. ..++. +| .......+|++. + +.+.+..++++.|+++. T Consensus 246 ~~~~g~~~~~~~~e~~~~~~~~~~~~~--~~~~-~~~~~v~~l~~~~k~t~~lLDDa~---~-l~~~i~~~l~~~~~~~e 318 (480) T protein:vir:40 246 LAEDGVDDTFISGTFKAGTDKNKSQTA--TKRS-LRPQMAEAYLQMDKATVRGVNDSG---A-LSEYVMSEMVNRVIQKV 318 (480) T ss_pred eeeccccceeeeeeeeccccccccccc--ccch-hhHHHHHHHHHhHHHHHHHhhhhH---H-HHHHHHHHHHHHHHHHH Confidence 1111111 0111122211111100 0000 11111 11 123334445443 3 56788889999999998 Q ss_pred HHHHHHHHHhhhhhccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHH Q lcl|Aclame:pro 128 DQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNA 207 (402) Q Consensus 128 Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~ 207 (402) ++.++. +. +....+. .+.....++.+...+.+.+++.|+.+..+--.++.| ++||+|..|.. T Consensus 319 e~a~l~----G~---------g~g~~~~-~g~~~~~~~~~~~~~~~d~id~L~~al~~~y~~~a~----~~vmn~~t~~~ 380 (480) T protein:vir:40 319 EYNMIL----GS---------VDGSNGF-YGLKTATDGWTKQIEYTDLFEGITDAVAECSISDAI----TIVMSPQTFAE 380 (480) T ss_pred HHHhhc----cC---------CCCcccc-ccceeecccccccchhHHHHHHHHHhhhHHhhCCCC----EEEECHHHHHH Confidence 877642 10 0000011 111111112222233445555554443322222211 57899999998 Q ss_pred Hhc--ccchhhcccccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHH Q lcl|Aclame:pro 208 LRD--ADRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDA 285 (402) Q Consensus 208 Ll~--~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~A 285 (402) |.+ |.. ..|- =.+....|...+++|.|||++........ +. .++++.| .+++.+ T Consensus 381 I~klKD~~---G~Yi--~q~~~~~~~~~~llG~pvv~~~~~~~~~~-----~~--~~~~~~~----------~~~~d~-- 436 (480) T protein:vir:40 381 LRKAKGTD---GHSR--FNELATKEQIAQSFGAVNLETRVWMPKDE-----VA--VYNHDEY----------VLIGDL-- 436 (480) T ss_pred HHHhhcCC---CCee--ccCcccccCcceecccceeeeeccccCCc-----ce--eeeCCcc----------EEEEec-- Confidence 854 322 2221 11223467778999999887754332111 01 1122222 222222 Q ss_pred hhhhhhcccceeec--cchhHHHHHHHHHHHhcCcccccceEEEEEEeeccCccc Q lcl|Aclame:pro 286 LLVGRTIEVTGDIF--YEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGD 338 (402) Q Consensus 286 v~tv~~~dl~~e~~--~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~ 338 (402) .++.+ ++.+.-...+......|-.+.||+++..++.+++- |+ T Consensus 437 ---------~~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~--~~ 480 (480) T protein:vir:40 437 ---------NVENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSL--GV 480 (480) T ss_pred ---------ccceecccccccchhhhhhhhhhceeeEccccEEEEEeccCc--CC Confidence 12222 23344555667777889999999999999998755 33 No 179 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=96.99 E-value=0.00012 Score=42.17 Aligned_cols=318 Identities=12% Similarity=0.028 Sum_probs=149.4 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHHHHhhhc-c-cce----eee--ccccceEEeeeccceeeeeecCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYLKGENIL-S-YFD----VQT--VTGTNTVSNKYLGETELQVLAPG 71 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~~sv~~-~-~~~----~rt--i~~Gksv~f~~iG~~t~~~~~~G 71 (402) |...|.-|+ -.++|+ |+|.-.|.+...+.+-|. + .+. +.. -.+|+.+.+|..+.+.-..-..+ T Consensus 1 M~~~~~~T~--------l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~ 72 (367) T protein:vir:80 1 MPDFNNQVR--------LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYG 72 (367) T ss_pred Ccchhhhhh--------hhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccC Confidence 665543222 234666 777777777666655443 1 111 111 15899999999988754222111 Q ss_pred CC-----CCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Q lcl|Aclame:pro 72 QS-----PNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAE 146 (402) Q Consensus 72 ~~-----i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~ 146 (402) .. +....+.+.+..=+| ...--.+...||-...+--| .-.++..+.+..-.+ .||..+....++.....-.. T Consensus 73 ~d~~~~~~t~~kittg~~~a~v-~~r~kaw~~~Dla~~lsG~d-pm~~Ia~qva~yW~r-~~q~~Lla~L~Gvf~~~~a~ 149 (367) T protein:vir:80 73 SDNPNVEAPIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSN-PMTRIRNRFGVYWTR-QWQRRIIAMAVGVYKSNLAG 149 (367) T ss_pred CCCCcccccccccccchheeee-ehhcccchhhhHHHHhhCch-HHHHHHHHHHHHhhh-hhHHHHHHHHHHhhcccccc Confidence 11 111222222211111 01123455677777776655 334455555554444 45666666666554321110 Q ss_pred cc---------ccccccccccccc----ccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccc Q lcl|Aclame:pro 147 RN---------KPRVKGHGFSINV----NVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADR 213 (402) Q Consensus 147 ~~---------~~~~~g~~~~~~v----~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r 213 (402) .. ++...+.....+. ..+.+....+ .+.|.+|...|-+ -...=-.++|.+..|..|.+- + T Consensus 150 ~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s----~~~~~~A~~~lGD--~~~~l~~i~mHS~V~~~L~~~-~ 222 (367) T protein:vir:80 150 NFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFN----REAFVDAAFTMGD--HVGSIAAIAVHSMVYKRMTNN-D 222 (367) T ss_pred chhhhhhhhccccccccccCceeeeeeccCCCccceec----HHHHHHHHHHhcc--ccccccEEEEchHHHHHHHhc-c Confidence 00 0000001111111 1111122233 3456667666644 233446789999999997765 4 Q ss_pred hhhcccccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcc Q lcl|Aclame:pro 214 IVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE 293 (402) Q Consensus 214 ~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~d 293 (402) ++ +|--..++ +..+...+|.+|+.+..+|....+.. + .|+ -.+|-+=|++..+..+ T Consensus 223 li--~~i~~sd~---~~~i~ty~G~~VIvDD~~Pv~~~~a~---------~---~yt-------tYlfg~GAi~~~~~~~ 278 (367) T protein:vir:80 223 EI--EFIPDSKG---QLTIPTYMGKVVIVDDGMPVFGTGAD---------K---TYL-------SILFGGAAFGYADGAP 278 (367) T ss_pred cc--ccccCCCC---ccccceecceeEEEeCCCcccccCCC---------c---eEE-------EEEEecceeeecccCC Confidence 44 33222222 45789999999999999997542111 1 121 2455555666555443 Q ss_pred c-ceeeccchhHH----HHHHH-----HHHHhcCcccccceEEEEEE----eeccCccccccchhhHH------Hhhhcc Q lcl|Aclame:pro 294 V-TGDIFYEKKEK----TYYID-----TFMAEGAIPDRWEAVSVVTT----KRDATTGDAGGPGDDHA------TVLARA 353 (402) Q Consensus 294 l-~~e~~~d~~~~----~d~i~-----~~~a~Ga~vlRPeaa~vv~~----~~~~t~~~a~~~~~~~~------~~~~~~ 353 (402) . ..|..||+..+ .|++. .+|.+|...... .++.-+ ..+.++...+.+-++++ .|-.+- T Consensus 279 ~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~~~~--~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~NW~~v~d~K 356 (367) T protein:vir:80 279 QVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDA--DVTIPDNTGSPSGITSGPPAITLANLANPDNWERVTYRK 356 (367) T ss_pred ccceecccchhhhcCCceEEEEeeeeEEeecceeeeccc--ccccccccccccccccccCCCChHHhcCCcccccccchh Confidence 2 25777887753 24444 345666655422 111101 01111112111112222 121111 Q ss_pred cceE--EEeec Q lcl|Aclame:pro 354 QRKA--VYVKT 362 (402) Q Consensus 354 ~~~~--~~~~~ 362 (402) +=.. .++|| T Consensus 357 ~I~iv~~it~g 367 (367) T protein:vir:80 357 NVPMAFLVTKG 367 (367) T ss_pred hcceEEEEecC Confidence 1122 34566 No 180 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=95.92 E-value=0.0013 Score=36.49 Aligned_cols=274 Identities=9% Similarity=0.010 Sum_probs=108.8 Q ss_pred CCCC------cccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeee-ccceeeeeecCCCC Q lcl|Aclame:pro 1 MSTP------NTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKY-LGETELQVLAPGQS 73 (402) Q Consensus 1 Ms~~------n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~-iG~~t~~~~~~G~~ 73 (402) ++.. ......+..+-.. -..+...+.+.+...+.+++.+++.+|. +..++. .....+..+.-|+. T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~-----p~~~~~~i~~~~~~~~~i~~~~~~~~i~---~~~~~~~~~~~~a~~~~eG~~ 300 (517) T protein:vir:97 229 LTKDPKAAWTAELKERGISGMPA-----PAGILKRIQDAVNDEGSLLPFIRHENLP---TLVVGGDNALTQGTGHTTGTD 300 (517) T ss_pred ccccccceeeeeccccccccccc-----chHHHHHHHHhhhhhccceeeeeecccc---ceeeecccccceeeeeecCCc Confidence 1100 0000111111000 1233344555566666666666655543 233332 22334555666666 Q ss_pred CCCCCccccceeEeecceeeccc-hhh--hHHHhhcC-ccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 74 PNATPTQADKNQLVIDTTVIARN-TVA--HIHDVQGD-IDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) Q Consensus 74 i~~~~~~~~e~~itID~~lya~~-~Id--dlDe~q~~-~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~ 149 (402) .+...+...++++.+-. +++. .+. .|++...+ ...+.+.+..++.+.|+++.|+.++ .+... + T Consensus 301 kp~s~~tf~~~~~~~~~--ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l----~GdGt-------g 367 (517) T protein:vir:97 301 KTESNITLQTRVLTPQY--VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII----MGGVT-------G 367 (517) T ss_pred ccccccceeeEEeeHhh--hhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHh----cccCC-------C Confidence 55555666666665533 2221 111 23222221 1125677888899999999998774 11110 0 Q ss_pred ccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhc--c--cchhhcccccccCc Q lcl|Aclame:pro 150 PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRD--A--DRIVDKTYTISQSG 225 (402) Q Consensus 150 ~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~--~--~r~~n~d~~~~~~g 225 (402) ....++. .......+........+.|.+..+...+. +..+-.+||+|..|..|.+ | .|.+ |. . T Consensus 368 ~~~~gi~--~~a~~~~~~~~~~~~~~~d~i~~l~~a~~----~a~~a~~vmn~~t~~~I~klKD~~G~Yl---~~----~ 434 (517) T protein:vir:97 368 VSETQIY--PVVGDAWATNVTGTTNIQELLEKLSVATP----KAADSTLVIHRNDLAAIRFLKDKNGNYV---FP----V 434 (517) T ss_pred ccccccc--ccccccccccccccchHHHHHHHHHHHhh----hccCCEEEECHHHHHHHHHhhcCCCCee---cc----C Confidence 0100110 00001111111111222332222222222 2223345899999999854 3 2333 11 1 Q ss_pred ccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHH Q lcl|Aclame:pro 226 ATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK 305 (402) Q Consensus 226 ~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~ 305 (402) ...++.+..++|+.-+.+ .++.+.. . ..+...|...+.+. .. + .+.| +..+- T Consensus 435 ~~~~~~~~~l~G~~~~~~-~~~~~~~------~--~~~~~~y~i~~~~g-~~--~---------------~~~f-d~~~n 486 (517) T protein:vir:97 435 GVSNQTIATHFGFNRLVQ-SVAVDEK------T--AVSLSGYVTNGSRG-ME--F---------------EQGT-ILVEN 486 (517) T ss_pred cCCcccccccCCcccccc-ccccCce------e--EeeccccEEEeecc-ee--e---------------eeee-ecccC Confidence 123455566667422211 1111110 0 01111222211110 00 0 0111 11111 Q ss_pred HHHHHHHHHhcCcccccceEEEEEEeeccCccccc Q lcl|Aclame:pro 306 TYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) Q Consensus 306 ~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~ 340 (402) .+.+...+..|-.|+.||+++-.+.+..+ ++ T Consensus 487 ~~~f~~~~~~~g~i~~~~r~a~~~~~p~~----~~ 517 (517) T protein:vir:97 487 NKEYLFEMPISGSLEYKGTTAYGTYTPPV----AG 517 (517) T ss_pred ceeEeeeeeeccccccccceEEEEEcCCC----CC Confidence 12222334466688999988765444333 33 No 181 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=95.58 E-value=0.0018 Score=35.62 Aligned_cols=287 Identities=8% Similarity=0.032 Sum_probs=130.4 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeee-cc-ccceEEeeeccce-eeeeecCC-CCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQT-VT-GTNTVSNKYLGET-ELQVLAPG-QSPNA 76 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rt-i~-~Gksv~f~~iG~~-t~~~~~~G-~~i~~ 76 (402) |-+. ..|.-.+-+++...-+|.+.....-+.+.++.+++ +- +..++.|+..-.. .++-+.-+ .++.. T Consensus 1 ~~~~---------~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~ 71 (301) T protein:vir:80 1 MQGK---------ITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPL 71 (301) T ss_pred CCcc---------ccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccccccc Confidence 1111 11222344455555666666666777788887775 32 4566666644222 22223222 22333 Q ss_pred CCccccceeEeeccee-eccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc-cccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDTTV-IARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK-PRVK 153 (402) Q Consensus 77 ~~~~~~e~~itID~~l-ya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~-~~~~ 153 (402) .....++....|=..- =+...+.+|..++ ...+ +...-...+..+++++.|+.+|.=-.+ ....+ -+.+ T Consensus 72 ~~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~-l~~~k~~aa~~~~~~~~n~~~f~G~~~-------~g~~GLlN~p 143 (301) T protein:vir:80 72 VDVDMVRKSVPIYSIGIGLSYTIQDLRAARMQGTT-VDAAKATTVRRAIAEKENSIAFRGEKK-------YAIKGAFEAT 143 (301) T ss_pred ccccceeEEEEEEEEEeeeeecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhceEEeeeccc-------ccceeeecCC Confidence 3344445555444321 1344466777664 4555 556666777889999999877632111 00011 1111 Q ss_pred cccccccccc-C-Ccc--ccccHHHHHHHHHHHHHHHHhhcCCccC-cEEEeChHHHHHHhcccchhhcccccccCcccc Q lcl|Aclame:pro 154 GHGFSINVNV-T-ESE--ALANPQYVMAAVEYALEQQLEQEVDISD-VAIMMPWKFFNALRDADRIVDKTYTISQSGATI 228 (402) Q Consensus 154 g~~~~~~v~~-~-~a~--~~~~~~~l~dai~~a~~~LdekdVP~~g-R~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~ 228 (402) +.....+... . +.. ...+++.+++-|..+..+|.++.-=..+ -.++|||+.|..|.. +.++-.+ +-+.. T Consensus 144 ~~~~~~~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~--~~~~~~~----~~tvl 217 (301) T protein:vir:80 144 GIQIDVSPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINK--KRYSNED----SRSVL 217 (301) T ss_pred CcccccccCcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhh--ccccCCC----CeeHH Confidence 1111111000 0 011 2346899999999999998764211112 358999999988863 1111000 00110 Q ss_pred cceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeec-HHHhhhhhhcccceeeccchhHHHH Q lcl|Aclame:pro 229 NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFT-SDALLVGRTIEVTGDIFYEKKEKTY 307 (402) Q Consensus 229 ~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh-~~Av~tv~~~dl~~e~~~d~~~~~d 307 (402) .-...+..+.+|+..+.|...+...+ ....-|..+..+. -+.++ +--...++..++ .+ T Consensus 218 ~~l~~~~~~~~I~~~p~L~~~g~~g~---------~~~v~~~~~~d~~-~~~v~~~~~~~~~e~~~~-----------~~ 276 (301) T protein:vir:80 218 KVLQDNAWFSAIVRVPDLAGMGTAGS---------DSFAVIHDSNETA-ELIIPMDITRHPEEYSFP-----------RT 276 (301) T ss_pred HHHHHHcCcceEEEcceeccCCCCcc---------cEEEEEecCCcEE-EEEecCceeeecceecCc-----------ee Confidence 10001123466777777753221111 1111122111111 11111 111112222222 11 Q ss_pred HHHHHHH-hcCcccccceEEEEEEeec Q lcl|Aclame:pro 308 YIDTFMA-EGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 308 ~i~~~~a-~Ga~vlRPeaa~vv~~~~~ 333 (402) .+.+... .|.-+.||+|++.+. += T Consensus 277 ~~~~~~r~~Gv~i~~P~ai~~~~--GI 301 (301) T protein:vir:80 277 KVPFEERTAGVVVRFPAAIVRVD--GI 301 (301) T ss_pred EeeeeeeeEEEEEEccceEEEEe--cC Confidence 1222222 478899999876542 11 No 182 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=95.54 E-value=0.0019 Score=35.53 Aligned_cols=290 Identities=9% Similarity=0.032 Sum_probs=120.4 Q ss_pred CCCCccccccccc-ccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccc---eeeeeecCCCCCC- Q lcl|Aclame:pro 1 MSTPNTLTNVAVS-ASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGE---TELQVLAPGQSPN- 75 (402) Q Consensus 1 Ms~~n~~t~~~~~-~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~---~t~~~~~~G~~i~- 75 (402) |. .+|...+. ++.|..+ ..|++.|.+.|-++.+.+-..+. |++.++++.-. .... .-+.+.. T Consensus 1 mp---altLaea~k~~~d~l~-------~~ViE~~~~~s~lL~~LpF~~ve-g~~~~ynR~~~~~~~~~~--~v~~~~~~ 67 (310) T protein:vir:97 1 MA---SVTLAESAKLAQDELV-------AGVIENIITVNRMFDVLPFDSIE-GNSLAYNRENVLGDVIMA--GVGTTFSG 67 (310) T ss_pred Cc---ccchHHHhhcCcchHH-------HHHHHHHhccchHHHhCCccccc-CCcceeeEeeccCCcccc--cccccccC Confidence 64 35555553 3333332 34566676666556666666666 45777776622 2221 1111110 Q ss_pred ----CCCccccceeEeecceeeccchhhhHH----Hhh-c-CccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|Aclame:pro 76 ----ATPTQADKNQLVIDTTVIARNTVAHIH----DVQ-G-DIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKA 145 (402) Q Consensus 76 ----~~~~~~~e~~itID~~lya~~~IddlD----e~q-~-~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~ 145 (402) ......++++ +.+.++ --+-++| +.. + .+| .+.+..+.-.++|++.+...++ .+--++.+- T Consensus 68 ~g~~~~~~t~~~~~---~~L~i~-~g~~~Vd~~i~dl~~~~~~d-q~~~Ql~~~iea~~~~~e~~lI----NGD~a~n~F 138 (310) T protein:vir:97 68 AGAGKAAATFTKVN---SNLTTI-MGDAEVNGLIQATRSGDGND-QTAVQIASKAKSAGRKYQDQLI----NGNGAGNEF 138 (310) T ss_pred CCccccccccceee---eeeeee-eehhhhhhHHHhhhcCChHH-HHHHHHHHHHHHHHHHHHHHhh----ccccCCCcc Confidence 0011111111 111111 1122333 322 3 234 4555556666788887765543 221111110 Q ss_pred ccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccCc Q lcl|Aclame:pro 146 ERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSG 225 (402) Q Consensus 146 ~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g 225 (402) .+-....+ .+..+...+.....+++ ..|.+.++.-++ .-+..+++..|+.+.++..--|=.++..--.... T Consensus 139 ~GL~~~~~---~~q~i~~~~~gg~~t~d-~LDeLl~~v~~~-----~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~ 209 (310) T protein:vir:97 139 AGLIQLCA---SGQKATTGATGSAISFA-ILDELMDLVVDK-----DGQVDYLTMHARTLRSYKALLRALGGASINEVVE 209 (310) T ss_pred cchhhcCC---ccceeecCCCCCCCCHH-HHHHHHHHHhcC-----CCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccc Confidence 00000000 11112111111112333 222222221111 1244689999987665554333222111111122 Q ss_pred ccccceEEEEeccEEEecCccccccCccccccccccCCcccccee-eeccc---eeEEeecHHHhhhhhhcccceeeccc Q lcl|Aclame:pro 226 ATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPI-AEMNG---AVAVLFTSDALLVGRTIEVTGDIFYE 301 (402) Q Consensus 226 ~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~-ad~~~---~~al~fh~~Av~tv~~~dl~~e~~~d 301 (402) ...+..|....|+||+.++-+|......++ .+.-+=|-++ ++.+. ++|+....+.-..|+.++- -+ T Consensus 210 ~~~G~~v~~~~GiPi~~~d~ip~~~~~~~~-----~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~-----~~ 279 (310) T protein:vir:97 210 LPSGAEVPAYSGTPIFRNDYIPTNQTKGGT-----TGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGE-----SE 279 (310) T ss_pred cCCCCEEeeeCCeEEEEeCccCCCcccccc-----CCceeEEEEeeCccccccceeccccCCccceeEEeCCc-----cc Confidence 234567889999999999999975422111 1111111122 11112 2332211122222333220 01 Q ss_pred hhH-HHHHHHHHHHhcCcccccceEEEEEEeec Q lcl|Aclame:pro 302 KKE-KTYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) Q Consensus 302 ~~~-~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~ 333 (402) ++- +.|.| .+| +|..++.|+++++|+=--. T Consensus 280 ~~~v~~~~V-~~Y-~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 280 DSDEHIWRV-KWY-CGLALFSEKGLACADGITN 310 (310) T ss_pred CCcceeEEE-EEe-eeEEEecccceeeeccccC Confidence 111 11222 122 8999999999998853222 No 183 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=93.19 E-value=0.0085 Score=31.95 Aligned_cols=293 Identities=12% Similarity=0.058 Sum_probs=123.8 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccce-eeeeecCCCCCCCCCc Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGET-ELQVLAPGQSPNATPT 79 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~-t~~~~~~G~~i~~~~~ 79 (402) |+ .+|...+.- |--......|++.|.+.+-++++.+...+.+ ++.+.++.-.. .+.-+..++.+..... T Consensus 25 m~---alTLaea~~------l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~-~~~~~~r~~~lp~a~~r~~n~~~~~~~~ 94 (330) T protein:vir:94 25 MP---TVTLAESAK------LSQDHLVSGLIETIVEVNPLYEMMPFTEIEG-NALAYNRENVLGDVQFLAVGGTITAKNP 94 (330) T ss_pred hh---hhhhhHHhh------cCchhhHHHHHHhhhccchHHhhcccccccC-CcceeeeeecCCcceeeeccccccccCc Confidence 33 233333221 1114456677788877666666666666654 55666665332 2222333333322211 Q ss_pred -cccceeEeecceeeccchhhhHHHhhcC-----ccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 80 -QADKNQLVIDTTVIARNTVAHIHDVQGD-----IDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) Q Consensus 80 -~~~e~~itID~~lya~~~IddlDe~q~~-----~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~ 153 (402) ...+.+. + +..--.+-++|...++ +| .|.+..+..-++|++++...++ .+......-.+-.... T Consensus 95 ~Tf~q~t~--~--l~~l~~~~~Vd~~iadl~g~~~d-~~~~q~~~~ieal~~~~e~~li----nGDs~~~~F~GL~~~~- 164 (330) T protein:vir:94 95 ATFTKVTS--E--LTTLIGDAEVNGLIQATRSDFMD-QTSVQVASKAKSIGRQYQASMI----TGDGTGNSFQGMMGLV- 164 (330) T ss_pred ceeeeeee--c--hhhhhhhHHHHHHHHHhcCCHHH-HHHHHHHHHHHHHHHHHHHHhh----ccCCCCccccchhhcC- Confidence 1122222 2 2222223356655533 34 4655556666777776665443 3211100000000000 Q ss_pred ccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhccccccc-CcccccceE Q lcl|Aclame:pro 154 GHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQ-SGATINGFV 232 (402) Q Consensus 154 g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~-~g~~~~G~V 232 (402) .....+...+.+...+++. .|.+.++.-+ -|-..-+++++.+.+..+.+-.|=.. .|+... .....+..| T Consensus 165 --~~~q~i~tg~~gg~~T~d~-LDeLl~~v~~-----~~g~~~~~l~n~a~~r~I~a~~R~~~-~~~v~~~~~~~~G~~v 235 (330) T protein:vir:94 165 --AASQTISAGANGGTLTFEL-LDQLLDLVKD-----KDGQVDYLMSSFAMRRKYFSLLRALG-GAAIGEVMTLPSGRQI 235 (330) T ss_pred --CcccEEecCCCCCCCCHHH-HHHHHHHhcC-----CCCCCcEEEechhHHHHHHHHHHhcc-CCCCCCcccccCCCEE Confidence 1112221111122234332 2222221111 12234588888777777766444221 122211 122345678 Q ss_pred EEEeccEEEecCccccccCccccccccccCCcccccee--eec--cceeEEeecHHHhhhhhhcc-cceeeccchhHHHH Q lcl|Aclame:pro 233 LSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPI--AEM--NGAVAVLFTSDALLVGRTIE-VTGDIFYEKKEKTY 307 (402) Q Consensus 233 ~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~--ad~--~~~~al~fh~~Av~tv~~~d-l~~e~~~d~~~~~d 307 (402) ....|+||+.++-+|...+..+. .+.-+=|-++ .+. ..++||-..-..-..|+.++ +..... -.-+-.| T Consensus 236 ~~~~GvPi~~~d~ip~~~~~~~~-----~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v-~~~~v~~ 309 (330) T protein:vir:94 236 PTYRGVPWFVNDFIPSNMTQGTA-----TNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADE-TITRVKM 309 (330) T ss_pred eeeCCeEEEecccccCCCCcccC-----CCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccce-eeEEEEE Confidence 88999999999999975321111 0111111111 111 13344432222222222222 100000 0001123 Q ss_pred HHHHHHHhcCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 308 YIDTFMAEGAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 308 ~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) + +|..++.|+++++|+= +..| T Consensus 310 y------~~~av~~~~a~~~L~~---V~~g 330 (330) T protein:vir:94 310 Y------CGFANFSQLGLAAIKG---LIPG 330 (330) T ss_pred e------eeeEEechhheeeecc---ccCC Confidence 3 8999999999998752 2222 No 184 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=91.41 E-value=0.0078 Score=32.15 Aligned_cols=274 Identities=11% Similarity=0.029 Sum_probs=112.1 Q ss_pred CCCCc--cccccc-------------ccc-cccHHHHH-HHH--------------------HhHHHHHHHHHHhhhccc Q lcl|Aclame:pro 1 MSTPN--TLTNVA-------------VSA-SGEVDSLL-IEK--------------------FNGKVNEQYLKGENILSY 43 (402) Q Consensus 1 Ms~~n--~~t~~~-------------~~~-~~d~~alf-le~--------------------f~geV~t~f~~~sv~~~~ 43 (402) |..+- ....|. |+. .||.-|.= ||. |-+.++.-.+.+...+++ T Consensus 85 ~~~~~r~~p~~~~veyRSaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~sl 164 (410) T protein:vir:83 85 AISAMRGSPVGTEVEYRSAGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVST 164 (410) T ss_pred hhccCcCCCCCCCcccccHHHHHHHHhccCCchHHHHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccchhhh Confidence 33331 111111 111 12222211 222 222222222222222222 Q ss_pred ceeeeccccceEEeeec-cceeee-eec------CCCCCCCCCccccceeEeeccee----eccchhhhHHHhhcCccch Q lcl|Aclame:pro 44 FDVQTVTGTNTVSNKYL-GETELQ-VLA------PGQSPNATPTQADKNQLVIDTTV----IARNTVAHIHDVQGDIDSL 111 (402) Q Consensus 44 ~~~rti~~Gksv~f~~i-G~~t~~-~~~------~G~~i~~~~~~~~e~~itID~~l----ya~~~IddlDe~q~~~D~v 111 (402) ...=.. .|.|..-+.. .++++. +++ -|..++...+..+-.+-.|++.= .+|..|+--+--. + T Consensus 165 f~tLP~-~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~-----L 238 (410) T protein:vir:83 165 LGTLPL-NNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSA-----L 238 (410) T ss_pred hhhCCC-CCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehhcCcccccceeeecCChhh-----H Confidence 221111 2666666433 223332 222 23334444555555556666643 3444432111111 2 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 112 KPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEV 191 (402) Q Consensus 112 rse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdV 191 (402) .-++ +.++.+-|+..-...-..| .+.... ..+....+++.+...|.++....+.+-- T Consensus 239 ~~~l-raL~~AYA~atea~vra~L-~~t~t~---------------------~~a~~~~Tad~~~~~i~da~~~v~da~~ 295 (410) T protein:vir:83 239 DLVV-NGLGQQYAIETEALVGAAL-ASTSTG---------------------AVGYGNATADNVASAIWQAAGAVYTAVK 295 (410) T ss_pred HHHH-HHHHHHHHHHHHHHHHHHH-HHhhhh---------------------hhhhhhccHHHHHHHHHHHHHHHhhhhc Confidence 2222 3334444444443332222 111110 1122334677888888888888876521 Q ss_pred CccCcEEEeChHHHHHHhcccchhhcccccccC-cc--cccceEEEEeccEEEecCccccccCccccccccccCCccccc Q lcl|Aclame:pro 192 DISDVAIMMPWKFFNALRDADRIVDKTYTISQS-GA--TINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYD 268 (402) Q Consensus 192 P~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~-g~--~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~ 268 (402) --.=+++.|+|+.|..+.+--+-+|.+.+-+.+ +. +..|.-+.+.|++|+.++.+|.+. . T Consensus 296 ~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgT----A------------- 358 (410) T protein:vir:83 296 GMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGD----A------------- 358 (410) T ss_pred cceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCe----e------------- Confidence 123478899999986665533333333222211 11 224566788999999999987532 1 Q ss_pred eeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccc Q lcl|Aclame:pro 269 PIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) Q Consensus 269 ~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~ 340 (402) -|-...++.+--+.++.+++.+- .++.--+.++ ++++| .+.-|+++.= -+-+ T Consensus 359 ---~f~~~~Ai~~~eS~~gp~qL~d~--~i~nLt~~yS----gY~a~--a~~~~~gliP---------v~g~ 410 (410) T protein:vir:83 359 ---YLFSTAAIECFEQRVGTLQVVEP--SVFGLQVAYA----GYFST--LVVNEDAIVP---------LVGS 410 (410) T ss_pred ---eEeccceeeeeecCCceeEeeCC--chhhhhhhhe----eeeee--ccccccceee---------eccC Confidence 11122233332333444555441 1111112222 33322 3333433221 1111 No 185 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=84.24 E-value=0.062 Score=27.20 Aligned_cols=290 Identities=10% Similarity=0.009 Sum_probs=123.0 Q ss_pred CCCC----cccc-ccccccc----ccHHHHHHHHHhHHHHHHHH----HHhhhcccceeee-cc-ccceEEee---eccc Q lcl|Aclame:pro 1 MSTP----NTLT-NVAVSAS----GEVDSLLIEKFNGKVNEQYL----KGENILSYFDVQT-VT-GTNTVSNK---YLGE 62 (402) Q Consensus 1 Ms~~----n~~t-~~~~~~~----~d~~alfle~f~geV~t~f~----~~sv~~~~~~~rt-i~-~Gksv~f~---~iG~ 62 (402) |-|. +... +.+..+. .+....|+.+..-+++.... ..-+.+.++.+++ +- +-.++.+. ..|. T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~ 83 (319) T protein:vir:10 4 KKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKVGT 83 (319) T ss_pred cchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccccc Confidence 3333 1111 1111111 12224665333335554333 3345556666664 32 33455444 3455 Q ss_pred eeeeeecC-CCCCCCCCccccceeEeecce-eeccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 63 TELQVLAP-GQSPNATPTQADKNQLVIDTT-VIARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGG 139 (402) Q Consensus 63 ~t~~~~~~-G~~i~~~~~~~~e~~itID~~-lya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA 139 (402) .+. +.. ...+..-....++....|=.. .-+..-+.+|..++ ...+ +...-......+++++.|+.+|.=-. T Consensus 84 a~~--~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~-l~~~k~~aA~~~~~~~~n~i~f~G~~--- 157 (319) T protein:vir:10 84 AQI--IADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRP-LSTRKASACQLAHDQLVNRLVFKGSA--- 157 (319) T ss_pred eee--ecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhceEEEeecc--- Confidence 443 221 122322223333444333321 11333356666664 4454 45555667778888888887652211 Q ss_pred hhccccccccc-cccccccccccccCCccccccHHHHHHHHHHHHHHHHhh--cCCccCcEEEeChHHHHHHhcccchhh Q lcl|Aclame:pro 140 IANTKAERNKP-RVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQ--EVDISDVAIMMPWKFFNALRDADRIVD 216 (402) Q Consensus 140 ~~~a~~~~~~~-~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~Ldek--dVP~~gR~~VV~P~~y~~Ll~~~r~~n 216 (402) .....+. +.++.. ..+.+......+.+++.+++-|..+..+|-++ .+= ..-.++|||+.|..|.. +..+ T Consensus 158 ----~~g~~GLlN~p~~~-~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~-~p~~L~L~p~~~~~L~~--~~~~ 229 (319) T protein:vir:10 158 ----PHKIVSVFNHPNIT-KITSGKWIDVSTMKPETAEAELTQAIETIETITRGQH-RATNILIPPSMRKVLAI--RMPE 229 (319) T ss_pred ----cccceeEEeCCCce-eeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCcee-eceEEEecHHHHHhhhc--ccCC Confidence 0000111 111111 11111111122347889999999999888754 331 12358899999988853 2111 Q ss_pred cccccccCcccccceEEE-EeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccc Q lcl|Aclame:pro 217 KTYTISQSGATINGFVLS-SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVT 295 (402) Q Consensus 217 ~d~~~~~~g~~~~G~V~~-iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~ 295 (402) ++ ...--.+.+ -.+++|...+.|...++..+. ...-|..+-.+. -+.+ .++++ T Consensus 230 --~~-----~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~---------~~v~y~~~~~~~-~~~v---------~~~~~ 283 (319) T protein:vir:10 230 --TT-----MSYLDYFKSQNSGIEIDSIAELEDIDGAGTK---------GVLVYEKNPMNM-SIEI---------PEAFN 283 (319) T ss_pred --CC-----eeHHHHHHHhcCCceEEEeeeecccCCCcce---------EEEEEecCCceE-EEec---------Cccee Confidence 11 111111111 146677777777542211111 001111111111 1111 11211 Q ss_pred eeeccchhHHHHHHHHHH-HhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 296 GDIFYEKKEKTYYIDTFM-AEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 296 ~e~~~d~~~~~d~i~~~~-a~Ga~vlRPeaa~vv~~~~~~ 334 (402) ... -..+...+.+.+.. ..|.-+.||++++.+. |- T Consensus 284 ~~~-~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~d---GI 319 (319) T protein:vir:10 284 MLP-AQPKDLHFKVPCTSKCTGLTIYRPMTIVLIT---GV 319 (319) T ss_pred eee-eeecCceEEEeeeeeeEEEEEEccceeEeee---cC Confidence 111 11122333333323 2468899999876542 11 No 186 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=82.52 E-value=0.076 Score=26.71 Aligned_cols=277 Identities=11% Similarity=0.072 Sum_probs=125.2 Q ss_pred CCCCcccccccccccccHHHHHH-HHHhHHHHHHHH----HHhhhcccceeee-cc-ccceEEeee---ccceeeeeecC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLI-EKFNGKVNEQYL----KGENILSYFDVQT-VT-GTNTVSNKY---LGETELQVLAP 70 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfl-e~f~geV~t~f~----~~sv~~~~~~~rt-i~-~Gksv~f~~---iG~~t~~~~~~ 70 (402) |+..- +|.-..|+ +++. .++.... ..-+.+.++.+++ +- +-.++.+.. .|..+. +.. T Consensus 1 ~~~~~----------a~~~~~f~~~ql~-~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~--~~~ 67 (296) T protein:vir:10 1 MGVDK----------ADAAGIWTVKQLT-ASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQI--VAD 67 (296) T ss_pred Ccccc----------hhhhHHHHHHHHH-HHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeE--eCC Confidence 55442 23333454 6655 4444433 3445566666665 32 345565444 354442 222 Q ss_pred C-CCCCCCCccccceeEeeccee-eccchhhhHHHhhc-CccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 71 G-QSPNATPTQADKNQLVIDTTV-IARNTVAHIHDVQG-DIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER 147 (402) Q Consensus 71 G-~~i~~~~~~~~e~~itID~~l-ya~~~IddlDe~q~-~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~ 147 (402) + ..++.-....++....|=..- =+...+.+|..++. ..+ +...-...+..++++..|+.+|.=-. T Consensus 68 ~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~-l~~~ka~aA~~~~~~~~n~~~f~G~~----------- 135 (296) T protein:vir:10 68 YTDDLPLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQS-LSTRKQSLAFEAHDKLLDKLVWSGST----------- 135 (296) T ss_pred CccccceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhceEEEeecc----------- Confidence 1 223222233334444333211 12333567766653 455 55666667778888888876652210 Q ss_pred ccccccccccccccc-cCCccccccHHHHHHHHHHHHHHHHhh--cCCccCcEEEeChHHHHHHhcccchhhcccccccC Q lcl|Aclame:pro 148 NKPRVKGHGFSINVN-VTESEALANPQYVMAAVEYALEQQLEQ--EVDISDVAIMMPWKFFNALRDADRIVDKTYTISQS 224 (402) Q Consensus 148 ~~~~~~g~~~~~~v~-~~~a~~~~~~~~l~dai~~a~~~Ldek--dVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~ 224 (402) .....|......++ .....+..+++.+++-|..+..+|-++ .+=. .-.++|||.+|..|... .+ +++. T Consensus 136 -~~g~~GLlN~p~v~~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~-p~~l~L~p~~~~~L~~~---~~-~~~~--- 206 (296) T protein:vir:10 136 -AHGIPSVFDYPNINNVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHR-ATHLLLPTTARRIMQNL---VP-GTSV--- 206 (296) T ss_pred -cccceeEeecCCCccccccCCccCHHHHHHHHHHHHHHHHHhhCceec-ceeEEeCHHHHHHHhhc---cC-CCCc--- Confidence 00111111111111 112234556778899898888877654 3211 12478899999888532 21 1111 Q ss_pred cccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeec--HHHhhhhhhcccceeeccch Q lcl|Aclame:pro 225 GATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFT--SDALLVGRTIEVTGDIFYEK 302 (402) Q Consensus 225 g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh--~~Av~tv~~~dl~~e~~~d~ 302 (402) +...-.-....+++|...+.|...++.. +..++++. ++-+...=.++++.- .-.. T Consensus 207 -t~l~~ik~~~~~l~i~~~~~l~~a~~~g---------------------~~~~v~~~~~~~~~~~~v~~~~~~~-~~e~ 263 (296) T protein:vir:10 207 -SYGEFFRQNNSGVTVEFVQYLNDYNGTG---------------------TSAAIAYEKDPNNMAIEIPEATNAL-PAQP 263 (296) T ss_pred -cHHHHHHHhcCCceEEEeeeeccCCCCc---------------------ceEEEEEEcCCceEEEEcCcceeee-cccc Confidence 1111111122466777777665322111 11122221 111111111222211 1122 Q ss_pred hHHHHHHHHHHHh-cCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 303 KEKTYYIDTFMAE-GAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 303 ~~~~d~i~~~~a~-Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) +...+.+.+.... |.-+.||+|++.+ .+ -|=+ T Consensus 264 ~~l~~~~~~~~~~~Gv~i~~P~ai~~~--dG-I~~~ 296 (296) T protein:vir:10 264 KDLHFKIPVTSKATGLIVYRPLTMAVM--KG-ITFA 296 (296) T ss_pred cCceEEEeeEeeEEEEEEECCceeEEE--ee-eecC Confidence 3344444444544 6999999987654 21 1111 No 187 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=72.97 E-value=0.15 Score=25.07 Aligned_cols=301 Identities=12% Similarity=0.044 Sum_probs=118.2 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhh-----cccceeeeccccceEEeeec-cceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENI-----LSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~-----~~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i 74 (402) |......+-..+. .++-+|.+=.+++ |.+++++ ...+++-+| +|+++...|- |++++..|+-|.++ T Consensus 61 l~~~~~~~ta~~~----a~~T~i~V~~~~~---f~~~~l~~~~~~~EvirVtsV-ng~~lTV~RG~~~t~aa~iaag~~~ 132 (418) T protein:vir:96 61 MVFASAVVTAEAL----ADATVLTVENSDG---LTKGMIFYNEATGENMRLELV-NGLNLTVKRQTGRIAAAIIAANTKL 132 (418) T ss_pred eeeeeEEEEEEEe----cCceEEEecCCcc---cccccEEEEecCCeEEEEEEE-eCCEEEEEEccCCeeeeeeecCceE Confidence 2222211111111 1234455555666 8889986 345677788 6898887765 88888889988853 Q ss_pred --------CCCCccccc--eeEeecc-eeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc Q lcl|Aclame:pro 75 --------NATPTQADK--NQLVIDT-TVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANT 143 (402) Q Consensus 75 --------~~~~~~~~e--~~itID~-~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a 143 (402) .|...++.. +...++- .-|++..+.==+-+|+.+- ..-++....+. ++.+|+. ..++=++..+.. T Consensus 133 ~~ig~~~eEGsd~~ta~~~k~~~vsN~tQIf~e~vsVSgTAqA~v~--qaGvsn~~~~e-~d~l~~~-kv~iE~ali~g~ 208 (418) T protein:vir:96 133 IVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYA--EAGYSNITESR-RDCMDFH-ATEQETAIFFGQ 208 (418) T ss_pred EEeecCcccccccCCcceecceeccchhheehhhhhhhhhhhhhhh--hcCcchhHHHH-HHHHHHH-HHHHHHhhhccc Confidence 122222221 1111111 1133333322233333221 11111111111 3334433 223323222222 Q ss_pred ccccc---ccc------cccccc---ccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccC------cEEEeChHHH Q lcl|Aclame:pro 144 KAERN---KPR------VKGHGF---SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISD------VAIMMPWKFF 205 (402) Q Consensus 144 ~~~~~---~~~------~~g~~~---~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~g------R~~VV~P~~y 205 (402) +.... .+. ..|+.. +.++. ++.+...+-+.+.+++.++. +.+++..+ ++++|+|.+- T Consensus 209 ~~~~~~ng~p~~~t~R~m~gI~~f~~~Nvi~-ag~~~~~t~d~L~~~~~~a~----~~g~n~G~~~~~~~y~~~V~a~~k 283 (418) T protein:vir:96 209 AFMGTYNGQPLHTTQGIVDAIRQYAPDNVNA-MPNPTAVTYDDVVDATIDAF----KWSVNVGDNTQRVMFCDTVGMRTM 283 (418) T ss_pred cccCCCCCcccccccchhHHHHhhccccccc-cCCCCcCCHHHHHHHHHHHH----hhcCCCCCcccceEEEEEeChHHH Confidence 11100 010 011111 11122 22222334444444444443 44444322 6689999987 Q ss_pred HHHhcccchhhcccccccCcccccceEEE-E---e-ccEEEecCccccccCccccccccccCCccccceeeeccceeEEe Q lcl|Aclame:pro 206 NALRDADRIVDKTYTISQSGATINGFVLS-S---Y-NCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVL 280 (402) Q Consensus 206 ~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~-i---a-G~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~ 280 (402) ..+-+ |. .+........ .-|.+.. + + =++|+..+|||.-. .. .-.+++ T Consensus 284 ~~I~k---~~-~~I~~~~~en-~~G~vv~~~~Td~G~v~ii~n~~~pad~------I~----------------~g~mlV 336 (418) T protein:vir:96 284 QDIGR---FF-GEVTVTQRET-SYGMVFTEWKFFKGRLIIKEHPLFSAIG------IS----------------PGFAVV 336 (418) T ss_pred HHHhh---hh-ceeEeccccc-eeceEEEEEEeeccEEEEEecCCCCccc------cC----------------cceEEE Confidence 77654 22 1222112111 1232221 1 2 35777888888411 11 112566 Q ss_pred ecHHHhhhhhh--cccceeeccchhHHHHHHHHHHHhcCc---------------ccccceEEEEEEeeccCccccccch Q lcl|Aclame:pro 281 FTSDALLVGRT--IEVTGDIFYEKKEKTYYIDTFMAEGAI---------------PDRWEAVSVVTTKRDATTGDAGGPG 343 (402) Q Consensus 281 fh~~Av~tv~~--~dl~~e~~~d~~~~~d~i~~~~a~Ga~---------------vlRPeaa~vv~~~~~~t~~~a~~~~ 343 (402) |.+..+--.-+ .++..|..--.-.-...--..+.|||+ +++|.++++|+=-+.+-+.+-++.. T Consensus 337 vD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~~G~l~~Eltle~~N~~a~a~itgl~~~~~~~~~~~~ 416 (418) T protein:vir:96 337 VDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVDAQGGSLTSEWALELLNPQGCAVITGLQKAKERVYLTAP 416 (418) T ss_pred EecCceEEEEecCCCccchhcccCCCcccccccccccccccccccCEEEEEEEEEeecccccEEeecccccccccccCCC Confidence 65543322222 232222221000000011123344554 4566666655433333222222111 Q ss_pred hh Q lcl|Aclame:pro 344 DD 345 (402) Q Consensus 344 ~~ 345 (402) +. T Consensus 417 ~~ 418 (418) T protein:vir:96 417 AP 418 (418) T ss_pred CC Confidence 11 No 188 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=72.07 E-value=0.19 Score=24.57 Aligned_cols=262 Identities=10% Similarity=0.013 Sum_probs=120.3 Q ss_pred cccccccccHHHHHHHHHhHHHHHHHHHHhhhcccce-eee---ccccceEEeeeccce--eeeeecCCCCCC--CCC-- Q lcl|Aclame:pro 9 NVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFD-VQT---VTGTNTVSNKYLGET--ELQVLAPGQSPN--ATP-- 78 (402) Q Consensus 9 ~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~-~rt---i~~Gksv~f~~iG~~--t~~~~~~G~~i~--~~~-- 78 (402) -|. |-+...-.|-|+|.|...+-|+..+.|++..- ++. |++..++---...++ -++.|..++..- |+. T Consensus 1 mp~--N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNvagFGtGTg 78 (295) T protein:vir:47 1 MPS--NQNNAVRRYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGENDGGFGDNSG 78 (295) T ss_pred CCC--CCCccchhhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeecccCCCcccccccCCc Confidence 222 11233456779999999999999999997653 222 333333311122222 334455565553 222 Q ss_pred ----ccccceeEeeccee-eccch--hhhHHHhhcCccchhHHH---HHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 79 ----TQADKNQLVIDTTV-IARNT--VAHIHDVQGDIDSLKPKL---AMNQAKQLKRLEDQMAIQQMLLGGIANTKAERN 148 (402) Q Consensus 79 ----~~~~e~~itID~~l-ya~~~--IddlDe~q~~~D~vrse~---s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~ 148 (402) .---+-++-+|+.. |..-| -.-||...-+=| +.... -..++.|-++.+|..+=.-|...|-- T Consensus 79 ~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd-~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~------- 150 (295) T protein:vir:47 79 AQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNND-LNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATK------- 150 (295) T ss_pred cccccCceeeEEeecccccccccchhhhccccccccCC-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh------- Confidence 11234556666654 43333 233555554433 33322 24567788888887664444322210 Q ss_pred cccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCc---cCcEEEeChHHHHHHhcccchhhcccccccCc Q lcl|Aclame:pro 149 KPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI---SDVAIMMPWKFFNALRDADRIVDKTYTISQSG 225 (402) Q Consensus 149 ~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~---~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g 225 (402) +.+....+. |.+..++.++.|+.|.- ...-++|.|+.|.+|+.++-.+..- +.+.+ T Consensus 151 ---------------te~~td~t~----d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK-~SsaN- 209 (295) T protein:vir:47 151 ---------------TEALADFTD----DKVKALFNKLSAFYTNNEVTAPITVYLRSEFYNAIVDMASVTSAK-GATIS- 209 (295) T ss_pred ---------------hhhhhcccc----hhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccc-cceee- Confidence 001111111 34556666666666542 2333899999999999988776442 22222 Q ss_pred ccccceEEEEeccEEEecCccccccCccccccccccCCcccc-------ce-eeeccceeEEeec--HHHhhhhhhcccc Q lcl|Aclame:pro 226 ATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRY-------DP-IAEMNGAVAVLFT--SDALLVGRTIEVT 295 (402) Q Consensus 226 ~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~-------~~-~ad~~~~~al~fh--~~Av~tv~~~dl~ 295 (402) +-.-.+.+.-||.|-|.+.--...+.. ...+..+.|-++ .. +.||..+ .++ -.-+.+ -+|. T Consensus 210 -iDengi~~FkGf~i~e~P~~~~q~G~~--aifs~dnig~aftGIn~aR~IesEdF~GV---alQ~~~~~~~~-~~~~-- 280 (295) T protein:vir:47 210 -LDENGLPKYKGFTLEETPAQYFETGVI--AIFSPNGIIIPFVGISTARVIEAENFDGV---NCKLLLRVVLT-LLMT-- 280 (295) T ss_pred -eccCCcceecceEEEeccHhhccCCcE--EEEccccceeecccceeeeeeecccccch---HHHHHHHHHHH-HHHH-- Confidence 223456788999998876532211110 111212222221 11 1223321 111 000000 0000 Q ss_pred eeeccchhHHHHHHHHHHHhcCcccc Q lcl|Aclame:pro 296 GDIFYEKKEKTYYIDTFMAEGAIPDR 321 (402) Q Consensus 296 ~e~~~d~~~~~d~i~~~~a~Ga~vlR 321 (402) . .+.|..+-. +.| +| T Consensus 281 ~-----~~~~~~~~~--~~~----~~ 295 (295) T protein:vir:47 281 I-----RKQFTKLQE--LLY----RR 295 (295) T ss_pred H-----HHHHHHHHH--Hhh----cC Confidence 0 111221111 111 11 No 189 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=70.08 E-value=0.21 Score=24.26 Aligned_cols=316 Identities=12% Similarity=0.029 Sum_probs=137.0 Q ss_pred CCCCcccccccccccccHHHHH-HHHHhHHHHHHHHHHhhhc--ccce----eee--ccccceEEeeeccceeee-eecC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLL-IEKFNGKVNEQYLKGENIL--SYFD----VQT--VTGTNTVSNKYLGETELQ-VLAP 70 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alf-le~f~geV~t~f~~~sv~~--~~~~----~rt--i~~Gksv~f~~iG~~t~~-~~~~ 70 (402) |+. |+- .+.=.| +|+|.-.|.+.-.+.+-|. +.+. .+. ..+|+.+.+|..+...-. .... T Consensus 1 Ma~----T~l------~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~ 70 (349) T protein:vir:94 1 MAI----TTI------GNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNY 70 (349) T ss_pred CCc----eEE------eeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCccccc Confidence 652 110 001111 2356656655555544443 1111 111 257999999998775432 1111 Q ss_pred -C-C---CCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|Aclame:pro 71 -G-Q---SPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKA 145 (402) Q Consensus 71 -G-~---~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~ 145 (402) | + .+.+..+.+.+..=++ ...-..+...||=...+--| .-.+++++.+..-.+ .||..+....++....... T Consensus 71 ~~dt~~~~~t~~kit~~~~~a~~-~~r~kaw~~~Dla~~lsG~d-pm~~Ia~~va~yW~r-~~q~~Lia~L~Gvf~~~~~ 147 (349) T protein:vir:94 71 SNDVYQDIATPRAIQTGEMMARV-AYLNEGFGQADLTVELTSQN-PLQSVASRLDNFWQR-QAQRRLIATALGLYNDNVS 147 (349) T ss_pred CCCCcccccccccccccceeeee-eeeccccchhHHHHHhhCch-HHHHHHHHHHHHHhh-HHHHHHHHHHHhhhccccc Confidence 1 1 1222333333221111 11112445566655555445 344566666655555 4566666666665432211 Q ss_pred cccc-ccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccCcEEEeChHHHHHHhcccchhhcccccccC Q lcl|Aclame:pro 146 ERNK-PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQS 224 (402) Q Consensus 146 ~~~~-~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~ 224 (402) .... .+..++.. ...++...++..+.++...+...+. -+....=-.++|-+..|..|.+-.. + +|-... T Consensus 148 ~~~~~~~~~~~~~-----d~~~~a~~~~~~~~~A~~~~Gdaa~-Gd~~~~lt~i~mHS~v~~~L~~~~l-i--~~i~~s- 217 (349) T protein:vir:94 148 ATDAYHEQNDMVV-----DVSATSGFDAGAFIDATQTMGDALM-GNGGEVLGAIAMHSFVYAQARKAQL-I--DFIRDA- 217 (349) T ss_pred ccccccccCceeE-----EecccCCCChhhHHHHHHHHHHHhc-cccccceeEEEEchHHHHHHHhcch-h--hhccCc- Confidence 1111 00001111 1112223444444444433333211 0111122458899999999766444 3 232112 Q ss_pred cccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcc-cceeeccchh Q lcl|Aclame:pro 225 GATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE-VTGDIFYEKK 303 (402) Q Consensus 225 g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~d-l~~e~~~d~~ 303 (402) ..+..|...+|.+|+.+..+|....+... .|+ -.+|-+-|++..+..+ +..|..|++. T Consensus 218 --~~~~~i~ty~G~~VivDD~~Pv~~~g~~~------------~yt-------tylfg~GAi~~~~~~~~~~~E~~rd~~ 276 (349) T protein:vir:94 218 --ENNTMFATYQGYRVIVDDSMTVVGQDTSR------------KFI-------SIIFGQGAIGYGEGNPEMPLEYEREAS 276 (349) T ss_pred --ccCcccceecCcEEEEeCCCccccCCCCc------------eEE-------EEEeecceEEeecCCCCcceeeecccc Confidence 23456889999999999999975422111 121 2344455555555543 3456666665 Q ss_pred HH----HHHHHH-----HHHhcCcccccceEEEEEEeeccCccc-cccchhhHH------HhhhcccceEEEeecchhh Q lcl|Aclame:pro 304 EK----TYYIDT-----FMAEGAIPDRWEAVSVVTTKRDATTGD-AGGPGDDHA------TVLARAQRKAVYVKTEGAA 366 (402) Q Consensus 304 ~~----~d~i~~-----~~a~Ga~vlRPeaa~vv~~~~~~t~~~-a~~~~~~~~------~~~~~~~~~~~~~~~~~~~ 366 (402) .+ -|.+.. +|.+|....-+. .+..+.+..+ .|+. ++++ .|--+-+=..+..+.-.-| T Consensus 277 ~g~~~G~d~L~~R~~~~~hp~G~s~~~a~-----v~~~~~~~~~~sPt~-aeLa~~~NW~~v~~~K~I~iv~~~~~~~a 349 (349) T protein:vir:94 277 RANGGGVETLWTRKTWLLHPFGYSFTSAV-----ITGNGTETIARSASW-QDLANAANWNRVVDRKHVPIAFLVTGVGA 349 (349) T ss_pred cCCcceeEEEEEeeEEEeeeeeeeecccc-----cCCCccccccCCCCh-HHhcCCcCcccccChhhcceEEEEeccCC Confidence 43 355555 344444443210 1122222222 2222 2222 2222222223333333333 No 190 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=67.70 E-value=0.25 Score=23.90 Aligned_cols=291 Identities=11% Similarity=0.094 Sum_probs=122.4 Q ss_pred CCCCccccccc--ccccccHHHHHH-HHHhHHHHHHHHH-HhhhcccceeeeccccceEEeeeccce-eeeeecCCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVA--VSASGEVDSLLI-EKFNGKVNEQYLK-GENILSYFDVQTVTGTNTVSNKYLGET-ELQVLAPGQSPN 75 (402) Q Consensus 1 Ms~~n~~t~~~--~~~~~d~~alfl-e~f~geV~t~f~~-~sv~~~~~~~rti~~Gksv~f~~iG~~-t~~~~~~G~~i~ 75 (402) ++..+.....+ -..++++-...| ..-.-.++..|+. ..-++.|.+.++++-=|..+..++|.. ++..+.-|.++. T Consensus 348 ~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk 427 (652) T protein:vir:79 348 VSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYK 427 (652) T ss_pred CCCCCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCccc Confidence 11111000000 012344444333 4444555667766 556778888888765444555555443 344444444554 Q ss_pred CCCccccceeEeeccee----eccch-h-hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc-c Q lcl|Aclame:pro 76 ATPTQADKNQLVIDTTV----IARNT-V-AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAER-N 148 (402) Q Consensus 76 ~~~~~~~e~~itID~~l----ya~~~-I-ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~-~ 148 (402) +-.+...+-++.+.+.= +.|.. | |||+-+ ..+.+.+|.+-++.+++.+...|.. .|.-. . T Consensus 428 ~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~--------~~ip~~~g~aA~~~~~~~vy~~l~~-----Np~~~~D 494 (652) T protein:vir:79 428 YVTTGDKQATIALATYGELFSITRQAIINDDLNML--------TDVPMKLGRAAKSTIADLVYAILTS-----NPKISTD 494 (652) T ss_pred eeeecCccceeeeecccCeeeeehheeeccchhHH--------HHHHHHHHHHHHHHHHHHHHHHHhc-----CcccccC Confidence 43343444455555411 11111 1 344333 3466778899999999999887752 22111 1 Q ss_pred cccccccccccccccCCccccccHHHHHHHHHHHHHHHH-hhcCCccCcEEEeChHHHHHHhcccchhhcccccccCccc Q lcl|Aclame:pro 149 KPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQL-EQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGAT 227 (402) Q Consensus 149 ~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~Ld-ekdVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~ 227 (402) +-..++|..-.++..++ +.+-+.|-.+...++.+-+ +..+--..||++|||+..... .++++... ..+.. . T Consensus 495 Gk~LF~hA~H~Nl~~~a---a~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a---~~ll~s~~-v~~a~-~ 566 (652) T protein:vir:79 495 NVSLFDKAKHANVLESA---AMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVA---NQVIRSSS-VKGAD-I 566 (652) T ss_pred Cceeecccccccccccc---cCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHH---HHHhccCC-Ccccc-c Confidence 11111111112221111 2232222222222222212 222334678999999865443 33443221 11110 1 Q ss_pred ccceEEEEecc-EEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHH Q lcl|Aclame:pro 228 INGFVLSSYNC-PVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKT 306 (402) Q Consensus 228 ~~G~V~~iaG~-~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~ 306 (402) -.|.+--+.|+ +|+..++|.... .+...+... .+. -.+--.| .-| ...+.+|.-.+-.-.+ T Consensus 567 ~~~~~Np~~~~~~~i~eprL~~~s--~~~wylaa~-~~~-dtiev~y--L~G------------~~~P~ie~~~gf~~dG 628 (652) T protein:vir:79 567 NAGIINPVKDFATVIAEPRLDDNS--QTTFYLAAS-KGS-DTIEVAY--LNG------------VDTPYIDQMEGFSVDG 628 (652) T ss_pred ccccccccccccccccccccCCCC--cccEEEecC-CCC-CeEEEEE--ecC------------CCCCeeeecCCCCcce Confidence 12333334453 888898885321 222333211 111 0010000 000 0111222211111123 Q ss_pred HHHHHHHHhcCcccccceEEEEEEee Q lcl|Aclame:pro 307 YYIDTFMAEGAIPDRWEAVSVVTTKR 332 (402) Q Consensus 307 d~i~~~~a~Ga~vlRPeaa~vv~~~~ 332 (402) -.++.++=||++++..-. .++.+. T Consensus 629 ~~~kvrlD~G~~~iD~RG--~~k~t~ 652 (652) T protein:vir:79 629 VTTKVRIDAGVAPVDHRG--LVKCTA 652 (652) T ss_pred EEEEEEEeccCceeeccc--eeeecC Confidence 334456668888886654 333332 No 191 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=67.58 E-value=0.25 Score=23.88 Aligned_cols=302 Identities=12% Similarity=0.061 Sum_probs=102.9 Q ss_pred CCCCccc--------ccccc-cccccHHHHHH---HHHhHHHHHHHHHHhhhcccceeeeccccceEEeeeccceeeeee Q lcl|Aclame:pro 1 MSTPNTL--------TNVAV-SASGEVDSLLI---EKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVL 68 (402) Q Consensus 1 Ms~~n~~--------t~~~~-~~~~d~~alfl---e~f~geV~t~f~~~sv~~~~~~~rti~~Gksv~f~~iG~~t~~~~ 68 (402) |+...+. ++-.- .-.-+++.-++ +++. +-+..-+.++-++++.++-+. ..++..|++||-....-+ T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l~~g~L~p~~a~-~Fl~~v~~~t~iL~~~r~~~~-~s~~~ei~kig~G~r~~r 78 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAELDGFQLPVDVTE-EFLERMQKGVQILGMADTMTL-ARLEMEVPQFGVPRLSGH 78 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccccCceeecHHHHH-HHHHHHhhccchhhhcceeec-ccccccccccccceeecc Confidence 6554211 11100 00001111111 2222 223334556666677765543 347777787766332222 Q ss_pred cC---CCCCC-CCCccccceeEeeccee-eccchhhhHHH-----hhcCccchhHHHHHHHHHHHHHHHHHH-------- Q lcl|Aclame:pro 69 AP---GQSPN-ATPTQADKNQLVIDTTV-IARNTVAHIHD-----VQGDIDSLKPKLAMNQAKQLKRLEDQM-------- 130 (402) Q Consensus 69 ~~---G~~i~-~~~~~~~e~~itID~~l-ya~~~IddlDe-----~q~~~D~vrse~s~~~G~aLA~~~Dq~-------- 130 (402) .. |+... ++.....-....++... .++.+.+++-+ .|.--+.++..+++..|+-|....-+. T Consensus 79 ~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~ 158 (360) T protein:vir:99 79 TRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQ 158 (360) T ss_pred ccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccc Confidence 22 11111 11111111122233222 33344444322 111012244555555555433222110 Q ss_pred -------HHHH---HHhhhhhccccc-ccc-c---cccccccccccc---cCCccccccHHHH-HHHHHHHHHHHHhhcC Q lcl|Aclame:pro 131 -------AIQQ---MLLGGIANTKAE-RNK-P---RVKGHGFSINVN---VTESEALANPQYV-MAAVEYALEQQLEQEV 191 (402) Q Consensus 131 -------i~~~---l~kaA~~~a~~~-~~~-~---~~~g~~~~~~v~---~~~a~~~~~~~~l-~dai~~a~~~LdekdV 191 (402) ++.+ +.|=|....... ..+ . ...+.......+ ...-....+++.+ .+.|.++...|..+.- T Consensus 159 ~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp~kyr 238 (360) T protein:vir:99 159 SIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLDSRYR 238 (360) T ss_pred cCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcchhhh Confidence 0000 111110000000 000 0 000000000000 0000000111121 2234455666666653 Q ss_pred --CccCcEEEeChHHHHHHhcccchhhcccccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccce Q lcl|Aclame:pro 192 --DISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDP 269 (402) Q Consensus 192 --P~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~ 269 (402) |..--+.+++|..+..... .+.+|+. ..++..+.++......|++|+..++||...-..| T Consensus 239 ~~~~~~~~~~~s~~~~~~yr~--~L~~R~t-~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~~mlT--------------- 300 (360) T protein:vir:99 239 ESDAYSPVLMTSPNQVQSYTM--SLTERED-PLGSAVIFGDSDITPFSYDLVGVNGFPDEYMMFT--------------- 300 (360) T ss_pred cCcccceEEEccCchHHHHHH--HHhccCc-ccchhheecccccccceeeeEEcCCCCCCceEEe--------------- Confidence 2112145778877666554 4555553 3444444455556789999999999996432211 Q ss_pred eeeccceeEEeecHHHhhhhhhcccceee-----ccchhHHH--HHHHHHHHhcCcccccceEEEEEEeeccCc Q lcl|Aclame:pro 270 IAEMNGAVAVLFTSDALLVGRTIEVTGDI-----FYEKKEKT--YYIDTFMAEGAIPDRWEAVSVVTTKRDATT 336 (402) Q Consensus 270 ~ad~~~~~al~fh~~Av~tv~~~dl~~e~-----~~d~~~~~--d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~ 336 (402) +|+=+..+-..++..+. +++++|+. +++.+. +=.-+--+||++.++=..+.++ T Consensus 301 ------------~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~--~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 301 ------------DPNNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQ--FDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred ------------ccCceeEEeeeeeEEeecccchhhhhhceeeeEEEEEE--eeEEEEecccEEEEecCCCCCC Confidence 11111111111111111 11222211 110000 0111222444443322222222 No 192 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=65.89 E-value=0.28 Score=23.65 Aligned_cols=290 Identities=11% Similarity=0.093 Sum_probs=110.2 Q ss_pred CCCC-----------------c---ccccccccccccHHHHHH-HHHhHHHHHHHHH-HhhhcccceeeeccccceEEee Q lcl|Aclame:pro 1 MSTP-----------------N---TLTNVAVSASGEVDSLLI-EKFNGKVNEQYLK-GENILSYFDVQTVTGTNTVSNK 58 (402) Q Consensus 1 Ms~~-----------------n---~~t~~~~~~~~d~~alfl-e~f~geV~t~f~~-~sv~~~~~~~rti~~Gksv~f~ 58 (402) |+.- | -..+. -..++++-.+.| ....-.++..|+. -.-++.|.+.++++-=|..+.. T Consensus 366 ~~L~elAr~~L~~rg~~~~~~~~~~~~~~a-~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~ 444 (693) T protein:vir:95 366 MTLRELARASLVDRGIGVASLNAPQMVGLA-FTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRV 444 (693) T ss_pred CcHHHHHHHHHHhcCCccCCCCHHHHHHHH-HhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCccccccee Confidence 1111 1 00010 012344443333 5555666777777 5666777777777544444455 Q ss_pred eccce-eeeeecCCCCCCCCCccccceeEeecce----eeccchh--hhHHHhhcCccchhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 59 YLGET-ELQVLAPGQSPNATPTQADKNQLVIDTT----VIARNTV--AHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMA 131 (402) Q Consensus 59 ~iG~~-t~~~~~~G~~i~~~~~~~~e~~itID~~----lya~~~I--ddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i 131 (402) ++|.. ++..+.-|.++.+-.+...+-++.+.+. -+.|..| |||+-+ +.+.+.+|.+-++.+++.+ T Consensus 445 ~lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~--------~~ip~~~g~aA~~~~~~~v 516 (693) T protein:vir:95 445 GLGEFSSLRQVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQML--------SDIPFKLGQAAKATIGDLV 516 (693) T ss_pred ecCCCCChhhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHH--------HHHHHHHHHHHHHHHHHHH Confidence 55543 3333333334333223223233333331 0112111 344322 3466778999999999999 Q ss_pred HHHHHhhhhhccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHh----h--cCCccCcEEEeChHHH Q lcl|Aclame:pro 132 IQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLE----Q--EVDISDVAIMMPWKFF 205 (402) Q Consensus 132 ~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~Lde----k--dVP~~gR~~VV~P~~y 205 (402) +..|..-..+ ..++..|..+|+- .++ ++....+-+.+-.+...++.+-++ . .+--..+|++|||... T Consensus 517 y~~L~~Np~m---~DGk~LFhadH~N--l~t--ga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le 589 (693) T protein:vir:95 517 YAVLTGNPAM---SDGKTLFHADHSN--LLT--GAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALE 589 (693) T ss_pred HHHHhcCccc---cCCcceeeccccc--ccc--ccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHH Confidence 9887532111 1223333333322 111 111122333333333333332211 1 1223568889988876 Q ss_pred HHHhcccchhhcccccccCcccccceEEEEecc-EEEecCccccccCccccccccccCCccccceeeec-cceeE-Eeec Q lcl|Aclame:pro 206 NALRDADRIVDKTYTISQSGATINGFVLSSYNC-PVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEM-NGAVA-VLFT 282 (402) Q Consensus 206 ~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~iaG~-~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~-~~~~a-l~fh 282 (402) ... .++++..+....+ .-.|.+--+.|+ +|+..++|.... .+.+.+... .+.. .+--.| ....+ .+.+ T Consensus 590 ~~a---~~l~~s~~~~~a~--~~~~~~NP~~~~~~vi~~prL~~~s--~~~Wyl~a~-~~~d-tie~~yL~G~~~P~ie~ 660 (693) T protein:vir:95 590 DKA---NQIINSESVPGAD--VNSGIVNPIRAFAQVIGEPRLDDAS--ATAWYMAAK-KGSD-TIEVAYLDGVDTPYLEQ 660 (693) T ss_pred HHH---HHHhccccccccc--cccccccchhccccccccceecCCC--CCceEEecC-CCCC-eEEEEEecCCCCCeEee Confidence 643 3455443321111 112333334453 788888885322 223333211 1110 000000 00000 0000 Q ss_pred HHHhhhhhhcccceeeccchhHHHHHHHHHHHhcCcccccce Q lcl|Aclame:pro 283 SDALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEA 324 (402) Q Consensus 283 ~~Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPea 324 (402) .+-.. .-.++.++. .|+=.+..=|=.-+..|.| T Consensus 661 ~~gf~---~dG~~~kvr------~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 661 QEGFT---VDGVASKVR------IDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred cCCCC---cceEEEEEE------EeccCceeeccccccCCCC Confidence 00000 000000000 0000000011111223333 No 193 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=64.97 E-value=0.29 Score=23.52 Aligned_cols=286 Identities=9% Similarity=0.008 Sum_probs=122.0 Q ss_pred CCCCcccc--cccc---cccc-cHHHHHH-HHHhHHHHHHHHH----Hhhhcccceeee-cc-ccceEEee---ecccee Q lcl|Aclame:pro 1 MSTPNTLT--NVAV---SASG-EVDSLLI-EKFNGKVNEQYLK----GENILSYFDVQT-VT-GTNTVSNK---YLGETE 64 (402) Q Consensus 1 Ms~~n~~t--~~~~---~~~~-d~~alfl-e~f~geV~t~f~~----~sv~~~~~~~rt-i~-~Gksv~f~---~iG~~t 64 (402) |-|.-... ..+. .+.. |.--.|+ +++. .|+....+ .-..+.++.+++ +- .-.++.+. ..|..+ T Consensus 3 ~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~-~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~ 81 (314) T protein:vir:10 3 IKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLT-AALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQ 81 (314) T ss_pred cchHHHHHHHHHHHHhhcccchhhhHHHHHHHHH-HHHHHHhhhhccccccceeeccccCCCCceeEEEeeeecccccee Confidence 55553221 1111 1222 2222455 5444 45544443 344455666654 21 23355544 345544 Q ss_pred eeeecC-CCCCCCCCccccceeEeeccee-eccchhhhHHHhh-cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 65 LQVLAP-GQSPNATPTQADKNQLVIDTTV-IARNTVAHIHDVQ-GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIA 141 (402) Q Consensus 65 ~~~~~~-G~~i~~~~~~~~e~~itID~~l-ya~~~IddlDe~q-~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~ 141 (402) . +.- +..++.-....++....|-..- -+...+.+|..++ ...+ +...-...+..++++..|+.+|.=- T Consensus 82 ~--~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~-l~~~k~~aA~~~~~~~~n~i~f~G~------ 152 (314) T protein:vir:10 82 I--IADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQS-LSARKQALAFEAHDNLLDKLVWSGS------ 152 (314) T ss_pred e--eCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhceEEEeec------ Confidence 2 221 2234333344445555443311 1222245555553 3444 4455555666777777776554210 Q ss_pred ccccccccccccccccccccccC-CccccccHHHHHHHHHHHHHHHHhh----cCCccCcEEEeChHHHHHHhcccchhh Q lcl|Aclame:pro 142 NTKAERNKPRVKGHGFSINVNVT-ESEALANPQYVMAAVEYALEQQLEQ----EVDISDVAIMMPWKFFNALRDADRIVD 216 (402) Q Consensus 142 ~a~~~~~~~~~~g~~~~~~v~~~-~a~~~~~~~~l~dai~~a~~~Ldek----dVP~~gR~~VV~P~~y~~Ll~~~r~~n 216 (402) + .....|......++.. ...+..+++.+++-|..+..+|.++ .-|. .++|||..|..|.. + .+ T Consensus 153 -~-----~~g~~GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~---~l~Lpp~~~~~L~~--~-~~ 220 (314) T protein:vir:10 153 -A-----PHGIVSVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVT---DILLPASARRVMQG--L-VP 220 (314) T ss_pred -c-----cccceeEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccce---eEEecHHHHHhhcc--c-cc Confidence 0 0011122211122211 2235568899999999999999875 2232 57899999877732 1 11 Q ss_pred cccccccCcccccceE-EEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccc Q lcl|Aclame:pro 217 KTYTISQSGATINGFV-LSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVT 295 (402) Q Consensus 217 ~d~~~~~~g~~~~G~V-~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~ 295 (402) +++. +.. -.+ .+-.+++|...+.|-..++..+. ...-|..+-.+ +.+.++ ++++ T Consensus 221 --~~~~---tvl-~~l~~n~~~l~I~~~~el~~ag~~g~~---------~~v~y~~~~~~-~~~~vp---------~~~~ 275 (314) T protein:vir:10 221 --QTNL---SYG-ELFTRNNPGLTIRFLQFLDNYDGAGGK---------AALAFEKSPLN-MSIEIP---------EVTN 275 (314) T ss_pred --CCCc---cHH-HHHHHhCCCcEEEEcccccccCCCcce---------EEEEEecCCcE-EEEecC---------ccce Confidence 1110 000 000 01136777777776532211110 00111111111 111111 1111 Q ss_pred eeeccchhHHHHHHHHHHHh-cCcccccceEEEEEEeeccCcc Q lcl|Aclame:pro 296 GDIFYEKKEKTYYIDTFMAE-GAIPDRWEAVSVVTTKRDATTG 337 (402) Q Consensus 296 ~e~~~d~~~~~d~i~~~~a~-Ga~vlRPeaa~vv~~~~~~t~~ 337 (402) .-. ...+...+.+.+.... |.-+.||++++.+ . |-|=+ T Consensus 276 ~l~-~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~--d-GI~~~ 314 (314) T protein:vir:10 276 VLP-AQPKDLHFRYPVTSKATGLIVYRPLTMAVI--K-GITFA 314 (314) T ss_pred eec-ceecCceEEEcceeeeEEEEEECcceeEee--e-eeecC Confidence 110 1122233333333333 7889999987632 1 11111 No 194 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=62.78 E-value=0.33 Score=23.23 Aligned_cols=314 Identities=12% Similarity=0.020 Sum_probs=137.8 Q ss_pred CCCCcccccccccccccHHHHH-HHHHhHHHHHHHHHHhhhcc--cce----eee--ccccceEEeeeccceeee-ee-- Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLL-IEKFNGKVNEQYLKGENILS--YFD----VQT--VTGTNTVSNKYLGETELQ-VL-- 68 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alf-le~f~geV~t~f~~~sv~~~--~~~----~rt--i~~Gksv~f~~iG~~t~~-~~-- 68 (402) |+. |+- .+.-.| +|+|.-.|.+.-.+.+-|.. .+. .+. ..+|+.+.+|..+.+.-. .. T Consensus 1 Ma~----T~l------~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv 70 (349) T protein:vir:78 1 MAI----TTI------GDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNY 70 (349) T ss_pred CCc----eEE------eeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCccccc Confidence 651 110 001111 23566555555555444331 111 111 257999999999876531 11 Q ss_pred -cCC--CCCCCCCccccceeEeecceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Q lcl|Aclame:pro 69 -APG--QSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKA 145 (402) Q Consensus 69 -~~G--~~i~~~~~~~~e~~itID~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~ 145 (402) .-+ ..+.++.+.+.+..=++= ..-..+...||-...+--| .-.+++.+.+..-.+ .||..+....++....... T Consensus 71 ~~D~~~~~~t~~kitt~~~~a~~~-~r~kaw~~~Dla~~lsG~d-pm~~Ia~~va~yW~r-~~q~~Lia~L~Gvf~~~~~ 147 (349) T protein:vir:78 71 SNDVYQDIATPRAIQTGEMMARVA-YLNEGFGQADLTVELTSQN-PLQSVASRLDNFWQR-QAQRRLIATALGLYNDNVS 147 (349) T ss_pred CCCCcccccccccccccceeeeee-eeccccchhHHHHHhhCch-HHHHHHHHHHHHHhh-HHHHHHHHHHHHhhccccc Confidence 111 122333343333222211 1123455666655555544 344555555555544 4566666666665432211 Q ss_pred cccc-ccccccccccccccCCccccccHHHHHHHHHHHHHHHHhh---cCCccCcEEEeChHHHHHHhcccchhhccccc Q lcl|Aclame:pro 146 ERNK-PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQ---EVDISDVAIMMPWKFFNALRDADRIVDKTYTI 221 (402) Q Consensus 146 ~~~~-~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~Ldek---dVP~~gR~~VV~P~~y~~Ll~~~r~~n~d~~~ 221 (402) .... .+..++... ..+....++.. |.++.++|... +....=-.++|-+..|..|.+.. ++ +|-. T Consensus 148 a~~~~~~~~~~t~d-----~s~~a~~~~~~----~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~-li--~~i~ 215 (349) T protein:vir:78 148 ATDAYHEQNDMVVD-----VSATLGFDAGA----FIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQ-LI--DFIR 215 (349) T ss_pred ccchhhhcccceee-----eccccCCChhh----hhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhh-hh--hhcc Confidence 1110 011111111 11122234444 44444454332 11222356899999999977544 44 3322 Q ss_pred ccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcc-cceeecc Q lcl|Aclame:pro 222 SQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIE-VTGDIFY 300 (402) Q Consensus 222 ~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~d-l~~e~~~ 300 (402) .. ..+..|...+|.+|+.+..+|....+. ...|. -++|-+-|++..+..+ +..|..| T Consensus 216 ~s---~~~~~i~ty~G~~VivDD~~Pv~~~g~------------~~~yt-------tylfg~GAi~~~~~~~~~~~et~r 273 (349) T protein:vir:78 216 DA---ENNTMFATYQGYRVIVDDSMTVVGQGA------------QRKFI-------SIIFGQGAIGYGEGNPVMPLEYER 273 (349) T ss_pred Cc---ccCcccceecCeEEEEeCCCccccCCC------------CceEE-------EEEeecceEEEccCCCccceeeec Confidence 22 234578899999999999999754211 11121 2455555555554443 2356666 Q ss_pred chhHH----HHHHHH-----HHHhcCcccccceEEEEEEeeccCccccccchhhHH------HhhhcccceEEEeecchh Q lcl|Aclame:pro 301 EKKEK----TYYIDT-----FMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDHA------TVLARAQRKAVYVKTEGA 365 (402) Q Consensus 301 d~~~~----~d~i~~-----~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~~------~~~~~~~~~~~~~~~~~~ 365 (402) |+... -|.+.. +|.+|....-. . .+..+....+.+.+-++++ .|--+-+=..+..+.-.- T Consensus 274 d~~~g~~~G~d~l~~R~~~~~hp~G~s~~~a---~--v~~~~~~~~~~sPt~aeLa~~~NW~~v~~~K~I~iv~~~~~~~ 348 (349) T protein:vir:78 274 EASRANGGGVETLWTRKTWLLHPFGYRFTSA---V--ITGNGTETIARSASWQDLANATNWNRVVDRKHVPIAFLVTGVG 348 (349) T ss_pred ccccCCcceeEEEEEeeEEEeeeeeeeeccc---c--ccCCccccccCCCChHHhcCCcCcccccChhhcceEEEEeccC Confidence 66442 355555 34444443321 1 1122222222221112222 222222222333333333 Q ss_pred h Q lcl|Aclame:pro 366 A 366 (402) Q Consensus 366 ~ 366 (402) | T Consensus 349 a 349 (349) T protein:vir:78 349 A 349 (349) T ss_pred C Confidence 3 No 195 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=56.57 E-value=0.45 Score=22.47 Aligned_cols=297 Identities=10% Similarity=0.046 Sum_probs=113.5 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhc-----ccceeeeccccceEEeeec-cceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIL-----SYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~-----~~~~~rti~~Gksv~f~~i-G~~t~~~~~~G~~i 74 (402) |-.....+-.... ..+.+|.+=.|++ |.+++++- ..+++-+| +|+++...|- |++++.-++-|+++ T Consensus 61 ~~~~~~~~ta~a~----a~~T~l~ve~~~~---f~~~~l~~~~~~~Evirv~sV-ng~~lTV~Rg~~~t~aaaia~n~~~ 132 (418) T protein:vir:10 61 MVFASAVVTAEAA----ADATVLTVENSDG---LTKGMIFYNEATGENMRLELV-NGLNLTVKRQTGRISAAIIAANTKL 132 (418) T ss_pred EeeeeEEEEEEEe----cCceEEEEcCcce---eccccEEEEccCCeEEEEEEE-eCCEEEEEEecCCeeEEEEecCceE Confidence 4444333322221 2344566666776 88899863 26677788 6898888765 78877777777742 Q ss_pred --------CCCCccccc--eeEeec-ceeeccchhhhHHHhhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc- Q lcl|Aclame:pro 75 --------NATPTQADK--NQLVID-TTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN- 142 (402) Q Consensus 75 --------~~~~~~~~e--~~itID-~~lya~~~IddlDe~q~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~- 142 (402) .|...++.. +.-.|. =.-+++..+.==+-+|+.+ ...-++.+...+.-++.+..+ .+=|+...- T Consensus 133 ~~Ig~~~eEGsd~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~--~q~Gvsn~~ese~drk~~~av--~iEkalI~G~ 208 (418) T protein:vir:10 133 IVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASY--AEAGYSNITESRRDCMDFHAT--EQETAIFFGQ 208 (418) T ss_pred EEeccccccccccCCcceecceeccchhhhhhhhhhhhhhhhhcc--ccccCchHHHHHHHHHHHHHH--HHHHHHhccc Confidence 122222211 111111 1114444443334444432 122222232223223333222 122222111 Q ss_pred --cccccccc--cccccc-------cccccccCCccccccHHHHHHHHHHHHHHHHhhcCCccC------cEEEeChHHH Q lcl|Aclame:pro 143 --TKAERNKP--RVKGHG-------FSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISD------VAIMMPWKFF 205 (402) Q Consensus 143 --a~~~~~~~--~~~g~~-------~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~~g------R~~VV~P~~y 205 (402) .+-+..++ ...|+. .+.++. .+.+...+-+.+.+++.+++ +.+.+..+ ++++|+|++- T Consensus 209 ~~~~~~~~g~~R~m~GIl~~vr~~~~gnVv~-a~~~t~~s~d~l~~a~~~af----~~g~~~G~~~q~~~f~~~V~~~~k 283 (418) T protein:vir:10 209 AFMGTYNGQPLHTTQGIVDAVRQYAPDNVNA-MPNPTAVTYDDVVDATIDAF----KWSVNVGDNTQRVMFCDTVGMRTM 283 (418) T ss_pred ccCCCcCCcchhhHHHHHHHHhhhcccceec-cCCCCccCHHHHHHHHHHHh----hccCCCcccccceeEEEEeChHHH Confidence 11111111 011111 112222 22222334445555555544 33333322 7789999886 Q ss_pred HHHhcccchhhcccccccCcccccceEEEE-e--c-c-----EEEecCccccccCccccccccccCCccccceeeeccce Q lcl|Aclame:pro 206 NALRDADRIVDKTYTISQSGATINGFVLSS-Y--N-C-----PVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGA 276 (402) Q Consensus 206 ~~Ll~~~r~~n~d~~~~~~g~~~~G~V~~i-a--G-~-----~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~ 276 (402) ..+- +|. .++....+....+-.|-.+ . | + +|+..-|||.. T Consensus 284 ~~I~---k~~-~~I~~~~~e~~~G~vv~~~~~~~G~I~L~~~p~~~~~~lp~g--------------------------- 332 (418) T protein:vir:10 284 QDIG---RFF-GEVTVTQRETSYGMVFTEWKFFKGRLILKEHPLFSAIGISPG--------------------------- 332 (418) T ss_pred HHhh---hhh-hheeecccceeeeEEEEEEEcceEEEEeecccccccccCCCc--------------------------- Confidence 5543 332 2332222211111111111 0 1 1 22222234321 Q ss_pred eEEeecHHHhhhhhh--cccceeeccchh---HHH----------HHHHHHHH--hcCcccccceEEEEEEeeccCcccc Q lcl|Aclame:pro 277 VAVLFTSDALLVGRT--IEVTGDIFYEKK---EKT----------YYIDTFMA--EGAIPDRWEAVSVVTTKRDATTGDA 339 (402) Q Consensus 277 ~al~fh~~Av~tv~~--~dl~~e~~~d~~---~~~----------d~i~~~~a--~Ga~vlRPeaa~vv~~~~~~t~~~a 339 (402) .+|++.+.++--.-+ .+++.|..--.- .++ |.+.|... |+-.+++|.++++|+=-+.+-|-+- T Consensus 333 ~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~kG~iv~E~tLe~~N~~a~avitgl~~~~~~~~ 412 (418) T protein:vir:10 333 FAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVDAQGGSLTSEWALELLNPQGCAVITGLQKAKERVY 412 (418) T ss_pred eEEEEccccceEEEeccccccchhcccCCCcccccccccccccccccccceEEEEeeeeeecccceEEeeccceeccccc Confidence 244544333211111 222223221000 001 22111111 3344577887777654332222222 Q ss_pred ccchhh Q lcl|Aclame:pro 340 GGPGDD 345 (402) Q Consensus 340 ~~~~~~ 345 (402) ++..+. T Consensus 413 ~t~p~~ 418 (418) T protein:vir:10 413 LTAPAP 418 (418) T ss_pred CCCCCC Confidence 211111 No 196 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=52.65 E-value=0.55 Score=22.01 Aligned_cols=283 Identities=10% Similarity=-0.038 Sum_probs=126.1 Q ss_pred CCCC-c--ccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccccc-eEEeeeccceeeeeecCCCCCCC Q lcl|Aclame:pro 1 MSTP-N--TLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTN-TVSNKYLGETELQVLAPGQSPNA 76 (402) Q Consensus 1 Ms~~-n--~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~Gk-sv~f~~iG~~t~~~~~~G~~i~~ 76 (402) |... + ..+.+.....++. . ..+.-..-++-.|. +......|.... .-..+..+.. T Consensus 219 l~gEA~t~~sTd~at~~~Gtt-----------------~---t~~~~~lyt~~~g~~t~~~~~~~~~~~-~~~~~~~~~e 277 (523) T protein:vir:59 219 LYARLFFVTGSDFATVAGGTP-----------------S---TQDLDLVYYIDARNDFEDQSTDPDYPD-PGFQSLDIPE 277 (523) T ss_pred ccccccccccccccccCCCcc-----------------c---ccccccccccccccchhhccccccccc-cccccccccc Confidence 2111 1 0111111111100 0 00000001111111 111111111100 0011222222 Q ss_pred CCccccceeEeecc-eeeccchhhhHHHhhc--C-ccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 77 TPTQADKNQLVIDT-TVIARNTVAHIHDVQG--D-IDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRV 152 (402) Q Consensus 77 ~~~~~~e~~itID~-~lya~~~IddlDe~q~--~-~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~ 152 (402) .....+|++++.=. .|-+.-.+.---|.++ + +| -.+|++.=+..++..++++-|++.+..-|.- ....... T Consensus 278 M~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLD-AE~ELanILStEImlEINR~ii~~~~~~a~~----~~~~~~~ 352 (523) T protein:vir:59 278 INLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVD-LENEIVTLMSQYIAREIDLEILSTIMAHARR----TDNYGFW 352 (523) T ss_pred eeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCC-hhHHHHHHHHHHHHHHhhHHHHHhHhhhhee----eeecccc Confidence 23333444443322 1234445666666666 3 88 6899999999999999999999999864421 1111111 Q ss_pred cccccccccccCCcccccc-HH----HHHHHHHHHHHHHH-hhc-CC----c-cCcEEEeChHHHHHHhcccchhhcccc Q lcl|Aclame:pro 153 KGHGFSINVNVTESEALAN-PQ----YVMAAVEYALEQQL-EQE-VD----I-SDVAIMMPWKFFNALRDADRIVDKTYT 220 (402) Q Consensus 153 ~g~~~~~~v~~~~a~~~~~-~~----~l~dai~~a~~~Ld-ekd-VP----~-~gR~~VV~P~~y~~Ll~~~r~~n~d~~ 220 (402) + ..+.......+... .. ..++.+..+..+++ |.| +- . .+-|+|++|+..++|-..+-+..++.. T Consensus 353 ~----~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~ 428 (523) T protein:vir:59 353 S----EVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDN 428 (523) T ss_pred c----cceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCcc Confidence 1 11111111111111 11 12344444444444 222 21 1 467899999999998776666433221 Q ss_pred cccCcccccceEEEE-eccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHH-Hhhhhhhcccceee Q lcl|Aclame:pro 221 ISQSGATINGFVLSS-YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSD-ALLVGRTIEVTGDI 298 (402) Q Consensus 221 ~~~~g~~~~G~V~~i-aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~-Av~tv~~~dl~~e~ 298 (402) .......-.+|.+ .|++||.=++.|...- +.+.. |.. .+.. -+++|.|= .++..+++ T Consensus 429 --~~~~~~~~~~g~l~~~~~vy~d~~~~~dy~-~~g~k------~~~----~~~~--~~~~y~Py~~l~~~~~~------ 487 (523) T protein:vir:59 429 --RDGGTGIFYVGMVQGRYRLYKNIYQNQPVI-IMGNQ------DLN----TPWQ--TGAVYAPYVPLLFTPTI------ 487 (523) T ss_pred --ccccccceeEEEecCceEEEecCCCCcceE-EEEec------ccC----Cccc--ccceecccchhhccccc------ Confidence 1111112235555 6889999988775221 11110 000 0111 25677654 33333332 Q ss_pred ccchhHHHHHHHHHHHhcCcccccceEEEEEEeeccC Q lcl|Aclame:pro 299 FYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDAT 335 (402) Q Consensus 299 ~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t 335 (402) .|+.+|--.|--+.=||..|.+|.+-+-+-.+---. T Consensus 488 -~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 488 -VDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred -ccCCcccceeeeeeehhheecchhHhhhhhhhhcCC Confidence 355666555555566888888998777665543211 No 197 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=50.68 E-value=0.6 Score=21.79 Aligned_cols=264 Identities=16% Similarity=0.107 Sum_probs=128.9 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccce-eee---ccccceEEeeeccce--eeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFD-VQT---VTGTNTVSNKYLGET--ELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~-~rt---i~~Gksv~f~~iG~~--t~~~~~~G~~i 74 (402) |...|+. -..-.|-|+|.|.+.+-|+..+.|++..- ++. |++..++---...++ -++.|..++.. T Consensus 1 m~t~N~n---------~avr~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNv 71 (286) T protein:vir:94 1 MATTNND---------LPVRVYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYSTDANT 71 (286) T ss_pred CCCCccc---------cceeehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEecccCCCcc Confidence 7665531 12335779999999999999999997653 222 333333311122222 23345545443 Q ss_pred C-CCC------ccccceeEeeccee-eccch--hhhHHHhhcCccchhHHH---HHHHHHHHHHHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 75 N-ATP------TQADKNQLVIDTTV-IARNT--VAHIHDVQGDIDSLKPKL---AMNQAKQLKRLEDQMAIQQMLLGGIA 141 (402) Q Consensus 75 ~-~~~------~~~~e~~itID~~l-ya~~~--IddlDe~q~~~D~vrse~---s~~~G~aLA~~~Dq~i~~~l~kaA~~ 141 (402) - ++. .---+-++-+|+.. |..-| -.-||...-+=| +.... -..++.|-++.+|..+=.-|..+| T Consensus 72 ~FGtgTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd-~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~~A-- 148 (286) T protein:vir:94 72 AFGTGTSNSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNND-LDAAVADRLNLQAQAKTRLFNVAMGEALATAG-- 148 (286) T ss_pred ccccCCccccccCceeeEEeecccccccccchhhhccccccccCC-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-- Confidence 1 121 22234556666554 43333 233555554433 33322 234567777888765533332211 Q ss_pred ccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcCCc---cCcEEEeChHHHHHHhcccchhhcc Q lcl|Aclame:pro 142 NTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDI---SDVAIMMPWKFFNALRDADRIVDKT 218 (402) Q Consensus 142 ~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdVP~---~gR~~VV~P~~y~~Ll~~~r~~n~d 218 (402) +. ...+|.+..++.+|.|+.|.- ...-++|.|+.|.+|+.++-.+..- T Consensus 149 -------------------------~~----t~~~D~V~~LF~~as~~yvn~ev~~~~~ayV~~evYnaiiD~~l~TsaK 199 (286) T protein:vir:94 149 -------------------------TD----LGAVDDVNALFESAVEKYTDLEVIAPVRAYVTASVYNAIIDLANVTTAK 199 (286) T ss_pred -------------------------hh----hhhhhhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccc Confidence 00 112366777888888877753 2344899999999999988776442 Q ss_pred cccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceee Q lcl|Aclame:pro 219 YTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDI 298 (402) Q Consensus 219 ~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~ 298 (402) +.+.+ +-.-.+.+.-||.|.|.+.==..+. ....+..+ +|..|. -+.++++| ++|- T Consensus 200 -~SsaN--iDengi~~FkGf~i~e~P~~~~~g~---~aifs~dn--------------ig~aft--GIn~aR~I--esEd 255 (286) T protein:vir:94 200 -NSAVN--IDTNGMLSFRGIAITKVPTQYMGGK---AVIFAPDN--------------VARVFT--GINIARTI--QAID 255 (286) T ss_pred -cceee--eccCCcceecceEEeecchhhccCc---eEEEcccc--------------ceeeec--cceeeeee--eccc Confidence 22222 2234567889999988774111110 01111111 111110 11223333 2344 Q ss_pred ccchhHHHHHHHHHHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 299 FYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 299 ~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) |....-|+ -==||-=++.-...++++..... T Consensus 256 F~GValQg-----AGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 256 FAGVELQG-----AGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred cCceeeec-----cccccccccccCceeEEEeecCC Confidence 43332222 11133334433344444333222 No 198 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=35.02 E-value=1.3 Score=20.04 Aligned_cols=262 Identities=10% Similarity=0.052 Sum_probs=123.2 Q ss_pred cHHHHHHHHHhHHHHHHHHHHhhhcccce--eee---ccccceEEeeeccce--eeeeecCCCCCC-CC------Ccccc Q lcl|Aclame:pro 17 EVDSLLIEKFNGKVNEQYLKGENILSYFD--VQT---VTGTNTVSNKYLGET--ELQVLAPGQSPN-AT------PTQAD 82 (402) Q Consensus 17 d~~alfle~f~geV~t~f~~~sv~~~~~~--~rt---i~~Gksv~f~~iG~~--t~~~~~~G~~i~-~~------~~~~~ 82 (402) -..-.|-|+|.|.+.+-|+.++.|++..- ++. |+.-.+.---.+.++ -++-|..++..- ++ +.--- T Consensus 1 ~avr~y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y~Td~Nv~FGtGTg~ssRFG~r 80 (287) T protein:vir:39 1 MAIKYFTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAYSTDANVGFGSGTGNTSRFGQR 80 (287) T ss_pred CCcccccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecccCCCCcccccCCCccccccce Confidence 22345779999999999999999997653 222 333333322222222 234444444431 11 11123 Q ss_pred ceeEeeccee-eccch--hhhHHHhhcCccchhHHH---HHHHHHHHHHHHHHHHHHHHHhhhhhccccccccccccccc Q lcl|Aclame:pro 83 KNQLVIDTTV-IARNT--VAHIHDVQGDIDSLKPKL---AMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHG 156 (402) Q Consensus 83 e~~itID~~l-ya~~~--IddlDe~q~~~D~vrse~---s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~ 156 (402) +-++-+|+.. |..-| -.-||+..-+=| +.... -..++.|-++.+|..+=..|...|-. T Consensus 81 kEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd-~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls~~A~~--------------- 144 (287) T protein:vir:39 81 KEVKSVNKQVSYDAPLAINEGIDDFTVNDI-KDQVVAERLALHGVAWAQHVDKLLGKLLSDSASE--------------- 144 (287) T ss_pred eEEEEecccccceeccccccccccccccCC-hhHHHHHHHHhHHHHHHHHHHHHHHHHHHhhcch--------------- Confidence 4456666554 33322 233555554433 33222 34568888899997665444332210 Q ss_pred cccccccCCccccccHHHHHHHHHHHHHHHHhhcC----CccC-cEEEeChHHHHHHhcccchhhcccccccCcccccce Q lcl|Aclame:pro 157 FSINVNVTESEALANPQYVMAAVEYALEQQLEQEV----DISD-VAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) Q Consensus 157 ~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdV----P~~g-R~~VV~P~~y~~Ll~~~r~~n~d~~~~~~g~~~~G~ 231 (402) .. ..+.+. |.+..++.++.|+.| -... -.++|+|+.|.+|+.++-.+..- +.+.+ +-.-. T Consensus 145 ---t~-----~~~~t~----d~V~~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~TsaK-~SsaN--iDen~ 209 (287) T protein:vir:39 145 ---TL-----TVKLDE----DSVTKLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTAK-NSSAN--VDEQT 209 (287) T ss_pred ---he-----eeeecc----cchHHHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhccccccccc-cceee--eccCC Confidence 00 001121 234455556666555 3334 45899999999999988776442 22222 22345 Q ss_pred EEEEeccEEEecCccccccCccccccccccCCccccceeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHHHH Q lcl|Aclame:pro 232 VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDT 311 (402) Q Consensus 232 V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i~~ 311 (402) +.+.-||-|-|.+.--...+- ....+..+. |..|. -+.++++| ++|-|...--|+ T Consensus 210 i~kFkGf~l~e~P~~~~q~g~--~a~fs~dni--------------g~af~--GI~vaR~i--~sEdF~GvalQg----- 264 (287) T protein:vir:39 210 LYKFKGFILSELPDEKFQLNE--GAYFAADNV--------------GVAGV--GIQVTRAM--DSEDFAGTALQA----- 264 (287) T ss_pred cceecceEEEecchHhhccCc--EEEEccccc--------------eeecc--cceeEEee--ecccccceeeec----- Confidence 678899999887732111100 001111111 11110 01122222 233333222222 Q ss_pred HHHhcCcccccceEEEEEEeecc Q lcl|Aclame:pro 312 FMAEGAIPDRWEAVSVVTTKRDA 334 (402) Q Consensus 312 ~~a~Ga~vlRPeaa~vv~~~~~~ 334 (402) --=||-=++.-...++++....- T Consensus 265 AgK~G~~i~e~Nk~Ai~k~t~~k 287 (287) T protein:vir:39 265 AAKYGKYLPEKNKKAILKATVTK 287 (287) T ss_pred ccccccccccccceEEEEEecCC Confidence 11133333333333444332222 No 199 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=32.01 E-value=1.5 Score=19.69 Aligned_cols=298 Identities=11% Similarity=0.045 Sum_probs=122.9 Q ss_pred CCCCcccccccccccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeeccc---cceEEeee---ccceeeeeecCCCCC Q lcl|Aclame:pro 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTG---TNTVSNKY---LGETELQVLAPGQSP 74 (402) Q Consensus 1 Ms~~n~~t~~~~~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~~---Gksv~f~~---iG~~t~~~~~~G~~i 74 (402) ++.+...+.|...++.-..+.|+.-|.-.|......--+.+.++.+.+. + -+++.|+. .|+.++. .-++++ T Consensus 61 ~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~-g~W~~~t~ty~~~e~~G~A~~y--gd~~D~ 137 (382) T protein:vir:96 61 SAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTV-GSWEDQEIVQGIVEPAGTAVEY--GDHTNI 137 (382) T ss_pred cccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhcccccc-CCccceEEEEeeeecccceEEe--ecccCC Confidence 2223334445433333347889988886555555555556777877763 3 35667654 5777753 334444 Q ss_pred CCCC--ccccceeEeecceeeccchhhhHHHhhc---CccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 75 NATP--TQADKNQLVIDTTVIARNTVAHIHDVQG---DIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNK 149 (402) Q Consensus 75 ~~~~--~~~~e~~itID~~lya~~~IddlDe~q~---~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~a~~~~~~ 149 (402) +-.. .+..++++..=+ ..+.+.++++.++ .+| +-.+-......+|.++.|+..|.=...+ ..... .+- T Consensus 138 Pl~d~~~~~~~r~v~~~~---~g~~yg~lE~~rAa~~~~~-l~~~Ka~aA~~ale~~~N~i~f~G~~~g--~~~~~-yGl 210 (382) T protein:vir:96 138 PLTSWNANFERRTIVRGE---LGLLVGTLEEGRASAIRLN-SAETKRQQAAIGLEIFRNAIGFYGWQSG--LGNRT-YGF 210 (382) T ss_pred CccccccceeEEEEEEEE---EeeeecHHHHHHHHhhCCC-cHHHHHHHHHHHHHHhhceEEEEeeecC--cCcce-EEE Confidence 2222 222333332222 3455567777664 555 3333334444555555554332100000 00000 000 Q ss_pred ccccccccccccccCCccccccHHHHHHHHHHHHHHHHhhcC----Ccc-CcEEEeChHHHHHHhcccchhhcccccccC Q lcl|Aclame:pro 150 PRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEV----DIS-DVAIMMPWKFFNALRDADRIVDKTYTISQS 224 (402) Q Consensus 150 ~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~LdekdV----P~~-gR~~VV~P~~y~~Ll~~~r~~n~d~~~~~~ 224 (402) .+.+..+...+ ...+.-...+++.+++-|..+..+|-...- |.. .-.++|||..|..|-.. | +|+.+-- T Consensus 211 lNdP~l~a~~t-~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~----n-~~g~Tvl 284 (382) T protein:vir:96 211 LNDPNLPPFQT-PPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT----T-PYGISVS 284 (382) T ss_pred EeCCCcccccc-cCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc----C-ccCccHH Confidence 00011111110 011112345788999999999998865542 433 34588999999888432 1 2221111 Q ss_pred cccccceEEEEeccEEEecCccccccCccccccccccCCccccceeeec------cceeEEeecHH--Hhhhhhhcccce Q lcl|Aclame:pro 225 GATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEM------NGAVAVLFTSD--ALLVGRTIEVTG 296 (402) Q Consensus 225 g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad~------~~~~al~fh~~--Av~tv~~~dl~~ 296 (402) ..+ + .+..+++|+..+.|-....+..+ +....+-+..+. ...+...|... +-..+-...... T Consensus 285 ~~l-k---~n~Pnl~i~t~peL~~a~~~g~g------~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~ 354 (382) T protein:vir:96 285 DWI-E---QTYPKMRIVSAPELSGVQMQGKT------PEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRA 354 (382) T ss_pred HHH-H---HhcCCcEEEEccccccccCCCcc------ceeEEEEecchhhhhcccccccCcceeccccceeeeccceeec Confidence 011 0 11235566666555321110000 000011111110 00111111100 000000000111 Q ss_pred eeccchhHHHHHHHHHHHhcCcccccceEEEEEEee Q lcl|Aclame:pro 297 DIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKR 332 (402) Q Consensus 297 e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~ 332 (402) ..|..+.... .-|.-+.||.+++-+. -. T Consensus 355 ~~~~~~~s~~-------t~Gv~i~~P~ai~~~~-GI 382 (382) T protein:vir:96 355 KSYVEDFSNG-------TAGALCKRPWAVVRYL-GI 382 (382) T ss_pred ceeEeccccc-------eeeeEEEcchhhhhcc-CC Confidence 2222222222 2466666666543321 11 No 200 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=26.67 E-value=1.9 Score=19.03 Aligned_cols=307 Identities=13% Similarity=0.065 Sum_probs=134.6 Q ss_pred CCCCcccc---ccccccc------ccHHHHHHHHHhHHHHHHHHHHhhhcccc---eeeeccccceEEee---------- Q lcl|Aclame:pro 1 MSTPNTLT---NVAVSAS------GEVDSLLIEKFNGKVNEQYLKGENILSYF---DVQTVTGTNTVSNK---------- 58 (402) Q Consensus 1 Ms~~n~~t---~~~~~~~------~d~~alfle~f~geV~t~f~~~sv~~~~~---~~rti~~Gksv~f~---------- 58 (402) |+-|..+- |..+++. +..-++| .|..+.|.-..-..... ..-...+++..-.+ T Consensus 97 mTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~-----nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~ 171 (457) T protein:vir:10 97 MTGPTGLIFAMRTNYGAERNPAAAGYDEAFF-----NEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQA 171 (457) T ss_pred CCCcceeeeeeeeeecCccccccccccceee-----eccCcccCcccccccccccccccccccccccccCcccccccccc Confidence 77775322 3333321 2233444 33333332211000000 00011111111111 Q ss_pred --eccceeeeeecCCCCCC-CC-CccccceeEeeccee--------eccchhhhHHHhhc-C-ccchhHHHHHHHHHHHH Q lcl|Aclame:pro 59 --YLGETELQVLAPGQSPN-AT-PTQADKNQLVIDTTV--------IARNTVAHIHDVQG-D-IDSLKPKLAMNQAKQLK 124 (402) Q Consensus 59 --~iG~~t~~~~~~G~~i~-~~-~~~~~e~~itID~~l--------ya~~~IddlDe~q~-~-~D~vrse~s~~~G~aLA 124 (402) .-|-.++. ++.+. +. .....|.-+.||..- -+.-.+.-.-|.++ | +| -.+|++.=+..++. T Consensus 172 ~~~~gmsTA~----aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLD-AEtELaNILStEIm 246 (457) T protein:vir:10 172 DDATGMSTAT----VEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLD-AEQELANILSTEIL 246 (457) T ss_pred ccccchhhhh----hhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCC-hhHHHHHHHHHHHH Confidence 11111111 11121 11 123456666666544 34555666667777 6 66 68999999999999 Q ss_pred HHHHHHHHHHHHhhhhhccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHH----Hhh---cCCccCcE Q lcl|Aclame:pro 125 RLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQ----LEQ---EVDISDVA 197 (402) Q Consensus 125 ~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~L----dek---dVP~~gR~ 197 (402) .++++-|++.++.-|.. ...+. .....+....... ++.-..+.+..+.-++ .+- ----.+.| T Consensus 247 lEINReii~~l~~~a~~----~~~~~----~~~~gv~dl~~~~---~g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ 315 (457) T protein:vir:10 247 AEINREVVRTIYTNAVA----GAQNN----TATAGVFDLDVDS---NGRWSVEKFKGLLFQIERDANAIGHQTRRGKGNI 315 (457) T ss_pred HHhhHHHHHhHhhhhee----eeccc----cccceeeeeeccc---cchhhHHHHHHHHHHHHHHHHHHHHhhccccceE Confidence 99999999999865421 11111 1111121111111 1222223333332222 211 11236789 Q ss_pred EEeChHHHHHHhcccch--h---hcccccccCcccccceEEEE-eccEEEec----CccccccCccccccccccCCcccc Q lcl|Aclame:pro 198 IMMPWKFFNALRDADRI--V---DKTYTISQSGATINGFVLSS-YNCPVIPS----NRFPTFAQDQAHHLLSNEDNGYRY 267 (402) Q Consensus 198 ~VV~P~~y~~Ll~~~r~--~---n~d~~~~~~g~~~~G~V~~i-aG~~V~~S----NnlP~~~~~~t~~~ls~a~~G~~~ 267 (402) +|.+|+..++|-...-+ . +.+-+.+.-.......+|.+ .|++||.= ||-|... ... T Consensus 316 ~i~S~~Va~~L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy------~~v-------- 381 (457) T protein:vir:10 316 LICSADVVSALGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHF------YVA-------- 381 (457) T ss_pred EEEchhHHHHHhhcccccccchhhccccccccccccceeEEEecCCeEEEEecccccCCccce------EEE-------- Confidence 99999999998764332 1 11101111112344557776 57888877 6655422 110 Q ss_pred ceeeeccceeEEeecHHH-hhhhhhcccceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 268 DPIAEMNGAVAVLFTSDA-LLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 268 ~~~ad~~~~~al~fh~~A-v~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) .|.++-.-.-+++|.|=- +--++. -|+.+|--.|--+.=||- +.+|.+.+. +-+.. ....+- T Consensus 382 G~KG~~~~~~glfy~PYv~l~~~~~--------~dp~sfqP~~g~~tRY~l-~~NP~~~~~-------~~~~~-~~~~~~ 444 (457) T protein:vir:10 382 GYKGTSPYDAGLFYCPYVPLQQVRA--------INPDTFQPKIGFKTRYGM-VSNPFAGGL-------TQGSG-ALTVNA 444 (457) T ss_pred EEeCCcceecceeecccccccccCc--------cCCccccceeeeeeeeee-eeccccccc-------ccccc-cccccc Confidence 011222223456776541 112222 255555554444444665 667764321 11111 122333 Q ss_pred HHhhhcccceEEE Q lcl|Aclame:pro 347 ATVLARAQRKAVY 359 (402) Q Consensus 347 ~~~~~~~~~~~~~ 359 (402) ....-|.+=+-.+ T Consensus 445 n~~~~rs~vs~ll 457 (457) T protein:vir:10 445 NKYYRRVQVANLM 457 (457) T ss_pred hhhcceeeeeecC Confidence 3444445544444 No 201 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=25.58 E-value=2 Score=18.89 Aligned_cols=292 Identities=11% Similarity=0.009 Sum_probs=116.7 Q ss_pred CCC--Ccccccccc-------cccccHHHHHHHHHhHHHHHHHHHHhhhcccceeeecc--ccceEEeee---ccceeee Q lcl|Aclame:pro 1 MST--PNTLTNVAV-------SASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVT--GTNTVSNKY---LGETELQ 66 (402) Q Consensus 1 Ms~--~n~~t~~~~-------~~~~d~~alfle~f~geV~t~f~~~sv~~~~~~~rti~--~Gksv~f~~---iG~~t~~ 66 (402) |.+ +...+.|.+ ..+..-.=.||.-|-=.+....-.--+...++.+.+.= .-+++.|+. .|+.++. T Consensus 52 ~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~y 131 (379) T protein:vir:10 52 MQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPY 131 (379) T ss_pred hhhhhccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEe Confidence 222 222222211 11111222477777644444444555557777777731 135666554 4666642 Q ss_pred eecCCCCCCCCCccccceeEeecceee-ccchhhhHHHhh---cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhc Q lcl|Aclame:pro 67 VLAPGQSPNATPTQADKNQLVIDTTVI-ARNTVAHIHDVQ---GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIAN 142 (402) Q Consensus 67 ~~~~G~~i~~~~~~~~e~~itID~~ly-a~~~IddlDe~q---~~~D~vrse~s~~~G~aLA~~~Dq~i~~~l~kaA~~~ 142 (402) .-+.++.--....+...-.| ..+ ..+.+.+++... ..++ +..+-.+....+|.++.|+..| -+ .-. T Consensus 132 --gd~~d~pl~d~~~~~~~r~v--~~~~~g~~yg~~El~~Aa~~g~~-l~~~Ka~aA~~ale~~~N~i~f----~G-~~d 201 (379) T protein:vir:10 132 --TDGGNMALMSWTPTFETRTV--VRFEAGLQVAPLEEARSSRVQVS-SADEKRAMVGEALEVQRNRVAF----YG-YND 201 (379) T ss_pred --ccccCCCeeeeeeeeeeeee--EEEEEEEeecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhceEEE----Ee-ecC Confidence 32333311111112111111 112 122233333322 2333 3333333334444444443222 11 000 Q ss_pred ccccc-ccccccccccccccccC--Ccc--ccccHHHHHHHHHHHHHHHHhh---c-CCccCc-EEEeChHHHHHHhccc Q lcl|Aclame:pro 143 TKAER-NKPRVKGHGFSINVNVT--ESE--ALANPQYVMAAVEYALEQQLEQ---E-VDISDV-AIMMPWKFFNALRDAD 212 (402) Q Consensus 143 a~~~~-~~~~~~g~~~~~~v~~~--~a~--~~~~~~~l~dai~~a~~~Ldek---d-VP~~gR-~~VV~P~~y~~Ll~~~ 212 (402) +.... +-.+.+......+...+ +.+ ...+++.+++-|..+..+|-.+ . .|..-+ .++|||.+|..|-.-. T Consensus 202 ~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n 281 (379) T protein:vir:10 202 GSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPT 281 (379) T ss_pred CCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhcccc Confidence 00000 00111111111111111 111 2347888998888888876543 2 254444 6899999999996431 Q ss_pred chhhcccccccCcccccceEEEEeccEEEecCccccccCccccccccccCCccccceeee-------ccceeEEeecHH- Q lcl|Aclame:pro 213 RIVDKTYTISQSGATINGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAE-------MNGAVAVLFTSD- 284 (402) Q Consensus 213 r~~n~d~~~~~~g~~~~G~V~~iaG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~~~ad-------~~~~~al~fh~~- 284 (402) +|+.+--..+ + .+..+++|+..+.|-..+++.. ..+-|..+ ...++-..|+.+ T Consensus 282 -----~~g~Tvl~~l-k---~n~Pnl~i~t~pEL~~aggg~~----------~~~~~~~~~~~~~t~~~~~~~~~~p~k~ 342 (379) T protein:vir:10 282 -----ELGYSVAQYM-R---ESYPNVTFVSAPELNDANGGSS----------AIYYYADAVENNGTDDGRTWLQVVPTKM 342 (379) T ss_pred -----ccCccHHHHH-H---HhcCCcEEEEcccccccCCCcc----------EEEEEeeccCCCccCCcceEEEecchhh Confidence 2211111011 1 1134677888777743211110 11111111 011222233221 Q ss_pred HhhhhhhcccceeeccchhHHHHHHHHHHHhcCcccccceEEEEEEe Q lcl|Aclame:pro 285 ALLVGRTIEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTK 331 (402) Q Consensus 285 Av~tv~~~dl~~e~~~d~~~~~d~i~~~~a~Ga~vlRPeaa~vv~~~ 331 (402) -..-++... ..|..+.... ..|.-+.||-+++-+.=. T Consensus 343 ~~l~ve~~~---~~~~~~~~~r-------t~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 343 FTLGVEKKI---KGYAEGYTNA-------TAGAMLKRPFATYRQTGA 379 (379) T ss_pred hhccceecC---ceeEeccccc-------eeeeeeecchhhheecCC Confidence 111222222 2233343333 358888888875444322 No 202 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=24.97 E-value=2.1 Score=18.80 Aligned_cols=319 Identities=14% Similarity=0.050 Sum_probs=128.4 Q ss_pred CCCCccc--cccc-----------ccccccHHHHHHHHHhHHHHHHHHHHh--hh-------cccceeeeccccceEEee Q lcl|Aclame:pro 1 MSTPNTL--TNVA-----------VSASGEVDSLLIEKFNGKVNEQYLKGE--NI-------LSYFDVQTVTGTNTVSNK 58 (402) Q Consensus 1 Ms~~n~~--t~~~-----------~~~~~d~~alfle~f~geV~t~f~~~s--v~-------~~~~~~rti~~Gksv~f~ 58 (402) |-.+|+. .+.+ ....|+........-.|.+...+.... +. .+......+.+|...-+ T Consensus 142 ~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~- 220 (514) T protein:vir:56 142 TRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEI- 220 (514) T ss_pred ccccCcCccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhh- Confidence 2222210 0000 000000000000011111100000000 00 00000011111111100 Q ss_pred eccceeeeeecCCCC---CCC-CCccccceeEeeccee--------eccchhhhHHHhhc-C-ccchhHHHHHHHHHHHH Q lcl|Aclame:pro 59 YLGETELQVLAPGQS---PNA-TPTQADKNQLVIDTTV--------IARNTVAHIHDVQG-D-IDSLKPKLAMNQAKQLK 124 (402) Q Consensus 59 ~iG~~t~~~~~~G~~---i~~-~~~~~~e~~itID~~l--------ya~~~IddlDe~q~-~-~D~vrse~s~~~G~aLA 124 (402) .-|..+.. ++. +.+ ......|.-+.||..- -+.-.+.---|.++ | +| -.+|++.=++.++. T Consensus 221 ~~Gm~Ta~----aEal~~lggs~~~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLD-AEtELsNILSTEIm 295 (514) T protein:vir:56 221 DAGMATSQ----AELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLD-ADAELSGILANEVM 295 (514) T ss_pred hhhhhhhh----hhhcccCCCCcccccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCC-hHHHHHHHHHHHHH Confidence 01111111 111 111 1223456666666543 34555666667777 6 67 58999999999999 Q ss_pred HHHHHHHHHHHHhhhhhccccccccccccccccccccccCCccccccHHHHHHHHHHHHHHHH-hhc-----CC-ccCcE Q lcl|Aclame:pro 125 RLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFSINVNVTESEALANPQYVMAAVEYALEQQL-EQE-----VD-ISDVA 197 (402) Q Consensus 125 ~~~Dq~i~~~l~kaA~~~a~~~~~~~~~~g~~~~~~v~~~~a~~~~~~~~l~dai~~a~~~Ld-ekd-----VP-~~gR~ 197 (402) .++++-|++.+..-+.-.. .....+.+...+.......+...+-.+++.+..+..+++ |.| -- -.+.| T Consensus 296 lEINReii~~l~~~atv~~-----~~~~~~~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~ 370 (514) T protein:vir:56 296 VELNREIVNLVNSQAQIGK-----SGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNF 370 (514) T ss_pred HHhhHHHHHHHHhheeehh-----cccccccccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccE Confidence 9999999888865442211 111222222223222222222223335666666666666 333 12 25789 Q ss_pred EEeChHHHHHHhcccc--------hhhcccccccCcccccceEEEE-eccEEEecCccccccCccccccccccCCccccc Q lcl|Aclame:pro 198 IMMPWKFFNALRDADR--------IVDKTYTISQSGATINGFVLSS-YNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYD 268 (402) Q Consensus 198 ~VV~P~~y~~Ll~~~r--------~~n~d~~~~~~g~~~~G~V~~i-aG~~V~~SNnlP~~~~~~t~~~ls~a~~G~~~~ 268 (402) +|.+|+..++|-...- +.+..+.....+. -..+.+ .|++||.=++.|... .+. - T Consensus 371 ~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~~~~---~~aG~l~~~~~vy~D~y~~~dy------~~v--------G 433 (514) T protein:vir:56 371 IIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQT---VFAGVLGGRFKVYIDQYAVNDY------FTV--------G 433 (514) T ss_pred EEEchhHHHHHHhhhhhccccccCccccccccccCcc---eEEEEecCceEEEecCCCCcce------EEE--------E Confidence 9999999999865332 2322222111111 123443 789999999887522 111 1 Q ss_pred eeeeccceeEEeecHHHhhhhhhcccceeeccchhHHHHHH--HHHHHhcCcccccceEEEEEEeeccCccccccchhhH Q lcl|Aclame:pro 269 PIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYI--DTFMAEGAIPDRWEAVSVVTTKRDATTGDAGGPGDDH 346 (402) Q Consensus 269 ~~ad~~~~~al~fh~~Av~tv~~~dl~~e~~~d~~~~~d~i--~~~~a~Ga~vlRPeaa~vv~~~~~~t~~~a~~~~~~~ 346 (402) |.++-.-.-+++|.|= |++ +.-..+|+.+|--.| ..+|++...+.-.+-+..+. .++.-... T Consensus 434 ~KG~~~~~~glfyaPY----v~l---~~~~~~dp~sfqP~~g~~tRY~l~~NPy~~~~~~~~~---------~~~~~~~~ 497 (514) T protein:vir:56 434 FKGSTEMDAGVFYSPY----VPL---TPLRGSDSKNFQPVIGFKTRYGVQVNPFADPTASATK---------VGNGAPVA 497 (514) T ss_pred EecCcceecceeeccc----ccc---ccccccCCccccceeeeeeeeceeeCCCCCccccccc---------cCCcchhh Confidence 1122222346777665 222 222335666665443 33333333333222111111 11111111 Q ss_pred HHhhhcccceEEEeecc Q lcl|Aclame:pro 347 ATVLARAQRKAVYVKTE 363 (402) Q Consensus 347 ~~~~~~~~~~~~~~~~~ 363 (402) +...+-+==.-+.+++- T Consensus 498 a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 498 ASMGKNAYFRRVFVKGL 514 (514) T ss_pred hcccccceeeeEEEecC Confidence 11011111122344444 Done!