Query lcl|Aclame:protein:vir:8885|NCBI_annot:major capsid protein A|genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Match_columns 347 No_of_seqs 144 out of 168 Neff 7.6 Searched_HMMs 1612 Date Sat Nov 30 07:56:38 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_22 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_22_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:8885 Length: 347 # 100.0 1E-112 6E-116 634.6 29.8 347 1-347 1-347 (347) 2 protein:vir:94576 Length: 347 100.0 4E-112 2E-115 631.3 30.1 346 1-346 1-347 (347) 3 protein:vir:94711 Length: 347 100.0 6E-108 4E-111 608.4 30.5 345 1-347 1-346 (347) 4 protein:vir:10450 Length: 344 100.0 3E-106 2E-109 599.4 30.0 342 1-346 1-344 (344) 5 protein:vir:3364 Length: 347 # 100.0 4E-105 2E-108 593.1 28.8 344 1-347 1-346 (347) 6 protein:vir:100057 Length: 375 100.0 9E-105 6E-108 590.9 25.5 345 1-347 1-371 (375) 7 protein:vir:1541 Length: 347 # 100.0 2E-103 1E-106 583.2 27.3 344 1-347 1-346 (347) 8 protein:vir:2201 Length: 345 # 100.0 3E-102 2E-105 577.0 27.9 341 1-346 1-345 (345) 9 protein:vir:103323 Length: 364 100.0 1E-100 9E-104 567.9 25.1 334 1-347 1-340 (364) 10 protein:vir:80213 Length: 334 100.0 2.3E-99 1E-102 561.3 27.0 327 1-347 1-333 (334) 11 protein:vir:6324 Length: 335 # 100.0 3E-99 2E-102 560.7 25.1 322 1-347 1-329 (335) 12 protein:vir:78935 Length: 335 100.0 2.9E-98 2E-101 555.3 25.6 323 1-347 1-329 (335) 13 protein:vir:97031 Length: 402 100.0 2.7E-97 2E-100 550.0 26.1 328 1-347 1-340 (402) 14 protein:vir:78739 Length: 332 100.0 5.8E-97 4E-100 548.2 25.0 318 1-344 7-332 (332) 15 protein:vir:7019 Length: 401 # 100.0 2.9E-95 1.8E-98 538.9 26.6 327 1-347 1-334 (401) 16 protein:vir:105645 Length: 400 100.0 5E-95 3.1E-98 537.6 27.1 328 1-347 1-334 (400) 17 protein:vir:99675 Length: 324 100.0 5.7E-89 3.5E-92 504.4 23.8 296 50-347 1-297 (324) 18 protein:vir:94622 Length: 341 100.0 1.2E-72 7.4E-76 414.9 26.1 324 1-347 1-340 (341) 19 protein:vir:80180 Length: 381 100.0 1.6E-67 9.6E-71 386.9 22.9 331 1-346 1-381 (381) 20 protein:vir:105822 Length: 273 100.0 3.4E-58 2.1E-61 335.6 19.2 266 1-346 1-273 (273) 21 protein:vir:102605 Length: 273 100.0 3.4E-58 2.1E-61 335.6 19.2 266 1-346 1-273 (273) 22 protein:vir:7990 Length: 273 # 100.0 2.7E-56 1.7E-59 325.2 19.0 266 1-346 1-273 (273) 23 protein:vir:3136 Length: 322 # 100.0 5.9E-57 3.6E-60 328.9 15.1 305 1-347 1-319 (322) 24 protein:vir:102655 Length: 322 100.0 1.1E-54 7.1E-58 316.3 21.4 311 1-347 1-322 (322) 25 protein:vir:1781 Length: 221 # 100.0 7E-54 4.4E-57 312.0 14.1 217 95-338 1-221 (221) 26 protein:vir:80930 Length: 278 100.0 2.1E-43 1.3E-46 254.6 17.8 271 1-347 1-278 (278) 27 protein:vir:97331 Length: 319 100.0 4.8E-42 3E-45 247.1 18.2 285 1-347 5-295 (319) 28 protein:vir:94800 Length: 319 100.0 4.8E-42 3E-45 247.1 18.2 285 1-347 5-295 (319) 29 protein:vir:107120 Length: 329 100.0 4E-42 2.5E-45 247.5 17.7 285 1-347 16-306 (329) 30 protein:vir:108303 Length: 418 100.0 1.9E-41 1.2E-44 243.8 17.6 298 1-347 1-417 (418) 31 protein:vir:96123 Length: 274 100.0 2.9E-41 1.8E-44 242.8 17.9 265 1-347 1-271 (274) 32 protein:vir:93742 Length: 274 100.0 3.4E-40 2.1E-43 237.0 17.2 265 1-347 1-271 (274) 33 protein:vir:96833 Length: 275 100.0 1.4E-39 8.9E-43 233.5 17.6 266 1-347 1-272 (275) 34 protein:vir:1239 Length: 274 # 100.0 1.9E-39 1.2E-42 232.9 18.0 265 1-347 1-271 (274) 35 protein:vir:94494 Length: 274 100.0 2.2E-39 1.4E-42 232.5 17.3 265 1-347 1-271 (274) 36 protein:vir:97433 Length: 274 100.0 2.2E-39 1.4E-42 232.5 17.3 265 1-347 1-271 (274) 37 protein:vir:3525 Length: 423 # 100.0 5.6E-39 3.5E-42 230.3 19.1 301 1-345 1-423 (423) 38 protein:vir:95898 Length: 274 100.0 1.8E-39 1.1E-42 232.9 16.5 264 1-347 1-271 (274) 39 protein:vir:96262 Length: 274 100.0 1.8E-39 1.1E-42 232.9 16.5 264 1-347 1-271 (274) 40 protein:vir:99075 Length: 392 100.0 4.2E-39 2.6E-42 231.0 18.4 287 1-347 1-315 (392) 41 protein:vir:3613 Length: 272 # 100.0 3.9E-39 2.4E-42 231.2 17.3 267 1-346 1-272 (272) 42 protein:vir:174 Length: 423 # 100.0 1.4E-38 8.5E-42 228.2 19.6 301 1-345 1-423 (423) 43 protein:vir:105374 Length: 423 100.0 1.5E-38 9.2E-42 228.0 19.4 301 1-345 1-423 (423) 44 protein:vir:105522 Length: 423 100.0 2.9E-36 1.8E-39 215.4 18.1 301 1-345 1-423 (423) 45 protein:vir:79008 Length: 299 100.0 1.5E-35 9.3E-39 211.5 18.2 284 1-347 1-297 (299) 46 protein:vir:105334 Length: 276 100.0 1.4E-35 8.5E-39 211.7 17.4 265 1-347 1-271 (276) 47 protein:vir:9820 Length: 272 # 100.0 1.2E-35 7.2E-39 212.1 16.1 264 1-347 1-270 (272) 48 protein:vir:3033 Length: 272 # 100.0 1.2E-35 7.2E-39 212.1 16.1 264 1-347 1-270 (272) 49 protein:vir:78920 Length: 290 100.0 1.2E-31 7.6E-35 190.1 17.7 281 1-346 1-290 (290) 50 protein:vir:105464 Length: 346 99.9 3.2E-29 2E-32 176.8 17.4 284 1-347 1-301 (346) 51 protein:vir:102335 Length: 312 99.9 1.6E-28 9.8E-32 173.0 17.7 299 1-347 1-308 (312) 52 protein:vir:739 Length: 231 # 99.9 7.8E-29 4.8E-32 174.7 12.8 229 51-346 1-231 (231) 53 protein:vir:95107 Length: 270 99.9 9.9E-28 6.2E-31 168.6 16.3 262 1-347 1-266 (270) 54 protein:vir:99523 Length: 311 99.9 7.5E-24 4.7E-27 147.4 17.1 297 17-347 1-311 (311) 55 protein:vir:79712 Length: 285 99.9 8E-24 4.9E-27 147.2 15.5 266 1-347 1-284 (285) 56 protein:vir:78090 Length: 302 99.8 3.9E-22 2.4E-25 137.9 16.1 285 1-347 1-300 (302) 57 protein:vir:95451 Length: 313 99.7 5.6E-20 3.5E-23 126.1 11.0 298 17-347 1-312 (313) 58 protein:vir:100939 Length: 430 99.7 8.2E-19 5.1E-22 119.7 15.6 303 1-347 1-430 (430) 59 protein:vir:9265 Length: 430 # 99.7 8.2E-19 5.1E-22 119.7 15.6 303 1-347 1-430 (430) 60 protein:vir:2106 Length: 430 # 99.7 1.8E-18 1.1E-21 117.9 16.3 302 1-347 1-430 (430) 61 protein:vir:78523 Length: 338 99.6 8.1E-18 5E-21 114.3 15.1 308 1-347 1-336 (338) 62 protein:vir:41 Length: 299 # N 99.6 1E-16 6.5E-20 108.2 16.3 282 10-347 1-299 (299) 63 protein:vir:78223 Length: 333 99.6 1.6E-16 9.8E-20 107.2 16.0 308 1-347 1-333 (333) 64 protein:vir:6242 Length: 390 # 99.5 8.9E-17 5.5E-20 108.6 12.6 290 1-347 93-390 (390) 65 protein:vir:1328 Length: 392 # 99.5 1.5E-16 9.5E-20 107.3 13.5 293 1-347 97-392 (392) 66 protein:vir:4700 Length: 415 # 99.5 2.8E-15 1.7E-18 100.4 16.2 291 1-347 113-405 (415) 67 protein:vir:4600 Length: 415 # 99.5 2.8E-15 1.7E-18 100.4 16.2 291 1-347 113-405 (415) 68 protein:vir:4511 Length: 409 # 99.5 2.8E-15 1.7E-18 100.4 15.9 298 1-347 99-407 (409) 69 protein:vir:8102 Length: 543 # 99.5 1.9E-15 1.2E-18 101.3 14.7 288 1-347 245-543 (543) 70 protein:vir:81100 Length: 415 99.5 2.5E-15 1.5E-18 100.7 15.1 295 1-347 109-405 (415) 71 protein:vir:79987 Length: 415 99.5 2.5E-15 1.5E-18 100.7 15.1 295 1-347 109-405 (415) 72 protein:vir:98339 Length: 415 99.5 2.5E-15 1.5E-18 100.7 15.1 295 1-347 109-405 (415) 73 protein:vir:7771 Length: 330 # 99.5 4E-15 2.5E-18 99.5 16.1 298 1-347 1-324 (330) 74 protein:vir:9410 Length: 415 # 99.4 3.8E-15 2.4E-18 99.6 14.3 295 1-347 109-405 (415) 75 protein:vir:485 Length: 407 # 99.4 1.6E-14 1E-17 96.2 17.4 296 1-347 90-401 (407) 76 protein:vir:105905 Length: 304 99.4 2.1E-14 1.3E-17 95.6 15.0 285 1-345 1-304 (304) 77 protein:vir:94142 Length: 304 99.4 2.1E-14 1.3E-17 95.6 15.0 285 1-345 1-304 (304) 78 protein:vir:4339 Length: 395 # 99.4 3.9E-14 2.4E-17 94.1 16.1 291 1-346 98-395 (395) 79 protein:vir:104085 Length: 320 99.4 3.2E-14 2E-17 94.5 15.6 296 1-347 1-320 (320) 80 protein:vir:1638 Length: 298 # 99.4 3.7E-14 2.3E-17 94.2 15.7 283 1-345 1-298 (298) 81 protein:vir:9309 Length: 324 # 99.4 3.6E-14 2.2E-17 94.3 15.6 286 1-347 14-316 (324) 82 protein:vir:3870 Length: 400 # 99.4 8.7E-15 5.4E-18 97.7 12.1 278 1-347 120-400 (400) 83 protein:vir:8187 Length: 311 # 99.4 5.1E-14 3.2E-17 93.4 16.2 292 1-347 1-311 (311) 84 protein:vir:97053 Length: 390 99.4 2.5E-14 1.6E-17 95.2 14.5 287 1-344 92-390 (390) 85 protein:vir:1886 Length: 385 # 99.4 1.6E-14 9.7E-18 96.3 13.3 287 1-347 93-385 (385) 86 protein:vir:191 Length: 385 # 99.4 1.6E-14 9.7E-18 96.3 13.3 287 1-347 93-385 (385) 87 protein:vir:80684 Length: 315 99.4 4.9E-14 3E-17 93.6 15.4 287 1-347 1-307 (315) 88 protein:vir:10364 Length: 390 99.3 5.8E-14 3.6E-17 93.2 15.2 279 1-344 107-390 (390) 89 protein:vir:9574 Length: 300 # 99.3 1.8E-13 1.1E-16 90.5 17.7 284 1-346 1-300 (300) 90 protein:vir:94771 Length: 298 99.3 1.2E-13 7.5E-17 91.4 16.3 283 1-345 1-298 (298) 91 protein:vir:4456 Length: 401 # 99.3 1E-13 6.4E-17 91.8 15.9 281 1-346 107-401 (401) 92 protein:vir:96223 Length: 324 99.3 4.8E-14 3E-17 93.6 14.1 285 1-347 15-316 (324) 93 protein:vir:103955 Length: 324 99.3 9.5E-14 5.9E-17 92.0 15.7 282 1-347 18-316 (324) 94 protein:vir:96392 Length: 324 99.3 5.2E-14 3.2E-17 93.4 14.3 286 1-347 14-316 (324) 95 protein:vir:78830 Length: 324 99.3 5.2E-14 3.2E-17 93.4 14.3 286 1-347 14-316 (324) 96 protein:vir:9759 Length: 303 # 99.3 1.7E-13 1.1E-16 90.6 16.7 287 1-346 1-303 (303) 97 protein:vir:99749 Length: 324 99.3 1.1E-13 6.7E-17 91.7 15.3 282 1-347 18-316 (324) 98 protein:vir:100247 Length: 425 99.3 1E-13 6.5E-17 91.8 15.2 288 1-347 126-425 (425) 99 protein:vir:94673 Length: 419 99.3 6.5E-14 4E-17 92.9 14.0 295 1-347 110-418 (419) 100 protein:vir:4856 Length: 293 # 99.3 1.7E-13 1.1E-16 90.5 15.4 275 1-347 1-282 (293) 101 protein:vir:100135 Length: 418 99.3 1.4E-13 8.9E-17 91.0 14.7 288 1-347 121-416 (418) 102 protein:vir:2430 Length: 318 # 99.3 2.5E-13 1.6E-16 89.6 15.6 278 1-347 14-314 (318) 103 protein:vir:81070 Length: 390 99.3 1.5E-13 9.6E-17 90.8 14.4 285 1-344 101-390 (390) 104 protein:vir:4830 Length: 397 # 99.3 5.5E-13 3.4E-16 87.8 17.4 282 1-347 98-386 (397) 105 protein:vir:2344 Length: 397 # 99.3 1.3E-13 7.8E-17 91.3 13.7 285 1-347 1-307 (397) 106 protein:vir:99920 Length: 311 99.3 6.1E-13 3.8E-16 87.5 17.3 297 1-346 1-311 (311) 107 protein:vir:104256 Length: 458 99.3 1.5E-13 9.1E-17 90.9 13.5 294 1-346 143-458 (458) 108 protein:vir:97148 Length: 324 99.3 1.9E-13 1.2E-16 90.3 14.1 287 1-347 1-316 (324) 109 protein:vir:4997 Length: 397 # 99.3 8.4E-13 5.2E-16 86.8 16.9 282 1-347 98-386 (397) 110 protein:vir:95763 Length: 297 99.2 3.2E-13 2E-16 89.1 14.3 280 1-347 1-297 (297) 111 protein:vir:102119 Length: 404 99.2 7.7E-13 4.8E-16 87.0 16.2 298 1-347 92-401 (404) 112 protein:vir:3991 Length: 404 # 99.2 1.6E-12 1E-15 85.2 15.7 285 1-347 101-394 (404) 113 protein:vir:81160 Length: 371 99.2 2.9E-12 1.8E-15 83.9 16.9 274 1-346 91-371 (371) 114 protein:vir:4953 Length: 397 # 99.2 2E-12 1.3E-15 84.7 15.7 280 1-347 97-386 (397) 115 protein:vir:4226 Length: 326 # 99.1 4.5E-12 2.8E-15 82.8 16.0 295 1-347 1-324 (326) 116 protein:vir:9704 Length: 394 # 99.1 3.2E-12 2E-15 83.6 15.2 269 1-347 121-391 (394) 117 protein:vir:101607 Length: 379 99.1 4E-12 2.5E-15 83.1 15.6 273 1-346 98-379 (379) 118 protein:vir:7409 Length: 408 # 99.1 3.6E-12 2.2E-15 83.3 15.4 285 1-347 101-394 (408) 119 protein:vir:95376 Length: 425 99.1 2.5E-12 1.6E-15 84.1 14.5 293 1-347 119-422 (425) 120 protein:vir:1383 Length: 421 # 99.1 3.1E-12 1.9E-15 83.7 14.9 276 1-347 104-384 (421) 121 protein:vir:1025 Length: 408 # 99.1 4.1E-12 2.5E-15 83.0 15.2 285 1-347 101-394 (408) 122 protein:vir:2504 Length: 305 # 99.1 1.9E-12 1.2E-15 84.9 13.2 283 1-347 1-299 (305) 123 protein:vir:1268 Length: 397 # 99.1 3.3E-12 2.1E-15 83.5 14.0 282 1-346 102-397 (397) 124 protein:vir:5974 Length: 324 # 99.1 7.4E-12 4.6E-15 81.6 15.6 279 1-347 1-295 (324) 125 protein:vir:81227 Length: 413 99.1 5.3E-12 3.3E-15 82.4 14.8 291 1-347 105-411 (413) 126 protein:vir:100172 Length: 394 99.1 1.7E-11 1.1E-14 79.6 16.8 278 1-347 101-385 (394) 127 protein:vir:96762 Length: 632 99.1 3E-12 1.9E-15 83.8 11.4 286 1-345 316-632 (632) 128 protein:vir:100884 Length: 389 99.1 2.8E-11 1.7E-14 78.4 16.5 278 1-347 99-383 (389) 129 protein:vir:6212 Length: 434 # 99.1 1.1E-11 6.8E-15 80.7 14.2 290 1-347 131-433 (434) 130 protein:vir:102944 Length: 330 99.1 9.7E-12 6E-15 81.0 13.9 281 1-347 1-301 (330) 131 protein:vir:1433 Length: 435 # 99.0 7.2E-11 4.5E-14 76.2 18.6 299 1-347 105-434 (435) 132 protein:vir:1084 Length: 437 # 99.0 1.3E-11 8.3E-15 80.2 14.3 282 1-347 141-428 (437) 133 protein:vir:3845 Length: 395 # 99.0 2.5E-11 1.5E-14 78.8 14.8 272 1-347 105-384 (395) 134 protein:vir:107593 Length: 392 99.0 5.2E-11 3.2E-14 77.0 16.4 285 1-347 84-385 (392) 135 protein:vir:105004 Length: 392 99.0 5.2E-11 3.2E-14 77.0 16.4 285 1-347 84-385 (392) 136 protein:vir:102873 Length: 392 99.0 5.2E-11 3.2E-14 77.0 16.4 285 1-347 84-385 (392) 137 protein:vir:102082 Length: 392 99.0 5.2E-11 3.2E-14 77.0 16.4 285 1-347 84-385 (392) 138 protein:vir:80376 Length: 435 99.0 1.1E-10 6.9E-14 75.2 18.0 299 1-347 105-435 (435) 139 protein:vir:1583 Length: 351 # 99.0 2.4E-11 1.5E-14 78.8 13.7 281 1-347 1-299 (351) 140 protein:vir:4092 Length: 390 # 99.0 3.5E-11 2.2E-14 77.9 14.2 287 1-347 72-369 (390) 141 protein:vir:962 Length: 397 # 99.0 1.4E-11 8.6E-15 80.1 11.7 276 1-346 121-397 (397) 142 protein:vir:108211 Length: 318 99.0 1E-10 6.3E-14 75.4 16.0 292 1-345 1-318 (318) 143 protein:vir:5739 Length: 366 # 98.9 1.3E-10 8.1E-14 74.8 15.9 299 1-346 52-366 (366) 144 protein:vir:105038 Length: 428 98.9 2.4E-10 1.5E-13 73.3 17.3 295 1-346 113-428 (428) 145 protein:vir:9875 Length: 296 # 98.9 3.2E-10 2E-13 72.6 17.3 280 1-347 1-296 (296) 146 protein:vir:101650 Length: 497 98.9 1.4E-10 8.5E-14 74.7 14.8 297 1-347 138-494 (497) 147 protein:vir:7855 Length: 497 # 98.9 1.4E-10 8.5E-14 74.7 14.8 297 1-347 138-494 (497) 148 protein:vir:2770 Length: 318 # 98.9 6.5E-10 4E-13 71.0 18.4 258 1-308 1-318 (318) 149 protein:vir:9927 Length: 295 # 98.9 3.1E-10 1.9E-13 72.8 16.5 272 1-347 1-289 (295) 150 protein:vir:93881 Length: 387 98.9 1.2E-10 7.4E-14 75.0 14.0 274 1-347 100-382 (387) 151 protein:vir:105610 Length: 430 98.9 7.4E-10 4.6E-13 70.6 18.3 324 1-347 1-423 (430) 152 protein:vir:8420 Length: 477 # 98.9 3.4E-10 2.1E-13 72.5 16.4 302 1-347 137-472 (477) 153 protein:vir:78640 Length: 352 98.8 1.8E-10 1.1E-13 74.0 13.6 273 1-347 73-347 (352) 154 protein:vir:93696 Length: 364 98.8 9.9E-10 6.2E-13 69.9 17.5 302 1-347 1-362 (364) 155 protein:vir:9361 Length: 402 # 98.8 3.6E-10 2.3E-13 72.3 14.5 273 1-347 114-397 (402) 156 protein:vir:93616 Length: 645 98.8 1.1E-09 7E-13 69.6 16.7 301 1-347 321-642 (645) 157 protein:vir:94424 Length: 387 98.8 5.4E-10 3.3E-13 71.4 13.8 273 1-347 99-382 (387) 158 protein:vir:96978 Length: 387 98.8 5.4E-10 3.3E-13 71.4 13.8 273 1-347 99-382 (387) 159 protein:vir:2685 Length: 387 # 98.8 5.4E-10 3.3E-13 71.4 13.8 273 1-347 99-382 (387) 160 protein:vir:95875 Length: 401 98.7 4E-09 2.5E-12 66.6 17.2 324 1-347 1-401 (401) 161 protein:vir:10123 Length: 404 98.7 1.1E-08 6.6E-12 64.3 18.1 330 1-347 1-402 (404) 162 protein:vir:104439 Length: 404 98.7 1.1E-08 6.6E-12 64.3 18.1 330 1-347 1-402 (404) 163 protein:vir:3298 Length: 404 # 98.7 1.1E-08 6.6E-12 64.3 18.1 330 1-347 1-402 (404) 164 protein:vir:819 Length: 404 # 98.7 1.1E-08 6.6E-12 64.3 18.1 330 1-347 1-402 (404) 165 protein:vir:4197 Length: 314 # 98.5 1.2E-08 7.4E-12 64.0 13.7 296 1-347 1-313 (314) 166 protein:vir:9509 Length: 381 # 98.4 1.1E-08 7E-12 64.2 12.4 289 1-347 57-369 (381) 167 protein:vir:101291 Length: 381 98.4 1.1E-08 7E-12 64.2 12.4 289 1-347 57-369 (381) 168 protein:vir:80128 Length: 466 98.4 4.9E-09 3E-12 66.1 9.8 297 1-347 123-449 (466) 169 protein:vir:9643 Length: 377 # 98.4 2.2E-08 1.4E-11 62.6 13.3 295 1-346 57-377 (377) 170 protein:vir:106647 Length: 303 98.3 1.8E-07 1.1E-10 57.5 16.2 280 1-347 1-297 (303) 171 protein:vir:78350 Length: 383 98.3 4.9E-08 3E-11 60.7 12.9 291 1-347 64-376 (383) 172 protein:vir:4159 Length: 315 # 98.2 5.6E-08 3.5E-11 60.3 12.1 298 1-345 7-315 (315) 173 protein:vir:3158 Length: 321 # 98.2 3.2E-08 2E-11 61.7 10.2 294 1-347 1-312 (321) 174 protein:vir:100632 Length: 381 98.2 1.1E-07 6.7E-11 58.8 12.9 292 1-347 57-373 (381) 175 protein:vir:95963 Length: 395 98.2 7.9E-08 4.9E-11 59.5 11.5 284 1-347 61-377 (395) 176 protein:vir:98635 Length: 377 98.1 1.5E-07 9.5E-11 57.9 12.7 283 1-346 59-377 (377) 177 protein:vir:79928 Length: 393 97.8 7.5E-07 4.6E-10 54.2 11.2 302 1-347 59-378 (393) 178 protein:vir:80446 Length: 367 97.7 8.5E-06 5.3E-09 48.4 15.8 296 1-347 1-340 (367) 179 protein:vir:97397 Length: 517 97.0 5.3E-05 3.3E-08 44.0 12.6 282 1-347 226-515 (517) 180 protein:vir:78387 Length: 349 95.9 0.0013 8E-07 36.4 15.0 290 1-347 1-320 (349) 181 protein:vir:95512 Length: 693 95.6 0.0015 9.1E-07 36.1 12.6 294 1-347 366-693 (693) 182 protein:vir:94528 Length: 286 95.2 0.0026 1.6E-06 34.7 14.1 261 1-347 1-286 (286) 183 protein:vir:103285 Length: 296 95.1 0.002 1.3E-06 35.3 12.1 274 16-344 1-296 (296) 184 protein:vir:107687 Length: 319 94.7 0.003 1.9E-06 34.4 11.8 294 1-344 1-319 (319) 185 protein:vir:80068 Length: 301 94.6 0.0039 2.4E-06 33.8 15.7 284 19-344 1-301 (301) 186 protein:vir:94989 Length: 349 94.6 0.004 2.5E-06 33.7 15.5 290 1-347 1-320 (349) 187 protein:vir:4786 Length: 295 # 94.4 0.003 1.9E-06 34.4 11.1 273 10-328 1-295 (295) 188 protein:vir:79548 Length: 652 94.1 0.0055 3.4E-06 33.0 13.0 289 1-343 336-652 (652) 189 protein:vir:4074 Length: 480 # 91.9 0.014 8.6E-06 30.8 13.8 278 1-347 164-478 (480) 190 protein:vir:3969 Length: 287 # 91.7 0.015 9.1E-06 30.6 12.5 262 1-347 1-287 (287) 191 protein:vir:104342 Length: 314 90.6 0.02 1.2E-05 29.9 11.1 287 1-344 1-314 (314) 192 protein:vir:79078 Length: 307 90.2 0.02 1.2E-05 29.9 10.1 277 1-347 1-298 (307) 193 protein:vir:107882 Length: 307 90.0 0.021 1.3E-05 29.8 10.0 289 1-347 1-298 (307) 194 protein:vir:98871 Length: 314 88.2 0.034 2.1E-05 28.6 12.7 280 1-347 11-314 (314) 195 protein:vir:99888 Length: 309 87.6 0.037 2.3E-05 28.4 10.6 283 1-347 1-308 (309) 196 protein:vir:103181 Length: 457 86.8 0.043 2.7E-05 28.1 13.0 305 1-347 97-439 (457) 197 protein:vir:97255 Length: 310 84.6 0.059 3.7E-05 27.3 15.3 288 1-346 1-310 (310) 198 protein:vir:79642 Length: 329 80.8 0.091 5.7E-05 26.3 13.3 289 1-344 21-329 (329) 199 protein:vir:5942 Length: 523 # 80.6 0.093 5.8E-05 26.2 14.1 311 1-345 162-523 (523) 200 protein:vir:106286 Length: 534 73.8 0.17 0.0001 24.9 13.3 308 1-347 125-507 (534) 201 protein:vir:95131 Length: 325 73.7 0.17 0.0001 24.8 14.9 275 18-347 1-298 (325) 202 protein:vir:107732 Length: 379 72.2 0.19 0.00012 24.6 12.4 303 1-344 56-379 (379) 203 protein:vir:8324 Length: 410 # 69.1 0.23 0.00014 24.1 10.0 268 1-344 89-410 (410) 204 protein:vir:94933 Length: 330 67.4 0.25 0.00016 23.9 16.2 284 1-347 25-330 (330) 205 protein:vir:10324 Length: 320 59.2 0.4 0.00025 22.8 10.5 295 10-347 1-318 (320) 206 protein:vir:103886 Length: 302 58.7 0.41 0.00025 22.7 13.3 281 1-347 1-302 (302) 207 protein:vir:80986 Length: 528 58.6 0.41 0.00025 22.7 13.5 306 1-347 116-502 (528) 208 protein:vir:94070 Length: 339 56.6 0.45 0.00028 22.5 10.8 285 1-344 37-339 (339) 209 protein:vir:5670 Length: 514 # 50.2 0.62 0.00038 21.7 12.1 305 1-347 114-498 (514) 210 protein:vir:78148 Length: 123 45.8 0.22 0.00014 24.2 3.4 117 206-346 1-123 (123) 211 protein:vir:99424 Length: 360 40.9 0.95 0.00059 20.7 9.9 305 1-347 1-360 (360) 212 protein:vir:96079 Length: 382 33.4 1.4 0.00084 19.9 15.2 290 1-344 63-382 (382) 213 protein:vir:104549 Length: 462 27.8 1.8 0.0011 19.2 13.9 298 1-347 97-444 (462) 214 protein:vir:100603 Length: 529 21.1 2.7 0.0016 18.3 13.9 309 1-347 147-503 (529) No 1 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=1e-112 Score=634.61 Aligned_cols=347 Identities=100% Similarity=1.410 Sum_probs=340.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |||++++++++||+||+++++|+++||||+|+|||+++|+++|++++++++|++++|||+|||++|+.++.+|+||++++ T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~~~~~g~~l~ 80 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecceeeeeeccccCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccC Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~ 160 (347) ++++++++++++|+||+++|++++|||+|++|++||+|+++++++|++||+++|++|+++++++++.++.+...++|.++ T Consensus 81 ~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~ 160 (347) T protein:vir:88 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) T ss_pred CCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCccc Confidence 98889999999999999999999999999999999999999999999999999999999999999998888899999999 Q ss_pred ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEEE Q lcl|Aclame:pro 161 AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRNV 240 (347) Q Consensus 161 ~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~i 240 (347) +..+..+++.+.++++.+++++++.|++|+++|||++||++|||+||+|++|++||+++++++.+|.+++++++|+|+++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~vg~i 240 (347) T protein:vir:88 161 AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRNV 240 (347) T ss_pred cccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcceeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999899999999999 Q ss_pred eceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhh Q lcl|Aclame:pro 241 MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQAD 320 (347) Q Consensus 241 ~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d 320 (347) +||+||+|||+|...++..+.+++.+.++..+.+..+..++|++|++++++|+|||+|+|+|+++++++|.+|+++||+| T Consensus 241 ~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d 320 (347) T protein:vir:88 241 MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQAD 320 (347) T ss_pred ccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechhhHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 321 QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 321 ~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +|+++|+|||+++||||||+|.+++|| T Consensus 321 ~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 321 QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred HhhhhhhhcCceeccceEEEEEeCCCC Confidence 999999999999999999999999999 No 2 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=4e-112 Score=631.29 Aligned_cols=346 Identities=78% Similarity=1.166 Sum_probs=330.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |||++.+++++||+||+++++|+++||||+|+|||+++|++.|++++++++|+|++|||+|||+||++++.+|+||++++ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~~~~G~~l~ 80 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLD 80 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeeccceeEeeeecCcCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccC Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~ 160 (347) ++.++++++|++|+||+++|++++|||+|++|++||+|+++++++|++||+++||+|+++++++++++.+....+.|.++ T Consensus 81 ~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~~ 160 (347) T protein:vir:94 81 DKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGLGK 160 (347) T ss_pred CCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCCc Confidence 98889999999999999999999999999999999999999999999999999999999999999998888888888888 Q ss_pred ceeeeecccc-cccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 161 AVVLNIGAAA-DLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 161 ~~~i~~~~~~-~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) +..+.++... ...+++.++.++|+.|++|+++|||++||++|||+||+|++|+.||+..++...++....++.+|+|++ T Consensus 161 ~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V~~ 240 (347) T protein:vir:94 161 AHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIRN 240 (347) T ss_pred ceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccccccceeEE Confidence 8888776543 445667788999999999999999999999999999999999999998888888888888899999999 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHh Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQA 319 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~ 319 (347) ++||+||+|||+|....+.+....+.+.++.++.+..+.+++|++||+++|+|+|||+|+++|+++++++|.+||++||+ T Consensus 241 v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~~~ 320 (347) T protein:vir:94 241 VMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRANFQA 320 (347) T ss_pred eeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeechhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 320 DQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 320 d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) |+|+++++|||+++||||||+|.+..| T Consensus 321 ~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 321 DQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhcCcccccceeEEEEecCC Confidence 999999999999999999999988777 No 3 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=6.1e-108 Score=608.37 Aligned_cols=345 Identities=74% Similarity=1.114 Sum_probs=328.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |||++ ++.++|||||+++++|+.+||||+|.|||+++|+++|++++++++|+|++|||+|||++|++++++|+||++++ T Consensus 1 m~~~~-~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~t~G~~l~ 79 (347) T protein:vir:94 1 MANVP-GQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSGVYLAPGERLS 79 (347) T ss_pred CCCCC-ccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceeeeeecCCCCcC Confidence 99975 69999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccC Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~ 160 (347) ++++++++++++|+||+++|++++|||+|++|++||+|+++++++|++||+++|++|++++++++....++...+.|++. T Consensus 80 ~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~~ 159 (347) T protein:vir:94 80 DKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLGT 159 (347) T ss_pred CCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcc Confidence 88888999999999999999999999999999999999999999999999999999999999988888888888889889 Q ss_pred ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEEE Q lcl|Aclame:pro 161 AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRNV 240 (347) Q Consensus 161 ~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~i 240 (347) ++++..+..++..++...+++++++|++|+++|+|++||++|||+||+|++|++||++.++.+.+|.+++++.+|+|+++ T Consensus 160 ~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~i 239 (347) T protein:vir:94 160 ASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIRNV 239 (347) T ss_pred cceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccceEEE Confidence 99999888888888899999999999999999999999999999999999999999999999999999889999999999 Q ss_pred eceeEEEeccccccccccccccCccccccc-cccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHh Q lcl|Aclame:pro 241 MGFEVIEVPHLTVGGAGDNNPADGVAPTNQ-KHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQA 319 (347) Q Consensus 241 ~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~-~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~ 319 (347) +||+||+|||||..+.+.+....+.+..++ .+.++...+.+|+++|+++++|+|||+|+++|+++++++|.+|++++|+ T Consensus 240 ~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~ 319 (347) T protein:vir:94 240 MGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQG 319 (347) T ss_pred eceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhhHH Confidence 999999999999998888888877776665 5677777888999999999999999999999999999999999999999 Q ss_pred hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 320 DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 320 d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) |+|+++|+|||+++||||+|+|.+. +| T Consensus 320 d~i~~~~~~G~~~~rP~~a~~~~~~-~A 346 (347) T protein:vir:94 320 DLIVGKYAMGHGGLRPEAAGALVFS-PA 346 (347) T ss_pred HHhhhhhhhcCcccccceeEEEEec-CC Confidence 9999999999999999999999766 55 No 4 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=2.6e-106 Score=599.39 Aligned_cols=342 Identities=73% Similarity=1.107 Sum_probs=314.9 Q ss_pred CCCCccCccccccCc-ccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQG-KGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~-~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~ 79 (347) |+|+..+++.++..+ +.++++|+++||||+|+|||+++|+++|++++++++|+|++|||+|||++|++++++|+||+++ T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG~~~~~~~~~G~~l 80 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGENL 80 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeeceeEEEeeecCCCC Confidence 999987766655443 3367778899999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +++.++++++|++|+||+++|++|+|||+|++|++||+|+++++++|++||+++|++|+++++++++.+++....+.|.+ T Consensus 81 ~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~g~~ 160 (344) T protein:vir:10 81 DDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENITGLG 160 (344) T ss_pred CCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccc Confidence 99888999999999999999999999999999999999999999999999999999999999999999988888888888 Q ss_pred Cceeeeecc-cccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 160 QAVVLNIGA-AADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 160 ~~~~i~~~~-~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) .+.++.... +...+++...++++++.|++|+++|||++||++|||+||+|++|++||+++++++.+|.+++++.+|.|+ T Consensus 161 ~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~ 240 (344) T protein:vir:10 161 TATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSIR 240 (344) T ss_pred ccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccceeeeEEE Confidence 887776554 3344677788899999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++||+||+|||+|..+.+... ...++.++.++++.+..|+++|+++|||+|||+|+++++++++++|.+|+++|| T Consensus 241 ~v~G~~V~~Sn~lp~~~~~~~~----~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~ 316 (344) T protein:vir:10 241 NVMGFEVVEVPHLTAGGAGTSR----EGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANFQ 316 (344) T ss_pred EEeceEEEeccccccccCCccc----ccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchhHH Confidence 9999999999999976554432 235666788889999999999999999999999999999999999999999999 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) +|+|+++|+|||+++||||++++.++.. T Consensus 317 ~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 317 ADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHHHHhhcccceecccceEEEEeecC Confidence 9999999999999999999999999888 No 5 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=3.7e-105 Score=593.11 Aligned_cols=344 Identities=74% Similarity=1.097 Sum_probs=313.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |||+..+++++|||||+++++|+++||||+|+|||+++|+++|++++++++|++++|||+|||++|++++++|++|++++ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~~~~~g~~l~ 80 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeeccceeeeeecCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC--cc Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA--GL 158 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~--g~ 158 (347) ++++++++++++|+||+++|++++|||+|++|++||+|+++++++|++||+++|++|+++++++.+.+..+....+ +. T Consensus 81 ~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~ 160 (347) T protein:vir:33 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGK 160 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc Confidence 8888899999999999999999999999999999999999999999999999999999999987766555443333 33 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) ..+..+..++++...++..+++++|++|++|+++|+|++||++|||+||+|++|++||++++|++++|.+++.+.+|.|+ T Consensus 161 ~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~G~V~ 240 (347) T protein:vir:33 161 PTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPERGTIR 240 (347) T ss_pred cccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccccccccceeE Confidence 34445556666677778888999999999999999999999999999999999999999999999999988899999999 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++||+||+|||||....+.+..+ +..+..+.+.++.+..|+++|+++++|+|||+|+|+++++++++|..|+++|| T Consensus 241 ~i~G~~V~~Sn~lp~~~~~~~~~~---~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~ 317 (347) T protein:vir:33 241 NVMGFEVVEVPHLTAGGAGDTRED---APADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQ 317 (347) T ss_pred EEeceeEEEecccccCcccccccc---ccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhh Confidence 999999999999998755443332 22445667778888889999999999999999999999999999999999999 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +|+|+++|+|||+++||||||+|.+-.-+ T Consensus 318 ~d~i~~~~~~G~~vlrP~~av~i~~~~~~ 346 (347) T protein:vir:33 318 ADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred hHhhhhhhhcCCceecccceEEEecCCCC Confidence 99999999999999999999999766666 No 6 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=9.4e-105 Score=590.90 Aligned_cols=345 Identities=23% Similarity=0.326 Sum_probs=307.5 Q ss_pred CCCCcc----CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCC Q lcl|Aclame:pro 1 MANATG----GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPG 76 (347) Q Consensus 1 m~~~~~----~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g 76 (347) |||.|+ ..+.+|||||+++ +|+++||||+|+|||+++|+++|++++++++|+|++|||++|+++|++++++|+|| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~-~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGA-TDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTGRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccc-cchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeeeeeEEeeecCC Confidence 999884 5678899999854 58888999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCC-CCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 77 ENLDDK-RKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 77 ~~~~~~-~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) ++++++ ..++++++++|+||+++|++|+|||+|++|++||+|+++++|+|++||+++|++|+++++++++...+..... T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~~ 159 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSATN 159 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 999875 4578899999999999999999999999999999999999999999999999999999999999888766666 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcc---hhhhhhhccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSA---LMPNAANYAALIDP 232 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~---~~~~~~~~~~~~~~ 232 (347) ...+++..+..++.+. .+++.+++++|++|++++++|+|++||++|||+||+|++|++||++ +++++.+|.+++.. T Consensus 160 ~~~~Gg~~i~~~sg~~-~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~ 238 (375) T protein:vir:10 160 FVEPGGTQIRVGSGTN-ESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQ 238 (375) T ss_pred ccccCcceeeeccccc-cccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccccee Confidence 6666777777664433 3445678899999999999999999999999999999999999987 67899999888888 Q ss_pred cccceEEEeceeEEEeccccccccccccccCccccccc------------cccccccccccccccc---cceeEEeechh Q lcl|Aclame:pro 233 ETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQ------------KHIFPATATGDDRVAQ---NNVVGLFNHRS 297 (347) Q Consensus 233 ~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~------------~~~~~a~~~~~y~~d~---~~~~~l~~h~~ 297 (347) .+|+|++++||+||+|||+|......+.+....+.++. .....++..++|++|| +++|||+|||+ T Consensus 239 ~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~ 318 (375) T protein:vir:10 239 SGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKE 318 (375) T ss_pred ccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchh Confidence 89999999999999999999887766666544443321 1234566778999999 99999999999 Q ss_pred hhhhhhhhheeeccc---cchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 298 AVGTVKLKDMALERA---RRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 298 A~~tv~~~~~~~e~~---~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) |+|+||++++++|.+ |+++||+|+|+++|+|||+++||||||+|...++| T Consensus 319 A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~ 371 (375) T protein:vir:10 319 AAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGATA 371 (375) T ss_pred heeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcCc Confidence 999999999999988 79999999999999999999999999999988777 No 7 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=2.3e-103 Score=583.25 Aligned_cols=344 Identities=74% Similarity=1.093 Sum_probs=312.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |+|+..+++++||+||+++++|+++||||+|+|||+++|+++|++++++++|++++|||+|||++|++++++|++|++++ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig~~t~~~~~~g~~l~ 80 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAYLKPGENLD 80 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeeccceeeeeeccCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC--cc Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA--GL 158 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~--g~ 158 (347) ++++++++++++|+||+++|++++|||+|++|++||+|+++++++|++||+++|++|++++++++.++..+..... |. T Consensus 81 ~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~g~ 160 (347) T protein:vir:15 81 DKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGLGK 160 (347) T ss_pred CCCCCCccceEEEEechhhhhhHHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCc Confidence 8888899999999999999999999999999999999999999999999999999999999988766544433332 33 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) .........++++..++...+++|++.|++|+++|+|++||++|||+||+|++|+.||+++++++.+|.++..+.+|.|+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G~Vg 240 (347) T protein:vir:15 161 PTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHERGTIR 240 (347) T ss_pred cccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccceEEE Confidence 33334445566677888889999999999999999999999999999999999999999999999999988889999999 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++||+||+|||||....+.+.. .+.+..++.+.++.+...+++|+++++|+|||+|+++|++|++++|..|+++|| T Consensus 241 ~i~G~~V~~Sn~lp~~~~t~~~~---~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~ 317 (347) T protein:vir:15 241 NVMGFEVVEVPHLTAGGAGDTRE---DAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQ 317 (347) T ss_pred EEeceEEEecccccccccccccc---cccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhh Confidence 99999999999999875544432 234556677778888888999999999999999999999999999999999999 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +|+|+++|+|||+++||||||+|.+-.-+ T Consensus 318 ~d~i~~~~~~G~~vlrP~~av~~~~~~~~ 346 (347) T protein:vir:15 318 ADQIIAKYAMGHGGLRPEAAGAIVLPKVS 346 (347) T ss_pred hhhhehhhhcCCceeccccEEEEecCCCC Confidence 99999999999999999999999766666 No 8 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=3.2e-102 Score=577.02 Aligned_cols=341 Identities=72% Similarity=1.091 Sum_probs=303.4 Q ss_pred CCCCccCcccc--ccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIG--ANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~--~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~ 78 (347) |+++.++++.+ +|+||+ +++|+++||||+|+|||+++|+++|++++++++|+|++|||++||++|++++++|+||++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~iG~~~~~~~~~G~~ 79 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVV-AAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPGEN 79 (345) T ss_pred Ccccccchhcccccccccc-cCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeeecceEEEeeecCCC Confidence 99988765544 566775 467888999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++++.++++++|++|+||+++|++|+|||+|++|++||+|+++++|+|++||+++||+|+++++++++.+++....+.++ T Consensus 80 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~~~~ 159 (345) T protein:vir:22 80 LDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENIEGL 159 (345) T ss_pred CCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Confidence 99988889999999999999999999999999999999999999999999999999999999999999888777778877 Q ss_pred cCceeeeecc-cccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 159 GQAVVLNIGA-AADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 159 ~~~~~i~~~~-~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) +.+..+...+ +...+++...+.+++++|++|+++|||++||.+|||+||+|++|++||+++++++.+|.++++..+|.| T Consensus 160 ~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V 239 (345) T protein:vir:22 160 GTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKGSI 239 (345) T ss_pred ccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccccceE Confidence 7777766544 334456677788999999999999999999999999999999999999999999999999889999999 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccc-cccccccceeEEeechhhhhhhhhhheeeccccchh Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATG-DDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE 316 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~-~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~ 316 (347) ++++||+||+|||+|....+.... +.....+.++.+.+. .|..+.+++|||+|||+|+++|+++++++|.+|+++ T Consensus 240 ~~i~G~~V~~sn~lp~~~~~~~~~----~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~ 315 (345) T protein:vir:22 240 RNVMGFEVVEVPHLTAGGAGTARE----GTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRAN 315 (345) T ss_pred EEEeceEEEecccccccccCcccc----CcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeechh Confidence 999999999999999754443322 233444555555433 355678999999999999999999999999999999 Q ss_pred hHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 317 FQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 317 ~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ||+|+|+++|+|||+++||||+++|.+=-. T Consensus 316 ~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 316 FQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred HHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 999999999999999999999999986555 No 9 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=1.5e-100 Score=567.90 Aligned_cols=334 Identities=13% Similarity=0.097 Sum_probs=290.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) ||++|+ .|||||++++ |.++||||+|+|||+|+|+++|++++++++|+|++|||++||++|+++++||+||++++ T Consensus 1 ms~~n~----~t~~~~~~~~-~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~~~~~~~G~~ld 75 (364) T protein:vir:10 1 MSNPNV----LTQPAVSASG-EVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETELQVLSPGKSPD 75 (364) T ss_pred CCCccc----cccccccccc-chhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeEEeeeccCcccC Confidence 999876 4889988554 77889999999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc-cccccccCcc Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYD-VRAEYSAQLGEALAIAADGAVLAEMAKLCNLP-AASNENIAGL 158 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D-~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a-~~~~~~~~g~ 158 (347) ++ +++++|++|+||+++|++++|+|+|++|+||| +|+|+++|+||+||+++||+|++++.+++... .+......+. T Consensus 76 ~~--~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~ 153 (364) T protein:vir:10 76 AS--PTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVA 153 (364) T ss_pred CC--CcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCccc Confidence 64 68999999999999999999999999999999 89999999999999999999998776654211 1112222333 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcc--ccccccccc Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYA--ALIDPETGN 236 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~ 236 (347) +.|..++.+ +...+.+++++.++++|+++.++|||++||.+|||+||+|++|+.||++++|+|++|. +++++.+|+ T Consensus 154 ~~g~~i~~~--~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~ 231 (364) T protein:vir:10 154 GHGFSIHIV--GLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGF 231 (364) T ss_pred CCcceeeec--ccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccce Confidence 445455443 2234567788899999999999999999999999999999999999999999999996 567799999 Q ss_pred eEEEeceeEEEeccccccccccccccCcccccccccccccccccccc--ccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDR--VAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 237 v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~--~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |++++||+|++|||||........ ......+...+++.+++|. +|++++++|+|||+|+++++++++++|.+|+ T Consensus 232 v~~v~Gv~Vv~Sn~lP~~~~~~~~----t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~ 307 (364) T protein:vir:10 232 VLKSWNTPIVPSNRFPKLSDNTEG----TGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYE 307 (364) T ss_pred eEEEeceEEEeccccccccccccc----cccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeec Confidence 999999999999999986543322 2223344556777788887 7899999999999999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++|+|+|+++|+|||+++|||||++|..++++ T Consensus 308 ~~~~~~~ida~~a~G~g~lRPeaa~~i~~~~~~ 340 (364) T protein:vir:10 308 KKEKTWYIDTFLAEGAIPDRWEAVAVVTAADTA 340 (364) T ss_pred cceeeeeeeeehcccCcccCccceEEEEecCCC Confidence 999999999999999999999999999988888 No 10 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=2.3e-99 Score=561.34 Aligned_cols=327 Identities=15% Similarity=0.110 Sum_probs=286.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |||++. +..|||+|+++++| ++||||+|+|||+++|+++|+|++++++|+|++|||+|||++|+++++||+||++++ T Consensus 1 m~~~~~--~~~t~~~~~~~~~~-~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~~~~g~~l~ 77 (334) T protein:vir:80 1 MTYPAA--NTHTRPGWGGANSD-VSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTIAGRKAGEELV 77 (334) T ss_pred CCCCcC--CCccccccccccch-heehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceeeeeecCCCCCC Confidence 999877 45699999998887 569999999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccC Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~ 160 (347) ++ ++++++++|+||+++|++++|||+|++|++||+|+|+++|+|++||+++||+|+++|+++++.+++.........+ T Consensus 78 ~~--~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G 155 (334) T protein:vir:80 78 VQ--KNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDG 155 (334) T ss_pred CC--CcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCC Confidence 75 5899999999999999999999999999999999999999999999999999999999999877664333222222 Q ss_pred ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCC---CCCEEEEChHHHHHHhcchhhhhhhcccc---ccccc Q lcl|Aclame:pro 161 AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPA---GDRRFYCAPEDYSAILSALMPNAANYAAL---IDPET 234 (347) Q Consensus 161 ~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~---~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~---~~~~~ 234 (347) +......++ ...+.+.+++.+++++++|++.|+|++||+ .|||+||+|++|++||++++|+|++|.+. .++.+ T Consensus 156 ~~~~~~~~g-~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~ 234 (334) T protein:vir:80 156 ILLPSTISG-LAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVG 234 (334) T ss_pred cceeecccc-cccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccc Confidence 222222222 223556778889999999999999999994 67999999999999999999999999653 45789 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |+|++++||+||+|||+|....+.+.. .+..+.|++||+++++|||||+|+++++++++++|.+|+ T Consensus 235 g~i~~v~G~~V~~Sn~~P~~~~t~~~~--------------g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~ 300 (334) T protein:vir:80 235 GRIAMLNGVRVVETPRFPQSAITANAL--------------GADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEE 300 (334) T ss_pred eeEEEEeceEEEeecCCCCcccccccc--------------ccccccccccccceEEEEEeCceEEEEEEeecceeeeec Confidence 999999999999999999754332211 245678999999999999999999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++|+|+|+++++|||+++||||++++.++-+= T Consensus 301 ~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~~~~ 333 (334) T protein:vir:80 301 KKDFGHYLDTFQSYNIGQRRPDAVAVHDITVTN 333 (334) T ss_pred hhhHHHHHHHHHHcCCceeccceEEEEEEeeec Confidence 999999999999999999999999999876655 No 11 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=100.00 E-value=3e-99 Score=560.69 Aligned_cols=322 Identities=16% Similarity=0.128 Sum_probs=281.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |||+|. +|||||+++++|. +||||+|+|||+++|+|.|++++++++|+|++|||+|||++|+.+++||+||++++ T Consensus 1 ms~~~~----~tr~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:63 1 MSFLND----LTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELE 75 (335) T ss_pred CCCccc----chhhhcccccchh-heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeeecccCCcCcC Confidence 999865 4999999999997 69999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccC Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~ 160 (347) +++ +.++|++|+||+++|++++|||+|++|+|||+|+|+++|+|++||+++||+++++|+++++..++......-+++ T Consensus 76 ~~~--~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G 153 (335) T protein:vir:63 76 RSR--VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPG 153 (335) T ss_pred CCC--ccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCC Confidence 874 788999999999999999999999999999999999999999999999999999999999887664322221122 Q ss_pred -ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCC---CEEEEChHHHHHHhcchhhhhhhccc---ccccc Q lcl|Aclame:pro 161 -AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGD---RRFYCAPEDYSAILSALMPNAANYAA---LIDPE 233 (347) Q Consensus 161 -~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~g---R~~vv~P~~~~~Ll~~~~~~~~~~~~---~~~~~ 233 (347) +..+.....+ ...+++.++++|++|.++|+|++||+++ ||++|+|++|++||++++|+|++|.. .+++. T Consensus 154 ~~~~~~~tg~~----~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~ 229 (335) T protein:vir:63 154 VLEKLDLTGLT----AKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYV 229 (335) T ss_pred cceeeeeccCc----ccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccccccccccccccc Confidence 2222222221 2235677899999999999999999755 99999999999999999999999963 45689 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) +|+|++++||+|++|||+|....+.+.++ ...+.|++|+++.++|+|||+|++++|++++++|.+| T Consensus 230 ~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg--------------~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~ 295 (335) T protein:vir:63 230 KSRVAILNGVKVLETPRFATKAIAAHPLG--------------RHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWE 295 (335) T ss_pred CceeEEeeceEEEeeccCCCCCccccccc--------------ccCCccccccceeEEEEEecceEEEEEEeecccceee Confidence 99999999999999999997643333221 2346799999999999999999999999999999999 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++++|+|+|+++|+|||+++||||||+|.++-.- T Consensus 296 ~~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:63 296 DNEKFSWVLDTFQMYNIGARRPDTAGAIELKGIG 329 (335) T ss_pred ccchhhHHhHHHHHcCCcccccceEEEEEEcCCC Confidence 9999999999999999999999999999865443 No 12 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=100.00 E-value=2.9e-98 Score=555.31 Aligned_cols=323 Identities=15% Similarity=0.118 Sum_probs=282.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |||+|. +|||||+++++|. +||||+|+|||+++|+++|++++++++|+|++|||+|||++|+.+++||+||++++ T Consensus 1 ms~~~~----~t~~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSFLND----LTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGEELE 75 (335) T ss_pred CCcccc----ccccccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeeecccccCcccC Confidence 999865 4999999999997 79999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccC Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQ 160 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~ 160 (347) +++ ++++|++|+||+++|++++|||+|++|+|||+|+|+++|+|++||+++||+++++++++++..++......-+++ T Consensus 76 ~~~--~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G 153 (335) T protein:vir:78 76 RSR--VVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPG 153 (335) T ss_pred CCC--cccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCC Confidence 874 789999999999999999999999999999999999999999999999999999999999887665432222222 Q ss_pred ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCC---CCEEEEChHHHHHHhcchhhhhhhccc---cccccc Q lcl|Aclame:pro 161 AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAG---DRRFYCAPEDYSAILSALMPNAANYAA---LIDPET 234 (347) Q Consensus 161 ~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~---gR~~vv~P~~~~~Ll~~~~~~~~~~~~---~~~~~~ 234 (347) +......++. +...+++.++++++++.+.|+|++||+. |||++|+|++|++||++++|+|++|.. .+++.+ T Consensus 154 ~~~~~~~tg~---~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~ 230 (335) T protein:vir:78 154 VLEKLDLTGL---TAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVK 230 (335) T ss_pred cceeeeeccc---cccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccccccccc Confidence 2222222221 2345678899999999999999999975 699999999999999999999999963 356899 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |+|++++||+|++|||||....+.+.++ ...+.|++|++++++|+||++|++++++++++.|.+|+ T Consensus 231 g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg--------------~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~ 296 (335) T protein:vir:78 231 SRVAILNGVKVLETPRFATKAISAHPLG--------------RHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWED 296 (335) T ss_pred ceeEEeeceEEEeeccCCCCCCcccccc--------------ccCCcccccccceEEEEEecceEEEEEEEecccceeec Confidence 9999999999999999997643333221 23466888999999999999999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++|+|+|+++|+|||+++||||||+|.++-.- T Consensus 297 ~~~~~~~i~~~~a~G~g~lRPe~a~~i~~tg~~ 329 (335) T protein:vir:78 297 HDQFSWVLDTFQMYNIGARRPDTAGAIELKGIE 329 (335) T ss_pred cchhhHhhhHHHHcCCcccCcceEEEEEecCCC Confidence 999999999999999999999999999866543 No 13 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=100.00 E-value=2.7e-97 Score=549.97 Aligned_cols=328 Identities=14% Similarity=0.102 Sum_probs=278.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) ||++|+ .|||||++++ |.++||||+|+|||+++|+++|++++++++|+|++|||++||++|+++++||+||++++ T Consensus 1 Ms~~n~----~t~~~~~~s~-~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~a~y~~~G~~ld 75 (402) T protein:vir:97 1 MSTPNT----LTNVAVSASG-EVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPN 75 (402) T ss_pred CCCccc----cccccccccc-chhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeEEeeeccccccC Confidence 999876 4889988554 77889999999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-ccccccccCcc Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYD-VRAEYSAQLGEALAIAADGAVLAEMAKLCNL-PAASNENIAGL 158 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D-~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~-a~~~~~~~~g~ 158 (347) ++ ++.++|++|+||+++|++++|+|+||+|+||| +|+|+++|+|++||+++||+|++++..+++. +.+..+...+. T Consensus 76 g~--~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~ 153 (402) T protein:vir:97 76 AT--PTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVK 153 (402) T ss_pred CC--CcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccc Confidence 74 68899999999999999999999999999999 8999999999999999999998877655542 22223333333 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcc--ccccccccc Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYA--ALIDPETGN 236 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~ 236 (347) ..+..+... +...+...++++|+++|+++.++|||++||.+|||++|+|++|++||++++|+|++|. +.+++.+|+ T Consensus 154 ~~g~s~~~~--~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~ 231 (402) T protein:vir:97 154 GHGFSINVN--VTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGF 231 (402) T ss_pred ccccccccc--cccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccce Confidence 333333322 2223446688899999999999999999999999999999999999999999999994 567799999 Q ss_pred eEEEeceeEEEeccccccccccccccCcccccccccccccccccccc--ccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDR--VAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 237 v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~--~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |++++||+||+|||||.... ....+.+.+++.+..|. +|++++++|+|||+|++++|+++++.|.||| T Consensus 232 v~~v~Gv~Vv~SnnlP~~a~----------~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d 301 (402) T protein:vir:97 232 VLSSYNCPVIPSNRFPTFAQ----------DQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYE 301 (402) T ss_pred eEEEeceEEEecCccccccc----------cccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhc Confidence 99999999999999997431 11223333445555554 8999999999999999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecC------CC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTP------AA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~------aa 347 (347) +++|+|+|+++++|||+++||||++++.+-- ++ T Consensus 302 ~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~ 340 (402) T protein:vir:97 302 KKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRDATTGDAG 340 (402) T ss_pred hhHHHHHHHHHHHhCCcccCccceEEEEEecccccccCC Confidence 9999999999999999999999999995322 12 No 14 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=5.8e-97 Score=548.18 Aligned_cols=318 Identities=27% Similarity=0.388 Sum_probs=280.8 Q ss_pred CCCCccCccccccCcccCccccHH-HHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKL-ALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~-al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~ 79 (347) |+++|. .|+||+++++|++ |||||+|+|||+++|++.|+++++++.|++++|||+|||++|++++++|++|+++ T Consensus 7 ~~~~~~-----~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~~~~~g~~l 81 (332) T protein:vir:78 7 FSLPNQ-----ANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGYHTPGTPI 81 (332) T ss_pred ccCCcc-----ccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEeeecCCCCC Confidence 777766 7889999999976 9999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +++ +++++++++|+||+.+|++|+|||+|++|+++|+|+++++++|++||+++|++|+++++++++..+ ...+.+ T Consensus 82 ~~~-~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~----~~~~~~ 156 (332) T protein:vir:78 82 VGD-AGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEAS----PVTGEP 156 (332) T ss_pred CCC-CCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccC----cccccc Confidence 764 569999999999999999999999999999999999999999999999999999999998876543 334455 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhc--chhhhhhhccc-cccccccc Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILS--ALMPNAANYAA-LIDPETGN 236 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~--~~~~~~~~~~~-~~~~~~G~ 236 (347) ++..+.++++. +.+++++|++|++|+++|+|++||.+|||+||+|++|+.||+ +++|++.++.+ ++.+.+|+ T Consensus 157 g~~~~~~~~~~-----~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~ 231 (332) T protein:vir:78 157 GGFHVNIGAGN-----TNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGK 231 (332) T ss_pred cccccccCCcc-----ccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecce Confidence 55555554432 234567899999999999999999999999999999999998 78999999976 46677875 Q ss_pred -eEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecc---c Q lcl|Aclame:pro 237 -IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALER---A 312 (347) Q Consensus 237 -v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~---~ 312 (347) |++++||+||+|||||..... .......++..++|+++|+++++|+|||+|+++++++++++|. . T Consensus 232 ~i~~i~G~~V~~Sn~lp~~~g~-----------~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~ 300 (332) T protein:vir:78 232 GLYSIAGIRILKSNNLAGLYGQ-----------DLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGD 300 (332) T ss_pred eeeEEeeeEEEecCccccCccc-----------ccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcc Confidence 899999999999999964322 2233345677889999999999999999999999999987764 6 Q ss_pred cchhhHhhHHhhhhhhcCcccccceEEEEEec Q lcl|Aclame:pro 313 RRPEFQADQIIGKYAMGHGGLRPEAAGALVFT 344 (347) Q Consensus 313 ~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~ 344 (347) |+++||+|+|+++|+||++++||||+++|+.+ T Consensus 301 ~~~~~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 301 FNVQYQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred cchhhhHhhhhhhhhhcCceecccceEEEeeC Confidence 79999999999999999999999999999988 No 15 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=100.00 E-value=2.9e-95 Score=538.88 Aligned_cols=327 Identities=14% Similarity=0.106 Sum_probs=284.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |+++|+ +|||||++++ |.++||||+|+|||+++|+++|++++++++|+|++|||+|||++|+.+++||+||++++ T Consensus 1 Ms~~n~----~t~~~~~~sg-~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s~~~~~~pG~~ld 75 (401) T protein:vir:70 1 MSTPNN----LTNVAVSASG-EVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPA 75 (401) T ss_pred CCCCcc----cccccccccc-chhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEeeeecCCCCcC Confidence 999876 4888988555 77889999999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccccccccCcc Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYD-VRAEYSAQLGEALAIAADGAVLAEMAKLCN-LPAASNENIAGL 158 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D-~r~~~~~~~g~aLa~~~D~~il~~l~~~a~-~a~~~~~~~~g~ 158 (347) ++ ++.++|++|+||+++|++++|+|+|++|+||| +|+||++|+|++||+++||+|++.+..++. .+++...++.+. T Consensus 76 ~~--~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~ 153 (401) T protein:vir:70 76 AT--STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVK 153 (401) T ss_pred CC--CcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcC Confidence 64 68999999999999999999999999999999 999999999999999999999877755443 244555667777 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEE-ChHHHHHHhcchhhhhhhcc--cccccccc Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYC-APEDYSAILSALMPNAANYA--ALIDPETG 235 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv-~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G 235 (347) ++|..++++.+.. +...+++.|+++|++|++.|+|++||.+ |+++| +|.+|+.|+++++++|++|+ +.+++.+| T Consensus 154 ~~G~~i~v~~~~~--~~~~~~~~l~~ai~dA~~~LdEkdVP~~-r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G 230 (401) T protein:vir:70 154 GHGFSINVEVAEG--EALVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQG 230 (401) T ss_pred CCceEEecccccc--ccccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHhcCcccchhhccccCCccccc Confidence 8888888865443 3456778899999999999999999965 66655 77888899999999999986 45779999 Q ss_pred ceEEEeceeEEEeccccccccccccccCcccccccccccccccccccc--ccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDR--VAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~--~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) +|.+++||+||+|||+|.+..+ ..++.+.+++.+..|. +|++++++|+|||+|++++|+++++.|.|| T Consensus 231 ~v~~vaGv~Vv~SnnlP~~a~~----------it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~ 300 (401) T protein:vir:70 231 FTLSSYNCPVIPSNRFPKYSQG----------QTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFY 300 (401) T ss_pred eEEEEeceEEEeeccccccccc----------cccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhh Confidence 9999999999999999975322 2234444555565555 899999999999999999999999999999 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) |+++|+|+|+++++|||+++||||++++.+.-.. T Consensus 301 d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k~~~ 334 (401) T protein:vir:70 301 EKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRNT 334 (401) T ss_pred hhhhhHHHHHHHHHhCCcccchhheEEEeecCcc Confidence 9999999999999999999999999998765553 No 16 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=100.00 E-value=5e-95 Score=537.58 Aligned_cols=328 Identities=13% Similarity=0.102 Sum_probs=279.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) ||++|+ +|||||++++ |.++||||+|+|||+++|+++|++++++++|+|++|||+|||++|+++++||+||++++ T Consensus 1 Ms~~n~----~t~p~~~gsg-~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~s~a~y~~pG~~ld 75 (400) T protein:vir:10 1 MSTPNN----LTNVAVSASG-EVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAPGQSPA 75 (400) T ss_pred CCCCcc----cccccccccc-chhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeeeeEEeeecCCCCcC Confidence 999876 4888988555 77789999999999999999999999999999999999999999999999999999998 Q ss_pred CCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc-ccccccccCcc Q lcl|Aclame:pro 81 DKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYD-VRAEYSAQLGEALAIAADGAVLAEMAKLCNL-PAASNENIAGL 158 (347) Q Consensus 81 ~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D-~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~-a~~~~~~~~g~ 158 (347) ++ ++.++|++|+||+++|++++|+|+||+|+||| +|+||++|+|++||+++||++++++..++.. ...+.....|. T Consensus 76 g~--~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~ 153 (400) T protein:vir:10 76 AT--STQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVK 153 (400) T ss_pred CC--CcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCcc Confidence 75 58999999999999999999999999999999 9999999999999999999999887665422 12223333443 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcc--ccccccccc Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYA--ALIDPETGN 236 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~--~~~~~~~G~ 236 (347) ..+..+.+.+.. .....+++.+.++|++|.+.|+|++||.++++++++|++|++|+++++++|++|+ +++++.+|+ T Consensus 154 ~~g~s~~v~~~~--~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~ 231 (400) T protein:vir:10 154 GHGFSVNVEVNE--GEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGF 231 (400) T ss_pred ccccceeecccc--cccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccce Confidence 333334332222 2334577888999999999999999997766666688888899999999999996 457799999 Q ss_pred eEEEeceeEEEeccccccccccccccCcccccccccccccccccccc--ccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDR--VAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 237 v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~--~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.+++|++||+|||+|.... ...++...+++.+..|. +|++++++|+|||+|++++|+++++.|.||| T Consensus 232 v~~v~Gv~Iv~Sn~lP~~a~----------~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d 301 (400) T protein:vir:10 232 VLSSYNCPVIPSNRFPKYSQ----------GQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYE 301 (400) T ss_pred EEEEeceEEEeeCcCCcccC----------cccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccc Confidence 99999999999999996421 22234444556666665 8999999999999999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++|+|+|+++++||++++||||++++.+.-.+ T Consensus 302 ~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~~ 334 (400) T protein:vir:10 302 KKEKTYYIDTFMSEGAIPDRWEAVSVVTTKRQS 334 (400) T ss_pred hhhHHHHHHHHHHhCCcccchhheEEEEecCCc Confidence 999999999999999999999999999988777 No 17 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=5.7e-89 Score=504.37 Aligned_cols=296 Identities=63% Similarity=0.932 Sum_probs=255.9 Q ss_pred ccccccCCceEEEeccccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHH Q lcl|Aclame:pro 50 MVRTIQNGKSASFPVMGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEAL 129 (347) Q Consensus 50 ~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aL 129 (347) ++|+|++|||++||++|++++++|+||+++++++++++++|++|+||+++|++|+|||+|++|++||+|+++++|+|++| T Consensus 1 ~vr~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~~Dlr~e~s~~~G~aL 80 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNHYDVRSEYSTQMGEAL 80 (324) T ss_pred CeeeeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcCccchhHHHHHHHHHH Confidence 89999999999999999999999999999998888899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhhcccccccc-cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEC Q lcl|Aclame:pro 130 AIAADGAVLAEMAKLCNLPAASNEN-IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCA 208 (347) Q Consensus 130 a~~~D~~il~~l~~~a~~a~~~~~~-~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~ 208 (347) |+.+||+|+++++++++.+++.... ..+.+++..+.. +++. .+++..++++++.|++|+++|||++||++|||+||+ T Consensus 81 A~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~-~~~~-~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv~ 158 (324) T protein:vir:99 81 AMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKI-TGKK-EDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYTD 158 (324) T ss_pred HHHHHHHHHHHHHHhhhcccccccCCcccCCccceecc-cccc-cccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeC Confidence 9999999999999988776655443 334344444443 3322 355677889999999999999999999999999999 Q ss_pred hHHHHHHhcchhhhhhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccc Q lcl|Aclame:pro 209 PEDYSAILSALMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNN 288 (347) Q Consensus 209 P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~ 288 (347) |++|++||++.++++.+|.+++.+++|+|++++||+||+|||+|....+......+.....-......+...+|++|+++ T Consensus 159 P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~~ 238 (324) T protein:vir:99 159 PDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGADN 238 (324) T ss_pred hHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccCc Confidence 99999999888888999999999999999999999999999999865544333222222222222233445789999999 Q ss_pred eeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 289 VVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 289 ~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++||+||++|+++++++++++|.+|+++||+|+|+++|+|||+++||||++++.+.+.| T Consensus 239 ~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~ 297 (324) T protein:vir:99 239 VVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGE 297 (324) T ss_pred eeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCc Confidence 99999999999999999999999999999999999999999999999999988866665 No 18 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=1.2e-72 Score=414.89 Aligned_cols=324 Identities=16% Similarity=0.115 Sum_probs=260.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhccccccc--ccCCceEEEeccccceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVRT--IQNGKSASFPVMGRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~rt--i~~G~tv~i~~iG~~t~~~~~~g~ 77 (347) ||=.|.. |+ ..-+...+.. || |+|+++|++.|++++++.++++.++ +++|+|||||++|++++++|++|. T Consensus 1 ~~~~~~~----~~--~~~~t~~v~~-fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~~~~d~~~~~ 73 (341) T protein:vir:94 1 MALGNTI----TG--PSINTQRGQQ-FIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRISELGVEDKATDV 73 (341) T ss_pred Ccchhhh----cc--ccccchhHHH-HHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccCcceeeeecCCC Confidence 5543332 22 3335556654 55 9999999999999999999988664 567999999999999999999999 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .++. +++++++++|+||+++|+++.|+|+|+.|+++|+|++++++++++||+++|+.|+..++.++..+... . T Consensus 74 ~i~~--~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~-----~ 146 (341) T protein:vir:94 74 PVGV--QPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQN-----V 146 (341) T ss_pred cccc--ccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCc-----c Confidence 8864 57899999999999999999999999999999999999999999999999999998775543222111 0 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) ...... ....++ ....++.|++++++|||++||.+|||+||+|++|+.|+++++|++.++.++..+++|.| T Consensus 147 ~~~~~~------~~t~~~---~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~i 217 (341) T protein:vir:94 147 FSSSNG------AITGNG---QAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQI 217 (341) T ss_pred ccCccc------cccCch---hhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeee Confidence 000000 000011 11236889999999999999999999999999999999999999999998888999999 Q ss_pred EEEeceeEEEeccccccccccccccCcccc--ccccccccccccccccccccceeEEeechhhhhhhhhhh--------- Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAP--TNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD--------- 306 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~--t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~--------- 306 (347) ++++||+|++||++|....+......+... .............+|+++++.++||+||++|++++|.++ T Consensus 218 g~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~ 297 (341) T protein:vir:94 218 GSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVS 297 (341) T ss_pred eeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhcccc Confidence 999999999999999876554433322211 112223344455779999999999999999999998544 Q ss_pred --eeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 307 --MALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 307 --~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.+|..|+++||+|+|+++|+|||++|||||||.|.++++. T Consensus 298 ~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~ 340 (341) T protein:vir:94 298 KAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGDT 340 (341) T ss_pred ccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcCC Confidence 67888899999999999999999999999999999888888 No 19 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=1.6e-67 Score=386.85 Aligned_cols=331 Identities=18% Similarity=0.235 Sum_probs=263.7 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccccc--ccCCceEEEeccccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRT--IQNGKSASFPVMGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rt--i~~G~tv~i~~iG~~t~~~~~~g~~ 78 (347) ||++++ .+...|++.+..+..++..|+|+++|++.|++.+++..+++.+. .++|+|||||++|++++.+|++|++ T Consensus 1 ~~~~~~---~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d~~~g~~ 77 (381) T protein:vir:80 1 MATIQG---TGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRAAVYDKQPQTP 77 (381) T ss_pred Cceecc---cccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcceeeeecCCCc Confidence 999985 26788888888888765559999999999999999998877664 4789999999999999999999998 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) +.. +++++++++++||+++|++++|+|+|+.|+++|+|++++++++++||+++|+.|+..+.+.......... T Consensus 78 i~~--~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~----- 150 (381) T protein:vir:80 78 VNL--QARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIY----- 150 (381) T ss_pred ccc--cccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc----- Confidence 864 5689999999999999999999999999999999999999999999999999999887655443322111 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) ..+..+..++... ..........++.|++|+++|||++||.+|||+||+|++|+.||++++|++.+|.++..+++|.|+ T Consensus 151 t~~~~i~~~~~~~-~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig 229 (381) T protein:vir:80 151 SYDTTLGDGTVNA-HLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVG 229 (381) T ss_pred ccccccccccccc-ccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeee Confidence 1111111111111 111223345689999999999999999999999999999999999999999999888889999999 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccc------------------------------ Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNN------------------------------ 288 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~------------------------------ 288 (347) +++||+|++||++|....+......+. .....+...+..|.++++. T Consensus 230 ~i~G~~Vv~Sn~lp~~~~t~~~~~aga----p~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~ 305 (381) T protein:vir:80 230 TILGMEVIVTTQIGINSLTGYVNGQGA----PTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGA 305 (381) T ss_pred EEcceEEEeecccccccccceeeeccc----cccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeeccee Confidence 999999999999998654433332211 0111122334455554432 Q ss_pred ------------------eeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 289 ------------------VVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 289 ------------------~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ..||++|+++.+.+..+.++.+..+...||+|.|+|+++||++++||++|+.|+.+-- T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:80 306 TAADGGQTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRETMYLADAFVTSCVYGAKVFRPDHCVLLHTSGI 381 (381) T ss_pred eecCCCceeeeehhhhhhhhhcccccccccccceeEeecccchhheeehhhhhhhhhhccccccchhhhhhhhcCC Confidence 1267788887777888888888889999999999999999999999999999997777 No 20 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=3.4e-58 Score=335.65 Aligned_cols=266 Identities=19% Similarity=0.148 Sum_probs=222.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cccCCceEEEeccccceeeeecC- Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR---TIQNGKSASFPVMGRTKGYYLAP- 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~r---ti~~G~tv~i~~iG~~t~~~~~~- 75 (347) ||+. .|+ |+|+++|++.|++.+++.++++.+ ++++|+|+|||++|++++.+|++ T Consensus 1 MA~~---------------------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 59 (273) T protein:vir:10 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAA 59 (273) T ss_pred Ccch---------------------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccC Confidence 7772 365 999999999999999999987653 57889999999999999998875 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) |..+. .+++++++++++||+.+|+++.|+|+|+.|.++|+++ ++++++++||+++|+.++..++.++.. . T Consensus 60 ~~~~~--~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~-------~ 129 (273) T protein:vir:10 60 GRQTS--ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------L 129 (273) T ss_pred CCccC--ccccccceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc-------c Confidence 44443 4678999999999999999999999999999999865 999999999999999999876532210 0 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh-hhhccc-ccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN-AANYAA-LIDPE 233 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~-~~~~~ 233 (347) ..+ .......+++.|++|+++||+++||.+|||+||+|++|+.|++++.++ +.++.+ +..++ T Consensus 130 ---------~~~-------~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~ 193 (273) T protein:vir:10 130 ---------TGS-------APTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) T ss_pred ---------ccc-------cccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccccccee Confidence 000 011234468999999999999999999999999999999999988655 566654 45688 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) +|.|++++||+|++||++|.... .-+++|||+|++.++.+. ++|..| T Consensus 194 ~G~ig~i~G~~v~~s~~lp~~~~--------------------------------~~~~~~~~~A~~~a~q~~-~~e~~r 240 (273) T protein:vir:10 194 AGTIGNLLGARIVESNNLRDTDD--------------------------------EQFVAFHPSAAAYVSQID-TVEALR 240 (273) T ss_pred eeeeeEEeceEEEEecccccCCc--------------------------------cEEEEEeccceeeeeeee-hhhccc Confidence 99999999999999999994210 013789999999888654 899999 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ++++|+|+|+++++||++++|||++++|+.+.+ T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 999999999999999999999999999998888 No 21 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=3.4e-58 Score=335.65 Aligned_cols=266 Identities=19% Similarity=0.148 Sum_probs=222.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cccCCceEEEeccccceeeeecC- Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR---TIQNGKSASFPVMGRTKGYYLAP- 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~r---ti~~G~tv~i~~iG~~t~~~~~~- 75 (347) ||+. .|+ |+|+++|++.|++.+++.++++.+ ++++|+|+|||++|++++.+|++ T Consensus 1 MA~~---------------------~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~ 59 (273) T protein:vir:10 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAA 59 (273) T ss_pred Ccch---------------------hhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccC Confidence 7772 365 999999999999999999987653 57889999999999999998875 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) |..+. .+++++++++++||+.+|+++.|+|+|+.|.++|+++ ++++++++||+++|+.++..++.++.. . T Consensus 60 ~~~~~--~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~-------~ 129 (273) T protein:vir:10 60 GRQTS--ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTA-------L 129 (273) T ss_pred CCccC--ccccccceEEEEEeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccc-------c Confidence 44443 4678999999999999999999999999999999865 999999999999999999876532210 0 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh-hhhccc-ccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN-AANYAA-LIDPE 233 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~-~~~~~ 233 (347) ..+ .......+++.|++|+++||+++||.+|||+||+|++|+.|++++.++ +.++.+ +..++ T Consensus 130 ---------~~~-------~~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~ 193 (273) T protein:vir:10 130 ---------TGS-------APTDADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) T ss_pred ---------ccc-------cccchhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccccccee Confidence 000 011234468999999999999999999999999999999999988655 566654 45688 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) +|.|++++||+|++||++|.... .-+++|||+|++.++.+. ++|..| T Consensus 194 ~G~ig~i~G~~v~~s~~lp~~~~--------------------------------~~~~~~~~~A~~~a~q~~-~~e~~r 240 (273) T protein:vir:10 194 AGTIGNLLGARIVESNNLRDTDD--------------------------------EQFVAFHPSAAAYVSQID-TVEALR 240 (273) T ss_pred eeeeeEEeceEEEEecccccCCc--------------------------------cEEEEEeccceeeeeeee-hhhccc Confidence 99999999999999999994210 013789999999888654 899999 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ++++|+|+|+++++||++++|||++++|+.+.+ T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 999999999999999999999999999998888 No 22 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=2.7e-56 Score=325.24 Aligned_cols=266 Identities=18% Similarity=0.144 Sum_probs=220.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cccCCceEEEeccccceeeeec-C Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR---TIQNGKSASFPVMGRTKGYYLA-P 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~r---ti~~G~tv~i~~iG~~t~~~~~-~ 75 (347) ||+. .|+ |+|+++|++.|++.+++.++++.. ....|+|||||++|.+++.+|+ + T Consensus 1 MA~~---------------------~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~ 59 (273) T protein:vir:79 1 MAFN---------------------NFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAA 59 (273) T ss_pred Ccch---------------------hhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccC Confidence 8772 365 999999999999999998887544 3457999999999999998776 4 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) |..+. .+++++++++++||+.+++++.|+|+|+.|.++|++ +++++++++||+++|+.++..++.++.. . T Consensus 60 ~~~~~--~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~~~~~~-~~~~~~~~ala~~vD~~i~~~~~~a~~~-------~ 129 (273) T protein:vir:79 60 GRQTS--ADAISDTGVDLLIDQEKSIDFLVDDIDRVQVAGSLE-AYTRAGATALATDTDKFIADMLVDNGTA-------L 129 (273) T ss_pred CCccC--ccccccceEEEEEeeecccceeeccHHHHhhcccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccc-------c Confidence 55554 457899999999999999999999999999999997 5999999999999999998776442210 0 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchh-hhhhhcccc-cccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALM-PNAANYAAL-IDPE 233 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~-~~~~~~~~~-~~~~ 233 (347) ..++ ...+..+++.|++++++||+++||.+|||+||+|++|+.||+++. +.+.++.++ ..++ T Consensus 130 ---------~~~~-------~~~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~ 193 (273) T protein:vir:79 130 ---------TGSA-------PSDADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLR 193 (273) T ss_pred ---------cccc-------ccchhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhccccccee Confidence 0000 111234578899999999999999999999999999999999875 556777654 5688 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) +|.|++++||+|++||++|.... ...+.+|++|++.++.+. ++|..| T Consensus 194 ~G~ig~~~G~~i~~s~~lp~~~~--------------------------------~~~~a~~~~A~~~a~~~~-~~e~~r 240 (273) T protein:vir:79 194 AGTIGNLLGARIVESNNLRDTDD--------------------------------EQFVAFHPSAAAYVSQID-TVEALR 240 (273) T ss_pred eeEeeEEeceEEEecccccccCc--------------------------------eEEEEEeccceeeeeehh-hhhccc Confidence 99999999999999999995210 012689999998887654 899999 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ++++|+|+|.++++||++++|||++++|..+.+ T Consensus 241 ~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 241 DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred CcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 999999999999999999999999999998888 No 23 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=5.9e-57 Score=328.86 Aligned_cols=305 Identities=12% Similarity=0.069 Sum_probs=223.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~ 79 (347) ||--|+ .++..++|+ |+|+.+++..++++.+...+.+......|+|||||+||++++++|++++++ T Consensus 1 ~~~~n~-------------ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsIg~~tV~dY~~~~~i 67 (322) T protein:vir:31 1 MSTGNN-------------TSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSVGTPVVRSRPEQGDF 67 (322) T ss_pred CCCCCC-------------cccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccccccccccccCCCCc Confidence 665442 333445663 999999999999999999988876777899999999999999999999988 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc--cccCc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASN--ENIAG 157 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~--~~~~g 157 (347) . .+++++++.+|+|||.|||+|.|+| |++|.++|++++++++++|+|++.+|+++...+..++....... ..+.+ T Consensus 68 ~--~d~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~ 144 (322) T protein:vir:31 68 T--FDNLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVING 144 (322) T ss_pred c--cccCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecC Confidence 5 4679999999999999999999999 99999999999999999999999999999887776654322221 12222 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHH---------Hhcchhhhhhhccc Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSA---------ILSALMPNAANYAA 228 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~---------Ll~~~~~~~~~~~~ 228 (347) .+ ..+ ..++ +.....|+.|++++.+|||++||.+|||+||+|+++.. |++++||+..+-+| T Consensus 145 ~~--~~i-v~~g-------t~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG 214 (322) T protein:vir:31 145 VP--HRF-VGTG-------TDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESG 214 (322) T ss_pred Cc--cce-eccC-------CCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhcccccccccccc Confidence 22 121 1111 12223488999999999999999999999999999764 57788887644433 Q ss_pred ccccccc--ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhh Q lcl|Aclame:pro 229 LIDPETG--NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD 306 (347) Q Consensus 229 ~~~~~~G--~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~ 306 (347) . ..| .|++++||+||+||++|..+ .+..+++...+. ...|.--|-+ +.=+.|+..+ +.-.+. T Consensus 215 ~---a~g~~~Vg~~~GF~V~~SN~l~~~~--~~i~aG~d~~~t---------~ag~~n~f~~-~~~~~~~~~~-~~~~~l 278 (322) T protein:vir:31 215 I---APDMQFVRSVYGIDLFVSNLLADAN--ETINAGGDARST---------TAGKCNMFMN-VSDMGLLPFV-VAWKEM 278 (322) T ss_pred c---hhhHHHHHHHhceeeeeeccccccc--cccccCcccccc---------cceeeccccc-ccchhhhhhh-hHhhhh Confidence 2 234 49999999999999998432 222222221111 0111110100 0011233344 344445 Q ss_pred eeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 307 MALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 307 ~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++.|.+|++.+++|.++++++||+|++|||.++.|...++- T Consensus 279 ~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~~~~ 319 (322) T protein:vir:31 279 PTTKSFIDDYNDDLNTATTARWGNGLVRDENLVCVLANADK 319 (322) T ss_pred hhhhcccCccccccceeeeeeecceeecccceEEEEecccc Confidence 58999999999999999999999999999999999655554 No 24 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=100.00 E-value=1.1e-54 Score=316.31 Aligned_cols=311 Identities=13% Similarity=0.064 Sum_probs=227.7 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHH-HhhhcccccccccCC-c------eEEEeccccceeee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRR-SVTMDKHMVRTIQNG-K------SASFPVMGRTKGYY 72 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~-s~~~~~~~~rti~~G-~------tv~i~~iG~~t~~~ 72 (347) |+=-+..+ |---=+.+..+.|+|+|..+|+..||.+ |+|++.++.++-.+| . ++.++.+|+..+.. T Consensus 1 ~~~~~~~~------~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (322) T protein:vir:10 1 MKLNAIMS------MLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQ 74 (322) T ss_pred Ccccceee------eeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccc Confidence 43211111 1001123567789999999999999986 899999998865433 3 34555556555554 Q ss_pred ecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 73 LAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASN 152 (347) Q Consensus 73 ~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~ 152 (347) +.+.+.++.++++++++.+.+.++++ |+.++|||+|++|+++|++++|++++++||+|++|+.|+..+...+.. T Consensus 75 ~~~d~~~dtp~~~~~~~~r~~~~~d~-~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~----- 148 (322) T protein:vir:10 75 QSADGTYPTPVNNKPFAKRRTNVDTY-DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASI----- 148 (322) T ss_pred cccCcccCCCccccccceEEEeeccc-ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccc----- Confidence 44444445566778888888877776 788999999999999999999999999999999999988655433211 Q ss_pred cccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCC-CEEEEChHHHHHHhcchhhhhhhcccccc Q lcl|Aclame:pro 153 ENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGD-RRFYCAPEDYSAILSALMPNAANYAALID 231 (347) Q Consensus 153 ~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (347) +.++.. +...+... .+..+.-..+++|++|+++|+|++||+++ ||+||+|++|++||++++|++.||.+... T Consensus 149 ----~~~gt~-v~~~ss~~--i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~ 221 (322) T protein:vir:10 149 ----KGTGQP-VEFLATQE--IGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMD 221 (322) T ss_pred ----cccccc-cccCCCcc--cccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchh Confidence 111111 11111100 01111112367899999999999999775 99999999999999999999999998777 Q ss_pred c-cccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeec Q lcl|Aclame:pro 232 P-ETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE 310 (347) Q Consensus 232 ~-~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e 310 (347) + .+|.|++++||+|++||+||..+......+ . ....+...+. +++||++|+++++.++++++ T Consensus 222 l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~--~--------------~~~~~~~~~~-~~a~~k~Av~~a~~~dv~~~ 284 (322) T protein:vir:10 222 LQSKGIITNWMGYTWIVSTRLDKFDPTQWGMA--A--------------EDGPQGDEIW-CIAMTDMALGYHSCKDIWTK 284 (322) T ss_pred hhhcCeeeeeeeEEEEEeccCCcccccccccc--c--------------cCCCCcccee-EEEEecCceeEEEeeeeeEE Confidence 6 579999999999999999996543222111 1 1112223333 47999999999999999999 Q ss_pred cccchhh-HhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 311 RARRPEF-QADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 311 ~~~~~~~-~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ..++|.+ ++|.|.++++||+++++|+++++|.+-.+= T Consensus 285 i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 285 VAEDPSASFAWRIYSAFTADCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred eeccCCcchhhhhhhhhhhCceEeccCcEEEEEEeccC Confidence 9998865 699999999999999999999999986555 No 25 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=100.00 E-value=7e-54 Score=311.99 Aligned_cols=217 Identities=24% Similarity=0.299 Sum_probs=166.5 Q ss_pred EeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccc Q lcl|Aclame:pro 95 IDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVD 174 (347) Q Consensus 95 ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~ 174 (347) ||++++++|+|||+|++|+|||+|+|+++|+||+||+++|++|+++++++++...+....+ ++...++.++ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~----~g~~~~~~a~----- 71 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQD----GGFSVNIGAG----- 71 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccc----cCcceecccc----- Confidence 9999999999999999999999999999999999999999999999999887554433222 2222222221 Q ss_pred hhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcc--hhhhhhhcccc-ccccccc-eEEEeceeEEEecc Q lcl|Aclame:pro 175 VEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSA--LMPNAANYAAL-IDPETGN-IRNVMGFEVIEVPH 250 (347) Q Consensus 175 ~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~--~~~~~~~~~~~-~~~~~G~-v~~i~G~~V~~sn~ 250 (347) .+.+++++++.|++|+++|||++||++|||+||+|++|+.||+. ++++|.++.++ +.+++|+ |++++||+||+||| T Consensus 72 ~t~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~Snn 151 (221) T protein:vir:17 72 NTNNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNV 151 (221) T ss_pred ccCCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEecc Confidence 12456778999999999999999999999999999999999974 67889988764 5688884 99999999999999 Q ss_pred ccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcC Q lcl|Aclame:pro 251 LTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGH 330 (347) Q Consensus 251 lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~ 330 (347) +|....+.... .++.....++..++|++||+|++||+|||+|+||||++.+-.. .| ++..++ T Consensus 152 lP~~~gt~~~~------~ag~~~~~~~~~~~yr~~fs~~~glv~~~~Avgtvkl~~~~~~---~~-----~~~~~~---- 213 (221) T protein:vir:17 152 LASLYGTNLVT------DPGDATTSGENNGSYRPAITDRAGLVFHKEAADTVEVLLPPSR---PP-----LVISMF---- 213 (221) T ss_pred CCccccccccc------CCccccccccccccccccccceEEEEEcchheeeeeeecCCCC---Cc-----eeeeee---- Confidence 99754433221 2223334556678999999999999999999999999975322 11 111111 Q ss_pred cccccceE Q lcl|Aclame:pro 331 GGLRPEAA 338 (347) Q Consensus 331 ~~lRPe~~ 338 (347) .+.||+-- T Consensus 214 ~~~~~~~~ 221 (221) T protein:vir:17 214 SIRRPDRR 221 (221) T ss_pred eccCCCCC Confidence 23344433 No 26 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=2.1e-43 Score=254.61 Aligned_cols=271 Identities=15% Similarity=0.111 Sum_probs=219.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhccccc-ccc--cCCceEEEeccccc-eeeeecC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMV-RTI--QNGKSASFPVMGRT-KGYYLAP 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~-rti--~~G~tv~i~~iG~~-t~~~~~~ 75 (347) |||+.. +.. | +|+ |+|+..|.+.|.+..++.++... +++ +.|++|+||+++.. .+++|.. T Consensus 1 Ma~~~T------~~~------~---~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~ 65 (278) T protein:vir:80 1 MADLTT------KLA------N---LIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAE 65 (278) T ss_pred CCCcce------ehh------h---eecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecC Confidence 998522 221 1 455 99999999999999898888654 344 46999999997754 4578888 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) |+.++ .++++.++.+++|++.. ..|.|+|++..++..|++++++++++++|+++.|+.++..+..+... + T Consensus 66 g~~i~--~~~lt~~~~~~~i~~~~-~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~-------~ 135 (278) T protein:vir:80 66 GAAID--YSALETESVKHGIKKAG-KGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE-------V 135 (278) T ss_pred CCcCc--ccccccceeeEeeehhh-ccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------c Confidence 99886 45799999999999975 58999999999999999999999999999999999998776432100 0 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhcccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPE 233 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~ 233 (347) . .+... ......++.|.++..+|++.++|. .++++|+|++|+.|+++. +|+.....+++.++ T Consensus 136 ~-----------~~~t~----~~~~~~~~~~~da~~~l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~ 199 (278) T protein:vir:80 136 K-----------GAINI----GLIDKIENTFTDAPDAIEDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLV 199 (278) T ss_pred c-----------ccccc----chhhhHHHHHHHHHHhhcccCCCc-ccEEEECHHHHHHHHhhhhhhcccccccccccee Confidence 0 00000 011223678899999999999996 667999999999999875 66666566667788 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) +|.|++++||+||+|+++|.+ .+.+||+.|++++..+++++|..| T Consensus 200 ~G~ig~~~G~~Vi~s~~~p~~-----------------------------------t~~l~~~gAi~~~~~~~~~vE~~R 244 (278) T protein:vir:80 200 KGAFGELLGWEIVRTKKLADG-----------------------------------NALAVKAGALKTFLKRNLLAESGR 244 (278) T ss_pred eccceeecceeEEEcCCCCcc-----------------------------------eEEEEeccceeeeecCCccccccc Confidence 999999999999999999832 136889999999999999999999 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++++.|.|.+++.||++++||++++.|++++.- T Consensus 245 d~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 245 DMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred chhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 9999999999999999999999999999877666 No 27 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=4.8e-42 Score=247.10 Aligned_cols=285 Identities=12% Similarity=0.024 Sum_probs=214.5 Q ss_pred CCCCccCccccccCcccCccccHHHH-HHHHHhHHHHHHHHHHHhhhcc-cc-cccccCCceEEEeccccceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLAL-FLKVFGGEVLTAFVRRSVTMDK-HM-VRTIQNGKSASFPVMGRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al-~ie~f~geV~~~f~~~s~~~~~-~~-~rti~~G~tv~i~~iG~~t~~~~~~g~ 77 (347) .-|+++--. ..-+|..+.+=++..+ +-|+|++.+++.|...+++... ++ .....+|++|+||+++.+.+++|+++. T Consensus 5 ~~~~~~~~~-~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~ 83 (319) T protein:vir:97 5 IKNATGMLK-LNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) T ss_pred cccccceeE-eehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCC Confidence 222221100 0112333333344434 4499999999988888777644 33 235568999999999999999999877 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcch--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDV--RAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~--r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) ... .+.++.+..+++||+.+|+.|.||++|..|++.++ ...+.+++.+.++..+|.+.+..++..+... T Consensus 84 g~~--~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~------- 154 (319) T protein:vir:97 84 TNE--FDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------- 154 (319) T ss_pred Ccc--cCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc------- Confidence 553 45799999999999999999999999999999887 4456788999999999999887775432110 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) . +.....+++|+.|+++.++|||++|| ++||++|+|++|.+|+++++|....-.++..+.+| T Consensus 155 --------~---------~~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) T protein:vir:97 155 --------L---------TVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) T ss_pred --------c---------ccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 0 01122345799999999999999999 69999999999999999999987554555667899 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc-c Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR-R 314 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~-~ 314 (347) .|+++.||+|+++|+.... +.-.+++||+|+..+...+ .++.+. . T Consensus 217 ~Vg~idG~~Vi~vps~~~k---------------------------------~in~i~~h~~A~~~~~k~~-~~~~~~p~ 262 (319) T protein:vir:97 217 VQGELDGFVIVKVPTKLLQ---------------------------------GLQAIAVVGEVLASPIQAD-LAKTNSNI 262 (319) T ss_pred eceeecCeEEEEecccccc---------------------------------cceEEEEcCCeeeeeeeee-eeeccCCC Confidence 9999999999997653210 0113899999997665554 567665 4 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.+++|+++++++||+.|+||+..+.+...+++ T Consensus 263 ~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:97 263 PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred ccccceeeeeeeeeeeEEeccccceEEEeecCC Confidence 778999999999999999999988888755555 No 28 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=4.8e-42 Score=247.10 Aligned_cols=285 Identities=12% Similarity=0.024 Sum_probs=214.5 Q ss_pred CCCCccCccccccCcccCccccHHHH-HHHHHhHHHHHHHHHHHhhhcc-cc-cccccCCceEEEeccccceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLAL-FLKVFGGEVLTAFVRRSVTMDK-HM-VRTIQNGKSASFPVMGRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al-~ie~f~geV~~~f~~~s~~~~~-~~-~rti~~G~tv~i~~iG~~t~~~~~~g~ 77 (347) .-|+++--. ..-+|..+.+=++..+ +-|+|++.+++.|...+++... ++ .....+|++|+||+++.+.+++|+++. T Consensus 5 ~~~~~~~~~-~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R~~ 83 (319) T protein:vir:94 5 IKNATGMLK-LNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) T ss_pred cccccceeE-eehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccccccCCC Confidence 222221100 0112333333344434 4499999999988888777644 33 235568999999999999999999877 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcch--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDV--RAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~--r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) ... .+.++.+..+++||+.+|+.|.||++|..|++.++ ...+.+++.+.++..+|.+.+..++..+... T Consensus 84 g~~--~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~------- 154 (319) T protein:vir:94 84 TNE--FDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------- 154 (319) T ss_pred Ccc--cCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccc------- Confidence 553 45799999999999999999999999999999887 4456788999999999999887775432110 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) . +.....+++|+.|+++.++|||++|| ++||++|+|++|.+|+++++|....-.++..+.+| T Consensus 155 --------~---------~~~~t~~n~y~~i~~a~~~Lde~~VP-~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) T protein:vir:94 155 --------L---------TVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) T ss_pred --------c---------ccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 0 01122345799999999999999999 69999999999999999999987554555667899 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc-c Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR-R 314 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~-~ 314 (347) .|+++.||+|+++|+.... +.-.+++||+|+..+...+ .++.+. . T Consensus 217 ~Vg~idG~~Vi~vps~~~k---------------------------------~in~i~~h~~A~~~~~k~~-~~~~~~p~ 262 (319) T protein:vir:94 217 VQGELDGFVIVKVPTKLLQ---------------------------------GLQAIAVVGEVLASPIQAD-LAKTNSNI 262 (319) T ss_pred eceeecCeEEEEecccccc---------------------------------cceEEEEcCCeeeeeeeee-eeeccCCC Confidence 9999999999997653210 0113899999997665554 567665 4 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.+++|+++++++||+.|+||+..+.+...+++ T Consensus 263 ~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~ 295 (319) T protein:vir:94 263 PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) T ss_pred ccccceeeeeeeeeeeEEeccccceEEEeecCC Confidence 778999999999999999999988888755555 No 29 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=4e-42 Score=247.55 Aligned_cols=285 Identities=12% Similarity=0.012 Sum_probs=216.9 Q ss_pred CCCCccCccccccCcccCccccHHHH-HHHHHhHHHHHHHHHHHhhhcc-cc-cccccCCceEEEeccccceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLAL-FLKVFGGEVLTAFVRRSVTMDK-HM-VRTIQNGKSASFPVMGRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al-~ie~f~geV~~~f~~~s~~~~~-~~-~rti~~G~tv~i~~iG~~t~~~~~~g~ 77 (347) .-|+++ .--..-+|..+.+-.+..+ +-|+|.+.+++.|...++.... ++ .-...+|++|+||+++.+.+++|+++. T Consensus 16 ~~~~~~-~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~R~~ 94 (329) T protein:vir:10 16 IKNATG-KLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYKRNA 94 (329) T ss_pred hhcccc-eeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeecccccccccCCC Confidence 233222 1111223444444444444 4499999999999998766544 33 224668999999999999999999877 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcch--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDV--RAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~--r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) ... .+.++.+..+++||+.+|+.|.||++|..|++.++ ...+.+.+.+.++.++|.+.+..++..+.. T Consensus 95 g~~--~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~-------- 164 (329) T protein:vir:10 95 TNE--FDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAK-------- 164 (329) T ss_pred Ccc--ccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhccc-------- Confidence 553 45789999999999999999999999999999876 455678899999999999998877543210 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) .. ......+++|+.|+++.++|+|++|| ++||++|+|++|.+|+++++|+...........+| T Consensus 165 -------~~---------~~~~t~~nay~~i~~a~~~Lde~~vp-~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g 227 (329) T protein:vir:10 165 -------HL---------TVGSGADAQYDAVLDVSVELDEIGAG-ASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKG 227 (329) T ss_pred -------cc---------ccccCHHHHHHHHHHHHHHHHhcCCC-CCcEEEeCHHHHHHHHhhhhhhccccccccceeee Confidence 00 01122345799999999999999999 59999999999999999999886544445567899 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc-c Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR-R 314 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~-~ 314 (347) .|+++.||+|+++|+.... +.-.+++||+|+......+ .+|.+. . T Consensus 228 ~Vg~idG~~Ii~vps~~~k---------------------------------~in~ii~~~~A~~~~~K~~-~~~~~~p~ 273 (329) T protein:vir:10 228 VQGELDGFTIVKVPSKMLQ---------------------------------GVEAMAVIGEVMASPIQAN-EAKLNSNV 273 (329) T ss_pred eeeeecCeEEEEecCCccc---------------------------------ceeEEEEcCCceeeeeeee-eeeeeCCC Confidence 9999999999998764310 0113899999997766665 677765 4 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.+++|+|.+++.||+.|+||++.+.+....+| T Consensus 274 ~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a 306 (329) T protein:vir:10 274 PGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKE 306 (329) T ss_pred CccchheeeeeeeeeeEEEccccCEEEEecccC Confidence 778999999999999999999988887765555 No 30 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=100.00 E-value=1.9e-41 Score=243.84 Aligned_cols=298 Identities=16% Similarity=0.115 Sum_probs=207.1 Q ss_pred CCCCccCccccccCcccCccccHHHHH-HHHHhHHHHHHHHHHHhhhcccccc---cc-cCCceEEEeccccceeeeecC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALF-LKVFGGEVLTAFVRRSVTMDKHMVR---TI-QNGKSASFPVMGRTKGYYLAP 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~-ie~f~geV~~~f~~~s~~~~~~~~r---ti-~~G~tv~i~~iG~~t~~~~~~ 75 (347) |+-.++ + +. .|+|..++++.|+++.++.++++.. ++ +.|+|||||+.+..+++++. T Consensus 1 m~~~~N---------~---------~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg~- 61 (418) T protein:vir:10 1 MAVQDN---------N---------LLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASGR- 61 (418) T ss_pred CCcccc---------c---------cccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecccC- Confidence 554222 1 22 3799999999999999998887643 22 35999999999999988754 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) .+. .+++...+++|+||+.+|++|.|+|.|++|...|+++++.++++++||+.+|+.++..+..+.+. T Consensus 62 --~~~--~~~~te~~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~-------- 129 (418) T protein:vir:10 62 --TLV--KQPMVDQTIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHS-------- 129 (418) T ss_pred --Ccc--ccccccceEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------- Confidence 443 45789999999999999999999999999999999999999999999999999998765432211 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCC-CEEEEChHHHHHHhcchhhhhhhccccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGD-RRFYCAPEDYSAILSALMPNAANYAALIDPET 234 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (347) .++.+.. +. -|+.|++++++|++++||.+| ||+||+|++|+.|+++.++..........+++ T Consensus 130 ----------~gt~gt~--~~-----~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~ 192 (418) T protein:vir:10 130 ----------SGTPGVR--PG-----AFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKM 192 (418) T ss_pred ----------cccCCcC--cc-----hHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhhe Confidence 1111111 10 168899999999999999985 99999999999999988765444444556999 Q ss_pred cceEEEeceeEEEecccccccccccc---ccCcccccc-------------c----cc--ccc----------------- Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNN---PADGVAPTN-------------Q----KH--IFP----------------- 275 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~---~~~~~~~t~-------------~----~~--~~~----------------- 275 (347) |.|++++||+||+||++|....+... ...|...++ + +. .+. T Consensus 193 G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~ 272 (418) T protein:vir:10 193 GYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQ 272 (418) T ss_pred eeeeeeeceEEEEecCCCcccccccccceeeecccccceeEEEeecceeeccceeeccEEEECceeecccccccccccce Confidence 99999999999999999964433211 111110000 0 00 000 Q ss_pred ---c------ccccccccc---------------------------------------------ccceeEEeechhhhhh Q lcl|Aclame:pro 276 ---A------TATGDDRVA---------------------------------------------QNNVVGLFNHRSAVGT 301 (347) Q Consensus 276 ---a------~~~~~y~~d---------------------------------------------~~~~~~l~~h~~A~~t 301 (347) . ..++...+. -+-..-|+|||+|+.. T Consensus 273 ~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~~~nl~f~~~a~~l 352 (418) T protein:vir:10 273 EFVVLEDVDTDAGGAGSIKISPSLNDGTATINNENGDPVSLTAYQNVTALPADNAPITVLGAANTTYEQNYLFHRDAIAL 352 (418) T ss_pred EEEEEeeccccccCcceeEeccccccccccccccccccccccCCCcccccccCcceeeeecccccceeeeeeeecceEEE Confidence 0 000000000 0112249999998743 Q ss_pred hhhh--------------------heeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 302 VKLK--------------------DMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 302 v~~~--------------------~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +-.. .+++-..||...+-+.++--..||.+.+|||.++.| ..++| T Consensus 353 ~~~~l~~p~g~~~~~~~~~~~~G~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~-~g~~~ 417 (418) T protein:vir:10 353 AMIDLELPQSAVIKSRAADPETGLSLTLTGAYDINEQSEIHRIDAVWGADMIYGELALRL-WGAAS 417 (418) T ss_pred EEeeccCCCCCCcceEEEeccCCeEEEEEEcccccccceEEEEEeecCceeecccceEEE-EeecC Confidence 3221 112223367777777777777999999999998666 45555 No 31 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=2.9e-41 Score=242.82 Aligned_cols=265 Identities=16% Similarity=0.180 Sum_probs=218.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cc--cCCceEEEecccc-ceeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TI--QNGKSASFPVMGR-TKGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti--~~G~tv~i~~iG~-~t~~~~~~g 76 (347) |||.+. +. +| -+.-|+|+..|.+.|.+..++.++.... ++ +.|++++||+.+. ..+++|..| T Consensus 1 ma~~~T------~~------~d--~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g 66 (274) T protein:vir:96 1 MAQGTT------KV------SN--LIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEG 66 (274) T ss_pred CCcccc------ch------hh--hhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCC Confidence 998542 21 22 1344999999999999999999987664 33 4699999999874 367789999 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++ .++++.++.+++|++. +..|.|+|++..++..|++++++++++++|++++|+.++..+..+.. T Consensus 67 ~~i~--~~~it~~~~~~~i~~~-~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~---------- 133 (274) T protein:vir:96 67 EKIP--VDQIGTSKREAKVRKI-GKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------- 133 (274) T ss_pred CcCc--hhhcccceeEEEEEee-eceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---------- Confidence 9885 4579999999999985 88999999999999999999999999999999999999876532110 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (347) .. ..+. ..++.|++|..+|+++++ ++||++|+|++|..|+++. +|+.....+++.+++ T Consensus 134 --------~~--~~~~--------~~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~ 193 (274) T protein:vir:96 134 --------TV--EADI--------TKLDGLQTAIDKFNDEDL--EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVK 193 (274) T ss_pred --------Cc--Cccc--------ccHHHHHHHHHHhcccCC--CceEEEeCHHHHHHHHhcccccccccccccccceee Confidence 00 0000 026789999999999886 6899999999999999974 666655556677889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.|++++||+|++||++|.+. +.+||+.|++++..+++++|..|+ T Consensus 194 g~ig~~~G~~Vi~s~~~p~~t-----------------------------------~~l~~~gA~~~~~~~~~~vE~~Rd 238 (274) T protein:vir:96 194 GAFGEALGAVIVRSNKLNKGE-----------------------------------ALLAKKGAVKLITKRDFFLEKDRD 238 (274) T ss_pred cccceecCeeEEEcCCCCcce-----------------------------------EEEEeCcceeeeecCCcccccccc Confidence 999999999999999998421 368899999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++++.|.|.+++.||++++||++++.|..+.+= T Consensus 239 ~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:96 239 ASRKSTALYSDKHYVAYLYDESKVVKITKGAGD 271 (274) T ss_pred hhhcccEEEEeeEEEEEEEcCccEEEEEcCccc Confidence 999999999999999999999999999877766 No 32 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=3.4e-40 Score=236.99 Aligned_cols=265 Identities=17% Similarity=0.159 Sum_probs=218.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cc--cCCceEEEeccccc-eeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TI--QNGKSASFPVMGRT-KGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti--~~G~tv~i~~iG~~-t~~~~~~g 76 (347) |+|.. |+... -+.-|+|+..|.+.+.+..++.++.... ++ +.|++|+||+++.. .+++|..| T Consensus 1 ma~~~------T~~~~--------~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg 66 (274) T protein:vir:93 1 MPQGI------TKTSN--------QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccc------eehhh--------eechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCC Confidence 99832 32222 1344999999999999999999887654 34 35999999997653 66788889 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++ .++++.++.+++|++. ++.|.|+|++..++..|++++++++++++|++++|+.++..+..+... T Consensus 67 ~~i~--~~~it~~~~~~~i~~~-~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~--------- 134 (274) T protein:vir:93 67 EKIP--TDILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------- 134 (274) T ss_pred Cccc--ccccccceeEEEeeee-cccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------- Confidence 9886 4579999999999885 789999999999999999999999999999999999998765322100 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (347) .. +... .++.|++|..+|+++++ ++||++|+|++|+.|+++. +|+...-.+++.+.+ T Consensus 135 ---------~~--~~~~--------~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:93 135 ---------VN--ADIT--------KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred ---------cc--cccc--------CHHHHHHHHHHhhhccC--CccEEEeCHHHHHHHHhhhhhcccccccccccceee Confidence 00 0000 15778999999999876 6899999999999999985 566655555666889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.|+++.||+|++||++|.+ .+.+||+.|++++..+++.+|..|+ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~-----------------------------------t~~l~~~gai~~~~~~~~~vE~~Rd 238 (274) T protein:vir:93 194 GAFGEALGAIIVRTNKLEAG-----------------------------------TAILAKKGAVKLILKRDFFLEVARD 238 (274) T ss_pred cccceecCeeEEEcCCCCcc-----------------------------------eEEEEeCCeEEEEecCCcccccccc Confidence 99999999999999999832 1368899999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.++.|.|++++.||++++||++++.+.++++. T Consensus 239 ~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s 271 (274) T protein:vir:93 239 ASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred hhhcccEEEEEEEEEEEEEcCCceEEEeeCccc Confidence 999999999999999999999999999988888 No 33 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=1.4e-39 Score=233.52 Aligned_cols=266 Identities=15% Similarity=0.160 Sum_probs=219.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-ccc--CCceEEEeccccc-eeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TIQ--NGKSASFPVMGRT-KGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti~--~G~tv~i~~iG~~-t~~~~~~g 76 (347) ||++|. |+. +| -+.-|+|+..|.+.+.+..++.++..+. ++. .|++|+||..... .++++..| T Consensus 1 ~~~~~~-----T~l------~d--~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g 67 (275) T protein:vir:96 1 MALENM-----TKL------AN--MVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEG 67 (275) T ss_pred CCCccc-----chh------hh--hhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCC Confidence 777653 222 22 2445999999999999999999997654 343 5999999986653 55678889 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++ .++++.++.+.+|.+ .++.|.|+|++..++..|++.+++++++++||+++|+.++..+..+... T Consensus 68 ~~i~--~~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~--------- 135 (275) T protein:vir:96 68 EEIP--IDLIETKKRQATIRK-IGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLK--------- 135 (275) T ss_pred CCcc--hhhcccceeeEEeeh-hcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------- Confidence 9886 357999999999977 4999999999999999999999999999999999999988765321100 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (347) . .... . .++.|++|..+|.+.+. ++||++|+|++|..|+++. +|+..+..++..+.+ T Consensus 136 ---------~--~~~~----~----~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~ 194 (275) T protein:vir:96 136 ---------V--EADI----T----KLAGLQTAIDKFNDEDL--EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVK 194 (275) T ss_pred ---------c--cccc----c----CHHHHHHHHHHhccccC--CccEEEeCHHHHHHHHhcccccccccccccccceec Confidence 0 0000 0 16789999999988764 7899999999999998874 777777667777889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.|++++||+|++||++|.+ .+.+|++.|++++..+++++|..|+ T Consensus 195 G~ig~~~G~~Vi~s~~~p~~-----------------------------------t~~i~~~gA~~~~~~~~~~vE~~Rd 239 (275) T protein:vir:96 195 GAFGEALGAIIVRSNKIKEG-----------------------------------EAILAKRGAVKLITKRDFFLETERH 239 (275) T ss_pred cccceecCeeEEEeCCCCcc-----------------------------------eEEEEeccceeeeecCCcccccccc Confidence 99999999999999999842 1367899999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.++.|.|.+++.||++++||+.++.+.+.|+- T Consensus 240 ~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 272 (275) T protein:vir:96 240 ASHKSTALFSDKHYVAYLYDESKVVKITKSASG 272 (275) T ss_pred hhhcCcEEEEeEEEEEEEEcCccEEEEEecccc Confidence 999999999999999999999999999998888 No 34 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=1.9e-39 Score=232.87 Aligned_cols=265 Identities=17% Similarity=0.153 Sum_probs=218.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cc--cCCceEEEeccccc-eeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TI--QNGKSASFPVMGRT-KGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti--~~G~tv~i~~iG~~-t~~~~~~g 76 (347) |||... +. .+-+.-|+|+.+|.+.+.+..++.++.... ++ +.|++|+||..+.. .+++|..| T Consensus 1 ma~~~T------~l--------~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g 66 (274) T protein:vir:12 1 MAQGLT------KT--------SNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccee------eh--------hhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 999532 22 122455999999999999888999887764 33 46999999986543 46688888 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++ .++++.++.+++|++ .++.|.|+|++..++..|++++++++++++|++++|+.++..+..+... T Consensus 67 ~~i~--~~~lt~~~~~~~i~~-~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~--------- 134 (274) T protein:vir:12 67 EKIP--TDILETKKREAKIRK-IAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------- 134 (274) T ss_pred Cccc--hhhcccceeeEEeee-ecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc--------- Confidence 9885 457999999999999 5899999999999999999999999999999999999998765322100 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (347) ..+ .. . .++.|++|..+|++++. .+||++|+|++|+.|+++. +|+...-.+.+.+++ T Consensus 135 ---------~~~--~a----~----~~d~i~dA~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~ 193 (274) T protein:vir:12 135 ---------VNA--DI----T----KLNGLQSAIDKFNDEDL--EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred ---------ccc--cc----c----CHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhhhhhccccccccccceec Confidence 000 00 0 16788999999998874 7899999999999999985 677655455667889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.|+++.||+|++|+++|.+ .+.+|++-|++.+..+++++|..|| T Consensus 194 G~ig~~~G~~Vi~s~~~p~~-----------------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd 238 (274) T protein:vir:12 194 GAFGEALGAIIVRSNKLEAG-----------------------------------TAILAKKGAVKLILKRDFFLEVARD 238 (274) T ss_pred ccceeecCeeEEEeCCCCcc-----------------------------------eEEEEeccceeeeecCCceeccccc Confidence 99999999999999999842 1368889999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.++.|.|.+++.||++++||+.++.++++.+. T Consensus 239 ~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:12 239 ASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred hhhcccEEEeeeEEEEEEEcCCceEEEEcCCcc Confidence 999999999999999999999999999988888 No 35 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=2.2e-39 Score=232.53 Aligned_cols=265 Identities=17% Similarity=0.165 Sum_probs=218.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cc--cCCceEEEeccccc-eeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TI--QNGKSASFPVMGRT-KGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti--~~G~tv~i~~iG~~-t~~~~~~g 76 (347) |+|.. |+.. | -+.-|+|+..|.+.+.+..++.++.... ++ +.|++|+||+++.. .+++|..| T Consensus 1 ma~~~------T~~~------d--~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g 66 (274) T protein:vir:94 1 MPQGL------TKTS------D--QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccc------eehh------h--eechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 99932 2222 2 1444999999999999999999887664 34 45999999996643 56688889 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++ .+.++.++.+++|++. .+.|.|+|++..++..|++++++++++++|++++|+.++..+..+... T Consensus 67 ~~i~--~~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~--------- 134 (274) T protein:vir:94 67 EKIP--TDILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------- 134 (274) T ss_pred Cccc--ccccccceeEEEeeee-cceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc--------- Confidence 9885 4579999999999995 689999999999999999999999999999999999998766332100 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (347) ..+ ... -++.|++|..+|++++. .+||++|+|++|..|+++. +|+...-.++..+.+ T Consensus 135 ---------~~~--~~~--------~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:94 135 ---------VNA--DIT--------KLNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred ---------ccc--ccc--------CHHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceec Confidence 000 000 16778999999999875 6799999999999999985 677665556666889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.|+++.||+|++||++|.+ .+.+|++.|++.++.+++.+|..|| T Consensus 194 G~ig~~~G~~Vi~s~~~p~~-----------------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd 238 (274) T protein:vir:94 194 GAFGEALGAIIVRTNKLEAG-----------------------------------TAILAKKGAVKLILKRDFFLEVARD 238 (274) T ss_pred cccceecCeeEEEcCCCCcc-----------------------------------eEEEEeCcceEeeecCCceeccccc Confidence 99999999999999999832 1368899999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.++.|.|.+++.||+++++|+.++.+.++.+. T Consensus 239 ~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:94 239 ASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred hhhcccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 999999999999999999999999999998888 No 36 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=2.2e-39 Score=232.53 Aligned_cols=265 Identities=17% Similarity=0.165 Sum_probs=218.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cc--cCCceEEEeccccc-eeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TI--QNGKSASFPVMGRT-KGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti--~~G~tv~i~~iG~~-t~~~~~~g 76 (347) |+|.. |+.. | -+.-|+|+..|.+.+.+..++.++.... ++ +.|++|+||+++.. .+++|..| T Consensus 1 ma~~~------T~~~------d--~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g 66 (274) T protein:vir:97 1 MPQGL------TKTS------D--QIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG 66 (274) T ss_pred CCccc------eehh------h--eechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC Confidence 99932 2222 2 1444999999999999999999887664 34 45999999996643 56688889 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++ .+.++.++.+++|++. .+.|.|+|++..++..|++++++++++++|++++|+.++..+..+... T Consensus 67 ~~i~--~~~lt~~~~~~~i~~~-~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~--------- 134 (274) T protein:vir:97 67 EKIP--TDILETKKREAKIRKI-AKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLT--------- 134 (274) T ss_pred Cccc--ccccccceeEEEeeee-cceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc--------- Confidence 9885 4579999999999995 689999999999999999999999999999999999998766332100 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (347) ..+ ... -++.|++|..+|++++. .+||++|+|++|..|+++. +|+...-.++..+.+ T Consensus 135 ---------~~~--~~~--------~~d~i~dA~~~l~d~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:97 135 ---------VNA--DIT--------KLNGLQSAIDKFNDEDL--EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred ---------ccc--ccc--------CHHHHHHHHHHhhccCC--CceEEEeCHHHHHHHHhhhhhhccccCcccccceec Confidence 000 000 16778999999999875 6799999999999999985 677665556666889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.|+++.||+|++||++|.+ .+.+|++.|++.++.+++.+|..|| T Consensus 194 G~ig~~~G~~Vi~s~~~p~~-----------------------------------t~~l~~~gA~~~~~~~~~~vE~~Rd 238 (274) T protein:vir:97 194 GAFGEALGAIIVRTNKLEAG-----------------------------------TAILAKKGAVKLILKRDFFLEVARD 238 (274) T ss_pred cccceecCeeEEEcCCCCcc-----------------------------------eEEEEeCcceEeeecCCceeccccc Confidence 99999999999999999832 1368899999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.++.|.|.+++.||+++++|+.++.+.++.+. T Consensus 239 ~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (274) T protein:vir:97 239 ASTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred hhhcccEEEEEEEEEEEEEcCCceEEEecCccc Confidence 999999999999999999999999999998888 No 37 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=100.00 E-value=5.6e-39 Score=230.31 Aligned_cols=301 Identities=15% Similarity=0.111 Sum_probs=209.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cc---cCCceEEEeccccceeeee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR---TI---QNGKSASFPVMGRTKGYYL 73 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~r---ti---~~G~tv~i~~iG~~t~~~~ 73 (347) ||| +.. .|| |+|..+.++.|+++.++.++++.. ++ +.|+||+|++.+..++++| T Consensus 1 MAN-~ll------------------T~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~ 61 (423) T protein:vir:35 1 MAN-NLE------------------SNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERT 61 (423) T ss_pred Ccc-chh------------------hhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecc Confidence 887 321 364 999999999999999999987643 34 3599999999999999999 Q ss_pred cCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 74 APGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) .++.......+++...++.|+||+.+|++|.++|.|++|..-|+. .+.+.++++|++.+|+.++..+...+. . T Consensus 62 ~~~~~~~~~~~~~~e~~v~l~id~~k~~a~~v~d~e~~l~i~~~~-~~l~~a~~ala~~vd~~l~~~l~~~a~------~ 134 (423) T protein:vir:35 62 ETGDITGKDKNGLFSAKATGKVGKYITVAVEWTQIEEALKLNQLD-QILSPIHERMVTDLETELAHFMMNNGA------L 134 (423) T ss_pred cCcCCCCccccccccceeeEEeccceeccceeCHHHHHhhHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhccc------c Confidence 765332223467888889999999999999999999999888884 577788899999999999876644321 0 Q ss_pred ccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchh-hhhhhccccccc Q lcl|Aclame:pro 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALM-PNAANYAALIDP 232 (347) Q Consensus 154 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~-~~~~~~~~~~~~ 232 (347) .+++.+.. + ..|+.|++++++|++.+||+.|||+||+|++|..|+++++ |.+.+-.++..+ T Consensus 135 -----------~vgt~~t~--~-----~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:35 135 -----------SLGSPNTA--I-----KKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAW 196 (423) T ss_pred -----------ccccccCC--c-----chHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHH Confidence 01111111 0 0168899999999999999999999999999999998765 545554555668 Q ss_pred cccce-EEEeceeEEEecccccccccccccc----Ccc----------------------cccc-----cccc------- Q lcl|Aclame:pro 233 ETGNI-RNVMGFEVIEVPHLTVGGAGDNNPA----DGV----------------------APTN-----QKHI------- 273 (347) Q Consensus 233 ~~G~v-~~i~G~~V~~sn~lp~~~~~~~~~~----~~~----------------------~~t~-----~~~~------- 273 (347) ++|+| |+++||+||+||++|....+..... .+. ..++ .... T Consensus 197 r~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v 276 (423) T protein:vir:35 197 ENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAPNVDYLSVKDSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWL 276 (423) T ss_pred hhccceeeecceEEEEcCCCccccccccccceeeccccccccccccccccceeeeeeeeeccCCcEEecceEEeeeeeec Confidence 89876 9999999999999996433321100 000 0000 0000 Q ss_pred ------------------cccc------ccccccccc----------------------------------cceeEEeec Q lcl|Aclame:pro 274 ------------------FPAT------ATGDDRVAQ----------------------------------NNVVGLFNH 295 (347) Q Consensus 274 ------------------~~a~------~~~~y~~d~----------------------------------~~~~~l~~h 295 (347) |... .++.+.... .....|+|| T Consensus 277 ~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~vt~~~~a~~~~~~nl~~~ 356 (423) T protein:vir:35 277 NQQSKQTLYNGSTAMSFTATVLEETNSTASGDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAVSIIGTAKQQMKPNLFYN 356 (423) T ss_pred cccccceeecccCCceeEEEEeccccccccCceeEEccccccccCCCcccccccccccCCceeeeeecCCCceeEEEeec Confidence 0000 001110000 011468999 Q ss_pred hhhhhhhh-----------------hhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 296 RSAVGTVK-----------------LKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 296 ~~A~~tv~-----------------~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) |+|+..+. ...+++-..||.+..-..++-=..||.+.+|||.++-+.-.| T Consensus 357 ~~a~~l~~~~l~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:35 357 KFFCGLGTIPLPKLHSLDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CceeEEEEEccccCCccceeeccccCceEEEEEeeccccCceEEEEEeecceeeecccceEEEEecC Confidence 99875432 233334445666655556666677999999999999998888 No 38 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=1.8e-39 Score=232.93 Aligned_cols=264 Identities=16% Similarity=0.166 Sum_probs=215.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhccccc-cccc--CCceEEEeccccc-eeeeecC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMV-RTIQ--NGKSASFPVMGRT-KGYYLAP 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~-rti~--~G~tv~i~~iG~~-t~~~~~~ 75 (347) |+|.. |+.. | +++ |+|+.+|.+.+.+..++.++..+ +++. .|++|+||..... .+++|.. T Consensus 1 m~~~~------T~l~------d---~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~ 65 (274) T protein:vir:95 1 MAQGM------TKLT------N---QIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE 65 (274) T ss_pred CCcce------eehh------h---eechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccC Confidence 99843 2222 1 444 99999999999999999988654 3444 5999999986543 5567888 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) |+.++ .+.++.++.+++|++. +..|.|+|++..++..|++++++++++++||+++|+.++..+.++... T Consensus 66 g~~i~--~~~lt~~~~~~~i~~~-~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~-------- 134 (274) T protein:vir:95 66 GEKIP--TDILETKKREAKIRKI-AKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT-------- 134 (274) T ss_pred CCccc--hhhcccceeEEEeeee-ecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------- Confidence 88885 4579999999999995 899999999999999999999999999999999999988765332100 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhcccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPE 233 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~ 233 (347) + .+ .. ..++.|++|..+|++++. .+||++|+|++|+.|+++. +|+...-.+...+. T Consensus 135 --------~--~~--~~--------~~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~ 192 (274) T protein:vir:95 135 --------V--EA--DI--------TKLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIV 192 (274) T ss_pred --------c--cc--cc--------cCHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhcccccccccccccccee Confidence 0 00 00 016778999999998874 7899999999999999985 66665444566788 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) +|.|++++||+||+||++|.+ .+.+|++.|++.+..+++++|..| T Consensus 193 ~G~ig~~~G~~Vi~s~~~~~~-----------------------------------t~~l~~~gA~~~~~~~~~~vE~~R 237 (274) T protein:vir:95 193 KGAFGEALGAVIVRSNKLEAG-----------------------------------TAILAKKGAVKLITKRDFFLETDR 237 (274) T ss_pred ccccceecCeEEEEeCCCCCc-----------------------------------eEEEEeccceeeeecCCccccccc Confidence 999999999999999999732 136888999999999999999999 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) |+.++.|.|.+++.||++++||++++.++++.=. T Consensus 238 d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:95 238 DPSTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ccccccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 9999999999999999999999999999866655 No 39 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=1.8e-39 Score=232.93 Aligned_cols=264 Identities=16% Similarity=0.166 Sum_probs=215.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhccccc-cccc--CCceEEEeccccc-eeeeecC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMV-RTIQ--NGKSASFPVMGRT-KGYYLAP 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~-rti~--~G~tv~i~~iG~~-t~~~~~~ 75 (347) |+|.. |+.. | +++ |+|+.+|.+.+.+..++.++..+ +++. .|++|+||..... .+++|.. T Consensus 1 m~~~~------T~l~------d---~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~ 65 (274) T protein:vir:96 1 MAQGM------TKLT------N---QIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAE 65 (274) T ss_pred CCcce------eehh------h---eechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccC Confidence 99843 2222 1 444 99999999999999999988654 3444 5999999986543 5567888 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) |+.++ .+.++.++.+++|++. +..|.|+|++..++..|++++++++++++||+++|+.++..+.++... T Consensus 66 g~~i~--~~~lt~~~~~~~i~~~-~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~-------- 134 (274) T protein:vir:96 66 GEKIP--TDILETKKREAKIRKI-AKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLT-------- 134 (274) T ss_pred CCccc--hhhcccceeEEEeeee-ecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccccc-------- Confidence 88885 4579999999999995 899999999999999999999999999999999999988765332100 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhcccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPE 233 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~ 233 (347) + .+ .. ..++.|++|..+|++++. .+||++|+|++|+.|+++. +|+...-.+...+. T Consensus 135 --------~--~~--~~--------~~~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~ 192 (274) T protein:vir:96 135 --------V--EA--DI--------TKLTGLQTAIDKFNDEDL--EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIV 192 (274) T ss_pred --------c--cc--cc--------cCHHHHHHHHHHhccccc--cccEEEeCHHHHHHHHhhcccccccccccccccee Confidence 0 00 00 016778999999998874 7899999999999999985 66665444566788 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) +|.|++++||+||+||++|.+ .+.+|++.|++.+..+++++|..| T Consensus 193 ~G~ig~~~G~~Vi~s~~~~~~-----------------------------------t~~l~~~gA~~~~~~~~~~vE~~R 237 (274) T protein:vir:96 193 KGAFGEALGAVIVRSNKLEAG-----------------------------------TAILAKKGAVKLITKRDFFLETDR 237 (274) T ss_pred ccccceecCeEEEEeCCCCCc-----------------------------------eEEEEeccceeeeecCCccccccc Confidence 999999999999999999732 136888999999999999999999 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) |+.++.|.|.+++.||++++||++++.++++.=. T Consensus 238 d~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~ 271 (274) T protein:vir:96 238 DPSTKTTALYSDKHYVAYLYDESKAVKITKGSGS 271 (274) T ss_pred ccccccCEEEEeEEEEEEEEcCCcEEEEEcCCcc Confidence 9999999999999999999999999999866655 No 40 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=4.2e-39 Score=230.99 Aligned_cols=287 Identities=12% Similarity=0.073 Sum_probs=183.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccc---cc--cCCceEEEeccccceeeeec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVR---TI--QNGKSASFPVMGRTKGYYLA 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~r---ti--~~G~tv~i~~iG~~t~~~~~ 74 (347) |||. +|+ |+|+.+++..|++..++.++++.. ++ +.|++|||++.+..++.+|+ T Consensus 1 Ma~~---------------------~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~ 59 (392) T protein:vir:99 1 MANA---------------------FSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRK 59 (392) T ss_pred Cccc---------------------cccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeee Confidence 8872 354 899999999999999998887643 45 45999999999999998886 Q ss_pred CCC---CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 75 PGE---NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) Q Consensus 75 ~g~---~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~ 151 (347) +.. ....+.+++.+++++++||+.+|++|.|+|.|+.|...|+++++.++++++||+.+|+.++..+..+.... T Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~--- 136 (392) T protein:vir:99 60 LRGAGAERNLTVSDFTEDSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEA--- 136 (392) T ss_pred ccccccCCcccccccccceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--- Confidence 421 11223467899999999999999999999999999999999999999999999999999987664321100 Q ss_pred ccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc-- Q lcl|Aclame:pro 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL-- 229 (347) Q Consensus 152 ~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~-- 229 (347) ...... ..+...|+.|++++++|+|++||. |||+||+|++|+.|+++++|.+.++.+. T Consensus 137 ---------------~~~~~~----~~~~~~~~~i~~a~~~L~~~~vP~-~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~ 196 (392) T protein:vir:99 137 ---------------AGAVHE----VAPDEFFKGVNGARRALNELYIPQ-GRVLVVGTAVTEQILNDDRFIKYESQGQSA 196 (392) T ss_pred ---------------cccccc----cChhhhHHHHHHHHHHHhhcCCCC-CCEEEEcHHHHHHHhcccceeecccccchh Confidence 000111 112335788999999999999996 8999999999999999999998887654 Q ss_pred -ccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 230 -IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 230 -~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) ..+++|.|++++||+||+|+++|...... .+..+............+.+..+.. +...++. .. T Consensus 197 ~~~l~~G~vg~i~G~~v~~s~~~~~~t~~a-~~~~a~~~at~a~v~~~~~~~~~s~--s~~~~v~-------------~~ 260 (392) T protein:vir:99 197 VSALQEARLGRIYGYEIVESTLIPHGDAYL-YHPTAFIMATRAPAPPMGAVRSTAI--SGDQRIA-------------MR 260 (392) T ss_pred hhhhhcceeeeeeeeEEEeeccccccccee-eeccccccccccccccccccceeEE--eccccee-------------cc Confidence 34789999999999999999999754321 1111111100000111111111100 0000000 00 Q ss_pred eccccchhhHhhHHhhhhhhcCcccccceEEEEEec-------------C---CC Q lcl|Aclame:pro 309 LERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFT-------------P---AA 347 (347) Q Consensus 309 ~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~-------------~---aa 347 (347) .-..|+.....+...-....|.+.+.-.+...+..+ + +. T Consensus 261 ~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~~ 315 (392) T protein:vir:99 261 WLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGAN 315 (392) T ss_pred eeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeeccc Confidence 011122222222222222223322221111111000 0 00 No 41 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=3.9e-39 Score=231.15 Aligned_cols=267 Identities=17% Similarity=0.144 Sum_probs=217.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-ccc--CCceEEEeccccc-eeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TIQ--NGKSASFPVMGRT-KGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti~--~G~tv~i~~iG~~-t~~~~~~g 76 (347) |||.. |+. .+-+.-|+|+..|.+.|.+..++.++..+. ++. .|++|+||..+.. ..+++..| T Consensus 1 ma~~~------T~~--------~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg 66 (272) T protein:vir:36 1 MSKQK------TTL--------ADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEG 66 (272) T ss_pred CCCcc------eeh--------hhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCC Confidence 99832 221 222445999999999999999999887654 344 5999999997665 34568888 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ..++ .+.++.++.+++|.+. ...|.|+|++..++..|++++++++++++|++++|+.++..+..+. T Consensus 67 ~~i~--~~~lt~~~~~~~i~~~-~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~----------- 132 (272) T protein:vir:36 67 GEIS--LDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTS----------- 132 (272) T ss_pred CccC--hhhcCCcceeEeeehh-hccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------- Confidence 8886 4578999999999886 6789999999999999999999999999999999999886552211 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh-hcccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA-NYAALIDPETG 235 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~~~~~~~G 235 (347) .+.. ....++.|++|..+|.+.++| .|+++|+|+.|+.|+++.++... ++.+...+.+| T Consensus 133 -------~~~~-----------~~~~~d~i~~A~~~lgd~~~~--~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G 192 (272) T protein:vir:36 133 -------QTVS-----------TKANVDGVQAALDIFNDEDAQ--AYVLIVNPKDAAKIRKDANAKNIGSEVGANALING 192 (272) T ss_pred -------cccc-----------ccccHHHHHHHHHHhhhcCCC--ceEEEEcHHHHHHHhcccccccccccccccceeee Confidence 0000 011267899999999999976 69999999999999999888765 45566678899 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~ 315 (347) .|++++|++|++|+++|.... + ...++|++.|++++..+++++|..|++ T Consensus 193 ~ig~~~G~~Vv~s~~~p~~~~-------------------------~------~~~~~~~~gA~~~~~~~~~~vE~~R~~ 241 (272) T protein:vir:36 193 TYADVLGAQIVRSKKLAEGSA-------------------------L------MFKIVSNSPALKLVLKRGVQVETDRDI 241 (272) T ss_pred ccceecCeeEEEeCCCCCCce-------------------------e------EEEEEecccceeeeecCCcccccccch Confidence 999999999999999994211 0 123678899999999999999999999 Q ss_pred hhHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 316 EFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 316 ~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) .++.|.|++++.||++++||++++.+.++-- T Consensus 242 ~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 242 VTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred hhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 9999999999999999999999999987777 No 42 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=100.00 E-value=1.4e-38 Score=228.16 Aligned_cols=301 Identities=15% Similarity=0.115 Sum_probs=207.5 Q ss_pred CCCCccCccccccCcccCccccHHHHH-HHHHhHHHHHHHHHHHhhhcccccc---cc---cCCceEEEeccccceeeee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALF-LKVFGGEVLTAFVRRSVTMDKHMVR---TI---QNGKSASFPVMGRTKGYYL 73 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~-ie~f~geV~~~f~~~s~~~~~~~~r---ti---~~G~tv~i~~iG~~t~~~~ 73 (347) |+| +.. .| .++|..++++.|+++.++.++++.+ .+ +.|+||+|++.+..++++| T Consensus 1 MaN-~ll------------------T~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~ 61 (423) T protein:vir:17 1 MPN-NLD------------------SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRT 61 (423) T ss_pred Ccc-chh------------------hhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecc Confidence 888 331 35 4999999999999999999887753 23 3699999999999999988 Q ss_pred cCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 74 APGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) +......-+.+++...++.|+||+.+|++|.++|.|+.+.--|+ +++.+.++++||+.+|+.++..+.+.+... T Consensus 62 ~~~~~~~~~~~~l~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~a~~~----- 135 (423) T protein:vir:17 62 PTGDISGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALS----- 135 (423) T ss_pred cCcccCCcccCccccceeEEEeeceeeeeeeecHHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Confidence 64332222346788888999999999999999999999766666 889999999999999999987764432110 Q ss_pred ccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh-hhhccccccc Q lcl|Aclame:pro 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN-AANYAALIDP 232 (347) Q Consensus 154 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~~~~~ 232 (347) .++.+...+ -|+.+++++.+|++++||.+|||+||+|++|..||++++++ ..+-.++..+ T Consensus 136 ------------~gt~~t~~~-------a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:17 136 ------------LGSPNTPIT-------KWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAW 196 (423) T ss_pred ------------cccCCcccc-------cHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHH Confidence 011111111 16789999999999999999999999999999999877654 4444455668 Q ss_pred cccce-EEEeceeEEEecccccccccccccc---------Ccccc-----------------cc-----------c---- Q lcl|Aclame:pro 233 ETGNI-RNVMGFEVIEVPHLTVGGAGDNNPA---------DGVAP-----------------TN-----------Q---- 270 (347) Q Consensus 233 ~~G~v-~~i~G~~V~~sn~lp~~~~~~~~~~---------~~~~~-----------------t~-----------~---- 270 (347) ++|+| ++++||+||+||++|....+....+ .+.+. ++ + T Consensus 197 r~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v 276 (423) T protein:vir:17 197 ENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWL 276 (423) T ss_pred hhccceeeecceEEEEeCCCccccccceeceeeecccccccccccccccceeeeeeeeeeeccCceeecceEEecceeee Confidence 99987 8999999999999996433332100 00000 00 0 Q ss_pred ----c-----------cccccc------cccccccc----------------------------------ccceeEEeec Q lcl|Aclame:pro 271 ----K-----------HIFPAT------ATGDDRVA----------------------------------QNNVVGLFNH 295 (347) Q Consensus 271 ----~-----------~~~~a~------~~~~y~~d----------------------------------~~~~~~l~~h 295 (347) + ..|... .++...+. -....-|+|| T Consensus 277 ~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~ 356 (423) T protein:vir:17 277 QQQTKQALYNGATPISFTATVTADANSDSSGDVTVTLSGVPIYDTTNPQYNSVSRQVAAGDAVSVVGTASQTMKPNLFYN 356 (423) T ss_pred cccccccccccccccceEEEEEecccccccCceEEEecCccccccCCcccccceecccCCceeeccccccCCeeEEEEec Confidence 0 000000 00000000 0012348999 Q ss_pred hhhhhhhh-----------------hhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 296 RSAVGTVK-----------------LKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 296 ~~A~~tv~-----------------~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) |+|+..+. ...+++-..||.+..-..++-=..||.+.+|||.++-+.-.| T Consensus 357 ~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:17 357 KFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CcceEEEEEcccCCCccceeecccCCcEEEEEEecccccceeEEEEEeecceeeeccceEEEEEecC Confidence 99885432 233333334555444445666667999999999999998888 No 43 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=100.00 E-value=1.5e-38 Score=227.97 Aligned_cols=301 Identities=14% Similarity=0.106 Sum_probs=209.4 Q ss_pred CCCCccCccccccCcccCccccHHHHH-HHHHhHHHHHHHHHHHhhhcccccc---cc---cCCceEEEeccccceeeee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALF-LKVFGGEVLTAFVRRSVTMDKHMVR---TI---QNGKSASFPVMGRTKGYYL 73 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~-ie~f~geV~~~f~~~s~~~~~~~~r---ti---~~G~tv~i~~iG~~t~~~~ 73 (347) |+| +.. .| .|+|..++++.|+++.++.++++.. .+ +.|+||+|++.+..++.+| T Consensus 1 MaN-~ll------------------T~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~ 61 (423) T protein:vir:10 1 MPN-NLD------------------SNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRT 61 (423) T ss_pred Ccc-chh------------------hhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeecc Confidence 887 321 34 4999999999999999998887753 24 3699999999999999999 Q ss_pred cCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 74 APGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) +++..-..+.+++...++.|+||+.+|++|.++|.|+.+.--|+ +.+.+++.++||+.+|+.++..+...+... T Consensus 62 ~~~~~~~~~~~dl~e~~v~l~id~~k~va~~v~d~E~~~~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~~~----- 135 (423) T protein:vir:10 62 PTGDISGQNKNNLISGKATGRVGNYITVAVEYQQLEEAIKLNQL-EEILAPVRQRIVTDLETELAHFMMNNGALS----- 135 (423) T ss_pred CCccccccccCccccceeEEEeeceeeeeeeechHHHhcChhhH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Confidence 86432222346788999999999999999999999998665555 889999999999999999887654322100 Q ss_pred ccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh-hhhccccccc Q lcl|Aclame:pro 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN-AANYAALIDP 232 (347) Q Consensus 154 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~~~~~ 232 (347) .++.+...+ -|+.+++++.+|++++||.+|||+||+|++|..||++++++ ..+-.++..+ T Consensus 136 ------------~gt~~t~~~-------a~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:10 136 ------------LGSPNTPIT-------KWSDVAQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAW 196 (423) T ss_pred ------------cccCCcccc-------hHHHHHHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhh Confidence 011111111 16789999999999999999999999999999999876644 4454556679 Q ss_pred cccce-EEEeceeEEEecccccccccccccc---------Cccc----------------c-cc---ccccc-ccc---- Q lcl|Aclame:pro 233 ETGNI-RNVMGFEVIEVPHLTVGGAGDNNPA---------DGVA----------------P-TN---QKHIF-PAT---- 277 (347) Q Consensus 233 ~~G~v-~~i~G~~V~~sn~lp~~~~~~~~~~---------~~~~----------------~-t~---~~~~~-~a~---- 277 (347) ++|+| ++++||+||+||++|....+....+ .+.+ . ++ -+..+ -+| T Consensus 197 r~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v 276 (423) T protein:vir:10 197 ENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWL 276 (423) T ss_pred hhccceeeecceEEEEeCCCccccccccccceeeeecceeccccccccceeeeeeeeccccccCceeecceEEecceeee Confidence 99987 8999999999999996433322110 0000 0 00 00000 000 Q ss_pred ----------------------------cccccccc----------------------------------ccceeEEeec Q lcl|Aclame:pro 278 ----------------------------ATGDDRVA----------------------------------QNNVVGLFNH 295 (347) Q Consensus 278 ----------------------------~~~~y~~d----------------------------------~~~~~~l~~h 295 (347) .++..... -...+-|+|| T Consensus 277 ~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v~a~~a~~~~vT~~~~a~~t~~~nl~~~ 356 (423) T protein:vir:10 277 QQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSGVPIYDTTNPQYNSVSRQVEAGDAVSVVGTASQTMKPNLFYN 356 (423) T ss_pred cccccccccccccCcceEEEEEeeeeeccCCceeeeccCccccccCCcccccccccccCCceeeccccccCCeeEEEEec Confidence 00000000 0112358999 Q ss_pred hhhhhhh-----------------hhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 296 RSAVGTV-----------------KLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 296 ~~A~~tv-----------------~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) |+|+..+ +...+++-..||.+..-..++-=..||.+.+|||.++-+.-.| T Consensus 357 ~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 357 KFFCGLGSIPLPKLHSIDSAVATYEGFSIRVHKYADGDANVQKMRFDLLPAYVCFNPHMGGQFFGNP 423 (423) T ss_pred CcceEEEEEcccCCCccceeeccccCceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 9988543 3333444445666655556666677999999999999998888 No 44 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=100.00 E-value=2.9e-36 Score=215.39 Aligned_cols=301 Identities=13% Similarity=0.096 Sum_probs=204.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc---cc---cCCceEEEeccccceeeeec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR---TI---QNGKSASFPVMGRTKGYYLA 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r---ti---~~G~tv~i~~iG~~t~~~~~ 74 (347) ||| +. + ++-.++|..+.++.|+++.++.++++.. ++ +.|+||+|++.+..++.+.. T Consensus 1 MAN-sl-~----------------~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~ 62 (423) T protein:vir:10 1 MAN-NL-D----------------ANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTM 62 (423) T ss_pred Ccc-cc-c----------------cccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeeccc Confidence 887 22 1 1345999999999999999998887753 23 25999999999999887644 Q ss_pred CCCCCCC-CCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 75 PGENLDD-KRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 75 ~g~~~~~-~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) . ..+.+ ..+++...++.++||+.+|++|.|+|.|+.+.--|+ +.+.+.+.++||+.+|+.++..+.+.+. . T Consensus 63 ~-~~~t~~~~~~l~e~~v~l~id~~k~~a~~v~d~E~~l~i~~~-~~~l~~A~~aLA~~vd~~ia~~~~~~~~------~ 134 (423) T protein:vir:10 63 D-GDITGKSKNSLISAKATGEVGNYITVAVEYRQIEEALKLNQL-DQILVPINERMVTDLETELALFMMKHGA------L 134 (423) T ss_pred C-cccCcccccccccceEEEEecceeeeeeeeChHHHhcChhHH-HHHHHHHHHHHHHHHHHHHHHHhhhccc------c Confidence 3 33322 335677778999999999999999999998655555 8899999999999999999766543221 0 Q ss_pred ccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh-hhhccccccc Q lcl|Aclame:pro 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN-AANYAALIDP 232 (347) Q Consensus 154 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~-~~~~~~~~~~ 232 (347) . .++.+...+ -|+.+.+++++|++.+||..+||+||+|++|..|++++.+. ..+..++..+ T Consensus 135 ~-----------vgt~~t~~~-------a~~~~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~al 196 (423) T protein:vir:10 135 S-----------LGSPNTPIK-------KWSDVAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAW 196 (423) T ss_pred c-----------ccccccccc-------cHHHHHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHH Confidence 0 111111111 16789999999999999999999999999999999876654 4455556678 Q ss_pred cccce-EEEeceeEEEecccccccccccc---cc------Ccccc---------------c-------------cc---- Q lcl|Aclame:pro 233 ETGNI-RNVMGFEVIEVPHLTVGGAGDNN---PA------DGVAP---------------T-------------NQ---- 270 (347) Q Consensus 233 ~~G~v-~~i~G~~V~~sn~lp~~~~~~~~---~~------~~~~~---------------t-------------~~---- 270 (347) ++|.| |+++||+||+||++|....+... +. .+.+. + ++ T Consensus 197 r~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v 276 (423) T protein:vir:10 197 ENAQISGNFGGIRALMSNGLASRTQGAFGGKLTVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWL 276 (423) T ss_pred HhcccceeecceEEEEecCCcccccccccceeeeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeee Confidence 99976 99999999999999954322111 00 00000 0 00 Q ss_pred -------------c--cccccc------ccccccccc----------------------------------cceeEEeec Q lcl|Aclame:pro 271 -------------K--HIFPAT------ATGDDRVAQ----------------------------------NNVVGLFNH 295 (347) Q Consensus 271 -------------~--~~~~a~------~~~~y~~d~----------------------------------~~~~~l~~h 295 (347) + ..|... .++...+.. +...-|+|| T Consensus 277 ~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~tv~i~p~~~~~~~~~~~~~V~a~~a~~~~vT~~~~~~~t~~~nl~~~ 356 (423) T protein:vir:10 277 NQQSKQTLYNGASALSFTATVMEDANAHSSGDVTVKISGVPIFDAGYPQYNAVDRLLAEGDTVSVIGTSKQAMKPNLFYN 356 (423) T ss_pred cccccceeecccCCcceEEEEEecccccccCceEEEeccccccccCcccccceeccccCCceeEEeeccCCceeEEEEec Confidence 0 000000 011110000 012358999 Q ss_pred hhhhhhh-----------------hhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 296 RSAVGTV-----------------KLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 296 ~~A~~tv-----------------~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) |+|+..+ +...+++-..||.+..-..++-=..||.+.+|||.++-+.-.| T Consensus 357 ~~a~~l~~~pl~~~~~~~~~~~~~~g~s~r~~~~~d~~~~~~~~r~d~l~g~~~~~p~~~~~~~g~~ 423 (423) T protein:vir:10 357 KLFCGLGTIPLPKLHSIDSAVATYEGFSIRVHKYADGDANKQMMRFDLLPAYVCYNPHMGGQFFGNP 423 (423) T ss_pred CcceEEEEEcccCCCccceeecccccceEEEEEeeeccccceEEEEEeecceeeeccceEEEEEecC Confidence 9987543 3333444445666655556666677999999999999998888 No 45 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=1.5e-35 Score=211.50 Aligned_cols=284 Identities=13% Similarity=0.043 Sum_probs=188.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc---cc--cCCceEEEeccccceeeeecC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR---TI--QNGKSASFPVMGRTKGYYLAP 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r---ti--~~G~tv~i~~iG~~t~~~~~~ 75 (347) ||.. -|.|+|+.++++.|...+++..+.+.. .+ .+||+|+||+++.+.+++|++ T Consensus 1 MA~~---------------------n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R 59 (299) T protein:vir:79 1 MAAL---------------------NYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNR 59 (299) T ss_pred Cccc---------------------hhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEecccccccccccc Confidence 6632 167999999999999999888765543 23 478999999999999999998 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcch--HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDV--RAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~--r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) ++....+ ..++.+..+++||+.+|+.|.||++|..|++..+ ...+.+.+.+.++.++|.+.+..|+..+... T Consensus 60 ~~~g~~~-g~~~~~~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~----- 133 (299) T protein:vir:79 60 DTIAVAQ-RNYDNAWEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTAL----- 133 (299) T ss_pred CCCcccc-cccCcceeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhc----- Confidence 6644322 3578899999999999999999977777766554 3445666778889999999888776443211 Q ss_pred ccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh-hccccccc Q lcl|Aclame:pro 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA-NYAALIDP 232 (347) Q Consensus 154 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~~~~~ 232 (347) +.... .....++++|+.|+++.++|+|++||.+|||++|+|++|.+|+++++|... +....... T Consensus 134 -------------g~~~~--~~~~T~~n~y~~i~~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~ 198 (299) T protein:vir:79 134 -------------GNTAD--TTVLTTTNVLEVFDKLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTS 198 (299) T ss_pred -------------CCccc--ccccCHHHHHHHHHHHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccce Confidence 00000 111234567999999999999999999999999999999999999998754 34434467 Q ss_pred cccceEEEeceeEEE--eccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeec Q lcl|Aclame:pro 233 ETGNIRNVMGFEVIE--VPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE 310 (347) Q Consensus 233 ~~G~v~~i~G~~V~~--sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e 310 (347) .+|.|+++.||+|++ |++++..-. ...|..+. .++ -+.=.++.||+|+..+.-.+ .+. T Consensus 199 ~~g~Vg~idG~~Ii~Vps~r~~t~~~----~~~G~~~~---------~~a------k~in~ii~~~~a~~~~~K~~-~~~ 258 (299) T protein:vir:79 199 LNRQTTDIDTVKIIKVPSNLMKTAYD----FTTGWKVG---------AGA------KQIFMSLVHPSAIITPVSYQ-FSK 258 (299) T ss_pred eeeeeeeecceEEEEechhhcCccce----eccCcccc---------Ccc------cccceEEEcCCeeeeeEeee-eEE Confidence 899999999999997 666763211 11111110 000 01114889999885433332 222 Q ss_pred cccchh--hHhhHH-hhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 311 RARRPE--FQADQI-IGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 311 ~~~~~~--~~~d~i-~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .+.|. ..+|.. -.+..+..-++.....+......+| T Consensus 259 -~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a 297 (299) T protein:vir:79 259 -LDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGA 297 (299) T ss_pred -eecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeec Confidence 34563 233333 2333333344433333433333333 No 46 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=100.00 E-value=1.4e-35 Score=211.71 Aligned_cols=265 Identities=17% Similarity=0.174 Sum_probs=216.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cc--cCCceEEEeccccc-eeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TI--QNGKSASFPVMGRT-KGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti--~~G~tv~i~~iG~~-t~~~~~~g 76 (347) |||-+ |+ ..+-+.-|+|+..|.+.+.+..++.++..+. ++ ..|++++||..+.. .++++..| T Consensus 1 Ma~~~------T~--------l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg 66 (276) T protein:vir:10 1 MAQGT------TT--------KSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEG 66 (276) T ss_pred CCcce------ee--------hhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCC Confidence 98832 21 1222556999999999999999999998764 34 46999999987654 44568888 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ..++ .+.++.++.+.+|.+ .+..|.++|++..++..|++++++++++++||+++|+.++..+..+.. T Consensus 67 ~~i~--~~~lt~~~~~a~i~~-~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~---------- 133 (276) T protein:vir:10 67 QKIP--VDKIETNRREAKIHK-IGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKL---------- 133 (276) T ss_pred CccC--ccccccceeeEEeeh-ccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Confidence 8885 457999999999976 599999999999999999999999999999999999998876532110 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcc--hhhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSA--LMPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~--~~~~~~~~~~~~~~~~ 234 (347) +..+ .. . .++.|++|..+|+++++ +.++++|.|++|..|+++ .+|+...-.+++.+.+ T Consensus 134 --------~~~~--~~----~----t~d~i~~A~~~lgd~~~--~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (276) T protein:vir:10 134 --------TVSA--DI----G----TLAGLEAAIDTFDDEDL--EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVK 193 (276) T ss_pred --------cccc--cc----c----CHHHHHHHHHHhccccC--cccEEEEcHHHHHHHHHhccccccccccccccceec Confidence 0000 00 0 15778999999998875 779999999999999775 5777766566667889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.|+++.|++|++|+++|.+ .+++|++.|++.+..+++++|..|+ T Consensus 194 G~ig~~~G~~Vi~s~~~p~~-----------------------------------t~~l~~~gAi~~~~~~~~~vE~dRd 238 (276) T protein:vir:10 194 GAFGEALGAVIVRSKKLDEG-----------------------------------EAILAKRGAVKLITKRDFFLETDRD 238 (276) T ss_pred cccceecceeEEEcCCCCcc-----------------------------------eEEEEeccceeeeecCCceeecccc Confidence 99999999999999999832 1268899999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.++.|.|.+++.||+++++|+.++.+.++.-+ T Consensus 239 ~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~ 271 (276) T protein:vir:10 239 PSTKTTALYSDKHYVAYLYDESKAVKVTKGAGT 271 (276) T ss_pred hhhcccEEEEeeEEEEEEEcCcceEEEecCCcC Confidence 999999999999999999999999999987655 No 47 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=1.2e-35 Score=212.11 Aligned_cols=264 Identities=18% Similarity=0.154 Sum_probs=213.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cc--cCCceEEEecccc-ceeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TI--QNGKSASFPVMGR-TKGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti--~~G~tv~i~~iG~-~t~~~~~~g 76 (347) ||+-+. +. +| -+.-|+|+..|.+.+.+.+++.++..+. ++ ..|++++||+.+. ..+..+..| T Consensus 1 MA~~~T------~~------~~--~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg 66 (272) T protein:vir:98 1 MAVGTT------KM------AQ--MLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEG 66 (272) T ss_pred CCCccc------cc------hh--eechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCC Confidence 998432 21 22 2445999999999999999998887754 33 3699999999864 467778888 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++. .+++.+++++++++. ...+.|+|.+..++..|+++++.+++++++++++|+.++..+.++.. T Consensus 67 ~~i~~--~~~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~---------- 133 (272) T protein:vir:98 67 EAIPM--TQLGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ---------- 133 (272) T ss_pred Ccccc--cccccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------- Confidence 88864 568999999999985 67899999999999999999999999999999999998865422110 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (347) ..++. ..++.|++|..+|++.+ ...|+++|+|+.|..|+++. +++.....+.+.+.+ T Consensus 134 --------~~~~~-----------~t~d~i~da~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:98 134 --------TVEAT-----------ATVDGVSKALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred --------ccccc-----------cCHHHHHHHHHHHhccC--CCccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 00000 01577889999998876 45799999999999999874 444444445566889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.+++++|++|++|+++|.+. .++|++.|++.+..+++++|..|+ T Consensus 193 g~ig~i~G~~Vi~s~~~p~~t-----------------------------------~~~~~~~a~~~~~~~~~~ve~~r~ 237 (272) T protein:vir:98 193 GVYGEVLGVQIVRSRKCPKGT-----------------------------------AYMVRKGALRIMLKRNTMVETDRD 237 (272) T ss_pred ccchhhcCeeEEEcCCCCcce-----------------------------------EEEEcCCeEEEEecCCceeeeccc Confidence 999999999999999998321 267899999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.++.|.|.+++.||.+++||++++.+.+++|+ T Consensus 238 ~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:98 238 ITKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred cccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 999999999999999999999999999999999 No 48 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=1.2e-35 Score=212.11 Aligned_cols=264 Identities=18% Similarity=0.154 Sum_probs=213.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cc--cCCceEEEecccc-ceeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TI--QNGKSASFPVMGR-TKGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti--~~G~tv~i~~iG~-~t~~~~~~g 76 (347) ||+-+. +. +| -+.-|+|+..|.+.+.+.+++.++..+. ++ ..|++++||+.+. ..+..+..| T Consensus 1 MA~~~T------~~------~~--~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg 66 (272) T protein:vir:30 1 MAVGTT------KM------AQ--MLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEG 66 (272) T ss_pred CCCccc------cc------hh--eechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCC Confidence 998432 21 22 2445999999999999999998887754 33 3699999999864 467778888 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++. .+++.+++++++++. ...+.|+|.+..++..|+++++.+++++++++++|+.++..+.++.. T Consensus 67 ~~i~~--~~~~~~~~~~~~~~~-~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~---------- 133 (272) T protein:vir:30 67 EAIPM--TQLGFKKTTMTIKKA-GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQ---------- 133 (272) T ss_pred Ccccc--cccccceEEEEeeee-eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhccccc---------- Confidence 88864 568999999999985 67899999999999999999999999999999999998865422110 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcch--hhhhhhccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSAL--MPNAANYAALIDPET 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~--~~~~~~~~~~~~~~~ 234 (347) ..++. ..++.|++|..+|++.+ ...|+++|+|+.|..|+++. +++.....+.+.+.+ T Consensus 134 --------~~~~~-----------~t~d~i~da~~~l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:30 134 --------TVEAT-----------ATVDGVSKALDIFNDED--DAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred --------ccccc-----------cCHHHHHHHHHHHhccC--CCccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 00000 01577889999998876 45799999999999999874 444444445566889 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |.+++++|++|++|+++|.+. .++|++.|++.+..+++++|..|+ T Consensus 193 g~ig~i~G~~Vi~s~~~p~~t-----------------------------------~~~~~~~a~~~~~~~~~~ve~~r~ 237 (272) T protein:vir:30 193 GVYGEVLGVQIVRSRKCPKGT-----------------------------------AYMVRKGALRIMLKRNTMVETDRD 237 (272) T ss_pred ccchhhcCeeEEEcCCCCcce-----------------------------------EEEEcCCeEEEEecCCceeeeccc Confidence 999999999999999998321 267899999999999999999999 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.++.|.|.+++.||.+++||++++.+.+++|+ T Consensus 238 ~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~ 270 (272) T protein:vir:30 238 ITKAINQIVANKHYGVYLYKAEKAVKITLKDAA 270 (272) T ss_pred cccceeEEEEEEEEEEEEEcCCceEEEEecccc Confidence 999999999999999999999999999999999 No 49 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=99.96 E-value=1.2e-31 Score=190.05 Aligned_cols=281 Identities=12% Similarity=0.036 Sum_probs=198.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cccCCceEEEeccccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TIQNGKSASFPVMGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti~~G~tv~i~~iG~~t~~~~~~g~~~ 79 (347) ||- + +.|+|++.+++.|...+++..+.+.. ...+||+|+||+|+.+.+++|++++.. T Consensus 1 Mai---------------------n-~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~ 58 (290) T protein:vir:78 1 MAI---------------------N-YVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITTTGLKAHTRNKGY 58 (290) T ss_pred Cch---------------------h-HHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeeccCcccccccCCCc Confidence 543 1 34899999999999998877775432 466899999999999999999987755 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhc--cHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIY--DIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vd--d~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .. .+++.+..+++||+.+++.|.|| |+||.+....+.....+.+.+.++.++|.+.+..|+..+.... T Consensus 59 ~~--g~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~-------- 128 (290) T protein:vir:78 59 NE--GSASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNS-------- 128 (290) T ss_pred cc--CccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccC-------- Confidence 43 45788899999999999999999 9999999999999999999999999999998887765542110 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh-hccc-ccccccc Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA-NYAA-LIDPETG 235 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~-~~~~~~G 235 (347) .. ++ .....+++++.|+++.++|+| ||.+|||++|+|+.|.+|+++++|... +-.. .....+| T Consensus 129 ----~~---~~------~t~t~~n~~~~i~~~~~~lde--vp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~ 193 (290) T protein:vir:78 129 ----NS---VA------EEITKDNVFTKLKAAIRKVKK--YGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIET 193 (290) T ss_pred ----cc---cc------cccCHHHHHHHHHHHHHHHHh--cCCCCeEEEECHHHHHHHhhChhhhccccccccccccccc Confidence 00 00 012335679999999999987 899999999999999999999998752 2222 2345599 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~ 315 (347) +|+++.||+|++.+.--..........+....+..+ +.=.|+.||+|+....-.+ .+. .+.| T Consensus 194 ~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ak----------------~in~ii~~~~a~i~~~K~~-~~~-~~~P 255 (290) T protein:vir:78 194 RITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAGAK----------------KLNFLLVNKGSVVGGAKHA-SIY-LHAP 255 (290) T ss_pred eeeeecCcEEEEecccchhhhhhhhcccccccCCcc----------------ceeEEEEcCCceeeeeeee-EEE-eeCC Confidence 999999999999663211111111111111111111 1114899999874433332 222 2345 Q ss_pred hh----HhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 316 EF----QADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 316 ~~----~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) .. -+|++..+..+..-++.....+.+..++- T Consensus 256 ~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 256 GSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred CCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 43 35677777777777776666666655444 No 50 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=99.94 E-value=3.2e-29 Score=176.80 Aligned_cols=284 Identities=9% Similarity=-0.001 Sum_probs=186.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccc-c---cc--cccCCceEEEeccc-cceeeee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKH-M---VR--TIQNGKSASFPVMG-RTKGYYL 73 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~-~---~r--ti~~G~tv~i~~iG-~~t~~~~ 73 (347) |+- -+.++|..++++.|...++..... + .. ...+|++|+||+|. .+.+++| T Consensus 1 Mai----------------------nya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY 58 (346) T protein:vir:10 1 MTI----------------------NYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDR 58 (346) T ss_pred Ccc----------------------hhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccc Confidence 443 145899999999998876653221 1 11 34689999999996 5568899 Q ss_pred cCCCCCCCCCCCCCCCceEEEEeeeeecchhhc--cHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 74 APGENLDDKRKDIKHSEKVIQIDGLLTSDVLIY--DIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vd--d~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~ 151 (347) +++.-. .....++.+..+++|++.+++.|.|| |+||.+........+.+.+....+.++|.+.+..|+..+...+. T Consensus 59 ~R~~g~-~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~- 136 (346) T protein:vir:10 59 QRRTIT-TPVANYSNDWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHD- 136 (346) T ss_pred cccCCc-ccccccccceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhcc- Confidence 764433 22246888999999999999999999 77787766666666667777778889999888777644321110 Q ss_pred ccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccc Q lcl|Aclame:pro 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALID 231 (347) Q Consensus 152 ~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (347) +... .......++|+.|+++.++|+|+.||.++||++|+|++|.+|.++++|...--.++.. T Consensus 137 ---------~~~~---------~~a~T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~ 198 (346) T protein:vir:10 137 ---------GGIT---------TNTLDEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPN 198 (346) T ss_pred ---------cccc---------ccccCHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheecccccccc Confidence 0000 0112345679999999999999999999999999999999999888887433233444 Q ss_pred ccccceEEEeceeEEE--eccccccccccccccCcccc-ccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 232 PETGNIRNVMGFEVIE--VPHLTVGGAGDNNPADGVAP-TNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 232 ~~~G~v~~i~G~~V~~--sn~lp~~~~~~~~~~~~~~~-t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) ..+|+|+++.||+|++ |++++..- . ...|... +.++ +.=.|+.||+|+....-.+ . T Consensus 199 ~i~~~V~siDGv~Ii~VPs~r~~t~~---~-f~~G~~~~t~ak----------------~INfiiv~~~A~ia~~K~~-~ 257 (346) T protein:vir:10 199 NIQRTVYSLDDVTIRVVPSDLMQTAY---D-FSDGSKIIDTAK----------------QIEMFLIYNGVQIAPEKYS-F 257 (346) T ss_pred ccceeeeeecCeEEEEcchhhcccch---h-hccCccccCCcc----------------ceeEEEECCceeeeeeeee-e Confidence 5699999999999987 56665211 1 1111111 1111 1113888999874333222 1 Q ss_pred ecccc-chhhHh-hHHhhhhhhcCcccccceEEE---EEecCCC Q lcl|Aclame:pro 309 LERAR-RPEFQA-DQIIGKYAMGHGGLRPEAAGA---LVFTPAA 347 (347) Q Consensus 309 ~e~~~-~~~~~~-d~i~~~~~~G~~~lRPe~~~~---l~~~~aa 347 (347) +..+- .+...+ |++..+..+..-|+.....+. +..+|+. T Consensus 258 ~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~ 301 (346) T protein:vir:10 258 VGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKK 301 (346) T ss_pred eEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeeccccc Confidence 22221 223333 577777777777775555544 4444444 No 51 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=99.93 E-value=1.6e-28 Score=172.99 Aligned_cols=299 Identities=10% Similarity=-0.010 Sum_probs=194.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccc-c--cccCCceEEEeccccceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-R--TIQNGKSASFPVMGRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~-r--ti~~G~tv~i~~iG~~t~~~~~~g~ 77 (347) ||| +. -+.++|..++++.|...+++-.+... . .+.+||+|+||+|....+++|++++ T Consensus 1 Man-tl-------------------~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~ 60 (312) T protein:vir:10 1 MAN-TL-------------------AYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGS 60 (312) T ss_pred CCc-ch-------------------hHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeeccccccccccc Confidence 776 21 26799999999999999877766422 2 3568999999999999999999765 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhc--cHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIY--DIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vd--d~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) ..-....+++.+..+++|++.+++.|.|| |+||.+....+...+.+.+.+....++|.+.+..|+..+...... T Consensus 61 g~~~~~g~v~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~---- 136 (312) T protein:vir:10 61 ANAYVGGDVKFEYETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGD---- 136 (312) T ss_pred CCccccccccccceeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc---- Confidence 52212245888999999999999999999 999998888888888888999999999999888776544221100 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) + .. ....+....++++.|.++.++|||+.|| ++|+++|+|+.+.+|.++..+.-..-.......+| T Consensus 137 -----~-~~-------~~~~~~T~~ni~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~~ 202 (312) T protein:vir:10 137 -----T-NV-------EYSYSVNSSTIINKIKTGIKIIRENGYN-GPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQT 202 (312) T ss_pred -----c-cc-------ccccccCHHHHHHHHHHHHHHHHHccCC-CceEEEeChHHHHHHhhhhhceecccccccceeee Confidence 0 00 0011124556899999999999999999 69999999999966665432221222223445699 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~ 315 (347) +|+++.|++|++.+.--.... .. ..+|.+.......+....++ -+.=.|+.||+|+....-.+ .+.. +.| T Consensus 203 ~V~~iDgv~Ii~VPs~r~~t~-~~-f~dG~t~~~~~gg~~~~~~a------k~INfiiv~~~a~i~~~K~~-~~~i-f~P 272 (312) T protein:vir:10 203 QVPSIDGCALIKTPQNRMYSS-IL-LNDGTTSNQTAGGYLKGTKA------LDTNFIIAPVDVPLAITKQD-KMRI-FDP 272 (312) T ss_pred eeeeecccEEEEchhhhccce-ee-eccCcccccccCceeecCcc------cccceEEeCCceeeceeeee-eeee-eCC Confidence 999999999998543211111 00 11111000000011111111 11124899999874333222 2222 233 Q ss_pred ----hhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 316 ----EFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 316 ----~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+|++..+..+..-|+.....+....-..| T Consensus 273 ~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a 308 (312) T protein:vir:10 273 ETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDA 308 (312) T ss_pred CCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecc Confidence 22367888888878777766666654444444 No 52 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.92 E-value=7.8e-29 Score=174.69 Aligned_cols=229 Identities=17% Similarity=0.164 Sum_probs=186.9 Q ss_pred cccccCCceEEEec-cccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHH Q lcl|Aclame:pro 51 VRTIQNGKSASFPV-MGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEAL 129 (347) Q Consensus 51 ~rti~~G~tv~i~~-iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aL 129 (347) .--+..|+|++||. +| .++++..|+.++ .+.++.++.+.+|.+. ...|.|+|.+..+...|++.+.++|++.+| T Consensus 1 ~~~~~~Gdtit~P~~iG--da~~v~eG~~i~--~~~l~~t~~~atIk~~-gk~~~itD~a~l~~~gDp~~ea~~Q~~~~i 75 (231) T protein:vir:73 1 ENGINLANLCEYPNDIG--DAADVAEGGEIS--LDKIGTTTKSVTIKKA-AKGTEITDEAALSGYGDPIGESNKQLGLSL 75 (231) T ss_pred CccccCCceEEeccccc--chhhhcCCCcCC--hhhccccceeeeEeee-ccceeeeHHHHhhccCchHHHHHHHHHHHH Confidence 22366799999984 45 446888999987 3568999999999885 889999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEECh Q lcl|Aclame:pro 130 AIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAP 209 (347) Q Consensus 130 a~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P 209 (347) |+++|..++..+-++.. +. .+. ..++.|.+|..+|.+.+ ...++++|.| T Consensus 76 A~kvD~di~~~~~~a~l------------------~~--~~~---------~t~d~i~~A~~~fgde~--~~~~vivv~p 124 (231) T protein:vir:73 76 ANKVDDDLLKAAKTTSQ------------------TV--STK---------ANVDGVQAALDIFNDED--AQAYVLIVNP 124 (231) T ss_pred HHhhhHHHHHhhccccc------------------cc--ccc---------ccHHHHHHHHHHhcccc--ccceEEEEcc Confidence 99999998865532110 00 000 12678899999999887 4678999999 Q ss_pred HHHHHHhcchhhhhh-hccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccc Q lcl|Aclame:pro 210 EDYSAILSALMPNAA-NYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNN 288 (347) Q Consensus 210 ~~~~~Ll~~~~~~~~-~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~ 288 (347) ..|+.|.++.++... +..+++-+.+|.|+.+.|++|+.|+++|.... |. T Consensus 125 ~~~~~Lrk~~~~~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~-------------------------~~----- 174 (231) T protein:vir:73 125 KDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSA-------------------------LM----- 174 (231) T ss_pred hHHHhhhhccchhhhhhhhccceeeecccceEcceEEEEcCCCCCCce-------------------------ee----- Confidence 999999998877653 44566778899999999999999999984211 00 Q ss_pred eeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 289 VVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 289 ~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) +-+++.|.|++.+..+++++|..||+.++.+.|.+.+.|++++.+|+.++.+.++-- T Consensus 175 -~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 175 -FKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred -eeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 113556889999999999999999999999999999999999999999999988777 No 53 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.92 E-value=9.9e-28 Score=168.63 Aligned_cols=262 Identities=16% Similarity=0.135 Sum_probs=205.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccccc-c--cCCceEEEeccccc-eeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRT-I--QNGKSASFPVMGRT-KGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rt-i--~~G~tv~i~~iG~~-t~~~~~~g 76 (347) ||-... + +-+.-|+|+..|.+.+.+..++.++..+.+ + +.|++|+||...-. .++++..| T Consensus 1 Ma~T~~--------------~--d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg 64 (270) T protein:vir:95 1 MTQTKK--------------A--NLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEG 64 (270) T ss_pred CCceeh--------------h--hhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCC Confidence 665211 1 225669999999999999999999887753 3 57999999986543 44677888 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) +.++ .++++.++.+.+|-+. ...|.++|++...+..|++.+.+++++..|++++|+.++..+..+. .. T Consensus 65 ~~i~--~~~lt~~~~~a~i~~~-gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~-~~-------- 132 (270) T protein:vir:95 65 VAMD--TTQMSMTTTKVTVKET-GKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSK-QT-------- 132 (270) T ss_pred Cccc--hhhcccchheeeeehh-hCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccc-cc-------- Confidence 8885 4679999999999775 7899999999998888999999999999999999999886653211 00 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) .. .... ++.|.+|..+|.+.. ....+++|.|..|..|.++..+....+ +.+...+|. T Consensus 133 ---------~~-------~~~t----~~~~~dA~~~lgd~~--~~~~~i~vhs~~~~~Lrk~~~~~~~~~-~~~~~~~G~ 189 (270) T protein:vir:95 133 ---------AT-------VSAD----ATGILDAIEVFNSEN--DEDYVLYVNPKDYNKLVKSLFKVGGNV-QDRAISKGD 189 (270) T ss_pred ---------cc-------cccC----HHHHHHHHHHhcccc--CCCcEEEEcHHHHHHHHhhhccccccc-ccchhcccc Confidence 00 0011 456788888886543 345789999999999998764433333 344577899 Q ss_pred eEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE 316 (347) Q Consensus 237 v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~ 316 (347) |+.+.|++|+.+.+.|.. ..+.+|++-|++.+..+++.+|..||+. T Consensus 190 ig~~~G~~Viv~s~~~~~----------------------------------~~~~l~~~gAi~~~~~~~~~vEtdRd~~ 235 (270) T protein:vir:95 190 LVEIVGVSDIVKSKRVSE----------------------------------NTAFLQRYGAMEIVNKKKPEAYTDFDIL 235 (270) T ss_pred cceecceeEEEeCCCCCc----------------------------------eeEEEEeccceeeeecCCceeeeccchh Confidence 999999999886665421 1136889999999999999999999999 Q ss_pred hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 ~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++.|.|.+.+.||.++++|+.++.+.+.||- T Consensus 236 ~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~ 266 (270) T protein:vir:95 236 KRTHLLSTNYHYSVNLKDETGVVKVTFKPSG 266 (270) T ss_pred hcccEEEeeeEEEEEEEccceEEEEEecCCC Confidence 9999999999999999999999999998877 No 54 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=99.86 E-value=7.5e-24 Score=147.35 Aligned_cols=297 Identities=12% Similarity=0.080 Sum_probs=189.7 Q ss_pred cCccccHHHH-HHHHHhHHHHHHHHHHHhhhcccccc-cc-cCCceEEEeccccceeeeecCCCCCCCCCCCCCCCceEE Q lcl|Aclame:pro 17 GQSAADKLAL-FLKVFGGEVLTAFVRRSVTMDKHMVR-TI-QNGKSASFPVMGRTKGYYLAPGENLDDKRKDIKHSEKVI 93 (347) Q Consensus 17 ~~~~~d~~al-~ie~f~geV~~~f~~~s~~~~~~~~r-ti-~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l 93 (347) -+..++.-|| |.++|..++++.|...+++..+.+.. .+ .+||+|+||+|....+++|++++-. ...+++.+..++ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~--~~g~v~~~~et~ 78 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGF--NSGTISDEKTIY 78 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeeeccccccccccCc--cccceeeeeeEE Confidence 1223444455 68999999999999988666554432 23 5899999999999999999987643 346789999999 Q ss_pred EEeeeeecchhhc--cHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeeccccc Q lcl|Aclame:pro 94 QIDGLLTSDVLIY--DIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAAD 171 (347) Q Consensus 94 ~ID~~~~~~~~Vd--d~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~ 171 (347) +|++.+++.|.|| |+||.......-.-..+.......-++|.+-+..|+..+...... .. .+.. ..++. T Consensus 79 tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~---~~---~~~~---~~~~~ 149 (311) T protein:vir:99 79 TMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGT---DT---EGTL---LAKTH 149 (311) T ss_pred EeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc---cc---chhh---hcccc Confidence 9999999999999 666665555544555556666677889998887776544221110 00 0000 01111 Q ss_pred ccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhh-hhccc-cccccccceEEEeceeEEEe- Q lcl|Aclame:pro 172 LVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNA-ANYAA-LIDPETGNIRNVMGFEVIEV- 248 (347) Q Consensus 172 ~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~-~~~~~-~~~~~~G~v~~i~G~~V~~s- 248 (347) ......+...+++.|..+..+|+| ||.++|+++|+|+.|.+|.+.++|.. .+-.. .....+++|+++.|++|++. T Consensus 150 ~~~~~lt~~nvl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ 227 (311) T protein:vir:99 150 KTEETLDETNAYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVY 227 (311) T ss_pred ccccccCHHHHHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEec Confidence 122234556789999999999987 79999999999999998877776653 11111 12345888999999999864 Q ss_pred --ccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhh----HhhHH Q lcl|Aclame:pro 249 --PHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEF----QADQI 322 (347) Q Consensus 249 --n~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~----~~d~i 322 (347) +++...- . ..+|.... .++ -..+ .++.||+|+....-.+ .+. .+.|.- .+|++ T Consensus 228 ps~r~~t~~---~-ft~G~~~~---------~~a-k~IN-----fiiv~~~a~i~~~K~~-~v~-~f~P~~~~~gd~~l~ 286 (311) T protein:vir:99 228 ESNRFMTKY---D-FTDGAKPT---------EDA-KAIN-----FLVVAKPAVISIVKEN-AVF-LFAPGQHTDGDGYLY 286 (311) T ss_pred Cchhhcchh---h-hcCCcccc---------Ccc-cccc-----eEEeCCCeeeeeeeee-eee-eeCCCCCCCcceeee Confidence 4554211 1 11111110 000 0111 4899999874333222 122 233432 37788 Q ss_pred hhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 323 IGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 323 ~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ..+..+..-|+.....+....--.| T Consensus 287 ~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 287 QNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred eeeeeeeeeeeccccCeEEEeeecC Confidence 8887777777766666665544444 No 55 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=99.85 E-value=8e-24 Score=147.22 Aligned_cols=266 Identities=12% Similarity=0.046 Sum_probs=180.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccc-----ccccCCceEEEeccc-cceeeeec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-----RTIQNGKSASFPVMG-RTKGYYLA 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~-----rti~~G~tv~i~~iG-~~t~~~~~ 74 (347) ||. -+.++|...+++.|...+++..+... ....+||+|+||++. ...+++|+ T Consensus 1 Mai----------------------n~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~ 58 (285) T protein:vir:79 1 MTV----------------------VLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYK 58 (285) T ss_pred Ccc----------------------hhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccc Confidence 554 14589999999999988777655433 235689999999996 46789998 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHH-HHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQ-LGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~-~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) ++... ...+++.+..+++|++.+++.|.||.+|..++..=....++.+ .......++|.+-+..|+..+.. T Consensus 59 R~~g~--~~g~v~~~~et~tl~~DR~~~f~iD~mDvdEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~------ 130 (285) T protein:vir:79 59 RGQDN--ARKTISVGKETVKLTHEDWFGYDLDQFDMDENGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAK------ 130 (285) T ss_pred cccCc--cccccceeeeEEEeeccccceecccccchhhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhccc------ Confidence 76644 3467889999999999999999999555444221123333333 44455677888877666533210 Q ss_pred ccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc---c Q lcl|Aclame:pro 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL---I 230 (347) Q Consensus 154 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~---~ 230 (347) .. + ......+++++|.++.++|+|..|| ++||++|+|++|.+|.++++|...--... . T Consensus 131 ---------~~---~------~~~T~~nv~~~i~~~~~~lde~~vp-~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~ 191 (285) T protein:vir:79 131 ---------KA---T------DSITKDNALDAYDTAEAYMFDNEVP-GGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVI 191 (285) T ss_pred ---------cc---c------cccCHHHHHHHHHHHHHHHHHcCCC-CceEEEEChHHHHHHHhhhhhheecccccceec Confidence 00 0 0122455799999999999999999 69999999999999999888764321111 2 Q ss_pred cccccceEEEec-eeEEEe--ccccccccccccccCcccccccccccccccccccccccc-ceeEEeechhhhhhhhhhh Q lcl|Aclame:pro 231 DPETGNIRNVMG-FEVIEV--PHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQN-NVVGLFNHRSAVGTVKLKD 306 (347) Q Consensus 231 ~~~~G~v~~i~G-~~V~~s--n~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~-~~~~l~~h~~A~~tv~~~~ 306 (347) .-.+++|+++.| ++|++. +++.... ++ +.=.++.||+|+....-.+ T Consensus 192 ~~i~~~V~~lDg~v~ii~Vps~r~kt~~------------------------------~~k~Infiiv~~~a~i~~~K~~ 241 (285) T protein:vir:79 192 NGIDRRVAQLDGGVPIVRVSSDRLKGLG------------------------------ITNHVNFILTPLSAIAPIVKYD 241 (285) T ss_pred cceeeeeccccceeEEEEcchhhccCcC------------------------------cchhccEEEecCceeccceeee Confidence 235678999998 899984 4553210 00 1114899999874443332 Q ss_pred eeeccccchh--h--HhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 307 MALERARRPE--F--QADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 307 ~~~e~~~~~~--~--~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .+. .++|. + -+|++..+..+..-|+.....+......|| T Consensus 242 -~~~-~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~ 284 (285) T protein:vir:79 242 -SVS-VIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAG 284 (285) T ss_pred -eeE-eECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeeccc Confidence 122 23332 2 367888888888888888888888777777 No 56 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=99.81 E-value=3.9e-22 Score=137.94 Aligned_cols=285 Identities=13% Similarity=0.088 Sum_probs=183.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc---cccCCceEEEeccc-----cceeee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR---TIQNGKSASFPVMG-----RTKGYY 72 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r---ti~~G~tv~i~~iG-----~~t~~~ 72 (347) ||| +. -+.++|.+++++.|...+++..+.... .+.+||+|+||+|- .+-.++ T Consensus 1 Man-tl-------------------~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~d 60 (302) T protein:vir:78 1 MAN-SL-------------------ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKA 60 (302) T ss_pred CCc-hh-------------------HHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccc Confidence 776 11 277999999999999998777663322 36789999999995 456778 Q ss_pred ecCCCCCCCCCCCCCCCceEEEEeeeeecchhhc--cHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 73 LAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIY--DIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 73 ~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vd--d~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) |++++-.. ..+++.+..++++++.+++.|.|| |+||.......-..+.+.......-++|.+-+..|+..+.... T Consensus 61 y~R~~g~~--~g~v~~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~- 137 (302) T protein:vir:78 61 YNRSTGFT--QGSVTLAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVG- 137 (302) T ss_pred cccccCcc--ccceeeeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccC- Confidence 88766432 355788899999999999999999 6666555554455555556777788999988877754331110 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh-hcc-c Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA-NYA-A 228 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~-~ 228 (347) .. .....+....+++++.|..+.++|+|+ ++|+++|+|+.+.+|.+++.+... +.. . T Consensus 138 -----------~~------~~~~~~~~t~~nvl~~i~~~~~~~~e~----~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~ 196 (302) T protein:vir:78 138 -----------GV------IDLSKPDASAQALMGDIATAMELVDDS----NQLILVTSPTTLAGLLNTALIRESKNTQVL 196 (302) T ss_pred -----------cc------ccccccchhHHHHHHHHHHHHHHhhcc----CCeEEEEChHHHHHHhcchhhccceecccc Confidence 00 001112234567889999999999996 589999999999999887666421 111 1 Q ss_pred cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) ..+..+++|+++.|++|++.+.--.... .. ..+|..... ++ -+.=.|+.||+|+....-.+ . T Consensus 197 ~~~~i~~~V~~lDgv~Ii~VPs~r~~t~-~~-f~~G~~~~~---------~a------k~INfiiv~~~a~ia~~K~~-~ 258 (302) T protein:vir:78 197 RRGEVDTKITFIQDVEVLQVPSEYLYDK-VA-PKVGVPDYT---------GA------KKIPYMIFKRDAPTGIVKTD-K 258 (302) T ss_pred ccccccceeeeecccEEEEchhhhcccc-ee-ccCCccccC---------Cc------cceeEEEECCCeeeeeeeee-e Confidence 2234588999999999998553211111 11 111111110 00 11124899999874333322 1 Q ss_pred eccc-cchhhHh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 309 LERA-RRPEFQA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 309 ~e~~-~~~~~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +..+ .++.+.+ |++..+..+..-|+.....+.+....+| T Consensus 259 ~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~ 300 (302) T protein:vir:78 259 VRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGT 300 (302) T ss_pred eEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeeccc Confidence 2222 2334555 5777777777777776666666555444 No 57 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=99.70 E-value=5.6e-20 Score=126.14 Aligned_cols=298 Identities=18% Similarity=0.163 Sum_probs=193.5 Q ss_pred cCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccc-cccccCCceEEEeccccceeeeecCCCCCCCCCCCCCCCceEEE Q lcl|Aclame:pro 17 GQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHM-VRTIQNGKSASFPVMGRTKGYYLAPGENLDDKRKDIKHSEKVIQ 94 (347) Q Consensus 17 ~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~-~rti~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~ 94 (347) -+..++-.|+.. |+|+-+++-..+..-+-..+.+ +-..-.|++.|||.+|.++++.....+ +-..+++++.|.++. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~tiGs~~~~~~~E~~--~~~~~~i~TGEIt~~ 78 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTIGSVTLQEAEEDT--PLIYNPIETGEITFQ 78 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEecccCceeeeccccCC--CeeecccccceEEEE Confidence 112233344333 9999998877776644444444 445678999999999999998755333 445688999999999 Q ss_pred Eeeeeecchhh-ccHHHHHhCcc-hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc-cccccCcccCceeeeeccccc Q lcl|Aclame:pro 95 IDGLLTSDVLI-YDIEDAMNHYD-VRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA-SNENIAGLGQAVVLNIGAAAD 171 (347) Q Consensus 95 ID~~~~~~~~V-dd~D~~q~~~D-~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~-~~~~~~g~~~~~~i~~~~~~~ 171 (347) |-+++-.+++| +|+-+.-..+| ++.+...|.++|+.+.+...+|..- .+.++.. .+..+. +.++++ +++.++ T Consensus 79 i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G--~~~FA~~~~P~~vN--G~PH~~-V~~~T~ 153 (313) T protein:vir:95 79 ITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTG--AEYFAANPGPHNVN--GFPHVI-VSAETN 153 (313) T ss_pred EEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhc--hhhhccCCCCcccc--cccceE-EeccCC Confidence 99988778778 56777766676 8899999999999999987766431 2233332 223333 344444 334444 Q ss_pred ccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhh-----hhcccccccccc--ceEEEecee Q lcl|Aclame:pro 172 LVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNA-----ANYAALIDPETG--NIRNVMGFE 244 (347) Q Consensus 172 ~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~-----~~~~~~~~~~~G--~v~~i~G~~ 244 (347) .+.. +..|..++-.|++.++|.+||+.+++|..-.-|-.-..+.+ ..+.-....+.| .|.+++|++ T Consensus 154 ~~~~-------~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~D 226 (313) T protein:vir:95 154 GVFA-------LKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWD 226 (313) T ss_pred ceeh-------hhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhh Confidence 3332 34467788899999999999999999987776643322222 112112233333 478899999 Q ss_pred EEEeccccccccccccccCcccccccc--ccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhhHH Q lcl|Aclame:pro 245 VIEVPHLTVGGAGDNNPADGVAPTNQK--HIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQI 322 (347) Q Consensus 245 V~~sn~lp~~~~~~~~~~~~~~~t~~~--~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i 322 (347) ++.||.|.....+ ++.+++++. +.|-- .-.+..+-..++| + ..+++|.+++..+--+-- T Consensus 227 i~~SN~L~~AN~~-----D~~tT~~G~~~NlFM~-----i~D~~~~P~~~AW--------r-~MP~s~~~~~~~~~~~~~ 287 (313) T protein:vir:95 227 ILTSNRLHVANYN-----DGTTTGNGYVGNLFMC-----ILDDQTKPIMGAW--------R-RMPKSEGERNKDRARDEH 287 (313) T ss_pred hhhhhhhhhcccc-----ccccccCceeeeeeee-----eecccccceeeee--------c-cccccccccccccccccc Confidence 9999999754433 222222111 00000 0011112122222 2 334778888877777778 Q ss_pred hhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 323 IGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 323 ~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .-...||.++.|-|.+|.+...++| T Consensus 288 ~~~~R~G~Gi~R~~~L~~~~~~A~~ 312 (313) T protein:vir:95 288 VVRCRYGFGIQRLDTLGLLATSATA 312 (313) T ss_pred eeeeeecccceeecceeEEEecccc Confidence 8889999999999999999999999 No 58 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=99.69 E-value=8.2e-19 Score=119.75 Aligned_cols=303 Identities=14% Similarity=0.100 Sum_probs=186.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccc-c--c---ccCCceEEEeccccceeeeec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-R--T---IQNGKSASFPVMGRTKGYYLA 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~-r--t---i~~G~tv~i~~iG~~t~~~~~ 74 (347) ||| +. .-.+++=.-|++..|....++...+.+ | . -+.|+++.+|.--..... T Consensus 1 MAn-~l------------------~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~--- 58 (430) T protein:vir:10 1 MAL-NE------------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) T ss_pred Ccc-ch------------------hhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccc--- Confidence 888 22 124456667888899888877764332 2 1 367999988865444332 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) .|..+.+...++....+.++||+.+--.|.+.+-| +...+....+.+.+.++||..+|..++..++.-........ T Consensus 59 ~G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~-- 134 (430) T protein:vir:10 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSP-- 134 (430) T ss_pred cCcccCCCCCccccceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhccccccccc-- Confidence 36555555455666789999999999999987655 46777778888999999999999999876543221111000 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCC-CCEEEEChHHHHHHhcc-hhhhhhhccccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAG-DRRFYCAPEDYSAILSA-LMPNAANYAALIDP 232 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (347) .++....++ .+..+-.+.+.|++..||.+ +|.++++|+.+..|... .++.+.+-.....+ T Consensus 135 -----------~~t~~~~~~-------~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~ 196 (430) T protein:vir:10 135 -----------DAIGTNTAD-------AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred -----------ccCCCcCCc-------chhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHH Confidence 011111111 24567778999999999996 89999999999998753 23333333344568 Q ss_pred cccceEE-Eecee-EEEecccccccccccccc--C---------------cc-------------cccc---ccccc-cc Q lcl|Aclame:pro 233 ETGNIRN-VMGFE-VIEVPHLTVGGAGDNNPA--D---------------GV-------------APTN---QKHIF-PA 276 (347) Q Consensus 233 ~~G~v~~-i~G~~-V~~sn~lp~~~~~~~~~~--~---------------~~-------------~~t~---~~~~~-~a 276 (347) ++|.|++ +.||+ +++++++|....+..... . |. +.++ .+..| -+ T Consensus 197 r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftia 276 (430) T protein:vir:10 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) T ss_pred hhccccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEec Confidence 9999997 89995 799999997433221111 0 00 0000 00000 00 Q ss_pred c---------------------------------------------cccccc-----------ccc----cceeEEeech Q lcl|Aclame:pro 277 T---------------------------------------------ATGDDR-----------VAQ----NNVVGLFNHR 296 (347) Q Consensus 277 ~---------------------------------------------~~~~y~-----------~d~----~~~~~l~~h~ 296 (347) | .+..|. +.+ .-...|+||| T Consensus 277 GV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr 356 (430) T protein:vir:10 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) T ss_pred ceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcc Confidence 1 000010 000 0123599999 Q ss_pred hhhhhhhhhh---------------------ee--eccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 297 SAVGTVKLKD---------------------MA--LERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 297 ~A~~tv~~~~---------------------~~--~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +|+..+.... +. +-..||.......++--..||.+.+|||.++.+...-+| T Consensus 357 ~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 357 DAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred cceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 9874332221 11 111345444445556667899999999999999888888 No 59 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=99.69 E-value=8.2e-19 Score=119.75 Aligned_cols=303 Identities=14% Similarity=0.100 Sum_probs=186.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccc-c--c---ccCCceEEEeccccceeeeec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-R--T---IQNGKSASFPVMGRTKGYYLA 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~-r--t---i~~G~tv~i~~iG~~t~~~~~ 74 (347) ||| +. .-.+++=.-|++..|....++...+.+ | . -+.|+++.+|.--..... T Consensus 1 MAn-~l------------------~~~~~ii~~eal~~l~n~~v~a~~~~~~r~~d~~~~r~Gdti~~p~~~~~~~~--- 58 (430) T protein:vir:92 1 MAL-NE------------------GQIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPTQ--- 58 (430) T ss_pred Ccc-ch------------------hhHHHHHHHHHHHHHhhhhhhhhhhcccCCchhhhhcccceEEeccccccccc--- Confidence 888 22 124456667888899888877764332 2 1 367999988865444332 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) .|..+.+...++....+.++||+.+--.|.+.+-| +...+....+.+.+.++||..+|..++..++.-........ T Consensus 59 ~G~~~t~~~~~i~e~~v~~~v~~~k~V~~~~~~ke--l~~~~~~~~~i~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~-- 134 (430) T protein:vir:92 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRADD--LRDETAYRHRIQSAARKLANNVELKVANMAAEMGSLVITSP-- 134 (430) T ss_pred cCcccCCCCCccccceEEEEEeeeccceEEechhH--hcChhHHHHHhHHHHHHHHHHHHHHHHHHhhhccccccccc-- Confidence 36555555455666789999999999999987655 46777778888999999999999999876543221111000 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCC-CCEEEEChHHHHHHhcc-hhhhhhhccccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAG-DRRFYCAPEDYSAILSA-LMPNAANYAALIDP 232 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (347) .++....++ .+..+-.+.+.|++..||.+ +|.++++|+.+..|... .++.+.+-.....+ T Consensus 135 -----------~~t~~~~~~-------~~~~~A~a~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~ 196 (430) T protein:vir:92 135 -----------DAIGTNTAD-------AWNFVADAEELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred -----------ccCCCcCCc-------chhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHH Confidence 011111111 24567778999999999996 89999999999998753 23333333344568 Q ss_pred cccceEE-Eecee-EEEecccccccccccccc--C---------------cc-------------cccc---ccccc-cc Q lcl|Aclame:pro 233 ETGNIRN-VMGFE-VIEVPHLTVGGAGDNNPA--D---------------GV-------------APTN---QKHIF-PA 276 (347) Q Consensus 233 ~~G~v~~-i~G~~-V~~sn~lp~~~~~~~~~~--~---------------~~-------------~~t~---~~~~~-~a 276 (347) ++|.|++ +.||+ +++++++|....+..... . |. +.++ .+..| -+ T Consensus 197 r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftia 276 (430) T protein:vir:92 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFT 276 (430) T ss_pred hhccccccchhhhhhhhcCCcccccCccCcCceeccccccccccceecccccccccccccceeeeecccceecccEEEec Confidence 9999997 89995 799999997433221111 0 00 0000 00000 00 Q ss_pred c---------------------------------------------cccccc-----------ccc----cceeEEeech Q lcl|Aclame:pro 277 T---------------------------------------------ATGDDR-----------VAQ----NNVVGLFNHR 296 (347) Q Consensus 277 ~---------------------------------------------~~~~y~-----------~d~----~~~~~l~~h~ 296 (347) | .+..|. +.+ .-...|+||| T Consensus 277 GV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~paii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr 356 (430) T protein:vir:92 277 GVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWAD 356 (430) T ss_pred ceeeeccccccccCCccEEEEEEecCCceeEEeccccccccccccccccccceeccccccCceeEEeccCCcccceeEcc Confidence 1 000010 000 0123599999 Q ss_pred hhhhhhhhhh---------------------ee--eccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 297 SAVGTVKLKD---------------------MA--LERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 297 ~A~~tv~~~~---------------------~~--~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +|+..+.... +. +-..||.......++--..||.+.+|||.++.+...-+| T Consensus 357 ~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 357 DAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred cceEEEEecccCCCCHHHhhhhheeccccceEEEEEEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 9874332221 11 111345444445556667899999999999999888888 No 60 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.68 E-value=1.8e-18 Score=117.90 Aligned_cols=302 Identities=16% Similarity=0.112 Sum_probs=185.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccc-c--c---ccCCceEEEeccccceeeeec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-R--T---IQNGKSASFPVMGRTKGYYLA 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~-r--t---i~~G~tv~i~~iG~~t~~~~~ 74 (347) |||. - + -++++=--|++..|....++...+.+ | . -+.|+++.+|.--.... . T Consensus 1 Ma~~-~--------------~----~~lti~~~eal~~~~n~lV~a~~~~~~r~~d~~~~r~Gdti~ip~p~~~~~---~ 58 (430) T protein:vir:21 1 MALN-E--------------G----QIVTLAVDEIIETISAITPMAQKAKKYTPPAASMQRSSNTIWMPVEQESPT---Q 58 (430) T ss_pred Cccc-c--------------c----hhhHHHHHHHHHHhhhhhhhhhhhhccCCchhhhhcccceEEeeccccccc---c Confidence 8872 1 0 12332228889999988888765332 2 2 36799999885433322 2 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) .|.++.+...++....+.++||+.+--.|.+.+ +| +...|....+.+.+.++||..+|..++..++.......... T Consensus 59 ~G~~~t~~~~~~~e~~v~~~~~~~~~V~~~~~~-kE-l~~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~-- 134 (430) T protein:vir:21 59 EGWDLTDKATGLLELNVAVNMGEPDNDFFQLRA-DD-LRDETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSP-- 134 (430) T ss_pred ccccccCCCccceeeeEeEEEeeeccceEEeeh-hH-hcChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcccccc-- Confidence 355555555567778899999999887788763 33 45677778999999999999999999877643221111000 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCC-CCEEEEChHHHHHHhcc-hhhhhhhccccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAG-DRRFYCAPEDYSAILSA-LMPNAANYAALIDP 232 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~-gR~~vv~P~~~~~Ll~~-~~~~~~~~~~~~~~ 232 (347) -++.+..++ .+..+-.+.+.|++..||.+ +|.++++|+.+..|... .++.+.+-.....+ T Consensus 135 -----------~~t~~~~~~-------~~~~~A~a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~~~A~ 196 (430) T protein:vir:21 135 -----------DAIGTNTAD-------AWNFVADAEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAY 196 (430) T ss_pred -----------CCCCCCCCc-------chhhHHHHHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhccccccccchhHHH Confidence 011111111 14667778899999999995 79999999999998764 33333333345568 Q ss_pred cccceEE-Eecee-EEEecccccccccccccc-----------------Ccc-------------cccc-----cccccc Q lcl|Aclame:pro 233 ETGNIRN-VMGFE-VIEVPHLTVGGAGDNNPA-----------------DGV-------------APTN-----QKHIFP 275 (347) Q Consensus 233 ~~G~v~~-i~G~~-V~~sn~lp~~~~~~~~~~-----------------~~~-------------~~t~-----~~~~~~ 275 (347) ++|.|++ +.||+ +++++++|....+..... .|. +.++ .... - T Consensus 197 r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ft-i 275 (430) T protein:vir:21 197 RDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVSGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKIS-F 275 (430) T ss_pred hhcccccccchhhhhhhcCCcccccCccCcCceeccccccccccceeccccccccccccceeeeeecccceecccEEE-e Confidence 8999997 89996 799999997433221111 000 0000 0000 0 Q ss_pred cc-----------------------c----------------------ccccc-----------cc----ccceeEEeec Q lcl|Aclame:pro 276 AT-----------------------A----------------------TGDDR-----------VA----QNNVVGLFNH 295 (347) Q Consensus 276 a~-----------------------~----------------------~~~y~-----------~d----~~~~~~l~~h 295 (347) +| . +..|. +. ..-...|+|| T Consensus 276 aGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~Pai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh 355 (430) T protein:vir:21 276 AGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPKPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWA 355 (430) T ss_pred cceeeeccccccccCCcceEEEEEecCCceeEEeecccccccccccccccccceeccccccCceeEEeccCCcccceeEc Confidence 11 0 00010 00 0002349999 Q ss_pred hhhhhhhhhhh---------------------eeec--cccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 296 RSAVGTVKLKD---------------------MALE--RARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 296 ~~A~~tv~~~~---------------------~~~e--~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) |+|+..+.... +++. ..||.+.....++--..||.+.+|||.++.+...-+| T Consensus 356 ~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsirv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 356 DDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGIFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred cceeEEEEecccCCCChhHhhheeeeeccccceEEEEEEccccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 99874332221 1111 2245555555666677899999999999999888888 No 61 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=99.63 E-value=8.1e-18 Score=114.30 Aligned_cols=308 Identities=12% Similarity=0.020 Sum_probs=176.4 Q ss_pred CCCCccCccccc-cCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccc---------ccee Q lcl|Aclame:pro 1 MANATGGQQIGA-NQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG---------RTKG 70 (347) Q Consensus 1 m~~~~~~~~~~~-~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG---------~~t~ 70 (347) ||+.|-.....+ ...++......-+|+.+.|..++.+.-++.|.++.+.++..+. +..+.+|+.- ..++ T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~~~~~ 79 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVKRPEVGQVGVGTS 79 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCccceeeccccc Confidence 777764322211 1223333333444788999999999999999999998887765 4568887643 2333 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) .....|..++. .+++..++++..-+. +....|.+-=..++.+|+.+.+.++.++++++..|+.++.-- +.... T Consensus 80 ~~~~Eg~~~~~--~~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~--g~~~~-- 152 (338) T protein:vir:78 80 NEQREGGTKPL--SGTAWDTRSVAPIKL-ATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGK--SPLTG-- 152 (338) T ss_pred ccccccccccc--cccceeEEEEEEEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhccc--CCCcc-- Confidence 33334444432 345666666655543 333344442223466899999999999999999999887311 00000 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhh--hhccc Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNA--ANYAA 228 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~--~~~~~ 228 (347) ..+.|..........+..+ .........++.|+++...+.. +.......++++|..|..|.+...+.+ ..|.- T Consensus 153 --~~~~gi~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~ 227 (338) T protein:vir:78 153 --SALQGIDTNNVIVNTTNVD--YLQTGTTPLLDRFLDGYDLVSA-NTDVDFNGWAADPRYRARLLRSQAYRDANGNVDP 227 (338) T ss_pred --ccccccccccccccccccc--cccccchhhHHHHHHHHHHhhh-hccccceEEEEchHHHHHHHHHhhhccCCCceee Confidence 0011111100111111111 1112223457778777666543 333344568899999999977655443 33443 Q ss_pred cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) ......|..+.++|++|+.++++|...... . +...--|-+||+... ++ ..++++ T Consensus 228 ~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~---------~-------~~~~~~~~gdfs~~~--~~--------~~~~~~ 281 (338) T protein:vir:78 228 TRINLAASAGDLLGLPVQFGKAVGGDLGAA---------T-------DSKVRVVGGDFSQLK--YG--------FADEIR 281 (338) T ss_pred cccccCCCCceeeeeeEEEccccCcccccc---------C-------CcccEEEEEecceEE--EE--------eecccE Confidence 344556777899999999999998432110 0 011122455665532 22 222334 Q ss_pred eccccch--------h------hHh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 309 LERARRP--------E------FQA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 309 ~e~~~~~--------~------~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++...+. . ++- ..++....+|.+++||++.+.|..+.++ T Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 282 VKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred EEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCC Confidence 4333221 0 111 2357778899999999999999987777 No 62 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=99.58 E-value=1e-16 Score=108.20 Aligned_cols=282 Identities=12% Similarity=0.105 Sum_probs=176.1 Q ss_pred ccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCCCCCCCCCCC Q lcl|Aclame:pro 10 IGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLDDKRKDIKHS 89 (347) Q Consensus 10 ~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~ 89 (347) +|+++-.+...++.-.+..+.+..++.+..++.++++.+.++.++. +.+.++|....+.+..+..|++++.+ +++.+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~--~~~f~ 77 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMT-KPEEEFTFMSGVGAFWVDEAERIQTS--KPTFT 77 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecC-CCcEEEEEEcCCceeeeecCcccccc--cccee Confidence 4444444433344334667999999999999999999998877765 56688898888888888888877643 47778 Q ss_pred ceEEEEeeeeecchhhccHHHHH-hCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecc Q lcl|Aclame:pro 90 EKVIQIDGLLTSDVLIYDIEDAM-NHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGA 168 (347) Q Consensus 90 ~~~l~ID~~~~~~~~Vdd~D~~q-~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~ 168 (347) ++++...+. +....|.+ +-.+ +.+|+.+.+.++.+++++++.|+.++. +... +.+.+....... T Consensus 78 ~v~l~~~k~-~~~~~is~-ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~----G~g~---------~~~~gil~~~~~ 142 (299) T protein:vir:41 78 KAKMRSKKM-GVIIPTTK-ENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFT----GVES---------PYNWNILKSATD 142 (299) T ss_pred EEEEeeEEE-EEeehhhH-HHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhh----cccC---------cccccccccccc Confidence 877777664 44455644 3333 558899999999999999999998873 1110 001111100000 Q ss_pred cccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEEEeceeEEEe Q lcl|Aclame:pro 169 AADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRNVMGFEVIEV 248 (347) Q Consensus 169 ~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~i~G~~V~~s 248 (347) +. .........++.|.++..+|...+.+ .-.++++|..|..|.+-. -.+..|....... +..+++.|++|+.+ T Consensus 143 ~~---~~~~~~~~~~~~l~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lk-d~~G~~l~~~~~~-~~~~~l~G~PV~~~ 215 (299) T protein:vir:41 143 AS---NLVEETANKYDDLNEAIGLIEAEDLE--PNGIATIRKQRVKYRSTK-DGNGMPIFNTATS-NGVDDVLGLPIAYT 215 (299) T ss_pred cc---eeeccccccHHHHHHHHHhhhcccCC--cCEEEEcHHHHHHHHHhh-ccCCceeecCCcC-CCCceecceeeEEe Confidence 00 00011112367788888888888764 345799999999998633 2233443333333 33468999999999 Q ss_pred ccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh------------ Q lcl|Aclame:pro 249 PHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE------------ 316 (347) Q Consensus 249 n~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~------------ 316 (347) +++|... +...-|.+||++.. + +..+++++|..++.- T Consensus 216 ~~~~~~~---------------------~~~~~~~gdfs~~~--i--------~~~~~~~i~~~~~~~~~~~~~~~~~~~ 264 (299) T protein:vir:41 216 PKYTFGD---------------------KDISELVGDWNQAY--Y--------GILRGVEYEILTEATLTTVADETGKPL 264 (299) T ss_pred cccCCCC---------------------CceEEEEEecccEE--E--------EEecCcEEEEeecccccccccccccch Confidence 9998421 11123455665532 1 223334444443221 Q ss_pred --hHhhH--HhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 --FQADQ--IIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 --~~~d~--i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++.+. ++....+|.++++|++.+.|...++- T Consensus 265 ~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 265 NLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 23333 46667889999999999888554444 No 63 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=99.56 E-value=1.6e-16 Score=107.22 Aligned_cols=308 Identities=14% Similarity=0.080 Sum_probs=169.3 Q ss_pred CCCCccCc--cccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccc-cceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQ--QIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG-RTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~--~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG-~~t~~~~~~g~ 77 (347) ||-.|-.. ..++.+ ++.......++..+++..++.+..++.+.++.+.++.++.+| ..++|+.. .+++.....|. T Consensus 1 ~a~l~el~~~~~~~~~-~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPNSAGSNH-QGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYG-ETIIPTTVKRPEVGQVGVGT 78 (333) T ss_pred CchhHHhhhhcccccc-cCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCceeEeecCcc Confidence 55555321 112222 222222233377899999999999999999999888777654 45676654 33443333232 Q ss_pred CCCC------CCCCCCCCceEEEEeeeeecc-hhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 78 NLDD------KRKDIKHSEKVIQIDGLLTSD-VLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 78 ~~~~------~~~~~~~~~~~l~ID~~~~~~-~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) .... +...+...++++. ..++.. ..|.+-=..++.+|+.+.+.+++++++++..|+.++. +-....+ T Consensus 79 ~~~~~e~~~~~~~~~~f~~i~l~--~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~----G~g~~~~ 152 (333) T protein:vir:78 79 SNEQREGGLKPLSGTAWDTRSVS--PIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFH----GKSPLTG 152 (333) T ss_pred cccccccccccccccceeEEEEe--eEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCCCC Confidence 2110 1123444544444 444343 3343322224677899999999999999999998874 1111000 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhh--hhccc Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNA--ANYAA 228 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~--~~~~~ 228 (347) ..+.|......+. ..+........+...++.|+++...+..+. ......+++.|..|..|++...+.+ ..|.- T Consensus 153 --~~~~g~~~~~~~~--~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~-~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~ 227 (333) T protein:vir:78 153 --SALQGIDTDNVIA--NTTNVDYLQETGDPLLDRLLDGYDLVSANT-DVEFNGWAVDPRFRAHLLRAQAYRDANGNVDP 227 (333) T ss_pred --ccccccccccccc--ccccccccccccchhHHHHHHHHHhhcccc-ccCceEEEEcchHHHHHHHHhhhcCCCCceee Confidence 0011111111110 000111111222334677888877766543 2233467889999999987655543 34444 Q ss_pred cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) ......|..++++|++|+.|+++|...... . .+...-|.+||++.. + +..++++ T Consensus 228 ~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~---------~-------~~~~~~~~gD~~~~~--~--------g~~~~~~ 281 (333) T protein:vir:78 228 SRINLAAQTGDVLGLPAQFGRAVGGDLGAA---------V-------DSKTRIIGGDFSQLK--F--------GFADEIR 281 (333) T ss_pred cCccccCCCceeeceeeEEccccCCCcccc---------C-------CCccEEEEEecccEE--E--------EEeeccE Confidence 445566777899999999999998532110 0 011122445665532 1 2233344 Q ss_pred eccccch-----------hhHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 309 LERARRP-----------EFQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 309 ~e~~~~~-----------~~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++...+. .++.| .+++.+.++.++++|++.+.|+.+.|- T Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 282 IKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 4433221 11222 357778899999999999999755444 No 64 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.54 E-value=8.9e-17 Score=108.57 Aligned_cols=290 Identities=10% Similarity=0.060 Sum_probs=167.4 Q ss_pred CCCCccCcc----ccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeec Q lcl|Aclame:pro 1 MANATGGQQ----IGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLA 74 (347) Q Consensus 1 m~~~~~~~~----~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~ 74 (347) |-....... .......+..+++.. +.+ +++...+....+..++++.+.++....++..+.||+. |...+.... T Consensus 93 ~r~~~~~~~r~~~~~~~~~~~t~~~~g~-~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~a~wv~ 171 (390) T protein:vir:62 93 LRAGNLGEARSFEFAPEKRDGTKAGNPN-VLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSSASIVG 171 (390) T ss_pred HhhhhhhhhHHHHhhhhhhcccccCCCc-cccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeec Confidence 000000000 000000011111211 344 4455555556666678888877776667777889866 445666666 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) .|..++.+ +++..++++.+-++ +.-..|.+-=-.++.+|+.+.+.++.+++|++..|+.++. +. T Consensus 172 E~~~~~~~--~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~----G~--------- 235 (390) T protein:vir:62 172 ETAEIPES--YPATAQRSMGGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFIT----GT--------- 235 (390) T ss_pred cccccccc--ccceeeeEeeeeeE-EeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhc----cC--------- Confidence 67776543 46778888877765 3444554433335677999999999999999999998763 11 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPET 234 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (347) |.+.|..-..+...............++.|+++...|+..... +-.+|++|..|..|.+-. -.+..|.-..++.. T Consensus 236 --G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lk-d~~g~~l~~~~~~~ 310 (390) T protein:vir:62 236 --GQPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRA--NAKYVVNDLRAAQMRKLK-DANGQYLWQSGLTV 310 (390) T ss_pred --CccccccccccccccceecccccccchHHHHHHHHhhhhhhhc--CCEEEEchHHHHHHHHhh-ccCCCeeecCCcCC Confidence 1111110000000000000000111256777777778766542 345688999999985421 12344544445666 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) |....+.|++|+.++++|... -+.+||+.. +++ ..++++++...+ T Consensus 311 g~~~~l~G~Pv~~~~~~p~~~-------------------------i~~gd~s~~--~i~--------~~~~~~v~~~~~ 355 (390) T protein:vir:62 311 GAPSLFNGKVVETDDGMPADK-------------------------ILFADLSKY--RVR--------FAGSLRVDRSVD 355 (390) T ss_pred CccceecccceEEecCCCCcc-------------------------EEEeeccce--eEE--------eecceEEEeecc Confidence 777789999999999998421 123455542 222 233445554444 Q ss_pred hhhHh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +...- ..+++.+.+|+++++|+++..|..+++| T Consensus 356 ~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 356 AKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred ccccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 43333 3458889999999999999999999999 No 65 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.53 E-value=1.5e-16 Score=107.28 Aligned_cols=293 Identities=11% Similarity=0.036 Sum_probs=167.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecccc-ceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR-TKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~-~t~~~~~~g~~~ 79 (347) |..-.-.-....+...+..+++.--+--+++...+.....+.++++...++....++..+.+|+... .++..+..|..+ T Consensus 97 ~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~ 176 (392) T protein:vir:13 97 NLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRATAGIVGETAEI 176 (392) T ss_pred chhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcceeeecccccc Confidence 0000000000000000111122111223567777788888888999888877777777788875544 556556667666 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +.+ +++.+++++..-++ ..-..|.+-=..++.+|+.+.+.++.++++++..|+.++. +.. .+.+ T Consensus 177 ~~~--~~~f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~----G~G---------t~~p 240 (392) T protein:vir:13 177 PES--YPATTQRSMGGFKY-GFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLT----GTG---------TGQP 240 (392) T ss_pred ccc--ccceeeEEeeeeeE-EeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----ccC---------Cccc Confidence 543 46777777776654 3444454433334677899999999999999999998873 110 0111 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .|..-..+...............++.|+++...|..... .+-.+|++|..|..|.+-. -.+..|.-..+...|.... T Consensus 241 ~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~--~~a~~v~n~~~~~~l~~lk-d~~G~~l~~~~~~~g~~~~ 317 (392) T protein:vir:13 241 RGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYR--KNAKFVVNDLRAAQMRKLK-DANGQYLWQSALTVGAPDT 317 (392) T ss_pred cccccccccccccccccccccccHHHHHHHHHhhhhhhh--cCCEEEEcHHHHHHHHHhh-ccCCceeecCCcCCCCCce Confidence 111100000000000111111236777777777765542 2234588999999886522 2233443334455676678 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHh Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQA 319 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~ 319 (347) ++|.+|+.++++|... =+.+||+.. +++ ...+++++...++...- T Consensus 318 l~G~Pv~~~~~~~~~~-------------------------i~~Gdf~~~--~i~--------~~~~~~i~~~~~~~~~~ 362 (392) T protein:vir:13 318 FNGKVVETDDGMPADK-------------------------VLFADLSKY--RVR--------FAGSLRVDRSVDAKFST 362 (392) T ss_pred ecceeeEEcCCCCCCc-------------------------EEEeeccce--eEE--------eecceEEEeeccccccC Confidence 9999999999998421 123455542 222 23344555544443322 Q ss_pred --hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 320 --DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 320 --d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ..+++.+.+|+++.+|++++.+..+++| T Consensus 363 ~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 363 DQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred CcEEEEEEEEeccEEecccceEEEEeeccC Confidence 3568889999999999999999999999 No 66 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.48 E-value=2.8e-15 Score=100.39 Aligned_cols=291 Identities=10% Similarity=-0.001 Sum_probs=166.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEe-ccccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFP-VMGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~-~iG~~t~~~~~~g~~ 78 (347) +...+. .+.+.. ..++.-.+.-+.|.+++.+..+..+.+++++++.++.++. ++.++ ..+......+..|.. T Consensus 113 ~~~~~~-----~~~~~~-~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~ 186 (415) T protein:vir:47 113 LETRND-----IQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) T ss_pred Hhhhhh-----hhhccc-cccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccc Confidence 000000 000000 0111112455899999999999999999998887776543 22232 234444555555655 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.. ..++.+++++..-+. +.-+.|.+-=..++.+|+.+.+.++.+++|++..|+.|+.-...+ ...+. T Consensus 187 ~~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g---------~~~~~ 255 (415) T protein:vir:47 187 NPEL-AVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG---------STGST 255 (415) T ss_pred cccc-cccceeeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC---------Ccccc Confidence 5422 124556656655554 233445433233456889999999999999999999887422110 00110 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) ........... ..+ ....++.|+++...+...... .-.+|++|..|..|.+- +-.+..|....++.+|..+ T Consensus 256 ~~~~~~~~~~~--~~~----~~~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~l-kd~~G~~i~~~~~~~~~~~ 326 (415) T protein:vir:47 256 SSGFEKEGKKL--EVK----KAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQ 326 (415) T ss_pred cccccccccee--ccc----cccchHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHh-hccCCCeeeccCcCCCCCc Confidence 01100000000 000 111256677777777766653 33568999999998652 2234455544456677778 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++|++|+.++++|....+.. .-+.+||++.+.+ +..++++++.... ... T Consensus 327 ~l~G~pV~~~~~~~~~~~~~~--------------------~~~~gd~~~~~~~---------~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:47 327 RLLGAKIEILPDEVLGQKGNN--------------------TLIIGNLKDAIVL---------FDRSQYQASWTDY-MHF 376 (415) T ss_pred cccceeeEEeccccccCCCcc--------------------EEEEEehhccEEE---------EeecceEEEeecc-ccC Confidence 999999999999985322111 1134455543221 2334445554332 222 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+++.+.++.++++|++++.+..+++| T Consensus 377 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:47 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEeeccC Confidence 34578888999999999999999999888 No 67 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.48 E-value=2.8e-15 Score=100.39 Aligned_cols=291 Identities=10% Similarity=-0.001 Sum_probs=166.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEe-ccccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFP-VMGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~-~iG~~t~~~~~~g~~ 78 (347) +...+. .+.+.. ..++.-.+.-+.|.+++.+..+..+.+++++++.++.++. ++.++ ..+......+..|.. T Consensus 113 ~~~~~~-----~~~~~~-~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~ 186 (415) T protein:vir:46 113 LETRND-----IQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) T ss_pred Hhhhhh-----hhhccc-cccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceeecccccc Confidence 000000 000000 0111112455899999999999999999998887776543 22232 234444555555655 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.. ..++.+++++..-+. +.-+.|.+-=..++.+|+.+.+.++.+++|++..|+.|+.-...+ ...+. T Consensus 187 ~~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g---------~~~~~ 255 (415) T protein:vir:46 187 NPEL-AVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG---------STGST 255 (415) T ss_pred cccc-cccceeeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC---------Ccccc Confidence 5422 124556656655554 233445433233456889999999999999999999887422110 00110 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) ........... ..+ ....++.|+++...+...... .-.+|++|..|..|.+- +-.+..|....++.+|..+ T Consensus 256 ~~~~~~~~~~~--~~~----~~~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~L~~l-kd~~G~~i~~~~~~~~~~~ 326 (415) T protein:vir:46 256 SSGFEKEGKKL--EVK----KAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQ 326 (415) T ss_pred cccccccccee--ccc----cccchHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHh-hccCCCeeeccCcCCCCCc Confidence 01100000000 000 111256677777777766653 33568999999998652 2234455544456677778 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++|++|+.++++|....+.. .-+.+||++.+.+ +..++++++.... ... T Consensus 327 ~l~G~pV~~~~~~~~~~~~~~--------------------~~~~gd~~~~~~~---------~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:46 327 RLLGAKIEILPDEVLGQKGNN--------------------TLIIGNLKDAIVL---------FDRSQYQASWTDY-MHF 376 (415) T ss_pred cccceeeEEeccccccCCCcc--------------------EEEEEehhccEEE---------EeecceEEEeecc-ccC Confidence 999999999999985322111 1134455543221 2334445554332 222 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+++.+.++.++++|++++.+..+++| T Consensus 377 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:46 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEeeccC Confidence 34578888999999999999999999888 No 68 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.47 E-value=2.8e-15 Score=100.40 Aligned_cols=298 Identities=11% Similarity=0.046 Sum_probs=162.9 Q ss_pred CCCCcc---Cccc-cccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccce--eeeec Q lcl|Aclame:pro 1 MANATG---GQQI-GANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTK--GYYLA 74 (347) Q Consensus 1 m~~~~~---~~~~-~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t--~~~~~ 74 (347) |...-. .... ..|.-.....++--.+-.++|.+++.+..+..+.++++.++.++.++..+.++..+... ..... T Consensus 99 ~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~ 178 (409) T protein:vir:45 99 GASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEWATADGTSEVGVLLG 178 (409) T ss_pred hhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEeeccCcccccccc Confidence 100000 0000 00000000001111244589999999999999999999888888888888888776432 22333 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeec-c-hhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTS-D-VLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASN 152 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~-~-~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~ 152 (347) -|...+. .+++...+++ ...++. . ..|.+-=...+.+|+.+.+.++.++++++..|+.|+. +..... . T Consensus 179 E~~~~~~--~~~~f~~~~l--~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~----G~G~~~--~ 248 (409) T protein:vir:45 179 ENEEAGE--EDTDFGMGSL--GALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQ----GTGAGT--P 248 (409) T ss_pred ccccccc--cccccceeee--eeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhc----cCCCCC--c Confidence 3444433 3345555444 444433 2 2354433333568999999999999999999998763 111000 0 Q ss_pred cccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCE-EEEChHHHHHHhcchhhhhhhcccccc Q lcl|Aclame:pro 153 ENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRR-FYCAPEDYSAILSALMPNAANYAALID 231 (347) Q Consensus 153 ~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~-~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (347) ..+.| ..-..+.. .........-++.|+++...|..... ....| ++++|..|..|.+-. -.+..|.-..+ T Consensus 249 ~~p~G----il~~~~~~---~~~~~~~~~~~d~i~~l~~~l~~~~~-~~a~~~~~~n~~~~~~l~~lk-d~~G~~i~~~~ 319 (409) T protein:vir:45 249 KQPKG----LAASVTGT---TQTAAANAVKWQEILALKHSIDPAYR-RGPKFRLAFNDNTLKLISEME-DGQGRPLWLPD 319 (409) T ss_pred cccce----eeeccccc---cccccccccchHHHHHHHHhhhhhhc-cCCeEEEEECHHHHHHHHHhh-cCCCceeeccC Confidence 00111 11000000 00000011125677888777776653 33456 467999988875421 22344544445 Q ss_pred ccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecc Q lcl|Aclame:pro 232 PETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALER 311 (347) Q Consensus 232 ~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~ 311 (347) ...|...+++|.+|+.++++|....+.. .=+.+||++. ++ .....+.++. T Consensus 320 ~~~~~~~~l~G~PV~~~~~~p~~~~~~~--------------------~i~~Gd~~~~--~i--------~~~~~~~~~~ 369 (409) T protein:vir:45 320 IVGVAPASVLNVPYVIDQEIDDIGAGKK--------------------FMFCGDFDRF--II--------RRVRYMILKR 369 (409) T ss_pred cCCCCCceecceeeEEecCcCCccCCcc--------------------EEEEeehhhh--he--------eeccceEEEE Confidence 6667778899999999999985321111 0122445442 12 2223334444 Q ss_pred ccchhhHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 312 ARRPEFQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 312 ~~~~~~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ..|+-..-+ .+++.+.+|.++.+|++++.|...+++ T Consensus 370 ~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~ 407 (409) T protein:vir:45 370 LVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGSV 407 (409) T ss_pred eecccccCCcEEEEEEEEeccEeechhheEEEEeccCC Confidence 444322223 378889999999999999999887777 No 69 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.47 E-value=1.9e-15 Score=101.26 Aligned_cols=288 Identities=14% Similarity=0.019 Sum_probs=161.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHH-HHHHHHHhhhcccccccccCCceEEEe-ccccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVL-TAFVRRSVTMDKHMVRTIQNGKSASFP-VMGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~-~~f~~~s~~~~~~~~rti~~G~tv~i~-~iG~~t~~~~~~g~~ 78 (347) +.+... .+...++.-.|..+.|..++. +.+...+.+..+.++... .|+ +.+| ..+...+..+.-|.. T Consensus 245 ~~~~~~---------~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~-~~~~~~~~~~~a~~v~Eg~~ 313 (543) T protein:vir:81 245 INEVRA---------MGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGD-VWHGVSSAAVQWSWDAEFEE 313 (543) T ss_pred hhhhhh---------cccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-Ccc-eEEEEecCCcceeecccCcc Confidence 111100 000011111144577776664 666667888887776444 454 4454 445556666666766 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++. ..++..++++...+.- .-+.|.+ +-.+.+.|+.+.+.++.++++++..|+.||. +.... +.+.|. T Consensus 314 ~~~--~~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~ail~----G~Gt~----~~p~Gi 381 (543) T protein:vir:81 314 VSD--DSPEFGQPEIPVKKAQ-GFVPISI-EALQDEANVTETVALLFAEGKDELEAVTLTT----GTGQG----NQPTGI 381 (543) T ss_pred ccc--cccccceeeeeeeeeE-eeehhhH-HHHhccHHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC----cccccc Confidence 654 3467777777776653 3345544 4455668999999999999999999998863 11100 011111 Q ss_pred c---CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 159 G---QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 159 ~---~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) - .+......++ ......++.++++...|...+-+ .-.+|++|..|..|.+-. -.++.|.-. .+..| T Consensus 382 ~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~l~~lk-d~~G~~l~~-~~~~g 450 (543) T protein:vir:81 382 VTALAGTAAEIAPV-------TAETFALADVYAVYEQLAARHRR--QGAWLANNLIYNKIRQFD-TQGGAGLWT-TIGNG 450 (543) T ss_pred hhhccccccccccc-------ccccccHHHHHHHHHhhhccccC--CcEEEEcHHHHHHHHHhh-cCCCceecc-CcCCC Confidence 0 0001111111 11112367777777777766543 235789999999997532 223333322 34456 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccc--- Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERA--- 312 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~--- 312 (347) ..++++|.+|+.++++|....... . ++...=|.+||++.. + +...+++++.. T Consensus 451 ~~~~l~G~pv~~~~~~~~~~~~~~--------~-------~~~~~i~~gd~~~~~--i--------~~~~~~~i~~~~~~ 505 (543) T protein:vir:81 451 EPSQLLGRPVGEAEAMDANWNTSA--------S-------ADNFVLLYGNFQNYV--I--------ADRIGMTVEFIPHL 505 (543) T ss_pred CCccccceeeEEeccccccccccc--------c-------CCcceEEEeecccee--E--------EeecccEEEEeccc Confidence 667899999999999996532211 0 011112345665422 2 22223333321 Q ss_pred ---cchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 313 ---RRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 313 ---~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++..+..-.+++...+|.++++|++.+.+..+++| T Consensus 506 ~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 506 FGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred cccchhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 11112223456677789999999999999999999 No 70 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.46 E-value=2.5e-15 Score=100.65 Aligned_cols=295 Identities=9% Similarity=0.010 Sum_probs=169.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEe-ccccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFP-VMGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~-~iG~~t~~~~~~g~~ 78 (347) +.+..- .....+.+.. ..++--.+.-+.|..++++..+..+.++++.++..+.++. ++.++ ..+...+.....|.. T Consensus 109 ~~~~~~-~~~~~~~~~~-~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~ 186 (415) T protein:vir:81 109 FTEYLE-TRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) T ss_pred HHHHHh-hhhhhhhccc-cccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccc Confidence 000000 0000000000 0001112455899999999999999999998887776432 33333 345555555555666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.. ..++.+++++.+.+. +.-..|.+-=..++.+|+.+.+.++.++++++..|+.++.-... ..+ .+. T Consensus 187 ~~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~----g~~-----~~~ 255 (415) T protein:vir:81 187 NPEL-AVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK----GST-----GST 255 (415) T ss_pred cCcc-cccceeeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc----Ccc-----ccc Confidence 5432 124556666666665 23344543323346788999999999999999999988743211 000 000 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) ...... .+.. ........++.|+++...+...+.. .-.+|++|..|..|.+- +-.+..|....++..|..+ T Consensus 256 ~~~~~~-~~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~ 326 (415) T protein:vir:81 256 SSGFEK-EGKK-----LEVKKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQ 326 (415) T ss_pred cccccc-cccc-----cccccccchhHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHh-hccCCceeeccCcCCCCCc Confidence 000000 0000 0011111267788888888777764 23468899999998753 2233455444456677778 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++|++|+.++++|....+.. .-+.+||++.+- .+...+++++...+ ..+ T Consensus 327 ~l~G~pV~~~~~~~~~~~~~~--------------------~~~~Gd~~~~~~---------~~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:81 327 RLLGAKIEILPDEVLGQKGNN--------------------TLIIGNLKDAIV---------LFDRSQYQASWTDY-MHF 376 (415) T ss_pred eecceeeEEecccccCCCCcc--------------------EEEEEehhccEE---------EEeecceEEEEecc-ccC Confidence 999999999999985432211 113445554321 23344455554432 233 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+++.+.++.++++|++++.+..+++| T Consensus 377 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:81 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEeccC Confidence 44678889999999999999999999999 No 71 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.46 E-value=2.5e-15 Score=100.65 Aligned_cols=295 Identities=9% Similarity=0.010 Sum_probs=169.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEe-ccccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFP-VMGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~-~iG~~t~~~~~~g~~ 78 (347) +.+..- .....+.+.. ..++--.+.-+.|..++++..+..+.++++.++..+.++. ++.++ ..+...+.....|.. T Consensus 109 ~~~~~~-~~~~~~~~~~-~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~ 186 (415) T protein:vir:79 109 FTEYLE-TRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) T ss_pred HHHHHh-hhhhhhhccc-cccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccc Confidence 000000 0000000000 0001112455899999999999999999998887776432 33333 345555555555666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.. ..++.+++++.+.+. +.-..|.+-=..++.+|+.+.+.++.++++++..|+.++.-... ..+ .+. T Consensus 187 ~~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~----g~~-----~~~ 255 (415) T protein:vir:79 187 NPEL-AVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK----GST-----GST 255 (415) T ss_pred cCcc-cccceeeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc----Ccc-----ccc Confidence 5432 124556666666665 23344543323346788999999999999999999988743211 000 000 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) ...... .+.. ........++.|+++...+...+.. .-.+|++|..|..|.+- +-.+..|....++..|..+ T Consensus 256 ~~~~~~-~~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~ 326 (415) T protein:vir:79 256 SSGFEK-EGKK-----LEVKKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQ 326 (415) T ss_pred cccccc-cccc-----cccccccchhHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHh-hccCCceeeccCcCCCCCc Confidence 000000 0000 0011111267788888888777764 23468899999998753 2233455444456677778 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++|++|+.++++|....+.. .-+.+||++.+- .+...+++++...+ ..+ T Consensus 327 ~l~G~pV~~~~~~~~~~~~~~--------------------~~~~Gd~~~~~~---------~~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:79 327 RLLGAKIEILPDEVLGQKGNN--------------------TLIIGNLKDAIV---------LFDRSQYQASWTDY-MHF 376 (415) T ss_pred eecceeeEEecccccCCCCcc--------------------EEEEEehhccEE---------EEeecceEEEEecc-ccC Confidence 999999999999985432211 113445554321 23344455554432 233 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+++.+.++.++++|++++.+..+++| T Consensus 377 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:79 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEeccC Confidence 44678889999999999999999999999 No 72 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.46 E-value=2.5e-15 Score=100.65 Aligned_cols=295 Identities=9% Similarity=0.010 Sum_probs=169.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEe-ccccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFP-VMGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~-~iG~~t~~~~~~g~~ 78 (347) +.+..- .....+.+.. ..++--.+.-+.|..++++..+..+.++++.++..+.++. ++.++ ..+...+.....|.. T Consensus 109 ~~~~~~-~~~~~~~~~~-~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~ 186 (415) T protein:vir:98 109 FTEYLE-TRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) T ss_pred HHHHHh-hhhhhhhccc-cccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccceeeccccc Confidence 000000 0000000000 0001112455899999999999999999998887776432 33333 345555555555666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.. ..++.+++++.+.+. +.-..|.+-=..++.+|+.+.+.++.++++++..|+.++.-... ..+ .+. T Consensus 187 ~~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~----g~~-----~~~ 255 (415) T protein:vir:98 187 NPEL-AVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK----GST-----GST 255 (415) T ss_pred cCcc-cccceeeEEeeeeee-EeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc----Ccc-----ccc Confidence 5432 124556666666665 23344543323346788999999999999999999988743211 000 000 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) ...... .+.. ........++.|+++...+...+.. .-.+|++|..|..|.+- +-.+..|....++..|..+ T Consensus 256 ~~~~~~-~~~~-----~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~v~n~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~ 326 (415) T protein:vir:98 256 SSGFEK-EGKK-----LEVKKAKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQ 326 (415) T ss_pred cccccc-cccc-----cccccccchhHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHh-hccCCceeeccCcCCCCCc Confidence 000000 0000 0011111267788888888777764 23468899999998753 2233455444456677778 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++|++|+.++++|....+.. .-+.+||++.+- .+...+++++...+ ..+ T Consensus 327 ~l~G~pV~~~~~~~~~~~~~~--------------------~~~~Gd~~~~~~---------~~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:98 327 RLLGAKIEILPDEVLGQKGNN--------------------TLIIGNLKDAIV---------LFDRSQYQASWTDY-MHF 376 (415) T ss_pred eecceeeEEecccccCCCCcc--------------------EEEEEehhccEE---------EEeecceEEEEecc-ccC Confidence 999999999999985432211 113445554321 23344455554432 233 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+++.+.++.++++|++++.+..+++| T Consensus 377 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 405 (415) T protein:vir:98 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEeccC Confidence 44678889999999999999999999999 No 73 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.46 E-value=4e-15 Score=99.52 Aligned_cols=298 Identities=12% Similarity=0.039 Sum_probs=167.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |+.... |.......++.-.+..+++..++.+..++.+.++.+.++.+..+ ..+.+|+. +.+.+..+..|..+ T Consensus 1 m~~~~~------~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~ 73 (330) T protein:vir:77 1 MAGSTV------PSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGP-TGISIPHWTGAVSASWTGEAERK 73 (330) T ss_pred Cccccc------chhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEcCCcceeEecCCCcc Confidence 776322 11111112222224456777889999999999999988777654 44778876 55566666667777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +.+ +++..++++..-+. +.-..|.+-=..++.+|+.+.+.++.+++++++.|+.+|. +........+...... T Consensus 74 ~~~--~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~----G~g~~~~~~g~~~~~~ 146 (330) T protein:vir:77 74 PIT--KGSFGKQELEPVKI-TTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIH----GIDKPSAFKGYLAETT 146 (330) T ss_pred ccc--cceeeEEEEeEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCCcccccccccc Confidence 643 46677766666553 3333454422233568899999999999999999998873 1111000000000000 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccc-----c Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPE-----T 234 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~-----~ 234 (347) ..... ......+.......+++.|.++...+...+.+. ..++++|..|..|.+-. -.+..|.-..... . T Consensus 147 ~~~~~---~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lk-d~~G~~l~~~~~~~~~~~~ 220 (330) T protein:vir:77 147 KVVSL---ADTNLTTASGPQGNAYLAVNNALSLLVNSGKKW--TGTLLDNVTEPILNTAV-DGNGRPLFVESTYTEQVGA 220 (330) T ss_pred cccee---ecccccccccccchhHHHHHHHHHhhhhcCCCc--cEEEEcHHHHHHHHHHh-ccCCceeecCccccccccc Confidence 00011 011111112223345778888888888877643 35789999999887532 1222332222222 2 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) ..-++++|++|+.++++|....+. ..--|.+||++.+ + +..++++++...+ T Consensus 221 ~~~~~l~G~PV~~~~~~p~~~~~~-------------------~~~~~~gd~s~~~--i--------~~~~~~~i~~~~e 271 (330) T protein:vir:77 221 IREGRILGRPTYVADNVVNGTVGN-------------------RVVGVMGDFSQVI--W--------GQIGGLSFDVTDQ 271 (330) T ss_pred cCCceecceeeEEeccccCCCCCC-------------------ccEEEEEecceEE--E--------EEecCcEEEEeec Confidence 233579999999999998532110 0111334555432 1 2233334433222 Q ss_pred h------------------hhH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 P------------------EFQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~------------------~~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) . .++ ...++...++|..++||++.+.|....+- T Consensus 272 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 272 ATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAG 324 (330) T ss_pred ceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 1 011 24467888999999999999888655543 No 74 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.44 E-value=3.8e-15 Score=99.63 Aligned_cols=295 Identities=9% Similarity=0.006 Sum_probs=168.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEec-cccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPV-MGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~~-iG~~t~~~~~~g~~ 78 (347) +.+... .....+.+.. ..++--.+.-+.+.+++++..+..+.++.++++..+.+|. ++.++. .+...+.....|.. T Consensus 109 ~~~~~~-~~~~~~~~~~-~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~ 186 (415) T protein:vir:94 109 FTEYLE-TRNDIQGGSL-KTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEKVEELEE 186 (415) T ss_pred HHHHhh-hhhhhhhhcc-ccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccceecccccc Confidence 000000 0000000000 0111111344889999999999999999999888776443 344443 34445555555666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.. ..++..++++.+-+. +..+.|.+-=..++.+|+.+.+.++.++++++..|+.|+.-...+ .+ .+. T Consensus 187 ~~~~-~~~~~~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g----~~-----~~~ 255 (415) T protein:vir:94 187 NPEL-AVKPFFQLAYDINTH-RGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKG----ST-----GST 255 (415) T ss_pred cccc-ccccceeeEeeheee-eeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccC----cc-----ccc Confidence 5432 123456656655554 233344432223456889999999999999999999887532110 00 000 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) ..+.... +.... .+. ...++.|+++...+...+.. .-.+|++|..|..|.+- +-.+..|.-..++.+|..+ T Consensus 256 ~~~~~~~-~~~~~-~~~----~~~~~~i~~~~~~~~~~~~~--~~~~vmn~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~ 326 (415) T protein:vir:94 256 SSGFEKE-GKKLE-VKK----AKSLDDIKDAINLNVKPNYE--HNVAIVSQTMFAKLDKM-KDKLGNYLIQPDVKEKTQQ 326 (415) T ss_pred ccccccc-ccccc-ccc----ccchHHHHHHHHhhhhhccC--CCEEEEcHHHHHHHHHh-hccCCCeeeccCcCCCCCc Confidence 0000000 00000 011 11267788888787777664 33568999999999763 2233445444456677778 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++|++|+.++++|....+.. .-+.+||++.+. .+...+++++.... .++ T Consensus 327 ~l~G~pV~~~~~~~~~~~~~~--------------------~i~~gd~~~~~~---------~~~~~~~~v~~~~~-~~~ 376 (415) T protein:vir:94 327 RLLGAKIEILPDEVLGQKGNN--------------------TLIIGNLKDAIV---------LFDRSQYQASWTDY-MHF 376 (415) T ss_pred eecceeeEEecccccCCCCcc--------------------EEEEEehhccEE---------EEeecceEEEEecc-ccC Confidence 999999999999985432111 113345554321 12334445554332 334 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+++.+.++.++++|++++.+..++++ T Consensus 377 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 405 (415) T protein:vir:94 377 GECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEeccC Confidence 45678889999999999999999999999 No 75 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.43 E-value=1.6e-14 Score=96.16 Aligned_cols=296 Identities=11% Similarity=-0.001 Sum_probs=161.5 Q ss_pred CCCCcc--CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEe-ccccceeeeecCCC Q lcl|Aclame:pro 1 MANATG--GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFP-VMGRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~--~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~-~iG~~t~~~~~~g~ 77 (347) |-.... ....-.+.-.....++--.+.-+.|..++.+..+..+.++.+.++.+..+++ ..+| ..+.+++.-...|. T Consensus 90 l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~~ 168 (407) T protein:vir:48 90 MRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSD-YKKLVNLGGTTSGWVGETD 168 (407) T ss_pred HhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCc-eEEEEecCCcceeeecccc Confidence 111000 0000000000000011111445899999999999999999988877776654 5554 44555555454555 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecc-hhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSD-VLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~-~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ..+.+ ...+..++++.+-++ .. ..|.+-=..++.+|+.+.+.++.++++++..|+.++. +. .. T Consensus 169 ~~~~~-~~~~f~~i~~~~~k~--~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~----G~---------G~ 232 (407) T protein:vir:48 169 ARPET-ATSKLGLIEPFMGEI--YGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTS----GD---------GS 232 (407) T ss_pred ccccc-ccccceeEEeeeeee--EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----cC---------CC Confidence 54422 123455555555443 33 3343332334667999999999999999999998763 10 00 Q ss_pred cccCceeeeeccc--------cc--ccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhc Q lcl|Aclame:pro 157 GLGQAVVLNIGAA--------AD--LVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANY 226 (347) Q Consensus 157 g~~~~~~i~~~~~--------~~--~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~ 226 (347) +.+.|........ +. ..........-++.|+++...|.....+ +-.+|++|..|..|.+-. -.++.| T Consensus 233 ~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~--~a~~v~n~~~~~~L~~lk-D~~Gr~ 309 (407) T protein:vir:48 233 KKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRS--GAKFMMNNSSLFAIRLLK-DNDGNY 309 (407) T ss_pred CccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhc--CCEEEEcHHHHHHHHHhh-ccCCce Confidence 1111111000000 00 0000001111267788888888777654 234589999999886522 223344 Q ss_pred cccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhh Q lcl|Aclame:pro 227 AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD 306 (347) Q Consensus 227 ~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~ 306 (347) .-..++..|..++++|.+|+.++++|....+... =+.+||+... .++.+ +. T Consensus 310 l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~--------------------i~~Gd~~~~~-~i~~~--------~~ 360 (407) T protein:vir:48 310 LWRPGIELGQPSSLAGYGIVENEQMPDIAADAKA--------------------IAFGNFKRGY-TIVDR--------IG 360 (407) T ss_pred eeccCcCCCCCceecceeeEEecCcCCccCCccE--------------------EEEEeccccE-EEEEe--------ec Confidence 3334456777789999999999999853221110 1234555422 22211 22 Q ss_pred eeeccccchh--hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 307 MALERARRPE--FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 307 ~~~e~~~~~~--~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++++ +++- +-.-.+++.+.+++++++|++.+.|..+++| T Consensus 361 ~~i~--~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~ 401 (407) T protein:vir:48 361 TRIL--RDPYTNKPFVGFYTTKRTGGMLVDSQAIKLMKIGAAT 401 (407) T ss_pred eEEE--eeccccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 2222 2221 2223467788899999999999999999988 No 76 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.38 E-value=2.1e-14 Score=95.56 Aligned_cols=285 Identities=13% Similarity=0.065 Sum_probs=167.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) ||-... +. .+- ...++.-.+.-+.+..++.+.-++.+.++.+.++..+.+ ...+||+. +...+..+..|..+ T Consensus 1 ma~~~~-~~----~~~-~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~ 73 (304) T protein:vir:10 1 MATPTY-TP----GNV-ILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTA-QKKKFTYLAKGVGAYWVSETERI 73 (304) T ss_pred Cccccc-cc----ccc-cccCCCceecchhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEeCCcceEEeecCccc Confidence 765322 11 111 111122236778999999999999998888887766654 55788876 45566666666666 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++.+++++...++ ..-..|.+-=..++.+|+.+.+.++.++++++..|+.++. +.....+. .... T Consensus 74 ~~--~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~~~~----~~~~ 142 (304) T protein:vir:10 74 QT--SKPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSPYNT----STSG 142 (304) T ss_pred cc--ccceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCCccc----cccc Confidence 54 346777777777664 3334454422334568999999999999999999998763 11110000 0000 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .+....... ...........++.|+++..++...+... ..++++|..|..|.+-.. .+..| +-....++ T Consensus 143 ~~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~lkd-~~G~~-----l~~~~~~~ 211 (304) T protein:vir:10 143 KPLVEGAEE---KGNVVTDTNNLYVDLSALMATIEDEELDP--NGVLTTRSFRSKMRNALD-ANDRP-----LFDANGNE 211 (304) T ss_pred ccccccccc---cccccccccchHHHHHHHHHHhhhccCCc--CEEEEcHHHHHHHHHhhc-cCCcE-----eecCCCcc Confidence 111100000 01111122334788888888888877643 357899999999875321 11122 11233468 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc----- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR----- 314 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~----- 314 (347) ++|.+|+.++++|..... +.-+.+||++.. + +..++++++...+ T Consensus 212 l~G~PV~~~~~~~~~~~~---------------------~~~~~gd~~~~~--~--------~~~~~~~i~~~~e~~~~~ 260 (304) T protein:vir:10 212 IMGLPLSYTGADVYDKKK---------------------SLALMGDWDYAR--Y--------GILQGIEYAISEDATLTT 260 (304) T ss_pred ccceeeEEecccccCCCC---------------------cEEEEEehhhEE--E--------EEecceEEEEeecceeee Confidence 999999999999842111 112335565532 2 1222333332221 Q ss_pred -----hh------hH--hhHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 315 -----PE------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 315 -----~~------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) +. ++ ...++..+.+|..++||++.+.|..|. T Consensus 261 ~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 261 LQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 11 11 234567788999999999999999988 No 77 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.38 E-value=2.1e-14 Score=95.56 Aligned_cols=285 Identities=13% Similarity=0.065 Sum_probs=167.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) ||-... +. .+- ...++.-.+.-+.+..++.+.-++.+.++.+.++..+.+ ...+||+. +...+..+..|..+ T Consensus 1 ma~~~~-~~----~~~-~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~ip~~~~~~~a~~v~E~~~~ 73 (304) T protein:vir:94 1 MATPTY-TP----GNV-ILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTA-QKKKFTYLAKGVGAYWVSETERI 73 (304) T ss_pred Cccccc-cc----ccc-cccCCCceecchhHHHHHHHHHHhccchhhhcceeeccC-CceEEEEEeCCcceEEeecCccc Confidence 765322 11 111 111122236778999999999999998888887766654 55788876 45566666666666 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++.+++++...++ ..-..|.+-=..++.+|+.+.+.++.++++++..|+.++. +.....+. .... T Consensus 74 ~~--~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~----G~g~~~~~----~~~~ 142 (304) T protein:vir:94 74 QT--SKPEYAQAEMEAKKI-GVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF----GTKSPYNT----STSG 142 (304) T ss_pred cc--ccceeeEEEEEEEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee----ccCCCccc----cccc Confidence 54 346777777777664 3334454422334568999999999999999999998763 11110000 0000 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .+....... ...........++.|+++..++...+... ..++++|..|..|.+-.. .+..| +-....++ T Consensus 143 ~~~~~~~~~---~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~L~~lkd-~~G~~-----l~~~~~~~ 211 (304) T protein:vir:94 143 KPLVEGAEE---KGNVVTDTNNLYVDLSALMATIEDEELDP--NGVLTTRSFRSKMRNALD-ANDRP-----LFDANGNE 211 (304) T ss_pred ccccccccc---cccccccccchHHHHHHHHHHhhhccCCc--CEEEEcHHHHHHHHHhhc-cCCcE-----eecCCCcc Confidence 111100000 01111122334788888888888877643 357899999999875321 11122 11233468 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc----- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR----- 314 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~----- 314 (347) ++|.+|+.++++|..... +.-+.+||++.. + +..++++++...+ T Consensus 212 l~G~PV~~~~~~~~~~~~---------------------~~~~~gd~~~~~--~--------~~~~~~~i~~~~e~~~~~ 260 (304) T protein:vir:94 212 IMGLPLSYTGADVYDKKK---------------------SLALMGDWDYAR--Y--------GILQGIEYAISEDATLTT 260 (304) T ss_pred ccceeeEEecccccCCCC---------------------cEEEEEehhhEE--E--------EEecceEEEEeecceeee Confidence 999999999999842111 112335565532 2 1222333332221 Q ss_pred -----hh------hH--hhHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 315 -----PE------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 315 -----~~------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) +. ++ ...++..+.+|..++||++.+.|..|. T Consensus 261 ~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 261 LQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 11 11 234567788999999999999999988 No 78 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.38 E-value=3.9e-14 Score=94.12 Aligned_cols=291 Identities=14% Similarity=0.084 Sum_probs=168.2 Q ss_pred CCCCccCcccc-ccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-c-cceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIG-ANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-G-RTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~-~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G-~~t~~~~~~g~ 77 (347) |.......... -+.......++.-.+..+.|..++.+..+..+.+++++++.++.+ .++.+|+. + ..++..+..|+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~ 176 (395) T protein:vir:43 98 TSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTES-NSVEYVRETGFVNNAAPVSEGT 176 (395) T ss_pred HHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCC-CceEEEEEecCCCceeeecCCc Confidence 10000000000 000000111111236778899999999999999999988877754 45788874 3 34555555666 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .++. .+++..++++.+.++- .-..|.+ +-.+...++.+.+.++.++++++..|..++. +.. .+..+.| T Consensus 177 ~~~~--~~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~l~~~v~~~la~a~~~~~d~~~l~----G~g----~~~~~~G 244 (395) T protein:vir:43 177 QKPY--SDLTFELENAPVRTIA-HLFKASR-QILDDASALQSYIDARARYGLMLVEECQLLY----GNG----TGANLHG 244 (395) T ss_pred cccc--cccceeEEEEeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccC----CCCcccc Confidence 6553 3467777777777763 3344543 3344445688888999999999999998863 111 1111112 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) ............ .........++.|.++...+...+.+. -.+|++|..|..|.+-.. .+..|... +..+|.. T Consensus 245 i~~~~~~~~~~~----~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd-~~G~~i~~-~~~~~~~ 316 (395) T protein:vir:43 245 IIPQAQAYAPPS----GVVVTAEQRIDRIRLAILQAQLAEFPA--SGIVLNPIDWALIELNKD-AENRYIIG-SPQNGTT 316 (395) T ss_pred cccccccccccc----ccccccchhHHHHHHHHHhhccccCCC--cEEEEcHHHHHHHHHhhc-cCCceecc-ccccCCC Confidence 111111100000 011112234778888888888777643 367899999998865332 23334332 3456667 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch-- Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP-- 315 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~-- 315 (347) ..++|++|+.++.+|... -+.+||+... +++ ...+++++..+.. T Consensus 317 ~~l~G~pVv~~~~~~~~~-------------------------~~~gd~~~~~-~~~--------~~~~~~i~~~~~~~~ 362 (395) T protein:vir:43 317 PTLWRLPVVETQAITQDE-------------------------FLTGAFSLGA-QIF--------DRMDIEVLVSTENDK 362 (395) T ss_pred ceecceeeEEcCCCCCCc-------------------------EEEEeccceE-EEE--------EecceEEEEeccccc Confidence 789999999999998421 1234555422 222 1223444444322 Q ss_pred hhH--hhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 316 EFQ--ADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 316 ~~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ..+ ...++....+|.++++|++++.+..++| T Consensus 363 ~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 363 DFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred hhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 122 2356777889999999999999988888 No 79 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=99.37 E-value=3.2e-14 Score=94.54 Aligned_cols=296 Identities=12% Similarity=0.010 Sum_probs=162.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |+--... +...|.......++.-.+..+.+..++.+...+.+.++.+.++.... +.+.+||+. +.+.+..+..|+++ T Consensus 1 ~~~~~~~-~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~E~~~~ 78 (320) T protein:vir:10 1 MAAGTAF-QVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWIGDVSAQWIGEGDMK 78 (320) T ss_pred CCCCccC-CHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEecCCccc Confidence 5543321 11222222222222222556889999999999999999988776655 455788875 45566666667777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++.+++++.+-+. ..-..|.+-=..++..|+.+.+.+++++++++..|+.+|. +.... ....+.+.. T Consensus 79 ~~--~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~----G~g~~--~~~~~~~~~ 149 (320) T protein:vir:10 79 PI--TKGNMTSQNIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALN----GTDSP--FPTYLAQTT 149 (320) T ss_pred cc--cccceeEEEEeeEEE-EEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCC--CCccccccc Confidence 64 346777777766664 3334454422234678999999999999999999998863 11100 000011110 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccc-----cccc Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI-----DPET 234 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~-----~~~~ 234 (347) .+......+... ......+-+.+.++...+...+. ..-+++++|..|..|.+-.+ .+..|.... .... T Consensus 150 ~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~n~~~~~~L~~lkd-~~G~~l~~~~~~~~~~~~ 222 (320) T protein:vir:10 150 KSVSLADPGGAT----ASDLTAYDAVAVNGLSLLVNAKK--KWTHTLLDDIVEPILNGAKD-KNGRPLFIESTYTDENSP 222 (320) T ss_pred ccccceeccccc----ccccccHHHHHHHHHhhhhcccC--CCcEEEEcHHHHHHHHHhhc-cCCceeeccccccCcccc Confidence 110100000000 00111112335555666665554 33577999999999975322 122222111 1111 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) ..-++++|++|+.++++|.... --+.+||++.. .+...+++++..++ T Consensus 223 ~~~~~i~g~pv~~~~~~~~~~~-----------------------~~~~gd~~~~~----------~~~~~~~~i~~~~~ 269 (320) T protein:vir:10 223 FRAGRIVSRPTILSDHVADGTT-----------------------VGYMGDFRNVI----------WGQVGGLSFDVTDQ 269 (320) T ss_pred ccCceeeeeeeEecCCCCCCce-----------------------EEEEeecceEE----------EEEecCeEEEEeec Confidence 1123689999999999874210 01234555432 12333344443332 Q ss_pred hh--------------hH--hhHHhhhhhhcCcccccceEEEEE--ecCCC Q lcl|Aclame:pro 315 PE--------------FQ--ADQIIGKYAMGHGGLRPEAAGALV--FTPAA 347 (347) Q Consensus 315 ~~--------------~~--~d~i~~~~~~G~~~lRPe~~~~l~--~~~aa 347 (347) .- ++ .-.+++.+.+|.+++||++.+.|. .+|.| T Consensus 270 ~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 270 ATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred ceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 11 11 234577788999999999998887 56666 No 80 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=99.37 E-value=3.7e-14 Score=94.23 Aligned_cols=283 Identities=11% Similarity=-0.052 Sum_probs=165.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEec-cccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPV-MGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~-iG~~t~~~~~~g~~~ 79 (347) |+-. . | .+..+.+..++.+..+..|+++.+.++.+..+|+ +.||+ .|.+.+..+..|+.+ T Consensus 1 ma~~-g--------------G---~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~E~~~~ 61 (298) T protein:vir:16 1 MVLN-K--------------G---TLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAESGKK 61 (298) T ss_pred Cccc-C--------------c---ceechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEecCCccc Confidence 6542 1 1 1455788899999998899999998877776554 67776 556677667667766 Q ss_pred CCCCCCCCCCceEEEEeeeeecch-hhccHHH---HHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDV-LIYDIED---AMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~-~Vdd~D~---~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) +.+ +++.+++++..-+ +... .|.+-=. .....++.+.+.++.++++++..|+.++.- .... .+.. T Consensus 62 ~~~--~~~f~~v~l~~~k--~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G----~~~~---~g~~ 130 (298) T protein:vir:16 62 THG--GVTLAPQTMVPIK--VEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHG----VNPR---LGTA 130 (298) T ss_pred ccc--ccceeEEEEeeee--EEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhcc----ccCC---CCcc Confidence 543 4566665555544 3333 3322111 123457888999999999999999988742 1100 0000 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) .+. .+.....+..+...........+++.|+++..++...+.+.. .++++|..+..|.+-.. .+..|.-......| T Consensus 131 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~--~~vmn~~~~~~l~~lkd-~~G~~i~~~~~~~~ 206 (298) T protein:vir:16 131 SAV-IGTNHFDSKVTQKVEAPRGIADPNGAIENAVELLTGVDADVT--GIAINPSFRSALAKQKD-LQDNALFPELKWGA 206 (298) T ss_pred ccc-ccccccccccccccccccccccHHHHHHHHHHHhhhcCCCcc--EEEEcHHHHHHHHHhhc-cCCCeeecCcccCC Confidence 000 000000011111111111223346778888888888887543 47889999999876432 23444333445567 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc-- Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR-- 313 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~-- 313 (347) ..++++|.+|+.++++|....+. ...-+.+||++.+.+. . ...++++... T Consensus 207 ~~~~l~G~PV~~~~~v~~~~~~~-------------------~~~~~~GDfs~~~~~~-~--------~~~~~~~~~~~~ 258 (298) T protein:vir:16 207 TPDTINGLPVDVNKTVSDMSLTQ-------------------RDRAIIGDFANGFKWG-Y--------AKEVPLEVIQYG 258 (298) T ss_pred CCceecceeeEEecccccccCCC-------------------ccEEEEeeccceEEEE-E--------ecCceEEEeecc Confidence 77899999999999998421110 0112446776543222 1 1222333222 Q ss_pred chh------hHh--hHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 314 RPE------FQA--DQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 314 ~~~------~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) ++. ++. -.+++...+|.+++||++.+.|..+. T Consensus 259 ~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 259 DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 211 122 33677888999999999999998777 No 81 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.37 E-value=3.6e-14 Score=94.32 Aligned_cols=286 Identities=12% Similarity=0.075 Sum_probs=166.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) +++...-... -+..-.....+...+..+.+..++.+..+..|.++.+.++.++.+ ..++||+. +.+.+.-+..|..+ T Consensus 14 f~~~~~~~~~-~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:93 14 FASNNVKPQV-FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHHhhhhhhh-cccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeeecCCccc Confidence 1111100000 001111111222236679999999999999999999887766654 45778765 66667667777777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +.. +++.+++++..-+. ..-..|.+-=..++.+|+.+.+.++.++++++..|+.+|. +... ... + T Consensus 92 ~~~--~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~g~----~~~----~ 156 (324) T protein:vir:93 92 ETS--KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN----NPF----G 156 (324) T ss_pred ccc--ccceeEEEEEeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----CCCC----CCc----C Confidence 643 46777777776664 3444565422334568999999999999999999998763 1110 000 0 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .+........... ..+...++.|.++...|...+... ..++++|..|..|.+-.. -.|...+..+..++ T Consensus 157 ~~~~~~~~~~~~~----~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~l~d-----~~G~~~~~~~~~~~ 225 (324) T protein:vir:93 157 KSIAQSIEKTNKV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-----PETKERIYDRNSDS 225 (324) T ss_pred cccccccccccee----ccccccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhC-----CCCCeeecCCCCCc Confidence 1111100000000 011123677888888888877533 367899999999875321 12222234455678 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch---- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP---- 315 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~---- 315 (347) ++|.+|+.++..+.. .+.-+.+||++.. + +..++++++..++. T Consensus 226 l~G~PVv~~~~~~~~-----------------------~~~i~~gdfs~~~--~--------~~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:93 226 LDGLPVVNLKSSNLK-----------------------RGELITGDFDKLI--Y--------GIPQLIEYKIDETAQLST 272 (324) T ss_pred ccceeeEeecCCCCC-----------------------cceEEEEecceEE--E--------EEecCcEEEEeecccccc Confidence 999999987765421 1112445665431 1 22334444444321 Q ss_pred ------------hhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 316 ------------EFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 316 ------------~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++-.-.+++.+.+|.++++|++.+.|..+.+- T Consensus 273 ~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:93 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccccc Confidence 11124567788899999999999988755444 No 82 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.37 E-value=8.7e-15 Score=97.67 Aligned_cols=278 Identities=13% Similarity=0.064 Sum_probs=158.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (347) +...............+...++--.+-.+.|..++++.....+.++++.++.++.++ +..+|.. +...+..+..|.. T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~ 198 (400) T protein:vir:38 120 AVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQ-KGTYPTVANATTKMVTVAELEK 198 (400) T ss_pred hhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCc-ceEEEEEecCCCcccccccccc Confidence 000000000000000000111111244589999999999999999999888776544 3555543 4444445544444 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) .+.. ..+..+++++.+.++ +.-..|.+-=-..+.+|+.+.+.++.+++|+...|+.|+.-. T Consensus 199 ~~~~-~~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~----------------- 259 (400) T protein:vir:38 199 NPAM-AKPEFKPVNWSVETY-RQALPVSQESIDDSAIDLVGLIAQNGQQIKVNTTNGAVATLL----------------- 259 (400) T ss_pred cccc-ccccceeeEeehhhe-eeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhhhhcc----------------- Confidence 4321 234556655555443 233334332223456789999999999999999998876321 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHH-HHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARA-RLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) +++ ++..... ++.|.++.. .++. ...-.+|++|..|..|.+-. -.++.|.-..++..|.. T Consensus 260 ~~~------~~~~~~~--------~~~~~~~~~~~~~~----~~~a~~v~~~~~~~~l~~lk-d~~G~~i~~~~~~~~~~ 320 (400) T protein:vir:38 260 KGF------TAKTISS--------VDDLKHINNVDLDP----AYSRVIIASQSFYNFLDTVK-DGNGRYLLQDSILTPSG 320 (400) T ss_pred ccc------ccccccc--------HHHHHHHHHhhhhh----hhCcEEEEcHHHHHHHHHhh-ccCCCeeeecCcCCCCc Confidence 000 0000000 233433322 2222 22456789999999987532 23445544445667777 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhh Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEF 317 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~ 317 (347) ++++|++|+.+++.|....+... -+.+||++.+- .+..++++++...+ .+ T Consensus 321 ~~l~G~pv~~~~~~~~~~~g~~~--------------------~~~gd~s~~~~---------~~~~~~~~~~~~~~-~~ 370 (400) T protein:vir:38 321 KSVLGMPIAVVSDDTLGAAGEAH--------------------AFLGDIKRAIL---------FANRADFMVRWVDD-QI 370 (400) T ss_pred cccccceeEEecccccCCCCceE--------------------EEEEeccccEE---------EEeecceEEEEecc-cc Confidence 88999999999999853222110 12345554321 22334455554433 44 Q ss_pred HhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 318 QADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 318 ~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +...+++.+.+|.++++|++.+.|..+|+| T Consensus 371 ~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 371 YGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred cceeEEEEEEeccEEecccceEEEEeecCC Confidence 566889999999999999999999999999 No 83 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=99.37 E-value=5.1e-14 Score=93.44 Aligned_cols=292 Identities=10% Similarity=-0.003 Sum_probs=167.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |+-...+ . -+--++|..++.+..+..|+++.+.++.++.+| .+++|+. +.+.+.-+..|+.+ T Consensus 1 mat~~~g--------------g--~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~ 63 (311) T protein:vir:81 1 MVALATG--------------T--FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFG-EQQYMTLTAPPRGEVVGEGAQK 63 (311) T ss_pred CceecCC--------------c--eEcchhHHHHHHHHHHhcchhhhhcceeecCCC-ceEEEEEeCCceeEEeecCccc Confidence 6664221 1 133488999999999999999999887776655 4888875 66677767777777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHH----hCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAM----NHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q----~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) +.+ +++.+++++..-++ ..-..|.+ +-.+ ...++.+.+.++.+++|++.+|+.++.-- . +...... T Consensus 64 ~~~--~~~f~~v~l~~~kl-~~~~~iS~-ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~----~--~~~~~~~ 133 (311) T protein:vir:81 64 SES--TATFAPVTAIPRKV-QVTQRFSQ-EVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGI----N--PLTGAAL 133 (311) T ss_pred ccc--cceeeEEEEeeEEE-EEeehhhH-HHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccc----c--CCCCccc Confidence 643 46777777766554 23333422 1222 33458899999999999999999887421 0 0000111 Q ss_pred CcccC-----ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccc Q lcl|Aclame:pro 156 AGLGQ-----AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI 230 (347) Q Consensus 156 ~g~~~-----~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (347) .|... ...++.+... ...++..|.++..++...+... ..++++|..+..|.+-. -.+..|.-.. T Consensus 134 ~gi~~~~~~~~~~~~~~~~~--------~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lk-d~~G~~l~~~ 202 (311) T protein:vir:81 134 SGSPAKILDTTNIVELTTGT--------SATPDLAVEAAVGLVLGDNLSP--DGVALDNTFSFMLATQR-DSQGRKLYPE 202 (311) T ss_pred ccccccccccceeeeecccc--------cchHHHHHHHHHHHhhhcCCCc--eEEEEcHHHHHHHHhhh-ccCCCeeecC Confidence 11111 1111111111 1112333445555555555422 34799999999996532 2233443233 Q ss_pred cccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeec Q lcl|Aclame:pro 231 DPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE 310 (347) Q Consensus 231 ~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e 310 (347) ....+..++++|++|+.++++|............. ....+..--+.+||++.. .+..+.++++ T Consensus 203 ~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~-------~~~~~~~~~~~gDfs~~~----------i~~~~~~~~~ 265 (311) T protein:vir:81 203 LGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVY-------RTTNPNVKAIAGDFSAFR----------WGVQVSIPLE 265 (311) T ss_pred ccccCCCceecceeEEecccccccccccccccchh-------cccCCccEEEEEecccEE----------EEEeccceEE Confidence 34456678999999999999985432211111000 000111112456776632 1223334444 Q ss_pred cccch-------hhHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 311 RARRP-------EFQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 311 ~~~~~-------~~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ..++. .++.+ .+++...+|.++++|++.+.|+.+..| T Consensus 266 ~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 266 LIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred EeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 44321 12222 456678899999999999999999888 No 84 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.37 E-value=2.5e-14 Score=95.15 Aligned_cols=287 Identities=15% Similarity=0.082 Sum_probs=170.3 Q ss_pred CCC-------CccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecccc--ceee Q lcl|Aclame:pro 1 MAN-------ATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR--TKGY 71 (347) Q Consensus 1 m~~-------~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~--~t~~ 71 (347) |.. ....-....+.+.....++.-.+..+.+...+.+..+..+.+++++++.++.+ .+..+|+... ..+. T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~a~ 170 (390) T protein:vir:97 92 TGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAA 170 (390) T ss_pred HHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccC-CceEEEEEecCCccee Confidence 000 00000000001111122222236678889999999999998998888777664 4577777533 4555 Q ss_pred eecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 72 YLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) Q Consensus 72 ~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~ 151 (347) .+..|..++.. +++..++++.+.++ ..-..|.+ +-.+.+.++.+.+.++.+++++++.|+.+|. +. .. T Consensus 171 ~v~Eg~~~~~~--~~~~~~i~~~~~k~-~~~~~is~-ell~ds~~l~~~i~~~la~a~~~~~d~a~l~----G~----g~ 238 (390) T protein:vir:97 171 IVAEGALKPES--SLKFAKKTDTTHVI-AHTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILR----GT----GA 238 (390) T ss_pred eecCCcccccc--ccceeEEEEeeeeE-EEeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhh----cC----CC Confidence 66667776543 46788888888775 34445544 2334445788999999999999999998863 11 00 Q ss_pred ccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccc Q lcl|Aclame:pro 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALID 231 (347) Q Consensus 152 ~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (347) +..+. |..-..+.... .....+...++.|.++...+.....+.. .+|++|..|..|.+-.. .++.|.-.. T Consensus 239 ~~~p~----Gi~~~~~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd-~~G~~l~~~- 308 (390) T protein:vir:97 239 NDGLL----GLIPQATTYAA--PTTIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKD-ANNQYLIGN- 308 (390) T ss_pred Ccccc----ceeeccccccc--cccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceeecC- Confidence 11111 11111110100 0011122346778888888888887643 56889999999875332 233333222 Q ss_pred ccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecc Q lcl|Aclame:pro 232 PETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALER 311 (347) Q Consensus 232 ~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~ 311 (347) ...+...+++|.+|+.|+.+|... -+.+||++.+ ++ +..++++++. T Consensus 309 ~~~~~~~~l~G~pV~~~~~~~~~~-------------------------~~~gd~~~~~-~~--------~~~~~~~i~~ 354 (390) T protein:vir:97 309 ARGTLTPTLWGLPVVATQAMAPGE-------------------------FLVGAFDLAA-QI--------FDQWDARVEI 354 (390) T ss_pred ccCCCCceecceeeEEcCCCCCCc-------------------------EEEEeccceE-EE--------EEecceEEEE Confidence 234555789999999999998421 1334554422 12 3345556666 Q ss_pred ccch-hhHhhH--HhhhhhhcCcccccceEEEEEec Q lcl|Aclame:pro 312 ARRP-EFQADQ--IIGKYAMGHGGLRPEAAGALVFT 344 (347) Q Consensus 312 ~~~~-~~~~d~--i~~~~~~G~~~lRPe~~~~l~~~ 344 (347) .++. .++.+. ++....||..+++|++++.+.++ T Consensus 355 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 355 GYVNDDFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred eecccccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 6543 334443 66777899999999999999999 No 85 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.37 E-value=1.6e-14 Score=96.28 Aligned_cols=287 Identities=15% Similarity=0.103 Sum_probs=166.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccc--cceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG--RTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (347) +.+.... -.+..-....++.-.+..+.+..++.+.....+.++.++++.++. +.++++|+.. ...+.....|+. T Consensus 93 ~~~~~~~---~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~ 168 (385) T protein:vir:18 93 QGTFGAK---TFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVVAEKAL 168 (385) T ss_pred hccchhh---HHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeeeccCcc Confidence 0000000 000000000111111455778888999988889888888877665 4578888763 334555556666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++. .+++..++++.+.++ +..+.|.+ +-.+...++.+.+.++.++++++..|+.+|. +.. .+....|. T Consensus 169 ~~~--~~~~~~~~~~~~~k~-~~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l~----G~g----~~~~~~Gi 236 (385) T protein:vir:18 169 KPE--SDITFSKQTANVKTI-AHWVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLLN----GDG----TGDNLEGL 236 (385) T ss_pred ccc--cccceeEEEEeeeeE-EEeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccC----CCCccccc Confidence 654 346777777777775 33344543 3334445688889999999999999998763 110 01111111 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) .........+. ...+...++.|.++..+|...+.+ .-.++++|..|..|.+-.. .+..|... +...|..+ T Consensus 237 ~~~~~~~~~~~------~~~~~~~~d~i~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd-~~G~~l~~-~~~~~~~~ 306 (385) T protein:vir:18 237 NKVATAYDTSL------NATGDTRADIIAHAIYQVTESEFS--ASGIVLNPRDWHNIALLKD-NEGRYIFG-GPQAFTSN 306 (385) T ss_pred ccccccccccc------cccccchHHHHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhc-CCCceecc-CcccCCCc Confidence 10000000000 111122467888888888777653 3367999999999876432 23344322 23466678 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh-- Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE-- 316 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~-- 316 (347) .++|.+|++++.+|... -+.+||+... +++ ..++++++..+... T Consensus 307 ~l~G~pV~~~~~~p~~~-------------------------~~~gd~~~~~-~~~--------~~~~~~v~~~~~~~~~ 352 (385) T protein:vir:18 307 IMWGLPVVPTKAQAAGT-------------------------FTVGGFDMAS-QVW--------DRMDATVEVSREDRDN 352 (385) T ss_pred eecceeeEEcCcCCCCc-------------------------EEEeecccEE-EEE--------EecceEEEEeccccch Confidence 89999999999998421 1234444322 222 23344555443221 Q ss_pred hHh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 FQA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 ~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.- ..++..+.+|..+++|++.+.+..+++| T Consensus 353 ~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 353 FVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred hhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 222 2456777899999999999999999999 No 86 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.37 E-value=1.6e-14 Score=96.28 Aligned_cols=287 Identities=15% Similarity=0.103 Sum_probs=166.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccc--cceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG--RTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (347) +.+.... -.+..-....++.-.+..+.+..++.+.....+.++.++++.++. +.++++|+.. ...+.....|+. T Consensus 93 ~~~~~~~---~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~ 168 (385) T protein:vir:19 93 QGTFGAK---TFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTS-SNALEYVREEVFTNNADVVAEKAL 168 (385) T ss_pred hccchhh---HHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceeccc-CcceEEEEEecCCcceeeeccCcc Confidence 0000000 000000000111111455778888999988889888888877665 4578888763 334555556666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++. .+++..++++.+.++ +..+.|.+ +-.+...++.+.+.++.++++++..|+.+|. +.. .+....|. T Consensus 169 ~~~--~~~~~~~~~~~~~k~-~~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~~~l~----G~g----~~~~~~Gi 236 (385) T protein:vir:19 169 KPE--SDITFSKQTANVKTI-AHWVQASR-QVMDDAPMLQSYINNRLMYGLALKEEGQLLN----GDG----TGDNLEGL 236 (385) T ss_pred ccc--cccceeEEEEeeeeE-EEeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccC----CCCccccc Confidence 654 346777777777775 33344543 3334445688889999999999999998763 110 01111111 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) .........+. ...+...++.|.++..+|...+.+ .-.++++|..|..|.+-.. .+..|... +...|..+ T Consensus 237 ~~~~~~~~~~~------~~~~~~~~d~i~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lkd-~~G~~l~~-~~~~~~~~ 306 (385) T protein:vir:19 237 NKVATAYDTSL------NATGDTRADIIAHAIYQVTESEFS--ASGIVLNPRDWHNIALLKD-NEGRYIFG-GPQAFTSN 306 (385) T ss_pred ccccccccccc------cccccchHHHHHHHHHhhccccCC--CCEEEEcHHHHHHHHHhhc-CCCceecc-CcccCCCc Confidence 10000000000 111122467888888888777653 3367999999999876432 23344322 23466678 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh-- Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE-- 316 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~-- 316 (347) .++|.+|++++.+|... -+.+||+... +++ ..++++++..+... T Consensus 307 ~l~G~pV~~~~~~p~~~-------------------------~~~gd~~~~~-~~~--------~~~~~~v~~~~~~~~~ 352 (385) T protein:vir:19 307 IMWGLPVVPTKAQAAGT-------------------------FTVGGFDMAS-QVW--------DRMDATVEVSREDRDN 352 (385) T ss_pred eecceeeEEcCcCCCCc-------------------------EEEeecccEE-EEE--------EecceEEEEeccccch Confidence 89999999999998421 1234444322 222 23344555443221 Q ss_pred hHh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 FQA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 ~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.- ..++..+.+|..+++|++.+.+..+++| T Consensus 353 ~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 353 FVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred hhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 222 2456777899999999999999999999 No 87 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=99.36 E-value=4.9e-14 Score=93.57 Aligned_cols=287 Identities=12% Similarity=-0.016 Sum_probs=159.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEec-cccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPV-MGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~-iG~~t~~~~~~g~~~ 79 (347) ||.... +.|. .+.-+++.+++++..++.|+++.+.++..... ..++||+ .|.+.+.-+..|+.+ T Consensus 1 Ma~~~~------------~~gg--~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~-~~~~ip~~~~~~~a~wv~Eg~~~ 65 (315) T protein:vir:80 1 MADDFL------------SAGK--LELPGSMIGAVRDRAIDSGVLAKLSPEQPTIF-GPVKGAVFSGVPRAKIVGEGEVK 65 (315) T ss_pred CCCCcC------------CcCc--eEcchHHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEeCCcceEEeeCCccc Confidence 887322 1111 14558999999999999999998877665543 4578887 455666666667776 Q ss_pred CCCCCCCCCCceEEEEeeeeecch-hhccHHHHHhCcc----hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDV-LIYDIEDAMNHYD----VRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~-~Vdd~D~~q~~~D----~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) +.+ +++.+++++..-+ +..+ .|.+-=..++..| +++.+.++.+++|++.+|+.++. +.... .... T Consensus 66 ~~s--~~~f~~v~l~~~k--l~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~----G~~~~--~~~~ 135 (315) T protein:vir:80 66 PSA--SVDVSAFTAQPIK--VVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFH----GIDPA--TGKA 135 (315) T ss_pred ccc--ccceeeeEeeeee--EEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheee----ccCCC--CCcc Confidence 543 4666666655444 3332 3332111122333 67888999999999999988763 11100 0011 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh----hhhccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN----AANYAALI 230 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~----~~~~~~~~ 230 (347) ..+...... ........+ ...++.|+++..++...+.-.. ..++++|..+..|.+-.... +..+.- . T Consensus 136 ~~~~~~~~~-~~~~~~~~~------~~~~~d~~~~~~~~~~~~~~~~-~~~imn~~~~~~L~~l~~~~g~~~~g~~~~-~ 206 (315) T protein:vir:80 136 ASAVHTSLN-KTKNIVDAT------DSATADLVKAVGLIAGAGLQVP-NGVALDPAFSFALSTEVYPKGSPLAGQPMY-P 206 (315) T ss_pred ccccccccc-cccceeecc------ccchHHHHHHHHHHhhccCccc-eEEEEcHHHHHHHHHHhhccCCcccccccc-c Confidence 111111100 000000001 1124556677666655544322 34678999999997653222 222211 2 Q ss_pred cccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeec Q lcl|Aclame:pro 231 DPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE 310 (347) Q Consensus 231 ~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e 310 (347) ....|..++++|.+|+.++++|....... +...--+.+||++.. +. ....++++ T Consensus 207 ~~~~g~~~tl~G~PV~~~~~~~~~~~~~~----------------~~~~~~~~GDfs~~~-~g---------~~~~~~i~ 260 (315) T protein:vir:80 207 AAGFAGLDNWRGLNVGASSTVSGAPEMSP----------------ASGVKAIVGDFSRVH-WG---------FQRNFPIE 260 (315) T ss_pred ccccCCCceecceeeEecCcCCccccccc----------------ccccEEEEeecccEE-EE---------EecCeeEE Confidence 34456668999999999999985422110 001112456777632 11 11222333 Q ss_pred cccc--h------hhHh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 311 RARR--P------EFQA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 311 ~~~~--~------~~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+ + .++. -.+++...+|.++++|++.+.|..++|. T Consensus 261 i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~ 307 (315) T protein:vir:80 261 LIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) T ss_pred EeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCC Confidence 2221 1 1222 2456678899999999999999744443 No 88 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.34 E-value=5.8e-14 Score=93.15 Aligned_cols=279 Identities=16% Similarity=0.088 Sum_probs=163.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccc--cceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG--RTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG--~~t~~~~~~g~~ 78 (347) ++..+. ......++.-.+....+...+.+.....+.++.++++.++.++ ++.+|+.. ..++.....|.. T Consensus 107 ~~~~~~--------~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~Eg~~ 177 (390) T protein:vir:10 107 KAALNT--------ASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSA-LIEYVQETGFVNNAAIVAEGAL 177 (390) T ss_pred HHHHHh--------hhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCCcceeeecCCcc Confidence 111000 0011111112356666777787777777878888887776544 67888653 345555566666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++. .+++..++++.+.++ .....|.+ +-.+...++.+.+.++.++++++..|+.++. +... +..+.|. T Consensus 178 ~~~--~~~~~~~i~~~~~k~-~~~~~is~-ell~d~~~l~~~i~~~l~~~~~~~~~~~il~----G~G~----~~~p~Gi 245 (390) T protein:vir:10 178 KPE--SSLKFAKKTDTTHVI-AHTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILR----GTGA----NDGLLGL 245 (390) T ss_pred ccc--cccceeEEEEeeEEE-EEeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhh----cCCC----Ccccccc Confidence 654 346778888877775 33444544 2334446788999999999999999998763 1110 1111111 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) -........ .....+...++.+.++...|...+.+.. .+|++|..|..|.+-.. .+..|..... ..+... T Consensus 246 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~lkd-~~g~~l~~~~-~~~~~~ 315 (390) T protein:vir:10 246 IPQATTYAA------PTTIAGATRVDQLRLAMLQASLAEYPAS--GIVINPIDWAAIELAKD-ANNQYLIGNA-RGTLTP 315 (390) T ss_pred ccccccccc------cccccccchHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceeecCC-cCcCCc Confidence 111000000 0001112236778888888888887643 56899999999875332 2334432222 234456 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch-hh Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP-EF 317 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~-~~ 317 (347) .++|.+|+.++.+|... -+.+||+..+.++ ..+.++++..+.. .+ T Consensus 316 ~l~G~pv~~~~~~p~~~-------------------------~~~gdf~~~~~~~---------~~~~~~i~~~~~~~~~ 361 (390) T protein:vir:10 316 TLWGLPVVATQAMAPGE-------------------------FLVGAFDLAAQIF---------DQWDARVEIGYVNDDF 361 (390) T ss_pred eecceeeEEcCCCCCCc-------------------------EEEEeccceEEEE---------EecceEEEEeeccccc Confidence 79999999999998421 1345666543222 2344455554432 23 Q ss_pred Hhh--HHhhhhhhcCcccccceEEEEEec Q lcl|Aclame:pro 318 QAD--QIIGKYAMGHGGLRPEAAGALVFT 344 (347) Q Consensus 318 ~~d--~i~~~~~~G~~~lRPe~~~~l~~~ 344 (347) ..+ .+++...++.++++|++.+.+.++ T Consensus 362 ~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 362 QRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 334 455678899999999999999999 No 89 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=99.34 E-value=1.8e-13 Score=90.46 Aligned_cols=284 Identities=12% Similarity=-0.022 Sum_probs=164.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEec-cccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPV-MGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~-iG~~t~~~~~~g~~~ 79 (347) ||+.+.- +|. +..+++..++.+..++.|.++.+.+++.+..| .+.+|+ .+...+..+..|+.+ T Consensus 1 ma~~t~~------------~G~---lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~-~~~~p~~~~~~~a~wv~Eg~~~ 64 (300) T protein:vir:95 1 MSEAQLS------------KGN---LFNPELVTKVINKVKGHSSIAKLSPQKPIPFN-GQREFVFDFDSDIDIVAENGKK 64 (300) T ss_pred CcccccC------------Ccc---eechhhHHHHHHHHHhhhhhhhhcceeeccCC-ceEEEEEecCcceEEeeCCccc Confidence 9985431 111 55688999999999999988888777766654 466775 455566666667666 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHH-----hCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAM-----NHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q-----~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) +.+ +++.+++++..-+. +.-..|.+ |.. ...|+.+.+.++.+++++++.|+.++. +.......... T Consensus 65 ~~s--~~~f~~v~l~~~k~-~~~~~iS~--ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~----G~~~~~g~~~~ 135 (300) T protein:vir:95 65 THG--GVSLDPVTIVPLKV-EYGARVSD--EFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIH----GINPRTKQAST 135 (300) T ss_pred ccc--cccceeeEeeeEEE-EEeehhhH--HHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhh----cccCCCCCCcc Confidence 543 46777777665543 33333432 222 236788999999999999999998873 11111111100 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPET 234 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~ 234 (347) +.+. ....+... ......+...++.|.++..++...+... ..++++|..+..|.+-.. .+..|.-...... T Consensus 136 ~~~~----~~~~~~~~--~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~L~~lkd-~~G~~i~~~~~~~ 206 (300) T protein:vir:95 136 IIGD----NCFDKKVT--QTVPFKDTNPDESMEDAVGMIDGSERDI--TGAILDPIFTTALSKMKN-AEGGKLYPELAWG 206 (300) T ss_pred cccc----cccccccc--eeecccccchHHHHHHHHHHhhhcCCCc--cEEEECHHHHHHHHHhhc-cCCCeeccCcccc Confidence 1000 00000000 0001111223677888888887766532 357899999999875332 2233332233445 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccc-- Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERA-- 312 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~-- 312 (347) |..++++|++|+.|+.+|...... ...-+.+||++.+-+.. .++++++.. T Consensus 207 ~~~~~l~G~Pv~~s~~v~~~~~~~-------------------~~~~~~GDf~~~~~~~~---------~~~~~~~v~~~ 258 (300) T protein:vir:95 207 GVPDAINGLAVDKNRTVSYSQTDP-------------------KNTAIVGDFETMFKWGY---------AKEVPMEIIKY 258 (300) T ss_pred CCCceecceeeEEecCCCCCCCCC-------------------ccEEEEeeccceEEEEE---------ecccEEEEeec Confidence 677899999999999998432110 01123466765432221 122222222 Q ss_pred cchh------hHh--hHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 313 RRPE------FQA--DQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 313 ~~~~------~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) -++. ++. -.++....+|.+++||++.+.|...+= T Consensus 259 ~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 259 GDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred cCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 1211 222 345778889999999999999875555 No 90 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=99.33 E-value=1.2e-13 Score=91.41 Aligned_cols=283 Identities=12% Similarity=-0.013 Sum_probs=167.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |+-. . |. +..++|..++.+..+..|+++.+.++.++.+| .+.||++ +.+.+..+..|+++ T Consensus 1 ma~~-g--------------G~---lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~Eg~~~ 61 (298) T protein:vir:94 1 MVLN-K--------------GT---LFDPELVTDLISKVAGKSSIARLSAQKPIPFN-GEKVFTFTMDSEIDVVAESGKK 61 (298) T ss_pred Ceec-c--------------cc---ccChhHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEecCcceEEeeCCccc Confidence 5541 1 11 45588999999999999999998887776654 4788876 56677777777777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHH----hCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAM----NHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q----~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) +.+ +++.+++++..-+. .....|.+ +-.+ ...++.+.+.++.+++|++.+|+.++.- ...... ... T Consensus 62 ~~~--~~~f~~v~l~~~k~-~~~~~iS~-ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G----~~~~~g--~~~ 131 (298) T protein:vir:94 62 THG--GVTLAPQTMVPIKV-EYGARISD-EFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHG----VNPRLG--TAS 131 (298) T ss_pred ccc--ccceeEEEEeeeEE-EEeeehhH-HHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcc----cccCCC--ccc Confidence 643 46667766665554 22333432 1122 2346888899999999999999988742 110000 000 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) .+.+...... ..+...........+++.|.++..+|...+... ..++++|..|..|.+-.. .+..|.-......| T Consensus 132 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd-~~G~~l~~~~~~~~ 206 (298) T protein:vir:94 132 AVIGTNHFDS--KVTQKVEAPRGIADPNGAIENAVELLTGVDADV--TGIAINPSFRSALAKQKD-LQGNALFPELKWGA 206 (298) T ss_pred cccccccccc--ccccccccccccccHHHHHHHHHHhhhhcCCCc--cEEEEcHHHHHHHHHhhc-cCCCeeecCcccCC Confidence 1110000000 000001111112234677888888998888653 358999999999876322 23344333445567 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc-- Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR-- 313 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~-- 313 (347) ..++++|++|+.++.+|....+. ...-+.+||++.+.+. ..++++++... T Consensus 207 ~~~tl~G~PV~~~~~v~~~~~~~-------------------~~~~~~Gdfs~~~~~~---------~~~~~~~~~~~~~ 258 (298) T protein:vir:94 207 TPDTINGLPVDVNKTVSDMSLTQ-------------------RDRAIIGDFANGFKWG---------YAKEVPLEVIQYG 258 (298) T ss_pred CCceecceeeEEecccccccCCC-------------------ccEEEEeeccceEEEE---------EecCceEEEeecC Confidence 77899999999999998431100 0112446666543211 12333333322 Q ss_pred chh------hHhh--HHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 314 RPE------FQAD--QIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 314 ~~~------~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) ++. ++.| .+++.+.+|.+++||++.+.|..+. T Consensus 259 ~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 259 DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred CCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 111 2222 3577788999999999999997666 No 91 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.33 E-value=1e-13 Score=91.78 Aligned_cols=281 Identities=12% Similarity=0.029 Sum_probs=155.7 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEec-cccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPV-MGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~-iG~~t~~~~~~g~~~ 79 (347) |+..+ +..+| .+.-++|..++.+..+..+.++.+.++.++.++ +..++. .+.+.+.-..-|... T Consensus 107 ~~~~~-----------~~~GG---~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~wv~E~~~~ 171 (401) T protein:vir:44 107 LQVGT-----------DEDGG---YAVPEELDRSILSLLKDEVVMRQEATVITVGGS-DYKKLVNLGGTASGWVGETDTR 171 (401) T ss_pred hhcCC-----------CCCCc---eeccHhHHHHHHHHHHhhhhhhhhceeeecCCC-ceEEEEecCCccceeecccccc Confidence 21100 00011 134489999999999999999998887776554 344553 444444433334433 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +.+ .....+++++.+-++ ..-..|.+-=...+.+|+.+.+.++.++++++..|+.++. +-. .+.+ T Consensus 172 ~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~----G~G---------~~~p 236 (401) T protein:vir:44 172 SQT-ATSRLGLIEPFMGEI-YGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTT----GDG---------TKKP 236 (401) T ss_pred Ccc-ccccceeeeeehhhe-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----cCC---------CCcc Confidence 322 123455555555443 2223343332334567899999999999999999998863 110 0011 Q ss_pred Cceee-----e------ecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccc Q lcl|Aclame:pro 160 QAVVL-----N------IGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA 228 (347) Q Consensus 160 ~~~~i-----~------~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (347) .|..- . .+..... .........++.|+++...|..... .+-.++++|..|..|.+-.. .+..|.- T Consensus 237 ~Gil~~~~~~~~~~~~~~~~~~~~-~t~~~~~~~~d~i~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lkd-~~G~~l~ 312 (401) T protein:vir:44 237 KGFLAYESTEESDKARAFGKLQHI-VSGEATAVTADAIIKLIYTLRKAHR--TGAKFMMNNNSLFAIRLLKD-TEGNYLW 312 (401) T ss_pred ceeecccccccccccccccccccc-ccccccccCHHHHHHHHHhcchhhh--cCCEEEEcHHHHHHHHHhhc-cCCceee Confidence 11000 0 0000000 0001111126778888877766543 23456899999998865322 2233433 Q ss_pred cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) ..++..|..++++|.+|+.++++|....+.. .-+.+||+... .++.+ +.++ T Consensus 313 ~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~--------------------~i~~Gd~~~~~-~i~~~--------~~~~ 363 (401) T protein:vir:44 313 RPGLELGQPSSLAGYGIAENEQMPDIAADAK--------------------AIAFGNFKRGY-TIVDR--------IGTR 363 (401) T ss_pred cCCcCCCCCceecceeeEEecCcCCccCCcc--------------------EEEEeehhccE-EEEEe--------cceE Confidence 3455677778899999999999985322111 11234554322 22222 2233 Q ss_pred eccccchhhH--hhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 309 LERARRPEFQ--ADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 309 ~e~~~~~~~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ++ +++-.. ...+++.+.+|.++++|++.+.|..++| T Consensus 364 ~~--~~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 364 IL--RDPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred Ee--eeccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 32 222111 2336777889999999999999988888 No 92 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.33 E-value=4.8e-14 Score=93.61 Aligned_cols=285 Identities=12% Similarity=0.054 Sum_probs=163.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) +.+.--.+. .+........+...+.-+.+..++.+..+..|.++.+.++.++.+ .+++||+. +.+.+.-+..|..+ T Consensus 15 ~~~~~~~~~--~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:96 15 ASNNVKPQV--FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHhhhhhhh--cccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeeecCCccc Confidence 110000000 111111111222235668999999999999999999888777654 56888876 45566666667776 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++..++++..-+. ..-..|.+-=..++.+|+.+.+.++.++++++..|+.+|. +... .....+ T Consensus 92 ~~--~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~----G~g~----~~~~~~-- 158 (324) T protein:vir:96 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN----NPFGKS-- 158 (324) T ss_pred cc--cccceeEEEEEeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCC----CCcCcc-- Confidence 54 346777777776664 3334554422223568899999999999999999998873 1100 000111 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) ..-....... ...+...++.|+++...+...+... -.++++|..|..|.+-.. .+ |...+..+..+. T Consensus 159 --~~~~~~~~~~----~~~~~~~~~~i~~~~~~i~~~~~~~--~~~i~n~~~~~~L~~lkd-~~----G~~~~~~~~~~~ 225 (324) T protein:vir:96 159 --IAQSIKKTNK----VIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-PE----TKERIYDRNSDS 225 (324) T ss_pred --ccccccccce----ecccccchHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhC-CC----CCeeecCCCCCc Confidence 0000000000 0011112677888888887776533 357899999999875422 11 222234556678 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh--- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE--- 316 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~--- 316 (347) ++|++|+.++..+... +.-+.+||++.. .+..++++++...+.. T Consensus 226 l~G~PV~~~~~~~~~~-----------------------~~~~~gd~s~~~----------~~~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:96 226 LDGLPVVNLKSSNLKR-----------------------GELITGDFDKLI----------YGIPQLIEYKIDETAQLST 272 (324) T ss_pred ccceeeEeecCCCCCc-----------------------ceEEEEecceEE----------EEEecCcEEEEeecccccc Confidence 9999999877654211 111234444321 1223344444433211 Q ss_pred -----------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 -----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 -----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++ .-.++..+.+|.+++||++.+.|..+.+. T Consensus 273 ~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:96 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeccccc Confidence 12 23457778899999999999988765555 No 93 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.33 E-value=9.5e-14 Score=91.98 Aligned_cols=282 Identities=11% Similarity=0.076 Sum_probs=161.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |...+.. +........+...+.-+.+..++.+.....|.++.+.++.++.+ .+++||+. +...+.-+..|..+ T Consensus 18 ~~~~~~~-----~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:10 18 NVKPQVF-----NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred hhcccee-----cccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCcceeEeccCccc Confidence 2111110 00111111222235668999999999999999999888777654 46888876 45566666677776 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++..++++..-++ ..-..|.+-=..++.+|+.+.+.++.++++++..|+.+|.- ... +. .+ T Consensus 92 ~~--~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G----~g~----~~----~~ 156 (324) T protein:vir:10 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN----QGN----NP----FG 156 (324) T ss_pred cc--cccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhc----CCC----Cc----cC Confidence 54 346777777766554 33344544222234578999999999999999999988631 110 00 01 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .+..-......... .+...++.|+++...|...+... -.++++|..|..|.+-.+ -++...+..+.-++ T Consensus 157 ~~i~~~~~~~~~~~----~~~~t~~~i~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~l~d-----~~g~~~~~~~~~~~ 225 (324) T protein:vir:10 157 KSIAQSIEKTNKVI----KGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-----PETKERIYDRNSDT 225 (324) T ss_pred ccccccccccceec----cccCCHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhc-----cCCceeecCCCCcc Confidence 11110000000000 01112677888888888876533 256899999999875322 12222233445567 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh--- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE--- 316 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~--- 316 (347) ++|.+|+.++..+... ..-+.+||++.+ .+..++++++...+.. T Consensus 226 l~G~PV~~~~~~~~~~-----------------------~~~~~gd~~~~~----------~~~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:10 226 LDGLPVVNLKSSNLKR-----------------------GELITGDFDKLI----------YGIPQLIEYKIDETAQLST 272 (324) T ss_pred ccceeEEeecCCCCCc-----------------------ceEEEEecccEE----------EEEecCcEEEEeecccccc Confidence 9999999877654211 111334554432 1223334444332210 Q ss_pred -----------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 -----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 -----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++ .-.++..+.+|.++++|++.+.|..+.+. T Consensus 273 ~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:10 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCC Confidence 11 23456678899999999999998765554 No 94 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.33 E-value=5.2e-14 Score=93.42 Aligned_cols=286 Identities=12% Similarity=0.096 Sum_probs=165.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) .++-...... .+.......++...+.-+.|..++.+..+..|.++.+.++.++. |.+++||+. +.+.+.-+..|..+ T Consensus 14 ~~~~~~~~~~-~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:96 14 FASNNVKPQV-FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHHHhhhhhh-hccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEecCCccc Confidence 1110000000 00000111122223556889999999999999999988876665 456888876 55566666677777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++.+++++..-+. ..-..|.+-=..++.+|+.+.+.++.++++++..|+.+|. +... ... + T Consensus 92 ~~--~~~~~~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~----~~~----~ 156 (324) T protein:vir:96 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN----NPF----G 156 (324) T ss_pred cc--cccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC----CCc----C Confidence 64 346777777777664 3444454422234567999999999999999999998863 1110 000 1 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .+.....+..... ..+...++.|+++...|...+... ..++++|..|..|.+-... + |...+..+.... T Consensus 157 ~gi~~~~~~~~~~----~~~~~t~~~i~~~~~~l~~~~~~~--~~~vmn~~~~~~L~~l~d~-~----G~~~~~~~~~~~ 225 (324) T protein:vir:96 157 KSIAQSIEKTNKV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP-E----TKERIYDRNSDS 225 (324) T ss_pred cccccccccccee----ccccccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhcc-C----CCeeecCCCCCc Confidence 1111000000000 011123677888888888877533 3679999999998753221 1 222244566678 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh--- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE--- 316 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~--- 316 (347) ++|++|+.++..+.. ...-+.+||++.+ .+..+++++|...+.- T Consensus 226 l~G~PV~~~~~~~~~-----------------------~~~~~~gd~~~~~----------~g~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:96 226 LDGLPVVNLKSSNLK-----------------------RGELITGDFDKLI----------YGIPQLIEYKIDETAQLST 272 (324) T ss_pred ccceeeEeeCCCCCC-----------------------cceEEEEecceEE----------EEEecCcEEEEeecccccc Confidence 999999987765421 0111334555422 1233444554443221 Q ss_pred -----------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 -----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 -----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++ .-.+++.+.+|.+++||++.+.|..+.+. T Consensus 273 ~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:96 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccccc Confidence 11 24456778899999999999998876655 No 95 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.33 E-value=5.2e-14 Score=93.42 Aligned_cols=286 Identities=12% Similarity=0.096 Sum_probs=165.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) .++-...... .+.......++...+.-+.|..++.+..+..|.++.+.++.++. |.+++||+. +.+.+.-+..|..+ T Consensus 14 ~~~~~~~~~~-~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:78 14 FASNNVKPQV-FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred HHHHhhhhhh-hccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEecCcceeEecCCccc Confidence 1110000000 00000111122223556889999999999999999988876665 456888876 55566666677777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++.+++++..-+. ..-..|.+-=..++.+|+.+.+.++.++++++..|+.+|. +... ... + T Consensus 92 ~~--~~~~~~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~----~~~----~ 156 (324) T protein:vir:78 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN----NPF----G 156 (324) T ss_pred cc--cccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC----CCc----C Confidence 64 346777777777664 3444454422234567999999999999999999998863 1110 000 1 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .+.....+..... ..+...++.|+++...|...+... ..++++|..|..|.+-... + |...+..+.... T Consensus 157 ~gi~~~~~~~~~~----~~~~~t~~~i~~~~~~l~~~~~~~--~~~vmn~~~~~~L~~l~d~-~----G~~~~~~~~~~~ 225 (324) T protein:vir:78 157 KSIAQSIEKTNKV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP-E----TKERIYDRNSDS 225 (324) T ss_pred cccccccccccee----ccccccHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhcc-C----CCeeecCCCCCc Confidence 1111000000000 011123677888888888877533 3679999999998753221 1 222244566678 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh--- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE--- 316 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~--- 316 (347) ++|++|+.++..+.. ...-+.+||++.+ .+..+++++|...+.- T Consensus 226 l~G~PV~~~~~~~~~-----------------------~~~~~~gd~~~~~----------~g~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:78 226 LDGLPVVNLKSSNLK-----------------------RGELITGDFDKLI----------YGIPQLIEYKIDETAQLST 272 (324) T ss_pred ccceeeEeeCCCCCC-----------------------cceEEEEecceEE----------EEEecCcEEEEeecccccc Confidence 999999987765421 0111334555422 1233444554443221 Q ss_pred -----------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 -----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 -----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++ .-.+++.+.+|.+++||++.+.|..+.+. T Consensus 273 ~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:78 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccceEEEeccccc Confidence 11 24456778899999999999998876655 No 96 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=99.33 E-value=1.7e-13 Score=90.57 Aligned_cols=287 Identities=10% Similarity=-0.009 Sum_probs=163.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |+.... +| .+.-+++..++.+..+..|.++.+.++....+ .+++||+. +.+.+..+..|..+ T Consensus 1 m~t~t~-------------gg---~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E~~~~ 63 (303) T protein:vir:97 1 MGTETS-------------KA---SLFDKHLVSDLINKVKGHSSLAKLSSQKPIPF-NGSKEFTFTLDSDIDVVAENGKK 63 (303) T ss_pred CcccCC-------------CC---eEcchhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEecCcceEEeecCccc Confidence 554211 01 14558899999999999999999988777664 45788774 55667666667766 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHH----HhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDA----MNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~----q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) +.+ +++.+++++..-+. .....|.+ +-. ....++.+.+.++.+++|++..|+.++.-. ...... T Consensus 64 ~~s--~~~f~~v~l~~~kl-~~~~~iS~-ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~----~~~~g~---- 131 (303) T protein:vir:97 64 THG--GLSLEPVTIVPIKV-EYGARLSD-EFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGI----NPRTKK---- 131 (303) T ss_pred ccc--ccceeeEEeeeEEE-EEeehhhH-HHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhccc----ccCCcc---- Confidence 543 46666666655443 23333322 111 234568899999999999999999887421 100000 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccc-c Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPE-T 234 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~-~ 234 (347) .+...+..+..+.++.. .........++.|.++..++...+... ..++++|..+..|.+-... +..|.-..+.. . T Consensus 132 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~vmn~~~~~~L~~lkd~-~g~~~~~~~~~~~ 207 (303) T protein:vir:97 132 ASDVIGTNHFDSKVTQV-VKFTESEDADANIEAAVNLIQGAEGVV--TGLAMDTEFSTALAKVTNG-EMGPKMYPELAWG 207 (303) T ss_pred ccccccccccccccccc-cccccccchHHHHHHHHHHHhhcCCCc--cEEEEcHHHHHHHHHhhcc-CCCeEEecCccCC Confidence 01111111100000000 000111223677888887877766533 3478999999999753221 22222112222 3 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccc-- Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERA-- 312 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~-- 312 (347) +..++++|++|+.|+++|...... . ....-|.+||++.+.+.. .+.+++|.. T Consensus 208 ~~~~~l~G~Pv~~s~~v~~~~~~~---------~--------~~~~~~~Gdf~~~~~~~~---------~~~~~~~~~~~ 261 (303) T protein:vir:97 208 ANPDSINGLKSSVNTTVGAGADEA---------E--------SKDLVIIGDFESMFKWGY---------AKQIPMEIIKY 261 (303) T ss_pred CCCceecceeeEEecccCCccccC---------C--------CccEEEEeeccccEEEEE---------ecCcEEEEeec Confidence 455689999999999998532110 0 011225567766543332 222233322 Q ss_pred cchh------hHhh--HHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 313 RRPE------FQAD--QIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 313 ~~~~------~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) .++. ++.| .+++...++.++++|++.+.|+.++= T Consensus 262 ~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 262 GDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred cCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 1111 2222 46778889999999999999998888 No 97 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.32 E-value=1.1e-13 Score=91.69 Aligned_cols=282 Identities=11% Similarity=0.067 Sum_probs=163.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |...+... ........+...+.-+.|..++.+.....+.++.+.++.++.+ .+++||+. +...+.-...|..+ T Consensus 18 ~~~~~~~~-----a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg~~~ 91 (324) T protein:vir:99 18 NVKPQVFN-----PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKI 91 (324) T ss_pred hhhhhhcc-----ccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceeEeccCccc Confidence 11111110 0111111222235668999999999999999999888777654 56888876 44566666667776 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++..++++..-++ ..-..|.+-=..++.+|+.+.+.++.++++++..|+.++. +... +. .+ T Consensus 92 ~~--~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~----G~g~----~~----~~ 156 (324) T protein:vir:99 92 ET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN----NP----FG 156 (324) T ss_pred cc--cccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh----cCCC----Cc----cC Confidence 54 346777777766664 3344454422233457899999999999999999998863 1110 00 11 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .+..-......... .+...++.|+++...|...+.... .++++|..|..|.+-.+ -++...+..+.-++ T Consensus 157 ~~~~~~~~~~~~~~----~~~~~~~~i~~~~~~l~~~~~~~~--~~v~n~~~~~~L~~l~d-----~~g~~~~~~~~~~~ 225 (324) T protein:vir:99 157 KSIAQSIEKTNKVI----KGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVD-----PETKERIYDRNSDT 225 (324) T ss_pred ccccccccccceec----cccCCHHHHHHHHHhhhhccCCCC--EEEEcHHHHHHHHHhhc-----CCCceeecCCCCcc Confidence 11110000010000 111126778888888888775332 57899999998875322 12222233445567 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh--- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE--- 316 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~--- 316 (347) ++|.+|+.++..+... ..-+.+||++.+ .+..+++++|...+.. T Consensus 226 l~G~PVv~~~~~~~~~-----------------------~~~i~gd~~~~~----------~~~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:99 226 LDGLPVVNLKSSNLKR-----------------------GELITGDFDKLI----------YGIPQLIEYKIDETAQLST 272 (324) T ss_pred ccceeEEeecCCCCCc-----------------------ceEEEEecccEE----------EEEecCcEEEEeecccccc Confidence 9999999987765211 111334554422 1223344444433211 Q ss_pred -----------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 -----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 -----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++ .-.++..+.+|.+++||++.+.|..+.+. T Consensus 273 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~ 316 (324) T protein:vir:99 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCC Confidence 11 23456678899999999999999876665 No 98 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.32 E-value=1e-13 Score=91.75 Aligned_cols=288 Identities=14% Similarity=0.092 Sum_probs=158.2 Q ss_pred CC-CCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEec-cccceeeeecCCCC Q lcl|Aclame:pro 1 MA-NATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPV-MGRTKGYYLAPGEN 78 (347) Q Consensus 1 m~-~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~-iG~~t~~~~~~g~~ 78 (347) +. -++.+ ..++--.+.-+.|..++.+..+..+.++.+.++.++.+++ .++|+ .+.+++.....|.. T Consensus 126 ~~~al~~~-----------t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~~~~~~~~~a~wv~E~~~ 193 (425) T protein:vir:10 126 VQAALNKG-----------EDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAG-FSKLFNMGGTTSGWVGEASQ 193 (425) T ss_pred hHHHhhcC-----------cCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCc-eEEEEEcCCcceeeeccccc Confidence 00 00000 0011111455999999999999999999998887776554 55543 45555554444554 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) .+.+ ......++++..-++ ..-..|.+-=..++.+|+.+.+.++.++++++..|+.++. +-. .+. T Consensus 194 ~~~~-~~~~f~~v~~~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~----G~G---------~~~ 258 (425) T protein:vir:10 194 RPQT-NAATFQPLSFASGEI-YANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLA----GDG---------TNK 258 (425) T ss_pred cccc-cccccceeeeeheee-EeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhc----ccC---------CCC Confidence 4322 123455555554443 2233343332334568999999999999999999998763 110 001 Q ss_pred cCceeeeeccccc----------ccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccc Q lcl|Aclame:pro 159 GQAVVLNIGAAAD----------LVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA 228 (347) Q Consensus 159 ~~~~~i~~~~~~~----------~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (347) +.|..-..+.... ...........++.|+++...|+.... ..-.+|++|..|..|.+-.+ .+..|.- T Consensus 259 p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~--~~a~~vmn~~~~~~L~~lkD-~~G~~l~ 335 (425) T protein:vir:10 259 PNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFT--GNARFAMNRNTQRQVRKLKD-GQGNYLW 335 (425) T ss_pred cceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhc--cCCEEEEchHHHHHHHHhhc-CCCceee Confidence 1111100000000 000001111236778888777766554 23356899999999875332 2334433 Q ss_pred cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) ..++..|.-++++|.+|+.++++|....+... =+.+||+... +++.+. .++ T Consensus 336 ~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~--------------------i~~Gd~~~~~-~i~~~~--------~~~ 386 (425) T protein:vir:10 336 QPSYVAGQPATLAGYPVTEVPDMPDVAANSTP--------------------ILFGDFQQTY-LIIDRI--------GVR 386 (425) T ss_pred ccCccCCCCceecceeeEEecCcCCccCCccE--------------------EEEEehhccE-EEEEec--------ceE Confidence 34566777788999999999999853222111 1234555432 223222 222 Q ss_pred eccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 309 LERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 309 ~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.....-.+-...+++...++.++++|++...|..+++= T Consensus 387 v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 387 VLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred EEecccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 221111112223566788899999999998887665555 No 99 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.32 E-value=6.5e-14 Score=92.87 Aligned_cols=295 Identities=10% Similarity=0.075 Sum_probs=161.4 Q ss_pred CCCCccCccccccCcccCccccHHH-HHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccce---------e Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLA-LFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTK---------G 70 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~a-l~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t---------~ 70 (347) +......... .+...++...+.-. +--+.+.+++...-...+.++++.++.+.. +.++++++....+ + T Consensus 110 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~a 187 (419) T protein:vir:94 110 MRDIDPNRLL-SRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNAD-YNVLEYIRDTSGTAGAGSTWNKA 187 (419) T ss_pred HHHHHHHHhh-ccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeecc-CCceeeeeeccccccccccCccc Confidence 0000000000 00001111111111 223667777777766677777777765554 4567777643322 2 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) ..+..|...+. .+++..++++.+.++ +.-..|.+ +-.+...++.+.+.++.++++++..|+.||. +-.. T Consensus 188 ~~v~Eg~~~~~--~~~~~~~i~~~~~k~-~~~~~is~-ell~d~~~l~~~i~~~la~a~~~~~d~aii~----G~G~--- 256 (419) T protein:vir:94 188 AVVPEGTAKPQ--STLSFDTITTTLKTV-AHWLPITR-QAADDNSQLMGYIQGRLTYGLRFLRDRQLLN----GNGS--- 256 (419) T ss_pred ceecCCccccc--cccceeeEEeeeeeE-EEeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh----ccCc--- Confidence 22233444432 245666777766665 33344542 2333345688889999999999999998873 1100 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccc Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI 230 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (347) +.+.|......+....... ..........++.|+++...+...+.+. -.++++|..|..|++-..-....|.-.. T Consensus 257 --~~p~Gi~~~~~~~~~~~~~-~~~~~t~~~~~~~l~~~~~~~~~~~~~~--~~~v~n~~~~~~l~~~k~~~~~~~~~~~ 331 (419) T protein:vir:94 257 --TEMQGILTTPGIGTYQQPK-PTAPATDEPPLVDIRRAKTVAEIAGFPP--DGVVVHPQDWESIELDQAPGSGVFRVIA 331 (419) T ss_pred --ccccceecccccccccccc-cccccccchhHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHHhhcCCCceeecC Confidence 0111110000000000000 0111112234788899988888877643 3679999999999865544444444444 Q ss_pred cccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeec Q lcl|Aclame:pro 231 DPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE 310 (347) Q Consensus 231 ~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e 310 (347) +...|..++++|++|+.++++|... -+.+||+... +++ ..++++++ T Consensus 332 ~~~~~~~~~l~G~pV~~~~~~~~~~-------------------------~~~gd~~~~~-~~~--------~~~~~~v~ 377 (419) T protein:vir:94 332 NVQGEATPRIWGLNVVSTVAIAQGT-------------------------ALVGGFRQGA-TLW--------SRQGITVL 377 (419) T ss_pred CcccCCCccccceeeEEcCCCCCcc-------------------------EEEeeccceE-EEE--------EecceEEE Confidence 5667777899999999999998421 1334555432 222 23344555 Q ss_pred cccchh----hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 311 RARRPE----FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 311 ~~~~~~----~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...... +-...++....++.++++|++++.+.++++= T Consensus 378 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~ 418 (419) T protein:vir:94 378 MTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFAAAT 418 (419) T ss_pred EeccccchhhcCcEEEEEEEeeccEEeccccEEEEEeccCC Confidence 443321 1233567888999999999999998877666 No 100 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=99.30 E-value=1.7e-13 Score=90.54 Aligned_cols=275 Identities=13% Similarity=0.051 Sum_probs=165.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccC-CceEEEeccc--cceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQN-GKSASFPVMG--RTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~-G~tv~i~~iG--~~t~~~~~~g~ 77 (347) |-+.-. .....+| -.+.-++|..++.+..+..+.++++.++..+.. ..+..|+... ...+.....|. T Consensus 1 ~l~~~~--------~~t~~~g--g~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~ 70 (293) T protein:vir:48 1 MLDSKT--------DHSGSDA--GLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAG 70 (293) T ss_pred Cceeec--------ccccCcC--ceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCc Confidence 222111 1000111 124569999999999999999999888766553 2356666543 23445555566 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .++.+ ..++..++++...+. +....|.+-=..++.+|+.+.+.++.++++++..|+.|+.-+... T Consensus 71 ~~~~~-~~~~~~~i~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~------------- 135 (293) T protein:vir:48 71 KIADI-DDPKLSLIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKL------------- 135 (293) T ss_pred ccccc-cccceeEEEEeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccc------------- Confidence 66432 235667777777665 334456543334567899999999999999999999887422100 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) ++..... -++.|+++..+|.....+ .-.++++|..|..|.+-.+ .+..|.-..++.+|.. T Consensus 136 ---------~~~~~~~--------~~d~i~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd-~~g~~l~~~~~~~~~~ 195 (293) T protein:vir:48 136 ---------PTKPTLT--------KWDDIIDLEAKVDPAIKQ--TSFFLTNTSGFTALKKVKN-ALGDYLMERDVKSPTG 195 (293) T ss_pred ---------ccccccc--------CHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHHhhc-cCCceEeecCcCCCCC Confidence 0000000 157788888888766543 3456889999999865332 2344444445667777 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch-- Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP-- 315 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~-- 315 (347) ++++|.+|+.+.+.+....+ ++...-+-+||++.+. .+..++++++..... T Consensus 196 ~~l~G~Pv~~~~~~~~~~~~------------------~~~~~~~~gd~~~~~~---------~~~~~~~~i~~~~~~~~ 248 (293) T protein:vir:48 196 YSIAGFAVKEISDRWLPNAS------------------SGVMPLYFGDLKQAVT---------LFDRQQMSLLSTNIGGG 248 (293) T ss_pred ceecceeeEEecccccCCcc------------------CCceEEEEEeccceEE---------EEEecceEEEEecccch Confidence 89999999987655432110 0011113344444322 223344455544321 Q ss_pred hhH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 316 EFQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 316 ~~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .++ .-.++....+|.++.+|++.+.+..++++ T Consensus 249 ~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 282 (293) T protein:vir:48 249 AFETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 282 (293) T ss_pred hhhcCeEEEEEEEeeCcEEecccceEEEEeeccc Confidence 122 33577888899999999999999977777 No 101 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.29 E-value=1.4e-13 Score=91.00 Aligned_cols=288 Identities=14% Similarity=0.086 Sum_probs=163.2 Q ss_pred CCCCcc--CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecccc--ceeeeecCC Q lcl|Aclame:pro 1 MANATG--GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR--TKGYYLAPG 76 (347) Q Consensus 1 m~~~~~--~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~--~t~~~~~~g 76 (347) |.+... ......+.+.+. ++...+..+.|..++.+..+..+.+++++++..+. +.++.+|+... .++.....| T Consensus 121 ~~~~~~~~~~~~~~~~~~~~--~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~ 197 (418) T protein:vir:10 121 RVRVDRKSIMNVPATVGSGV--SGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTS-SSSIEYTVETGFTNNAAAVAEG 197 (418) T ss_pred hhhhHHHHHHHhhhhccCCC--CCCccccchhHHHHHHHHHhhhhhHHhhcceeecc-CCceeEEEEecCCCceeeeccC Confidence 111000 000000011111 11122567999999999999999999998877765 45677777433 344455556 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ..++. .+++.+++++...++- .-..|.+ +-.+.+.|+.+.+.++.++++++..|+.++. +... +..+. T Consensus 198 ~~~~~--~~~~f~~v~~~~~k~~-~~~~is~-ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~----G~g~----~~~p~ 265 (418) T protein:vir:10 198 AQKPT--SDLKFNLKNQPVRTIA-HLFKASR-QILDDAPALQSYIDGRARYGLQLTEEGQILK----GDGT----GANIL 265 (418) T ss_pred ccccc--cccceeeEEEeeeeEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC----Ccccc Confidence 66543 3467777777666653 2334533 3444456899999999999999999998863 1110 11111 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) |.-........+. .......++.|+++...+...+.+.. .+|++|..|..|.+-.. .+..|... +...|. T Consensus 266 Gi~~~~~~~~~~~------~~~~~~~~~~i~~~~~~~~~~~~~~~--~~v~n~~~~~~L~~lkd-~~G~~i~~-~~~~~~ 335 (418) T protein:vir:10 266 GILPQASAFMPSI------TLANATPIDKIRLALLQAVLAEFPAT--GIVLNPIDWASIELTKD-SQGRYIVG-NPVNGT 335 (418) T ss_pred ccccccccccccc------cccccccHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceecc-ccccCC Confidence 1100000000000 00111125667777777766665332 47889999998865322 23344332 344566 Q ss_pred eEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE 316 (347) Q Consensus 237 v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~ 316 (347) .+.++|++|+.|+++|... -+.+||+..+ +++. ..+++++...+.. T Consensus 336 ~~~l~G~pV~~~~~~p~~~-------------------------~~~gd~s~~~-~~~~--------~~~~~i~~~~~~~ 381 (418) T protein:vir:10 336 TPRLWNLPVVETQAMTANE-------------------------FLVGAFSMAA-QIFD--------RMEIEVLLSTENV 381 (418) T ss_pred CceecceeeEEcCCCCCCc-------------------------EEEeeccceE-EEEE--------ecceEEEEecccc Confidence 7789999999999998421 1234554432 2222 2334444433221 Q ss_pred ----hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 ----FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 ----~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +-...+++.+.++.++++|++.+.+..+++| T Consensus 382 ~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~ 416 (418) T protein:vir:10 382 DDFEKNMVSIRAEERLALAVYRPESFVTGALVEQA 416 (418) T ss_pred hhhhcCceEEEEEEeeccEEecccceEEEEeccCC Confidence 1123455677889999999999999988888 No 102 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=99.29 E-value=2.5e-13 Score=89.65 Aligned_cols=278 Identities=10% Similarity=0.019 Sum_probs=156.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |++... .+.-.+..+++..+|.+..+..+.++.+.++.++. +.+.+||+. +.+.+.-+..|..+ T Consensus 14 ~~~~~~--------------~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~ 78 (318) T protein:vir:24 14 IAQTGD--------------TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWVGDVSAQWIGEGDMK 78 (318) T ss_pred hhcccC--------------cccceeechhHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCcceEEecCCccc Confidence 333211 11112456889999999999999999988776665 455778754 45566666667777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +.+ +++.+++++..-+. ..-..|.+-=..++.+|+.+.+.++.+++++++.|+.++. +.... ...+.. T Consensus 79 ~~~--~~~f~~i~~~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~----G~g~~-----~~~~~~ 146 (318) T protein:vir:24 79 PIT--KGNMTSQTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMH----GTDSP-----FPTYIG 146 (318) T ss_pred ccc--ccceeEEEEeeEEE-EEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhc----ccCCC-----CCcccc Confidence 543 46666666655553 2333443322223567899999999999999999998863 11100 001110 Q ss_pred Cc-eeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc--- Q lcl|Aclame:pro 160 QA-VVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG--- 235 (347) Q Consensus 160 ~~-~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G--- 235 (347) .. ..++.+...... ....+.++++...+...+ ...-.++++|..|..|.+-.+ .+..|........+ T Consensus 147 ~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~--~~~~~~v~n~~~~~~L~~lkd-~~G~~l~~~~~~~~~~~ 217 (318) T protein:vir:24 147 QTTKAISIADTTGAT------TVYDQVAVNGLSLLVNDG--KKWTHTLLDDITEPILNGAKD-QNGRPLFIESTYGEAAS 217 (318) T ss_pred ccccccccccccccc------chHHHHHHHHHHhhcccc--CCCCEEEEcHHHHHHHHHhhc-cCCceeecCccccCccc Confidence 00 001111111110 011233444555554444 334567999999999975332 23333222222222 Q ss_pred --ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 236 --NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 236 --~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) .-..+.|++++.++++|.... --+.+||+..+ + +..++++++..+ T Consensus 218 ~~~~~~i~g~pv~~~~~~~~~~~-----------------------~~~~gdfs~~~--~--------~~~~~l~i~~~~ 264 (318) T protein:vir:24 218 PFRSGRIVARPTILSDHVVEGTT-----------------------VGFMGDFSQLI--W--------GQIGGLSFDVTD 264 (318) T ss_pred cccCceEEEEeeEEeCCCCCCcc-----------------------EEEEeecceEE--E--------EEecCeEEEEee Confidence 224788999999998874210 01234554421 2 223333443332 Q ss_pred chh--------------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 314 RPE--------------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 314 ~~~--------------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.- ++ .-.++....+|.+++||++.+.|....++ T Consensus 265 ~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~ 314 (318) T protein:vir:24 265 QATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSG 314 (318) T ss_pred ccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccC Confidence 211 12 23457888999999999999999887777 No 103 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.29 E-value=1.5e-13 Score=90.83 Aligned_cols=285 Identities=15% Similarity=0.057 Sum_probs=165.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecccc--ceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR--TKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~--~t~~~~~~g~~ 78 (347) +.....- ...+.......++.-.+..++|..++.+.....+.+++++++.++.+ .++++++... .++..+..|.. T Consensus 101 ~~~~~~~--~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~ 177 (390) T protein:vir:81 101 RATMNIK--AALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEGAL 177 (390) T ss_pred hhhhHHH--HHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccC-CceEEEEEecCCcceeeecCCcc Confidence 0000000 00000000111222235667788888888888898888887766554 4677777533 35555666776 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.. +++.+++++.+.++- .-..|.+ +-.+.+.++.+.+.++.++++++..|+.++. +.. .+..+.|. T Consensus 178 ~~~~--~~~~~~i~~~~~k~~-~~~~is~-ell~d~~~~~~~i~~~l~~~~~~~~d~a~l~----G~g----~~~~~~Gi 245 (390) T protein:vir:81 178 KPES--SLKFAKKTDTTHVIA-HTMKATR-QILSDAPQLASYMNNRLIRGLKVKEDAEILR----GTG----ANDGLLGL 245 (390) T ss_pred cccc--cceeeEEEEeeeEEE-EeehhhH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHh----cCC----CCCcccce Confidence 6543 467777777777653 3344543 3334446788899999999999999998763 110 01111111 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) .-..+ . ............++.|.++...+...+.+.. .+|++|..|..|.+-.. .+..|.-. +...+... T Consensus 246 ----~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~v~~~~~~~~l~~lkd-~~G~~l~~-~~~~~~~~ 315 (390) T protein:vir:81 246 ----IPQAT-T-YAAPTTIAGATRVDQLRLAMLQASLAEYNPS--GIVINPIDWAAIELAKD-ANNQYLIG-NARGTLTP 315 (390) T ss_pred ----eeccc-c-cccccccccchhHHHHHHHHHhhccccCCCC--EEEEcHHHHHHHHHhhc-CCCceeec-CcccccCc Confidence 10000 0 0000011112236778888888888877543 56889999998875332 22334322 22345556 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhh- Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEF- 317 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~- 317 (347) .++|.+|+.++++|... -+.+||++.. +++ ...+++++..+...+ T Consensus 316 ~l~G~pv~~~~~~p~~~-------------------------~~~gd~~~~~-~~~--------~~~~~~v~~~~~~~~~ 361 (390) T protein:vir:81 316 TLWGLPVVATQAMAPGE-------------------------FLVGAFDLAA-QIF--------DQWDARVEIGYVGEDF 361 (390) T ss_pred eecceeeEEcCCCCCCc-------------------------EEEEehhceE-EEE--------EecceEEEEecccchh Confidence 89999999999998421 1345565432 222 234556665544332 Q ss_pred Hhh--HHhhhhhhcCcccccceEEEEEec Q lcl|Aclame:pro 318 QAD--QIIGKYAMGHGGLRPEAAGALVFT 344 (347) Q Consensus 318 ~~d--~i~~~~~~G~~~lRPe~~~~l~~~ 344 (347) +.+ .++....++.++++|++.+.+.++ T Consensus 362 ~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 362 QRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred hcCcEEEEEEEeeccEEecccceEEEEeC Confidence 334 456888999999999999999999 No 104 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.29 E-value=5.5e-13 Score=87.81 Aligned_cols=282 Identities=12% Similarity=0.035 Sum_probs=161.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccC--CceEEEeccc-cceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQN--GKSASFPVMG-RTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~--G~tv~i~~iG-~~t~~~~~~g~ 77 (347) +.. ........+. ....++.-.+.-+.|..++++..+..+.+++++++..+.+ |+....+... ...+.....|. T Consensus 98 ~~~-~~~~~~~~~~--~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 174 (397) T protein:vir:48 98 VRG-RYQNLLDSKT--DASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAG 174 (397) T ss_pred Hhh-hhhHHHHHhh--ccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeecccc Confidence 000 0000000000 0011111124568999999999999999999888776653 3333333222 22334444455 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .++.. ..++..++++.+.+. .....|.+-=..++.+|+.+.+.++.++++++..|+.|+.-. T Consensus 175 ~~~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~---------------- 236 (397) T protein:vir:48 175 SIGTN-DDPKLYPIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI---------------- 236 (397) T ss_pred ccccc-cccceeeEEeeheee-eeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------- Confidence 55432 235667777777665 344456543333467899999999999999999999887311 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) +.+... ++ . . -++.|+++...|+....+. =.+|++|..|..|.+-.. .+..|.-..++..|.- T Consensus 237 -g~~~~~--~~---~----~----~~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~L~~lkd-~~G~~i~~~~~~~~~~ 299 (397) T protein:vir:48 237 -ATLPTK--PT---L----T----KWDDIIDLQAKVDPAIKQT--SFFLTNTSGFTALKKVKN-AFGDYLMERDVKSPTG 299 (397) T ss_pred -cccccc--cc---c----c----cHHHHHHHHHHhhhhhcCC--CEEEECHHHHHHHHHhhc-CCCceeeccCcCCCCC Confidence 111110 00 0 0 1566788888888777543 466899999999876332 2344443445667777 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh- Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE- 316 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~- 316 (347) +.++|++|+.+.+.+....+ .+...-+.+||++. +..+....++++..+... T Consensus 300 ~~l~G~PV~~~~~~~~~~~~------------------~~~~~~~~gd~~~~---------~~~~~~~~~~i~~~~~~~~ 352 (397) T protein:vir:48 300 YSIDGFAVKEVADRWLANAS------------------SGAMPLYFGDLKQA---------VTLFDRQQMSLLSTNIGGG 352 (397) T ss_pred ceeccceeEEecccccCCcC------------------CCceEEEEEeccce---------EEEEeecceEEEEeccchh Confidence 89999999987653321100 00011122344332 222333444555444221 Q ss_pred -hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 -FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 -~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +. ...+++.+.++.++++|++.+.+..++++ T Consensus 353 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:48 353 AFETDTTKIRVIDRFDVVATDTESFVPASFKAIA 386 (397) T ss_pred hhhcCceeEEEEeeeccEEecccceEEEEecccc Confidence 12 23667888899999999999999988877 No 105 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=99.28 E-value=1.3e-13 Score=91.30 Aligned_cols=285 Identities=10% Similarity=0.035 Sum_probs=159.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) |.- |+- .+.......++.-.+..+++..++++..++.+.++.+.++.++. +.+.+||+. +.+.+.-+..|..+ T Consensus 1 ~g~-~~e----~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~~a~wv~Eg~~~ 74 (397) T protein:vir:23 1 MGF-SAD----HSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMG-ATGIVIPHWTGDVSAQWIGEGDMK 74 (397) T ss_pred CCc-CHH----HHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeecc-CCceEEEEEcCCcceEEecCCccc Confidence 322 111 11111111111112455667788888888888888888776655 455788865 44455555566666 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +. .+++..++++.+-++ ..-..|.+-=..++.+|+.+.+.++.+++|+++.|+.+|.- ...+. ...+.. T Consensus 75 ~~--s~~~f~~v~l~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G----~gt~~----~~~~~~ 143 (397) T protein:vir:23 75 PI--TKGNMTKRDVHPAKI-ATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHG----TNAPS----AFQGYL 143 (397) T ss_pred cc--cccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCc----cccccc Confidence 54 346777777666554 33344543223345689999999999999999999988731 11100 001100 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc-----ccccc Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL-----IDPET 234 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~-----~~~~~ 234 (347) ......... .....++.++++...|.+...+ .-.++++|..|..|.+-.+ .+..|.-. +.... T Consensus 144 ~~~~~~~~~---------~~~~~~~~~~~~~~~l~~~~~~--~a~~vmn~~~~~~L~~lkd-~~G~~i~~~~~~~~~~~~ 211 (397) T protein:vir:23 144 DQSNKTQSI---------SPNAYQGLGVSGLTKLVTDGKK--WTHTLLDDTVEPVLNGSVD-ANGRPLFVESTYESLTTP 211 (397) T ss_pred ccccceeee---------cccchhHHHHHHHHhhhhcccC--CCEEEEcHHHHHHHHHhhc-cCCceeeccccccccccc Confidence 000000000 1111245566666777776643 3457999999999986432 22333211 11222 Q ss_pred cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 235 GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 235 G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) +..+++.|++|+.++++|.... --+.+||++.. +. ..+++.++..++ T Consensus 212 ~~~~tl~G~Pv~~s~~~~~g~~-----------------------~~~~gDfs~~~--i~--------~~~~i~i~~~~e 258 (397) T protein:vir:23 212 FREGRILGRPTILSDHVAEGDV-----------------------VGYAGDFSQII--WG--------QVGGLSFDVTDQ 258 (397) T ss_pred ccCceeeeeeEEEeCCCCCCce-----------------------EEEEeecceEE--EE--------EEeceEEEEeee Confidence 3446899999999999984211 11344555432 21 122223332221 Q ss_pred hh--------------hHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PE--------------FQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~--------------~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .- ++-| .++....++.++++|++.+.+..++.. T Consensus 259 ~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~ 307 (397) T protein:vir:23 259 ATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVL 307 (397) T ss_pred eeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccc Confidence 10 1222 456778899999999999999987776 No 106 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=99.28 E-value=6.1e-13 Score=87.54 Aligned_cols=297 Identities=13% Similarity=0.030 Sum_probs=155.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~~ 79 (347) ||..+.. | + .+.-++|..++.+..+..|+++.+.++....+| .++||+. |.+++.-+..|+.+ T Consensus 1 Mat~tt~-------~-----g---~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~Eg~~~ 64 (311) T protein:vir:99 1 MATFGTG-------N-----L---KNLPRNIADGMVKDVVQGSTVAVLSARKPQRFG-NEDIITFNGRPKAEFVGEGQQK 64 (311) T ss_pred CceecCC-------C-----c---eeccHHHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCceeEEeecCccc Confidence 8864321 1 1 134588899999999999999988877666544 4688876 66677666667777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHH----HhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDA----MNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~----q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) +.. +++..++++..-+. ..-..|.+ +-. .+..|+.+.+.++++++|++++|+.+|.-. +... ...+ T Consensus 65 ~~~--~~~f~~v~l~~~k~-~~~~~iS~-ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~--g~~~----g~~~ 134 (311) T protein:vir:99 65 SST--TGEFDFVTSTPKKA-QVTMRFNE-EVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRI--NPLT----GTVI 134 (311) T ss_pred ccc--cceeeEEEEeeEEE-EEeehhhH-HHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhccc--Cccc----Cccc Confidence 643 45666666655433 22233432 111 245778999999999999999999887421 0000 0001 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) .+. ...+... ....+........+.+.+..+...+........--.++++|..+..|.+-.. .+..|.-......+ T Consensus 135 ~g~--~~~~~~~-~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd-~~G~~l~~~~~~~~ 210 (311) T protein:vir:99 135 PGW--SNYLGAA-SKRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARY-TDGRKKFPELGLGI 210 (311) T ss_pred ccc--ccccccc-cceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhc-cCCCeeecCcccCC Confidence 110 0001000 0000000011111223344444444433322111126899999999965322 22344433444456 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc-- Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR-- 313 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~-- 313 (347) ..+++.|++|+.|+++|....... .......+....-+.+||+..+-+. ..++++++... T Consensus 211 ~~~~l~G~Pv~~s~~i~~~~~~~~---------~~~~~~~~~~~~~~~Gdf~~~~~~~---------~~~~~~~~~~~~~ 272 (311) T protein:vir:99 211 GVSSFEGIDASVSDTVNGGDEADP---------DDEDLDAARAVRGIVGDFANGIHWG---------VQRDIPVELIKYG 272 (311) T ss_pred CCceecceeeEeeccccccccccc---------ccchhhccCcceEEEeeccccEEEE---------EecCceEEEeecC Confidence 678899999999999984321111 0111111111222446666533221 12222333221 Q ss_pred chh-----hHhhH--HhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 314 RPE-----FQADQ--IIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 314 ~~~-----~~~d~--i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ++. ++.|. +++...+|..+++|++ +.+..++| T Consensus 273 ~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~-v~~~~~~A 311 (311) T protein:vir:99 273 DPDGQGDLKRHNQIALRLEIVYGWYVFTDRF-VVIENAVA 311 (311) T ss_pred CCCcchhhhhcCcEEEEEEEeecceecChhH-eeeecccC Confidence 111 22333 4777889999988764 45544444 No 107 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.27 E-value=1.5e-13 Score=90.94 Aligned_cols=294 Identities=12% Similarity=0.060 Sum_probs=154.4 Q ss_pred CCCCc------cCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEec-cccceeeee Q lcl|Aclame:pro 1 MANAT------GGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPV-MGRTKGYYL 73 (347) Q Consensus 1 m~~~~------~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~-iG~~t~~~~ 73 (347) |..-. ........ .......+...+..+.|..++.+.-+..+.++.+.++.++.++ ...+++ .+.+.+... T Consensus 143 ~~~~~~~~~~~~~~~~~a~-~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~a~~v 220 (458) T protein:vir:10 143 VMEKGVFETEHGQRHLKAV-NQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSK-ILTMLVEPDAGKATWV 220 (458) T ss_pred HHhhccchhhhhhhhhhhh-hhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCc-ceEEEEecCCcceeec Confidence 10000 00000000 0011112223357789999999999888988888887666654 455553 444444443 Q ss_pred cCCCCCCCCC----CCCCCCceEEEEeeeeecch-hhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 74 APGENLDDKR----KDIKHSEKVIQIDGLLTSDV-LIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP 148 (347) Q Consensus 74 ~~g~~~~~~~----~~~~~~~~~l~ID~~~~~~~-~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a 148 (347) ..|...+.+. .+++..++++ ...++..+ .|.+-=...+.+|+.+.+.++++++|++..|+.+|. +.. T Consensus 221 ~e~~~~~~~~~~~~~~~~~~~i~~--~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~----G~G-- 292 (458) T protein:vir:10 221 AASTYGTDTTTGEEVKGALKEIHF--STYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMT----GDG-- 292 (458) T ss_pred ccccccccccccccccccceeeEe--eeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----CCC-- Confidence 3343333211 1234444444 44444443 343322233558899999999999999999998863 110 Q ss_pred cccccccCccc------CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh Q lcl|Aclame:pro 149 AASNENIAGLG------QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN 222 (347) Q Consensus 149 ~~~~~~~~g~~------~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~ 222 (347) ++.+.|.- .+..+...++ .......++.|+++...|...+.. .-.+|++|..|..|.+-. -. T Consensus 293 ---~~~p~Gi~~~~~~~~~~~~~~~~~------~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~l~~lk-d~ 360 (458) T protein:vir:10 293 ---SGKPKGLLTLASEDSAKVVTEAKA------DGSVLVTAKTISKLRRKLGRHGLK--LSKLVLIVSMDAYYDLLE-DE 360 (458) T ss_pred ---CCccceeeecccccccceeecccc------cccccccHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHhhc-cc Confidence 00111110 0011111111 111111267788888888877653 345689999998876422 22 Q ss_pred hhhccc----cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhh Q lcl|Aclame:pro 223 AANYAA----LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSA 298 (347) Q Consensus 223 ~~~~~~----~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A 298 (347) +..|.. ......|...+++|.+|+.++.+|...... .-+.++|.... +++ T Consensus 361 ~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~---------------------~~~~~~f~~~~-~~~---- 414 (458) T protein:vir:10 361 EWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSA---------------------EFAVIVYKDNF-VMP---- 414 (458) T ss_pred CCceeeccccccccccCcCceecceeeEEccccccccCCc---------------------ceEEEEecccE-EEE---- Confidence 333322 123445666789999999999998532110 11223443211 122 Q ss_pred hhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 299 VGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 299 ~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ...+++++...-.....-.++....+|-.+.+|++++...++++ T Consensus 415 ----~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 415 ----RQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ----EeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 22333333211111122346777889999999999999887777 No 108 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.27 E-value=1.9e-13 Score=90.27 Aligned_cols=287 Identities=11% Similarity=0.083 Sum_probs=163.2 Q ss_pred CCCCc------------cCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-cc Q lcl|Aclame:pro 1 MANAT------------GGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GR 67 (347) Q Consensus 1 m~~~~------------~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~ 67 (347) |=..+ ....--.+........+...+.-+.|..++.+.-+..+.++.+.++-+.. +.+++||+. +. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~~~~ 79 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADK 79 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeecc-CCceEEEEEecC Confidence 00000 00000000100111112223556899999999999999999987766654 456888876 45 Q ss_pred ceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 68 TKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNL 147 (347) Q Consensus 68 ~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~ 147 (347) +.+.-+..|..++. .+++.+++++..-++ ..-..|.+--..++.+++.+.+.++.+++++++.|+.++. +... T Consensus 80 ~~a~~v~Eg~~~~~--~~~~f~~v~~~~~k~-~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~----G~g~ 152 (324) T protein:vir:97 80 PGAYWVGEGQKIET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN 152 (324) T ss_pred cceeEeccCccccc--cccceeEEEEeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc----cCCC Confidence 56666666776654 346777777776664 3444555522233567899999999999999999998874 1100 Q ss_pred ccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcc Q lcl|Aclame:pro 148 PAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYA 227 (347) Q Consensus 148 a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (347) . +.+.+..-......... .+...++.|+++...|...+... ..++++|..|..|.+-.+ -+ T Consensus 153 ----~----~~~~gi~~~~~~~~~~~----~~~~~~~~i~~~~~~l~~~~~~~--~~~v~n~~~~~~L~~lkd-----~~ 213 (324) T protein:vir:97 153 ----N----PFGKSIAQSIEKTNKVI----KGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-----PE 213 (324) T ss_pred ----C----ccCccccccccccceec----cccCCHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhc-----CC Confidence 0 00111100000000000 11112677888888888877543 357899999998875322 12 Q ss_pred ccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhe Q lcl|Aclame:pro 228 ALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) Q Consensus 228 ~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~ 307 (347) |...+..+.-+.++|++|+.++..+... ..-+.+||++.+ .+..+++ T Consensus 214 g~~~~~~~~~~tl~G~PV~~~~~~~~~~-----------------------~~~~~gd~~~~~----------i~~~~~~ 260 (324) T protein:vir:97 214 TKERIYDRNSDTLDGLPVVNLKSSNLKR-----------------------GELITGDFDKLI----------YGIPQLI 260 (324) T ss_pred CceeecCCCCccccceeeEeecCCCCCc-----------------------ceEEEEecccEE----------EEEecCc Confidence 2222334555679999999987665311 111334555432 1233444 Q ss_pred eeccccchh--------------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 308 ALERARRPE--------------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 308 ~~e~~~~~~--------------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++|...+.. ++ .-.++..+.++.++++|++.+.|..+.+. T Consensus 261 ~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 316 (324) T protein:vir:97 261 EYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKK 316 (324) T ss_pred EEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 554443211 11 23456678899999999999999866554 No 109 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.25 E-value=8.4e-13 Score=86.80 Aligned_cols=282 Identities=12% Similarity=0.029 Sum_probs=161.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEecccc--ceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPVMGR--TKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~~iG~--~t~~~~~~g~ 77 (347) |-. ..... .+.......++--.+.-+.|..++.+..+..+.++.++++..+..+. ++.+++... ..+.....|. T Consensus 98 l~~-~~~~~--~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 174 (397) T protein:vir:49 98 VRG-RYQNL--LDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGG 174 (397) T ss_pred hhc-chhhH--HHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCcceeeecccc Confidence 110 00000 00000001111112455899999999999999999888887766432 344554332 2333333455 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .++.+ ..++.+++++...+. +.-..|.+-=..++.+|+.+.+.++.+++|++..|+.|+.-. T Consensus 175 ~~~~~-~~~~~~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~---------------- 236 (397) T protein:vir:49 175 QIGQN-DDPKLSLIRYAIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI---------------- 236 (397) T ss_pred ccccc-cccceeeeEeeeeee-EeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcc---------------- Confidence 54322 124556777777665 333455443233467899999999999999999999886311 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) +.+... +.. . -++.|.++...|+....+. -.+|++|..|..|.+-. -.+..|.-..++..|.- T Consensus 237 -g~~~~~-----~~~----~----~~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~l~~lk-d~~g~~l~~~~~~~g~~ 299 (397) T protein:vir:49 237 -GTLPNK-----PTL----A----KWDDIIDLQAKVDPAIKQT--SLFLTNTSGFTALKKVK-NAMGDYLMERDVKSPTG 299 (397) T ss_pred -cccccc-----ccc----c----CHHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhh-ccCCceeecccccCCCC Confidence 111000 000 0 1567888888888877653 46789999999886532 22334433345566777 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch-- Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP-- 315 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~-- 315 (347) .+++|++|+.+.+.+....+ . +...-+.+||++.+ ..+..+.++++..... T Consensus 300 ~~l~G~pV~~~~~~~~~~~~-----------~-------~~~~~~~gd~~~~~---------~~~~~~~~~i~~~~~~~~ 352 (397) T protein:vir:49 300 YSIDGFVVKEISDRFLPNGT-----------G-------GAMPLYFGDLKQAV---------TLFDRQHLSLLSTNIGGG 352 (397) T ss_pred ceecceeeEEeccccccccc-----------C-------CceeEEEeeccceE---------EEEeecccEEEEeccccc Confidence 88999999986654321100 0 00111234444322 2233444555544321 Q ss_pred h--hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 316 E--FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 316 ~--~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) . +-...+++...+|.++++|++.+.+..+++| T Consensus 353 ~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 353 AFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIA 386 (397) T ss_pred hhhcCeeeEEEEEeeccEEecccceEEEEecccc Confidence 1 2233577888999999999999999988887 No 110 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=99.25 E-value=3.2e-13 Score=89.07 Aligned_cols=280 Identities=11% Similarity=0.031 Sum_probs=162.1 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEe-ccccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFP-VMGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~-~iG~~t~~~~~~g~~~ 79 (347) |.= ..+ ++.-....++...|..++|..++.+..+..+.++.+.++..+.++....+| ..+...+..+..|..+ T Consensus 1 m~~---~~~---~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~ 74 (297) T protein:vir:95 1 MTV---QTF---NPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKI 74 (297) T ss_pred CCc---ccc---ccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccc Confidence 322 111 111111222333466799999999999999999988887766554445566 3445566677777777 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +.. +++.+++++...+. .....|.+-=..++..|+.+.+.++.+++++++.|+.++. +..... ..+.. T Consensus 75 ~~~--~~~f~~v~l~~~k~-~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~----G~g~~~-----~~gi~ 142 (297) T protein:vir:95 75 KTD--KPEVVPVTLKAHKL-GIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLL----GHDTPF-----ANSVA 142 (297) T ss_pred ccc--ccceeEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhc----ccCCcc-----ccccc Confidence 643 46777777776664 3444554422223568899999999999999999998873 111100 01100 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) ........... ....++.|+++..+|...+.+.. .++++|..|..|.+-.. .+..| +-.+..+. T Consensus 143 --~~~~~~~~~~~------~~~t~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~L~~l~d-~~G~~-----i~~~~~~~ 206 (297) T protein:vir:95 143 --KAAKDANKVIG------GPINYDNILKLQDALYDADVEPN--AFVSKIQNRSALREARD-GNKVS-----IYDKAANT 206 (297) T ss_pred --ccccccceecc------cccCHHHHHHHHHHhhhccCCcC--EEEEcHHHHHHHHHhhc-cCCce-----eecCCCCc Confidence 00000000000 01126778888888888776543 56889999999875221 11222 12334467 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh--- Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE--- 316 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~--- 316 (347) +.|++|+.+++.+... +.-+.+||++.. .+...+++++...+.. T Consensus 207 l~G~Pv~~~~~~~~~~-----------------------~~~~~gd~s~~~----------~~~~~~~~i~~~~~~~~~~ 253 (297) T protein:vir:95 207 IDGITTVDLKSARFEK-----------------------GDLLAGDFDNLI----------YGVPYNITYKISEEGQIST 253 (297) T ss_pred ccceeeEeecCCCCCC-----------------------ceEEEEecccEE----------EEEecCeEEEEeecccccc Confidence 8999999877654211 111335555432 1223334444332211 Q ss_pred -----------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 -----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 -----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++ .-.++....+|.++++|++.+.|..+..= T Consensus 254 ~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 254 ITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred ccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 22 22356667899999999999998765555 No 111 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.24 E-value=7.7e-13 Score=87.01 Aligned_cols=298 Identities=10% Similarity=0.015 Sum_probs=160.4 Q ss_pred CCCCccC----ccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCC-ceEEEec-cccceeeeec Q lcl|Aclame:pro 1 MANATGG----QQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNG-KSASFPV-MGRTKGYYLA 74 (347) Q Consensus 1 m~~~~~~----~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G-~tv~i~~-iG~~t~~~~~ 74 (347) +...+.. ...-.|.-..+..++--.+.-+.|.+++...-+..+.++.+.++.++.++ -.+.+++ .+...+.... T Consensus 92 ~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~ 171 (404) T protein:vir:10 92 LKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLS 171 (404) T ss_pred HHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEEEecCCcceeecc Confidence 1100000 00000000000111111134588899999998888999998888877632 2455554 5666666666 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) .|...+.+..+++.+++++...++ ..-..|.+-=..++.+++.+.+.++.++++++..|+.|+. +... ... T Consensus 172 e~~~~~~~~~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~----G~g~----~~~ 242 (404) T protein:vir:10 172 ENQQIPTNGDNGKLERFNFKLKDL-ADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILY----GAGG----DEH 242 (404) T ss_pred ccccccccccccceeeeEeeheee-EeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhh----cCCC----CCc Confidence 676665443345566666666554 2333454422234567899999999999999999998863 2111 111 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHH-HHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARA-RLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPE 233 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~ 233 (347) ..|......+. +....... .++.+..+.. .|....-+ +-.+|++|..|..|.+-. -.+..|.-..++. T Consensus 243 ~~gi~~~~~~~----~~~~~~~~----~~~~~~~~~~~~l~~~~~~--~~~~v~n~~~~~~L~~lk-d~~G~~l~~~~~~ 311 (404) T protein:vir:10 243 ATGIMTANKFK----KITLPKSP----ALKDFKKCKNVELLNVFKA--TSSWIVNQDGFNYLDSLE-DKTGRPYLQPDPK 311 (404) T ss_pred ccceeeccccc----eeeccccc----cHHHHHHHHHhhhhccccC--CCEEEEcHHHHHHHHHhh-ccCCceeeccCcC Confidence 11111000000 00001111 1344544433 34433322 235689999999887632 2344554444566 Q ss_pred ccceEEEeceeEEEecc-ccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPH-LTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERA 312 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~-lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~ 312 (347) .|...+++|.+|+.++. +|.... +...-+.+||+.. +..+....++++.. T Consensus 312 ~~~~~~l~G~PV~~~~~~~~~~~~--------------------~~~~~~~gd~s~~---------~~~~~~~~~~i~~~ 362 (404) T protein:vir:10 312 DPTQYRFLGLPVIELPNDLLLSTE--------------------SAIPVLLGDTKEA---------YKYVSDGAYELATT 362 (404) T ss_pred CCCCccccceeeEEecccccCCCC--------------------CccEEEEEecccc---------EEEEEecceEEEEe Confidence 67778899999986443 332110 0011123444432 22233344555544 Q ss_pred cch--h--hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 313 RRP--E--FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 313 ~~~--~--~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .++ . +-.-.+++.+.+|..+++|++++.+..+++| T Consensus 363 ~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa 401 (404) T protein:vir:10 363 NIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVES 401 (404) T ss_pred ccccchhhcCceEEEEEEeeccEEecccceEEEEeeccc Confidence 332 1 2234588999999999999999999999999 No 112 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.20 E-value=1.6e-12 Score=85.20 Aligned_cols=285 Identities=12% Similarity=0.038 Sum_probs=159.0 Q ss_pred CCCCccC-ccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCC-ceEEEecccc--ceeeeecCC Q lcl|Aclame:pro 1 MANATGG-QQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNG-KSASFPVMGR--TKGYYLAPG 76 (347) Q Consensus 1 m~~~~~~-~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G-~tv~i~~iG~--~t~~~~~~g 76 (347) +-+.... .....|.-.....++--.+.-+.|..++++..+..+.+++++++.++.++ -+..+++... ..+..+..| T Consensus 101 ~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg 180 (404) T protein:vir:39 101 VRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (404) T ss_pred HhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCc Confidence 1000000 00001100011111111245689999999999999999999888777643 2344443322 333444445 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ..++.+ ..+...++++.+.+. +....|.+-=...+.+|+.+.+.++.++++++..|+.|+.-. T Consensus 181 ~~~~~~-~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~--------------- 243 (404) T protein:vir:39 181 GKIPDL-DNPRLTIIKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM--------------- 243 (404) T ss_pred cccccc-cccceeeEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------- Confidence 555422 235667777777776 344456543233457889999999999999999999886311 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) +.+.. .++ .. . ++.|.++. ..++...-+ .-.+|++|..|..|.+-. -.+..|.-..++..| T Consensus 244 --g~~~~--~~~---~~----~----~~~i~~~~~~~~~~~~~~--~a~~v~n~~~~~~L~~lk-d~~G~~l~~~~~~~~ 305 (404) T protein:vir:39 244 --GTVPK--KPT---IA----K----FDDVITMINTSVDPAIIA--TSSLLTNQSGLNKLALVK-TAEGKYLLEPDPTKP 305 (404) T ss_pred --ccccc--ccc---cc----c----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhh-ccCCceeeccCcCCC Confidence 11100 000 00 1 23344443 233433322 236799999999998632 223445434455667 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~ 315 (347) ...+++|++|+.+.+.+....+. +...-|.+||+..+- .+..++++++..... T Consensus 306 ~~~~l~G~pV~~~~~~~~~~~~~------------------~~~~~~~gd~~~~~~---------~~~~~~~~i~~~~~~ 358 (404) T protein:vir:39 306 NSYLIKGKKVIVVADRWLPNSGS------------------TVYPLYYGDMSQAIT---------LFDRENMSLLPTNIG 358 (404) T ss_pred CcceecceeEEEecccccCccCC------------------CccEEEEEeccccEE---------EEeecceEEEEeccc Confidence 77899999999877643221100 001123445554321 223344555554433 Q ss_pred h----hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 316 E----FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 316 ~----~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) . +-...+++.+.+|.++++|++.+.+..+++| T Consensus 359 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a 394 (404) T protein:vir:39 359 AGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIA 394 (404) T ss_pred hhhhhhceeeEEEEeeeccEEecccceEEEEeeccc Confidence 1 2234577889999999999999999987777 No 113 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.19 E-value=2.9e-12 Score=83.88 Aligned_cols=274 Identities=11% Similarity=0.026 Sum_probs=160.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCC-ceEEEeccc-cceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNG-KSASFPVMG-RTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G-~tv~i~~iG-~~t~~~~~~g~~ 78 (347) |+-.+. ..| -.+.-+.|..++.+..+..+.+++++++..+.++ -+..++..+ .+.+..+..|+. T Consensus 91 ~~~~t~------------~~g--g~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~ 156 (371) T protein:vir:81 91 MSEGSN------------QDG--GYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAA 156 (371) T ss_pred hccCCC------------ccC--ceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceeeeccccc Confidence 221100 011 1145588999999999999999999888777543 234455443 346666666776 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) .+.. ..+..+++++...+. .....|.+-=...+.+|+.+.+.++.++++++..|+.|+.-. |. T Consensus 157 ~~~~-~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~---------------g~ 219 (371) T protein:vir:81 157 IGEK-ATPQFTLLQYQVKKY-AGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVL---------------NT 219 (371) T ss_pred cccc-cccceeeEEeeeeEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc---------------cc Confidence 6432 235667766666665 233445442223355789999999999999999998876411 00 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) .. +....+ ++.+..+. ..|+...- ..-.+|++|..|..|.+-. -.+..|.-..++..|.. T Consensus 220 ~~--------~~~~~~--------~~~i~~~~~~~l~~~~~--~~a~~vmn~~~~~~L~~lk-d~~g~~l~~~~~~~~~~ 280 (371) T protein:vir:81 220 KA--------KTAIAD--------LDGLKQIINVQLDPVFR--STSSVIVNQDAFNWLDTLK-DQNGQYLLQPSISSPTG 280 (371) T ss_pred cc--------cccccc--------HHHHHHHHHhhcchhhh--cCCEEEEcHHHHHHHHHhh-ccCCCeeeecccCCCCC Confidence 00 000011 23333332 23433332 2346789999999987632 22344544445666777 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh- Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE- 316 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~- 316 (347) ++++|.+|+.++++|.......... ++...-+.+||+..+- .+....++++...... T Consensus 281 ~~l~G~pV~~~~~~~~~~~~~~~~~-------------~~~~~i~~Gd~~~~~~---------~~~~~~~~i~~~~~~~~ 338 (371) T protein:vir:81 281 RQLLGLPVVIVSNKVLANRVDGGTG-------------AQFAPIIVGDLKEAVV---------MFDRQRTEIMSSNVAMD 338 (371) T ss_pred ceecceeEEEecccccCcccccccc-------------CCcceEEEEehhceEE---------EEeecceEEEEeccccc Confidence 8999999999999996543221110 1111123445543221 2233344444443221 Q ss_pred ---hHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 317 ---FQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 317 ---~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) +-.-.+++.+.++.++++|++.+.+..++| T Consensus 339 ~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 339 AFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred hhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 123467788889999999999999998888 No 114 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.19 E-value=2e-12 Score=84.66 Aligned_cols=280 Identities=14% Similarity=0.071 Sum_probs=162.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccC--CceEEEeccc--cceeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQN--GKSASFPVMG--RTKGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~--G~tv~i~~iG--~~t~~~~~~g 76 (347) |-.-....-...+. ....++--.+.-+.|..++.+.-+..+.++++.++..+.+ |+ ..+++.. ...+..+..| T Consensus 97 ~l~~~~~~~~~~~~--~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~E~ 173 (397) T protein:vir:49 97 LVRGRYQNLLDSKT--DASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGS-RVYEKWTDITGLANIDDEA 173 (397) T ss_pred HHhcchhHHHHHhh--ccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccc-eEEEeeccCCcceeeecCc Confidence 00000000000000 0111111224558999999999989999999888877653 33 4455433 3345555556 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ..++.. ..++..++++.+.+. +....|.+-=..++.+|+.+.+.++.+++|++..|+.|+.-. T Consensus 174 ~~~~~~-~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~--------------- 236 (397) T protein:vir:49 174 GKIADV-DDPKLSLIKYTIKRY-AGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAI--------------- 236 (397) T ss_pred cccccc-cccceeeEEeeeeeE-EeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhc--------------- Confidence 665432 235667777777665 344455443233456899999999999999999999887421 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) +.+.. .++ .. -++.|.++...|.....+. -.+|++|..|..|.+-.. .+..|.-..++..|. T Consensus 237 --g~~~~--~~~---~~--------~~d~i~~~~~~l~~~~~~~--a~~vmn~~~~~~l~~lkd-~~G~~l~~~~~~~~~ 298 (397) T protein:vir:49 237 --AALPT--KPT---LT--------KWDDIIDLEAKVDPAIKQT--SFFLTNTSGFTALKKVKN-ALGDYLMERDVKSPT 298 (397) T ss_pred --ccccc--ccc---cc--------cHHHHHHHHHhhhhhhcCC--CEEEEcHHHHHHHHHhhc-CCCceeeccCcCCCC Confidence 00000 000 00 1566788888888777543 467899999999976332 234454444566677 Q ss_pred eEEEeceeEEEecc--ccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 237 IRNVMGFEVIEVPH--LTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 237 v~~i~G~~V~~sn~--lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) -+.++|++|+.+.+ +|....+ ...-+.+||++.+ ..+..++++++.... T Consensus 299 ~~~l~G~PV~~~~~~~~~~~~~~--------------------~~~i~~gd~~~~~---------~~~~~~~~~i~~~~~ 349 (397) T protein:vir:49 299 GYSIDGFAVKEVADRWLANGTGG--------------------AMPLYFGDLKQAV---------TLFDRQHMSLLSTNI 349 (397) T ss_pred CceecceeeEEecccccccccCC--------------------ceeEEEeeccceE---------EEEeecceEEEEecc Confidence 78999999998654 3321100 0011223444322 122334445544332 Q ss_pred h--hh--HhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 P--EF--QADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~--~~--~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) . .+ -...+++...++.++++|++.+.+..+++| T Consensus 350 ~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 350 GGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 386 (397) T ss_pred ccchhhcCceeEEEEeeeCcEEecccceEEEEeeccc Confidence 1 12 234578888999999999999999988877 No 115 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=99.15 E-value=4.5e-12 Score=82.81 Aligned_cols=295 Identities=10% Similarity=0.011 Sum_probs=155.1 Q ss_pred CCCCccCc---cccc--cCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeec Q lcl|Aclame:pro 1 MANATGGQ---QIGA--NQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLA 74 (347) Q Consensus 1 m~~~~~~~---~~~~--~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~ 74 (347) |.= |.-- .... +....-+.++.-.+..+.+..++.+..++.+.++.+.++..+. ++..+||+. +.+.+..+. T Consensus 1 ~~~-~~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~-~~~~~~p~~~~~~~a~~v~ 78 (326) T protein:vir:42 1 MAV-NPDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMG-TTGQKIPHWTGDVSASWIG 78 (326) T ss_pred CCC-CccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeecc-CCceEEEEEeCCcceEEec Confidence 221 1100 0000 0000000111112556888999999999999888877765554 456778764 445556666 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) .|..++.. +++..++++...+. ..-+.|.+-=..++.+|+.+.+.++.++++++..|+.++. +.....+ .. T Consensus 79 Eg~~~~~~--~~~f~~i~~~~~k~-~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~----G~gs~~p-~g- 149 (326) T protein:vir:42 79 EGDMKPIT--KGNMTSQTIAPHKI-ATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAIN----GTDSPFP-TF- 149 (326) T ss_pred CCcccccc--ccceeEEEEeeEEE-EEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhc----ccCCCcc-cc- Confidence 67777643 47777777777664 4455565533344678999999999999999999998863 1111000 00 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHH--HHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKG--LTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDP 232 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~--l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (347) +.............++. .. ....++. +..+...+ .+.....-.+|++|..|..|.+-.+ .+..+.-.... T Consensus 150 i~~~~~~~~~~~~~~~~-~~----~~~~~~~~~~~~~~~~~--~~~~~~~a~~v~n~~~~~~L~~lkd-~~G~~l~~~~~ 221 (326) T protein:vir:42 150 LAQTTKEVSLVDPDGTG-SN----ADLTVYDAVAVNALSLL--VNAGKKWTHTLLDDITEPILNGAKD-KSGRPLFIEST 221 (326) T ss_pred ccccccccceeeccccc-cc----ccchhHHHHHHHHHhhh--hhhccCccEEEEeHHHHHHHHHhhc-cCCceeecccc Confidence 00000000000000000 00 0001111 12222222 2222344556899999999975222 22233211111 Q ss_pred -----cccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhe Q lcl|Aclame:pro 233 -----ETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) Q Consensus 233 -----~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~ 307 (347) .....+.+.|++|+.++.+|.... --+.+||++.. ++.+.. + T Consensus 222 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~-----------------------~~~~Gd~s~~~--~~~~~~--------~ 268 (326) T protein:vir:42 222 YTEENSPFRLGRIVARPTILSDHVASGTV-----------------------VGYQGDFRQLV--WGQVGG--------L 268 (326) T ss_pred ccCccccccCceeeeeeEEEcCCCCCCce-----------------------EEEEeecceEE--EEEecc--------e Confidence 222345799999999999984210 11334555432 222222 2 Q ss_pred eeccccch--------------hhHh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 308 ALERARRP--------------EFQA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 308 ~~e~~~~~--------------~~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++...+. .++. -.++..+.++.+++||++.+.|...++| T Consensus 269 ~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~ 324 (326) T protein:vir:42 269 SFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDAT 324 (326) T ss_pred EEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeecccc Confidence 22222111 1222 3457888999999999999999887777 No 116 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.15 E-value=3.2e-12 Score=83.58 Aligned_cols=269 Identities=13% Similarity=0.067 Sum_probs=155.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (347) +...+. .+.+- ..++--.+.-+.|..+|.+..+..+.++++.++.++.+|+ .++|.. +..++..+..|.. T Consensus 121 ~~~~~~-----~~~~~--t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~~~ 192 (394) T protein:vir:97 121 TTPVEP-----QKDGI--KKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEK 192 (394) T ss_pred hhhhhh-----hcccc--ccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcc-eEEEEEecCCCccceeccccc Confidence 111111 00011 1111112455889999998888889999998877766553 566654 3444555555555 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) .+.. ..+..+++++...++ +.-..|.+-=..++.+|+.+.+.++.+++|++..|..|+.-+. T Consensus 193 ~~~~-~~~~~~~v~l~~~k~-~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~---------------- 254 (394) T protein:vir:97 193 NPAL-AKPDFKDVAWNIDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLK---------------- 254 (394) T ss_pred cccc-ccccceeEEeehhhe-eeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------- Confidence 5421 235666777766554 3334444422334567899999999999999999988764210 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) .+ ++..... ++.|.++...+-. |...-.+|++|..|..|.+-. -.+..|.-..++..|.-+ T Consensus 255 -~~------~~~~~~~--------~~~~~~~~~~~~~---~~~~a~~v~n~~~~~~l~~lk-d~~G~~i~~~~~~~~~~~ 315 (394) T protein:vir:97 255 -SF------TTKTVKN--------LDEIKALLNGGFD---PAYNVSLIVSQSFYQTLDTLK-DGNGRYLLQDDITAVSGK 315 (394) T ss_pred -cc------ccccccc--------HHHHHHHHHhhhh---hhhCCEEEEcHHHHHHHHHhh-ccCCCeeeecCcCCCCCc Confidence 00 0000011 2334333322111 122335689999999987532 223444433455667677 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) .++|++|+.+++.+... ..-|.+||+..+. + +..++++++...+ .++ T Consensus 316 ~l~G~pv~~~~~~~~~~-----------------------~~~~~gd~~~~~~-~--------~~~~~~~~~~~~~-~~~ 362 (394) T protein:vir:97 316 VLLGKPVFVLSDEVLGA-----------------------NKAFIGDFKRGVL-F--------ADRKDLGLRWADN-EIY 362 (394) T ss_pred eeccceeEEecccccCC-----------------------ccEEEeeccccEE-E--------EEecceEEEEecc-ccc Confidence 99999999877543211 0113456654322 2 2234445554332 344 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ...+++.+.+|.++.+|++.+.|..++++ T Consensus 363 ~~~~~~~~r~d~~v~~~~a~~~~~~~~~~ 391 (394) T protein:vir:97 363 GQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) T ss_pred ceeEEEEEEEccEEecccceEEEEecccc Confidence 55788999999999999999999999999 No 117 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.15 E-value=4e-12 Score=83.10 Aligned_cols=273 Identities=15% Similarity=0.007 Sum_probs=157.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-c--cceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-G--RTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G--~~t~~~~~~g~ 77 (347) +-.... .-.+.....++.-.+..+.|..++++.-.+.+.++++.++.++.+ .++.||+. | .........|. T Consensus 98 ~~~~~~-----~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~Eg~ 171 (379) T protein:vir:10 98 GKSIQV-----KAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISG-GTYTFVRENGAGEGAIGAQVEGA 171 (379) T ss_pred hhhhhh-----hhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccC-CceEEEEeecCCCcccccccCCc Confidence 100000 000111112222224568899999999888888989888777654 45788753 2 22333344555 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) ..+. .+++.+++++.+.++- .-..|.+ +-.+...++.+.+.++.+++|++..|+.++.-+. + . T Consensus 172 ~~~~--~~~~f~~i~~~~~k~~-~~~~iS~-ell~D~~~l~~~i~~~la~~~~~~~~~~~~~g~~--~-------~---- 234 (379) T protein:vir:10 172 TKGQ--KDYDISMIDVNTDFIA-GFTRYSK-KMANNLPFLTSFIPNALRRDYAKAENAAFNAVLA--A-------N---- 234 (379) T ss_pred cccc--cccceeeeEeeeeeEE-eeehhhH-HHHhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc--c-------c---- Confidence 5543 3467777776666652 2333432 2333334577888888999999999987753110 0 0 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccc--ccccc Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI--DPETG 235 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~--~~~~G 235 (347) ...... +.+... .++.|.++...+...+.+. ..+|++|..|..|.+-.. .+..|.... ....| T Consensus 235 -~~~~~~---~~~~~~--------~~d~i~~~~~~~~~~~~~~--~~~vmn~~~~~~l~~lkd-~~G~~l~~~~~~~~~~ 299 (379) T protein:vir:10 235 -ATASTE---IITNKN--------KVEMLINEIAKQENLDFPV--TAIVLRPTDYYDILVTQK-SVGAGYGLPGVVTQDN 299 (379) T ss_pred -cccccc---cccCcc--------cHHHHHHHHHhhhhccCCC--CEEEEcHHHHHHHHHhhc-cCCceeccCCccCCCC Confidence 000000 011111 1456777777777666543 346789999999865332 334453332 23456 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~ 315 (347) ...+++|++|+.|+.+|.+ .-|-+||+... +++ .+.++++..+++ T Consensus 300 ~~~~l~G~pvv~s~~~~ag-------------------------~~~~gdf~~~~-~~~---------~~~~~i~~~~~~ 344 (379) T protein:vir:10 300 GVLRINGIPLFRATWLAAN-------------------------KYYVGDWTRVT-KVT---------TEGLSLEFSEVE 344 (379) T ss_pred CcceecceeeEecCCCCCC-------------------------ceEEeecccEE-EEE---------EeceEEEEeecc Confidence 6678999999999998732 11345666542 222 233455555443 Q ss_pred h--hH--hhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 316 E--FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 316 ~--~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) . +. ...+++...+|..++||++.+.+.+++= T Consensus 345 ~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 345 GTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred cccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 2 22 2356677789999999999999988877 No 118 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.15 E-value=3.6e-12 Score=83.35 Aligned_cols=285 Identities=12% Similarity=0.035 Sum_probs=157.9 Q ss_pred CCCCc-cCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEeccccc-eeeeec-CC Q lcl|Aclame:pro 1 MANAT-GGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPVMGRT-KGYYLA-PG 76 (347) Q Consensus 1 m~~~~-~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~~iG~~-t~~~~~-~g 76 (347) +-+.. .......|.-.....++--.+.-+.|..++++..+..+.+++++++..+.++. ++.+++.... ....+. .| T Consensus 101 ~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~ 180 (408) T protein:vir:74 101 VRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEED 180 (408) T ss_pred HhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEEEEeecCCcccccccccc Confidence 00000 00000011100111111112455899999999999999899998887776432 4556554332 222232 23 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ..++.. ..++.+++++...+. +....|.+-=...+.+|+.+.+.++.+++|++..|+.|+. T Consensus 181 ~~~~~~-~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~----------------- 241 (408) T protein:vir:74 181 GKIPDL-DNPRLTIIKYLIKRY-AGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIA----------------- 241 (408) T ss_pred cccccc-cccceeeEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh----------------- Confidence 444321 235667777777665 3344454433334677999999999999999999998763 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) |.+.+.. .++. .+ ++.|+++. ..|+....+ +-.+|++|..|..|.+-. -.+..|.-..++..| T Consensus 242 G~G~~~~--~~~~-------~~----~~~i~~~~~~~l~~~~~~--~a~~v~n~~~~~~l~~lk-d~~G~~l~~~~~~~~ 305 (408) T protein:vir:74 242 AMGTVPK--KPTI-------AN----FDDVITMINTSVDPAIIA--TSSLLTNQSGLNKLALVK-TAEGKYLLEPDPTKP 305 (408) T ss_pred ccccccc--cccc-------cc----HHHHHHHHHHhhhhhhcC--CCEEEEcHHHHHHHHHhh-cCCCceEeccCcCCC Confidence 1111100 0000 11 34455443 456655543 335688999999997632 234455444455667 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~ 315 (347) .-++++|++|+.+++.+....+ ++...-+.+||++.+ +++ ..+.++++..+.. T Consensus 306 ~~~~l~G~pV~~~~~~~~~~~~------------------~~~~~i~~gd~~~~~-~~~--------~~~~~~i~~~~~~ 358 (408) T protein:vir:74 306 NSYLIKGKQVIVVADRWLPNSG------------------STVYPLYYGDMSQAI-TLF--------DRENMSLLPTNIG 358 (408) T ss_pred CCceecceeeEEecCccccccc------------------CCcceEEEEehhccE-EEE--------EecceEEEEeccc Confidence 6689999999988753321100 000111334554432 122 2344455444321 Q ss_pred ----hhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 316 ----EFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 316 ----~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .+....+++.+.++.++++|++.+.+..++.+ T Consensus 359 ~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 394 (408) T protein:vir:74 359 AGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIA 394 (408) T ss_pred cchhhcceeeEEEEEeeCcEEecccceEEEEeeccc Confidence 23345577888999999999999999987777 No 119 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.15 E-value=2.5e-12 Score=84.14 Aligned_cols=293 Identities=10% Similarity=0.038 Sum_probs=154.3 Q ss_pred CCCCccCcccc-----ccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccce-eeeec Q lcl|Aclame:pro 1 MANATGGQQIG-----ANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTK-GYYLA 74 (347) Q Consensus 1 m~~~~~~~~~~-----~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t-~~~~~ 74 (347) |.......... .+.......++--.+.-+.+..++.+..+..+.+++++++.+.. |+ +.||+.+... +..+. T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-g~-~~ip~~~~~~~a~~v~ 196 (425) T protein:vir:95 119 LKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVK-GT-TRILVDTDTSPATWIE 196 (425) T ss_pred HhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecC-ce-eEEEEecCCccccccc Confidence 11100000000 00000001111112455889999999999999999998877654 54 5777665543 33344 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) .|..++.+ .....+++++..-++ +.-+.|.+-=..++..|+.+.+.++.++++++..|+.||. +. +. T Consensus 197 E~~~~~~~-~~~~f~~i~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~----G~-------G~ 263 (425) T protein:vir:95 197 QSGALPTG-DVGTIASIDFDGFKV-GKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVK----GT-------GA 263 (425) T ss_pred cccccccc-cccccceeeeeheee-eeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhc----cC-------CC Confidence 45555432 112355555544443 2333444432334556899999999999999999998873 11 00 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHH-HHHHhcchhh--hhhhcccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPED-YSAILSALMP--NAANYAALID 231 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~-~~~Ll~~~~~--~~~~~~~~~~ 231 (347) ..+.+.|..-..+.....+ .......++.|+++...+.....+...-+++++|.. |..|..-... .+..|... T Consensus 264 ~~~~p~Gil~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~-- 339 (425) T protein:vir:95 264 ANKQPLGIIPSLPPENQVT--VEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGK-- 339 (425) T ss_pred Cccccceeecccccccccc--cccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeec-- Confidence 0011111111100000000 001112356677777777666654445445666654 4444322222 23344322 Q ss_pred ccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecc Q lcl|Aclame:pro 232 PETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALER 311 (347) Q Consensus 232 ~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~ 311 (347) ...+....++|.+|+.++++|... -+.+||++. ++ +..++++++. T Consensus 340 ~~~~~~~~l~G~pvv~~~~~~~~~-------------------------i~~Gd~~~~--~~--------~~~~~~~i~~ 384 (425) T protein:vir:95 340 LPNLRTPDLLGLRVVFNNFLDDDT-------------------------VLFGEFEQY--TL--------VERENITIDS 384 (425) T ss_pred cCCCCCccccceeeEEcCcCCCcc-------------------------EEEEecccE--EE--------EeecceEEEe Confidence 235556789999999999998421 123455542 12 1233444444 Q ss_pred ccchhh--HhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 312 ARRPEF--QADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 312 ~~~~~~--~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ..+... -...+++...++.++++|++.+.+..++.. T Consensus 385 ~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~ 422 (425) T protein:vir:95 385 STHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPV 422 (425) T ss_pred ecccccccCceEEEEEEeeCcEeecccceEEEEecCcC Confidence 433211 134577778899999999999999977755 No 120 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.14 E-value=3.1e-12 Score=83.70 Aligned_cols=276 Identities=9% Similarity=0.000 Sum_probs=163.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccce---eeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTK---GYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t---~~~~~~g~ 77 (347) |-.... ..-.|.+....+|- .+.-+.|..++.+..+..+.+++++++.++.++ +.++|...... ......|. T Consensus 104 ~~~~~~--~~~~ra~~t~~~gg--~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~E~~ 178 (421) T protein:vir:13 104 IRGIQL--SEEERDIMSSTNNG--AVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRN-AGKMPVRAGASVDKLANLAKDT 178 (421) T ss_pred hhccch--hHHHhhccccCCcc--eecchhhHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEeecCCccceeeccccc Confidence 111000 00122222222221 134588999999888888989998887776644 45666433322 33344455 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .++. .+++..++++.+.++ +.-..|.+-=...+.+|+.+.+.++++++++...|+.++..+... T Consensus 179 ~~~~--s~~~f~~i~~~~~k~-~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~------------- 242 (421) T protein:vir:13 179 ELVK--AMLKTQPMAYDIDDY-GLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAV------------- 242 (421) T ss_pred cccc--cccceeEEEeeeeee-EeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhc------------- Confidence 5543 245667777777665 334445443334466889999999999999999998887533110 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) . . ..... -++.|+++...|..+..+. -.+|++|..|..|.+-. -.+..|.- .+...|.. T Consensus 243 -----~-~---~~~~~--------~~d~i~~~~~~l~~~~~~~--a~~v~n~~~~~~l~~lk-d~~G~~i~-~~~~~~~~ 301 (421) T protein:vir:13 243 -----L-A---EETIN--------DYAGLVKTINSLVPNARKR--AIIVTNSDGRAYLDGLM-DKQGRPLL-KELSDGGD 301 (421) T ss_pred -----c-c---ccccc--------chHHHHHHHHHhhhhhcCC--CEEEEcHHHHHHHHHhh-cCCCceee-cCcCCCCC Confidence 0 0 00000 1566777777777766543 35688999999987532 22344432 23456667 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhh Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEF 317 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~ 317 (347) ..++|++|+.++++|....+ ...-+.+||++.+. .+..++++++...+..+ T Consensus 302 ~tl~G~pV~~~~~~~~~~~~--------------------~~~~~~gd~~~~~~---------~~~~~~~~v~~~~~~~f 352 (421) T protein:vir:13 302 LVFKGRPVIELEESIFDVGD--------------------ETKFIVSDFKTLIK---------FMDRKQYLIDQSKEAGY 352 (421) T ss_pred ceecceeeEEeccccccCCC--------------------ceEEEEEeccccEE---------EEEecceEEEeeccccc Confidence 78999999999998842211 11123455554322 23445567776655543 Q ss_pred Hh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 318 QA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 318 ~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .- ..+++...++.++++|+++..+..+..+ T Consensus 353 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (421) T protein:vir:13 353 TKNETIARIIERFDVNSPLDKSSDAEKIRKFG 384 (421) T ss_pred ccCeeEEEEEeeecceeecchhhheeeecccc Confidence 32 4678889999999999998776655433 No 121 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.14 E-value=4.1e-12 Score=83.03 Aligned_cols=285 Identities=12% Similarity=0.034 Sum_probs=157.4 Q ss_pred CCCCcc-CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCC-ceEEEeccccc--eeeeecCC Q lcl|Aclame:pro 1 MANATG-GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNG-KSASFPVMGRT--KGYYLAPG 76 (347) Q Consensus 1 m~~~~~-~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G-~tv~i~~iG~~--t~~~~~~g 76 (347) +-+... ......|.-.....++--.+--+.|..++.+..+..+.+++++++.++.++ .++.+++.... .+.....| T Consensus 101 ~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~ 180 (408) T protein:vir:10 101 VRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAED 180 (408) T ss_pred hhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeeccccccceeeecCc Confidence 101000 000001111111112211244589999999999999999999887766532 23445544332 33334445 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ..++.+ ..+...++++...++ +.-..|.+-=..++.+|+.+.+.++.++++++..|+.|+.-. T Consensus 181 ~~~~~~-~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~--------------- 243 (408) T protein:vir:10 181 GKIPDL-DNPQLTIIKYLIKRY-AGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM--------------- 243 (408) T ss_pred cccccc-cCcceeeEEeeeeeE-EeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------- Confidence 555422 124556666666554 233345432233457899999999999999999999886421 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) +.+... .+. .. ++.|+++. ..|+...- .+-.+|++|..|..|.+-. -.+..|.-..++.+| T Consensus 244 --g~~~~~--~~~-------~~----~~~l~~~~~~~~~~~~~--~~a~~v~n~~~~~~l~~lk-d~~G~~i~~~~~~~~ 305 (408) T protein:vir:10 244 --KAAPKK--PTI-------AK----FDDVITMINTAVDPAII--ATSSLLTNQSGLNKLALVK-TAEGKYLLEPDPTKP 305 (408) T ss_pred --cccccc--ccc-------cc----HHHHHHHHHHhhhhhhc--cCCEEEEcHHHHHHHHHhh-ccCCceEeccCcCCC Confidence 111000 000 11 34455543 34544332 2345689999999988643 234455444456677 Q ss_pred ceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch Q lcl|Aclame:pro 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP 315 (347) Q Consensus 236 ~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~ 315 (347) ...+++|++|+.+.+.+....+. +...-|.+||++.+. .+...+++++..... T Consensus 306 ~~~~l~G~PV~~~~~~~~~~~~~------------------~~~~i~~gd~~~~~~---------~~~~~~~~v~~~~~~ 358 (408) T protein:vir:10 306 NSYLIKGKQVIVVADRWLPNTGS------------------TVYPLYYGDMSQAIT---------LFDRENMSLLPTNIG 358 (408) T ss_pred CCceecceeeEEecccccCccCC------------------CceEEEEEehhccEE---------EEEecceEEEEcccc Confidence 77899999999976533211100 001113445554322 222344455544322 Q ss_pred ----hhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 316 ----EFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 316 ----~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++-...+++.+.++.++++|++++.+..++++ T Consensus 359 ~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~ 394 (408) T protein:vir:10 359 AGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) T ss_pred cchhhcCceEEEEEEeeccEEeccccEEEEEeeccc Confidence 12234677888899999999999999999887 No 122 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=99.13 E-value=1.9e-12 Score=84.89 Aligned_cols=283 Identities=13% Similarity=0.067 Sum_probs=149.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccc-cceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG-RTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG-~~t~~~~~~g~~~ 79 (347) ||+.+... | -.+.-+.+..++.+..++.+.++.+.++.++.+ .+.+||+.. .+.+.-+..|... T Consensus 1 ma~~t~~~------g--------g~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~wv~E~~~~ 65 (305) T protein:vir:25 1 MADISRAE------V--------ASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESATD 65 (305) T ss_pred CCCccCCc------c--------ceecCHHHHHHHHHHHHhhchhhhhcceeeccC-CcEEEEEEeCCcceEEeeccccc Confidence 88854311 1 124558899999999999999999988877754 467887654 3355545445443 Q ss_pred CCC---CCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccC Q lcl|Aclame:pro 80 DDK---RKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIA 156 (347) Q Consensus 80 ~~~---~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~ 156 (347) ... ..+++..++++..-+. +.-..|.+-=..++.+|+.+.+.++.+++|++..|+.++. +...... ..... T Consensus 66 ~~~~~~~s~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~----G~g~~~~-~~~~~ 139 (305) T protein:vir:25 66 PKGVKPTSKVTWANRTLVAEEI-AVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIF----GTDKPAS-WVSPA 139 (305) T ss_pred ccccccccccceeeEEeeeEEE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhhee----ccCCCCC-ccccc Confidence 321 1123445444444443 2333444322223568899999999999999999998873 1110000 00000 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccc Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) ..+..... +..............+++.+..+...+....-.. .-++++|..|..|.+-.. .+..|. +.. T Consensus 140 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~v~~~~~~~~l~~lkd-~~G~~i----~~~-- 208 (305) T protein:vir:25 140 LIPAAVTA--GQAVEVVGGVANESDIVGATNRAAKAVASAGWAP--DTLLSSLALRYEVANIRD-ANGNPV----FRD-- 208 (305) T ss_pred cccccccc--cccccccccchhhhHHHHHHHHHHHhhhhccccc--ceeEecHHHHHHHHHhhc-cCCcee----ecC-- Confidence 00000000 0000111111112223444554444443332211 125789999999865321 122221 111 Q ss_pred eEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch- Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP- 315 (347) Q Consensus 237 v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~- 315 (347) ..++|++|+.++++|.... ...-+.+||++.. + +..++++++...+. T Consensus 209 -~~l~G~Pv~~~~~~~~~~~---------------------~~~~~~gd~s~~~--i--------~~~~~~~i~~~~~~~ 256 (305) T protein:vir:25 209 -DSFAGFRTFFNRNGAWDAD---------------------AAIEVIADSSRVK--I--------GVRQDITVKFLDQAT 256 (305) T ss_pred -CcccccceEEcCccCCCCC---------------------ccEEEEEecceEE--E--------EEecCeEEEEeeeee Confidence 3689999999999874211 0112334555432 1 12222333322110 Q ss_pred ---------hhHh--hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 316 ---------EFQA--DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 316 ---------~~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .++. -.++....+|..++||++++.+..++.| T Consensus 257 ~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~ 299 (305) T protein:vir:25 257 LGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA 299 (305) T ss_pred eecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEcccccc Confidence 1122 2456677889999999999999987666 No 123 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.12 E-value=3.3e-12 Score=83.52 Aligned_cols=282 Identities=8% Similarity=-0.019 Sum_probs=160.3 Q ss_pred CCCCccC-------ccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCC-ceEEEec-cccceee Q lcl|Aclame:pro 1 MANATGG-------QQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNG-KSASFPV-MGRTKGY 71 (347) Q Consensus 1 m~~~~~~-------~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G-~tv~i~~-iG~~t~~ 71 (347) |.+.... .....|...+...++--.+.-+.|..++.+.....+.++.+.++..+.++ ..+.+++ .+.+.+. T Consensus 102 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 181 (397) T protein:vir:12 102 LRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFS 181 (397) T ss_pred HhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCceeEEEEEecCCccee Confidence 0000000 00000111111111111245599999999999888988888887766532 2344443 5556667 Q ss_pred eecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 72 YLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) Q Consensus 72 ~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~ 151 (347) .+..|..++.. ..++.+++++...+.- .-..|.+-=...+.+|+.+.+.++.+++|++..|..|+.-. T Consensus 182 ~v~Eg~~~~~~-~~~~~~~v~~~~~k~~-~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~---------- 249 (397) T protein:vir:12 182 PVEELGNLPEI-DQPRFTKVSYSIIDYG-GIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAAI---------- 249 (397) T ss_pred eeccccccccc-ccccceeEEeeheeeE-eeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhcc---------- Confidence 77777766432 2356677777666552 33344433233456789999999999999999999887411 Q ss_pred ccccCcccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccc Q lcl|Aclame:pro 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI 230 (347) Q Consensus 152 ~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~ 230 (347) +.+.. .+ ... ++.|.++. ..|+...- .+-.++++|..|..|.+-. -.+..|.... T Consensus 250 -------g~~~~--~g----~~~--------~~~i~~~~~~~l~~~~~--~~a~~~~n~~~~~~L~~lk-d~~G~~l~~~ 305 (397) T protein:vir:12 250 -------ASLKK--VD----IDG--------LDGIKKALNVTLDPMVA--PGSIVLTNQDGYDWLDTLK-DGTGRYLLQP 305 (397) T ss_pred -------ccccc--cc----ccc--------HHHHHHHHhhccchhhh--CCCEEEEcHHHHHHHHHhh-ccCCceeecc Confidence 00000 00 000 34455443 34443332 3345789999999986532 2234454444 Q ss_pred cccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeec Q lcl|Aclame:pro 231 DPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE 310 (347) Q Consensus 231 ~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e 310 (347) ++.+|..++++|.+|+.+++...... ++...-+-+||++.+ ..+..+.++++ T Consensus 306 ~~~~g~~~~l~G~pv~~~~~~~~~~~-------------------~~~~~~~~gd~~~~~---------~~~~~~~~~i~ 357 (397) T protein:vir:12 306 DPTNPTKKLLDGRPVVPFTNRVLKTQ-------------------KGKAPLIIGNLKEAI---------VLFDREQQSIA 357 (397) T ss_pred cccCCCCccccceeeEEecccccccC-------------------CCccEEEEEehhceE---------EEEeecceEEE Confidence 56677778999999999876432110 001111334554422 12233444555 Q ss_pred cccchh----hHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 311 RARRPE----FQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 311 ~~~~~~----~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ..+.+. +-...+++.+.++.++++|++.+.+..++. T Consensus 358 ~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 358 STDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred EeccccchhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 443322 223468888999999999999999999988 No 124 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.11 E-value=7.4e-12 Score=81.61 Aligned_cols=279 Identities=14% Similarity=0.082 Sum_probs=173.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhcc---cc---ccc-c---cCCceEEEeccccc- Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMDK---HM---VRT-I---QNGKSASFPVMGRT- 68 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~~---~~---~rt-i---~~G~tv~i~~iG~~- 68 (347) ||. |+. +| +|+ |+|...|.....+.+.|..- .+ ... + .+|+++.+|..+.. T Consensus 1 MA~--------T~l------sd---~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~ 63 (324) T protein:vir:59 1 MAY--------TKI------SD---VIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLD 63 (324) T ss_pred CCc--------eee------ec---eechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCC Confidence 774 222 12 455 99999999888888766321 11 111 1 37999999998765 Q ss_pred -eeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 69 -KGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNL 147 (347) Q Consensus 69 -t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~ 147 (347) ..+.+..+..+. .+.+..++..-+|= .....+.+.|+-...+--|.+.+++++.+..++++.+..+|..|..+... T Consensus 64 Gd~~~v~~~~~i~--~~~l~t~~~~a~i~-~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~ 140 (324) T protein:vir:59 64 GDSQVLNDTDDLV--PQKINAGQDKAVLI-LRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSN 140 (324) T ss_pred CcccccCCCcccc--hhhcccceeeEEEE-eecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 456777777765 35677777666665 46788889999888888899999999999999999999888766433211 Q ss_pred ccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcc Q lcl|Aclame:pro 148 PAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYA 227 (347) Q Consensus 148 a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~ 227 (347) .... .+...+.++.+ +... .+.|.+|..+|.++. ..-..++|.|..|..|.+..-+....+. T Consensus 141 ~~~~---------~~~~dvsa~~~---~~~s----~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~li~~~~~s 202 (324) T protein:vir:59 141 DDMK---------DNKLDISGTAD---GIYS----AETFVDASYKLGDHE--SLLTAIGMHSATMASAVKQDLIEFVKDS 202 (324) T ss_pred cccc---------cceeeeecccc---ceec----HHHHHHHHHHhCCcc--cCcEEEEEchHHHHHHHHhhhhhhcccc Confidence 1111 11122211111 1111 356788888887764 2345888999999999976422112221 Q ss_pred ccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhh-h Q lcl|Aclame:pro 228 ALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK-D 306 (347) Q Consensus 228 ~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~-~ 306 (347) -..+.|+.+.|.+|+.+..+|..... +...+| ..+++-+-|++....+ + T Consensus 203 ----~~~~~i~~~~G~~VivdD~~p~~~~~-------------------~~~~~y-------~s~l~~~GAi~~~~~~~~ 252 (324) T protein:vir:59 203 ----QSGIRFPTYMNKRVIVDDSMPVETLE-------------------DGTKVF-------TSYLFGAGALGYAEGQPE 252 (324) T ss_pred ----ccCceeeeecccEEEEeCCCCccccC-------------------CCCceE-------EEEEEecCeEEEeecCCC Confidence 12457899999999999999853211 111112 2356667777776654 3 Q ss_pred eeeccccchhhHhhHHhhhhhhcCccc--ccceEEEEEecCCC Q lcl|Aclame:pro 307 MALERARRPEFQADQIIGKYAMGHGGL--RPEAAGALVFTPAA 347 (347) Q Consensus 307 ~~~e~~~~~~~~~d~i~~~~~~G~~~l--RPe~~~~l~~~~aa 347 (347) +.+|..|++..-.|.+...+.|...+. .......-...|+- T Consensus 253 v~vE~dRd~~~g~~~l~~r~~~~~~p~G~s~~~~~~~~~sPt~ 295 (324) T protein:vir:59 253 VPTETARNALGSQDILINRKHFVLHPRGVKFTENAMAGTTPTD 295 (324) T ss_pred cceecccCccccceEEEEeeEEEeEeeeEEecccccCCCCCCh Confidence 568989998877777666666553332 22111101122222 No 125 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.11 E-value=5.3e-12 Score=82.39 Aligned_cols=291 Identities=12% Similarity=0.035 Sum_probs=154.8 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccc-----eeeeecC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT-----KGYYLAP 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~-----t~~~~~~ 75 (347) +... -......+-..++..++...+..+.|..++++.....+.+++++++.+..+ .++.+++.... .+..+.. T Consensus 105 ~~~~-~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~a~~v~E 182 (413) T protein:vir:81 105 YVAP-RVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTN-TTIKYLMEKANRVVEGGFKTVAE 182 (413) T ss_pred hhhh-HHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccC-CceeEEEeccccccccccceecC Confidence 0000 000000011112223344446678999999999999999999988877754 45666654322 2333444 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) |..++.. ......++++.+.++- ....|.+- -.+.+.++-+.+.++.++++++..|+.+|. +... ...+ T Consensus 183 g~~~~~~-~~~~f~~i~~~~~k~~-~~~~iS~e-ll~ds~~l~~~i~~~la~~~~~~~d~~~l~----G~G~----~~~~ 251 (413) T protein:vir:81 183 GGKKPYM-RFADFDIVTESLSKIA-GLTKITDE-MIEDYDFLVSYINARLLEELAIEEERQLLL----GDGT----GNNL 251 (413) T ss_pred ccccccc-CcccceeeEeeeeeEE-EeehhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc----cCCC----CCcc Confidence 5554322 1124566666666652 23445432 222223477888888999999999998763 1100 0111 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccc------- Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA------- 228 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~------- 228 (347) .|. .-..+.. +........+++.+.++...+..+..-.... +|++|..|..|.+-.. .++.|.. T Consensus 252 ~Gi----~~~~~~~---~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~vmn~~~~~~l~~lkd-~~G~~l~~~~~~~~ 322 (413) T protein:vir:81 252 TGL----LKRDGIQ---TLAVSNKDELADSIYKAMTNISLATPFQADA-LVINPLDYQELRLAKD-ANGQYYGGGVFQGQ 322 (413) T ss_pred ccc----ccccccc---cccccccchhHHHHHHHHHHhhhhccCCCcE-EEEcHHHHHHHHHhhc-cCCceecccccccc Confidence 111 0000000 0001111223566666665555443322233 5889999998754321 1233321 Q ss_pred cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) .+....+...+++|.+|+.|+.+|... -+.+||++.. +++. .+.++ T Consensus 323 ~~~~~~~~~~~l~G~pv~~s~~~~~~~-------------------------~~~gd~~~~~-~~~~--------~~~~~ 368 (413) T protein:vir:81 323 YGSGGIMLDPAPWGLRTVQSQVVPVGK-------------------------PVVGAFRSAA-SVLR--------KGGVR 368 (413) T ss_pred ccccccccCceecceeeEEcCCCCccc-------------------------EEEEecccEE-EEEE--------ecceE Confidence 111122234579999999999998421 1345665533 2332 23345 Q ss_pred eccccchh--hHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 309 LERARRPE--FQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 309 ~e~~~~~~--~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++..+... +.-+ .+++...++..+.+|++.+.+..++++ T Consensus 369 v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 411 (413) T protein:vir:81 369 IDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVAEVV 411 (413) T ss_pred EEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEecCCC Confidence 55444321 2223 566777899999999999999988888 No 126 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.09 E-value=1.7e-11 Score=79.61 Aligned_cols=278 Identities=10% Similarity=0.043 Sum_probs=153.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (347) ..+..... +.......++--.+.-+.|..++.+..+..+.++++.++.++.++ +.++|.. +...+.....|.. T Consensus 101 ~~~~~~~~----~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~ 175 (394) T protein:vir:10 101 HSHGKVID----NAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELAE 175 (394) T ss_pred hccchhhh----hhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCC-ceEEEEEecCCCcccccccccc Confidence 00000000 000001111111134589999999999999999999887776543 4555544 3344444444444 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) .+.. ..+...++++.+-++ +.-..|.+-=..++.+|+.+.+.+++++++++..|+.|+.-.. T Consensus 176 ~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g---------------- 237 (394) T protein:vir:10 176 NPAL-AEPEFEQVDWSVSTY-RGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQ---------------- 237 (394) T ss_pred cccc-ccccceeEEeeeeee-EeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------- Confidence 4321 235666766666554 3334454433344678999999999999999999998864210 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHH-HHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccc----ccc Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARA-RLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI----DPE 233 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~----~~~ 233 (347) .+.... ..+. . .++.|.++.. .++... .-.+|++|..|..|.+-. -.+..|.-.. ... T Consensus 238 -~~~~~~--~~~~-----~----~~d~l~~~~~~~~~~~~----~a~~vmn~~~~~~l~~lk-d~~G~~i~~~~~~~~~~ 300 (394) T protein:vir:10 238 -SFTAKA--TTTD-----T----LVDSLKHILNVDLDPAY----SRALVVTQSLFNTLDTLK-DKNGRYLLHDASDSITD 300 (394) T ss_pred -cccccc--cccc-----c----cHHHHHHHHHhhhhhhc----cCEEEecHHHHHHHHHhh-ccCCCeeeecccccccc Confidence 010000 0000 0 1344544432 333322 246789999999987632 2233332111 122 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) .|.-++++|++|+.+++...... ++...-+.+||++..- ++ ..++++++... T Consensus 301 ~~~~~~L~G~PV~~~~~~~~~~~-------------------~~~~~i~~gd~s~~~~-~~--------~~~~~~v~~~~ 352 (394) T protein:vir:10 301 GTAKGTVLGVPVYVVGDALLGSA-------------------AGDQKAFVGDLKRGVL-FA--------DRQQVTLAWED 352 (394) T ss_pred CCcccccccceeEEecccccCCC-------------------CCceEEEEeeccccEE-EE--------eecceEEEEec Confidence 34456899999998765422110 0001114456665332 22 23444555433 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) + ..+...+++.+.++.++++|++++.+..+++| T Consensus 353 ~-~~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~~ 385 (394) T protein:vir:10 353 S-KIYGRYLGAAFRFGVKQADSNAGYFVTNTDAA 385 (394) T ss_pred c-cccceeEEEEEEeccEEeccccEEEEEeeccc Confidence 2 33445678889999999999999999988888 No 127 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=99.06 E-value=3e-12 Score=83.76 Aligned_cols=286 Identities=13% Similarity=0.074 Sum_probs=150.5 Q ss_pred CCCCc------------------c---------CccccccCcccCccccHHHHHHHH-HhHHHHHHHHHHHhhhcc-ccc Q lcl|Aclame:pro 1 MANAT------------------G---------GQQIGANQGKGQSAADKLALFLKV-FGGEVLTAFVRRSVTMDK-HMV 51 (347) Q Consensus 1 m~~~~------------------~---------~~~~~~~~~~~~~~~d~~al~ie~-f~geV~~~f~~~s~~~~~-~~~ 51 (347) +++.. + ..-...|....+..++--.|-... +..++.+..+..++++.+ .++ T Consensus 316 ~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~ 395 (632) T protein:vir:96 316 AATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARM 395 (632) T ss_pred hhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceE Confidence 00000 0 000001111111111111133434 467777777677777766 333 Q ss_pred ccccCCceEEEecc-ccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHH Q lcl|Aclame:pro 52 RTIQNGKSASFPVM-GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALA 130 (347) Q Consensus 52 rti~~G~tv~i~~i-G~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa 130 (347) -+...| .+.||+. |.+++..+.-|..++.+ +++.+++++..-++ +.-+.|.+-=..++.+|+.+.+.++++++|+ T Consensus 396 ~~~~~g-~~~ip~~~~~~~a~wv~E~~~~~~s--~~~f~~i~l~~~k~-~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~ 471 (632) T protein:vir:96 396 LPGLVG-DVDIPKKTSGANFYWIGEDEDVQDS--DFDFTTLSFSPKTI-AGAVPVTRKLRKQSSIHVENLIREDLIEGIG 471 (632) T ss_pred eecCCc-ceEEEEEeCCceeEeecCCcccccc--ccceeeEEeeeeEE-EEehhhHHHHHhccchHHHHHHHHHHHHHHH Confidence 344444 4778865 55566555566666543 46666666655443 2333343322335678999999999999999 Q ss_pred HHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChH Q lcl|Aclame:pro 131 IAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPE 210 (347) Q Consensus 131 ~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~ 210 (347) +..|+.+|. +... .+.+.|..-..+......... ..-++.|+++..++...++....-..+++|. T Consensus 472 ~~~d~a~l~----G~G~--------~~~p~Gi~~~~~~~~~~~~~~---~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~ 536 (632) T protein:vir:96 472 VALDLAMLT----GTGL--------ANDPVGLLNMTGVPALTYPAG---GVDWASVVDMETKISTFNADAGRLAYLTSVT 536 (632) T ss_pred HHHHHHhhc----ccCC--------CCccceeeecccccceecccc---cCCHHHHHHHHHHHhhcccccCccEEEEchh Confidence 999998863 1110 111222111111100000000 0125678888888888887665666788998 Q ss_pred HHHHHhcch-hhhhhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccce Q lcl|Aclame:pro 211 DYSAILSAL-MPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNV 289 (347) Q Consensus 211 ~~~~Ll~~~-~~~~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~ 289 (347) .+..|.... +-.++.|. +.. +.+.|++|+.|+.+|... -..+||+.. T Consensus 537 ~~~~l~~~~l~d~~G~~i----~~~---~~l~G~pv~~s~~ip~~~-------------------------~~~gd~s~~ 584 (632) T protein:vir:96 537 QRGAAKKAQVFDNTGERI----WQN---NEVNGYRAEASNQIPADT-------------------------WIFGDWSQI 584 (632) T ss_pred HHHHHHHHhccCCCCcee----ecC---CeecccceEeccccccCc-------------------------EEEeecceE Confidence 887776432 11123332 122 368999999999998421 112344442 Q ss_pred eEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 290 VGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 290 ~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) . +. ......+.+..+-....-.-.++.++.++.++++|++.+.++.++ T Consensus 585 ~--i~------~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 585 V--IA------MWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred E--EE------EecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 1 11 111111221111111222336778899999999999999998887 No 128 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.05 E-value=2.8e-11 Score=78.43 Aligned_cols=278 Identities=8% Similarity=0.007 Sum_probs=152.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (347) |-...-. .+.-.+...++--.+--+.|..++.+..+..+.++.+.++.++.++ +.+++.. +......+..|.. T Consensus 99 lr~~~~~----~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~E~~~ 173 (389) T protein:vir:10 99 IHSHGKV----IDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTP-KGTYPILKRATDRFSSVAELAE 173 (389) T ss_pred hhcchhh----hhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCC-eeEEEEEecCCCcccccccccc Confidence 1000000 0000011111111134488999999999889989888887776543 3455543 3333344444444 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) .+.. ..++..++++.+.++ +.-+.|.+-=...+.+|+.+.+.++.+++|++..|..|+.-+.. T Consensus 174 ~~~~-~~~~~~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~--------------- 236 (389) T protein:vir:10 174 NPKL-AEPEFNKVDWSVATY-RGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQS--------------- 236 (389) T ss_pred cccc-ccccceeeeeeheee-EeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcc--------------- Confidence 4321 235666666666554 33334443323345678999999999999999999988643210 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHH-HHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccc----ccc Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARA-RLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALI----DPE 233 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~-~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~----~~~ 233 (347) +. ..+.++. . -++.|.++.. .++.. .+-.++++|..|..|.+-.. .++.|.-.. ... T Consensus 237 --~~--~~~~~~~-----~----~~d~l~~~~~~~~~~~----~~a~~~~n~~~~~~L~~lkd-~~G~~i~~~~~~~~~~ 298 (389) T protein:vir:10 237 --FT--AKKTTTD-----T----LVDSLKHILNVDLDPA----YSRALVVTQSLFNTLDTLKD-KNGRYLLHDASDSITD 298 (389) T ss_pred --cc--ccccccc-----c----cHHHHHHHHHhhhhhh----hCcEEEecHHHHHHHHHhhc-cCCCeeeecCcccccc Confidence 00 0000110 1 1444554432 33322 23467899999999986432 234443221 222 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) .|...+++|.+|+.+++......+ +...-+.+||++.+.+ +..++++++... T Consensus 299 ~~~~~~l~G~pV~~~~~~~~~~~~-------------------~~~~~~~gd~~~~~~~---------~~~~~~~i~~~~ 350 (389) T protein:vir:10 299 GTAKGTILGVPVYVVGDTLLGSLA-------------------GDQKAFVGDLKRGVLF---------TDRQQVTLAWED 350 (389) T ss_pred cccccccccceeEEecccccCCCC-------------------CceEEEEeeccccEEE---------EeecceEEEeec Confidence 355578999999987653211100 0001144566553322 223445555443 Q ss_pred chhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 314 RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 314 ~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) + ..+...+++.+.+|..+++|++++.+..++++ T Consensus 351 ~-~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~ 383 (389) T protein:vir:10 351 S-KIYGKYLGAAFRFGVQKADSKAGYFVTNTDVP 383 (389) T ss_pred c-ccccceEEEEEEeccEEecccceEEEEeeccC Confidence 3 44556788899999999999999988877666 No 129 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.05 E-value=1.1e-11 Score=80.68 Aligned_cols=290 Identities=9% Similarity=0.032 Sum_probs=150.3 Q ss_pred CCCCccCccccccCcc-cCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeec---C Q lcl|Aclame:pro 1 MANATGGQQIGANQGK-GQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLA---P 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~-~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~---~ 75 (347) |.. +. ...+.-. +...++--.|.-+.|..+|.+..+..+.++.+.++.... | .+.+|+. +..++.... . T Consensus 131 l~~-~~---~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~-~~~~p~~~~~~~a~~~~~~~e 204 (434) T protein:vir:62 131 IVG-NI---DEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTK-E-NIKYPVLVKKAEAQGHKNERT 204 (434) T ss_pred hcc-cc---chhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccC-C-ceEEEEEecCCcccceecccc Confidence 111 00 0000000 001111111344899999999999999998888765443 3 4677764 223322221 1 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~ 155 (347) |...+ ..+++..++++.+-++ +.-..|.+-=...+.+|+.+.+.++.+++|++..|+.|+. +.. ..... T Consensus 205 ~~~~~--~~~~~f~~v~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~----G~G----~~~~~ 273 (434) T protein:vir:62 205 NNEMP--ETDIEFDEIELSPTEF-DALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVN----GDE----ANNIN 273 (434) T ss_pred ccccc--ccccceeeEEeeheee-EeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc----cCC----CCccc Confidence 22322 2234555555555443 2233343322234567999999999999999999998873 110 00111 Q ss_pred CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc--cccc Q lcl|Aclame:pro 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL--IDPE 233 (347) Q Consensus 156 ~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~--~~~~ 233 (347) .|......++. .......++.|+++...|+....+ .. .+|++|..|..|.+- +-.+..|.-. .... T Consensus 274 ~g~~~~~~~~~---------~~~~~~~~d~l~~l~~~l~~~~~~-~a-~~v~n~~~~~~L~~l-kd~~G~~l~~~~~~~~ 341 (434) T protein:vir:62 274 DGALAKKAVEF---------KTDEKNLYDALVKMKNTPVKEVRK-KA-RWVLNTAALTKIETM-KTDDGFPLLRPFNQAE 341 (434) T ss_pred cceeecccccc---------cccccchhhHHHHHHhhcchhhhc-CC-EEEEcHHHHHHHHHh-hccCCCEeeccCCCcc Confidence 11111111111 111223478888888888776543 22 447899999988542 2224445322 2334 Q ss_pred ccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR 313 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~ 313 (347) .|....++|++|+.++.+|....+.. ..=|.+||+... ++.+.. .++++... T Consensus 342 ~g~~~tl~G~pV~~~~~~~~~~~~~~-------------------~~i~~Gdfs~~~--i~~~~g-------~~~i~~~~ 393 (434) T protein:vir:62 342 GGIGYTLLGFPVEEEDAIDIPDSPDT-------------------PVFYFGDFSKFY--IQDVIG-------SLEVQKLV 393 (434) T ss_pred CCCCceecceeeEEecCccCccCCCc-------------------eEEEEeeccceE--EEEeec-------eeEEEeeh Confidence 56666899999999999985322111 011345666432 222111 12233222 Q ss_pred chhhHhh--HHhhhhhhcCccc-ccceEEEEEe---cCCC Q lcl|Aclame:pro 314 RPEFQAD--QIIGKYAMGHGGL-RPEAAGALVF---TPAA 347 (347) Q Consensus 314 ~~~~~~d--~i~~~~~~G~~~l-RPe~~~~l~~---~~aa 347 (347) +.-+.-+ .+++...+.++++ +|++..++.. +|++ T Consensus 394 ~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~ 433 (434) T protein:vir:62 394 ELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTG 433 (434) T ss_pred hhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCC Confidence 2111112 3678888888876 4888877742 3333 No 130 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.05 E-value=9.7e-12 Score=80.95 Aligned_cols=281 Identities=14% Similarity=0.096 Sum_probs=164.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhc---ccccccc-----cCCceEEEeccccc--e Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMD---KHMVRTI-----QNGKSASFPVMGRT--K 69 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~---~~~~rti-----~~G~tv~i~~iG~~--t 69 (347) |||- . |+. +| +|+ |+|...|.+...+.+.|.. +++...+ .+|+++.+|..+.. . T Consensus 1 Ma~~-~-----T~l------~d---~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~ 65 (330) T protein:vir:10 1 MANE-L-----TKI------LD---TITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGD 65 (330) T ss_pred CCCC-c-----eEe------ee---eechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCc Confidence 9982 1 222 22 455 9999999988888775532 1222122 36999999988755 3 Q ss_pred eeeecCCC-CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 70 GYYLAPGE-NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP 148 (347) Q Consensus 70 ~~~~~~g~-~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a 148 (347) .+.+..|. .+. ++.+...+..-+|=. .-..+.+.|+-....--|.+.++.+|.+...++..+..++..+.+..... T Consensus 66 ~~~~~dg~~~i~--~~ki~t~~~~a~i~~-~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~ 142 (330) T protein:vir:10 66 SEVLGNGDKALE--TGKITAGADIACVLY-RGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATG 142 (330) T ss_pred ccccCCCccccc--hhhcccceeEEEEEe-ecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhh Confidence 34454453 443 355776666655544 35568889999888888999999999999999998888887665443321 Q ss_pred cccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhh-hhcc Q lcl|Aclame:pro 149 AASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNA-ANYA 227 (347) Q Consensus 149 ~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~-~~~~ 227 (347) ..... .....+.....+.. ..... ++.|.+|..+|.++. ..-..+++.|..|..|.+.. +++ ..+. T Consensus 143 ~~~~~---~~~~~~~~~~~~~~---~a~~s----~~~l~~A~~~~GD~~--~~~~~ivmhS~v~~~L~~~~-li~~~~~s 209 (330) T protein:vir:10 143 TAGEK---GALEETHVSDQSKA---STGID----AGMVLDAKQLLGDSA--DQVTAIAMHSAVYTKLQKDN-LIQYIQPT 209 (330) T ss_pred hcccc---hhhhhhheeccccc---ccccC----HHHHHHHHHHhcccc--ccceEEEEcHHHHHHHHHhh-hhhhhccc Confidence 11110 00000000000000 00011 356788888887765 34568899999999999743 433 2221 Q ss_pred ccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhh- Q lcl|Aclame:pro 228 ALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD- 306 (347) Q Consensus 228 ~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~- 306 (347) ...+.|+.+.|.+|+.+..+|.... +| ..++|-+-|++..+..+ T Consensus 210 ----~~~~~i~~~~G~~VivdD~~p~~~~------------------------~y-------t~yl~~~GAi~~~~~~~~ 254 (330) T protein:vir:10 210 ----TATINIPTYLGYRVIIDDGIAPTGD------------------------IY-------TSYLFRTGSIGLNTGNPS 254 (330) T ss_pred ----ccCcccccccceEEEEeCCCCCCCC------------------------ce-------eEEEEecCceeeecccCC Confidence 1246789999999999999984211 11 01344455555554332 Q ss_pred --eeeccccchhhHhhHHhhhhhhcCcccccceEEEEE----ecCCC Q lcl|Aclame:pro 307 --MALERARRPEFQADQIIGKYAMGHGGLRPEAAGALV----FTPAA 347 (347) Q Consensus 307 --~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~----~~~aa 347 (347) +..|..|++..-.+.+...+.|...+.=........ ..|+- T Consensus 255 ~~v~~EtdRd~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~sPt~ 301 (330) T protein:vir:10 255 GLTTFETSREAAKGNDMIYTRRALVMHPYGVKWTGAEVDAGNITPSN 301 (330) T ss_pred ccccccccCCccccceEEEEeeEEEeeeeeeeecccccccCcCCcCh Confidence 467888888776677766666554332111111100 11221 No 131 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.05 E-value=7.2e-11 Score=76.18 Aligned_cols=299 Identities=14% Similarity=0.094 Sum_probs=149.6 Q ss_pred CCCCccC-c----------cccccCcccC--ccccHHHHHHHHHhHHHHHHHHHHHhhhcc-cccccccCCceEEEecc- Q lcl|Aclame:pro 1 MANATGG-Q----------QIGANQGKGQ--SAADKLALFLKVFGGEVLTAFVRRSVTMDK-HMVRTIQNGKSASFPVM- 65 (347) Q Consensus 1 m~~~~~~-~----------~~~~~~~~~~--~~~d~~al~ie~f~geV~~~f~~~s~~~~~-~~~rti~~G~tv~i~~i- 65 (347) +...... . ....+....+ ..++--.+.-+.+..++.+..+..+.++.+ .+..+..+| .+.+|+. T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~-~~~~p~~~ 183 (435) T protein:vir:14 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIPRLK 183 (435) T ss_pred HHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCC-ceEEEEEe Confidence 0000000 0 0000000000 001100133478888888888777777765 444444444 4788876 Q ss_pred ccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhC--cchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNH--YDVRAEYSAQLGEALAIAADGAVLAEMAK 143 (347) Q Consensus 66 G~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~--~D~r~~~~~~~g~aLa~~~D~~il~~l~~ 143 (347) +.+.+.....|..++. .+++..++++..-++ +.-+.|.+-=..++. .++.+.+..+.+++|+++.|+.|+. T Consensus 184 ~~~~a~~v~E~~~~~~--~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~---- 256 (435) T protein:vir:14 184 GGAIVGYIGADTDIPT--TQQQFDDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIR---- 256 (435) T ss_pred CCcceeeeccCccccc--cccceeEEEeeeEEE-EEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhc---- Confidence 5455544445555543 345666666666554 333445431122233 3488889999999999999998863 Q ss_pred hhhcccccccccCcccCceee-eecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh Q lcl|Aclame:pro 144 LCNLPAASNENIAGLGQAVVL-NIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN 222 (347) Q Consensus 144 ~a~~a~~~~~~~~g~~~~~~i-~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~ 222 (347) +.. .+..+.|.-..... ...+... ......+++.+.++...+...+.-.....+|++|..|..|.+-.. . T Consensus 257 G~G----~~~~p~Gi~~~~~~~~~~~~~~----~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd-~ 327 (435) T protein:vir:14 257 DDG----TANTPKGLRFWALPSNVITASD----ASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRD-G 327 (435) T ss_pred cCC----CCccccceeecccccceecccc----ccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhc-c Confidence 110 00111111100000 0001111 112222344555555555555443334567999999998865332 3 Q ss_pred hhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhh Q lcl|Aclame:pro 223 AANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTV 302 (347) Q Consensus 223 ~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv 302 (347) +..|.-. .... +.++|++|+.++.+|....... ....=+.+||+..+ ++.+ T Consensus 328 ~G~~l~~-~~~~---g~l~G~Pv~~~~~~p~~~~~~~-----------------~~~~i~~gd~s~~~--i~~~------ 378 (435) T protein:vir:14 328 NGNKVYP-ELAN---GMLKGYPVGKTTQVPINLGETG-----------------KESEIYFTDFGDVF--IGEE------ 378 (435) T ss_pred CCceecc-CCCC---CeeecceeEeeccccccccCCC-----------------ccceEEEeecccEE--EEEe------ Confidence 3333211 1222 3689999999999996421100 00112345665532 3322 Q ss_pred hhhheeeccccchh-----------hH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 303 KLKDMALERARRPE-----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 303 ~~~~~~~e~~~~~~-----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .+++++...+.. ++ .-.++..+.++.++.||++.+.|.-++-- T Consensus 379 --~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~ 434 (435) T protein:vir:14 379 --ETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWG 434 (435) T ss_pred --cccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCC Confidence 233333332110 11 35678899999999999998888643332 No 132 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.04 E-value=1.3e-11 Score=80.19 Aligned_cols=282 Identities=10% Similarity=-0.024 Sum_probs=145.0 Q ss_pred CCCCccC-ccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeeeecCCC Q lcl|Aclame:pro 1 MANATGG-QQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~-~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~~~~g~ 77 (347) +...... .....|..+....++.-.+--+.+...+... ...+.++.+.++.....+ +..+|.. +........-+. T Consensus 141 ~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~ 218 (437) T protein:vir:10 141 VTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTT-TGKLPIFNNSTDLLTAHTEYG 218 (437) T ss_pred hhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccC-ceeeEEeeccccccccccccc Confidence 0000000 0000000111111121123346777777654 344556666666655543 3455543 333344444444 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) ..+.. ..+..+++++.+.+. +.-..|.+-=...+.+|+.+.+.++.+++|++..|..|+.-. T Consensus 219 ~~~e~-~~~~~~~v~~~~~k~-~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~---------------- 280 (437) T protein:vir:10 219 QTTKN-ATPVITPILWDLKTY-TGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITAL---------------- 280 (437) T ss_pred ccccc-ccccceeeeeehhhe-eeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh---------------- Confidence 44321 224455555555443 233344332223456789999999999999999998876421 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccc Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) +.+.. .++++.. ++.|.++. ..|+....+ +-.+|++|..|..|.+-. -.++.|.-..++..|. T Consensus 281 -g~~~~--~~~~~~~----------~~~~~~~~~~~l~~~~~~--~~~~~~~~~~~~~l~~lk-d~~g~~~~~~~~~~~~ 344 (437) T protein:vir:10 281 -TDGIK--KTTSTYL----------LGDLKKVLNVTLKPQDSA--AASIVMSQSAYNLFDMAT-DAMGRPLLQPNVTAAT 344 (437) T ss_pred -ccccc--ccccccc----------hhhHHHHHHhhhhhhhhc--CCEEEEcHHHHHHHHHhh-ccCCCeeeccCccCCC Confidence 01100 0111110 22333332 245554432 335699999999986532 2344554444566677 Q ss_pred eEEEeceeEEEeccc--cccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHL--TVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR 314 (347) Q Consensus 237 v~~i~G~~V~~sn~l--p~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~ 314 (347) ..+++|.+|+.+++. |....+ ...-+.+||++.+. +|- .++++++...+ T Consensus 345 ~~~l~G~pv~~~~~~~~~~~~~~--------------------~~~~~~gd~~~~~~-~~~--------r~~~~~~~~~~ 395 (437) T protein:vir:10 345 GYTLLGKTVVIVDDKLFPSASAG--------------------DVNIVVAPLKKAVI-NFK--------LTEITGQFQDT 395 (437) T ss_pred CcccccceeEEecccccCCcCCC--------------------ceEEEEeeccccEE-EEe--------eeceEEEEecc Confidence 788999999998764 321111 11124566665432 222 23344443333 Q ss_pred hhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) -..+...+++.+.|+.++++|++.+.|....+| T Consensus 396 ~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~ 428 (437) T protein:vir:10 396 YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKA 428 (437) T ss_pred cccccceeeEEEEEccEEecccceEEEEeeccc Confidence 334455677778899999999999988754444 No 133 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.02 E-value=2.5e-11 Score=78.76 Aligned_cols=272 Identities=9% Similarity=-0.005 Sum_probs=154.7 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccC-CceEEEeccccc--eeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQN-GKSASFPVMGRT--KGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~-G~tv~i~~iG~~--t~~~~~~g~ 77 (347) |+.... ..++--.+.-+.|..++++..+..+.++.+.++.++.+ ...+.++..... .+.....|. T Consensus 105 ~~~~~~------------~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 172 (395) T protein:vir:38 105 VTSGTT------------GTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESA 172 (395) T ss_pred HhhccC------------ccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCcccccccccc Confidence 222110 01111124458899999999999999999888776642 223444444332 222333455 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .++.. ..++..++++...+.- .-..|.+-=...+.+|+.+.+.++.+++|++..|+.|+.-. T Consensus 173 ~~~~~-~~~~f~~v~~~~~k~~-~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~---------------- 234 (395) T protein:vir:38 173 LIGDN-DDPELTVVKYLIHRYA-GITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVM---------------- 234 (395) T ss_pred ccccc-cccceeeEEeeeeeeE-eehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------- Confidence 54322 1245566666555542 22344432223356889999999999999999999887411 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccc Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGN 236 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 236 (347) +.+.... +. .. ++.|.++. ..|+...- ..-.++++|..|..|.+-. -.+..|.-..++..|. T Consensus 235 -g~~~~~~--~~-------~~----~~~i~~~~~~~l~~~~~--~~a~~v~n~~~~~~L~~lk-d~~G~~l~~~~~~~~~ 297 (395) T protein:vir:38 235 -GKAPKKP--TI-------SQ----FDNIKDLENNTLDPAIE--STSSFITNQSGYNILSKVK-DADGRYLMQPDVTSPD 297 (395) T ss_pred -ccccccc--cc-------cc----HHHHHHHHHHhhhhhhc--CCCEEEEcHHHHHHHHHhh-ccCCceeeccCcCCCC Confidence 1111000 00 01 23344332 23433332 2346789999999987632 2244454444566777 Q ss_pred eEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchh Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE 316 (347) Q Consensus 237 v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~ 316 (347) ...++|++|+.+.+.+...... ...-|.+||++.. ..+..++++++..+... T Consensus 298 ~~~l~G~pV~~~~~~~~~~~~~-------------------~~~i~~gd~~~~~---------~i~~~~~~~i~~~~~~~ 349 (395) T protein:vir:38 298 KYLIDGKPVIRIADKWLPDVSG-------------------SHPLYFGDLKQGI---------TLFDRQQMQIDTTNVGA 349 (395) T ss_pred cceeccceeEEecccccCcCCC-------------------cceEEEEeccccE---------EEEEecceEEEEecccc Confidence 7899999999998765422110 0111344554422 12333445555554322 Q ss_pred ----hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 317 ----FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 317 ----~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +-...++....++.++++|++.+.+..++++ T Consensus 350 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 384 (395) T protein:vir:38 350 GSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVA 384 (395) T ss_pred chhhcCceEEEEEEeeccEEecccceEEEEeeccc Confidence 2234577788899999999999999988888 No 134 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.01 E-value=5.2e-11 Score=76.95 Aligned_cols=285 Identities=11% Similarity=0.032 Sum_probs=156.0 Q ss_pred CCCCcc--------CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEec-ccccee Q lcl|Aclame:pro 1 MANATG--------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPV-MGRTKG 70 (347) Q Consensus 1 m~~~~~--------~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~~-iG~~t~ 70 (347) |.+... ....-.+.......++--.+.-+.|.+++.+.-+..+.++++.++..+.++. ...++. .+.+.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 110000 0000000000000011111345889999999999999999998888776432 334443 444455 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) ..+..|..++.+ ..++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..|+.-.. T Consensus 164 ~~v~E~~~~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeeccccccccc-ccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 556556665422 235667777766665 4444555432334568999999999999999999998864210 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) .+. ..... . ++.|.++. ..|+....+ +-.+|++|..|..|.+-. -.+..|.-. T Consensus 234 ---------~~~------~~~~~----~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lk-d~~G~~l~~ 287 (392) T protein:vir:10 234 ---------KLT------KQAIK----S----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLK-DKDGKYILQ 287 (392) T ss_pred ---------ccc------ccCcc----C----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhh-ccCCCeEee Confidence 000 00000 1 34455543 355555433 345689999999996532 234445433 Q ss_pred ccccccceEEEeceeEEEe--ccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhe Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEV--PHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~s--n~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~ 307 (347) .++..|..++++|.+++.. +.+|..... . ++...-+.+||++.+- .+....+ T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~----------~-------~~~~~~~~gdfs~~~~---------i~~~~~~ 341 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGT----------T-------AKKAPLIIGDLKEAIV---------LFKREDM 341 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcc----------c-------CCceEEEEEehhceEE---------EEeecce Confidence 4556677778999876652 333321100 0 0011112344443221 2333444 Q ss_pred eecccc--chhhHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 308 ALERAR--RPEFQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 308 ~~e~~~--~~~~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++... +..+.-+ .+++...+|.++++|++.+.+..+++| T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 444332 2223333 377888899999999999999988877 No 135 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.01 E-value=5.2e-11 Score=76.95 Aligned_cols=285 Identities=11% Similarity=0.032 Sum_probs=156.0 Q ss_pred CCCCcc--------CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEec-ccccee Q lcl|Aclame:pro 1 MANATG--------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPV-MGRTKG 70 (347) Q Consensus 1 m~~~~~--------~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~~-iG~~t~ 70 (347) |.+... ....-.+.......++--.+.-+.|.+++.+.-+..+.++++.++..+.++. ...++. .+.+.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 110000 0000000000000011111345889999999999999999998888776432 334443 444455 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) ..+..|..++.+ ..++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..|+.-.. T Consensus 164 ~~v~E~~~~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeeccccccccc-ccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 556556665422 235667777766665 4444555432334568999999999999999999998864210 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) .+. ..... . ++.|.++. ..|+....+ +-.+|++|..|..|.+-. -.+..|.-. T Consensus 234 ---------~~~------~~~~~----~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lk-d~~G~~l~~ 287 (392) T protein:vir:10 234 ---------KLT------KQAIK----S----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLK-DKDGKYILQ 287 (392) T ss_pred ---------ccc------ccCcc----C----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhh-ccCCCeEee Confidence 000 00000 1 34455543 355555433 345689999999996532 234445433 Q ss_pred ccccccceEEEeceeEEEe--ccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhe Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEV--PHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~s--n~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~ 307 (347) .++..|..++++|.+++.. +.+|..... . ++...-+.+||++.+- .+....+ T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~----------~-------~~~~~~~~gdfs~~~~---------i~~~~~~ 341 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGT----------T-------AKKAPLIIGDLKEAIV---------LFKREDM 341 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcc----------c-------CCceEEEEEehhceEE---------EEeecce Confidence 4556677778999876652 333321100 0 0011112344443221 2333444 Q ss_pred eecccc--chhhHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 308 ALERAR--RPEFQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 308 ~~e~~~--~~~~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++... +..+.-+ .+++...+|.++++|++.+.+..+++| T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 444332 2223333 377888899999999999999988877 No 136 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.01 E-value=5.2e-11 Score=76.95 Aligned_cols=285 Identities=11% Similarity=0.032 Sum_probs=156.0 Q ss_pred CCCCcc--------CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEec-ccccee Q lcl|Aclame:pro 1 MANATG--------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPV-MGRTKG 70 (347) Q Consensus 1 m~~~~~--------~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~~-iG~~t~ 70 (347) |.+... ....-.+.......++--.+.-+.|.+++.+.-+..+.++++.++..+.++. ...++. .+.+.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 110000 0000000000000011111345889999999999999999998888776432 334443 444455 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) ..+..|..++.+ ..++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..|+.-.. T Consensus 164 ~~v~E~~~~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeeccccccccc-ccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 556556665422 235667777766665 4444555432334568999999999999999999998864210 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) .+. ..... . ++.|.++. ..|+....+ +-.+|++|..|..|.+-. -.+..|.-. T Consensus 234 ---------~~~------~~~~~----~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lk-d~~G~~l~~ 287 (392) T protein:vir:10 234 ---------KLT------KQAIK----S----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLK-DKDGKYILQ 287 (392) T ss_pred ---------ccc------ccCcc----C----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhh-ccCCCeEee Confidence 000 00000 1 34455543 355555433 345689999999996532 234445433 Q ss_pred ccccccceEEEeceeEEEe--ccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhe Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEV--PHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~s--n~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~ 307 (347) .++..|..++++|.+++.. +.+|..... . ++...-+.+||++.+- .+....+ T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~----------~-------~~~~~~~~gdfs~~~~---------i~~~~~~ 341 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGT----------T-------AKKAPLIIGDLKEAIV---------LFKREDM 341 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcc----------c-------CCceEEEEEehhceEE---------EEeecce Confidence 4556677778999876652 333321100 0 0011112344443221 2333444 Q ss_pred eecccc--chhhHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 308 ALERAR--RPEFQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 308 ~~e~~~--~~~~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++... +..+.-+ .+++...+|.++++|++.+.+..+++| T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 444332 2223333 377888899999999999999988877 No 137 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.01 E-value=5.2e-11 Score=76.95 Aligned_cols=285 Identities=11% Similarity=0.032 Sum_probs=156.0 Q ss_pred CCCCcc--------CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEec-ccccee Q lcl|Aclame:pro 1 MANATG--------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPV-MGRTKG 70 (347) Q Consensus 1 m~~~~~--------~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~~-iG~~t~ 70 (347) |.+... ....-.+.......++--.+.-+.|.+++.+.-+..+.++++.++..+.++. ...++. .+.+.+ T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a 163 (392) T protein:vir:10 84 LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPF 163 (392) T ss_pred HhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccc Confidence 110000 0000000000000011111345889999999999999999998888776432 334443 444455 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) ..+..|..++.+ ..++.+++++..-+. +.-..|.+-=-.++.+|+.+.+.++.++++++..|..|+.-.. T Consensus 164 ~~v~E~~~~~~~-~~~~~~~v~l~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g-------- 233 (392) T protein:vir:10 164 AEITEMGEIPET-DNPKFSNVQYAVKDR-AGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE-------- 233 (392) T ss_pred eeeccccccccc-ccccceeEEeeeeeE-EEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------- Confidence 556556665422 235667777766665 4444555432334568999999999999999999998864210 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) .+. ..... . ++.|.++. ..|+....+ +-.+|++|..|..|.+-. -.+..|.-. T Consensus 234 ---------~~~------~~~~~----~----~d~i~~~~~~~l~~~~~~--~a~~vm~~~~~~~L~~lk-d~~G~~l~~ 287 (392) T protein:vir:10 234 ---------KLT------KQAIK----S----LDDIKDVLNVKLDPAISP--NAILLTNQDGFNYLDKLK-DKDGKYILQ 287 (392) T ss_pred ---------ccc------ccCcc----C----HHHHHHHHHHhhhhhhcc--CCEEEEcHHHHHHHHHhh-ccCCCeEee Confidence 000 00000 1 34455543 355555433 345689999999996532 234445433 Q ss_pred ccccccceEEEeceeEEEe--ccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhe Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEV--PHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~s--n~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~ 307 (347) .++..|..++++|.+++.. +.+|..... . ++...-+.+||++.+- .+....+ T Consensus 288 ~~~~~~~~~tllG~~~v~~~~~~~~~~~~~----------~-------~~~~~~~~gdfs~~~~---------i~~~~~~ 341 (392) T protein:vir:10 288 SDPTQKNKKLFAGTNPVVVVSNRFLKSKGT----------T-------AKKAPLIIGDLKEAIV---------LFKREDM 341 (392) T ss_pred cCccCCccccccCcccEEEecccccCCCcc----------c-------CCceEEEEEehhceEE---------EEeecce Confidence 4556677778999876652 333321100 0 0011112344443221 2333444 Q ss_pred eecccc--chhhHhh--HHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 308 ALERAR--RPEFQAD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 308 ~~e~~~--~~~~~~d--~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++... +..+.-+ .+++...+|.++++|++.+.+..+++| T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a 385 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSA 385 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccc Confidence 444332 2223333 377888899999999999999988877 No 138 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.01 E-value=1.1e-10 Score=75.15 Aligned_cols=299 Identities=15% Similarity=0.105 Sum_probs=154.5 Q ss_pred CCCCccC-cc---------c-cccCc--ccCccccHHHHHHHHHhHHHHHHHHHHHhhhcc-cccccccCCceEEEecc- Q lcl|Aclame:pro 1 MANATGG-QQ---------I-GANQG--KGQSAADKLALFLKVFGGEVLTAFVRRSVTMDK-HMVRTIQNGKSASFPVM- 65 (347) Q Consensus 1 m~~~~~~-~~---------~-~~~~~--~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~-~~~rti~~G~tv~i~~i- 65 (347) |.....- .. . ..+.. .....++--.+.-+.+..+|.+..+..+.++.+ .++.+...| .+.+|+. T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~-~~~~p~~~ 183 (435) T protein:vir:80 105 LAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNG-NITIPRLK 183 (435) T ss_pred HHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCC-ceEEEEEe Confidence 1100000 00 0 00000 000011111134478888898888887877765 333344444 4777766 Q ss_pred ccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhcc--HHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYD--IEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAK 143 (347) Q Consensus 66 G~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd--~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~ 143 (347) |.+.+.-..-|..++. .+++.+++++...++ +..+.|.+ ++.....+++.+.+.++.+++|+++.|+.++. T Consensus 184 ~~~~a~~v~E~~~~~~--~~~~f~~i~~~~~k~-~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~---- 256 (435) T protein:vir:80 184 GGAIVGYIGADTDIPT--TQQQFDDLKLTAKKM-AALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIR---- 256 (435) T ss_pred CCcceeeeccCccccc--cccceeeEEEeeEEE-EEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhc---- Confidence 5555554555665543 346677777766665 33444532 22223356788999999999999999998863 Q ss_pred hhhcccccccccCcccCceee-eecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhh Q lcl|Aclame:pro 144 LCNLPAASNENIAGLGQAVVL-NIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPN 222 (347) Q Consensus 144 ~a~~a~~~~~~~~g~~~~~~i-~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~ 222 (347) +... +..+.|....... +.... ........+...+.++...|...+.....-.+|++|..|..|.+-. -. T Consensus 257 G~G~----~~~p~Gi~~~~~~~~~~~~----~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk-d~ 327 (435) T protein:vir:80 257 DDGT----ANTPKGLRFWALPGNVITA----SDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLR-DG 327 (435) T ss_pred cCCC----CCcccceeecccccceeec----ccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhh-cc Confidence 1100 0111111100000 00001 1111222234456666666666665444455689999998885532 22 Q ss_pred hhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhh Q lcl|Aclame:pro 223 AANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTV 302 (347) Q Consensus 223 ~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv 302 (347) ++.|.-. .... ++++|++|+.++++|...... . ..+.-|.+||+..+ ++ T Consensus 328 ~G~~l~~-~~~~---~~l~G~pv~~~~~~p~~~~~~--------~---------~~~~i~~gd~s~~~--i~-------- 376 (435) T protein:vir:80 328 NGNKVYP-ELAN---GMLKGYPVGKTTQVPINLGEA--------G---------KESEIYFTDFGDVF--IG-------- 376 (435) T ss_pred CCceecc-CCCC---CeEeeeeeEEeccccccccCC--------C---------CcceEEEEEcccEE--EE-------- Confidence 3333211 1122 368999999999999532110 0 01112456666532 22 Q ss_pred hhhheeeccccchh-----------hH--hhHHhhhhhhcCcccccceEEEEEecCC-C Q lcl|Aclame:pro 303 KLKDMALERARRPE-----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPA-A 347 (347) Q Consensus 303 ~~~~~~~e~~~~~~-----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~a-a 347 (347) ...+++++...+.. ++ .-.++....|+.++.||++.+.|.-..= | T Consensus 377 ~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 377 EEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred eecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 22344444433221 11 3467888999999999999999974332 3 No 139 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.99 E-value=2.4e-11 Score=78.84 Aligned_cols=281 Identities=12% Similarity=0.083 Sum_probs=160.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhhc---ccccccc-----cCCceEEEeccccc--e Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTMD---KHMVRTI-----QNGKSASFPVMGRT--K 69 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~~---~~~~rti-----~~G~tv~i~~iG~~--t 69 (347) ||. |+. +| +++ |+|...|.+.+.+.+.|.. +++...+ .+|+++.||..+.. . T Consensus 1 MA~--------T~l------sd---~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd 63 (351) T protein:vir:15 1 MAE--------THL------SD---LIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGD 63 (351) T ss_pred CCc--------eee------ee---eechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCc Confidence 774 222 22 455 9999999988888775533 2222122 36999999988764 5 Q ss_pred eeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 70 GYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPA 149 (347) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~ 149 (347) .+++..+..+. .+.+...+..-+|=. .-..+.+.|+...-+--|.+.++.+|.+...++..+..+|..+..+..... T Consensus 64 ~~~~~~~~~i~--~~kitt~~~~a~i~~-~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~ 140 (351) T protein:vir:15 64 PDNWTDSDDID--VNNLTSGKQQGIKFY-QTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTK 140 (351) T ss_pred ccccCCCcccc--hheecccceeEEEEe-eccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchh Confidence 67777777775 456887777777744 455688899988888889999999999999999999988876643322111 Q ss_pred ccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 150 ASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 150 ~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) .. . ......... ...+.... ++.|.+|..+|.+..- ..-..+++.|..|..|.+..-+....+. . T Consensus 141 ~~----~----~~~~d~t~~-~~~~~~is----~~~l~~A~~~~GD~~~-~~~~~ivmhS~v~~~L~~~~li~~~~~s-~ 205 (351) T protein:vir:15 141 IA----N----SKVYDQTKV-SPSEPMFG----AKGFTGAIGLMGDLQD-TAFGAIAVNSATYSLMKVQGLIETIQPQ-N 205 (351) T ss_pred hc----c----cceeccccc-cccccccC----HHHHHHHHHHhccccc-cceEEEEEChHHHHHHHhhhhhhhcccc-c Confidence 10 0 111111111 11111111 3568888888866421 1136778899999999976422122221 1 Q ss_pred ccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheee Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMAL 309 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~ 309 (347) ..+.|+.+.|.+|+.+..+|....+.. ..+| ...+|-+-|++..+.. ..+ T Consensus 206 ---~~~~i~t~~G~~VivdD~~p~~~~~~~-------------------~~~y-------tsyl~~~GAi~~~~~~-~~v 255 (351) T protein:vir:15 206 ---GATPFEAYNGLRIVLDDDIEIDLTDKT-------------------KPVS-------TSYIFAPGAVRYSTNM-RST 255 (351) T ss_pred ---cCcccceecceEEEEcCCCccccCCCC-------------------Ccee-------EEEEEecceeeeecCC-cCc Confidence 245789999999999999996432110 0111 1244455555544433 345 Q ss_pred ccccchhhH--hhHHhhhh-----hhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 310 ERARRPEFQ--ADQIIGKY-----AMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 310 e~~~~~~~~--~d~i~~~~-----~~G~~~lRPe~~~~l~~~~aa 347 (347) |..||+... .|.+..+. .+|.+...+...... ..|+= T Consensus 256 e~~rd~~~~~g~d~l~~r~~~~~hp~G~s~~~~~~~~~~-~sPt~ 299 (351) T protein:vir:15 256 ETKYDPLINGGQDVIVQKRVGTIHVAGTSIKASFSPSKA-SFPTI 299 (351) T ss_pred ceeecccCCCCceEEEEeeeeeeeeeeeeecccccccCc-CCcCh Confidence 666666542 12222232 223322222111100 11111 No 140 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.98 E-value=3.5e-11 Score=77.89 Aligned_cols=287 Identities=10% Similarity=0.010 Sum_probs=151.0 Q ss_pred CCCCcc--CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCC Q lcl|Aclame:pro 1 MANATG--GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~--~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~ 77 (347) +++-.- ... .+.. ++.++.-.+.-+.|..++.+..++.+.+++++++.++.+| ...||+. +...+.....+. T Consensus 72 l~~~~r~~~~~--~~~~--~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~-~~~i~~~~~~~~a~~~~E~~ 146 (390) T protein:vir:40 72 LTSDESKYYNE--VIAG--NGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTAT-TEWIISVGDVATAWWGPLCA 146 (390) T ss_pred ccHHHHHHHHH--HHhc--cCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCc-eeEEEEEcCCcceeeecccc Confidence 000000 000 0000 1111212245599999999999999999999888776654 4666654 444544444444 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .+..+ .+++.+++++..-++ +.-..|.+-=...+.+|+-+.+.++.++++++..|+.|+. +.. .+ T Consensus 147 ~~~~~-~~~~f~~i~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~----G~G---------~~ 211 (390) T protein:vir:40 147 EIKEV-LDNGFDKIQTGMYKL-SAYIPVCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVN----GSG---------KD 211 (390) T ss_pred ccCcc-ccccceeeEeeeeeE-EEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhc----ccC---------CC Confidence 44322 245667777777665 3445555444445777899999999999999999998873 110 01 Q ss_pred ccCceeee-----ecccccccchhhHHHHHHHHHHHHHHHHhhccCCC-CCCEEEEChHHHHHHhcchhhhhhhcccccc Q lcl|Aclame:pro 158 LGQAVVLN-----IGAAADLVDVEARGKAILKGLTLARARLTKNYVPA-GDRRFYCAPEDYSAILSALMPNAANYAALID 231 (347) Q Consensus 158 ~~~~~~i~-----~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~-~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (347) .+.|..-. .+.....+........+.+.+..+...+....-+. ..-+++++|..+..+|+..+.. .+ .++. T Consensus 212 ~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~-~d--~~G~ 288 (390) T protein:vir:40 212 QPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSY-MT--PQGV 288 (390) T ss_pred ccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhc-cC--CCCc Confidence 11111100 00000000000111111222222333333322111 2345678887766555432221 11 1222 Q ss_pred ccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecc Q lcl|Aclame:pro 232 PETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALER 311 (347) Q Consensus 232 ~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~ 311 (347) +.++. ...|.+|+.++++|... -..+||+.. +++ ..++++++. T Consensus 289 ~v~~~--~~~g~pvv~~~~~p~~~-------------------------i~~Gd~s~~--~i~--------~~~~~~v~~ 331 (390) T protein:vir:40 289 WVTGI--LPVPLEIVQSVAVPVGK-------------------------AVAGRAKDY--FMG--------IGSEQVIRT 331 (390) T ss_pred ccccc--CCCceeEEEcCCCCCCc-------------------------EEEEeeceE--EEE--------eecceEEEe Confidence 22222 24699999999998421 012455542 222 233445554 Q ss_pred ccch--hhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 312 ARRP--EFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 312 ~~~~--~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ..+. .+-...+++.+.++.++++|++.+.|..++++ T Consensus 332 ~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~ 369 (390) T protein:vir:40 332 STEYRLLDDETLYYAKQYANGRPKDNSSFLVFDITGLE 369 (390) T ss_pred cchhhhhcCcEEEEEEEEeCCEEecccceEEEEeeccC Confidence 3222 12234578999999999999999999999887 No 141 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=98.97 E-value=1.4e-11 Score=80.12 Aligned_cols=276 Identities=13% Similarity=0.052 Sum_probs=147.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCc-eEEEeccccceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGK-SASFPVMGRTKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~-tv~i~~iG~~t~~~~~~g~~~ 79 (347) +..... ...+.-.+....+...+-.+.+..++...- ....+....++.++..++ .+.++..+...+.....+... T Consensus 121 ~~~~~~---~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~ 196 (397) T protein:vir:96 121 NAFVKS---KGAEKRDGFTSVEGGALIPQELLQPQLEPK-DIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKN 196 (397) T ss_pred HHHHHh---hhhhhhhcccccccccchhHHHHHHHHHhh-hhhhHHHhhhhccccccceeEEEEeccCCccccccccccc Confidence 000000 000000111122222345577888887643 333445555555544322 234444444444444444444 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLG 159 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~ 159 (347) +.. .++...++++.+.+. +.-..|.+--..++.+|+.+.+.++.++++++..|..|+.-. | T Consensus 197 ~~~-~~~~~~~i~~~~~~~-~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~---------------g-- 257 (397) T protein:vir:96 197 PQL-ANPKMVEIDYSVATR-RGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVL---------------K-- 257 (397) T ss_pred ccc-ccccccceeecHhHh-hcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------c-- Confidence 321 235667777776554 333344332233456789999999999999999998776311 0 Q ss_pred CceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 160 QAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 160 ~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .+ .+.... . ++.|.++....... .-+-.+|++|..|..|.+-. -.++.|.-..++..|..++ T Consensus 258 ~~------~~~~~~----~----~d~~~~~~~~~~~~---~~~a~~v~n~~~~~~l~~lk-d~~G~~~~~~~~~~~~~~~ 319 (397) T protein:vir:96 258 TA------TAKSVV----G----VDGLKDLINKEIKK---VYDVKLFISASMYSELDKLK-DKNGRYLLQDSITAASGKQ 319 (397) T ss_pred cc------cccccc----c----hHHHHHHHHHhhhh---hcCcEEEEcHHHHHHHHHhh-ccCCCeEeccCccCCCccc Confidence 00 000000 1 34444443322221 12345799999999987632 2344555444566777789 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHh Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQA 319 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~ 319 (347) ++|.+|+.+++.+..... +...-+.+||++.+. ++ ..++++++... ..++. T Consensus 320 l~G~pv~~~~~~~~~~~~-------------------~~~~~~~gd~~~~~~-~~--------~~~~~~~~~~~-~~~~~ 370 (397) T protein:vir:96 320 LLGKEVVVLDDDVIGKSV-------------------GNVVGFIGDAKAFAS-FF--------DRKQVSVSWVD-NNIYG 370 (397) T ss_pred ccccceEEecccccCCCC-------------------CceEEEEeehhcceE-eE--------eecceEEEEec-ccccc Confidence 999999998875432110 001113456665432 22 23334444332 23445 Q ss_pred hHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 320 DQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 320 d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ..+++.+.+|.++++|++.+.+..++| T Consensus 371 ~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 371 QLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred eeEEEEEEEccEEecccceEEEEeecC Confidence 678999999999999999999998888 No 142 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.96 E-value=1e-10 Score=75.35 Aligned_cols=292 Identities=13% Similarity=0.017 Sum_probs=164.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHH--HHHhHHHHHHHHHHHhhhcccccc-cccCCceEEE----eccccceeeee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL--KVFGGEVLTAFVRRSVTMDKHMVR-TIQNGKSASF----PVMGRTKGYYL 73 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i--e~f~geV~~~f~~~s~~~~~~~~r-ti~~G~tv~i----~~iG~~t~~~~ 73 (347) |+|+++..++...+. =.++. +| ..|-......-.+.....+....+ ..+++-++.| |.......+.. T Consensus 1 ~~~~~~i~s~~~~~~-----itv~~-ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~V 74 (318) T protein:vir:10 1 MTAPTGIVSVSDGPA-----ITVRE-LVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADV 74 (318) T ss_pred CCCCCcceeeecCCc-----eehHH-hhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhc Confidence 999887766543321 11111 11 122222222222333333322222 3444557777 44555566778 Q ss_pred cCCCCCCCCCCCCCCCceEE-EEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 74 APGENLDDKRKDIKHSEKVI-QIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASN 152 (347) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l-~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~ 152 (347) .+|.+++.. ...+.+..+ .+.+ .--.+.|.|--......|..+...++++.+++++.|+..+..|..+.-...+.+ T Consensus 75 aEggEiP~~--~~~~G~~~ia~~~K-~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s 151 (318) T protein:vir:10 75 AEFGEIPVS--AGARGLPRTAFAVK-KALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVP 151 (318) T ss_pred cCccccccc--CCCCCchhhhhheh-hccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCC Confidence 889998754 355555555 3333 356778888888889999999999999999999999998876533221100001 Q ss_pred cccCcccCceeeeecccccccchhhHHHHHHHHHHHHH-HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccc Q lcl|Aclame:pro 153 ENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLAR-ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALID 231 (347) Q Consensus 153 ~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~-~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (347) . ...+.+.... +..++....+.....+..+. ...++ +-.-.--.+||.|..|..|++++.+... |.+..+ T Consensus 152 ~--~w~~~~~~~~-----d~~~A~e~v~~a~~~~~~a~~~~~~~-~~GY~pdtIVlhP~~~~~l~~n~~~~~~-y~~~a~ 222 (318) T protein:vir:10 152 T--AWDNGGKVRT-----DIAIAIEQISTAAPTAYPAGVGSSDE-YFGFIPDTIVMHYALLPILMDNENFMKV-YERNAN 222 (318) T ss_pred c--CCCCcccccc-----cchhhhhhhhhhhhhhhhhhhhhhhh-ccCccceeeEECHHHHHHHhcchhhhhh-hhccch Confidence 0 0001111111 11111100000011111111 11111 1122224789999999999999776432 322111 Q ss_pred ------ccccce-EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhh-h Q lcl|Aclame:pro 232 ------PETGNI-RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTV-K 303 (347) Q Consensus 232 ------~~~G~v-~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv-~ 303 (347) --.|.+ ++++|++|+.|+++|... ++++++-.+|+. - T Consensus 223 ~~~~~~~~tg~~~g~~lGl~vi~s~~~p~~~-----------------------------------alvlq~g~vG~~~d 267 (318) T protein:vir:10 223 YVSTAPDWTGNFPGSVMGLNVIRSRTFPIDR-----------------------------------VLIMERGTVGFYSD 267 (318) T ss_pred hhhhcccccccccceeeceEEeecCccCCCe-----------------------------------eEEEecCCcceeec Confidence 112433 578999999999999532 245555555533 3 Q ss_pred hhheeeccccch-------hhHhhHHhhhhhhcCcccccceEEEEE--ecC Q lcl|Aclame:pro 304 LKDMALERARRP-------EFQADQIIGKYAMGHGGLRPEAAGALV--FTP 345 (347) Q Consensus 304 ~~~~~~e~~~~~-------~~~~d~i~~~~~~G~~~lRPe~~~~l~--~~~ 345 (347) ..+++.+..|.. ...+|.++.+......|.+|-++.-|. .+| T Consensus 268 ~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 268 TRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred cccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 445666777743 567899999999999999999998887 555 No 143 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.94 E-value=1.3e-10 Score=74.76 Aligned_cols=299 Identities=13% Similarity=0.041 Sum_probs=147.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcc-cccccccCCceEEEecc-ccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDK-HMVRTIQNGKSASFPVM-GRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~-~~~rti~~G~tv~i~~i-G~~t~~~~~~g~~ 78 (347) |+....+.....+ ..+...++--.|.-+++.+++.+..+..++++.+ .++-+...|+ +.+|+. +.+.+.....|.. T Consensus 52 ~a~~~~~~~~~~~-a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~-~~~p~~t~~~~a~wv~E~~~ 129 (366) T protein:vir:57 52 FAATELGDTGLSM-AISTAAGSGGALIPQNMQNEVIELLRDRTVVRILGARSIPLPNGN-LSMPRLSGGATAGYVGEGKD 129 (366) T ss_pred HHHHhhcchhhhh-hccccccCCccccchhHHHHHHHHHhhhcchhhhceeeeecCCCc-eEEEEEeCCcceeeeccCcc Confidence 1110000000000 0000001101134578899999888888887765 4433444454 777765 5555655666776 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.+ +++.+++++..-+. +.-..|.+-=..++.+|+.+.+.+++++++++..|+.+|. +......+.+..... T Consensus 130 ~~~s--~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~----G~G~~~~p~Gi~~~~ 202 (366) T protein:vir:57 130 VVAT--GATFDDVKLSAKTM-IALVPVSNQLIGRAGFNVEQLLLGDILSAIATREDKAFLR----DDGTGDTPKGMKAVA 202 (366) T ss_pred cccc--ccceeEEEEeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhc----cCCCCccccceeecc Confidence 6543 46667766665554 3334454322235678999999999999999999998873 111000000000000 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHH-HHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccce Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLA-RARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNI 237 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a-~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v 237 (347) ...+.....+++. . +... ++.++++ ...+...+.-...-.++++|..|..|.+-.. .+..|.-. +... T Consensus 203 ~~~~~~~~~~~t~-~----~~~~-~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd-~~G~~l~~-~~~~--- 271 (366) T protein:vir:57 203 TAANRLVAWTGTA-I----NLTT-IDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRD-GNGNKVYP-EMSQ--- 271 (366) T ss_pred ccccceeeccccc-c----chhh-HHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhc-cCCceecc-CCCC--- Confidence 0000000000000 0 1111 1212221 1222222222223345899999999875321 22233211 1122 Q ss_pred EEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccch-- Q lcl|Aclame:pro 238 RNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRP-- 315 (347) Q Consensus 238 ~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~-- 315 (347) +.++|++|+.|+.+|...... .+...=|.+||+... +.. ..+++++..++. T Consensus 272 g~l~G~Pvv~s~~ip~~~~~~-----------------~~~~~i~~gdfs~~~--i~~--------~~~i~i~~~~ea~~ 324 (366) T protein:vir:57 272 GILKGYPIQRTSAIPANLGDD-----------------GNESEIYFCDFNDVV--IGE--------DGMMKVDFSTEATY 324 (366) T ss_pred CeecceeeEEccccccccccC-----------------CCccEEEEEecceEE--EEE--------ecceEEEEeecccc Confidence 468999999999999532110 011112456776542 222 223333333221 Q ss_pred ---------hhHh--hHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 316 ---------EFQA--DQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 316 ---------~~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) .++. -.++..+.++.+++||++.+.|.-.-= T Consensus 325 ~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 325 KDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred ccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 1222 257788889999999999998864333 No 144 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.94 E-value=2.4e-10 Score=73.28 Aligned_cols=295 Identities=12% Similarity=0.068 Sum_probs=145.5 Q ss_pred CCCCccCccccccCccc-CccccHHHHHHHHHhHHHHHHHHHHHhhhcc-cccccccCCceEEEecc-ccceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKG-QSAADKLALFLKVFGGEVLTAFVRRSVTMDK-HMVRTIQNGKSASFPVM-GRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~-~~~~d~~al~ie~f~geV~~~f~~~s~~~~~-~~~rti~~G~tv~i~~i-G~~t~~~~~~g~ 77 (347) |+.-........+.-.. ..+|- .+--+.|..++.+..+..++++.+ .++-+..+|+ +.||++ +.+.+.....|. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~gg--~liP~~~~~~ii~~l~~~~~l~~~~~~~~~~~~g~-~~~p~~~~~~~a~~v~Eg~ 189 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAGSGG--VLIPQNIHSEVIELLRDRTIVRKLGARSIPLPNGN-MSLPRLAGGATASYTGENQ 189 (428) T ss_pred HhhhhhhhhhHhhhhcccccCCc--cccchhHHHHHHHHHhhhchhhhhcceeeecCCcc-eEEEEEeCCcceeeeccCc Confidence 11111000000000000 00111 122377788888887778888776 3322223344 788875 344555555566 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .++. .++..+++++...++ +.-+.|.+-=..++.+++.+.+.++++++|++..|+.+|. +.. .+..+.| T Consensus 190 ~~~~--~~~~f~~i~~~~~k~-~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~----G~G----~~~~p~G 258 (428) T protein:vir:10 190 DAKV--SEARFDDVKLTAKTM-IAMVPISNALIGRAGFNVEQLVLQDILTAISVREDKAFMR----DDG----TGDTPIG 258 (428) T ss_pred cccc--cccceeeEEeeeEEE-EEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhc----cCC----CCccccc Confidence 6654 346677777766554 3344554432334678899999999999999999998863 111 0111111 Q ss_pred ccC----ceeeeecccccccchhhHHHHHHHHHHHHHHHHhh-ccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccc Q lcl|Aclame:pro 158 LGQ----AVVLNIGAAADLVDVEARGKAILKGLTLARARLTK-NYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDP 232 (347) Q Consensus 158 ~~~----~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde-~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (347) .-. .+.+...+.... ..... .+...++...+.. .+.....-..+++|..|..|.+-. -.++.|.-. +. T Consensus 259 i~~~~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~G~~i~~-~~ 331 (428) T protein:vir:10 259 MKARATQWNRLLPWAADAA----VNLDT-IDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLR-DGNGNKVYP-EM 331 (428) T ss_pred ccccccccccccccccccc----ccHHH-HHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhh-ccCCceecc-CC Confidence 111 111111111111 11111 1112222111111 111122234577999999886532 233333211 12 Q ss_pred cccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccc Q lcl|Aclame:pro 233 ETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERA 312 (347) Q Consensus 233 ~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~ 312 (347) ..| +++|++|+.++++|....... ..+.=|.+||+..+ + +...+++++.. T Consensus 332 ~~g---~l~G~pv~~~~~~p~~~~~~~-----------------~~~~i~~gd~s~~~--i--------~~~~~i~i~~~ 381 (428) T protein:vir:10 332 AQG---MLKGYPIQRTSAIPANLGEGG-----------------KESEIYFADFNDVV--I--------GEDGNMKVDFS 381 (428) T ss_pred CCC---eeeceeeEEeccccccccCCC-----------------ccceEEEEecceEE--E--------EEecceEEEee Confidence 223 699999999999985321100 00111445565432 2 12233444433 Q ss_pred cchh-----------hH--hhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 313 RRPE-----------FQ--ADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 313 ~~~~-----------~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ++.. ++ .-.++....++..+.||++.+.++-..= T Consensus 382 ~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 382 KEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 3221 11 2456888899999999999998864433 No 145 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.92 E-value=3.2e-10 Score=72.65 Aligned_cols=280 Identities=10% Similarity=0.024 Sum_probs=154.0 Q ss_pred CCCCccCccccccCccc---CccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecccc--ceeeeecC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKG---QSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR--TKGYYLAP 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~---~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~--~t~~~~~~ 75 (347) |-.--.+.- ++.-.. +.+-++| |+++|+.-+.+.++ +++..|..++..|++++++.-.. ...++... T Consensus 1 ~~~~~~~~e--~nlt~~~dl~~~~siD--f~~~f~~~i~~L~~----~LGv~r~~pla~GstIkt~k~~~y~gda~dVaE 72 (296) T protein:vir:98 1 MVTSRTYPE--ENLIKSTDLKYPITID--VTNKFQENISKLLE----MLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPE 72 (296) T ss_pred CCCccccCc--CCCcchhhhhhhhhhh--hHHHHhhhHHHHHH----HhhhcccccccCCCEEeeccceeeeeccccccC Confidence 322111111 011000 0122344 99999988876654 56777777888899997654222 23356778 Q ss_pred CCCCCCCCCCCCCC---ceEEEEeeeeecchhhccHHHH-H-hCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHS---EKVIQIDGLLTSDVLIYDIEDA-M-NHY-DVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPA 149 (347) Q Consensus 76 g~~~~~~~~~~~~~---~~~l~ID~~~~~~~~Vdd~D~~-q-~~~-D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~ 149 (347) |..|+.+ .+... ..+++|.++.-. + =||+ | .-| |...+..+++..+|++++|..++..+-.+. T Consensus 73 Ge~Ipls--kvt~~~~~t~t~~ikK~rK~---t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT---- 141 (296) T protein:vir:98 73 GEVIPLS--KVERKIHSEKKIELKKYRKA---T--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---- 141 (296) T ss_pred Ccccchh--hheeeecceEEEEeeccccc---c--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccc---- Confidence 8888754 34433 367777664322 4 3555 4 434 589999999999999999999987652111 Q ss_pred ccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 150 ASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 150 ~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) .+...+.+....++...+.++..+|.+.+ ....+++|+|...+.+|++.++.....-|. T Consensus 142 -------------------~t~~~t~~~lQ~Ala~~~~~l~~~feded--~~~~V~FVnP~D~a~ylg~a~it~qt~fG~ 200 (296) T protein:vir:98 142 -------------------GTQDALGAGLQGALASAWGKLQVLFEDYG--SERAIVFANSLDVAEYIAKAGITTQTAFGL 200 (296) T ss_pred -------------------ceeeechhhHHHHHHHHhhhhhhhccccC--CCceEEEEehHHHHHHhcCCccchhheech Confidence 00001223445566677788888888765 245789999999999999887643222111 Q ss_pred ccccccceEEEeceeEEEeccccccccccccccC---ccccccccccccccccccccccccceeEEeechhhhhhhhhhh Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPAD---GVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD 306 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~---~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~ 306 (347) ++ +.++.|..|+.|+.+|.+....+..-+ ...+..+..+ +..-....|.++.+|+.- T Consensus 201 -ty----l~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l---~~~f~~~~d~tglIGv~h------------ 260 (296) T protein:vir:98 201 -TY----LVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSEL---AKEFNLYGDPTGYIGMNH------------ 260 (296) T ss_pred -hh----hhhccccEEEEcCcCCCceEEEeeecceEEEeecccccch---hhhhccccccccceEEEe------------ Confidence 11 124788999999999976543322110 0111100101 111112234444444221 Q ss_pred eeeccccchhhHhhHHhhhhhhcCc--ccccceEEEEEecCCC Q lcl|Aclame:pro 307 MALERARRPEFQADQIIGKYAMGHG--GLRPEAAGALVFTPAA 347 (347) Q Consensus 307 ~~~e~~~~~~~~~d~i~~~~~~G~~--~lRPe~~~~l~~~~aa 347 (347) ...++.- -+..+..+|.. +=|+|+++...+.|+- T Consensus 261 ---~~~~~~~----t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 261 ---FQENTTL----TIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred ---cccccee----eehhHhHhHHHhcccccceEEEEEecCCC Confidence 1111111 11112222222 2377888888776666 No 146 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.90 E-value=1.4e-10 Score=74.67 Aligned_cols=297 Identities=13% Similarity=0.099 Sum_probs=151.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (347) +.+.-.... ..+....+..++--.+..+.|..++.+..++.+.++++.++.++.++ ++.||+. +..++.....|+. T Consensus 138 ~~~~~~~~~-~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~ 215 (497) T protein:vir:10 138 FADGETAPA-AIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGT 215 (497) T ss_pred HhhhhhhHH-HHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCcc Confidence 000000000 00000011112222256689999999999888999999888777665 5888864 3445666666666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.+ +++.+++++...++- .-..|.+ +-.+.+.++.+.+.++.++++++..|+.+|. +.... .+.|. T Consensus 216 ~~~s--~~~f~~i~~~~~k~a-~~~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~----G~G~~-----~p~Gi 282 (497) T protein:vir:10 216 YPFS--SEEFARVYEQVGKVA-NALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYP-----GVNGL 282 (497) T ss_pred cccc--cccceeeEeeeeeeE-eecHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhc----CCCcc-----ccccc Confidence 6543 466777666666542 2233432 2233345688888999999999999998863 11000 00000 Q ss_pred c---Cceeeeecccc--------------------cccchhh-------------------------HHHHHHHHHHHHH Q lcl|Aclame:pro 159 G---QAVVLNIGAAA--------------------DLVDVEA-------------------------RGKAILKGLTLAR 190 (347) Q Consensus 159 ~---~~~~i~~~~~~--------------------~~~~~~~-------------------------~~~~i~~~l~~a~ 190 (347) - .+..+..+... ....... ....+...++.+. T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) T protein:vir:10 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHH Confidence 0 00000000000 0000000 0000111222222 Q ss_pred HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccc------cccccccceEEEeceeEEEeccccccccccccccCc Q lcl|Aclame:pro 191 ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA------LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADG 264 (347) Q Consensus 191 ~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~------~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~ 264 (347) ..+..... ...-.+|++|..|..|.+-.+ .+..|.. ......+....++|.+|+.++.+|... T Consensus 363 ~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd-~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~--------- 431 (497) T protein:vir:10 363 VDIQLTLF-QTPNAVVMNPRDWELLRLTKD-ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--------- 431 (497) T ss_pred hhhhhhcc-cCCCeEEEchHHHHHHHHhhc-CCCceeccCcccccccccccCCceeeceeeEecCCCCCCc--------- Confidence 22222111 011146899999998754322 1223321 111122334589999999999998421 Q ss_pred cccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc--hhhH--hhHHhhhhhhcCcccccceEEE Q lcl|Aclame:pro 265 VAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR--PEFQ--ADQIIGKYAMGHGGLRPEAAGA 340 (347) Q Consensus 265 ~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~--~~~~--~d~i~~~~~~G~~~lRPe~~~~ 340 (347) -+-+||+...-+++. .++++++.... +.++ .-.|++...++..+++|++++. T Consensus 432 ----------------~~~Gd~~~~~~~i~~--------r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~ 487 (497) T protein:vir:10 432 ----------------ILVGHFAPSVIQTAR--------REGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQL 487 (497) T ss_pred ----------------eEEeecccceEEEEE--------ecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEE Confidence 123455443223332 33334443321 1122 3347778889999999999999 Q ss_pred EEecCCC Q lcl|Aclame:pro 341 LVFTPAA 347 (347) Q Consensus 341 l~~~~aa 347 (347) +...++| T Consensus 488 l~~~~~~ 494 (497) T protein:vir:10 488 IQLKKGA 494 (497) T ss_pred EEecCCc Confidence 9988888 No 147 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.90 E-value=1.4e-10 Score=74.67 Aligned_cols=297 Identities=13% Similarity=0.099 Sum_probs=151.0 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (347) +.+.-.... ..+....+..++--.+..+.|..++.+..++.+.++++.++.++.++ ++.||+. +..++.....|+. T Consensus 138 ~~~~~~~~~-~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~-~~~~~~~~~~~~~a~wv~E~~~ 215 (497) T protein:vir:78 138 FADGETAPA-AIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSP-NLSYLTESAAHNNAAAVAEAGT 215 (497) T ss_pred HhhhhhhHH-HHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCC-ceEEEEEcCCCCcceeeccCcc Confidence 000000000 00000011112222256689999999999888999999888777665 5888864 3445666666666 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.+ +++.+++++...++- .-..|.+ +-.+.+.++.+.+.++.++++++..|+.+|. +.... .+.|. T Consensus 216 ~~~s--~~~f~~i~~~~~k~a-~~~~iS~-ell~d~~~l~~~i~~~l~~~i~~~~d~~~l~----G~G~~-----~p~Gi 282 (497) T protein:vir:78 216 YPFS--SEEFARVYEQVGKVA-NALTITD-EGLRDAPELFNFVQGRLLEGIQRKEEVQLLA----GGGYP-----GVNGL 282 (497) T ss_pred cccc--cccceeeEeeeeeeE-eecHhHH-HHHHhHHHHHHHHHHHHHHHHHHHHHHHhhc----CCCcc-----ccccc Confidence 6543 466777666666542 2233432 2233345688888999999999999998863 11000 00000 Q ss_pred c---Cceeeeecccc--------------------cccchhh-------------------------HHHHHHHHHHHHH Q lcl|Aclame:pro 159 G---QAVVLNIGAAA--------------------DLVDVEA-------------------------RGKAILKGLTLAR 190 (347) Q Consensus 159 ~---~~~~i~~~~~~--------------------~~~~~~~-------------------------~~~~i~~~l~~a~ 190 (347) - .+..+..+... ....... ....+...++.+. T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 362 (497) T protein:vir:78 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAF 362 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHH Confidence 0 00000000000 0000000 0000111222222 Q ss_pred HHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccc------cccccccceEEEeceeEEEeccccccccccccccCc Q lcl|Aclame:pro 191 ARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA------LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADG 264 (347) Q Consensus 191 ~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~------~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~ 264 (347) ..+..... ...-.+|++|..|..|.+-.+ .+..|.. ......+....++|.+|+.++.+|... T Consensus 363 ~~~~~~~~-~~~~~~vmn~~~~~~l~~lkd-~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~--------- 431 (497) T protein:vir:78 363 VDIQLTLF-QTPNAVVMNPRDWELLRLTKD-ANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGT--------- 431 (497) T ss_pred hhhhhhcc-cCCCeEEEchHHHHHHHHhhc-CCCceeccCcccccccccccCCceeeceeeEecCCCCCCc--------- Confidence 22222111 011146899999998754322 1223321 111122334589999999999998421 Q ss_pred cccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccc--hhhH--hhHHhhhhhhcCcccccceEEE Q lcl|Aclame:pro 265 VAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARR--PEFQ--ADQIIGKYAMGHGGLRPEAAGA 340 (347) Q Consensus 265 ~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~--~~~~--~d~i~~~~~~G~~~lRPe~~~~ 340 (347) -+-+||+...-+++. .++++++.... +.++ .-.|++...++..+++|++++. T Consensus 432 ----------------~~~Gd~~~~~~~i~~--------r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~ 487 (497) T protein:vir:78 432 ----------------ILVGHFAPSVIQTAR--------REGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQL 487 (497) T ss_pred ----------------eEEeecccceEEEEE--------ecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEE Confidence 123455443223332 33334443321 1122 3347778889999999999999 Q ss_pred EEecCCC Q lcl|Aclame:pro 341 LVFTPAA 347 (347) Q Consensus 341 l~~~~aa 347 (347) +...++| T Consensus 488 l~~~~~~ 494 (497) T protein:vir:78 488 IQLKKGA 494 (497) T ss_pred EEecCCc Confidence 9988888 No 148 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=98.90 E-value=6.5e-10 Score=70.95 Aligned_cols=258 Identities=10% Similarity=0.064 Sum_probs=148.3 Q ss_pred CCCCccCccccccCccc------CccccHHHHHHHHHhHHHHHHHHHHHhhhcc---------ccccccc--CCceEEEe Q lcl|Aclame:pro 1 MANATGGQQIGANQGKG------QSAADKLALFLKVFGGEVLTAFVRRSVTMDK---------HMVRTIQ--NGKSASFP 63 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~------~~~~d~~al~ie~f~geV~~~f~~~s~~~~~---------~~~rti~--~G~tv~i~ 63 (347) |+|+..+... -.+. -...|+ .++.|++.+...-++.+-++.+ ++...+. .|++|.|. T Consensus 1 mt~~~~~~~~---~~~~~~~ft~~~~~~~---~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~ 74 (318) T protein:vir:27 1 MTTVTSAQAN---KLFQVALFTAANRNRS---MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFS 74 (318) T ss_pred CCccCCCChH---HHHHHHHHHHHhcCCh---HHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEe Confidence 9987664321 0010 011122 5788999887666655433322 2222332 59999999 Q ss_pred ccccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 64 VMGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAK 143 (347) Q Consensus 64 ~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~ 143 (347) -+...+..-..-++.+.+..+.++...-.|.||+..-.=..=..+++-.+-+|+|++.-..++.-+++..||.+|.+|+. T Consensus 75 L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~~dlR~~ar~~L~~w~~~~~Dq~~~v~laG 154 (318) T protein:vir:27 75 IMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAG 154 (318) T ss_pred EeeccccCccccCceeeccccceEEEeeEEEEeeeccccccccchhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 88887766666667777777788888889999997432111146777778899999999999999999999999998864 Q ss_pred hhhc--------cc---ccc-----cccCcccCceeeeecccccc---cchhhHHHHHHHHHHHHHHHHhhccCC----- Q lcl|Aclame:pro 144 LCNL--------PA---ASN-----ENIAGLGQAVVLNIGAAADL---VDVEARGKAILKGLTLARARLTKNYVP----- 199 (347) Q Consensus 144 ~a~~--------a~---~~~-----~~~~g~~~~~~i~~~~~~~~---~~~~~~~~~i~~~l~~a~~~Lde~~VP----- 199 (347) +-.. +. +.. +.+..-.....+..+.++.. +..+... ++.|-.+.+++++..-| T Consensus 155 arg~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s---~~lid~~~~~~~~~a~pi~PV~ 231 (318) T protein:vir:27 155 ARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHPLQPVR 231 (318) T ss_pred cccccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhccccc---HHHHHHHHHHHHHhCCCCccee Confidence 3310 00 000 00000000112222222111 1111212 23344556666663322 Q ss_pred CC--C-------CEEEEChHHHHHHhcchh---hhh----hhccc---cccccccceEEEeceeEEEecccccccccccc Q lcl|Aclame:pro 200 AG--D-------RRFYCAPEDYSAILSALM---PNA----ANYAA---LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNN 260 (347) Q Consensus 200 ~~--g-------R~~vv~P~~~~~Ll~~~~---~~~----~~~~~---~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~ 260 (347) -+ . ++++++|.+|..|..+.. +.+ +...+ ...+-.|.++.+.|+=|.+.+++|.. T Consensus 232 v~g~~~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~vpIr------ 305 (318) T protein:vir:27 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIR------ 305 (318) T ss_pred eccccccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCCccEE------ Confidence 11 2 677899999999998752 222 22222 23477899999999999999998852 Q ss_pred ccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhee Q lcl|Aclame:pro 261 PADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMA 308 (347) Q Consensus 261 ~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~ 308 (347) |.+|.+-+|. + ++ T Consensus 306 -------------f~~G~~v~~~----~------------------~~ 318 (318) T protein:vir:27 306 -------------FYQGQRFWYQ----R------------------IT 318 (318) T ss_pred -------------EcCCCeeeee----e------------------cC Confidence 1122221111 0 00 No 149 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.90 E-value=3.1e-10 Score=72.75 Aligned_cols=272 Identities=12% Similarity=0.051 Sum_probs=147.9 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecccc-ceeeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR-TKGYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~-~t~~~~~~g~~~ 79 (347) ||..|.-.. +..+ ..-..| |++.|+.-+.+-+ .+++..|..++..|+++++|.-.- ...+++..|..| T Consensus 1 mAe~nlt~~--~dL~---~~~sid--fv~~f~~~i~~L~----~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~I 69 (295) T protein:vir:99 1 MAEKNLNTM--ADLG---DIKSID--FVNKFSKNINDLL----KLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETI 69 (295) T ss_pred CCCcccccH--hhcc---Cceeeh--hhHHhhhhHHHHH----HHhccccccccccCCeEEeeeeeeecccccccCCccc Confidence 999655322 2222 122344 9999997776554 356777777888999999997552 244678889998 Q ss_pred CCCCCCCCCC---ceEEEEeeeeecchhhccHHHH-H-hCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 80 DDKRKDIKHS---EKVIQIDGLLTSDVLIYDIEDA-M-NHY-DVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 80 ~~~~~~~~~~---~~~l~ID~~~~~~~~Vdd~D~~-q-~~~-D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) +.+ .++.. ..++++.++. . .+. ||+ | .-| |...|..+|+..+|++++|..++..+.. +.. T Consensus 70 pls--kvt~~~~~t~t~kikK~r--K-~tT--dEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lkt-at~------ 135 (295) T protein:vir:99 70 PLS--KVTRTKDKDYTVKWFKKR--R-ATT--AEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKT-KPT------ 135 (295) T ss_pred chh--hheeeeeeeeEEEeeeec--c-ccc--HHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhcc-Cce------ Confidence 764 34433 4566676543 3 243 444 4 334 5999999999999999999999876521 100 Q ss_pred ccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhc-cCCCCCCEEEEChHHHHHHhcchhhh--hh-hcccc Q lcl|Aclame:pro 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKN-YVPAGDRRFYCAPEDYSAILSALMPN--AA-NYAAL 229 (347) Q Consensus 154 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~-~VP~~gR~~vv~P~~~~~Ll~~~~~~--~~-~~~~~ 229 (347) +. +.... +.-++.+..+-..+.|. ++ ..+++|+|...+.||++.... .+ .| |. T Consensus 136 -----------t~-------tg~~l-q~a~a~~~~al~~f~Ee~~~---~~V~FVnP~D~a~yl~~A~~~~~~a~~f-G~ 192 (295) T protein:vir:99 136 -----------KV-------KGVGL-QKALSASWAKLATFNEFEGS---PLVSFVSPLDVANYLGDTKVGADASNVF-GM 192 (295) T ss_pred -----------ee-------ehhhH-HHHHHHhhhhhhhcccccCC---ceEEEEehHHHHHHHhccccccchhhhh-hh Confidence 00 00011 11133344444444443 33 368999999999999986543 11 13 11 Q ss_pred ccccccceEEEecee-EEEecccccccccccccc---CccccccccccccccccccccccccceeEEeechhhhhhhhhh Q lcl|Aclame:pro 230 IDPETGNIRNVMGFE-VIEVPHLTVGGAGDNNPA---DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK 305 (347) Q Consensus 230 ~~~~~G~v~~i~G~~-V~~sn~lp~~~~~~~~~~---~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~ 305 (347) . + +.++.|++ |+.|+.+|.+....+..- ....+..+..+..+ .....|.++.+|+.- T Consensus 193 ~-~----L~nfLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~---f~~~~D~tglIg~~h----------- 253 (295) T protein:vir:99 193 T-L----LKNFLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLGGL---FADFTDETGLIAAAR----------- 253 (295) T ss_pred h-h----hhhhhccceEEEcccCCCceEEEeeccceEEEEecCCchhhhhh---hhhccCcccceEEEe----------- Confidence 1 1 22489997 999999998764432211 00111111101000 011123333333221 Q ss_pred heeeccccchhhHhhHHhhhhhhcCc--ccccceEEEEEecCCC Q lcl|Aclame:pro 306 DMALERARRPEFQADQIIGKYAMGHG--GLRPEAAGALVFTPAA 347 (347) Q Consensus 306 ~~~~e~~~~~~~~~d~i~~~~~~G~~--~lRPe~~~~l~~~~aa 347 (347) ...++.- -+..+..+|.. +=|+|+++...+...+ T Consensus 254 ----~~~~~~~----t~et~~~~~~~lfpE~~dgiv~~tI~~~~ 289 (295) T protein:vir:99 254 ----NRQLSNL----TYESVFFGANVLFAEIPEGVVEATIEAAA 289 (295) T ss_pred ----cccccee----eehhhhHhHHHhcccccceEEEEEEecCc Confidence 1111111 11122222322 2377888888875555 No 150 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=98.89 E-value=1.2e-10 Score=74.98 Aligned_cols=274 Identities=15% Similarity=0.093 Sum_probs=144.3 Q ss_pred CCCCccCcccc----ccC-ccc-CccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeee Q lcl|Aclame:pro 1 MANATGGQQIG----ANQ-GKG-QSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYY 72 (347) Q Consensus 1 m~~~~~~~~~~----~~~-~~~-~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~ 72 (347) +.......... .+. +.+ .++|- .+.-+.|..++++.-...+.++++.++.++.+ ..+|++ +..++.- T Consensus 100 ~~~~~~~~~~~~~~~~~al~~~t~s~gG--~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~ 174 (387) T protein:vir:93 100 LPNEFEKPSMEAQRLLHALPTGNDSGGD--KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDF 174 (387) T ss_pred hhhhhhhhhhhhHHHHHhhccCcCCCCc--eeechhHHHHHHHHHHhhchhhhheeeeecCC---ceEEEEeecCCcccc Confidence 00000000000 000 000 01111 13458889999998888888888888877654 334432 3344555 Q ss_pred ecCCCCCCCCCCCCCCCceEEEEeeeeecc-hhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 73 LAPGENLDDKRKDIKHSEKVIQIDGLLTSD-VLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) Q Consensus 73 ~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~-~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~ 151 (347) ...|+..+. .+++.+++++..-++ .. ..|.+-=...+.+|+.+.+.++.++++++..++.++. T Consensus 175 v~E~~~~~~--~~~~f~~v~~~~~k~--~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~------------ 238 (387) T protein:vir:93 175 ITDVETAKE--LKLKGDTVKFTTNKF--KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA------------ 238 (387) T ss_pred ccCcccccc--cccccceeeeeheee--eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh------------ Confidence 555555543 346667666555444 44 4454322334568899999999999999886665542 Q ss_pred ccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccc Q lcl|Aclame:pro 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALID 231 (347) Q Consensus 152 ~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (347) .+...|.+.+.....+.. ...+...++.|+++...|+..... ...| ++++..|..|++-.+-.++ . T Consensus 239 ~g~g~g~p~g~l~~~~~~------~v~~~~~~d~i~~~~~~l~~~~~~-~a~~-~mn~~t~~~~~~~~~d~~~------~ 304 (387) T protein:vir:93 239 VSPKSGLDHMSFYNGSVK------EVEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGTT------N 304 (387) T ss_pred cCCCccccceeeeccccc------cccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCCC------c Confidence 111122222222211111 011222467788887788777653 3345 6777766665542221122 2 Q ss_pred ccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecc Q lcl|Aclame:pro 232 PETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALER 311 (347) Q Consensus 232 ~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~ 311 (347) +..|.-.++.|.+|+.++..|. -+.|||+..... + . .+..+. T Consensus 305 ~~~~~~~~llG~PV~~~~~~~~---------------------------~~~GDf~~~~~~-~--------~--~~~~~~ 346 (387) T protein:vir:93 305 FFDTPAEKVFGKPVVFTDAAVK---------------------------PIVGDFNYFGIN-Y--------D--GTTYDT 346 (387) T ss_pred ccccCCccccccceEEecCCCc---------------------------eeeeehhhhhee-h--------h--hheeee Confidence 3334445799999999775431 022344432111 1 0 111222 Q ss_pred ccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 312 ARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 312 ~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ..+...-...+++...||+++++|++.+.+...++| T Consensus 347 ~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~ 382 (387) T protein:vir:93 347 DKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred cccccCCceeEEEEeeeCceeechhheEEEEeecCC Confidence 222222233456677899999999999988876666 No 151 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=98.89 E-value=7.4e-10 Score=70.63 Aligned_cols=324 Identities=13% Similarity=0.091 Sum_probs=175.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHH----hhhc----------------------ccccccc Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRS----VTMD----------------------KHMVRTI 54 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s----~~~~----------------------~~~~rti 54 (347) |+-+. |..+ .+|+. -+++|+.-+.+.-.+++ .|.+ .+++..+ T Consensus 1 ~~~a~------T~~~----~~~p~--a~~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL 68 (430) T protein:vir:10 1 MTASK------TTMR----YGDPN--AMIQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDL 68 (430) T ss_pred Cccee------eecc----cCChh--HHHHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccC Confidence 65432 3333 24443 56777777766654432 2222 4555555 Q ss_pred c--CCceEEEeccccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhh-ccHHHHHhCcchHHHHHHHHHHHHHH Q lcl|Aclame:pro 55 Q--NGKSASFPVMGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLI-YDIEDAMNHYDVRAEYSAQLGEALAI 131 (347) Q Consensus 55 ~--~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~V-dd~D~~q~~~D~r~~~~~~~g~aLa~ 131 (347) . .|++|.|+-+...+..-..-++.+.+.-+.++...-.|+|||.. .++.+ ..+++-.+-+|+|++.-..++.=+++ T Consensus 69 ~K~~GD~Vtf~L~~~L~g~gv~Gd~~lEGnee~L~~~~d~l~IDq~R-~~V~~gg~msqQRt~~dlR~~ar~~L~~w~~~ 147 (430) T protein:vir:10 69 GRNKGDEVRFHFVQPANAFPIMGSEYAEGKGTGLKIGSDQLRVNQAR-FPVDLGDVMSQIRNPYDLRRLGRPKAKWFMDA 147 (430) T ss_pred CCCCccEEEEeEeeccccCceecCceeeccccceEEEeeEEEEeeec-cccccCCchhhhhhhhHHHHHHHHHHHHHHHH Confidence 3 59999999888877666666677777667888888999999974 33333 35677788899999999999999999 Q ss_pred HHHHHHHHHHHHhhh----------------cccc-cccccCcccCce--eeeeccccc----------ccchhhHHHHH Q lcl|Aclame:pro 132 AADGAVLAEMAKLCN----------------LPAA-SNENIAGLGQAV--VLNIGAAAD----------LVDVEARGKAI 182 (347) Q Consensus 132 ~~D~~il~~l~~~a~----------------~a~~-~~~~~~g~~~~~--~i~~~~~~~----------~~~~~~~~~~i 182 (347) ..||.+|.+|+.+-. ++.. .+.... +... ....+.+++ .+..+... T Consensus 148 ~~Dq~~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~a--Pt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s--- 222 (430) T protein:vir:10 148 YLDQSMLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKA--PTKNRHFVASADAITGVAPNAGEYNITTADVLD--- 222 (430) T ss_pred HHHHHHHHHHhhhhcccccccccccccCCcchhhhhccccCC--CCCceeEeecccccccccccccccchhhhcccC--- Confidence 999999999874311 0000 000000 1111 111111111 11111111 Q ss_pred HHHHHHHHHHHhhccCC-------CCC-------CEEEEChHHHHHHhcchhhhh----h-hccc---cccccccceEEE Q lcl|Aclame:pro 183 LKGLTLARARLTKNYVP-------AGD-------RRFYCAPEDYSAILSALMPNA----A-NYAA---LIDPETGNIRNV 240 (347) Q Consensus 183 ~~~l~~a~~~Lde~~VP-------~~g-------R~~vv~P~~~~~Ll~~~~~~~----~-~~~~---~~~~~~G~v~~i 240 (347) ++.|.++...++..+.| .+. +++++.|.+|..|..++.+.. + .+.+ ...+-.|.++.+ T Consensus 223 ~~~id~a~~~a~~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ 302 (430) T protein:vir:10 223 VDVVDSIATYMDQIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLW 302 (430) T ss_pred HHHHHHHHHHHHhhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeee Confidence 44556677777776543 222 677899999999999987742 1 1122 234668999999 Q ss_pred eceeEEEecccccccccc--ccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhe-------eecc Q lcl|Aclame:pro 241 MGFEVIEVPHLTVGGAGD--NNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM-------ALER 311 (347) Q Consensus 241 ~G~~V~~sn~lp~~~~~~--~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~-------~~e~ 311 (347) .|+-|++..+.-....+. ...+.....+......++.....+.+ ..+|++-..|++.+..+.. =.|. T Consensus 303 ngvii~~~~~virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v----~RalllGaQA~~~A~g~~~~~g~~f~w~Ee 378 (430) T protein:vir:10 303 SNTLIIKMPKPIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAV----DRALLLGGQALAQAWAASEHSGMPFFWSEK 378 (430) T ss_pred cCeEEecCCceeeecCCCccccccCCcccccccccccccccccccc----hhhhhccchhheeeeeccCCCCcceeeeee Confidence 999999987542211110 00000000111111122222222222 2334444444433333221 1344 Q ss_pred ccchhhHhhHHhhhhhhcCccccc----------ceEEEEEecCCC Q lcl|Aclame:pro 312 ARRPEFQADQIIGKYAMGHGGLRP----------EAAGALVFTPAA 347 (347) Q Consensus 312 ~~~~~~~~d~i~~~~~~G~~~lRP----------e~~~~l~~~~aa 347 (347) .+|-.++- -|.....+|.+=.|- +=-|+|..-.+| T Consensus 379 ~~D~g~~~-~i~~~~i~G~kK~rF~~~~~~~~~~~DfGvi~idtaa 423 (430) T protein:vir:10 379 DMDHGDKL-ELLIGAILGCSKIRFAVEATNGLEYTDHGVMAIDTAV 423 (430) T ss_pred ccccCchh-hhhhhHHhccceeeecCCCCCCceeeeeEEEEhhhhh Confidence 44444332 344455556543333 234555554444 No 152 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.88 E-value=3.4e-10 Score=72.46 Aligned_cols=302 Identities=10% Similarity=0.024 Sum_probs=144.8 Q ss_pred CCCCccC-----ccc-cccCcccCccccHHHHHHHH-HhHHHHHHHHHHHhhhcccccccccC-CceEEEecccccee-e Q lcl|Aclame:pro 1 MANATGG-----QQI-GANQGKGQSAADKLALFLKV-FGGEVLTAFVRRSVTMDKHMVRTIQN-GKSASFPVMGRTKG-Y 71 (347) Q Consensus 1 m~~~~~~-----~~~-~~~~~~~~~~~d~~al~ie~-f~geV~~~f~~~s~~~~~~~~rti~~-G~tv~i~~iG~~t~-~ 71 (347) ....... ... ..|.......+.- .+.+.. ..+++.+..+..+.++++++.+++.+ +.++.||++..... . T Consensus 137 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg-~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a 215 (477) T protein:vir:84 137 DVESDKEIRKIAKVGEEYRDLDRNGGTGG-YAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTA 215 (477) T ss_pred hhhhhhhHHHHHHhhhhhccccccCCCcc-eeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCccee Confidence 0000000 000 0000000000001 134444 46888888888888888888877764 45789998633322 2 Q ss_pred e-ecCCCCCCCC---CCCCCCCceEEEEeeeeecch-hhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 72 Y-LAPGENLDDK---RKDIKHSEKVIQIDGLLTSDV-LIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCN 146 (347) Q Consensus 72 ~-~~~g~~~~~~---~~~~~~~~~~l~ID~~~~~~~-~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~ 146 (347) + ...|..+... ..++....+++...+ +..+ .|.+-=-.++.+|+.+.+.++.+++|+++.|+.+|. +.. T Consensus 216 ~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k--~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~----G~G 289 (477) T protein:vir:84 216 IQAADNAALTAPSAHEVDLTDGFVQANVKT--IAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVIS----GTG 289 (477) T ss_pred eeeccCcccccccccccccceeeEEEeeee--EEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhc----cCC Confidence 2 2223322211 112334444444444 3333 343322344578999999999999999999998763 211 Q ss_pred cccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhc Q lcl|Aclame:pro 147 LPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANY 226 (347) Q Consensus 147 ~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~ 226 (347) . ++.+.|.-....+...+.+...........+++.|+++...++.... .....+++.|..|..|.+-.+- +..| T Consensus 290 t----~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~-~~~~~~v~~~~~~~~l~~lkd~-~G~~ 363 (477) T protein:vir:84 290 S----NNQVVGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRF-LEPEVIVMHPRRWASFHAIFAG-DDRP 363 (477) T ss_pred C----CCccceeeeccccccccccccccchhhHHHHHHHHHHHHhhcccccc-CCccEEEEcHHHHHHHHHhhcc-CCCe Confidence 1 01111111100000000111111111223356667776665555433 2234668899999888653321 2222 Q ss_pred cc-------------cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEe Q lcl|Aclame:pro 227 AA-------------LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLF 293 (347) Q Consensus 227 ~~-------------~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~ 293 (347) .. .+.+..|..++++|++|+.|+.+|...... .....-+.+||+.. ++ T Consensus 364 l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~-----------------~d~~~i~~gd~~~~--~i 424 (477) T protein:vir:84 364 LIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTG-----------------TDQDVIHVLRASDL--AL 424 (477) T ss_pred eeecCcccccccccccccccccccchhcccceEecCccccccccc-----------------CCcceEEEEEeceE--EE Confidence 11 122445666789999999999999521000 00111234566543 22 Q ss_pred echhhhhhhhhhheeeccccchhhHhhH------HhhhhhhcCcccc-cceEEEEEecCCC Q lcl|Aclame:pro 294 NHRSAVGTVKLKDMALERARRPEFQADQ------IIGKYAMGHGGLR-PEAAGALVFTPAA 347 (347) Q Consensus 294 ~h~~A~~tv~~~~~~~e~~~~~~~~~d~------i~~~~~~G~~~lR-Pe~~~~l~~~~aa 347 (347) +. + .+.++. ++..+.+. +.+++.+ +.+| |++.+.+..++++ T Consensus 425 ~~-~--------~~~~~~--~~~~~~~~~~~~~~v~~~~~~--~~~r~~~afv~~t~~~~~ 472 (477) T protein:vir:84 425 FE-S--------SVRMRA--LQETRAENLSVLLQVYGYLAF--TAARFPQSVVEIGGTALT 472 (477) T ss_pred Ee-e--------ceeEEe--ccccccccceeeeeehhhhhh--hhhccccceEEeeccccc Confidence 21 1 122222 22222222 2233333 5666 9999999866665 No 153 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=98.84 E-value=1.8e-10 Score=73.95 Aligned_cols=273 Identities=15% Similarity=0.077 Sum_probs=146.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccceeeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTKGYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t~~~~~~g~~ 78 (347) +..........+ .| ....|. .|--+.+..++++.-+..+.++.+.++.++.+ .++|++ +..++.....|.. T Consensus 73 ~~~~~~~~~al~-~~-~~~~gG--~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v~E~~~ 145 (352) T protein:vir:78 73 SMEAQRLLHALP-TG-NDSGGD--KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDDDDFITDVET 145 (352) T ss_pred HhhHHHHHHHhc-cC-CCCCCc--eeccHhHHHHHHHHHHhhcchhhheeeEecCC---ceEEEEecCCCcccccccccc Confidence 000000000000 00 001111 14448899999999989999999988876543 234432 2234444444555 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~ 158 (347) ++.. +++.+++++.+.++ ..-+.|.+-=-..+.+|+.+.+.++.++++++..++.++.. +...|. T Consensus 146 ~~~~--~~~f~~v~~~~~k~-~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~------------g~g~~~ 210 (352) T protein:vir:78 146 AKEL--KLKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV------------SPKSGL 210 (352) T ss_pred cccc--cccceeeeecceeE-EeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhhc------------CCCCcc Confidence 5443 46777777776655 23345544333345789999999999999987645544321 111111 Q ss_pred cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccccccceE Q lcl|Aclame:pro 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) Q Consensus 159 ~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~ 238 (347) +.+.....+.. . ..+...+|.|+++...|+..... +-.+++++..|..|++-.+-.++. +-.|.-. T Consensus 211 ~~g~l~~~~~~--~----~t~~~~~d~i~~~~~~l~~~~~~--~a~~~mn~~t~~~l~~~~~~~~~~------~~~~~~~ 276 (352) T protein:vir:78 211 EHMSFYNGSVK--E----VEGANMYDAIINALADLHEDYRD--NATIYMRYADYVKIISVLSNGTTN------FFDTPAE 276 (352) T ss_pred cccceeccccc--c----ccccchHHHHHHHHhccChhhhc--CCEEEEehHHHHHHHHHHhccCCc------ccccCCc Confidence 11211111110 0 11122367788887777766542 334577888888877643222222 2234445 Q ss_pred EEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhH Q lcl|Aclame:pro 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) Q Consensus 239 ~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~ 318 (347) +++|.+|+.++..+.. +-|||+..... ...+..+...+...- T Consensus 277 ~llG~PV~~~~~~~~~---------------------------~~Gdf~~~~~~-----------~~~~~~~~~~~~~~g 318 (352) T protein:vir:78 277 KVFGKPVVFTDAAVKP---------------------------IVGDFNYFGIN-----------YDGTTYDTDKDVKKG 318 (352) T ss_pred cccccceEEecCCCce---------------------------eEeehhhhhhh-----------hhhheeeeeccccCC Confidence 7899999988754310 22344332100 011223333332222 Q ss_pred hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 319 ~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .-.+++.+.++++++|||+.+.+..+++| T Consensus 319 ~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~ 347 (352) T protein:vir:78 319 EYLFVLTAWYDQQRTLDSAFRIAKAKEST 347 (352) T ss_pred eeEEEEEeeeCceeechhheEEEEeeccc Confidence 23455678899999999999999888888 No 154 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=98.84 E-value=9.9e-10 Score=69.95 Aligned_cols=302 Identities=11% Similarity=0.043 Sum_probs=167.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhc-cc---------cccccc--CCceEEEeccccc Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMD-KH---------MVRTIQ--NGKSASFPVMGRT 68 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~-~~---------~~rti~--~G~tv~i~~iG~~ 68 (347) ||-.+ .+ .+|+. -.++|+..+...-.+.|-|.. ++ +...+. .|++|.|.-+... T Consensus 1 Ma~T~--------~~----~~~p~--a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L~~~L 66 (364) T protein:vir:93 1 MSQTV--------IP----FGDPK--AVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDLSVHL 66 (364) T ss_pred Cceec--------cC----cCCHH--HHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeeeeeec Confidence 88622 22 35665 469999999888878764444 21 121232 4999999988888 Q ss_pred eeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhh---ccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 69 KGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLI---YDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~V---dd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a 145 (347) +.....-++.+.+.-+.++....+|+||+.. +.| ..+++-.+-+|+|.+.-..++.-+++..|+.++..++.+ T Consensus 67 ~g~gv~Gd~~leGnee~L~~~~~~i~idq~r---~~V~~~g~ms~qRt~~dlr~~ar~~L~~w~~~~~d~~~f~~laGa- 142 (364) T protein:vir:93 67 RGKPTYGDARVEGKEESLRFYQDEVRIDQVR---HSVSAGGRMSRKRTVHNIRRIARDRLGDYFYKFTDELLFIYLSGA- 142 (364) T ss_pred ccCCcccCceeeccccceeEEeeEEEEeecc---ccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc- Confidence 7666666677777777888888999999974 344 468888899999999999999999999999998888642 Q ss_pred hccccc-----------ccccCcccCceeeeeccccccc---chhhHHHHHHHHHHHHHHHHhhccCC------------ Q lcl|Aclame:pro 146 NLPAAS-----------NENIAGLGQAVVLNIGAAADLV---DVEARGKAILKGLTLARARLTKNYVP------------ 199 (347) Q Consensus 146 ~~a~~~-----------~~~~~g~~~~~~i~~~~~~~~~---~~~~~~~~i~~~l~~a~~~Lde~~VP------------ 199 (347) +-.... .+.+..-.....+-.+.++... ..+.. -++.|..+..+++....+ T Consensus 143 rg~~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~---sl~~id~a~~~a~~~~~~~~~~~~~~Pv~~ 219 (364) T protein:vir:93 143 RGINLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIM---APLVIEKAVEKAAMMQAENPDVANMVPVSI 219 (364) T ss_pred cccccccccccCcccccccccCCCCCCcEEeccccCchhhccccccc---cHHHHHHHHHHHHHhCCCCCCCcccceeEe Confidence 111000 0000000011111111111111 11111 145566666666654321 Q ss_pred -CCCC-EEEEChHHHHHHhcch--hhhhhh---cc--c-cccccccceEEEeceeEEEeccccccccccccccCcccccc Q lcl|Aclame:pro 200 -AGDR-RFYCAPEDYSAILSAL--MPNAAN---YA--A-LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTN 269 (347) Q Consensus 200 -~~gR-~~vv~P~~~~~Ll~~~--~~~~~~---~~--~-~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~ 269 (347) .++. ++++.|.++..|..+. ++.+-. .. + ...+-.|.++.+.|+-|++.++++....... + ++.+ T Consensus 220 ~g~~~yV~~l~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~----~-~~v~ 294 (364) T protein:vir:93 220 DGDDHYVCVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGA----G-ANVE 294 (364) T ss_pred cCcceeEEEEcchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCccccccccc----C-cccc Confidence 1123 6678999999998543 332211 11 1 2336689999999999999999986532211 0 0000 Q ss_pred ccccccccccccccccccceeEEeechhhhhhh--hh---hheeeccccchhhHhhHHhhhhhhcCcccc--cceEEEEE Q lcl|Aclame:pro 270 QKHIFPATATGDDRVAQNNVVGLFNHRSAVGTV--KL---KDMALERARRPEFQADQIIGKYAMGHGGLR--PEAAGALV 342 (347) Q Consensus 270 ~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv--~~---~~~~~e~~~~~~~~~d~i~~~~~~G~~~lR--Pe~~~~l~ 342 (347) + ..+|++-..|++.+ +. ...-.|..+|..++- -|.....+|-+=.| .+=-|+|. T Consensus 295 --------------~----~ralllGaQA~~~a~g~~~g~~~~w~Ee~~D~gn~~-~i~~~~i~G~kK~rF~~~DfGvi~ 355 (364) T protein:vir:93 295 --------------A----ARALFMGRQAGVIAYGTANGLRFDWEETVKDYGNEP-AIAAGFIAGMKKARFNNKDFGVIS 355 (364) T ss_pred --------------c----hhhheecceeeEEEeecCCCCCceeeecccCCCCch-hhhhhhHhhhhhcccCCccceEEE Confidence 0 01122222222211 11 111244444443332 23444444543332 22334444 Q ss_pred e--cCCC Q lcl|Aclame:pro 343 F--TPAA 347 (347) Q Consensus 343 ~--~~aa 347 (347) . ++++ T Consensus 356 idtaa~~ 362 (364) T protein:vir:93 356 IDTAAKK 362 (364) T ss_pred ecccccc Confidence 3 3333 No 155 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=98.82 E-value=3.6e-10 Score=72.33 Aligned_cols=273 Identities=14% Similarity=0.052 Sum_probs=146.3 Q ss_pred CC---------CCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccce Q lcl|Aclame:pro 1 MA---------NATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTK 69 (347) Q Consensus 1 m~---------~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t 69 (347) |. +........ ..+.. ++| -.+..+.|..++++..+..+.++++.+++++.+ .++|++ +..+ T Consensus 114 ~~~~~~~~~~~~~~~~~~a~-~~~t~-~~G--G~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~p~~~~~~~~ 186 (402) T protein:vir:93 114 ILPNEFEKPSMEAQRLLHAL-PTGND-SGG--DKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDD 186 (402) T ss_pred HhhhhHHHHHHhHHHHHhhh-ccCCC-cCC--ccccchhHHHHHHHhHHhhhhhhhhceeeecCC---ceeeeeeccCCc Confidence 00 000000000 00000 111 124458899999999999899999988877653 345543 3334 Q ss_pred eeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 70 GYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPA 149 (347) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~ 149 (347) +.-+..|...+.+ +++.+++++.+.++ +.-..|.+-=...+.+|+.+.+.++.++++++..++.++.. T Consensus 187 a~~v~Eg~~~~~~--~~~f~~i~~~~~k~-~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~--------- 254 (402) T protein:vir:93 187 DDFITDVETAKEL--KAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV--------- 254 (402) T ss_pred ccccccccccccc--ccccceeeecceee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhc--------- Confidence 4445555555432 46667766666554 33344543222335788999999999999998766655421 Q ss_pred ccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 150 ASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 150 ~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) +...|.+.+.....+.. . ..+...++.|+++...|+..... ...| ++.+..|..|+.-.+-.+ T Consensus 255 ---g~g~g~p~g~~~~~~~~--~----~~~~~~~d~l~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~d~~------ 317 (402) T protein:vir:93 255 ---SPKSGLEHMSFYNGSVK--E----VEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGT------ 317 (402) T ss_pred ---CCCccccceeeeccccc--c----ccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCC------ Confidence 11122222222211111 1 11222467788888788776653 3455 666666655554222112 Q ss_pred ccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheee Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMAL 309 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~ 309 (347) ..+..|.-.++.|.+|+.++..|.- +.|||+..... + ..+.. T Consensus 318 ~~~~~~~~~~llG~PV~~t~~~~~i---------------------------~~GDf~~~~~~-~----------~~~~~ 359 (402) T protein:vir:93 318 TNFFDTPAEKVFGKPVVFTDAAVKP---------------------------IVGDFNYFGIN-Y----------DGTTY 359 (402) T ss_pred CcccccCCccccccceEEecCCCce---------------------------eeechhhhhhh-h----------hhhhh Confidence 2233344457899999998755410 22344432111 1 01122 Q ss_pred ccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 310 ERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 310 e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +...+...-.-.+++...+++++++|++...|...+++ T Consensus 360 ~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~ 397 (402) T protein:vir:93 360 DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 397 (402) T ss_pred hhhhcccCCceEEEEEEEeCcEEechhheEEEEeecCC Confidence 22333222233456777899999999999999986666 No 156 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=98.81 E-value=1.1e-09 Score=69.65 Aligned_cols=301 Identities=8% Similarity=-0.044 Sum_probs=146.0 Q ss_pred CCCCcc--------CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccc--ccccC-CceEEEec-cccc Q lcl|Aclame:pro 1 MANATG--------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV--RTIQN-GKSASFPV-MGRT 68 (347) Q Consensus 1 m~~~~~--------~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~--rti~~-G~tv~i~~-iG~~ 68 (347) ..+... .....+.++ .+|.. +.-+.|.+++.+..+..++++.+... ....+ --.+.||+ .+.+ T Consensus 321 ~~~~~~~~~~~~a~~~~~~~~~~---~~Gg~--~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~ 395 (645) T protein:vir:93 321 PDDSRLHHVLKSAVGAGTTTDPQ---WAGSL--SEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGG 395 (645) T ss_pred ccchhhhhhhhhhhhcccccccc---ccCCc--cCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCc Confidence 000000 000001111 11111 33478889998888888887766432 22221 12467775 4556 Q ss_pred eeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 69 KGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP 148 (347) Q Consensus 69 t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a 148 (347) ++.....|+.++.+ +++.+++++..-++ +.-..|.+-=-.++.+|+.+.+.++.+++|+++.|+.+|.-- . T Consensus 396 ~a~wv~Eg~~~~~s--~~~f~~v~l~~~kl-a~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~--g---- 466 (645) T protein:vir:93 396 AAGWVGEGKTKPLT--KFDFESITFSHAKV-SAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPK--K---- 466 (645) T ss_pred ceEEeccCcccccc--ccceeEEEEeeEEE-EEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCC--C---- Confidence 66666667766543 45677666655432 333334332223567889999999999999999999887310 0 Q ss_pred cccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccc Q lcl|Aclame:pro 149 AASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA 228 (347) Q Consensus 149 ~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~ 228 (347) .+.....+.+. ..+..+..... ...+.+..+...|..+++...+-++|++|..+..|.+-.. .+..+.- T Consensus 467 ---~~~~~~~p~gi--~~~~~~~~~~~-----~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd-~~G~~~~ 535 (645) T protein:vir:93 467 ---AAVADVSPASI--THDVKGTASSG-----NPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKN-ALGQKEY 535 (645) T ss_pred ---cccCCccccce--ecccccccccc-----chHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccc-cCCceee Confidence 00000111111 11111110000 1134466677778888876666677899999999876432 2222211 Q ss_pred cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhh-- Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD-- 306 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~-- 306 (347) ......| ++++|++|+.|+++|... ..+ ..+.-|-+++.. +-+.+.+.|-......+ T Consensus 536 ~~~~~~~--~tL~G~PV~~s~~vp~~~----~~g--------------d~s~~~ig~~~~-v~i~~s~~a~~~~~~~~~~ 594 (645) T protein:vir:93 536 PDMTLLG--GSFQGLPVIVSQYVGDQL----VLV--------------NAPDIYLADDGG-VAVDMSREASLEMQSEPTG 594 (645) T ss_pred cCCCCCC--ceeeceeeEEeccCCcce----eEe--------------ccccEEEEEecc-eEEEeecceeEEEeecccc Confidence 1111122 479999999999998310 000 001111111111 11111111110000000 Q ss_pred -eeecc--ccchhhHh--hHHhhhhhhcCcccccceEEEEEecC--CC Q lcl|Aclame:pro 307 -MALER--ARRPEFQA--DQIIGKYAMGHGGLRPEAAGALVFTP--AA 347 (347) Q Consensus 307 -~~~e~--~~~~~~~~--d~i~~~~~~G~~~lRPe~~~~l~~~~--aa 347 (347) ..... .--..++- -.|+....++-+++||++++.|.-.. +| T Consensus 595 ~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~ 642 (645) T protein:vir:93 595 DSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSA 642 (645) T ss_pred cccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcc Confidence 00000 00001222 34677788899999999998886211 12 No 157 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=98.77 E-value=5.4e-10 Score=71.40 Aligned_cols=273 Identities=14% Similarity=0.055 Sum_probs=144.7 Q ss_pred CCCCcc---------CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccce Q lcl|Aclame:pro 1 MANATG---------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTK 69 (347) Q Consensus 1 m~~~~~---------~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t 69 (347) |..... ..... ..|.. ++|. .+..+.|..++++..+..+.++++.+++++.+ .++|++ +..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~-~~gG--~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~ 171 (387) T protein:vir:94 99 ILPNEFEKPSMEAQRLLHAL-PTGND-SGGD--KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDD 171 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCC-CCCc--eeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCc Confidence 100000 00000 00000 1111 24458899999999999898999888877654 334432 2234 Q ss_pred eeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 70 GYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPA 149 (347) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~ 149 (347) +.-...|...+.+ +++.+++++...++ ..-..|.+-=...+.+|+.+.+.++.++++++..++.++.. T Consensus 172 a~~v~Eg~~~~~~--~~~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~--------- 239 (387) T protein:vir:94 172 DDFITDVETAKEL--KAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV--------- 239 (387) T ss_pred ccccccccccccc--ccccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhc--------- Confidence 4444555555432 46677766666554 23334443222235688999999999999998766655421 Q ss_pred ccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 150 ASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 150 ~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) +...|.+.+.....+... ..+...++.|+++...|+....+ ...| ++.+..|..|+.-.+-.+ T Consensus 240 ---g~g~g~~~g~~~~~~~~~------~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~~------ 302 (387) T protein:vir:94 240 ---SPKSGLEHMSFYNGSVKE------VEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGT------ 302 (387) T ss_pred ---CCCccccceeeecccccc------ccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCC------ Confidence 111122222222111111 11122367788887778776553 3455 566766666654322112 Q ss_pred ccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheee Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMAL 309 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~ 309 (347) ..+..|.-.++.|.+|+.++..|.. +-|||+.... . . ..+.. T Consensus 303 ~~~~~~~~~~llG~PV~~~~~~~~~---------------------------~~GDf~~~~~-~--------~--~~~~~ 344 (387) T protein:vir:94 303 TNFFDTPAEKVFGKPVVFTDAAVKP---------------------------IVGDFNYFGI-N--------Y--DGTTY 344 (387) T ss_pred CcccccCCccccccceEEecCCCce---------------------------eeechhhhhh-h--------h--hhhhh Confidence 2233444457899999998754321 1234433211 0 0 11112 Q ss_pred ccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 310 ERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 310 e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.+.+...-.-.+++.+.|++++++|++.+.+...+++ T Consensus 345 ~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:94 345 DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred eecccccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 22222221123455667899999999999999987777 No 158 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=98.77 E-value=5.4e-10 Score=71.40 Aligned_cols=273 Identities=14% Similarity=0.055 Sum_probs=144.7 Q ss_pred CCCCcc---------CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccce Q lcl|Aclame:pro 1 MANATG---------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTK 69 (347) Q Consensus 1 m~~~~~---------~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t 69 (347) |..... ..... ..|.. ++|. .+..+.|..++++..+..+.++++.+++++.+ .++|++ +..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~-~~gG--~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~ 171 (387) T protein:vir:96 99 ILPNEFEKPSMEAQRLLHAL-PTGND-SGGD--KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDD 171 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCC-CCCc--eeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCc Confidence 100000 00000 00000 1111 24458899999999999898999888877654 334432 2234 Q ss_pred eeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 70 GYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPA 149 (347) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~ 149 (347) +.-...|...+.+ +++.+++++...++ ..-..|.+-=...+.+|+.+.+.++.++++++..++.++.. T Consensus 172 a~~v~Eg~~~~~~--~~~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~--------- 239 (387) T protein:vir:96 172 DDFITDVETAKEL--KAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV--------- 239 (387) T ss_pred ccccccccccccc--ccccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhc--------- Confidence 4444555555432 46677766666554 23334443222235688999999999999998766655421 Q ss_pred ccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 150 ASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 150 ~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) +...|.+.+.....+... ..+...++.|+++...|+....+ ...| ++.+..|..|+.-.+-.+ T Consensus 240 ---g~g~g~~~g~~~~~~~~~------~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~~------ 302 (387) T protein:vir:96 240 ---SPKSGLEHMSFYNGSVKE------VEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGT------ 302 (387) T ss_pred ---CCCccccceeeecccccc------ccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCC------ Confidence 111122222222111111 11122367788887778776553 3455 566766666654322112 Q ss_pred ccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheee Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMAL 309 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~ 309 (347) ..+..|.-.++.|.+|+.++..|.. +-|||+.... . . ..+.. T Consensus 303 ~~~~~~~~~~llG~PV~~~~~~~~~---------------------------~~GDf~~~~~-~--------~--~~~~~ 344 (387) T protein:vir:96 303 TNFFDTPAEKVFGKPVVFTDAAVKP---------------------------IVGDFNYFGI-N--------Y--DGTTY 344 (387) T ss_pred CcccccCCccccccceEEecCCCce---------------------------eeechhhhhh-h--------h--hhhhh Confidence 2233444457899999998754321 1234433211 0 0 11112 Q ss_pred ccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 310 ERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 310 e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.+.+...-.-.+++.+.|++++++|++.+.+...+++ T Consensus 345 ~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:96 345 DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred eecccccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 22222221123455667899999999999999987777 No 159 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=98.77 E-value=5.4e-10 Score=71.40 Aligned_cols=273 Identities=14% Similarity=0.055 Sum_probs=144.7 Q ss_pred CCCCcc---------CccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc--ccce Q lcl|Aclame:pro 1 MANATG---------GQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM--GRTK 69 (347) Q Consensus 1 m~~~~~---------~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i--G~~t 69 (347) |..... ..... ..|.. ++|. .+..+.|..++++..+..+.++++.+++++.+ .++|++ +..+ T Consensus 99 ~~~~~~~~~~~~~~~~~~a~-~~~~~-~~gG--~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~~~~~~~ 171 (387) T protein:vir:26 99 ILPNEFEKPSMEAQRLLHAL-PTGND-SGGD--KLLPKTLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRVSYTLDD 171 (387) T ss_pred HhhhhHHHHHHHHHHHHhhh-ccCCC-CCCc--eeechhHHHHHHHHHHhhchhhhhceeeecCC---ceeeeeeccCCc Confidence 100000 00000 00000 1111 24458899999999999898999888877654 334432 2234 Q ss_pred eeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Q lcl|Aclame:pro 70 GYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPA 149 (347) Q Consensus 70 ~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~ 149 (347) +.-...|...+.+ +++.+++++...++ ..-..|.+-=...+.+|+.+.+.++.++++++..++.++.. T Consensus 172 a~~v~Eg~~~~~~--~~~f~~v~l~~~k~-~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~--------- 239 (387) T protein:vir:26 172 DDFITDVETAKEL--KAKGDTVKFTTNKF-KVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAV--------- 239 (387) T ss_pred ccccccccccccc--ccccceeeechhee-eeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhc--------- Confidence 4444555555432 46677766666554 23334443222235688999999999999998766655421 Q ss_pred ccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 150 ASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 150 ~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) +...|.+.+.....+... ..+...++.|+++...|+....+ ...| ++.+..|..|+.-.+-.+ T Consensus 240 ---g~g~g~~~g~~~~~~~~~------~~~~~~~d~i~~~~~~l~~~y~~-na~~-imn~~t~~~~~~~~~~~~------ 302 (387) T protein:vir:26 240 ---SPKSGLEHMSFYNGSVKE------VEGADMYDAIINALADLHEDYRD-NATI-YMRYADYVKIISVLSNGT------ 302 (387) T ss_pred ---CCCccccceeeecccccc------ccccchHHHHHHHHhccChhhhc-CCEE-EEechHHHHHHHHHhcCC------ Confidence 111122222222111111 11122367788887778776553 3455 566766666654322112 Q ss_pred ccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheee Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMAL 309 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~ 309 (347) ..+..|.-.++.|.+|+.++..|.. +-|||+.... . . ..+.. T Consensus 303 ~~~~~~~~~~llG~PV~~~~~~~~~---------------------------~~GDf~~~~~-~--------~--~~~~~ 344 (387) T protein:vir:26 303 TNFFDTPAEKVFGKPVVFTDAAVKP---------------------------IVGDFNYFGI-N--------Y--DGTTY 344 (387) T ss_pred CcccccCCccccccceEEecCCCce---------------------------eeechhhhhh-h--------h--hhhhh Confidence 2233444457899999998754321 1234433211 0 0 11112 Q ss_pred ccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 310 ERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 310 e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +.+.+...-.-.+++.+.|++++++|++.+.+...+++ T Consensus 345 ~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~ 382 (387) T protein:vir:26 345 DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred eecccccCCceEEEEEEEeCcEeechhheEEEEeecCC Confidence 22222221123455667899999999999999987777 No 160 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=98.72 E-value=4e-09 Score=66.63 Aligned_cols=324 Identities=12% Similarity=0.046 Sum_probs=165.3 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccc--cCCceEEEeccccceeeeecC--- Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTI--QNGKSASFPVMGRTKGYYLAP--- 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti--~~G~tv~i~~iG~~t~~~~~~--- 75 (347) |-|-|-.++..-...++ +.++ .+...=|.-.++.--++.-++..+-.++++ .+|+|+.|.+--.-. ...+| T Consensus 1 ~~~~~a~~~~~~~s~~g-~~~~--~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~-~~~~pl~e 76 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDG-ANSD--QMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLL-DDRNINDQ 76 (401) T ss_pred CCccCCCcccccccccc-cccc--eeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEeccccc-ccccchhc Confidence 76655433321111111 1122 233333444444433344566666666655 479999987654322 11222 Q ss_pred CCCCCCC------------------------------CC--CCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHH Q lcl|Aclame:pro 76 GENLDDK------------------------------RK--DIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSA 123 (347) Q Consensus 76 g~~~~~~------------------------------~~--~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~ 123 (347) |-+..+. ++ .++-.++...|-|+=.|.-+=|.++..-....+...++. T Consensus 77 Gv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ 156 (401) T protein:vir:95 77 GIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSR 156 (401) T ss_pred CCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHH Confidence 2211111 00 011122333444544444333444444444445555544 Q ss_pred HHHHHHH-HHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCC-- Q lcl|Aclame:pro 124 QLGEALA-IAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPA-- 200 (347) Q Consensus 124 ~~g~aLa-~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~-- 200 (347) ++-..=+ +..|.. -+++..++. ....++ +.+..-.+++...+ +....++.|..+...|+++..|+ T Consensus 157 ell~g~~~~t~d~i-~~dll~ag~-~viyAg------~ats~At~~~~~~~----~t~vt~~~l~rl~~~L~~nRapk~t 224 (401) T protein:vir:95 157 ELMNGATQITEAVL-QKDLLAAAG-TVLYAG------AATSDATITGEGST----PSVVSYKNLMRLDQILTENRTPTQT 224 (401) T ss_pred HHhhhhhhhHHHHH-HHHHHhhcC-eeecCC------ccceeeeccccccc----cceechhHHHHHHHHHHhcccccch Confidence 4433332 223433 333332210 000000 00011111111111 11123688999999999988877 Q ss_pred --------C-------CCEEEECh------HHHHHHhcchhhhhh-hccccccccccceEEEeceeEEEecccccccccc Q lcl|Aclame:pro 201 --------G-------DRRFYCAP------EDYSAILSALMPNAA-NYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGD 258 (347) Q Consensus 201 --------~-------gR~~vv~P------~~~~~Ll~~~~~~~~-~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~ 258 (347) - -|+.++.| +.+++|+.++.|+.. .|..-+..-+|.||++.+|++++++.+-...... T Consensus 225 ~~i~~s~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag 304 (401) T protein:vir:95 225 TIITGSRMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAG 304 (401) T ss_pred hhhhhhhccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCc Confidence 1 26778777 555888888999874 6777778889999999999999988864332111 Q ss_pred ccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeec---------------cccchhhHhhHHh Q lcl|Aclame:pro 259 NNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALE---------------RARRPEFQADQII 323 (347) Q Consensus 259 ~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e---------------~~~~~~~~~d~i~ 323 (347) ... .+..+ +-.. .....++.|.+ .-.|++-++|.+++..+..... ..-||.-|-=.+- T Consensus 305 ~~a-~~~~~-~y~~-~~~~~gg~~dV----yp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vg 377 (401) T protein:vir:95 305 AQA-TGANP-GYRT-SMVSGQEHYDV----YPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSS 377 (401) T ss_pred ccc-ccccc-cccc-ccccCCCccee----eeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhh Confidence 100 00000 0000 01112233332 2258888888888777654321 0136666766777 Q ss_pred hhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 324 GKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 324 ~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) =++.|++.+||||..+-|...+.= T Consensus 378 wK~~~a~~vL~~e~m~~ies~a~~ 401 (401) T protein:vir:95 378 IKWYYGILVKRPERLALIKTVAPL 401 (401) T ss_pred hhhhhhhheeccceeEEEEeecCC Confidence 788999999999999999766555 No 161 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=98.67 E-value=1.1e-08 Score=64.29 Aligned_cols=330 Identities=10% Similarity=0.064 Sum_probs=172.9 Q ss_pred CCCCccCccccccCcccC----ccccHHHHHHHHHhHHHHHHHHHHHhhh---------ccccccccc--CCceEEEecc Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQ----SAADKLALFLKVFGGEVLTAFVRRSVTM---------DKHMVRTIQ--NGKSASFPVM 65 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~----~~~d~~al~ie~f~geV~~~f~~~s~~~---------~~~~~rti~--~G~tv~i~~i 65 (347) |+-... ....-.|.- ..+-.+ -+++.|.+.+...=+..+-+. ..++...+. .|++|.|.-+ T Consensus 1 ~~~~~~---~~a~~~~~~~lft~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~ 76 (404) T protein:vir:10 1 MTTVTS---AQANKLYQVALFTAANRNR-SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) T ss_pred CCCcCC---cchhhhHHHHHHHHHhcCC-hhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEe Confidence 544322 212222211 011111 357888776544333222111 222222332 5999999988 Q ss_pred ccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 66 GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) Q Consensus 66 G~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a 145 (347) ...+-....-++.+.+..+.++....+|+||+..-.=..=..+++-.+-+|+|++.-..++.-+++..||.+|.+|+.+ T Consensus 77 ~~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~- 155 (404) T protein:vir:10 77 HKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA- 155 (404) T ss_pred eecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc- Confidence 8887666666677777778888899999999975331111577788889999999999999999999999999988743 Q ss_pred hccccc---------cccc-----Cc--ccCce-eeeecccccc---cchhhHHHHHHHHHHHHHHHHhhccCCC----- Q lcl|Aclame:pro 146 NLPAAS---------NENI-----AG--LGQAV-VLNIGAAADL---VDVEARGKAILKGLTLARARLTKNYVPA----- 200 (347) Q Consensus 146 ~~a~~~---------~~~~-----~g--~~~~~-~i~~~~~~~~---~~~~~~~~~i~~~l~~a~~~Lde~~VP~----- 200 (347) +.-..+ .... .. .+... .+-.+.+++. +..+... ++.|-++.+++++..-|- T Consensus 156 rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s---~~~Id~~~~~~~~~~~pi~Pv~~ 232 (404) T protein:vir:10 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHPLQPVRL 232 (404) T ss_pred ccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhccccc---HHHHHHHHHHHHHhCCCCcceEe Confidence 210000 0000 00 00001 1111111111 1111111 344555667776643331 Q ss_pred --CC-------CEEEEChHHHHHHhcchh---hhh----hhc---cccccccccceEEEeceeEEEeccccccccccccc Q lcl|Aclame:pro 201 --GD-------RRFYCAPEDYSAILSALM---PNA----ANY---AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNP 261 (347) Q Consensus 201 --~g-------R~~vv~P~~~~~Ll~~~~---~~~----~~~---~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~ 261 (347) +. ++++++|.+|..|..++. +.+ +.. .....+-.|.++.+.|+-|++.++.|..-...... T Consensus 233 ~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~ 312 (404) T protein:vir:10 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) T ss_pred ccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeeccccee Confidence 12 677899999999999852 222 111 11344668999999999999999887532111111 Q ss_pred c-CccccccccccccccccccccccccceeEEeechhhhhhhhhh-----heeeccccchhhHhhHHhhhhhhcCcccc- Q lcl|Aclame:pro 262 A-DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK-----DMALERARRPEFQADQIIGKYAMGHGGLR- 334 (347) Q Consensus 262 ~-~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~-----~~~~e~~~~~~~~~d~i~~~~~~G~~~lR- 334 (347) . ......+.... ....+.++ .+|++-..|++.+..+ .--.|..+|-.++- -|.....+|.+=.| T Consensus 313 ~~~~n~~~a~~~~----~aa~~~v~----RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~-~i~~~~i~G~kK~rF 383 (404) T protein:vir:10 313 LVSENNLTATTKE----VAAATNID----RAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT-EIAISWINGLKKIRF 383 (404) T ss_pred eecCCcccccccc----ccccccch----hheeecceeEEEEeeccCCCCceeEeeccccCchh-hhhhHHHhhhhhccc Confidence 0 00000000000 01111121 1233333333222111 11244444544433 45556667765555 Q ss_pred c------ceEEEEEecCCC Q lcl|Aclame:pro 335 P------EAAGALVFTPAA 347 (347) Q Consensus 335 P------e~~~~l~~~~aa 347 (347) | +--|+|..-.+| T Consensus 384 ~~~~g~~~DfGvi~idta~ 402 (404) T protein:vir:10 384 PEKSGKMQDHGVIAVDTAV 402 (404) T ss_pred cCCCCceeeEEEEEecccc Confidence 4 356777777777 No 162 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=98.67 E-value=1.1e-08 Score=64.29 Aligned_cols=330 Identities=10% Similarity=0.064 Sum_probs=172.9 Q ss_pred CCCCccCccccccCcccC----ccccHHHHHHHHHhHHHHHHHHHHHhhh---------ccccccccc--CCceEEEecc Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQ----SAADKLALFLKVFGGEVLTAFVRRSVTM---------DKHMVRTIQ--NGKSASFPVM 65 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~----~~~d~~al~ie~f~geV~~~f~~~s~~~---------~~~~~rti~--~G~tv~i~~i 65 (347) |+-... ....-.|.- ..+-.+ -+++.|.+.+...=+..+-+. ..++...+. .|++|.|.-+ T Consensus 1 ~~~~~~---~~a~~~~~~~lft~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~ 76 (404) T protein:vir:10 1 MTTVTS---AQANKLYQVALFTAANRNR-SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) T ss_pred CCCcCC---cchhhhHHHHHHHHHhcCC-hhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEe Confidence 544322 212222211 011111 357888776544333222111 222222332 5999999988 Q ss_pred ccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 66 GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) Q Consensus 66 G~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a 145 (347) ...+-....-++.+.+..+.++....+|+||+..-.=..=..+++-.+-+|+|++.-..++.-+++..||.+|.+|+.+ T Consensus 77 ~~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~- 155 (404) T protein:vir:10 77 HKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA- 155 (404) T ss_pred eecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc- Confidence 8887666666677777778888899999999975331111577788889999999999999999999999999988743 Q ss_pred hccccc---------cccc-----Cc--ccCce-eeeecccccc---cchhhHHHHHHHHHHHHHHHHhhccCCC----- Q lcl|Aclame:pro 146 NLPAAS---------NENI-----AG--LGQAV-VLNIGAAADL---VDVEARGKAILKGLTLARARLTKNYVPA----- 200 (347) Q Consensus 146 ~~a~~~---------~~~~-----~g--~~~~~-~i~~~~~~~~---~~~~~~~~~i~~~l~~a~~~Lde~~VP~----- 200 (347) +.-..+ .... .. .+... .+-.+.+++. +..+... ++.|-++.+++++..-|- T Consensus 156 rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s---~~~Id~~~~~~~~~~~pi~Pv~~ 232 (404) T protein:vir:10 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHPLQPVRL 232 (404) T ss_pred ccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhccccc---HHHHHHHHHHHHHhCCCCcceEe Confidence 210000 0000 00 00001 1111111111 1111111 344555667776643331 Q ss_pred --CC-------CEEEEChHHHHHHhcchh---hhh----hhc---cccccccccceEEEeceeEEEeccccccccccccc Q lcl|Aclame:pro 201 --GD-------RRFYCAPEDYSAILSALM---PNA----ANY---AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNP 261 (347) Q Consensus 201 --~g-------R~~vv~P~~~~~Ll~~~~---~~~----~~~---~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~ 261 (347) +. ++++++|.+|..|..++. +.+ +.. .....+-.|.++.+.|+-|++.++.|..-...... T Consensus 233 ~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~ 312 (404) T protein:vir:10 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) T ss_pred ccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeeccccee Confidence 12 677899999999999852 222 111 11344668999999999999999887532111111 Q ss_pred c-CccccccccccccccccccccccccceeEEeechhhhhhhhhh-----heeeccccchhhHhhHHhhhhhhcCcccc- Q lcl|Aclame:pro 262 A-DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK-----DMALERARRPEFQADQIIGKYAMGHGGLR- 334 (347) Q Consensus 262 ~-~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~-----~~~~e~~~~~~~~~d~i~~~~~~G~~~lR- 334 (347) . ......+.... ....+.++ .+|++-..|++.+..+ .--.|..+|-.++- -|.....+|.+=.| T Consensus 313 ~~~~n~~~a~~~~----~aa~~~v~----RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~-~i~~~~i~G~kK~rF 383 (404) T protein:vir:10 313 LVSENNLTATTKE----VAAATNID----RAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT-EIAISWINGLKKIRF 383 (404) T ss_pred eecCCcccccccc----ccccccch----hheeecceeEEEEeeccCCCCceeEeeccccCchh-hhhhHHHhhhhhccc Confidence 0 00000000000 01111121 1233333333222111 11244444544433 45556667765555 Q ss_pred c------ceEEEEEecCCC Q lcl|Aclame:pro 335 P------EAAGALVFTPAA 347 (347) Q Consensus 335 P------e~~~~l~~~~aa 347 (347) | +--|+|..-.+| T Consensus 384 ~~~~g~~~DfGvi~idta~ 402 (404) T protein:vir:10 384 PEKSGKMQDHGVIAVDTAV 402 (404) T ss_pred cCCCCceeeEEEEEecccc Confidence 4 356777777777 No 163 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=98.67 E-value=1.1e-08 Score=64.29 Aligned_cols=330 Identities=10% Similarity=0.064 Sum_probs=172.9 Q ss_pred CCCCccCccccccCcccC----ccccHHHHHHHHHhHHHHHHHHHHHhhh---------ccccccccc--CCceEEEecc Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQ----SAADKLALFLKVFGGEVLTAFVRRSVTM---------DKHMVRTIQ--NGKSASFPVM 65 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~----~~~d~~al~ie~f~geV~~~f~~~s~~~---------~~~~~rti~--~G~tv~i~~i 65 (347) |+-... ....-.|.- ..+-.+ -+++.|.+.+...=+..+-+. ..++...+. .|++|.|.-+ T Consensus 1 ~~~~~~---~~a~~~~~~~lft~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~ 76 (404) T protein:vir:32 1 MTTVTS---AQANKLYQVALFTAANRNR-SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) T ss_pred CCCcCC---cchhhhHHHHHHHHHhcCC-hhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEe Confidence 544322 212222211 011111 357888776544333222111 222222332 5999999988 Q ss_pred ccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 66 GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) Q Consensus 66 G~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a 145 (347) ...+-....-++.+.+..+.++....+|+||+..-.=..=..+++-.+-+|+|++.-..++.-+++..||.+|.+|+.+ T Consensus 77 ~~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~- 155 (404) T protein:vir:32 77 HKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA- 155 (404) T ss_pred eecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc- Confidence 8887666666677777778888899999999975331111577788889999999999999999999999999988743 Q ss_pred hccccc---------cccc-----Cc--ccCce-eeeecccccc---cchhhHHHHHHHHHHHHHHHHhhccCCC----- Q lcl|Aclame:pro 146 NLPAAS---------NENI-----AG--LGQAV-VLNIGAAADL---VDVEARGKAILKGLTLARARLTKNYVPA----- 200 (347) Q Consensus 146 ~~a~~~---------~~~~-----~g--~~~~~-~i~~~~~~~~---~~~~~~~~~i~~~l~~a~~~Lde~~VP~----- 200 (347) +.-..+ .... .. .+... .+-.+.+++. +..+... ++.|-++.+++++..-|- T Consensus 156 rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s---~~~Id~~~~~~~~~~~pi~Pv~~ 232 (404) T protein:vir:32 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHPLQPVRL 232 (404) T ss_pred ccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhccccc---HHHHHHHHHHHHHhCCCCcceEe Confidence 210000 0000 00 00001 1111111111 1111111 344555667776643331 Q ss_pred --CC-------CEEEEChHHHHHHhcchh---hhh----hhc---cccccccccceEEEeceeEEEeccccccccccccc Q lcl|Aclame:pro 201 --GD-------RRFYCAPEDYSAILSALM---PNA----ANY---AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNP 261 (347) Q Consensus 201 --~g-------R~~vv~P~~~~~Ll~~~~---~~~----~~~---~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~ 261 (347) +. ++++++|.+|..|..++. +.+ +.. .....+-.|.++.+.|+-|++.++.|..-...... T Consensus 233 ~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~ 312 (404) T protein:vir:32 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) T ss_pred ccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeeccccee Confidence 12 677899999999999852 222 111 11344668999999999999999887532111111 Q ss_pred c-CccccccccccccccccccccccccceeEEeechhhhhhhhhh-----heeeccccchhhHhhHHhhhhhhcCcccc- Q lcl|Aclame:pro 262 A-DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK-----DMALERARRPEFQADQIIGKYAMGHGGLR- 334 (347) Q Consensus 262 ~-~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~-----~~~~e~~~~~~~~~d~i~~~~~~G~~~lR- 334 (347) . ......+.... ....+.++ .+|++-..|++.+..+ .--.|..+|-.++- -|.....+|.+=.| T Consensus 313 ~~~~n~~~a~~~~----~aa~~~v~----RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~-~i~~~~i~G~kK~rF 383 (404) T protein:vir:32 313 LVSENNLTATTKE----VAAATNID----RAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT-EIAISWINGLKKIRF 383 (404) T ss_pred eecCCcccccccc----ccccccch----hheeecceeEEEEeeccCCCCceeEeeccccCchh-hhhhHHHhhhhhccc Confidence 0 00000000000 01111121 1233333333222111 11244444544433 45556667765555 Q ss_pred c------ceEEEEEecCCC Q lcl|Aclame:pro 335 P------EAAGALVFTPAA 347 (347) Q Consensus 335 P------e~~~~l~~~~aa 347 (347) | +--|+|..-.+| T Consensus 384 ~~~~g~~~DfGvi~idta~ 402 (404) T protein:vir:32 384 PEKSGKMQDHGVIAVDTAV 402 (404) T ss_pred cCCCCceeeEEEEEecccc Confidence 4 356777777777 No 164 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=98.67 E-value=1.1e-08 Score=64.29 Aligned_cols=330 Identities=10% Similarity=0.064 Sum_probs=172.9 Q ss_pred CCCCccCccccccCcccC----ccccHHHHHHHHHhHHHHHHHHHHHhhh---------ccccccccc--CCceEEEecc Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQ----SAADKLALFLKVFGGEVLTAFVRRSVTM---------DKHMVRTIQ--NGKSASFPVM 65 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~----~~~d~~al~ie~f~geV~~~f~~~s~~~---------~~~~~rti~--~G~tv~i~~i 65 (347) |+-... ....-.|.- ..+-.+ -+++.|.+.+...=+..+-+. ..++...+. .|++|.|.-+ T Consensus 1 ~~~~~~---~~a~~~~~~~lft~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L~ 76 (404) T protein:vir:81 1 MTTVTS---AQANKLYQVALFTAANRNR-SMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIM 76 (404) T ss_pred CCCcCC---cchhhhHHHHHHHHHhcCC-hhHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeEe Confidence 544322 212222211 011111 357888776544333222111 222222332 5999999988 Q ss_pred ccceeeeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 66 GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) Q Consensus 66 G~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a 145 (347) ...+-....-++.+.+..+.++....+|+||+..-.=..=..+++-.+-+|+|++.-..++.-+++..||.+|.+|+.+ T Consensus 77 ~~L~g~gv~Gd~~lEGnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~~dlr~~ar~~L~~w~~~~~d~~~~~~laG~- 155 (404) T protein:vir:81 77 HKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGA- 155 (404) T ss_pred eecccCCcccCceeeccccceeEEeeEEEEeeecccccccCchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc- Confidence 8887666666677777778888899999999975331111577788889999999999999999999999999988743 Q ss_pred hccccc---------cccc-----Cc--ccCce-eeeecccccc---cchhhHHHHHHHHHHHHHHHHhhccCCC----- Q lcl|Aclame:pro 146 NLPAAS---------NENI-----AG--LGQAV-VLNIGAAADL---VDVEARGKAILKGLTLARARLTKNYVPA----- 200 (347) Q Consensus 146 ~~a~~~---------~~~~-----~g--~~~~~-~i~~~~~~~~---~~~~~~~~~i~~~l~~a~~~Lde~~VP~----- 200 (347) +.-..+ .... .. .+... .+-.+.+++. +..+... ++.|-++.+++++..-|- T Consensus 156 rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s---~~~Id~~~~~~~~~~~pi~Pv~~ 232 (404) T protein:vir:81 156 RGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFS---IGLVDNLSLFIDEMAHPLQPVRL 232 (404) T ss_pred ccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhccccc---HHHHHHHHHHHHHhCCCCcceEe Confidence 210000 0000 00 00001 1111111111 1111111 344555667776643331 Q ss_pred --CC-------CEEEEChHHHHHHhcchh---hhh----hhc---cccccccccceEEEeceeEEEeccccccccccccc Q lcl|Aclame:pro 201 --GD-------RRFYCAPEDYSAILSALM---PNA----ANY---AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNP 261 (347) Q Consensus 201 --~g-------R~~vv~P~~~~~Ll~~~~---~~~----~~~---~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~ 261 (347) +. ++++++|.+|..|..++. +.+ +.. .....+-.|.++.+.|+-|++.++.|..-...... T Consensus 233 ~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~~ 312 (404) T protein:vir:81 233 SGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSKV 312 (404) T ss_pred ccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeeccccee Confidence 12 677899999999999852 222 111 11344668999999999999999887532111111 Q ss_pred c-CccccccccccccccccccccccccceeEEeechhhhhhhhhh-----heeeccccchhhHhhHHhhhhhhcCcccc- Q lcl|Aclame:pro 262 A-DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK-----DMALERARRPEFQADQIIGKYAMGHGGLR- 334 (347) Q Consensus 262 ~-~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~-----~~~~e~~~~~~~~~d~i~~~~~~G~~~lR- 334 (347) . ......+.... ....+.++ .+|++-..|++.+..+ .--.|..+|-.++- -|.....+|.+=.| T Consensus 313 ~~~~n~~~a~~~~----~aa~~~v~----RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~-~i~~~~i~G~kK~rF 383 (404) T protein:vir:81 313 LVSENNLTATTKE----VAAATNID----RAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRT-EIAISWINGLKKIRF 383 (404) T ss_pred eecCCcccccccc----ccccccch----hheeecceeEEEEeeccCCCCceeEeeccccCchh-hhhhHHHhhhhhccc Confidence 0 00000000000 01111121 1233333333222111 11244444544433 45556667765555 Q ss_pred c------ceEEEEEecCCC Q lcl|Aclame:pro 335 P------EAAGALVFTPAA 347 (347) Q Consensus 335 P------e~~~~l~~~~aa 347 (347) | +--|+|..-.+| T Consensus 384 ~~~~g~~~DfGvi~idta~ 402 (404) T protein:vir:81 384 PEKSGKMQDHGVIAVDTAV 402 (404) T ss_pred cCCCCceeeEEEEEecccc Confidence 4 356777777777 No 165 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=98.48 E-value=1.2e-08 Score=64.02 Aligned_cols=296 Identities=14% Similarity=0.123 Sum_probs=154.1 Q ss_pred CC------CCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc-cccCCceEEEeccccce--ee Q lcl|Aclame:pro 1 MA------NATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR-TIQNGKSASFPVMGRTK--GY 71 (347) Q Consensus 1 m~------~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r-ti~~G~tv~i~~iG~~t--~~ 71 (347) |= ++...-.+ +..+. | -|--++|. ++.+.-+..|.++.+.++. +..+ .+..|+++|... .. T Consensus 1 ~~~~~~~~~~~k~it~-~d~~g----G---~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s-~~~~i~~i~~g~~~~~ 70 (314) T protein:vir:41 1 MDFLNKPFQITPKIDV-PDLGK----G---ILAVQRFG-EFVREVRENSAIIKDARVLNALKS-YEVDISRISLGVELEP 70 (314) T ss_pred CchhhhHHHhhccccc-ccCCC----c---eeChHHHH-HHHHHHHhccchhhheeeecccCc-cceeecccccCccccc Confidence 21 11111000 11111 1 13347775 6778888999999888864 4444 457888887431 11 Q ss_pred eec-CCCCCCCCCCCCCCCceEEEEeeeeecchhhcc-HHHHHh-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 72 YLA-PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYD-IEDAMN-HYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP 148 (347) Q Consensus 72 ~~~-~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd-~D~~q~-~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a 148 (347) -.. .|+....+..+++.+.++|..-++.. .+.|.+ +=+..+ ..|+.+.+..+.++++++....+++.= .++..+ T Consensus 71 ~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~-~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nG--dg~~~s 147 (314) T protein:vir:41 71 GRNTSGTKVAPTADEVTVSTNTLEMKELVT-KVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHA--DSSLTT 147 (314) T ss_pred ccccccCCccCCcccccccceeeeeEEEEE-eecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhcc--ccCCcC Confidence 111 11111112234667777777777643 455632 222222 248999999999999999887766531 111111 Q ss_pred ccc-ccccCcccC--ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCC-EEEEChHHHHHHhcchhhhhh Q lcl|Aclame:pro 149 AAS-NENIAGLGQ--AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDR-RFYCAPEDYSAILSALMPNAA 224 (347) Q Consensus 149 ~~~-~~~~~g~~~--~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR-~~vv~P~~~~~Ll~~~~~~~~ 224 (347) +.+ ...+.|+-. +..+...++.+ +....+.|.++...|....--...+ +++++++.+..+.+-.. -+. T Consensus 148 ~~~~~~~p~G~l~~a~~~~~~~~~~~-------~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~-~~~ 219 (314) T protein:vir:41 148 GRELYRINDGWMKLAGNQYTDAEPED-------ENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLL-VRE 219 (314) T ss_pred cccchhcchhhhhhcccceeecCccc-------cccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHh-ccC Confidence 100 012223211 11111111111 1112344556666665543211222 45679988877654110 112 Q ss_pred hccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhh Q lcl|Aclame:pro 225 NYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKL 304 (347) Q Consensus 225 ~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~ 304 (347) .+.++..+..|.-..+.|++|+.++.+|..... . .+.++.+++-+..+-. T Consensus 220 ~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~--------------------~----------~~i~fgd~~nlv~~~~ 269 (314) T protein:vir:41 220 TGLGDSALIGATGLQYDGIPIQYVPALDALGDD--------------------K----------ARALLTVPTNLVYGFW 269 (314) T ss_pred CcccchhhhCCCCceecceeeEecccccccCCC--------------------C----------ceEEEechhheEEEee Confidence 334455566777788999999999999742111 1 1123444443333556 Q ss_pred hheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecC-CC Q lcl|Aclame:pro 305 KDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTP-AA 347 (347) Q Consensus 305 ~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~-aa 347 (347) .+++++..|+.+.....+...+.+++.+.-+++|+....-. .| T Consensus 270 ~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~ 313 (314) T protein:vir:41 270 RNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSG 313 (314) T ss_pred ceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCC Confidence 67788888877765666677777888887665555544444 44 No 166 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.44 E-value=1.1e-08 Score=64.17 Aligned_cols=289 Identities=12% Similarity=0.022 Sum_probs=146.0 Q ss_pred CCCCccCcccccc--------Cc-ccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccc-ee Q lcl|Aclame:pro 1 MANATGGQQIGAN--------QG-KGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT-KG 70 (347) Q Consensus 1 m~~~~~~~~~~~~--------~~-~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~-t~ 70 (347) +..... +..+. .. ...+.|. .|.-+.+..++.+.....|.++.+.++.++. |+ .+|++.... .+ T Consensus 57 ~~~~~~--~~lt~~e~~~~~~~~~~~~~~gg--~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 130 (381) T protein:vir:95 57 SLPKSA--QSLSANQRSFFMDINKNVNYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) T ss_pred HhccCc--ccccHHHHHHHHHHhcccCCCCc--eecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcce Confidence 111000 00000 00 0111122 2556999999999999999999998887764 44 466655433 33 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) .-..-+..+..+ .+.+..+++|..-++ +.-..|..-=...+.+|+.+.+.++.++++++..|++++. +-....| T Consensus 131 ~w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~----G~G~~qP 204 (381) T protein:vir:95 131 VWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKDQP 204 (381) T ss_pred eeeccccccccc-ccccceeeeecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEe----ccCCCCc Confidence 332222333322 234556666655554 4444454322333667899999999999999999987763 1100000 Q ss_pred cccccCcccCceeeeecccc----cccchhhHHHHHHHHHHHHHHHHhhc----cC-CCCCCEEEEChHHHHHHhcchhh Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAA----DLVDVEARGKAILKGLTLARARLTKN----YV-PAGDRRFYCAPEDYSAILSALMP 221 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~----~~~~~~~~~~~i~~~l~~a~~~Lde~----~V-P~~gR~~vv~P~~~~~Ll~~~~~ 221 (347) -+.............+... ..+....+...+++.|.++...|... .. +..+-++++.|..++.|+.-... T Consensus 205 -~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~ 283 (381) T protein:vir:95 205 -IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) T ss_pred -eeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccccc Confidence 0000000000011111000 00000111222344454444444322 22 34445678999988887643322 Q ss_pred hhhhccccccccccceEEE--eceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhh Q lcl|Aclame:pro 222 NAANYAALIDPETGNIRNV--MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAV 299 (347) Q Consensus 222 ~~~~~~~~~~~~~G~v~~i--~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~ 299 (347) .+ .+|..... .|.+|++|+.+|... -..+||+.. +++ T Consensus 284 ~~---------~~G~~v~~l~~g~~vv~s~~~p~~~-------------------------iifgDfs~Y--~i~----- 322 (381) T protein:vir:95 284 LN---------ANGVYVTALPFNLNVIESTVQEAGK-------------------------VLTYVKGLY--DGY----- 322 (381) T ss_pred CC---------CCCceeecCCCCceEEecCCCCcCc-------------------------EEEEecccE--EEE----- Confidence 11 13332222 366789999888421 122455442 233 Q ss_pred hhhhhhheeeccccchhhH---hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 300 GTVKLKDMALERARRPEFQ---ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 300 ~tv~~~~~~~e~~~~~~~~---~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ....++++... ..++ ...+++.+.++.++++|++++++..+-.. T Consensus 323 ---~r~~~~i~~~~-~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:95 323 ---LAGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) T ss_pred ---EecccEEEeec-hhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecC Confidence 23334444332 2222 23688999999999999999987765544 No 167 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.44 E-value=1.1e-08 Score=64.17 Aligned_cols=289 Identities=12% Similarity=0.022 Sum_probs=146.0 Q ss_pred CCCCccCcccccc--------Cc-ccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccc-ee Q lcl|Aclame:pro 1 MANATGGQQIGAN--------QG-KGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT-KG 70 (347) Q Consensus 1 m~~~~~~~~~~~~--------~~-~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~-t~ 70 (347) +..... +..+. .. ...+.|. .|.-+.+..++.+.....|.++.+.++.++. |+ .+|++.... .+ T Consensus 57 ~~~~~~--~~lt~~e~~~~~~~~~~~~~~gg--~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a 130 (381) T protein:vir:10 57 SLPKSA--QSLSANQRSFFMDINKNVNYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVA 130 (381) T ss_pred HhccCc--ccccHHHHHHHHHHhcccCCCCc--eecCHHHHHHHHHHHHhhccceeheeeEecC-cc-eEEEEecCCcce Confidence 111000 00000 00 0111122 2556999999999999999999998887764 44 466655433 33 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) .-..-+..+..+ .+.+..+++|..-++ +.-..|..-=...+.+|+.+.+.++.++++++..|++++. +-....| T Consensus 131 ~w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~----G~G~~qP 204 (381) T protein:vir:10 131 VWGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKDQP 204 (381) T ss_pred eeeccccccccc-ccccceeeeecceeE-EeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEe----ccCCCCc Confidence 332222333322 234556666655554 4444454322333667899999999999999999987763 1100000 Q ss_pred cccccCcccCceeeeecccc----cccchhhHHHHHHHHHHHHHHHHhhc----cC-CCCCCEEEEChHHHHHHhcchhh Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAA----DLVDVEARGKAILKGLTLARARLTKN----YV-PAGDRRFYCAPEDYSAILSALMP 221 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~----~~~~~~~~~~~i~~~l~~a~~~Lde~----~V-P~~gR~~vv~P~~~~~Ll~~~~~ 221 (347) -+.............+... ..+....+...+++.|.++...|... .. +..+-++++.|..++.|+.-... T Consensus 205 -~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~ 283 (381) T protein:vir:10 205 -IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTH 283 (381) T ss_pred -eeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccccc Confidence 0000000000011111000 00000111222344454444444322 22 34445678999988887643322 Q ss_pred hhhhccccccccccceEEE--eceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhh Q lcl|Aclame:pro 222 NAANYAALIDPETGNIRNV--MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAV 299 (347) Q Consensus 222 ~~~~~~~~~~~~~G~v~~i--~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~ 299 (347) .+ .+|..... .|.+|++|+.+|... -..+||+.. +++ T Consensus 284 ~~---------~~G~~v~~l~~g~~vv~s~~~p~~~-------------------------iifgDfs~Y--~i~----- 322 (381) T protein:vir:10 284 LN---------ANGVYVTALPFNLNVIESTVQEAGK-------------------------VLTYVKGLY--DGY----- 322 (381) T ss_pred CC---------CCCceeecCCCCceEEecCCCCcCc-------------------------EEEEecccE--EEE----- Confidence 11 13332222 366789999888421 122455442 233 Q ss_pred hhhhhhheeeccccchhhH---hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 300 GTVKLKDMALERARRPEFQ---ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 300 ~tv~~~~~~~e~~~~~~~~---~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ....++++... ..++ ...+++.+.++.++++|++++++..+-.. T Consensus 323 ---~r~~~~i~~~~-~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~ 369 (381) T protein:vir:10 323 ---LAGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKG 369 (381) T ss_pred ---EecccEEEeec-hhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecC Confidence 23334444332 2222 23688999999999999999987765544 No 168 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=98.41 E-value=4.9e-09 Score=66.14 Aligned_cols=297 Identities=12% Similarity=0.071 Sum_probs=140.0 Q ss_pred CCCC------------ccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccc Q lcl|Aclame:pro 1 MANA------------TGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT 68 (347) Q Consensus 1 m~~~------------~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~ 68 (347) |... ..............+.++-..+.-+.+...+.+.....+.+++.+++..+.+ .++++.-+.. T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g--~~~~~~~~~~ 200 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKG--TARQNIAGAI 200 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCc--eeEeeeecCC Confidence 0000 0000000000000011111124457788888888888888888888777653 3566655543 Q ss_pred eeee-ecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 69 KGYY-LAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNL 147 (347) Q Consensus 69 t~~~-~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~ 147 (347) .... ...|..++. .++...++++.+.++ +.-..|.+-=...+.+|+-+.+..+.+++|+...|+.|+. +... T Consensus 201 ~~a~wv~E~~~~~~--~~~~f~~i~~~~~k~-~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~----G~G~ 273 (466) T protein:vir:80 201 PEGVWTEAVANLNE--LSLSFSQIEVDGYKV-GGFIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILY----GTGT 273 (466) T ss_pred cceeeccccccccc--ccccccceeecceee-eeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheee----ccCC Confidence 3222 233444432 245667766666665 3334454333334557899999999999999999998763 1100 Q ss_pred ccccccccCcccCceeeeec--ccccc--cchh---------hHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHH Q lcl|Aclame:pro 148 PAASNENIAGLGQAVVLNIG--AAADL--VDVE---------ARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSA 214 (347) Q Consensus 148 a~~~~~~~~g~~~~~~i~~~--~~~~~--~~~~---------~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~ 214 (347) ..+ -+........+..... .+... .++. ..+...+..++.+. .+.+.......-++++++..+.. T Consensus 274 ~~P-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~w~~~~~~~~~ 351 (466) T protein:vir:80 274 KMP-VGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKL-SKARANYSNGMKFWAMSSNTHAV 351 (466) T ss_pred CCc-ceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHH-HhhhccccCCceeEEecchhHHH Confidence 000 0000000000000000 00000 0000 00011111111111 11112221222345788888888 Q ss_pred Hhcchhhhhh--hccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEE Q lcl|Aclame:pro 215 ILSALMPNAA--NYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGL 292 (347) Q Consensus 215 Ll~~~~~~~~--~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l 292 (347) |+.-.-..+. .+.. ...++ ..++|.+|+.|+++|.... +.++|... . T Consensus 352 l~~~~~~~~~~g~~~~--~~~~~--~~i~G~pvv~s~~~~~~~~-------------------------~~g~~~~y--~ 400 (466) T protein:vir:80 352 LMSKAITFNSAGALVA--SLNNT--MPIVGGDIVILDFIPDNDI-------------------------IGGYGSLY--L 400 (466) T ss_pred hhcccccccCCccccc--cCCCc--ccccccceeecCccCccce-------------------------eeeccccE--E Confidence 7654322111 1211 11122 2588999999999985320 11222221 1 Q ss_pred eechhhhhhhhhhheeeccccchhhH--hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 293 FNHRSAVGTVKLKDMALERARRPEFQ--ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 293 ~~h~~A~~tv~~~~~~~e~~~~~~~~--~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ++ ..++++++...+.... ...+++.+.+++++++|++.+.+..+..+ T Consensus 401 i~--------~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~ 449 (466) T protein:vir:80 401 LA--------ERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANAN 449 (466) T ss_pred EE--------eecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCC Confidence 22 2234444443322211 23578889999999999999999766655 No 169 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.40 E-value=2.2e-08 Score=62.57 Aligned_cols=295 Identities=14% Similarity=0.063 Sum_probs=146.1 Q ss_pred CCCCccCccccccC--------cccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecccc-ceee Q lcl|Aclame:pro 1 MANATGGQQIGANQ--------GKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR-TKGY 71 (347) Q Consensus 1 m~~~~~~~~~~~~~--------~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~-~t~~ 71 (347) +.+.+...+..+.- -+.+..++--.|.-+.+..++.+...+.|.++.+.++.++. |. ++|++-.. .++. T Consensus 57 ~~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~~~~~a~ 134 (377) T protein:vir:96 57 MFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAV 134 (377) T ss_pred HHHhccCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEecCCccee Confidence 00000000000000 00111122122445889999999999999999999887764 33 56665433 3433 Q ss_pred eecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-- Q lcl|Aclame:pro 72 YLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPA-- 149 (347) Q Consensus 72 ~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~-- 149 (347) -..-+.++..+ .+++..+++|..-++ +.-..|..-=...+.+|+-+.+.++.++++++..|+.++.= .+...+. T Consensus 135 wv~e~~~~~~~-~~~~f~~i~l~~~kl-~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G--~G~~~P~Gi 210 (377) T protein:vir:96 135 WGDIFGEIKGQ-LKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKG--NGLLQPVGL 210 (377) T ss_pred Eeecccccccc-cCccceeEeeeeeeE-EeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEec--cCCCcceee Confidence 33333334322 235566666655444 33344543333346778999999999999999999987630 0100000 Q ss_pred ------ccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccC--C---CCCCEEEEChHHHHHHhcc Q lcl|Aclame:pro 150 ------ASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYV--P---AGDRRFYCAPEDYSAILSA 218 (347) Q Consensus 150 ------~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~V--P---~~gR~~vv~P~~~~~Ll~~ 218 (347) .......+...+......... ......+...+++.+..+...+....- | ...-+++++|..|..++.. T Consensus 211 l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~ 289 (377) T protein:vir:96 211 LKDLSQPTVDQSTGRDITTYKTDKEAI-ADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAK 289 (377) T ss_pred eeccccccccccccccccceeeccccc-cccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcccc Confidence 000000000000000000000 000112233344444444444443321 1 1223467888888776432 Q ss_pred hhhhhhhccccccccccceEEEe--ceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeech Q lcl|Aclame:pro 219 LMPNAANYAALIDPETGNIRNVM--GFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHR 296 (347) Q Consensus 219 ~~~~~~~~~~~~~~~~G~v~~i~--G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~ 296 (347) ....+ .+|.-..+. |++|++|+.+|... -..+||++. ++ T Consensus 290 ~~~~~---------~~G~~~~~l~~p~~v~~s~~~p~~~-------------------------i~fgdf~~Y--~i--- 330 (377) T protein:vir:96 290 FTSRN---------QFGEYVTVLPHGITILESLAVETGK-------------------------AIAFVANRY--DA--- 330 (377) T ss_pred ccccC---------CCCCceeccCCCceEEecCCCCccc-------------------------EEEEEcCcE--EE--- Confidence 11111 234444554 45678888888421 012344331 22 Q ss_pred hhhhhhhhhheeeccccchh--hHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 297 SAVGTVKLKDMALERARRPE--FQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 297 ~A~~tv~~~~~~~e~~~~~~--~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) +....++++...+.. +-...+++.+.++.++++|+++++|..+-- T Consensus 331 -----~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 331 -----FMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred -----EEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 233444555443221 223458899999999999999999988777 No 170 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=98.31 E-value=1.8e-07 Score=57.54 Aligned_cols=280 Identities=11% Similarity=0.048 Sum_probs=142.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc----ccceeeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM----GRTKGYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i----G~~t~~~~~~g 76 (347) |+--++.+. .+..+ .+-+.| |.++|+.-+.+-++ +++.+|..++..|.+++.+.. -....++...| T Consensus 1 M~~e~nl~~-~~dL~---~a~siD--F~~~f~~~i~~L~~----~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEG 70 (303) T protein:vir:10 1 MSAENNLIN-VEALG---KAKSID--FANKLGVGLNKLFE----ALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEG 70 (303) T ss_pred CCCCcCCcc-hhhcc---cceeeh--hhhhhhhhHHHHHH----HhhhhccccccCCceeeeeeeeceeeccccccccCC Confidence 665444322 13333 223455 99999988887664 555666666777877766543 11223567778 Q ss_pred CCCCCCCCCCCC---CceEEEEeeeeecchhhccHHHH-H-hCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 77 ENLDDKRKDIKH---SEKVIQIDGLLTSDVLIYDIEDA-M-NHY-DVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 77 ~~~~~~~~~~~~---~~~~l~ID~~~~~~~~Vdd~D~~-q-~~~-D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) ..|+.+ .+.. ...++++.++. . .+ -||+ | .-| |...+.-+++..++++++|..++..+-.+..... T Consensus 71 e~Ipls--kvt~~~~~t~~~~~kK~r--K-~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~- 142 (303) T protein:vir:10 71 DVIPLT--KVTREQVDITELQFAKYR--K-ST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGK- 142 (303) T ss_pred cccchh--hheeeecceEEEEeeccc--c-cc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccc- Confidence 888754 3443 34677776643 3 33 3444 4 444 4899999999999999999999876532210000 Q ss_pred cccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhh--hhccc Q lcl|Aclame:pro 151 SNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNA--ANYAA 228 (347) Q Consensus 151 ~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~--~~~~~ 228 (347) . +.......+....+ ++.+..-...++|.++ .-+++|+|...+.||.+..... ..| | T Consensus 143 ---------------~-t~~t~~s~~glq~A-l~~~~~kl~~~~ed~~---~~V~FvNP~Daa~yl~~A~i~~~~t~f-G 201 (303) T protein:vir:10 143 ---------------R-TNKTKLSAENLQGA-LSKGRANLSVLLDDEI---TPIAFVNPNDTAEYLANGFINSTGAQF-G 201 (303) T ss_pred ---------------c-ccceeecHHHHHHH-HHhhhhhccccccccc---cEEEEEchHHHHHHhhcCCcchhhhhh-h Confidence 0 00000111111111 1111111122344443 2488899999999998776642 233 2 Q ss_pred cccccccceEEEeceeEEEecccccccccccccc---CccccccccccccccccccccccccceeEEeechhhhhhhhhh Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPA---DGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK 305 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~---~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~ 305 (347) ..-+ .++.|+.|+.|+.+|.+....+..- ..+.+..+ .-...-.|..|.++.+|+.-.+ ... T Consensus 202 ~n~L-----~nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~g----~l~~~f~~t~D~tglIGv~h~~------~~~ 266 (303) T protein:vir:10 202 VNLL-----TPYVGVKIVEFADVPQGEVWMTVAENLNVAYANPRG----ELSRAFAFATDATGFVGVLHDI------QPQ 266 (303) T ss_pred hhhh-----hhhhcceEEEeccCCCceEEEeeccceEEEEecCch----hhhhhhhhccccccceEEEecc------ccc Confidence 2112 2489999999999998754332211 00111111 0111223455555554433110 111 Q ss_pred heeeccccchhhHhhHHhhhhhhcCc--ccccceEEEEEecCCC Q lcl|Aclame:pro 306 DMALERARRPEFQADQIIGKYAMGHG--GLRPEAAGALVFTPAA 347 (347) Q Consensus 306 ~~~~e~~~~~~~~~d~i~~~~~~G~~--~lRPe~~~~l~~~~aa 347 (347) .++. ..+..+|.. +=|+|+++...+.+.= T Consensus 267 ~~t~-------------eT~~~~~~~lfpE~~dgiv~~ti~~~e 297 (303) T protein:vir:10 267 RLTS-------------DTIYASAISMFPENIDAVIKVTIKKDE 297 (303) T ss_pred eeee-------------hhHhHhHHHhcccccceEEEEEEeccc Confidence 1111 112222222 2266777666553322 No 171 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.30 E-value=4.9e-08 Score=60.68 Aligned_cols=291 Identities=11% Similarity=0.004 Sum_probs=138.1 Q ss_pred CCCC--ccCccccccC-----cccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeee Q lcl|Aclame:pro 1 MANA--TGGQQIGANQ-----GKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYL 73 (347) Q Consensus 1 m~~~--~~~~~~~~~~-----~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~ 73 (347) +... ...++--.+. -.+.++|. .|.-+.|..++.+...+.|.++.++++.++ +|+ .+||+......... T Consensus 64 ~~~~g~~~lt~~e~~~~~~~~~~~~~~gg--~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~-~~~-~~i~~~~~~~~a~w 139 (383) T protein:vir:78 64 SASRTDKNITNEEIKFFNDINKEVGYKEE--TLLPQTVVDEIFEDLTTEHPFLASIGMRTT-GLR-TKFLKSETSGVAVW 139 (383) T ss_pred HhcCChhhhhHHHHHHHHHHhccCCCCCc--cccCHHHHHHHHHHHHhhccceeeeeeEec-CCc-eEEEEEcCCcceEE Confidence 0000 0000000000 00111222 245699999999999999999999888776 455 57777655433332 Q ss_pred -cCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 74 -APGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASN 152 (347) Q Consensus 74 -~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~ 152 (347) .-+.++..+ .+.+..+++|..-++ +.-..|..-=...+.+|+.+.+.++.++++++..|++++. +.... .+- T Consensus 140 ~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~----G~G~~-qP~ 212 (383) T protein:vir:78 140 GKIFGEIKGQ-LDATFSDEESIQNKL-TAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIV----GDGND-KPI 212 (383) T ss_pred eecccccccc-cCcceeeEeecceee-EeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEe----ccCCC-Cce Confidence 222233322 235667766666554 4445554322333567899999999999999999998763 11000 000 Q ss_pred cccCcccCceeeeecccccccc----hhhHHHHHHHHH---HHHHHHHhhccC-CCCC-CEEEEChHHHHHHhcchhhhh Q lcl|Aclame:pro 153 ENIAGLGQAVVLNIGAAADLVD----VEARGKAILKGL---TLARARLTKNYV-PAGD-RRFYCAPEDYSAILSALMPNA 223 (347) Q Consensus 153 ~~~~g~~~~~~i~~~~~~~~~~----~~~~~~~i~~~l---~~a~~~Lde~~V-P~~g-R~~vv~P~~~~~Ll~~~~~~~ 223 (347) +.............+.....+. ...+...+++.+ ......+....- ...+ -..+++|..|+.++...... T Consensus 213 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~- 291 (383) T protein:vir:78 213 GLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSL- 291 (383) T ss_pred eeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhcc- Confidence 0000000000011110000000 001111111111 111111111110 1111 23466776555443211110 Q ss_pred hhccccccccccceEEEe--ceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhh Q lcl|Aclame:pro 224 ANYAALIDPETGNIRNVM--GFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGT 301 (347) Q Consensus 224 ~~~~~~~~~~~G~v~~i~--G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~t 301 (347) ..+|....+. |..|++|+++|.... ..+||+.. ++ T Consensus 292 --------~~~G~~~t~l~~~~~iv~s~~~p~~~i-------------------------ifgdfs~Y--~i-------- 328 (383) T protein:vir:78 292 --------NANGVYVTALPFNLNIIESLFVPEKKA-------------------------ISYVAERY--DA-------- 328 (383) T ss_pred --------CCCCceeeecCCCceEEecCCCCcccE-------------------------EEeeccce--EE-------- Confidence 1234444554 455888988884210 11344432 22 Q ss_pred hhhhheeeccccchhhH---hhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 302 VKLKDMALERARRPEFQ---ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 302 v~~~~~~~e~~~~~~~~---~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +..++++++.. +..+| ...+++.+.++.++++|++.++|..+-+. T Consensus 329 ~~r~~~~i~~~-~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~ 376 (383) T protein:vir:78 329 LIGGPLDIGTY-DQTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINP 376 (383) T ss_pred EecccceEEec-chhhhhcCceEEEEEEEEcCEEecCCeEEEEEEEecC Confidence 23445555543 33333 24689999999999999999887765444 No 172 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=98.24 E-value=5.6e-08 Score=60.34 Aligned_cols=298 Identities=12% Similarity=0.058 Sum_probs=145.6 Q ss_pred CCCCccCccccccCcccCccccHHHHH--HHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccce--eeeecCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALF--LKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTK--GYYLAPG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~--ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t--~~~~~~g 76 (347) |-+-.+ .+.-+.-...|...-+ -++++ ++++.-++.|.++...++.+..++.+..|+.+|-.. ......+ T Consensus 7 ~~~~~~-----~~~~k~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~ 80 (315) T protein:vir:41 7 IRGGKP-----FEIVPKIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDET 80 (315) T ss_pred hhcCCh-----hhhhhhcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccc Confidence 222111 1111111112222222 35654 566677778889988887544445556677765331 1111111 Q ss_pred CCC-CCCCCCCCCCceEEEEeeeeecchhh--ccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 77 ENL-DDKRKDIKHSEKVIQIDGLLTSDVLI--YDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 77 ~~~-~~~~~~~~~~~~~l~ID~~~~~~~~V--dd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) .+. ..+...++..++++..-++. +...| +-+|+..-..|+.+.+..+.++++++..+.+++.= .++ ...+... T Consensus 81 ~~~~~~~~~~~~f~~~~l~~~~l~-~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nG--dg~-s~~p~~~ 156 (315) T protein:vir:41 81 GQKLAPPESTAEVKTNTLYMREMV-TKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHG--DTS-SSDPLLR 156 (315) T ss_pred cCcCCCCCCccccceeeeceeeee-eeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhcc--CCc-CcCcccc Confidence 111 11112355566666666553 33345 22333322469999999999999999888766531 000 0000001 Q ss_pred ccCcccC---ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCC-CCCEEEEChHHHHHHhcchhhhhhhcccc Q lcl|Aclame:pro 154 NIAGLGQ---AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPA-GDRRFYCAPEDYSAILSALMPNAANYAAL 229 (347) Q Consensus 154 ~~~g~~~---~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~-~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~ 229 (347) ...|+-. ............ .....+.|.++...|..+.--. ..-.+++++..+..|.+- +-.+..|.++ T Consensus 157 ~~~G~l~~a~~~~~~~~~~~~a------~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rkl-k~~~g~~lw~ 229 (315) T protein:vir:41 157 MSDGWLKLASEKLTESDVDPEA------EDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDA-LKGRETGLGD 229 (315) T ss_pred ccccceeccccccccccccccc------ccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHH-hccCCCcccc Confidence 1223211 011100000000 0111344555555555543211 123558899988776542 1223566667 Q ss_pred ccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheee Q lcl|Aclame:pro 230 IDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMAL 309 (347) Q Consensus 230 ~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~ 309 (347) ..+..|....+.|.+|+.++++|........ -+-+||.+. ..+-..++++ T Consensus 230 ~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~--------------------ilf~d~~nl----------~~~~~~~i~i 279 (315) T protein:vir:41 230 QALTGANSILYDGRPVQYVPALEALNDGKSR--------------------ALFVVPTQL----------VYGFWRNIKV 279 (315) T ss_pred chhhcCCCceecccceEecccccccCCCCcc--------------------EEEecccce----------EEEeccccEE Confidence 7778888889999999999999853221110 122334332 1233455677 Q ss_pred ccccchhhHhhHHhhhhhhcCcccccceEEEEEecC Q lcl|Aclame:pro 310 ERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTP 345 (347) Q Consensus 310 e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~ 345 (347) +..|+.......+......|+++.-++++++-...- T Consensus 280 ~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 280 VPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 777776554445555566677655444433322222 No 173 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=98.22 E-value=3.2e-08 Score=61.70 Aligned_cols=294 Identities=11% Similarity=-0.022 Sum_probs=144.2 Q ss_pred CCCCccCccccccCcc--cCccccHHH--HHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeec-- Q lcl|Aclame:pro 1 MANATGGQQIGANQGK--GQSAADKLA--LFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLA-- 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~--~~~~~d~~a--l~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~-- 74 (347) ||.-.- .+-..+.-+ .-..+|.+. +....+..++...-++.|.++.+.++.++.+. .-+|+.+|-......+ T Consensus 1 ~~~k~~-~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~-~~~i~~~~~~~~~~~~~~ 78 (321) T protein:vir:31 1 MASRTI-NNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAK-KTRIPTLNIGERHRRPQD 78 (321) T ss_pred CchHHH-HHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCc-ceeeeeeccCCccccccc Confidence 665221 111111111 111233332 22377788888888888889998888776543 3667766533221221 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhh--ccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLI--YDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASN 152 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~V--dd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~ 152 (347) .|.... +..++..+++++..-+.. +-..| +-+|+.....|+.+.+....++++++..++.++. +-..+.++. T Consensus 79 e~~~~~-~~~~~~~~~~~~~~~k~~-~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~n----Gd~~~~~~~ 152 (321) T protein:vir:31 79 EGEWNE-NESDVSTGTIDISTEKAT-VAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAAN----GDEDAEDSF 152 (321) T ss_pred cccccc-ccccceeeeeeeeeEEEE-eehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheee----ccccCCCcc Confidence 111111 112244555666665553 33344 2344433356899999999999999998886652 111111100 Q ss_pred -cccCcccC---ceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh-hcc Q lcl|Aclame:pro 153 -ENIAGLGQ---AVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA-NYA 227 (347) Q Consensus 153 -~~~~g~~~---~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~ 227 (347) ....|+-. ....+.+.++.. . -++.|.++...|++..--..+-+++++++.+..+++- +.+. ... T Consensus 153 ~~~n~G~l~~a~~~~~~~~~~~~~----~----~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~--l~~~~~~~ 222 (321) T protein:vir:31 153 ENQNDGFITVAEGDVETIDAADDI----L----DNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYT--LTDRDTPL 222 (321) T ss_pred cccchhhhhhhccccccccccccc----c----CHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHH--HhcCCCcc Confidence 01122210 011111111111 1 1355667777777765322234668899887665431 1111 122 Q ss_pred ccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhhe Q lcl|Aclame:pro 228 ALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) Q Consensus 228 ~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~ 307 (347) +...+..|...++.|++|+.++++|....- -.++.|.+ .+...++ T Consensus 223 ~~~~l~~~~~~tl~G~pvv~~~~mP~~~il-------------------------~t~~~nl~----------~~~~~~~ 267 (321) T protein:vir:31 223 GDNVIMGEADVNPFSFPIIGSGLWPDDKAM-------------------------FTDPQNLI----------YALYRDL 267 (321) T ss_pred ccchhhccccccccceeEEEcCCCCCCcEE-------------------------EeccccEE----------EEEeecc Confidence 334455667778999999999999953211 11233321 1223344 Q ss_pred eeccccchhhH---hhHHhhhhh--hcCcccccceEEEEEecCCC Q lcl|Aclame:pro 308 ALERARRPEFQ---ADQIIGKYA--MGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 308 ~~e~~~~~~~~---~d~i~~~~~--~G~~~lRPe~~~~l~~~~aa 347 (347) ..+..++.... .+.+...+. ++..+-++++++.+.==+-+ T Consensus 268 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 268 EIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDP 312 (321) T ss_pred EEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCcc Confidence 55555443221 122333232 45556667766666522222 No 174 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.20 E-value=1.1e-07 Score=58.77 Aligned_cols=292 Identities=13% Similarity=-0.005 Sum_probs=141.0 Q ss_pred CCCC--ccCccccc------cCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeee Q lcl|Aclame:pro 1 MANA--TGGQQIGA------NQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYY 72 (347) Q Consensus 1 m~~~--~~~~~~~~------~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~ 72 (347) +... +..+.--. +.+ ..++|. .|..+.|..++.+.....|.++.+.++.+.. |+ .++++........ T Consensus 57 ~~~~~~~~l~~~e~~~~~~~~~~-t~~~Gg--~lvP~~~~~~I~~~l~~~spir~~a~v~~~~-~~-~~i~~~~~~~~a~ 131 (381) T protein:vir:10 57 SLPKSAQTLSANQRNFFMDINKS-VGYKEE--KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-LR-LKFLKSETSGVAV 131 (381) T ss_pred HhcccccccCHHHHHHHHHHhhc-CCCCCc--eecCHHHHHHHHHHHHhhcceeeeeeeEecC-cc-eEEEeecCCcceE Confidence 0000 00000000 000 111121 2556999999999999999999999887763 44 4555444332222 Q ss_pred ecC-CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 73 LAP-GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) Q Consensus 73 ~~~-g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~ 151 (347) ... +..+..+ .+++.++++|..-++ |....|..-=...+.+|+-+.+.++.++++++..|++++. +-.... + T Consensus 132 W~~e~~~~~~~-~~~~f~~i~l~~~kl-~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi~----GdG~~q-P 204 (381) T protein:vir:10 132 WGKIYGEIKGQ-LDAAFSEETAIQNKL-TAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTGKDQ-P 204 (381) T ss_pred Eeecccccccc-cCccceeEeecceeE-EeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeEe----cccCCC-c Confidence 211 1222222 134555555555444 3334443222223567889999999999999999987762 111000 0 Q ss_pred ccccCcccCceeeeecccccc----cchhhHHHHHHHHHHHHHHHH----hhccC-CCCCCEEEEChHHHHHHhcchhhh Q lcl|Aclame:pro 152 NENIAGLGQAVVLNIGAAADL----VDVEARGKAILKGLTLARARL----TKNYV-PAGDRRFYCAPEDYSAILSALMPN 222 (347) Q Consensus 152 ~~~~~g~~~~~~i~~~~~~~~----~~~~~~~~~i~~~l~~a~~~L----de~~V-P~~gR~~vv~P~~~~~Ll~~~~~~ 222 (347) -+.......+.....+...+. +....+....++.+......+ ..+.. +..+.+++++|..|+.|....... T Consensus 205 ~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~ 284 (381) T protein:vir:10 205 IGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL 284 (381) T ss_pred eeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccC Confidence 011110011111111111000 000011112222222222222 11122 344567889999998887543222 Q ss_pred hhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhh Q lcl|Aclame:pro 223 AANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTV 302 (347) Q Consensus 223 ~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv 302 (347) + +.+.+.+. --.|.+|++++++|... -..+||++. +++ T Consensus 285 ~----~~G~~v~~---lp~g~~vv~~~~~p~~~-------------------------i~fGDfs~Y--~i~-------- 322 (381) T protein:vir:10 285 N----ANGVYVTA---LPFNLNVIESTVQEAGK-------------------------VLTYVKGLY--DGY-------- 322 (381) T ss_pred C----CCCceeec---CCCCceeEEcCCCCcCc-------------------------EEEEEcccE--EEE-------- Confidence 1 11112111 01478899999988421 022455542 333 Q ss_pred hhhheeeccccchhhH---hhHHhhhhhhcCcccccceEEEEEec----CCC Q lcl|Aclame:pro 303 KLKDMALERARRPEFQ---ADQIIGKYAMGHGGLRPEAAGALVFT----PAA 347 (347) Q Consensus 303 ~~~~~~~e~~~~~~~~---~d~i~~~~~~G~~~lRPe~~~~l~~~----~aa 347 (347) ..+.++++... ..+| ...+++.+.+++++++|++++++..+ ++| T Consensus 323 ~r~~~~i~~~~-~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:10 323 LAGGINVQKFK-ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred EecccEEEeec-hhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccc Confidence 23334444332 2222 23688999999999999999997765 333 No 175 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.17 E-value=7.9e-08 Score=59.51 Aligned_cols=284 Identities=12% Similarity=0.043 Sum_probs=140.4 Q ss_pred CCCC--------ccCcccc------ccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccc Q lcl|Aclame:pro 1 MANA--------TGGQQIG------ANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMG 66 (347) Q Consensus 1 m~~~--------~~~~~~~------~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG 66 (347) +.+. +..+.-- -+.+.+ ++|- .|.-+.+..++.+..++.|.++.++++.++. |+ +.|++.. T Consensus 61 ~~~~~~~~~r~~~~l~~ee~~~~~~~~~~t~-~~gG--~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~-~~-~~i~~~~ 135 (395) T protein:vir:95 61 VVDNGILAKRSQDPLTSEERKFFNDINYDVG-YTDE--KILPETVVERVFDDLQKDHPLLSKINFQNAG-IK-TRVIKAD 135 (395) T ss_pred HHHHHHHhhcCccccchHHHHHHHHHhhccC-CCCc--eeccHHHHHHHHHHHHhhhhhhhhceeEecC-Cc-eEEEEec Confidence 1000 0000000 000111 1111 1445889999999999999999998877764 44 5777655 Q ss_pred cceeeeec-CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 67 RTKGYYLA-PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) Q Consensus 67 ~~t~~~~~-~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a 145 (347) ........ -+.++..+ .+++.+++++..-++ +.-..|.+-=...+.+|+-+.+.++.++++++..|+.++. +. T Consensus 136 ~~~~a~w~~e~~~~~~~-~~~~f~~i~l~~~kl-~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~----G~ 209 (395) T protein:vir:95 136 PAGQAVWGKVFGEIKGQ-LDAAFREENFTQYKL-TCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALESAIIN----GG 209 (395) T ss_pred CCcceEEeecccccCcc-ccccceeeeeceeeE-EEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheee----cc Confidence 44333222 12233222 235666666655443 3444454333334668899999999999999999998763 11 Q ss_pred hcccccccccCcccCceeeee---------cccccccchhhHHHHHHHHHHHHHHHHhh----cc-CCCCCCEEEEChHH Q lcl|Aclame:pro 146 NLPAASNENIAGLGQAVVLNI---------GAAADLVDVEARGKAILKGLTLARARLTK----NY-VPAGDRRFYCAPED 211 (347) Q Consensus 146 ~~a~~~~~~~~g~~~~~~i~~---------~~~~~~~~~~~~~~~i~~~l~~a~~~Lde----~~-VP~~gR~~vv~P~~ 211 (347) . .....+.|..-.. +..... ....+....++.+.++...+.- .. .......++++|.. T Consensus 210 G-------~~~~qP~Gil~~~~~~~~~~~~~~~~~~-~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t 281 (395) T protein:vir:95 210 G-------AAKTQPVGLMKDVNTNSGAVTDKASSGT-LTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRD 281 (395) T ss_pred C-------CCCcCceeeeecccccccccccccccch-hhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchh Confidence 0 0000011111000 000000 0111111223333333322211 11 11122345788877 Q ss_pred HHHHhcchhhhhhhccccccccccceEEEe--ceeEEEeccccccccccccccCccccccccccccccccccccccccce Q lcl|Aclame:pro 212 YSAILSALMPNAANYAALIDPETGNIRNVM--GFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNV 289 (347) Q Consensus 212 ~~~Ll~~~~~~~~~~~~~~~~~~G~v~~i~--G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~ 289 (347) +..+.. .|... ...|...++. |.+|++++.+|... -..+||+.. T Consensus 282 ~~~~~g-------~~~~~--~~~G~~~~~lg~g~~v~~~~~~p~~~-------------------------i~fgdfs~y 327 (395) T protein:vir:95 282 SWDVQA-------RYTYL--TANGGFVTVLPYNVTIITSEFVPEGK-------------------------LVAFVTDRY 327 (395) T ss_pred hhhcCC-------cceec--cCCCcceeccCCcceEEEcCCCCCCc-------------------------EEEEecccE Confidence 665432 22211 1245555664 66689999998421 012455542 Q ss_pred eEEeechhhhhhhhhhheeeccccchh--hHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 290 VGLFNHRSAVGTVKLKDMALERARRPE--FQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 290 ~~l~~h~~A~~tv~~~~~~~e~~~~~~--~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++ ...+++++...+.. +-...+++...+|+++++|++++.|..+.+- T Consensus 328 --~i~--------~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~ 377 (395) T protein:vir:95 328 --NAV--------RGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVAS 377 (395) T ss_pred --EEE--------EecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeeccC Confidence 222 23334444332211 1234578889999999999999988765222 No 176 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=98.15 E-value=1.5e-07 Score=57.93 Aligned_cols=283 Identities=13% Similarity=0.012 Sum_probs=135.7 Q ss_pred CCCCcc--Ccccc------ccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEec-cccceee Q lcl|Aclame:pro 1 MANATG--GQQIG------ANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPV-MGRTKGY 71 (347) Q Consensus 1 m~~~~~--~~~~~------~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~-iG~~t~~ 71 (347) ++.... .+.-- .....+.+++. .+.-+.+..++.+...+.|.++..+++.++. |+ +++|+ -+..++. T Consensus 59 ~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg--~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~~-~~~~~~~~~~~a~ 134 (377) T protein:vir:98 59 DLRDKNRELTAEEIKFFNDIDKNVGGKDKF--KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-LR-LKALTAETSGTAV 134 (377) T ss_pred HhccCCcccCHHHHHHHHHHHhccCCCCCc--cccCHHHHHHHHHHHHHhhhhhhheeeEecC-cc-eEEEEecCCccee Confidence 111000 00000 00001112222 2455889999999999999999998887764 44 46664 3444443 Q ss_pred eecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 72 YLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) Q Consensus 72 ~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~ 151 (347) -..-+..+..+ .+.+..+++|..-++ |.-..|..-=...+.+|+-+.+.++.++++++..|++++. +- T Consensus 135 w~~e~~~~~~~-~~~~f~~i~l~~~kl-~a~~~is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~----G~------ 202 (377) T protein:vir:98 135 WGDIFGEIKGQ-LKQAFKEQDFSQFKL-TAFVVIPKDALKFGPKWIKQFITEQLKEAIAVALELAIVK----GD------ 202 (377) T ss_pred EeecccccCcc-cCccceeEeecceeE-EeeecccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEe----cc------ Confidence 33332333322 223445544444443 3333443222233667899999999999999999998763 11 Q ss_pred ccccCcccCceeeee-------cccccccchhhHHHHHHHH-----------HHHHHHHHhhccC----CCCCCEE-EEC Q lcl|Aclame:pro 152 NENIAGLGQAVVLNI-------GAAADLVDVEARGKAILKG-----------LTLARARLTKNYV----PAGDRRF-YCA 208 (347) Q Consensus 152 ~~~~~g~~~~~~i~~-------~~~~~~~~~~~~~~~i~~~-----------l~~a~~~Lde~~V----P~~gR~~-vv~ 208 (347) ..+.+.|..-.. .+.............+.+. ..-++..+....+ -..||++ +++ T Consensus 203 ---G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n 279 (377) T protein:vir:98 203 ---GLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILN 279 (377) T ss_pred ---CCCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEec Confidence 111111111000 0000000000000011100 0001111111111 1346644 467 Q ss_pred hHHHHHHhcchhhhhhhccccccccccceEEEece--eEEEeccccccccccccccCccccccccccccccccccccccc Q lcl|Aclame:pro 209 PEDYSAILSALMPNAANYAALIDPETGNIRNVMGF--EVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQ 286 (347) Q Consensus 209 P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~i~G~--~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~ 286 (347) |.-|+.++.... .....|.-..+.|+ .|++|+.+|.... ..+|| T Consensus 280 ~~~~~~~~p~~~---------~~~~~G~~~t~lg~p~~vv~s~~~p~~~i-------------------------~fgdf 325 (377) T protein:vir:98 280 PEDRWALEAQFT---------SRNQFGEYVTVLPHGITILESLAVETGKA-------------------------IAFVA 325 (377) T ss_pred ccchhhcccccc---------ccCCCCccccccCCCceEEecCCCCcccE-------------------------EEEEe Confidence 766665543211 01123444456654 4788888884210 11344 Q ss_pred cceeEEeechhhhhhhhhhheeeccccchh--hHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 287 NNVVGLFNHRSAVGTVKLKDMALERARRPE--FQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 287 ~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~--~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) +.. ++ +....++++...+.. +-...+++.+.++++++.|++.+.|..+.- T Consensus 326 ~~Y--~i--------~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 326 NRY--DA--------FMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred cce--eE--------EeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 332 22 233445555432221 123458899999999999999999988777 No 177 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=97.83 E-value=7.5e-07 Score=54.18 Aligned_cols=302 Identities=14% Similarity=0.119 Sum_probs=167.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLD 80 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~ 80 (347) |+.-.++..+.+|---. +++..=|.-+.-++-|.++-.--.+-..++..-+++.|.+-.|+.+|-.-+.+...|.+++ T Consensus 59 m~G~~p~~eV~~~e~mt--t~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~~IgEGgE~~ 136 (393) T protein:vir:79 59 MEGETPTNEVNLREFMA--TPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAYDVAEGQEIP 136 (393) T ss_pred hcCCCchhheehhhhhc--CCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeecccccccccc Confidence 77666655554444332 3333323447777777765333233333333346778999999999988888887888876 Q ss_pred CCCCC-CCCCceEEEEeeeeecchhh--ccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc-cccccccC Q lcl|Aclame:pro 81 DKRKD-IKHSEKVIQIDGLLTSDVLI--YDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP-AASNENIA 156 (347) Q Consensus 81 ~~~~~-~~~~~~~l~ID~~~~~~~~V--dd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a-~~~~~~~~ 156 (347) ....+ -+++ .+++-+-++ ...| .|-----+.+|++.-..++++.+|+|+.|+-++++.-.-...+ ..-..... T Consensus 137 ~~sld~~T~d--sv~~~~gK~-G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ 213 (393) T protein:vir:79 137 EDSIDWQTHE--SPEIRVGKS-GIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKL 213 (393) T ss_pred ccchhhhcCC--ceeEEechh-hhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCcc Confidence 54333 2333 455555543 2233 2222223779999999999999999999999988764322100 00111111 Q ss_pred cccCceeeeecccccccchhhHHHHHHHHHHHHHHH-HhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccccc-c Q lcl|Aclame:pro 157 GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARAR-LTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPE-T 234 (347) Q Consensus 157 g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~-Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~-~ 234 (347) +++.|- ..+..-++.-..+.|.++.-. +..-. .+-++++.|=.|+..-+....-....+.-+++- . T Consensus 214 ahptGr---------~~~~~qNGTlSleDllDm~~av~~~hy---t~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~ 281 (393) T protein:vir:79 214 AHTTGL---------DKNGVQNDTFSAEDFLDLIIAVMANEY---TPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAK 281 (393) T ss_pred ceeecC---------CccccccccccHHHHHHHHHHHhcccC---CcceEEEcCchhhhhhhhhhhcceeeccccccCcc Confidence 222110 011122222234445544322 22222 346889999888888776433221111111110 0 Q ss_pred c-ceEEE-----------eceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhh Q lcl|Aclame:pro 235 G-NIRNV-----------MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTV 302 (347) Q Consensus 235 G-~v~~i-----------~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv 302 (347) + .-.+. ..|+|+.||-+|..... ..| +|.---.|+++++.-++ T Consensus 282 ~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~--------------~rF------d~~~Vd~NnvgvlLV~D----- 336 (393) T protein:vir:79 282 GAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKS--------------RRF------DVYAVDRNNVGVLLVRD----- 336 (393) T ss_pred ccchhhhhchhhhccccccceeEEEeccccccccc--------------cee------eEEEeecCCceEEEEec----- Confidence 0 11112 25999999999865321 011 22222234556665222 Q ss_pred hhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 303 KLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 303 ~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++++.+-|+-+--.-|+-+-+||.+||.--.+++....-+= T Consensus 337 ---~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI~~ 378 (393) T protein:vir:79 337 ---DLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNISM 378 (393) T ss_pred ---CcceeccccccccceeeeeeeeeceeeeeCCceEEEEeccee Confidence 678888888878777888899999999988777665433222 No 178 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=97.75 E-value=8.5e-06 Score=48.39 Aligned_cols=296 Identities=11% Similarity=0.043 Sum_probs=152.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHHHHhhh-cc-cc-cccc-----cCCceEEEeccccceee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTM-DK-HM-VRTI-----QNGKSASFPVMGRTKGY 71 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~~s~~~-~~-~~-~rti-----~~G~tv~i~~iG~~t~~ 71 (347) |+.++.-|.. +| +|+ |+|.-.|.+...+.+-|. += +. ...+ .+|+.+.+|..+...-. T Consensus 1 M~~~~~~T~l----------~D---ii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~ 67 (367) T protein:vir:80 1 MPDFNNQVRL----------VD---AVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSL 67 (367) T ss_pred Ccchhhhhhh----------hh---ccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCC Confidence 8876543221 22 455 888888887777665333 21 21 1122 58999999999877532 Q ss_pred e--ecCCCCC-CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc Q lcl|Aclame:pro 72 Y--LAPGENL-DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP 148 (347) Q Consensus 72 ~--~~~g~~~-~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a 148 (347) . |-..+.. ..++..++..+..-.+ ...-.++...|+-..-+--|.|..+..+.+.--.|..-+.+|..|....... T Consensus 68 ~~n~~~d~~~~~~t~~kittg~~~a~v-~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~ 146 (367) T protein:vir:80 68 EPNYGSDNPNVEAPIDGLGSGEMKTTK-TWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSN 146 (367) T ss_pred ccccCCCCCcccccccccccchheeee-ehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccc Confidence 2 1111110 1122334433322111 2334567778999888888999999999888777765555555444333211 Q ss_pred ccc-----------ccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhc Q lcl|Aclame:pro 149 AAS-----------NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILS 217 (347) Q Consensus 149 ~~~-----------~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~ 217 (347) ... .+...+....+++.+.+.+...+.... .+.|.+|...|.++. +.=-.++|-+..|..|.+ T Consensus 147 ~a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s----~~~~~~A~~~lGD~~--~~l~~i~mHS~V~~~L~~ 220 (367) T protein:vir:80 147 LAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFN----REAFVDAAFTMGDHV--GSIAAIAVHSMVYKRMTN 220 (367) T ss_pred cccchhhhhhhhccccccccccCceeeeeeccCCCccceec----HHHHHHHHHHhcccc--ccccEEEEchHHHHHHHh Confidence 100 011122233344443333322222222 345777878886643 334678899999999887 Q ss_pred chhhhhhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechh Q lcl|Aclame:pro 218 ALMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRS 297 (347) Q Consensus 218 ~~~~~~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~ 297 (347) ..-+.-..++ + ....|+..+|.+|+....+|....+. ...|. ..+|-+= T Consensus 221 ~~li~~i~~s-d---~~~~i~ty~G~~VIvDD~~Pv~~~~a--------------------~~~yt-------tYlfg~G 269 (367) T protein:vir:80 221 NDEIEFIPDS-K---GQLTIPTYMGKVVIVDDGMPVFGTGA--------------------DKTYL-------SILFGGA 269 (367) T ss_pred ccccccccCC-C---CccccceecceeEEEeCCCcccccCC--------------------CceEE-------EEEEecc Confidence 6422112221 1 13568999999999999999754221 11111 1234444 Q ss_pred hhhhhhhhhee-eccccchhhH-h---hHHhh-----hhhhcCcccccceEE-EEEe----------cCCC Q lcl|Aclame:pro 298 AVGTVKLKDMA-LERARRPEFQ-A---DQIIG-----KYAMGHGGLRPEAAG-ALVF----------TPAA 347 (347) Q Consensus 298 A~~tv~~~~~~-~e~~~~~~~~-~---d~i~~-----~~~~G~~~lRPe~~~-~l~~----------~~aa 347 (347) |++..+..+.. .|..||+... + |.+.. +|.+|.+-.....+. .-.. .|+= T Consensus 270 Ai~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~ 340 (367) T protein:vir:80 270 AFGYADGAPQVPVAVGRRELRGNGSGLEYILERKEWIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITL 340 (367) T ss_pred eeeecccCCccceecccchhhhcCCceEEEEeeeeEEeecceeeecccccccccccccccccccccCCCCh Confidence 44444443322 5777887653 1 33332 233343332211110 0000 0110 No 179 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=97.05 E-value=5.3e-05 Score=44.04 Aligned_cols=282 Identities=13% Similarity=0.040 Sum_probs=114.2 Q ss_pred CCCCccCcc--ccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEecc-ccceeeeecCCC Q lcl|Aclame:pro 1 MANATGGQQ--IGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM-GRTKGYYLAPGE 77 (347) Q Consensus 1 m~~~~~~~~--~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~i-G~~t~~~~~~g~ 77 (347) +........ .+.+.......+. +-...+...+.+.+...+.+...+++.++. +..++.- ....+..+..|. T Consensus 226 ~~~~~~~~~~~~~~~~~~~~~~~~---~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~---~~~~~~~~~~~~a~~~~eG~ 299 (517) T protein:vir:97 226 SASLTKDPKAAWTAELKERGISGM---PAPAGILKRIQDAVNDEGSLLPFIRHENLP---TLVVGGDNALTQGTGHTTGT 299 (517) T ss_pred Hhcccccccceeeeeccccccccc---ccchHHHHHHHHhhhhhccceeeeeecccc---ceeeecccccceeeeeecCC Confidence 000000000 0000000000000 011223334444454445455554444332 2333321 112233444455 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecch-hhccHHHHHhCcc----hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDV-LIYDIEDAMNHYD----VRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASN 152 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~-~Vdd~D~~q~~~D----~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~ 152 (347) ..+. .+++..++++.+-+ +.++ .+..---..+.+| +.+-+..++.+.|+++.++.++. +-- T Consensus 300 ~kp~--s~~tf~~~~~~~~~--ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~----GdG------ 365 (517) T protein:vir:97 300 DKTE--SNITLQTRVLTPQY--VYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIM----GGV------ 365 (517) T ss_pred cccc--cccceeeEEeeHhh--hhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhc----ccC------ Confidence 4432 23555555554433 2222 2222111122344 77788999999999999987763 100 Q ss_pred cccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccccccc Q lcl|Aclame:pro 153 ENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDP 232 (347) Q Consensus 153 ~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~ 232 (347) ... ...+.......... ........+.+.+ ..|.....+..+-.+|++|..|..|.+-.+ .+..|.-.... T Consensus 366 -tg~-~~~gi~~~a~~~~~--~~~~~~~~~~d~i----~~l~~a~~~a~~a~~vmn~~t~~~I~klKD-~~G~Yl~~~~~ 436 (517) T protein:vir:97 366 -TGV-SETQIYPVVGDAWA--TNVTGTTNIQELL----EKLSVATPKAADSTLVIHRNDLAAIRFLKD-KNGNYVFPVGV 436 (517) T ss_pred -CCc-cccccccccccccc--ccccccchHHHHH----HHHHHHhhhccCCEEEECHHHHHHHHHhhc-CCCCeeccCcC Confidence 000 00011000000000 0000011112222 222222222224456899999999876432 34566544445 Q ss_pred cccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccc Q lcl|Aclame:pro 233 ETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERA 312 (347) Q Consensus 233 ~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~ 312 (347) ..+.+..++|+.-+. +.++... .. ...-.+|.. + -.+.+..-.. T Consensus 437 ~~~~~~~l~G~~~~~-~~~~~~~---~~---------------~~~~~~y~i--------~---------~~~g~~~~~~ 480 (517) T protein:vir:97 437 SNQTIATHFGFNRLV-QSVAVDE---KT---------------AVSLSGYVT--------N---------GSRGMEFEQG 480 (517) T ss_pred CcccccccCCccccc-cccccCc---ee---------------EeeccccEE--------E---------eecceeeeee Confidence 556666677742221 1121100 00 000011210 0 0111111112 Q ss_pred cchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 313 RRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 313 ~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ||-++-.+.+...++.|..++.||.++-.+..|++ T Consensus 481 fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~ 515 (517) T protein:vir:97 481 TILVENNKEYLFEMPISGSLEYKGTTAYGTYTPPV 515 (517) T ss_pred eecccCceeEeeeeeeccccccccceEEEEEcCCC Confidence 33333344455667788899999999888877777 No 180 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=95.89 E-value=0.0013 Score=36.42 Aligned_cols=290 Identities=13% Similarity=0.047 Sum_probs=139.7 Q ss_pred CCCCccCccccccCcccCccccHHHHH-HHHHhHHHHHHHHHHHhhhc--ccc-cccc-----cCCceEEEeccccceee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALF-LKVFGGEVLTAFVRRSVTMD--KHM-VRTI-----QNGKSASFPVMGRTKGY 71 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~-ie~f~geV~~~f~~~s~~~~--~~~-~rti-----~~G~tv~i~~iG~~t~~ 71 (347) ||- ||. .|.- .| +|+|.-.|.+...+.+.|.. .+. ...+ .+|+.+.+|..+...-. T Consensus 1 Ma~--------T~l------~D~i-ipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~ 65 (349) T protein:vir:78 1 MAI--------TTI------GDIV-TGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTS 65 (349) T ss_pred CCc--------eEE------eeee-ccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCC Confidence 764 221 2221 11 24677777776666653332 222 1122 46999999999876431 Q ss_pred -e--e-cCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 72 -Y--L-APGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNL 147 (347) Q Consensus 72 -~--~-~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~ 147 (347) . + .-+..-+.+++.+...+..- +=...-.++...|+-..-+--|.|..+.++.+.--.|...+.++..|...... T Consensus 66 ~e~nv~~D~~~~~~t~~kitt~~~~a-~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~ 144 (349) T protein:vir:78 66 IEPNYSNDVYQDIATPRAIQTGEMMA-RVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYND 144 (349) T ss_pred cccccCCCCcccccccccccccceee-eeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcc Confidence 1 1 00111011223344443322 22334556777888888777899999999998888877666666555433221 Q ss_pred ccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhcc---CCCCCCEEEEChHHHHHHhcchhhhhh Q lcl|Aclame:pro 148 PAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNY---VPAGDRRFYCAPEDYSAILSALMPNAA 224 (347) Q Consensus 148 a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~---VP~~gR~~vv~P~~~~~Ll~~~~~~~~ 224 (347) ... +.....+....+....+... ..+ ..+.+|..+|...- ....=..+++-+..|..|.+...+ T Consensus 145 ~~~-a~~~~~~~~~~t~d~s~~a~-----~~~----~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li--- 211 (349) T protein:vir:78 145 NVS-ATDAYHEQNDMVVDVSATLG-----FDA----GAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI--- 211 (349) T ss_pred ccc-ccchhhhcccceeeeccccC-----CCh----hhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhh--- Confidence 111 11011111122222211111 112 23555555555541 111125778999999998865433 Q ss_pred hccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhh Q lcl|Aclame:pro 225 NYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKL 304 (347) Q Consensus 225 ~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~ 304 (347) +|.- ..-....|..++|.+|+.+..+|....++ ..+| ...+|-+-|++.... T Consensus 212 ~~i~-~s~~~~~i~ty~G~~VivDD~~Pv~~~g~--------------------~~~y-------ttylfg~GAi~~~~~ 263 (349) T protein:vir:78 212 DFIR-DAENNTMFATYQGYRVIVDDSMTVVGQGA--------------------QRKF-------ISIIFGQGAIGYGEG 263 (349) T ss_pred hhcc-CcccCcccceecCeEEEEeCCCccccCCC--------------------CceE-------EEEEeecceEEEccC Confidence 2321 11124468899999999999999754321 1112 113344444444443 Q ss_pred hhe-eeccccchhhH----hhHHhhhhhhcC--cccccceEEEE-------EecCCC Q lcl|Aclame:pro 305 KDM-ALERARRPEFQ----ADQIIGKYAMGH--GGLRPEAAGAL-------VFTPAA 347 (347) Q Consensus 305 ~~~-~~e~~~~~~~~----~d~i~~~~~~G~--~~lRPe~~~~l-------~~~~aa 347 (347) .+. .+|..||+... .|.+..++.|.- ..+....+.+. ...|+= T Consensus 264 ~~~~~~et~rd~~~g~~~G~d~l~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~ 320 (349) T protein:vir:78 264 NPVMPLEYEREASRANGGGVETLWTRKTWLLHPFGYRFTSAVITGNGTETIARSASW 320 (349) T ss_pred CCccceeeecccccCCcceeEEEEEeeEEEeeeeeeeeccccccCCccccccCCCCh Confidence 332 25666666432 244444333321 11222222111 111221 No 181 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=95.56 E-value=0.0015 Score=36.11 Aligned_cols=294 Identities=13% Similarity=0.058 Sum_probs=124.9 Q ss_pred CCCCccCccccccCccc---------------CccccHHHHHHHHHhHHHHHHHHHH-HhhhcccccccccCCceEEEec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKG---------------QSAADKLALFLKVFGGEVLTAFVRR-SVTMDKHMVRTIQNGKSASFPV 64 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~---------------~~~~d~~al~ie~f~geV~~~f~~~-s~~~~~~~~rti~~G~tv~i~~ 64 (347) |+-.-..--...+.|.+ ++++|=-.|+-..-.-.++..|+.. .-++.|.+.++++.=|..+..+ T Consensus 366 ~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~ 445 (693) T protein:vir:95 366 MTLRELARASLVDRGIGVASLNAPQMVGLAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVG 445 (693) T ss_pred CcHHHHHHHHHHhcCCccCCCCHHHHHHHHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceee Confidence 11000000000111211 2333432233344445566667665 4667777766665444445555 Q ss_pred cccc-eeeeecCCCCCCCCCCCCCCCceEEEEeee----eecchh-h-ccHHHHHhCcchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 MGRT-KGYYLAPGENLDDKRKDIKHSEKVIQIDGL----LTSDVL-I-YDIEDAMNHYDVRAEYSAQLGEALAIAADGAV 137 (347) Q Consensus 65 iG~~-t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~----~~~~~~-V-dd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~i 137 (347) +|.. ++..+..|.++. ...+....-++.+.++ .+.+-. | ||++ .-..+....|++-++..++.+ T Consensus 446 lg~~~~L~~V~E~gEyk--~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLg-------a~~~ip~~~g~aA~~~~~~~v 516 (693) T protein:vir:95 446 LGEFSSLRQVREGAEYK--YVTLGERGEQIILATYGELFSITRQAIINDDLQ-------MLSDIPFKLGQAAKATIGDLV 516 (693) T ss_pred cCCCCChhhcCCCCcee--eeecCCccceeehhhcCCeeeecHHhhhccchH-------HHHHHHHHHHHHHHHHHHHHH Confidence 5543 333333333332 1223444445655553 111111 2 3333 445677889999999999998 Q ss_pred HHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhcc----------CCCCCCEEEE Q lcl|Aclame:pro 138 LAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNY----------VPAGDRRFYC 207 (347) Q Consensus 138 l~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~----------VP~~gR~~vv 207 (347) +..|..-......++=+..+| ++..+.+ ...-+ ++.|-.++..|.++. +--..+|++| T Consensus 517 y~~L~~Np~m~DGk~LFhadH--~Nl~tga--~sals--------~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llv 584 (693) T protein:vir:95 517 YAVLTGNPAMSDGKTLFHADH--SNLLTGA--ASALS--------IDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLT 584 (693) T ss_pred HHHHhcCccccCCcceeeccc--ccccccc--ccccC--------hHHHHHHHHHHHHhhcchhccCCceeecccceEEe Confidence 877653222222222222222 1112111 11111 122333333332222 1123478888 Q ss_pred ChHHHHHHhcchhhhhhhccccccccccceEEEece-eEEEeccccccccccccccCccccccccccccccccccccccc Q lcl|Aclame:pro 208 APEDYSAILSALMPNAANYAALIDPETGNIRNVMGF-EVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQ 286 (347) Q Consensus 208 ~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~i~G~-~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~ 286 (347) +|+...... ++++..+.-..+...|.+.-+.|| +|+..++|...+.+.+-+...-.....+..|-.|.. T Consensus 585 P~~le~~a~---~l~~s~~~~~a~~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~~------- 654 (693) T protein:vir:95 585 PVALEDKAN---QIINSESVPGADVNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAKKGSDTIEVAYLDGVD------- 654 (693) T ss_pred cchHHHHHH---HHhccccccccccccccccchhccccccccceecCCCCCceEEecCCCCCeEEEEEecCCC------- Confidence 887665433 344444432233445555556664 788888886333222222111000001111111111 Q ss_pred cceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 287 NNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 287 ~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .+.+|..-.-...+=.++-.+=||++++..-++ ++.|=| T Consensus 655 -------------------~P~ie~~~gf~~dG~~~kvr~D~G~~~iD~Rg~---~kn~GA 693 (693) T protein:vir:95 655 -------------------TPYLEQQEGFTVDGVASKVRIDAGVAPLDFRGL---QKSNGA 693 (693) T ss_pred -------------------CCeEeecCCCCcceEEEEEEEeccCceeecccc---ccCCCC Confidence 112222211111122334567788888877664 244444 No 182 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=95.17 E-value=0.0026 Score=34.74 Aligned_cols=261 Identities=16% Similarity=0.139 Sum_probs=128.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccc-cc---cccCCceEEEeccccc--eeeeec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHM-VR---TIQNGKSASFPVMGRT--KGYYLA 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~-~r---ti~~G~tv~i~~iG~~--t~~~~~ 74 (347) |+..|. + --++ .|-|+|.|.+.+-|+.++.|++..- .+ -+.+.++.---....+ .++.|. T Consensus 1 m~t~N~--------n-----~avr-~Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pVVig~Y~ 66 (286) T protein:vir:94 1 MATTNN--------D-----LPVR-VYSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAVVVGEYS 66 (286) T ss_pred CCCCcc--------c-----ccee-ehhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcceEEeccc Confidence 554322 1 1122 4789999999999999999886643 22 2232222211111111 223344 Q ss_pred CCCCC---CCCCCCCCCCce--EEEEeee-eecchh-h-ccHHHHHhCcchHHHHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 75 PGENL---DDKRKDIKHSEK--VIQIDGL-LTSDVL-I-YDIEDAMNHYDVRAEYS---AQLGEALAIAADGAVLAEMAK 143 (347) Q Consensus 75 ~g~~~---~~~~~~~~~~~~--~l~ID~~-~~~~~~-V-dd~D~~q~~~D~r~~~~---~~~g~aLa~~~D~~il~~l~~ 143 (347) .+.+. .++-+.-...++ .+..|+. .|..-+ | .-+|..-.+-|+....+ +.+++|-.+.+|..+=..|.. T Consensus 67 TdeNv~FGtgTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~lQA~Akt~~~n~~~Gk~ls~ 146 (286) T protein:vir:94 67 TDANTAFGTGTSNSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMTVNNDLDAAVADRLNLQAQAKTRLFNVAMGEALAT 146 (286) T ss_pred CCCccccccCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 33332 111111111111 2233322 222222 2 45666666666655554 456777788888755433322 Q ss_pred hhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCC---CCCEEEEChHHHHHHhcchh Q lcl|Aclame:pro 144 LCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPA---GDRRFYCAPEDYSAILSALM 220 (347) Q Consensus 144 ~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~---~gR~~vv~P~~~~~Ll~~~~ 220 (347) .+. +. .+ +|.+.++..++.+..|.- ...-+.|.|+.|.+|+.+.- T Consensus 147 ~A~---------------------------~t----~~-~D~V~~LF~~as~~yvn~ev~~~~~ayV~~evYnaiiD~~l 194 (286) T protein:vir:94 147 AGT---------------------------DL----GA-VDDVNALFESAVEKYTDLEVIAPVRAYVTASVYNAIIDLAN 194 (286) T ss_pred hhh---------------------------hh----hh-hhhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhcccc Confidence 110 00 00 244444455555544421 12338899999999998875 Q ss_pred hhhhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhh Q lcl|Aclame:pro 221 PNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVG 300 (347) Q Consensus 221 ~~~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~ 300 (347) .+... +++.++-.-.|.+.-||.|-|.|.--..+. ..+|.++-++ T Consensus 195 ~TsaK-~SsaNiDengi~~FkGf~i~e~P~~~~~g~----------------------------------~aifs~dnig 239 (286) T protein:vir:94 195 VTTAK-NSAVNIDTNGMLSFRGIAITKVPTQYMGGK----------------------------------AVIFAPDNVA 239 (286) T ss_pred ccccc-cceeeeccCCcceecceEEeecchhhccCc----------------------------------eEEEccccce Confidence 55432 233445556677899999998773111100 1223333222 Q ss_pred hhh-----hhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 301 TVK-----LKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 301 tv~-----~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ... +.-++.|.+ -|-.+.|-=-||--++.-...+.+..+|.| T Consensus 240 ~aftGIn~aR~IesEdF-----~GValQgAGK~G~~I~edNk~Ai~~~~~k~ 286 (286) T protein:vir:94 240 RVFTGINIARTIQAIDF-----AGVELQGAGKYGTFILDDNKKAIFTATPKA 286 (286) T ss_pred eeeccceeeeeeecccc-----CceeeeccccccccccccCceeEEEeecCC Confidence 111 111122222 233344444456566666677778888888 No 183 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=95.15 E-value=0.002 Score=35.34 Aligned_cols=274 Identities=13% Similarity=0.031 Sum_probs=127.3 Q ss_pred ccCccccHHHHHH-HHHhHHHHHHHHH----HHhhhccccccc-cc-CCceEEEec---cccceeeeecCC-CCCCCCCC Q lcl|Aclame:pro 16 KGQSAADKLALFL-KVFGGEVLTAFVR----RSVTMDKHMVRT-IQ-NGKSASFPV---MGRTKGYYLAPG-ENLDDKRK 84 (347) Q Consensus 16 ~~~~~~d~~al~i-e~f~geV~~~f~~----~s~~~~~~~~rt-i~-~G~tv~i~~---iG~~t~~~~~~g-~~~~~~~~ 84 (347) -.-..+|....|+ ++++ .|+....+ .-..+.++.+++ +- .-.++.+.. .|..+ -|..+ .+++. . T Consensus 1 ~~~~~a~~~~~f~~~ql~-~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~--~~~~~~~dip~--v 75 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLT-ASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQ--IVADYTDDLPL--V 75 (296) T ss_pred CcccchhhhHHHHHHHHH-HHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCcee--EeCCCccccce--e Confidence 1112234433455 6666 44443332 235556666654 22 234555443 34443 23222 22322 2 Q ss_pred CCCCCceEEEEeee-eecchhhccHHHHH-hCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCce Q lcl|Aclame:pro 85 DIKHSEKVIQIDGL-LTSDVLIYDIEDAM-NHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAV 162 (347) Q Consensus 85 ~~~~~~~~l~ID~~-~~~~~~Vdd~D~~q-~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~ 162 (347) ++.-.+....|-.. .-+...+.++..++ ...++-..-...++.++++..|+.++.=. ... .+.|.-... T Consensus 76 ~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~-----~~~----g~~GLlN~p 146 (296) T protein:vir:10 76 DALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGS-----TAH----GIPSVFDYP 146 (296) T ss_pred eccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec-----ccc----cceeEeecC Confidence 23344444544442 12344456777664 46888888888999999999998776311 000 111111100 Q ss_pred eeeecccccccchhhHHHHHHHHHHHHHHHHhhc--cCCCCC-CEEEEChHHHHHHhcchhhhhhhccccccccccceEE Q lcl|Aclame:pro 163 VLNIGAAADLVDVEARGKAILKGLTLARARLTKN--YVPAGD-RRFYCAPEDYSAILSALMPNAANYAALIDPETGNIRN 239 (347) Q Consensus 163 ~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~--~VP~~g-R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~ 239 (347) .++..++.... .....+++.|.++...|.++ .+ .+ -.++|+|+.|..|...- .+.+.+-..-+. .+ T Consensus 147 ~v~~~~~~~~W---~~~t~i~~Di~~~~~~l~~~s~g~--~~p~~l~L~p~~~~~L~~~~--~~~~~t~l~~ik----~~ 215 (296) T protein:vir:10 147 NINNVVSGGSW---SQPTTAVSDITSLLDIIETSTNGQ--HRATHLLLPTTARRIMQNLV--PGTSVSYGEFFR----QN 215 (296) T ss_pred CCccccccCCc---cCHHHHHHHHHHHHHHHHHhhCce--ecceeEEeCHHHHHHHhhcc--CCCCccHHHHHH----Hh Confidence 11111111111 11235688888888766554 22 11 25788999999886421 111111001011 12 Q ss_pred EeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeec--hhhhhhhhhhheeeccccchhh Q lcl|Aclame:pro 240 VMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNH--RSAVGTVKLKDMALERARRPEF 317 (347) Q Consensus 240 i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h--~~A~~tv~~~~~~~e~~~~~~~ 317 (347) ..+++|...+.|...+ ..+ +..++++. ++-+.....++++.-. -.++. T Consensus 216 ~~~l~i~~~~~l~~a~------------~~g-----------------~~~~v~~~~~~~~~~~~v~~~~~~~~-~e~~~ 265 (296) T protein:vir:10 216 NSGVTVEFVQYLNDYN------------GTG-----------------TSAAIAYEKDPNNMAIEIPEATNALP-AQPKD 265 (296) T ss_pred cCCceEEEeeeeccCC------------CCc-----------------ceEEEEEEcCCceEEEEcCcceeeec-ccccC Confidence 2466666666654211 000 11122322 2222112223332221 13333 Q ss_pred HhhHHhhhhhh-cCcccccceEEEE---Eec Q lcl|Aclame:pro 318 QADQIIGKYAM-GHGGLRPEAAGAL---VFT 344 (347) Q Consensus 318 ~~d~i~~~~~~-G~~~lRPe~~~~l---~~~ 344 (347) ..+.+...... |..+.||++++.+ .++ T Consensus 266 l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 266 LHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred ceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 45556666666 5889999999887 555 No 184 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=94.68 E-value=0.003 Score=34.42 Aligned_cols=294 Identities=12% Similarity=0.038 Sum_probs=130.6 Q ss_pred CCCCccC----ccccc---cCcccCccccHHHHHH-HHHhHHHHHHHHH----HHhhhccccccc-cc-CCceEEEe--- Q lcl|Aclame:pro 1 MANATGG----QQIGA---NQGKGQSAADKLALFL-KVFGGEVLTAFVR----RSVTMDKHMVRT-IQ-NGKSASFP--- 63 (347) Q Consensus 1 m~~~~~~----~~~~~---~~~~~~~~~d~~al~i-e~f~geV~~~f~~----~s~~~~~~~~rt-i~-~G~tv~i~--- 63 (347) |.+++.- ..+-+ +.|.-..+.+..+.|+ ++|. .++....+ .-..+.++.+++ +- .-.++.+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~-~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~ 79 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELH-RIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFD 79 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHH-HHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeec Confidence 6665521 00111 1122222223334665 4444 55443322 234555666653 22 23344433 Q ss_pred ccccceeeeecCC-CCCCCCCCCCCCCceEEEEeee-eecchhhccHHHH-HhCcchHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 64 VMGRTKGYYLAPG-ENLDDKRKDIKHSEKVIQIDGL-LTSDVLIYDIEDA-MNHYDVRAEYSAQLGEALAIAADGAVLAE 140 (347) Q Consensus 64 ~iG~~t~~~~~~g-~~~~~~~~~~~~~~~~l~ID~~-~~~~~~Vdd~D~~-q~~~D~r~~~~~~~g~aLa~~~D~~il~~ 140 (347) .+|..+. |..+ .+++. .++.-.+....|-.. .-+.+.+.+++.+ +...++-..-...+..+++++.|+.++.= T Consensus 80 ~~G~a~~--~~d~~~dip~--v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G 155 (319) T protein:vir:10 80 KVGTAQI--IADYTDDLPL--VDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKG 155 (319) T ss_pred cccceee--ecCccccccc--eeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEee Confidence 3454432 2221 22322 223344444444442 2233445666666 46788888888899999999999877631 Q ss_pred HHHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhc--cCCCCCCEEEEChHHHHHHhcc Q lcl|Aclame:pro 141 MAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKN--YVPAGDRRFYCAPEDYSAILSA 218 (347) Q Consensus 141 l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~--~VP~~gR~~vv~P~~~~~Ll~~ 218 (347) .. .....+..+ +++-...+.+..... .....+.+++.|..+..+|.++ .+ ...-.++|+|+.|..|.. T Consensus 156 ~~-----~~g~~GLlN-~p~~~~~~~~~~~~~--~t~t~~~i~~di~~~~~~l~~~s~g~-~~p~~L~L~p~~~~~L~~- 225 (319) T protein:vir:10 156 SA-----PHKIVSVFN-HPNITKITSGKWIDV--STMKPETAEAELTQAIETIETITRGQ-HRATNILIPPSMRKVLAI- 225 (319) T ss_pred cc-----cccceeEEe-CCCceeeecCCCCCc--cccCHHHHHHHHHHHHHHHHHhcCce-eeceEEEecHHHHHhhhc- Confidence 10 000111111 111111111111111 1234567888898888888754 22 111367899999999853 Q ss_pred hhhhhhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeech-- Q lcl|Aclame:pro 219 LMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHR-- 296 (347) Q Consensus 219 ~~~~~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~-- 296 (347) +..+.+.+-..-+.+ +..+++|...+.|...+ ..++ .+++++.+ T Consensus 226 -~~~~~~~t~l~~lk~----~~~~l~I~~~pel~~ag------------~~g~-----------------~~~v~y~~~~ 271 (319) T protein:vir:10 226 -RMPETTMSYLDYFKS----QNSGIEIDSIAELEDID------------GAGT-----------------KGVLVYEKNP 271 (319) T ss_pred -ccCCCCeeHHHHHHH----hcCCceEEEeeeecccC------------CCcc-----------------eEEEEEecCC Confidence 111111111111111 12356677766664211 0000 11122221 Q ss_pred hhhhhhhhhheeeccccchhhHhhHHhhhhhh-cCcccccceEEEEEec Q lcl|Aclame:pro 297 SAVGTVKLKDMALERARRPEFQADQIIGKYAM-GHGGLRPEAAGALVFT 344 (347) Q Consensus 297 ~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~-G~~~lRPe~~~~l~~~ 344 (347) +-+.....++++.... .++...+.+....+. |.-+.||++++.+.== T Consensus 272 ~~~~~~v~~~~~~~~~-e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 272 MNMSIEIPEAFNMLPA-QPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred ceEEEecCcceeeeee-eecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 1111111223222111 222334445455544 5678899998776522 No 185 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=94.62 E-value=0.0039 Score=33.77 Aligned_cols=284 Identities=15% Similarity=0.084 Sum_probs=129.0 Q ss_pred ccccHHHHHH----HHHhHHHHHHHHHHHhhhccccccc-cc-CCceEEEeccccc-eeeeecCCC-CCCCCCCCCCCCc Q lcl|Aclame:pro 19 SAADKLALFL----KVFGGEVLTAFVRRSVTMDKHMVRT-IQ-NGKSASFPVMGRT-KGYYLAPGE-NLDDKRKDIKHSE 90 (347) Q Consensus 19 ~~~d~~al~i----e~f~geV~~~f~~~s~~~~~~~~rt-i~-~G~tv~i~~iG~~-t~~~~~~g~-~~~~~~~~~~~~~ 90 (347) --+|..+.|+ +...-+|.+.....-..+.++.+++ +- ...++.++..-.. .++-+..+. +++. .++.-.+ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~--~~~~~~~ 78 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPL--VDVDMVR 78 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccccccc--cccccee Confidence 1122223344 4445556555555567777777764 22 3455565543322 222333222 2322 2233345 Q ss_pred eEEEEeee-eecchhhccHHHH-HhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeee--e Q lcl|Aclame:pro 91 KVIQIDGL-LTSDVLIYDIEDA-MNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLN--I 166 (347) Q Consensus 91 ~~l~ID~~-~~~~~~Vdd~D~~-q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~--~ 166 (347) ....|-.. .-+.+.+.+++.+ +...++-..-...+..+++++.|+.++.=..+ ....+..+.-....... . T Consensus 79 ~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~-----~g~~GLlN~p~~~~~~~~~~ 153 (301) T protein:vir:80 79 KSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK-----YAIKGAFEATGIQIDVSPTT 153 (301) T ss_pred EEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc-----ccceeeecCCCcccccccCc Confidence 55555553 1234445566665 46788888889999999999999977632111 00111111100001110 1 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHhhccCCCCC-CEEEEChHHHHHHhcchhhh-hhhccccccccccceEEEecee Q lcl|Aclame:pro 167 GAAADLVDVEARGKAILKGLTLARARLTKNYVPAGD-RRFYCAPEDYSAILSALMPN-AANYAALIDPETGNIRNVMGFE 244 (347) Q Consensus 167 ~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~~~~-~~~~~~~~~~~~G~v~~i~G~~ 244 (347) ++.+...=..+.++.|++.|.++..+|.++.-=-.+ -.++|+|+.|..|..- +.. +.+.+-..-+. .+..+.+ T Consensus 154 ~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~-~~~~~~~~tvl~~l~----~~~~~~~ 228 (301) T protein:vir:80 154 GVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKK-RYSNEDSRSVLKVLQ----DNAWFSA 228 (301) T ss_pred ccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhc-cccCCCCeeHHHHHH----HHcCcce Confidence 111111113446677899999999888765211112 3678999999999631 100 00110000011 1122356 Q ss_pred EEEeccccccccccccccCccccccccccccccccccccccccceeEEeech--hhhhhhhhhheeeccccchhhHhhHH Q lcl|Aclame:pro 245 VIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHR--SAVGTVKLKDMALERARRPEFQADQI 322 (347) Q Consensus 245 V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~--~A~~tv~~~~~~~e~~~~~~~~~d~i 322 (347) |...+.|... +..++ .+++++.. +-+-....++++.-.. .++.....+ T Consensus 229 I~~~p~L~~~------------g~~g~-----------------~~~v~~~~~~d~~~~~v~~~~~~~~~-e~~~~~~~~ 278 (301) T protein:vir:80 229 IVRVPDLAGM------------GTAGS-----------------DSFAVIHDSNETAELIIPMDITRHPE-EYSFPRTKV 278 (301) T ss_pred EEEcceeccC------------CCCcc-----------------cEEEEEecCCcEEEEEecCceeeecc-eecCceeEe Confidence 6666655311 00111 11222221 1111111222211100 111112233 Q ss_pred hhhhhh-cCcccccceEEEEEec Q lcl|Aclame:pro 323 IGKYAM-GHGGLRPEAAGALVFT 344 (347) Q Consensus 323 ~~~~~~-G~~~lRPe~~~~l~~~ 344 (347) ....+. |..+.||++++-+.== T Consensus 279 ~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 279 PFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eeeeeeEEEEEEccceEEEEecC Confidence 334444 5688899998776522 No 186 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=94.60 E-value=0.004 Score=33.75 Aligned_cols=290 Identities=12% Similarity=0.070 Sum_probs=138.8 Q ss_pred CCCCccCccccccCcccCccccHHHHH-HHHHhHHHHHHHHHHHhhh--cccccc-cc-----cCCceEEEeccccceee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALF-LKVFGGEVLTAFVRRSVTM--DKHMVR-TI-----QNGKSASFPVMGRTKGY 71 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~-ie~f~geV~~~f~~~s~~~--~~~~~r-ti-----~~G~tv~i~~iG~~t~~ 71 (347) ||- ||. .|.- .| +|+|.-.|.+...+.+.|. +.+..+ .+ .+|+.+.+|..+...-. T Consensus 1 Ma~--------T~l------~D~i-ipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~ 65 (349) T protein:vir:94 1 MAI--------TTI------GNIV-TGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTS 65 (349) T ss_pred CCc--------eEE------eeee-ccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCC Confidence 764 221 2221 11 3467777777766665333 222211 22 46999999988765422 Q ss_pred -e--ecCCCCC-CCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 72 -Y--LAPGENL-DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNL 147 (347) Q Consensus 72 -~--~~~g~~~-~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~ 147 (347) . |..-++. ..++..+...+.. -+=-..-.++...|+=..-+--|.|+.++++.+.--.|...+.+|..|...... T Consensus 66 ~e~n~~~dt~~~~~t~~kit~~~~~-a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~ 144 (349) T protein:vir:94 66 IEPNYSNDVYQDIATPRAIQTGEMM-ARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYND 144 (349) T ss_pred cccccCCCCccccccccccccccee-eeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcc Confidence 1 1111111 1122334433322 222334456777888888777899999999999888887666666555333321 Q ss_pred ccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccC--CCCC-CEEEEChHHHHHHhcchhhhhh Q lcl|Aclame:pro 148 PAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYV--PAGD-RRFYCAPEDYSAILSALMPNAA 224 (347) Q Consensus 148 a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~V--P~~g-R~~vv~P~~~~~Ll~~~~~~~~ 224 (347) ....+ .........+....+... ..++. +.+|..+|...-- ..+. -.+++-+..|..|.+...+. T Consensus 145 ~~~~~-~~~~~~~~~~~d~~~~a~-----~~~~~----~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~-- 212 (349) T protein:vir:94 145 NVSAT-DAYHEQNDMVVDVSATSG-----FDAGA----FIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID-- 212 (349) T ss_pred ccccc-ccccccCceeEEecccCC-----CChhh----HHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh-- Confidence 11111 111111222222221111 12222 4445555444311 1122 56789999999988754332 Q ss_pred hccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhh Q lcl|Aclame:pro 225 NYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKL 304 (347) Q Consensus 225 ~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~ 304 (347) |.- ..-..-.|..++|.+|+....+|....+. +++|. ..+|-+=|++.... T Consensus 213 -~i~-~s~~~~~i~ty~G~~VivDD~~Pv~~~g~--------------------~~~yt-------tylfg~GAi~~~~~ 263 (349) T protein:vir:94 213 -FIR-DAENNTMFATYQGYRVIVDDSMTVVGQDT--------------------SRKFI-------SIIFGQGAIGYGEG 263 (349) T ss_pred -hcc-CcccCcccceecCcEEEEeCCCccccCCC--------------------CceEE-------EEEeecceEEeecC Confidence 211 11123467899999999999999753221 11121 12333444444444 Q ss_pred hh-eeeccccchhhH----hhHHhhh-----hhhcCcccccceEE----EEEecCCC Q lcl|Aclame:pro 305 KD-MALERARRPEFQ----ADQIIGK-----YAMGHGGLRPEAAG----ALVFTPAA 347 (347) Q Consensus 305 ~~-~~~e~~~~~~~~----~d~i~~~-----~~~G~~~lRPe~~~----~l~~~~aa 347 (347) .+ +.+|..||+... .|.+..+ +.+|.+-..+.... .+...|+= T Consensus 264 ~~~~~~E~~rd~~~g~~~G~d~L~~R~~~~~hp~G~s~~~a~v~~~~~~~~~~sPt~ 320 (349) T protein:vir:94 264 NPEMPLEYEREASRANGGGVETLWTRKTWLLHPFGYSFTSAVITGNGTETIARSASW 320 (349) T ss_pred CCCcceeeecccccCCcceeEEEEEeeEEEeeeeeeeecccccCCCccccccCCCCh Confidence 32 235666666432 2444443 33333332211110 01111211 No 187 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=94.35 E-value=0.003 Score=34.39 Aligned_cols=273 Identities=17% Similarity=0.097 Sum_probs=117.5 Q ss_pred ccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccc-cc---cccCCceEEEecccc--ceeeeecCCCCCC--- Q lcl|Aclame:pro 10 IGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHM-VR---TIQNGKSASFPVMGR--TKGYYLAPGENLD--- 80 (347) Q Consensus 10 ~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~-~r---ti~~G~tv~i~~iG~--~t~~~~~~g~~~~--- 80 (347) +++++++ -++ .|-|+|.|.+.+-|++++.|++..- .+ -+.+.++.---.... +.++.|..+.+.- T Consensus 1 mp~N~n~-----avr-~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pVVig~Y~TdeNvagFG 74 (295) T protein:vir:47 1 MPSNQNN-----AVR-RYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPVVIGEYKTGENDGGFG 74 (295) T ss_pred CCCCCCc-----cch-hhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcceEeecccCCCcccccc Confidence 2222221 122 5889999999999999999886643 22 233322221111111 1233455444432 Q ss_pred -CCCCCCCCCce--EEEEeee-eecchh-h-ccHHHHHhCcchHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhccccc Q lcl|Aclame:pro 81 -DKRKDIKHSEK--VIQIDGL-LTSDVL-I-YDIEDAMNHYDVRAEYS---AQLGEALAIAADGAVLAEMAKLCNLPAAS 151 (347) Q Consensus 81 -~~~~~~~~~~~--~l~ID~~-~~~~~~-V-dd~D~~q~~~D~r~~~~---~~~g~aLa~~~D~~il~~l~~~a~~a~~~ 151 (347) ++-+.-...++ .+..|+. .|..-+ | .-+|..-.+-|+....+ +.+++|-.+.+|..+=..|...|... T Consensus 75 tGTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~ls~~A~~t--- 151 (295) T protein:vir:47 75 DNSGAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYTVNNDLNAAVADRLKLQSEAQTRTVNKRIGKYLSDTATKT--- 151 (295) T ss_pred cCCccccccCceeeEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh--- Confidence 11111111111 2233322 222222 2 45666666666665554 45677778888876544343222110 Q ss_pred ccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhcccccc Q lcl|Aclame:pro 152 NENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALID 231 (347) Q Consensus 152 ~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~ 231 (347) ..-++.+ ...+...+-.+.++.-...|-..-| +.|.|+.|.+|+.+.-.+... +++.+ T Consensus 152 ---------------e~~td~t-----~d~V~~LF~~as~~yvn~ev~~~~~-AyV~~evYnaiiD~~l~TsaK-~SsaN 209 (295) T protein:vir:47 152 ---------------EALADFT-----DDKVKALFNKLSAFYTNNEVTAPIT-VYLRSEFYNAIVDMASVTSAK-GATIS 209 (295) T ss_pred ---------------hhhhccc-----chhHHHHHHHHHHHhhhhheeeeeE-EEEchhHHHHHhccccccccc-cceee Confidence 0001111 1112233344455555555533223 899999999999887555432 23344 Q ss_pred ccccceEEEeceeEEEecccccc-ccccccccCcccccccccccccccccc---ccccccceeEEeechhhhhhhhhhhe Q lcl|Aclame:pro 232 PETGNIRNVMGFEVIEVPHLTVG-GAGDNNPADGVAPTNQKHIFPATATGD---DRVAQNNVVGLFNHRSAVGTVKLKDM 307 (347) Q Consensus 232 ~~~G~v~~i~G~~V~~sn~lp~~-~~~~~~~~~~~~~t~~~~~~~a~~~~~---y~~d~~~~~~l~~h~~A~~tv~~~~~ 307 (347) +-.-.|.+.-||.|-|.|.--.. +......+++. ...| .|.+.. ...||+.+ .++-- +-+|-+.-+ T Consensus 210 iDengi~~FkGf~i~e~P~~~~q~G~~aifs~dni-----g~af-tGIn~aR~IesEdF~GV---alQ~~-~~~~~~~~~ 279 (295) T protein:vir:47 210 LDENGLPKYKGFTLEETPAQYFETGVIAIFSPNGI-----IIPF-VGISTARVIEAENFDGV---NCKLL-LRVVLTLLM 279 (295) T ss_pred eccCCcceecceEEEeccHhhccCCcEEEEccccc-----eeec-ccceeeeeeecccccch---HHHHH-HHHHHHHHH Confidence 55566778899999887654321 11111111111 1111 111111 12344432 22110 001111111 Q ss_pred eeccccchhhHhhHHhhhhhh Q lcl|Aclame:pro 308 ALERARRPEFQADQIIGKYAM 328 (347) Q Consensus 308 ~~e~~~~~~~~~d~i~~~~~~ 328 (347) +....| .-+---+|.- T Consensus 280 ~~~~~~-----~~~~~~~~~~ 295 (295) T protein:vir:47 280 TIRKQF-----TKLQELLYRR 295 (295) T ss_pred HHHHHH-----HHHHHHhhcC Confidence 111111 1111111111 No 188 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=94.06 E-value=0.0055 Score=32.96 Aligned_cols=289 Identities=13% Similarity=0.054 Sum_probs=125.0 Q ss_pred CCCCccCccccccCccc---------------CccccHHHHHHHHHhHHHHHHHHHH-HhhhcccccccccCCceEEEec Q lcl|Aclame:pro 1 MANATGGQQIGANQGKG---------------QSAADKLALFLKVFGGEVLTAFVRR-SVTMDKHMVRTIQNGKSASFPV 64 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~---------------~~~~d~~al~ie~f~geV~~~f~~~-s~~~~~~~~rti~~G~tv~i~~ 64 (347) ||-.- ..+.|.+ ++++|=-.|+...-.-.++..|+.. .-++.|.+.++++.=|..+..+ T Consensus 336 lAr~~-----L~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~ 410 (652) T protein:vir:79 336 YARMS-----LTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVG 410 (652) T ss_pred HHHHH-----HHhhccCCCCCCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceee Confidence 11100 0111111 2344433233333344456667665 4677777777665444445555 Q ss_pred cccc-eeeeecCCCCCCCCCCCCCCCceEEEEeeee----ecch-hh-ccHHHHHhCcchHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 MGRT-KGYYLAPGENLDDKRKDIKHSEKVIQIDGLL----TSDV-LI-YDIEDAMNHYDVRAEYSAQLGEALAIAADGAV 137 (347) Q Consensus 65 iG~~-t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~----~~~~-~V-dd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~i 137 (347) +|.. ++..+..|.++. ...+...+-++.+.++= +.+- .| ||++ .-..+.+..|++-++..++.+ T Consensus 411 lg~~~~L~~V~E~gEyk--~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~-------a~~~ip~~~g~aA~~~~~~~v 481 (652) T protein:vir:79 411 MGGFSALRQVREGAEYK--YVTTGDKQATIALATYGELFSITRQAIINDDLN-------MLTDVPMKLGRAAKSTIADLV 481 (652) T ss_pred cCCCCCccccCCCCccc--eeeecCccceeeeecccCeeeeehheeeccchh-------HHHHHHHHHHHHHHHHHHHHH Confidence 5543 334443344442 22355566677776641 1111 13 4444 345677888888899999888 Q ss_pred HHHHHHhhhcc-cccccc-cCcccCceeeeecccccccchhhHHHHHHHHHHHH--HHHHhhccCCCCCCEEEEChHHHH Q lcl|Aclame:pro 138 LAEMAKLCNLP-AASNEN-IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLA--RARLTKNYVPAGDRRFYCAPEDYS 213 (347) Q Consensus 138 l~~l~~~a~~a-~~~~~~-~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a--~~~Lde~~VP~~gR~~vv~P~~~~ 213 (347) +..+..-.... ..++=+ ... -+ ++.+++. -+.+.. ++-+.+ .++-.+..+--..||++|+|+... T Consensus 482 y~~l~~Np~~~~DGk~LF~hA~--H~---Nl~~~aa-~~~~~l-----~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~ 550 (652) T protein:vir:79 482 YAILTSNPKISTDNVSLFDKAK--HA---NVLESAA-MDVASL-----DKARQLMRVQKEGERHLNIRPAFVLVPTAMES 550 (652) T ss_pred HHHHhcCcccccCCceeecccc--cc---ccccccc-CCHHHH-----HHHHHHHHHhccCCccccccccEEEecchhHH Confidence 87664221111 000001 011 11 1111111 121111 111111 122111112233589999998654 Q ss_pred HHhcchhhhhhhccccccccccceEEEece-eEEEeccccccccccccccCccccccccccccccccccccccccceeEE Q lcl|Aclame:pro 214 AILSALMPNAANYAALIDPETGNIRNVMGF-EVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGL 292 (347) Q Consensus 214 ~Ll~~~~~~~~~~~~~~~~~~G~v~~i~G~-~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l 292 (347) ... ++++.......+...|.+.-+.|+ +|+..++|...+....-++........+-.|-.| T Consensus 551 ~a~---~ll~s~~v~~a~~~~~~~Np~~~~~~~i~eprL~~~s~~~wylaa~~~~dtiev~yL~G--------------- 612 (652) T protein:vir:79 551 VAN---QVIRSSSVKGADINAGIINPVKDFATVIAEPRLDDNSQTTFYLAASKGSDTIEVAYLNG--------------- 612 (652) T ss_pred HHH---HHhccCCCcccccccccccccccccccccccccCCCCcccEEEecCCCCCeEEEEEecC--------------- Confidence 332 333222221122334555555664 8888999864322211111110000001111111 Q ss_pred eechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEe Q lcl|Aclame:pro 293 FNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVF 343 (347) Q Consensus 293 ~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~ 343 (347) .+.+.+|..-.-...+=.++-.+=||++++..-+++-... T Consensus 613 -----------~~~P~ie~~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 613 -----------VDTPYIDQMEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred -----------CCCCeeeecCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 1112233221111223344556788999998887754433 No 189 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=91.87 E-value=0.014 Score=30.77 Aligned_cols=278 Identities=10% Similarity=0.083 Sum_probs=108.0 Q ss_pred CCCCccCccccccCccc--CccccHHHHHHHHHh----HHHHHHHHHH--------------HhhhcccccccccCC-ce Q lcl|Aclame:pro 1 MANATGGQQIGANQGKG--QSAADKLALFLKVFG----GEVLTAFVRR--------------SVTMDKHMVRTIQNG-KS 59 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~--~~~~d~~al~ie~f~----geV~~~f~~~--------------s~~~~~~~~rti~~G-~t 59 (347) +.... .+|++.. +...+.....++.++ +.-...|.+. ++...+.++..+... .+ T Consensus 164 ~ee~k-----~~~~~~~~~~~~~~~~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 238 (480) T protein:vir:40 164 REELK-----KEREASIPSEKPEDAERKFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMK 238 (480) T ss_pred HHHHh-----hhhhhhccccchhhhhhHHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhheeechhhhh Confidence 10000 0111110 001111111222221 1111122111 000111111011100 00 Q ss_pred E-----EEeccccce---ee-eecCCCCCCCCCCCCCCCceEEEEeee--eec---chhhccHHHHHhCcchHHHHHHHH Q lcl|Aclame:pro 60 A-----SFPVMGRTK---GY-YLAPGENLDDKRKDIKHSEKVIQIDGL--LTS---DVLIYDIEDAMNHYDVRAEYSAQL 125 (347) Q Consensus 60 v-----~i~~iG~~t---~~-~~~~g~~~~~~~~~~~~~~~~l~ID~~--~~~---~~~Vdd~D~~q~~~D~r~~~~~~~ 125 (347) + .+...|... +. ....+.... . -.....++ .++. ++. ......+|++ .++.+-+..+. T Consensus 239 ~~~~~~~~~~~g~~~~~~~~e~~~~~~~~~--~--~~~~~~~~-~~~~v~~l~~~~k~t~~lLDDa---~~l~~~i~~~l 310 (480) T protein:vir:40 239 ARFQGLTLAEDGVDDTFISGTFKAGTDKNK--S--QTATKRSL-RPQMAEAYLQMDKATVRGVNDS---GALSEYVMSEM 310 (480) T ss_pred hhhhcceeeeccccceeeeeeeeccccccc--c--cccccchh-hHHHHHHHHHhHHHHHHHhhhh---HHHHHHHHHHH Confidence 0 011111110 00 011111100 0 00000010 1110 011 1111222222 24778889999 Q ss_pred HHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCC-E Q lcl|Aclame:pro 126 GEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDR-R 204 (347) Q Consensus 126 g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR-~ 204 (347) ++.|+++.++.++. + ..+|......+.....+ .+ ....+...++.|+.+. .+..- .+. . T Consensus 311 ~~~~~~~ee~a~l~----G---------~g~g~~~~~g~~~~~~~-~~-~~~~~~d~id~L~~al---~~~y~--~~a~~ 370 (480) T protein:vir:40 311 VNRVIQKVEYNMIL----G---------SVDGSNGFYGLKTATDG-WT-KQIEYTDLFEGITDAV---AECSI--SDAIT 370 (480) T ss_pred HHHHHHHHHHHhhc----c---------CCCCccccccceeeccc-cc-ccchhHHHHHHHHHhh---hHHhh--CCCCE Confidence 99999998887753 1 00111001111111111 11 1112233344444433 22221 133 5 Q ss_pred EEEChHHHHHHhcchhhhhhhccccccccccceEEEeceeEEEec-cccccccccccccCcccccccccccccccccccc Q lcl|Aclame:pro 205 FYCAPEDYSAILSALMPNAANYAALIDPETGNIRNVMGFEVIEVP-HLTVGGAGDNNPADGVAPTNQKHIFPATATGDDR 283 (347) Q Consensus 205 ~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~v~~i~G~~V~~sn-~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~ 283 (347) +|++|..|..|.+-. -.+..|.-+..+..|....++|++|+++. .+|.... ..+..+.| T Consensus 371 ~vmn~~t~~~I~klK-D~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~------------------~~~~~~~~- 430 (480) T protein:vir:40 371 IVMSPQTFAELRKAK-GTDGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEV------------------AVYNHDEY- 430 (480) T ss_pred EEECHHHHHHHHHhh-cCCCCeeccCcccccCcceecccceeeeeccccCCcc------------------eeeeCCcc- Confidence 789999999886543 33466876667788899999999988754 3332110 01111112 Q ss_pred ccccceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 284 VAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 284 ~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .+++.++ .+....++-++-...+......|..+.+|+++.-+.+=..= T Consensus 431 -------~~~~d~~---------~~~~~~~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~ 478 (480) T protein:vir:40 431 -------VLIGDLN---------VENYNDFDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSL 478 (480) T ss_pred -------EEEEecc---------cceecccccccchhhhhhhhhhceeeEccccEEEEEeccCc Confidence 2344332 12222234345556777788889999999888776544333 No 190 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=91.70 E-value=0.015 Score=30.64 Aligned_cols=262 Identities=15% Similarity=0.127 Sum_probs=120.2 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccc-----ccccCCceEEEeccccc--eeeee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV-----RTIQNGKSASFPVMGRT--KGYYL 73 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~-----rti~~G~tv~i~~iG~~--t~~~~ 73 (347) |+ ++ .|-|+|.|.+.+-|+++|.|++..-- .-+.+.++.-=-....+ .++.| T Consensus 1 ~a--------------------vr-~y~Kq~~glL~~vf~~qa~F~~~FGg~lQ~~DGV~~N~taf~vKtsD~pVVi~~Y 59 (287) T protein:vir:39 1 MA--------------------IK-YFTKQYAGMLPDLFAKKSAFLRAFGGVLQVKDGVTENDTFMELKVSDTDVVIQAY 59 (287) T ss_pred CC--------------------cc-cccHHHHHHHHHHHHHHHhhhhhcccceeeecCCcccceEEEEEecCcceEEecc Confidence 11 11 47799999999999999998876431 12333332221111111 22344 Q ss_pred cCCCCC---CCCCCCCCCCce--EEEEeee-eecchh-h-ccHHHHHhCcchHHHH---HHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 74 APGENL---DDKRKDIKHSEK--VIQIDGL-LTSDVL-I-YDIEDAMNHYDVRAEY---SAQLGEALAIAADGAVLAEMA 142 (347) Q Consensus 74 ~~g~~~---~~~~~~~~~~~~--~l~ID~~-~~~~~~-V-dd~D~~q~~~D~r~~~---~~~~g~aLa~~~D~~il~~l~ 142 (347) ..+.+. .++-+.-...++ .+.+|+. .|..-+ | .-+|+.-.+-|+.... .+.++.|-++.+|..+=..|. T Consensus 60 ~Td~Nv~FGtGTg~ssRFG~rkEi~y~dt~V~Y~~~~~ihEGiD~~TVNnd~~aaVAdRL~Lqa~A~t~~~n~~~Gk~ls 139 (287) T protein:vir:39 60 STDANVGFGSGTGNTSRFGQRKEVKSVNKQVSYDAPLAINEGIDDFTVNDIKDQVVAERLALHGVAWAQHVDKLLGKLLS 139 (287) T ss_pred cCCCCcccccCCCccccccceeEEEEecccccceeccccccccccccccCChhHHHHHHHHhHHHHHHHHHHHHHHHHHH Confidence 333322 111111111111 2223322 222111 1 3455555565655544 556788888999986544443 Q ss_pred HhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCC-CEEEEChHHHHHHhcchhh Q lcl|Aclame:pro 143 KLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGD-RRFYCAPEDYSAILSALMP 221 (347) Q Consensus 143 ~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~g-R~~vv~P~~~~~Ll~~~~~ 221 (347) ..|.. ...+ . .....+...+-+|.+++-.++|-... ..+.|.|+.|.+|+.+.-. T Consensus 140 ~~A~~---------------t~~~--~-------~t~d~V~~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~ 195 (287) T protein:vir:39 140 DSASE---------------TLTV--K-------LDEDSVTKLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLA 195 (287) T ss_pred hhcch---------------heee--e-------ecccchHHHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhccccc Confidence 22210 0000 0 00011223344455666655664444 4567999999999987755 Q ss_pred hhhhccccccccccceEEEeceeEEEeccccc-cccccccccCccccccccccccccccccccccccceeEEeechhhhh Q lcl|Aclame:pro 222 NAANYAALIDPETGNIRNVMGFEVIEVPHLTV-GGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVG 300 (347) Q Consensus 222 ~~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~-~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~ 300 (347) +... +++.++-.-.|.+.-||.+-|.|.--. .+.. .+|.++-++ T Consensus 196 TsaK-~SsaNiDen~i~kFkGf~l~e~P~~~~q~g~~----------------------------------a~fs~dnig 240 (287) T protein:vir:39 196 TTAK-NSSANVDEQTLYKFKGFILSELPDEKFQLNEG----------------------------------AYFAADNVG 240 (287) T ss_pred cccc-cceeeeccCCcceecceEEEecchHhhccCcE----------------------------------EEEccccce Confidence 5432 233445556677899999988763211 1110 112222111 Q ss_pred hhh-----hhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 301 TVK-----LKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 301 tv~-----~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) ... +.-++.|.+ -|-.+-|-=-||--++.-...+.+..++.- T Consensus 241 ~af~GI~vaR~i~sEdF-----~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~k 287 (287) T protein:vir:39 241 VAGVGIQVTRAMDSEDF-----AGTALQAAAKYGKYLPEKNKKAILKATVTK 287 (287) T ss_pred eecccceeEEeeecccc-----cceeeecccccccccccccceEEEEEecCC Confidence 111 111122222 132333333344444444444444444333 No 191 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=90.59 E-value=0.02 Score=29.89 Aligned_cols=287 Identities=14% Similarity=0.034 Sum_probs=125.3 Q ss_pred CCCCcc---CccccccCc-ccCccccHHHHHH-HHHhHHHHHHHHH----HHhhhccccccc-cc-CCceEEEe---ccc Q lcl|Aclame:pro 1 MANATG---GQQIGANQG-KGQSAADKLALFL-KVFGGEVLTAFVR----RSVTMDKHMVRT-IQ-NGKSASFP---VMG 66 (347) Q Consensus 1 m~~~~~---~~~~~~~~~-~~~~~~d~~al~i-e~f~geV~~~f~~----~s~~~~~~~~rt-i~-~G~tv~i~---~iG 66 (347) |+= +. ...+.++.. .+....|....|+ ++++ .|+....+ .-..+.++.+++ +- .-.++.+. .+| T Consensus 1 ~~~-~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~-~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G 78 (314) T protein:vir:10 1 MAI-KFDAEQAKITTHLEQMGVEKADAAGIWAVSQLT-AALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVG 78 (314) T ss_pred Ccc-chHHHHHHHHHHHHhhcccchhhhHHHHHHHHH-HHHHHHhhhhccccccceeeccccCCCCceeEEEeeeecccc Confidence 220 00 000001111 1223344433455 5554 45444433 234445555553 11 12244433 444 Q ss_pred cceeeeecCC-CCCCCCCCCCCCCceEEEEeeee-ecchhhccHHHH-HhCcchHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 RTKGYYLAPG-ENLDDKRKDIKHSEKVIQIDGLL-TSDVLIYDIEDA-MNHYDVRAEYSAQLGEALAIAADGAVLAEMAK 143 (347) Q Consensus 67 ~~t~~~~~~g-~~~~~~~~~~~~~~~~l~ID~~~-~~~~~Vdd~D~~-q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~ 143 (347) ..+ -|..+ .+++. .++.-.+....|-.+. -+.+.+.++..+ +...++-..-...+..++++..|+.++.= T Consensus 79 ~a~--~~~d~~~dip~--vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G--- 151 (314) T protein:vir:10 79 IAQ--IIADYSDDLPL--VDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSG--- 151 (314) T ss_pred cee--eeCCcccccce--eecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEee--- Confidence 443 22222 22332 2333444455444431 223334555554 45677777778888888888888866521 Q ss_pred hhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhc----cCCCCCCEEEEChHHHHHHhcch Q lcl|Aclame:pro 144 LCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKN----YVPAGDRRFYCAPEDYSAILSAL 219 (347) Q Consensus 144 ~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~----~VP~~gR~~vv~P~~~~~Ll~~~ 219 (347) .+. -.+.|.-....++..+++... +..+.+++.|..+..+|.++ .-| -.++|+|+.|..|.. T Consensus 152 -----~~~-~g~~GLlN~p~v~~~~~~~~W---aT~~ei~~Di~~~~~~l~~~s~g~~~p---~~l~Lpp~~~~~L~~-- 217 (314) T protein:vir:10 152 -----SAP-HGIVSVFDQPNINNVVATPNW---SVPQNAIDDVTAMIDAVESSTQGLHHV---TDILLPASARRVMQG-- 217 (314) T ss_pred -----ccc-ccceeEeecCCCccccCCCCc---ccHHHHHHHHHHHHHHHHHhcCccccc---eeEEecHHHHHhhcc-- Confidence 000 011122111112222221111 24467889999999999875 223 267899999977642 Q ss_pred hhhhhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhh- Q lcl|Aclame:pro 220 MPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSA- 298 (347) Q Consensus 220 ~~~~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A- 298 (347) ..-+.+.+-..-+.+ +-.+++|...+.|-..+ .++ +.+.+++.++. T Consensus 218 ~~~~~~~tvl~~l~~----n~~~l~I~~~~el~~ag------------~~g-----------------~~~~v~y~~~~~ 264 (314) T protein:vir:10 218 LVPQTNLSYGELFTR----NNPGLTIRFLQFLDNYD------------GAG-----------------GKAALAFEKSPL 264 (314) T ss_pred cccCCCccHHHHHHH----hCCCcEEEEcccccccC------------CCc-----------------ceEEEEEecCCc Confidence 111111111111111 12366677666653111 001 11112222111 Q ss_pred -hhhhhhhheeeccccchhhHhhHHhhhhhh-cCcccccceEE---EEEec Q lcl|Aclame:pro 299 -VGTVKLKDMALERARRPEFQADQIIGKYAM-GHGGLRPEAAG---ALVFT 344 (347) Q Consensus 299 -~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~-G~~~lRPe~~~---~l~~~ 344 (347) +.....++++.-. ..++...+.+...... |..+.||++++ -|.++ T Consensus 265 ~~~~~vp~~~~~l~-~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 265 NMSIEIPEVTNVLP-AQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEEEecCccceeec-ceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 1111112222110 1223344555555566 57888999998 55666 No 192 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=90.20 E-value=0.02 Score=29.88 Aligned_cols=277 Identities=10% Similarity=0.064 Sum_probs=102.2 Q ss_pred CCCCccCccccccCcccCccccHH--HHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceee--ee--c Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKL--ALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGY--YL--A 74 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~--al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~--~~--~ 74 (347) |+.++- .|+ -|+. ++.+.-+..+ |-... +.+.+.+ ...+.+++..|+-... +. . T Consensus 1 m~~~~~-----~~~------~dp~LT~~A~gy~n~~----~Iad~-lfP~vpV----~~~~~k~~~f~~e~f~~~~t~ra 60 (307) T protein:vir:79 1 MGRLSK-----LRI------VDPVLTNLAIGYTNAE----FIGQT-LMPVVEV----EKEGGKIPKFGKESFRLYQTERA 60 (307) T ss_pred CCCCCC-----Ccc------cCHHHHHHHhhccchh----hhhhh-cCCcccc----cccccceeeeccccccccccccc Confidence 666542 111 1321 1122111111 11111 1222222 2223444444432211 11 1 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) ++.... ..+.-..+..++.+++.-. ...||+.+...+.||++....+.....+.+..+- .++...-.... T Consensus 61 ~~~~~~-~v~~~~~~~~~~~~~~~~l-~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~-------~~A~l~~~~~~- 130 (307) T protein:vir:79 61 LRAKSN-RMNPEDIDSVDVNLDEHDL-EYPIDYREDQESAFPLEQAAVQTATDAIQLRREK-------MIADLSQNPSS- 130 (307) T ss_pred cCCCcc-eeeeeccccccccccccch-hhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHH-------HHHHHhccccc- Confidence 111111 1111122334556666533 3468888888889998776655544333333332 22222111111 Q ss_pred cCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh-hcccccccc Q lcl|Aclame:pro 155 IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA-NYAALIDPE 233 (347) Q Consensus 155 ~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~~~~~~ 233 (347) .+.++.+++.++.-=.++..+ .+..|.++++.+.+..- ..-..+|+++..|..|+.|+++.+. ++...+.+. T Consensus 131 ---y~~~~k~tLsgt~~Wsd~~sD---Pi~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it 203 (307) T protein:vir:79 131 ---YAAGNKKQLSATEKFTAANSD---PVGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVT 203 (307) T ss_pred ---cCCCceEEEccCcccCCCCCC---cHHHHHHHHHHHHHhhC-CccceEEeCHHHHHHHhcCHHHHHHhcCccccccC Confidence 123334444322211122222 24556666666665432 2235889999999999999998864 333322222 Q ss_pred ccceEEEeceeEEEeccccc-cccccccccCccccccccccccccccccccccccceeEEeechhhhhhhh--------- Q lcl|Aclame:pro 234 TGNIRNVMGFEVIEVPHLTV-GGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVK--------- 303 (347) Q Consensus 234 ~G~v~~i~G~~V~~sn~lp~-~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~--------- 303 (347) .-.+..+.|++.+..-.--. ...+. ...-..+.+.|.+.+.+.++.. T Consensus 204 ~~~la~l~~v~~V~vg~a~y~~~~~~-----------------------~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~G 260 (307) T protein:vir:79 204 VDLLKEIFEVENIAVGEAIYADDKDR-----------------------FTDIWGANIVLAYVPLQRGGQQRTPYEPSYG 260 (307) T ss_pred HHHHHHHhCceeEEEeeeeeeccccc-----------------------chhcCCCceEEEecccccCCCCCcccccccc Confidence 22334455655332111100 00000 0000112222333322221111 Q ss_pred ----hhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 304 ----LKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 304 ----~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .+.......|.....+|.|+.. |..-.+..++.| T Consensus 261 yt~~~~g~~~~d~~~~~~~~~~vrv~----------~~~~~~i~~~~~ 298 (307) T protein:vir:79 261 YTLRKKGNPVVDTRIEDGKLELVRAT----------DIFRPYLLGADA 298 (307) T ss_pred eeEEecCceEEecccCCCceeEEeec----------ccccceeecccc Confidence 0000000011111122222111 111222333333 No 193 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=89.98 E-value=0.021 Score=29.77 Aligned_cols=289 Identities=11% Similarity=0.036 Sum_probs=106.0 Q ss_pred CCCCccCccccccCcccCccccHH--HHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeec-CCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKL--ALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLA-PGE 77 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~--al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~-~g~ 77 (347) |+.++- .|+ -|+. ++-+--+..+ |-..+ +.+.+.+ ..++|+-.+|+.-+-....+.. ++. T Consensus 1 m~~~~~-----~~~------~dp~LT~~A~gy~n~~----~ia~~-l~P~vpv-~~~~~k~~~f~~eaF~~~~t~r~~~~ 63 (307) T protein:vir:10 1 MGRLSK-----LRI------VDPVLTNLAIGYTNAE----FIGQS-LMPVVEV-EKEGGKIPKFGKESFRLYKTERALRA 63 (307) T ss_pred CCCCCC-----Ccc------cChhHHHHHHhhcchh----hhhhh-cCCcccc-cccccceeeECcccccchhhhcccCC Confidence 666442 111 1211 1122111111 11111 1222222 2234555555432211111111 111 Q ss_pred CCCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCc Q lcl|Aclame:pro 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAG 157 (347) Q Consensus 78 ~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g 157 (347) .. ...+.-..+.....+-+.- -...||+-+...+.||++....+.....|.+..+-.+. ...-... . T Consensus 64 ~~-~~v~~~~~~~~~~~~~~~~-L~~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A-------~l~~~~~----~ 130 (307) T protein:vir:10 64 RS-NRMNPEDLGSIDIVLDEHD-LEYPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVA-------DLAQNPN----S 130 (307) T ss_pred Cc-ceeeccccccccccccccc-ccccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHH-------HHhcCcc----c Confidence 11 0111001111222222221 22456777778889998887766665555444443332 2111111 1 Q ss_pred ccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh-hccccccccccc Q lcl|Aclame:pro 158 LGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA-NYAALIDPETGN 236 (347) Q Consensus 158 ~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~~~~~~~G~ 236 (347) .+.++.+++.++..=.++..+ .+..|.++++.+.+..- ..-..+++++..|..|+.|+++... ++...+.+..-. T Consensus 131 y~~~~k~tLsGt~~Wsd~~sD---Pi~di~~~~~ai~~~~g-~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~~~ 206 (307) T protein:vir:10 131 YAGGNKKQLSATEKFTAAGSD---PVGVIEDGKEAIRTKIG-RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDL 206 (307) T ss_pred cCCCceEEeccccccCCCCCC---cHHHHHHHHHHHHhhhC-CccceEEeCHHHHHHHhcCHHHHHHhCCccccccCHHH Confidence 122334444333211122222 24556666666665432 2235889999999999999998864 444333222223 Q ss_pred eEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhh--eeeccccc Q lcl|Aclame:pro 237 IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD--MALERARR 314 (347) Q Consensus 237 v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~--~~~e~~~~ 314 (347) +..+.|++.+....--... .. .++..-..+.+.|.+.+...+..+... ++.=-++ T Consensus 207 la~ll~v~~i~vg~a~~~~-------------~~---------~~~~~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~- 263 (307) T protein:vir:10 207 LKEIFEVENIAVGEAIYAD-------------DK---------DRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTL- 263 (307) T ss_pred HHHHhCceeEEEeeeeeec-------------cC---------CccceeCCCceEEEecccccCCCCCcccccccceeE- Confidence 4456676665533211100 00 001001112222333322221111000 0000000 Q ss_pred hhhHhhHHhhhhhhcCcc--cc-cceEEEEEecCCC Q lcl|Aclame:pro 315 PEFQADQIIGKYAMGHGG--LR-PEAAGALVFTPAA 347 (347) Q Consensus 315 ~~~~~d~i~~~~~~G~~~--lR-Pe~~~~l~~~~aa 347 (347) ++++..+...+--+.|+ +| =|-.-.+..++.| T Consensus 264 -~~~g~~~~d~~~~~~~~~~~r~~~~~~~~i~~~~~ 298 (307) T protein:vir:10 264 -RKKGNPVVDTRIEDGKLELVRSTDIFRPYLLGADA 298 (307) T ss_pred -EEcCCeEeeceecCCceeEEeccccccceeecccc Confidence 01111111111112221 21 1223334444445 No 194 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=88.19 E-value=0.034 Score=28.64 Aligned_cols=280 Identities=16% Similarity=0.121 Sum_probs=122.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccc--c---cccCCceEEEeccccc--ee-ee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMV--R---TIQNGKSASFPVMGRT--KG-YY 72 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~--r---ti~~G~tv~i~~iG~~--t~-~~ 72 (347) +-|++-..+. ..|.+--++ .|-|+|.|.+.+-|+.++.|++..-- + -+.+.++.---....+ .+ .. T Consensus 11 ~~~~~~~~~~-----t~N~n~avr-~Y~Kqf~glL~~vf~~qa~F~~~FGg~lQalDGV~~N~tafsvKtsD~pVVig~~ 84 (314) T protein:vir:98 11 LNNIQFFASG-----TANQNKAAR-SYQKEFRQLLQAVFRSQAYFRDFFGGGIEALDGVQHNDTAFYVKTSDIPVVVGNE 84 (314) T ss_pred ccceeeeeec-----cccCcccee-eecHHHHHHHHHHHhhHhhhhhhcccceeeccCCCccceEEEEeecccceeecCc Confidence 3333322221 111112233 57799999999999999999876432 1 2332222211111111 11 12 Q ss_pred ecCCCCC---CCCCCCCCCCce--EEEEeee-eecchh-h-ccHHHHHhCcchHHHH---HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 73 LAPGENL---DDKRKDIKHSEK--VIQIDGL-LTSDVL-I-YDIEDAMNHYDVRAEY---SAQLGEALAIAADGAVLAEM 141 (347) Q Consensus 73 ~~~g~~~---~~~~~~~~~~~~--~l~ID~~-~~~~~~-V-dd~D~~q~~~D~r~~~---~~~~g~aLa~~~D~~il~~l 141 (347) |..+.+. .++-+.-...++ .+..|+. .|..-+ | .-+|..-.+-|+.... .+.+++|-.+.+|..+=..| T Consensus 85 Y~TdeNvaFGtGTg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~TVNnd~~aaVAdRL~LQA~Akt~~~n~~~Gk~l 164 (314) T protein:vir:98 85 YNKDENVGFGEGTSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHTVNNDFQAAVADRLDLQANAKIKQFNAQHSKFI 164 (314) T ss_pred ccCCCCcccccCCccccccCceeEEEeecccccccccchhhhccccccccCChhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3322221 111111111111 2233322 222222 2 4566666666665555 44567777888887654433 Q ss_pred HHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCC---CCCEEEEChHHHHHHhcc Q lcl|Aclame:pro 142 AKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPA---GDRRFYCAPEDYSAILSA 218 (347) Q Consensus 142 ~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~---~gR~~vv~P~~~~~Ll~~ 218 (347) ...|-.+. . -++ .. .|.+.++..++.+..|-- ....+.|.|+.|.+|+.+ T Consensus 165 S~~As~te----~--------------ltd-----~~----~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~ 217 (314) T protein:vir:98 165 SSIAEKTE----T--------------LTD-----YS----ADNVLRLFNELSKYYVNIEAIGTKAAKVSPELYNAIVDH 217 (314) T ss_pred Hhhhhhhh----h--------------hhh-----cc----hhhHHHHHHHHHhhhhcceeeEEEEEEEchhHHhHhhcc Confidence 32221100 0 000 01 133444445555554422 236678999999999988 Q ss_pred hhhhhhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhh Q lcl|Aclame:pro 219 LMPNAANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSA 298 (347) Q Consensus 219 ~~~~~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A 298 (347) .-.+... +++.++-.-.|.+.-||.|-|.|.--.... ..+-.+ .+.++..| T Consensus 218 ~l~TsaK-~SsaNIDengi~~FkGf~i~e~P~~~~q~g-------~ia~~s-----------------~dnig~af---- 268 (314) T protein:vir:98 218 PLTTSAK-SSSANIDQNGIVNFKGFAIQEIPESMLQSG-------DVAYTY-----------------ITNIGKAF---- 268 (314) T ss_pred ccccccc-cceeeeccCCcceecceEEEecchhhcCCC-------cEEEEc-----------------cccceeec---- Confidence 7555432 233445556677899999988664321100 000000 01111111 Q ss_pred hhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEE--EEecCCC Q lcl|Aclame:pro 299 VGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGA--LVFTPAA 347 (347) Q Consensus 299 ~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~--l~~~~aa 347 (347) +|-=.+.-++.|.+ -|-.+-|-=-||--++.-...+. +..+|.+ T Consensus 269 tGIn~aR~IesEdF-----~GValQgAGK~G~~I~edNk~Ai~k~t~tp~~ 314 (314) T protein:vir:98 269 TGINTSRIIESEDF-----DGVALQGAGKAGEFILDDNKKAVAKVTSTPEG 314 (314) T ss_pred ccceeeeeeecccc-----cceeeecccccccccccccceeeEEEecCCCC Confidence 00000111122222 12222222234444444434444 4455666 No 195 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=87.63 E-value=0.037 Score=28.40 Aligned_cols=283 Identities=11% Similarity=0.035 Sum_probs=104.8 Q ss_pred CCCCccC-ccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceee----ee-c Q lcl|Aclame:pro 1 MANATGG-QQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGY----YL-A 74 (347) Q Consensus 1 m~~~~~~-~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~----~~-~ 74 (347) |||..=. ...+|..-.|.. .++ |-...+| +.+++ ...+.+++..|+..+. +. . T Consensus 1 ~~~~~~~~dp~LT~~A~gy~------------n~~----~Ia~~l~-P~vpV----~~~~~~~~~f~~~e~F~~~~t~r~ 59 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYR------------NGR----MISDEVL-PRVPV----GKQEFKFWKYDLAQGFTVPETLVG 59 (309) T ss_pred CCCCCcCcCHhHHHHHhhcc------------Chh----hhhhhcC-Ccccc----Cccccceeeechhhcccccchhhc Confidence 8874211 111222211111 111 1112222 33333 2233444444543211 11 1 Q ss_pred CCCCCCCCCCCCCCCceEEEEeeee-ecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc Q lcl|Aclame:pro 75 PGENLDDKRKDIKHSEKVIQIDGLL-TSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNE 153 (347) Q Consensus 75 ~g~~~~~~~~~~~~~~~~l~ID~~~-~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~ 153 (347) ++.... . -+...++.++.+.+.- .......++.++...||++....+.....|....+..+ +......+. T Consensus 60 ~~~~~~-~-v~~~~~~~~~~~~~~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~-------A~lv~~~a~ 130 (309) T protein:vir:99 60 RKSKPN-E-VEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREART-------SKLVFSPNS 130 (309) T ss_pred cCCCcc-e-EeecccCceeeecccceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHH-------HHHhcChhh Confidence 121111 1 1123334444444332 22333345557777899988887776665555444322 222211111 Q ss_pred ccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh-hcccc--c Q lcl|Aclame:pro 154 NIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA-NYAAL--I 230 (347) Q Consensus 154 ~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~-~~~~~--~ 230 (347) .+.++.++++.+.--.++..+ .+..|.++++++. --| -.++++...|..|+.|+++... ++++. + T Consensus 131 ----y~~~~k~~Lsgt~~wsd~~SD---Pi~~i~~~~~~~g--~~P---N~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g 198 (309) T protein:vir:99 131 ----YAAGNKTTLSGADQWSDPTSN---PLPVITDALDSVI--LRP---NIGVLGRRTATILRRHPKIVKAYNGSLGDEG 198 (309) T ss_pred ----cCCCceEEecCccccCCCCCC---cHHHHHHHHHhhC--CCc---ceEEechHHHHHHhhCHHHHHHhcCCCcccc Confidence 122334444333222223222 2344544544431 112 4889999999999999999876 45432 1 Q ss_pred cccccceEEEecee-EEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheee Q lcl|Aclame:pro 231 DPETGNIRNVMGFE-VIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMAL 309 (347) Q Consensus 231 ~~~~G~v~~i~G~~-V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~ 309 (347) -+..-.+..+.|++ |+.-...-.. + ..+...+...-..+.+.|++......+.+. ++. T Consensus 199 ~it~~~la~l~~ve~V~vg~a~~n~--------------a-----~~g~~~~~~~iwg~~~~L~y~~~~~~~~~~--ps~ 257 (309) T protein:vir:99 199 MVPMAFLQELLELDAIYIGEARLNI--------------A-----RPGQNPNLIRAWGPHASFIYRDRLADTRNG--TTF 257 (309) T ss_pred ccCHHHHHHHhCcceEEeecceeec--------------c-----ccccccccccccCCcEEEEEcCCCCCCccc--ccc Confidence 12233445566774 4432111000 0 000001111111223334443332222211 111 Q ss_pred cc--ccchhhHhhHHhhhhhh-cCcccc-----------cceEEEEEecCCC Q lcl|Aclame:pro 310 ER--ARRPEFQADQIIGKYAM-GHGGLR-----------PEAAGALVFTPAA 347 (347) Q Consensus 310 e~--~~~~~~~~d~i~~~~~~-G~~~lR-----------Pe~~~~l~~~~aa 347 (347) .- .|..+..+..++-.+-. |...+| |++ |-|.+.+.| T Consensus 258 G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~-G~li~~~va 308 (309) T protein:vir:99 258 GLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDL-GFFFENAVA 308 (309) T ss_pred cceeecccccCCceeeeeeccCCceEEEEeccccchhcchhc-chhhhhccc Confidence 11 12222233222222211 112222 111 111111111 No 196 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=86.80 E-value=0.043 Score=28.07 Aligned_cols=305 Identities=17% Similarity=0.088 Sum_probs=144.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccc---cccccCCceEEE------------ecc Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHM---VRTIQNGKSASF------------PVM 65 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~---~rti~~G~tv~i------------~~i 65 (347) |+.+++.=.. -|..+++..+...+-.-|.|-.|..+.|.-..-...... .....+....-. ..- T Consensus 97 mTgPTGLIFA-mRsrY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~ 175 (457) T protein:vir:10 97 MTGPTGLIFA-MRTNYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYDPGATGVTNDAEGTNPALLNDSPAGTYEQADDAT 175 (457) T ss_pred CCCcceeeee-eeeeecCccccccccccceeeeccCcccCcccccccccccccccccccccccccCcccccccccccccc Confidence 7666542221 233344333321111223333445555432110000000 000011100000 011 Q ss_pred ccceeeeecCCCCCCCCCCCCCCCceEEEEeeeee--------cchhhccHHHHHh-C-cchHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLT--------SDVLIYDIEDAMN-H-YDVRAEYSAQLGEALAIAADG 135 (347) Q Consensus 66 G~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~--------~~~~Vdd~D~~q~-~-~D~r~~~~~~~g~aLa~~~D~ 135 (347) |..++ .++.+-....+....|.-+.||+... +...+.-..+.++ | .|.-.|++.=.+.++..++.+ T Consensus 176 gmsTA----~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINR 251 (457) T protein:vir:10 176 GMSTA----TVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINR 251 (457) T ss_pred chhhh----hhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhH Confidence 11111 11111100111234677888887643 3445665666666 4 899999999999999999999 Q ss_pred HHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHH-HHHHHHHHHHHhhccCCCCCCEEEEChHHHHH Q lcl|Aclame:pro 136 AVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAI-LKGLTLARARLTKNYVPAGDRRFYCAPEDYSA 214 (347) Q Consensus 136 ~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i-~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~ 214 (347) -|++.|...+... .+.+.....+..+....+........+.+ +.--+++.....+.- --.+.|+|.+|+..+. T Consensus 252 eii~~l~~~a~~~-----~~~~~~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~~T~-rg~gn~~i~S~~Va~~ 325 (457) T protein:vir:10 252 EVVRTIYTNAVAG-----AQNNTATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGHQTR-RGKGNILICSADVVSA 325 (457) T ss_pred HHHHhHhhhheee-----eccccccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHHhhc-cccceEEEEchhHHHH Confidence 9999887655321 11221112222222222211122223333 222244444333322 2357899999999999 Q ss_pred Hhcchh--hhhh--hccc---cccccccceEEEe-ceeEEEe----ccccccccccccccCccccccccccccccccccc Q lcl|Aclame:pro 215 ILSALM--PNAA--NYAA---LIDPETGNIRNVM-GFEVIEV----PHLTVGGAGDNNPADGVAPTNQKHIFPATATGDD 282 (347) Q Consensus 215 Ll~~~~--~~~~--~~~~---~~~~~~G~v~~i~-G~~V~~s----n~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y 282 (347) |-...- +... .+.+ .++.....+|.+. |++||.- +|-|.--.. -.| T Consensus 326 L~~sg~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~----------------------vG~ 383 (457) T protein:vir:10 326 LGMAGVLDYTPALNGNNGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYV----------------------AGY 383 (457) T ss_pred HhhcccccccchhhccccccccccccceeEEEecCCeEEEEecccccCCccceEE----------------------EEE Confidence 877532 2211 1111 1234444577764 8888886 443421100 012 Q ss_pred cccccceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 283 RVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 283 ~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +++..-..+|||.|=- ++.++ ...||+.|.-.|-.+.+||- +.+|.... +..+.+. T Consensus 384 KG~~~~~~glfy~PYv----~l~~~---~~~dp~sfqP~~g~~tRY~l-~~NP~~~~-~~~~~~~ 439 (457) T protein:vir:10 384 KGTSPYDAGLFYCPYV----PLQQV---RAINPDTFQPKIGFKTRYGM-VSNPFAGG-LTQGSGA 439 (457) T ss_pred eCCcceecceeecccc----ccccc---CccCCccccceeeeeeeeee-eecccccc-ccccccc Confidence 2333333468887753 33332 23389988888888888887 77887553 3333334 No 197 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=84.60 E-value=0.059 Score=27.31 Aligned_cols=288 Identities=12% Similarity=0.043 Sum_probs=121.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccc---eeeeec-CC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRT---KGYYLA-PG 76 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~---t~~~~~-~g 76 (347) |.-.+. .-++-...|.. ...|.+.|.+.|.+....+-..+. |++.+.++.-.. ...... +- T Consensus 1 mpaltL-------aea~k~~~d~l-------~~~ViE~~~~~s~lL~~LpF~~ve-g~~~~ynR~~~~~~~~~~~v~~~~ 65 (310) T protein:vir:97 1 MASVTL-------AESAKLAQDEL-------VAGVIENIITVNRMFDVLPFDSIE-GNSLAYNRENVLGDVIMAGVGTTF 65 (310) T ss_pred Ccccch-------HHHhhcCcchH-------HHHHHHHHhccchHHHhCCccccc-CCcceeeEeeccCCcccccccccc Confidence 553322 11222223322 345677888777666665555555 556777765322 111100 00 Q ss_pred CCCCCCCCCCCCCceEEEEeeeeecchhhccHHHH--H---h-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccc Q lcl|Aclame:pro 77 ENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDA--M---N-HYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAA 150 (347) Q Consensus 77 ~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~--q---~-~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~ 150 (347) .+...+....+.++++.. +..+.- +-++|.. + . -+|.+.+-.+...++|++++...++. +- +. T Consensus 66 ~~~g~~~~~~t~~~~~~~---L~i~~g-~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lIN----GD---~a 134 (310) T protein:vir:97 66 SGAGAGKAAATFTKVNSN---LTTIMG-DAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLIN----GN---GA 134 (310) T ss_pred cCCCccccccccceeeee---eeeeee-hhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhc----cc---cC Confidence 000000011122333322 222222 2244432 1 2 33566666788888998888664442 10 00 Q ss_pred cccccCcc----cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhh--h Q lcl|Aclame:pro 151 SNENIAGL----GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNA--A 224 (347) Q Consensus 151 ~~~~~~g~----~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~--~ 224 (347) ..+ ..|. ..+..|..++.+...+++ ..|.|+++-- .+ .-+..+++..|+.+..+-.--|-.. . T Consensus 135 ~n~-F~GL~~~~~~~q~i~~~~~gg~~t~d-----~LDeLl~~v~--~~---~g~p~~~l~~~~~~r~i~A~~R~~~~~g 203 (310) T protein:vir:97 135 GNE-FAGLIQLCASGQKATTGATGSAISFA-----ILDELMDLVV--DK---DGQVDYLTMHARTLRSYKALLRALGGAS 203 (310) T ss_pred CCc-ccchhhcCCccceeecCCCCCCCCHH-----HHHHHHHHHh--cC---CCCCCEEEecHHHHHHHHHHHHHhcCCC Confidence 111 1111 222334333332332322 1343332211 11 1234689999987655554333222 1 Q ss_pred hccccccccccceEEEeceeEEEeccccccccccccccCcccccccccccccccccccccccc---ceeEEeechhhhhh Q lcl|Aclame:pro 225 NYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQN---NVVGLFNHRSAVGT 301 (347) Q Consensus 225 ~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~---~~~~l~~h~~A~~t 301 (347) -|....+.--..|-.+.|++|+.++.+|...... + .++.+.-|..-+- ..+||+.-.- T Consensus 204 ~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~---------~------~~gtTsIya~r~Ge~~~~~Gv~Gl~~---- 264 (310) T protein:vir:97 204 INEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKG---------G------TTGCTTIFAGTLDDGSRTHGIAGLTA---- 264 (310) T ss_pred CCCccccCCCCEEeeeCCeEEEEeCccCCCcccc---------c------cCCceeEEEEeeCccccccceecccc---- Confidence 2222233333367889999999999999653210 0 0112222322221 1123322000 Q ss_pred hhhhheeecccc---chhhHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 302 VKLKDMALERAR---RPEFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 302 v~~~~~~~e~~~---~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) ...-.++++-.- +.--..+.| ...+|..++.|++++.|.-=-- T Consensus 265 ~~~~glsVr~~G~~~~~~v~~~~V--~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 265 TQAAGIQVVDVGESEDSDEHIWRV--KWYCGLALFSEKGLACADGITN 310 (310) T ss_pred CCccceeEEeCCcccCCcceeEEE--EEeeeEEEecccceeeeccccC Confidence 000012222211 101112222 1236778888888888762222 No 198 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=80.84 E-value=0.091 Score=26.29 Aligned_cols=289 Identities=9% Similarity=0.055 Sum_probs=122.4 Q ss_pred CCCCccCccccccCcccCccccHHHHHH-HHHhHHHHHHHHH----HHhhhccccccc-cc-CCceEEEecc---cccee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVR----RSVTMDKHMVRT-IQ-NGKSASFPVM---GRTKG 70 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~i-e~f~geV~~~f~~----~s~~~~~~~~rt-i~-~G~tv~i~~i---G~~t~ 70 (347) |..++. ++.- ..+|.-..|+ ++++ .|+....+ .-..+.++.+++ +. .-.++.+... |..+ T Consensus 21 a~~~~~-------~~~~-~~~~~~~~f~~~ql~-~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~- 90 (329) T protein:vir:79 21 ANHMQL-------RGAK-NDASDMGIWTSQELH-KIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAK- 90 (329) T ss_pred hhhccc-------ccce-eccchhhHHHHHHHH-HHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeee- Confidence 111111 1111 1122223465 4432 44443332 234455555553 22 2335554443 4443 Q ss_pred eeecC-CCCCCCCCCCCCCCceEEEEeee-eecchhhccHHHH-HhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 71 YYLAP-GENLDDKRKDIKHSEKVIQIDGL-LTSDVLIYDIEDA-MNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNL 147 (347) Q Consensus 71 ~~~~~-g~~~~~~~~~~~~~~~~l~ID~~-~~~~~~Vdd~D~~-q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~ 147 (347) -|.. +++++. .++.-.+....|-.. .-+.+.+.++..+ +...++-.+-...+..+++++.|+.++.=- . T Consensus 91 -~~~d~~~dip~--vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~-----~ 162 (329) T protein:vir:79 91 -IIADYTDDLST--VDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGS-----K 162 (329) T ss_pred -eecCcccccce--eecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeec-----c Confidence 2222 123322 123333333333332 1223445666666 467888888888889999999998665211 0 Q ss_pred ccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCC-CCCCEEEEChHHHHHHhcchhhhhhhc Q lcl|Aclame:pro 148 PAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVP-AGDRRFYCAPEDYSAILSALMPNAANY 226 (347) Q Consensus 148 a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP-~~gR~~vv~P~~~~~Ll~~~~~~~~~~ 226 (347) .....+..+ +++-..+..++.+...-..+..+.+++.|.++..++.++.-= ...-.++|+|+.|..|..- ..+.+. T Consensus 163 ~~g~~GLlN-~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~--~~~~~~ 239 (329) T protein:vir:79 163 PHKIISVFE-HPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVR--MPETTM 239 (329) T ss_pred cccceeeec-CCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcc--cCCCCc Confidence 001111111 111111112222222223346677899999998888775210 0113688999999888531 111111 Q ss_pred cccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechh--hhhhhhh Q lcl|Aclame:pro 227 AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRS--AVGTVKL 304 (347) Q Consensus 227 ~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~--A~~tv~~ 304 (347) +-..-+.. +...++|...+.|-. ++..+ +.+++++..+ -+..... T Consensus 240 tvl~~lk~----~~~~l~I~~~~el~~------------ag~~g-----------------~~~~v~y~~~~~~~~~~vp 286 (329) T protein:vir:79 240 SYLDYFKQ----QNGGITIESISELED------------IDGAG-----------------TKAALVYEKDPMNMSIEIP 286 (329) T ss_pred cHHHHHHH----hCCCcEEEEcccccc------------cCCCC-----------------ceEEEEEecCCceEEEecC Confidence 10011111 112345555444421 00000 1112222211 1111122 Q ss_pred hheeeccccchhhHhhHHhhhhhh-cCcccccceEEEEE---ec Q lcl|Aclame:pro 305 KDMALERARRPEFQADQIIGKYAM-GHGGLRPEAAGALV---FT 344 (347) Q Consensus 305 ~~~~~e~~~~~~~~~d~i~~~~~~-G~~~lRPe~~~~l~---~~ 344 (347) ++++... -.++...+.+...... |.-+.||++++-+. .. T Consensus 287 ~~~~~l~-~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 287 EAFNMLT-AQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred cceeeee-ceecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 2322211 1222233445555555 46888999984432 23 No 199 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=80.62 E-value=0.093 Score=26.23 Aligned_cols=311 Identities=14% Similarity=0.013 Sum_probs=135.5 Q ss_pred CCCCccCc--cccccCccc---------C--cc---ccHHHHHHHHHhHHH---HHHHHHHHhhhcccccccccCCce-- Q lcl|Aclame:pro 1 MANATGGQ--QIGANQGKG---------Q--SA---ADKLALFLKVFGGEV---LTAFVRRSVTMDKHMVRTIQNGKS-- 59 (347) Q Consensus 1 m~~~~~~~--~~~~~~~~~---------~--~~---~d~~al~ie~f~geV---~~~f~~~s~~~~~~~~rti~~G~t-- 59 (347) |++...-. ...++.+++ . .. +..-+..++.|.+.+ -.+|........-.......+|.. T Consensus 162 ~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t 241 (523) T protein:vir:59 162 SSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPST 241 (523) T ss_pred cccceeeeeccccccccccccccccccccccccccccccccchhhccccccccccccccccccccccccccccCCCcccc Confidence 33321000 000111111 0 01 111112223332211 111111110000011111111110 Q ss_pred -----EEEeccccceeeeecCCCCCCCCCCCCCCCceEEEEeeeee--------cchhhccHHHHHh--C-cchHHHHHH Q lcl|Aclame:pro 60 -----ASFPVMGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLT--------SDVLIYDIEDAMN--H-YDVRAEYSA 123 (347) Q Consensus 60 -----v~i~~iG~~t~~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~--------~~~~Vdd~D~~q~--~-~D~r~~~~~ 123 (347) +.-...|..+..--..+.............|.-+.||+... +...+.-..+.++ + .|.-.|++. T Consensus 242 ~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELan 321 (523) T protein:vir:59 242 QDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVT 321 (523) T ss_pred cccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHH Confidence 00001111110000001100000122345677888887643 3344555556666 3 899999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchh-hHHHHHHHHHHHHHHHHhhc--cCC- Q lcl|Aclame:pro 124 QLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVE-ARGKAILKGLTLARARLTKN--YVP- 199 (347) Q Consensus 124 ~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~-~~~~~i~~~l~~a~~~Lde~--~VP- 199 (347) =++.++..++.+-|++.|...+.... ..+.....+..+...++..... ......++.+..+..+++|. .+- T Consensus 322 ILStEImlEINR~ii~~~~~~a~~~~-----~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~ 396 (523) T protein:vir:59 322 LMSQYIAREIDLEILSTIMAHARRTD-----NYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQ 396 (523) T ss_pred HHHHHHHHHhhHHHHHhHhhhheeee-----eccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999998876553221 1111111111111111111110 00001122222222222221 121 Q ss_pred ----CCCCEEEEChHHHHHHhcchhhhhhhcccccccccc--ceEEE-eceeEEEeccccccccccccccCccccccccc Q lcl|Aclame:pro 200 ----AGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG--NIRNV-MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKH 272 (347) Q Consensus 200 ----~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G--~v~~i-~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~ 272 (347) -.+-|+|++|+....|-...-+...+... ....| .+|.+ .|++||.-++.|..-. ..+-+ T Consensus 397 ~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~--~~~~~~~~~g~l~~~~~vy~d~~~~~dy~----------~~g~k- 463 (523) T protein:vir:59 397 KTAVAGANFLVTSPQVAALLESMPGFTPGNDNR--DGGTGIFYVGMVQGRYRLYKNIYQNQPVI----------IMGNQ- 463 (523) T ss_pred hcccccccEEEEchhHHHHHHhccccccCCccc--cccccceeEEEecCceEEEecCCCCcceE----------EEEec- Confidence 14578999999999986655543322211 11122 24555 4889998887764211 11111 Q ss_pred cccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEE---ecC Q lcl|Aclame:pro 273 IFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALV---FTP 345 (347) Q Consensus 273 ~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~---~~~ 345 (347) +.++.|. .+|||.|=- ++.. .....||+.|.-.|-.+.+||-.|.+|...+-|+ .-| T Consensus 464 ----~~~~~~~------~~~~y~Py~----~l~~--~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 464 ----DLNTPWQ------TGAVYAPYV----PLLF--TPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred ----ccCCccc------ccceecccc----hhhc--ccccccCCcccceeeeeeehhheecchhHhhhhhhhhcCC Confidence 0111121 368888753 2221 2233588999889999999999999999887665 234 No 200 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=73.80 E-value=0.17 Score=24.86 Aligned_cols=308 Identities=16% Similarity=0.090 Sum_probs=133.2 Q ss_pred CCCCccCccccccCcccCccccH---HHHHHH-----HHhHHHHH----HHHH------HHhhhccccccccc--CCceE Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADK---LALFLK-----VFGGEVLT----AFVR------RSVTMDKHMVRTIQ--NGKSA 60 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~---~al~ie-----~f~geV~~----~f~~------~s~~~~~~~~rti~--~G~tv 60 (347) |+.+++.=.. -|.-+++.++|. -|+|-| -|+|.--. .|.. ...+..-....+.. +...+ T Consensus 125 MTgPTGLIFA-MRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~ 203 (534) T protein:vir:10 125 MTSSTGQVFT-LRAIYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTV 203 (534) T ss_pred CCchhhhhee-eeeeecCCCCCcccccccccccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 6555432111 122222222221 122222 12221000 0000 00000000000000 00000 Q ss_pred EE-----------------------------eccccceeeeecCCCCCC--CCCCCCCCCceEEEEeeeee--------c Q lcl|Aclame:pro 61 SF-----------------------------PVMGRTKGYYLAPGENLD--DKRKDIKHSEKVIQIDGLLT--------S 101 (347) Q Consensus 61 ~i-----------------------------~~iG~~t~~~~~~g~~~~--~~~~~~~~~~~~l~ID~~~~--------~ 101 (347) .+ ..+|.... ...++.+- +...+..-.|..+.||+... + T Consensus 204 ~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~--Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKA 281 (534) T protein:vir:10 204 QFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMA--TAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKA 281 (534) T ss_pred ccccccccccccCCccccccccccccccccceecccccc--hhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceec Confidence 00 00000000 00001000 00011234567788887643 3 Q ss_pred chhhccHHHHHh-C-cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHH Q lcl|Aclame:pro 102 DVLIYDIEDAMN-H-YDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARG 179 (347) Q Consensus 102 ~~~Vdd~D~~q~-~-~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~ 179 (347) ...|.-..+.++ | .|.-.|++.=++.++..++.+-|++.|...+.......-...+...| +.......+.......+ T Consensus 282 EYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G-~~d~~~~~~~~~~~~~~ 360 (534) T protein:vir:10 282 QYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAG-VFDFQDTKDIRGARWAG 360 (534) T ss_pred cccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccc-eeeeeccccccchhHHH Confidence 455666666666 4 89999999999999999999999998876553322111000011111 11222222211111222 Q ss_pred HHHHHHHHHHHHHHhhc--cCC-----CCCCEEEEChHHHHHHhcchhhhhhhccc-----ccccccc-ceEEE-eceeE Q lcl|Aclame:pro 180 KAILKGLTLARARLTKN--YVP-----AGDRRFYCAPEDYSAILSALMPNAANYAA-----LIDPETG-NIRNV-MGFEV 245 (347) Q Consensus 180 ~~i~~~l~~a~~~Lde~--~VP-----~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~-----~~~~~~G-~v~~i-~G~~V 245 (347) +.+..+..++++. .+- -.+-|+|.+|+..+.|-....+.-..+.+ ..+.... ..|.+ .|++| T Consensus 361 ----e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~v 436 (534) T protein:vir:10 361 ----ESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGKYRV 436 (534) T ss_pred ----HHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCceEE Confidence 2333333333332 111 13569999999999997665443221111 1111111 35565 48999 Q ss_pred EEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhh Q lcl|Aclame:pro 246 IEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGK 325 (347) Q Consensus 246 ~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~ 325 (347) |.-++.|..-.. -.|+++..-..+|||.|=. ++. +...+||+.|.-.|-.+ T Consensus 437 y~D~y~~~dy~~----------------------vG~KG~~~~~~glfyaPYv----~l~---~~~~~dp~sfqP~~g~~ 487 (534) T protein:vir:10 437 YIDQYAVEDYFT----------------------VGYKGASEMDAGLYYCPYV----ALT---PLRGTDPKNFQPVLGFK 487 (534) T ss_pred EecCCCCcceEE----------------------EEEeCCcccccceeecccc----ccc---cccccCCccccceeeee Confidence 998877642110 0122333333468888863 333 33467999988888777 Q ss_pred hhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 326 YAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 326 ~~~G~~~lRPe~~~~l~~~~aa 347 (347) .+||-.+ .| .+......+.+ T Consensus 488 tRY~l~~-NP-~~~~~~~~~~~ 507 (534) T protein:vir:10 488 TRYGVKL-HP-MADATQNKGFA 507 (534) T ss_pred eeeceee-cC-cccccCCcccc Confidence 7777543 33 22222222211 No 201 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=73.70 E-value=0.17 Score=24.84 Aligned_cols=275 Identities=12% Similarity=0.036 Sum_probs=120.0 Q ss_pred CccccHHHHHHHHHhHHHHHHHHHH-----Hhhh----c-ccccccccCCceEEEecccccee-----eeecCCCCCCCC Q lcl|Aclame:pro 18 QSAADKLALFLKVFGGEVLTAFVRR-----SVTM----D-KHMVRTIQNGKSASFPVMGRTKG-----YYLAPGENLDDK 82 (347) Q Consensus 18 ~~~~d~~al~ie~f~geV~~~f~~~-----s~~~----~-~~~~rti~~G~tv~i~~iG~~t~-----~~~~~g~~~~~~ 82 (347) .+-+|. ++|...+.+++-+. .+|- + .+.......|+-+..|..-...- ..+.....+ + T Consensus 1 m~lsD~-----~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~v--t 73 (325) T protein:vir:95 1 MALSDL-----AVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTV--A 73 (325) T ss_pred Cchhhh-----hhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCcee--c Confidence 222231 23544444444332 1111 1 11111222477776665443211 222211111 2 Q ss_pred CCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCce Q lcl|Aclame:pro 83 RKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAV 162 (347) Q Consensus 83 ~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~ 162 (347) +..+...+..-++ -..-..+...|+.....-.|-+++++++.|..+++...+.++..+.++...+-... ... T Consensus 74 ~~kitt~~~~av~-~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~-------~~~ 145 (325) T protein:vir:95 74 EKVLKHLVDTSVK-VAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQV-------SDV 145 (325) T ss_pred cceeccccceeeE-EecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-------ccc Confidence 2334433322222 11122233355555555677899999999999999887777766655432211111 111 Q ss_pred eeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhc-cccccccccceEEEe Q lcl|Aclame:pro 163 VLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANY-AALIDPETGNIRNVM 241 (347) Q Consensus 163 ~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~-~~~~~~~~G~v~~i~ 241 (347) +....+..+..+.-.. ...|.+|..+|.++. ..=..+++.+..|..|.+. .+++... ...+.. ..|.... T Consensus 146 v~dis~~~~~~~~~~s----~~~l~~A~~klGD~~--~~l~~~~MHS~v~~~L~~~-~L~~~~~~~~~~g~--~~i~t~~ 216 (325) T protein:vir:95 146 VYDATANTDAADKLPT----WNNLNNGQAKFGDQS--SQIAAWIMHSTPMHKLYGS-NLTNGERLFTYGTV--NVVRDPF 216 (325) T ss_pred eeeeecccCccccccc----HHHHHHHHHHhcccc--cceeEEEEchHHHHHHHHh-hccccccccccCCc--ccccccC Confidence 1222222221111111 345778888887754 1114668899999999874 3443211 111111 1345678 Q ss_pred ceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeecccc--chhhHh Q lcl|Aclame:pro 242 GFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR--RPEFQA 319 (347) Q Consensus 242 G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~--~~~~~~ 319 (347) |-+|+.+-.+|....+. +++|. -+.|-+-|++..+..++...... ..++.+ T Consensus 217 G~~VIVdD~~p~~~~g~--------------------~~~yt-------ty~lg~GAi~~~~~~~~~~~~~~~~~~~~~~ 269 (325) T protein:vir:95 217 GKLLVMTDSPNLFAAGT--------------------PNVYH-------ILGLVPGGVLIGQNNDFDANEETKNGDENII 269 (325) T ss_pred CcEEEEeCCCCCCCccC--------------------ceeEE-------EEEEecCeEEecCCCCccccccccCccccee Confidence 99999999998754321 11221 13333444444444333222211 111222 Q ss_pred hHHhhh-----hhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 320 DQIIGK-----YAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 320 d~i~~~-----~~~G~~~lRPe~~~~l~~~~aa 347 (347) .-++.. +.+|.+- +.... -..|+- T Consensus 270 ~~~~~~~tf~lhp~G~sw---~~s~~-g~sPt~ 298 (325) T protein:vir:95 270 RTYQAEWSYNIGVKGFAW---DKANG-GKSPTD 298 (325) T ss_pred eeeeeeeeEEeecceeee---ecccc-cCCcCh Confidence 222222 2333322 21110 012222 No 202 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=72.20 E-value=0.19 Score=24.59 Aligned_cols=303 Identities=11% Similarity=-0.014 Sum_probs=121.5 Q ss_pred CC--CCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccC--CceEEEec---cccceeeee Q lcl|Aclame:pro 1 MA--NATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQN--GKSASFPV---MGRTKGYYL 73 (347) Q Consensus 1 m~--~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~--G~tv~i~~---iG~~t~~~~ 73 (347) |- |..+....++-......+|=+. |+.-|.-.+.+..-.--+...++.+.+.=. -+++.|+. .|+.+ -| T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~--~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~--~y 131 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQ--FLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQ--PY 131 (379) T ss_pred hccccccccccccCccccccccchHH--HHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeE--Ee Confidence 44 2222222111111111222233 888886444433333344556666555211 24555554 45554 34 Q ss_pred cCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHH---HHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc-c Q lcl|Aclame:pro 74 APGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIED---AMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP-A 149 (347) Q Consensus 74 ~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~---~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a-~ 149 (347) ..+++.+...-+.+-..+.+..=+ ..+.+.+++. .++..|+-.+-.+.+..+|.+..|+..+- +..-+ . T Consensus 132 gd~~d~pl~d~~~~~~~r~v~~~~---~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~----G~~d~~~ 204 (379) T protein:vir:10 132 TDGGNMALMSWTPTFETRTVVRFE---AGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFY----GYNDGSG 204 (379) T ss_pred ccccCCCeeeeeeeeeeeeeEEEE---EEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEE----eecCCCc Confidence 444443221111222222222211 1222334332 34567777777777777777777764432 10000 0 Q ss_pred ccccccC--cccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhc---c-CCCCCC-EEEEChHHHHHHhcchhhh Q lcl|Aclame:pro 150 ASNENIA--GLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKN---Y-VPAGDR-RFYCAPEDYSAILSALMPN 222 (347) Q Consensus 150 ~~~~~~~--g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~---~-VP~~gR-~~vv~P~~~~~Ll~~~~~~ 222 (347) ...+..+ +.+.......++.+..+=..+..+.|++.|..+...|-.+ . .|.+-+ .++|+|.++..|-.-.++ T Consensus 205 ~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n~~- 283 (379) T protein:vir:10 205 RTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPTEL- 283 (379) T ss_pred ceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhcccccc- Confidence 0001111 1111111222222222223456677888888877665544 2 254444 688999999999753211 Q ss_pred hhhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccc--cccccccccceeEEeechhhhh Q lcl|Aclame:pro 223 AANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATA--TGDDRVAQNNVVGLFNHRSAVG 300 (347) Q Consensus 223 ~~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~--~~~y~~d~~~~~~l~~h~~A~~ 300 (347) +.+-..-+. .+..+++|...+.|-.. +. ++.....-. ...-..+-..++-..+ |+-. T Consensus 284 --g~Tvl~~lk----~n~Pnl~i~t~pEL~~a------------gg-g~~~~~~~~~~~~~~~t~~~~~~~~~~-p~k~- 342 (379) T protein:vir:10 284 --GYSVAQYMR----ESYPNVTFVSAPELNDA------------NG-GSSAIYYYADAVENNGTDDGRTWLQVV-PTKM- 342 (379) T ss_pred --CccHHHHHH----HhcCCcEEEEccccccc------------CC-CccEEEEEeeccCCCccCCcceEEEec-chhh- Confidence 110000011 12345677776666210 00 000000000 0000000000111111 1110 Q ss_pred hhhhhheeeccccchhhHhhHHhhhhhh-cCcccccceEEEEEec Q lcl|Aclame:pro 301 TVKLKDMALERARRPEFQADQIIGKYAM-GHGGLRPEAAGALVFT 344 (347) Q Consensus 301 tv~~~~~~~e~~~~~~~~~d~i~~~~~~-G~~~lRPe~~~~l~~~ 344 (347) ..+ .. .++...+.+....+. |.-+.||-+++-+.=+ T Consensus 343 ----~~l--~v--e~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 343 ----FTL--GV--EKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred ----hhc--cc--eecCceeEeccccceeeeeeecchhhheecCC Confidence 000 00 111223334444444 5677799887766544 No 203 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=69.11 E-value=0.23 Score=24.11 Aligned_cols=268 Identities=15% Similarity=0.121 Sum_probs=109.9 Q ss_pred CCCCccCccccccCc-------cc---------------------CccccHHHHHHHHHhHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 1 MANATGGQQIGANQG-------KG---------------------QSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVR 52 (347) Q Consensus 1 m~~~~~~~~~~~~~~-------~~---------------------~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~r 52 (347) |...-..+.+-.|.- |+ ...+|...-.-..|-+.+.+-....-...++...= T Consensus 89 ~r~~p~~~~veyRSaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf~tL 168 (410) T protein:vir:83 89 MRGSPVGTEVEYRSAGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTLGTL 168 (410) T ss_pred CcCCCCCCCcccccHHHHHHHHhccCCchHHHHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccchhhhhhhC Confidence 222110000001110 00 11122210011224444444333332222222111 Q ss_pred cccCCceEEEecccc-ceeeee-------cCCCCCCCCCCCCCCCceEEEEeeee----ecchhhccHHHHHhCcchHHH Q lcl|Aclame:pro 53 TIQNGKSASFPVMGR-TKGYYL-------APGENLDDKRKDIKHSEKVIQIDGLL----TSDVLIYDIEDAMNHYDVRAE 120 (347) Q Consensus 53 ti~~G~tv~i~~iG~-~t~~~~-------~~g~~~~~~~~~~~~~~~~l~ID~~~----~~~~~Vdd~D~~q~~~D~r~~ 120 (347) .. .|.|..-+..-. +++..+ +.|..++ ...+..+..+-.|+++= .++..|+. ++....+- T Consensus 169 P~-~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~--~gKl~~~t~tA~ikTyGGyt~LSRQ~IER-----s~v~~L~~ 240 (410) T protein:vir:83 169 PL-NNATFYRPIVSQRPAVGLQGVAGGASDEKTELD--SQKMVIDRLTVNAKTLGGYVNVSRQAIDF-----SSPSALDL 240 (410) T ss_pred CC-CCCeeEEeeeccccccccccccccccccccccc--ccceeeeeccceeehhcCcccccceeeec-----CChhhHHH Confidence 11 255555543311 122111 1233333 23344445555666542 12222332 34444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhc--cC Q lcl|Aclame:pro 121 YSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKN--YV 198 (347) Q Consensus 121 ~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~--~V 198 (347) ..+.++.+-|+......=..|. ..+ ++ ..+.. ...++.|...|.++....+.+ ++ T Consensus 241 ~lraL~~AYA~atea~vra~L~------~t~----t~---------~~a~~----~~Tad~~~~~i~da~~~v~da~~~~ 297 (410) T protein:vir:83 241 VVNGLGQQYAIETEALVGAALA------STS----TG---------AVGYG----NATADNVASAIWQAAGAVYTAVKGM 297 (410) T ss_pred HHHHHHHHHHHHHHHHHHHHHH------Hhh----hh---------hhhhh----hccHHHHHHHHHHHHHHHhhhhccc Confidence 4555555555554433322221 000 10 00111 113345666777788888876 54 Q ss_pred CCCCCEEEEChHHHHHHhcchhhhhhhc---cc--cccccccceEEEeceeEEEeccccccccccccccCcccccccccc Q lcl|Aclame:pro 199 PAGDRRFYCAPEDYSAILSALMPNAANY---AA--LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHI 273 (347) Q Consensus 199 P~~gR~~vv~P~~~~~Ll~~~~~~~~~~---~~--~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~ 273 (347) .=+++.|+|+.+..+.+--...+.+. .| .+.+-.|.-|.+.|++|...+.+|.+. T Consensus 298 --~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgT------------------ 357 (410) T protein:vir:83 298 --GRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGD------------------ 357 (410) T ss_pred --eeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCe------------------ Confidence 33788999999876665333233332 11 111224556789999999999887421 Q ss_pred ccccccccccccccceeEEeechhhh-------hhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEec Q lcl|Aclame:pro 274 FPATATGDDRVAQNNVVGLFNHRSAV-------GTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFT 344 (347) Q Consensus 274 ~~a~~~~~y~~d~~~~~~l~~h~~A~-------~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~ 344 (347) +.||.|.|+ |.+.+++-. .+--.+-|+ |.+ +..+.-|++++=+.-. T Consensus 358 -----------------A~f~~~~Ai~~~eS~~gp~qL~d~~--i~nLt~~yS----gY~--a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 358 -----------------AYLFSTAAIECFEQRVGTLQVVEPS--VFGLQVAYA----GYF--STLVVNEDAIVPLVGS 410 (410) T ss_pred -----------------eeEeccceeeeeecCCceeEeeCCc--hhhhhhhhe----eee--eeccccccceeeeccC Confidence 133444443 334333321 111111111 222 3345566666666544 No 204 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=67.45 E-value=0.25 Score=23.87 Aligned_cols=284 Identities=13% Similarity=0.089 Sum_probs=124.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccce-eeeecCCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTK-GYYLAPGENL 79 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t-~~~~~~g~~~ 79 (347) |+..+. .+.-.|.-......|.+.|.+.|-++...+...+. |++.+.++.-... +.-+.-++.+ T Consensus 25 m~alTL--------------aea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve-~~~~~~~r~~~lp~a~~r~~n~~~ 89 (330) T protein:vir:94 25 MPTVTL--------------AESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIE-GNALAYNRENVLGDVQFLAVGGTI 89 (330) T ss_pred hhhhhh--------------hHHhhcCchhhHHHHHHhhhccchHHhhccccccc-CCcceeeeeecCCcceeeeccccc Confidence 433222 11111223455678888888877565555544444 4456666544321 1112223332 Q ss_pred CCCCCCCCCCceEEEEeeeeecchhhccHHHHHh-----CcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Q lcl|Aclame:pro 80 DDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMN-----HYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNEN 154 (347) Q Consensus 80 ~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~-----~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~ 154 (347) +.. .+.+..+++. + .....- +-++|+.-+ -+|.|.+-.+...++|++++...++. +- +.+.. T Consensus 90 ~~~-~~~Tf~q~t~--~-l~~l~~-~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~lin----GD---s~~~~- 156 (330) T protein:vir:94 90 TAK-NPATFTKVTS--E-LTTLIG-DAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMIT----GD---GTGNS- 156 (330) T ss_pred ccc-Ccceeeeeee--c-hhhhhh-hHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhc----cC---CCCcc- Confidence 211 0111122222 2 222222 235555442 24677788888888998887664442 10 01111 Q ss_pred cCcc----cCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccc-- Q lcl|Aclame:pro 155 IAGL----GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA-- 228 (347) Q Consensus 155 ~~g~----~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~-- 228 (347) ..|. .....+..++.+..-+++. .|.|+++.-+ -|-+.-+++++..+...+-+-.|-.. .|.- T Consensus 157 F~GL~~~~~~~q~i~tg~~gg~~T~d~-----LDeLl~~v~~-----~~g~~~~~l~n~a~~r~I~a~~R~~~-~~~v~~ 225 (330) T protein:vir:94 157 FQGMMGLVAASQTISAGANGGTLTFEL-----LDQLLDLVKD-----KDGQVDYLMSSFAMRRKYFSLLRALG-GAAIGE 225 (330) T ss_pred ccchhhcCCcccEEecCCCCCCCCHHH-----HHHHHHHhcC-----CCCCCcEEEechhHHHHHHHHHHhcc-CCCCCC Confidence 1111 2233444443333333321 3333322211 12233588888888777766444221 1111 Q ss_pred ccccccc-ceEEEeceeEEEeccccccccccccccCcccccccccccccccccccccccc------ceeEEeechhhhhh Q lcl|Aclame:pro 229 LIDPETG-NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQN------NVVGLFNHRSAVGT 301 (347) Q Consensus 229 ~~~~~~G-~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~------~~~~l~~h~~A~~t 301 (347) ......| .|-.+.|++|+.++-+|..... .++ ++.++=|...|- ..+||-..-. T Consensus 226 ~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~---------~~~------~~ttsIyav~~G~~~~~qgV~Gl~~~g~---- 286 (330) T protein:vir:94 226 VMTLPSGRQIPTYRGVPWFVNDFIPSNMTQ---------GTA------TNATAIFAGTFDDGSNKYGIAGLTARGS---- 286 (330) T ss_pred cccccCCCEEeeeCCeEEEecccccCCCCc---------ccC------CCceeEEEEeecccccccceEeecCCCC---- Confidence 1112245 5678889999999999864210 000 011111222211 2334322111 Q ss_pred hhhhheeecccc--chhh-HhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 302 VKLKDMALERAR--RPEF-QADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 302 v~~~~~~~e~~~--~~~~-~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) -.++++-.. +.+- ..+.| ...+|..++.|++++.|.-=.-- T Consensus 287 ---~glsVr~~G~~~~k~v~~~~v--~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 287 ---AGLRVQNVGAKENADETITRV--KMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred ---CcceeeeCCCccccceeeEEE--EEeeeeEEechhheeeeccccCC Confidence 112222211 0000 11222 23468889999999888732222 No 205 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=59.15 E-value=0.4 Score=22.78 Aligned_cols=295 Identities=13% Similarity=0.044 Sum_probs=89.7 Q ss_pred ccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeeecCCCCCCCCCCCCCCC Q lcl|Aclame:pro 10 IGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLDDKRKDIKHS 89 (347) Q Consensus 10 ~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~~~g~~~~~~~~~~~~~ 89 (347) ++..|++- +....||- ....+....+.+. .++|+---+|.+-+... |+. .... T Consensus 1 i~~~P~~~---g~~~glff-----------~~~~v~T~~V~ie-~~~~~l~lip~v~rg~~-----g~~-------~~~~ 53 (320) T protein:vir:10 1 MNLLPVNY---GDSRALFA-----------REKKVRTRTILVE-EKNGVLTLIQSREPGST-----ENV-------AKRG 53 (320) T ss_pred CCcCCchh---hhhhhhcc-----------CCCCcccceEEEE-EecCceeeeeccCCCCC-----cee-------ecCC Confidence 44445431 11111211 1112222222222 12233222332222111 110 1111 Q ss_pred ceEEEEeeeeecchh-hccHHHHH-----------hCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccc-- Q lcl|Aclame:pro 90 EKVIQIDGLLTSDVL-IYDIEDAM-----------NHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI-- 155 (347) Q Consensus 90 ~~~l~ID~~~~~~~~-Vdd~D~~q-----------~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~-- 155 (347) +..+..-.-.|+... .=.-|+.| .--+++.....++...+.....-+.+..| ++.-..+. ...+ T Consensus 54 ~~~~~~f~~p~~~~~d~i~a~eiq~~Ra~G~~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL-~G~ildad-Gtv~~d 131 (320) T protein:vir:10 54 KRKVRSFVIPHLPLEDVILPDEYEGLRGFGTTALAAKSELVKERXETMKSSHDITHEHLRMGAK-KGQILDAD-GTVLYD 131 (320) T ss_pred cceEEEEecceeccCCccCHHHHcCcccCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cCeEEcCC-CcEEEe Confidence 111111111111000 00111111 11122222222222222222211111111 11100000 0000 Q ss_pred ----CcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhh--hcccc Q lcl|Aclame:pro 156 ----AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAA--NYAAL 229 (347) Q Consensus 156 ----~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~--~~~~~ 229 (347) .|... ..+...-.+..+ +. .+.+.+.+..+...|. ..|..+-+++++|++|..|+.|+++-.. .+... T Consensus 132 ~y~~fGi~~-~~i~~~l~~a~~--dv-~~~~~~~~~~i~~~l~--g~~~t~v~al~g~~f~~al~~h~~Vke~y~~~~~~ 205 (320) T protein:vir:10 132 LYAEFGITK-KTIYFGLDNKDA--NV-AESCRQVLRHVEDNLR--GDVMKDVSVDVSEEFFDKFIKHASVKEVFLNHEAA 205 (320) T ss_pred chhhhCCcc-ceeEEecCCCCc--cH-HHHHHHHHHHHHHHhc--cCCCCceEEEEChHHHHHHhcCHHHHHHHHhhhhh Confidence 01110 111111111111 11 1222333444444443 4566677889999999999999876432 11111 Q ss_pred cc-ccc--cceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhh Q lcl|Aclame:pro 230 ID-PET--GNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD 306 (347) Q Consensus 230 ~~-~~~--G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~ 306 (347) .. +.. .+-..+.|+.+++...--....+.... .-+.+..+.++.+..+.++- ..|.+=.-+++.+. .++ T Consensus 206 ~~~l~~~~~~~f~~gGi~~~~Y~g~~~d~~g~~~~---~I~~~~~~~~p~g~~~~f~~----~~apad~~e~vnt~-g~p 277 (320) T protein:vir:10 206 VNRLGGDTRKGFKFGGLIFNENRARHVDEEGKETR---FIKAGKGHAFPTGTTNTFFT----ALAPADFNETAGTL-GKR 277 (320) T ss_pred hhhccccccceEEecCEEEEEcccEEEcCCCCeeE---eecCCeeEEEEecCchhhee----eecccCcHhhcCCc-ccc Confidence 11 111 122367788888854311001111000 01111222333333222110 00000000111111 111 Q ss_pred eeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 307 MALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 307 ~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) +=...+.++...+..+..-...=.-+.||++++-++.+++- T Consensus 278 ~y~k~~~~~~~~g~~l~~qS~PLpi~~rP~~lv~~~~~a~~ 318 (320) T protein:vir:10 278 YYAKMEPRRMGRGFDLHSQSNVLPMCCRPGVLVELDAAAQP 318 (320) T ss_pred cccccccccCCCeEEEEeeecccccccCcceEEEEEecCCC Confidence 10111111111111122211112456699999988765554 No 206 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=58.73 E-value=0.41 Score=22.72 Aligned_cols=281 Identities=14% Similarity=0.029 Sum_probs=109.6 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHH-HhhhcccccccccCCceEEEeccccce-eeeecCCCC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRR-SVTMDKHMVRTIQNGKSASFPVMGRTK-GYYLAPGEN 78 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~-s~~~~~~~~rti~~G~tv~i~~iG~~t-~~~~~~g~~ 78 (347) |- . | .....+|+. -|...+.++|+.. +-.+.+.+ +.-+..++-+-..+|... +... .|+- T Consensus 1 m~---i-----t-------~~~l~~l~~-~~~~~~~~~y~~a~~~~~~~a~-~~~sdf~~~~~~~lg~~p~l~e~-~Ge~ 62 (302) T protein:vir:10 1 ML---I-----N-------KQSLNAAFV-AIKTIFNNAFAAAPTTWQKIAM-EVPSNTSSNDYKWLSTFPKMRRW-IGAK 62 (302) T ss_pred Cc---c-----c-------HHHHHHHHH-HHHHHHHHHHHhhhhhhhceee-ecCCCcceeeceecCCCCCcccc-ccce Confidence 22 1 1 011223333 4455556666654 23333332 222344555555555432 1111 1222 Q ss_pred CCCCCCCCCCCceEEEEeeeeecchhhccHHHHHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc--cccccccC Q lcl|Aclame:pro 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP--AASNENIA 156 (347) Q Consensus 79 ~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a--~~~~~~~~ 156 (347) . ...+...+-+|.+.++- --+.|..-+-.==++..-..+.+++|++-++..|+.++..|..+.... .-.+=+.+ T Consensus 63 ~---~~~l~~~~~~i~~~~~g-~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~ 138 (302) T protein:vir:10 63 V---VKNLKAYKYVVENEDFE-ATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDT 138 (302) T ss_pred e---eccccccceeEEeeccc-ceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecc Confidence 1 22355555566666541 222232211111124677888999999999999999998875432210 00111111 Q ss_pred cccCcee--eeecccccccchhhHHHHHHHHHHHHHHHH-hhccCC--CCCCEEEEChHHHH---HHhcchhhhhhhccc Q lcl|Aclame:pro 157 GLGQAVV--LNIGAAADLVDVEARGKAILKGLTLARARL-TKNYVP--AGDRRFYCAPEDYS---AILSALMPNAANYAA 228 (347) Q Consensus 157 g~~~~~~--i~~~~~~~~~~~~~~~~~i~~~l~~a~~~L-de~~VP--~~gR~~vv~P~~~~---~Ll~~~~~~~~~~~~ 228 (347) .|..+.. -+++.+.-...........+++.+.++.++ +...-| -..+++||+|.... .|+.+.+.. .+ T Consensus 139 dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~----~g 214 (302) T protein:vir:10 139 DHPVGDASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLA----DN 214 (302) T ss_pred cccccccccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhccccC----CC Confidence 1211110 011110000000011111233333333222 222222 23478899886544 344443321 12 Q ss_pred cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhh---hhh Q lcl|Aclame:pro 229 LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTV---KLK 305 (347) Q Consensus 229 ~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv---~~~ 305 (347) ..+...|. ++++.++.|... .... |+..|+.+=++ ..+ T Consensus 215 ~~Np~~g~------~~~vv~p~L~s~---~aWy------------------------------L~a~~~~i~~~~l~g~~ 255 (302) T protein:vir:10 215 TPNPYVGT------AELVVDGRIESD---TAWF------------------------------LLDTTKPVKPFIFQPRK 255 (302) T ss_pred Ccceeccc------eEEEEeeccCCC---CceE------------------------------EEecCCccceEEEcCcc Confidence 22333333 678888887421 1111 11111111000 111 Q ss_pred heeeccccchhhHhhHHhhhhhhcC------cccccceEEEEEecCCC Q lcl|Aclame:pro 306 DMALERARRPEFQADQIIGKYAMGH------GGLRPEAAGALVFTPAA 347 (347) Q Consensus 306 ~~~~e~~~~~~~~~d~i~~~~~~G~------~~lRPe~~~~l~~~~aa 347 (347) .+.++..-++..-+=.++-.+.||+ +..-|..+-.=. .++| T Consensus 256 ~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~-g~~~ 302 (302) T protein:vir:10 256 QPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGST-GTGA 302 (302) T ss_pred ccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccC-ccCC Confidence 2223332233323333444555553 233333332222 2222 No 207 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=58.65 E-value=0.41 Score=22.71 Aligned_cols=306 Identities=14% Similarity=0.080 Sum_probs=132.5 Q ss_pred CCCCccCccccccCcccCcc------------ccHHHHHHHHHh------HHHHHHHHHHHhhhcccccccccCCceEEE Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSA------------ADKLALFLKVFG------GEVLTAFVRRSVTMDKHMVRTIQNGKSASF 62 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~------------~d~~al~ie~f~------geV~~~f~~~s~~~~~~~~rti~~G~tv~i 62 (347) |+.+++.=.. -|.-+++.. .++++.|-+.-. ++-.+.|.. ...-.+...|+.++. T Consensus 116 MTgPTGLIFA-MRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~a~~~ea~t~fs~------~~~~~~~~~G~~~~~ 188 (528) T protein:vir:80 116 MSTPTSQIFA-IRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKGAAVGSPTGTPFAK------LAIGTQIEAGDIVHH 188 (528) T ss_pred CCchhhhhee-eeeeecCCcccccccccccccccccccccccccccccccccccccccc------ccccccccccceecc Confidence 5554321110 111111110 011222211110 011111111 000001111111110 Q ss_pred e---------------------------------ccccceeeeecC------CC---CCCCCCCCCCCCceEEEEeeeee Q lcl|Aclame:pro 63 P---------------------------------VMGRTKGYYLAP------GE---NLDDKRKDIKHSEKVIQIDGLLT 100 (347) Q Consensus 63 ~---------------------------------~iG~~t~~~~~~------g~---~~~~~~~~~~~~~~~l~ID~~~~ 100 (347) . .+....+..+.. ++ .+.++ ......|..+.||+... T Consensus 189 t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE~le~lg~s-s~~~f~EMaFsIEKvTV 267 (528) T protein:vir:80 189 TFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAEIQEGFNGS-SNNPWAEMSMRIDKQVV 267 (528) T ss_pred ccccccccccccccccccCccccCCcccccccccccccccccccccccchhhhhhhcccCCC-ccccccceeeEEEEEEE Confidence 0 000000000000 01 01011 11235678888888643 Q ss_pred --------cchhhccHHHHHh-C-cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcc-cCceeeeeccc Q lcl|Aclame:pro 101 --------SDVLIYDIEDAMN-H-YDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL-GQAVVLNIGAA 169 (347) Q Consensus 101 --------~~~~Vdd~D~~q~-~-~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~-~~~~~i~~~~~ 169 (347) +...+.-..+.++ | .|.-.|++.=++.++..++.+-|++.|--.++.-. . ....+. ....+..+... T Consensus 268 tAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a~~~~-~-~~t~~~~~~~G~~dl~~~ 345 (528) T protein:vir:80 268 EAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTAQVGK-T-GMTQTVGSKAGVFDLQDP 345 (528) T ss_pred eeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhheeeeee-e-eeeeccccccceeecccc Confidence 3445665666666 4 88899999999999999999999876532221111 0 000000 00001111111 Q ss_pred ccccc---hhhHHHHHHHHHHHHHHHHhhccCCCCCCEEEEChHHHHHHhcchhhhhhhccc-----cccccc-cceEEE Q lcl|Aclame:pro 170 ADLVD---VEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAA-----LIDPET-GNIRNV 240 (347) Q Consensus 170 ~~~~~---~~~~~~~i~~~l~~a~~~Lde~~VP~~gR~~vv~P~~~~~Ll~~~~~~~~~~~~-----~~~~~~-G~v~~i 240 (347) .+... .....+.++-.|.+.....-.+---..+-|+|.+|+..+.|-......-....+ ..+... =.+|.+ T Consensus 346 ~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~~~~~~~~~~~~~d~~~~~~~G~l 425 (528) T protein:vir:80 346 IDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGISLAMQGAAKGLNTDTTKAVFAGVL 425 (528) T ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccccccccccccccccccCCCCceEEEEe Confidence 11000 011122233233332222222222234579999999999997654322222211 111111 135666 Q ss_pred e-ceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHh Q lcl|Aclame:pro 241 M-GFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQA 319 (347) Q Consensus 241 ~-G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~ 319 (347) . |++||.-++.|..-.. -.|+++..-..+|+|.|=.- +++.+.+||+.|. T Consensus 426 ~~~~~vy~D~y~~~dy~~----------------------vG~KG~~~~~~glfy~PYv~-------l~~~~~~dp~sfq 476 (528) T protein:vir:80 426 AGKYKVFIDQYARQDYFT----------------------VGYKGDNEMDAGIYYAPYVA-------LTPLRATDPQSFH 476 (528) T ss_pred cCceEEEecCCCCcceEE----------------------EEEeCCcccccceeeccccc-------ceeeEeeCCcccc Confidence 4 8999988877642111 01223333335788888743 3444678999998 Q ss_pred hHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 320 DQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 320 d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) -.|-.+.+||-.+ .| .+.....++.| T Consensus 477 P~~g~~tRY~l~~-NP-~~~~~~~~~~~ 502 (528) T protein:vir:80 477 PVLGFKTRYGIGI-NP-FADSKSQAPSA 502 (528) T ss_pred ceeeeeeeeceee-cC-cccccCCcccc Confidence 8888888887644 55 55555555544 No 208 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=56.58 E-value=0.45 Score=22.47 Aligned_cols=285 Identities=12% Similarity=-0.020 Sum_probs=118.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHH-hHHHHH----HHHHHHhhhcccccccccC--CceEEEe---cccccee Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVF-GGEVLT----AFVRRSVTMDKHMVRTIQN--GKSASFP---VMGRTKG 70 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f-~geV~~----~f~~~s~~~~~~~~rti~~--G~tv~i~---~iG~~t~ 70 (347) |=.........|- .. .+|..| ...|+. .-...-..+.++.+.+.-. -+++.++ ..|..+ T Consensus 37 ~d~~~~~~~~~~~--------~~--~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~- 105 (339) T protein:vir:94 37 MDAVNLTPTLQTT--------AN--AGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVA- 105 (339) T ss_pred ccccccccccccc--------cc--cchhhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceE- Confidence 1111111110000 11 233322 222221 1112223445555544321 3567665 455554 Q ss_pred eeecCCCCCCCCCCCCCCCceEEEEeeeeecchhhccHHH---HHhCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Q lcl|Aclame:pro 71 YYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIED---AMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNL 147 (347) Q Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~---~q~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~ 147 (347) -|..+.+.+...-+.+-.++++.+=+. .+.+..++. .++..|+-.+-.+.+..+|.++.|+..+. + T Consensus 106 -~ygd~ad~Pl~~~~v~~~~~~v~~~~~---g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~----G--- 174 (339) T protein:vir:94 106 -TYSDWSANGMSKANVNFESRQNYRYQT---WTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLL----G--- 174 (339) T ss_pred -EcccccCCCcccccceeeEEeEEEEEE---EEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEee----e--- Confidence 344455443222234444445444442 233444443 33567777777778888888887774432 1 Q ss_pred ccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHhhcc----CCCCCCEEEEChHHHHHHhcchhhhh Q lcl|Aclame:pro 148 PAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNY----VPAGDRRFYCAPEDYSAILSALMPNA 223 (347) Q Consensus 148 a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~----VP~~gR~~vv~P~~~~~Ll~~~~~~~ 223 (347) .+. -.+.|.-....+....+....=..+..+.|++.|..+...|.... -|..-..++|+|.+|..|-.-..+ + T Consensus 175 -d~~-~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~n~~-~ 251 (339) T protein:vir:94 175 -VAG-IANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRTNNF-G 251 (339) T ss_pred -ecc-cceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccCCcC-C Confidence 110 011111111011111111111123456778888888887776653 234445789999999988653211 0 Q ss_pred hhccccccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhh Q lcl|Aclame:pro 224 ANYAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVK 303 (347) Q Consensus 224 ~~~~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~ 303 (347) .+-..-+.. +..+++|...+.|-. ++......+. .|..+ ..++-+.+. + + T Consensus 252 --~Tvl~~lk~----n~pnl~i~~~~el~~------------a~g~~~~~~~-----~~~~~-~~~~~~~~p-~-----~ 301 (339) T protein:vir:94 252 --LSAGAKIAQ----TYPNIQFVAVPEFDT------------ASGRLVQLWV-----PEVNG-QPTGEVAFA-E-----K 301 (339) T ss_pred --ccHHHHHHH----hcCCcEEEEcccccc------------CCCceEEEEE-----EeccC-CcceEEEcc-h-----h Confidence 100011111 133566666555421 0000000000 00000 011111111 1 1 Q ss_pred hhheeeccccchhhHhhHHhhhhhh-cCcccccceEEEEEec Q lcl|Aclame:pro 304 LKDMALERARRPEFQADQIIGKYAM-GHGGLRPEAAGALVFT 344 (347) Q Consensus 304 ~~~~~~e~~~~~~~~~d~i~~~~~~-G~~~lRPe~~~~l~~~ 344 (347) ...+.+ .++...+.+....+. |.-+.||.+++-+.== T Consensus 302 ~~~lpv----q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 302 LRSHSI----ERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred hhcccc----EEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 111111 112234555555664 6678899988665422 No 209 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=50.20 E-value=0.62 Score=21.74 Aligned_cols=305 Identities=15% Similarity=0.066 Sum_probs=129.8 Q ss_pred CCCCccCccccccCcccC--ccccHHHHHHHHHhHHHHHHHHHHHhhh---cc-----------ccc-ccccCCceEEEe Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQ--SAADKLALFLKVFGGEVLTAFVRRSVTM---DK-----------HMV-RTIQNGKSASFP 63 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~--~~~d~~al~ie~f~geV~~~f~~~s~~~---~~-----------~~~-rti~~G~tv~i~ 63 (347) |+.+++.=.. -|.-+++ .+++ -|+|. ..|..+.|--..-.. +. ... -+..+|+..... T Consensus 114 MTgPTGLIFA-MRsrY~~~~~tg~-EAf~~---~nEadt~fSG~~~~~~~~~~~~~~~~~~G~~~~~~~t~~~gd~~~~~ 188 (514) T protein:vir:56 114 MTGPTSQVFT-LRSVYGKDPLTGA-EAFHP---TRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRY 188 (514) T ss_pred CCchhhhhee-eeeeecCCCcccc-ccccc---ccccCcCcccccccccccccccccccccccccccccccccccccccc Confidence 6655432110 1111111 1111 12221 022222221100000 00 000 011222222111 Q ss_pred c----------cccceeeeec------------------CCCC---CCCCCCCCCCCceEEEEeeeee--------cchh Q lcl|Aclame:pro 64 V----------MGRTKGYYLA------------------PGEN---LDDKRKDIKHSEKVIQIDGLLT--------SDVL 104 (347) Q Consensus 64 ~----------iG~~t~~~~~------------------~g~~---~~~~~~~~~~~~~~l~ID~~~~--------~~~~ 104 (347) . .|..+...+. .++. +.++ ....-.|..+.||+... +... T Consensus 189 ~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs-~~~~f~EMaFsIdK~tVtAKSRaLKAEYT 267 (514) T protein:vir:56 189 FLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS-SNNEWNEMSFRIDKQVVEAKSRQLKAQYS 267 (514) T ss_pred ccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCC-cccccceeeeEEEEEEEeeeccceecccc Confidence 1 0000000000 0010 1111 12234577888888643 3455 Q ss_pred hccHHHHHh-C-cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHH Q lcl|Aclame:pro 105 IYDIEDAMN-H-YDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAI 182 (347) Q Consensus 105 Vdd~D~~q~-~-~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i 182 (347) |.-..+.++ | .|.-.|++.=++.++..++.+-|++.|...+...... .+.|.....+....+..+....-. . T Consensus 268 iELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~--~~~~~~~~G~~d~~~~~d~~~~~~----~ 341 (514) T protein:vir:56 268 IELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSG--WTQGAGAAGVFDFSDAVDVKGARW----A 341 (514) T ss_pred HHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcc--cccccccccccccccccccccchH----H Confidence 666666666 4 8999999999999999999999988776554332221 122221111222222222111111 1 Q ss_pred HHHHHHHHHHHhhc-c-C----C-CCCCEEEEChHHHHHHhcchhh--------hhhhccccccccccceEEE-eceeEE Q lcl|Aclame:pro 183 LKGLTLARARLTKN-Y-V----P-AGDRRFYCAPEDYSAILSALMP--------NAANYAALIDPETGNIRNV-MGFEVI 246 (347) Q Consensus 183 ~~~l~~a~~~Lde~-~-V----P-~~gR~~vv~P~~~~~Ll~~~~~--------~~~~~~~~~~~~~G~v~~i-~G~~V~ 246 (347) ++.+..+..++++. + + - -.+.|+|.+|+..+.|-...-+ .+..+..+. ...=..|.+ .|++|| T Consensus 342 ~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~~~d~-~~~~~aG~l~~~~~vy 420 (514) T protein:vir:56 342 GEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDT-NQTVFAGVLGGRFKVY 420 (514) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCcccccccccc-CcceEEEEecCceEEE Confidence 23333333333322 1 1 1 2468999999999998653322 111111111 000012443 589999 Q ss_pred EeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhh Q lcl|Aclame:pro 247 EVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKY 326 (347) Q Consensus 247 ~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~ 326 (347) .-++.|..-.. . .|+++..-..+|||.|=.- +.+ -..+||+.|.-.|-.+. T Consensus 421 ~D~y~~~dy~~----------v------------G~KG~~~~~~glfyaPYv~----l~~---~~~~dp~sfqP~~g~~t 471 (514) T protein:vir:56 421 IDQYAVNDYFT----------V------------GFKGSTEMDAGVFYSPYVP----LTP---LRGSDSKNFQPVIGFKT 471 (514) T ss_pred ecCCCCcceEE----------E------------EEecCcceecceeeccccc----ccc---ccccCCccccceeeeee Confidence 88887642110 0 1222223334688888633 332 24579998888877777 Q ss_pred hhcCcc--cccceEEEEEec---C-CC Q lcl|Aclame:pro 327 AMGHGG--LRPEAAGALVFT---P-AA 347 (347) Q Consensus 327 ~~G~~~--lRPe~~~~l~~~---~-aa 347 (347) +||-.+ .-++.+..+... + +| T Consensus 472 RY~l~~NPy~~~~~~~~~~~~~~~~~a 498 (514) T protein:vir:56 472 RYGVQVNPFADPTASATKVGNGAPVAA 498 (514) T ss_pred eeceeeCCCCCccccccccCCcchhhh Confidence 777543 322222222110 0 01 No 210 >protein:vir:78148 Length: 123 # NCBI annotation: hypothetical protein # Family: family:all:4955 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294802;genbank:gi:149882823;genbank:GeneID:5309176 Probab=45.85 E-value=0.22 Score=24.19 Aligned_cols=117 Identities=11% Similarity=0.011 Sum_probs=57.7 Q ss_pred EEChHHHHHHhcchhhhhhhccc--c-cccccc-ceEEEeceeEEEeccccccccccccccCcccccccccccccccccc Q lcl|Aclame:pro 206 YCAPEDYSAILSALMPNAANYAA--L-IDPETG-NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGD 281 (347) Q Consensus 206 vv~P~~~~~Ll~~~~~~~~~~~~--~-~~~~~G-~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~ 281 (347) +|+--+|..++.+.-. ..|.. + ...-.| .--+++|.+++.|+|+|-.. .......-......+++. +-. T Consensus 1 vvsdlqfA~~~g~~v~--~~aLpRE~aNp~ltG~lpV~~~GltWl~tpnlpg~~-a~vlDst~lGgmaDE~l~----~Pg 73 (123) T protein:vir:78 1 MLSGAQFAKLIGILVD--DKALPREQANIVLTGSLPVSAYGLTWVTSRHITGTD-PWLFDVEQLGGMADEKLL----SPE 73 (123) T ss_pred CcchhhHHHHhcchhc--ccccccccCCceEecCcceeeeceeeeecCCCCCCc-cceeehhhhccccccccC----CCc Confidence 6666778888775321 11211 1 122234 44569999999999999321 111111111111111111 111 Q ss_pred ccccccceeEEeechhhhhhhhhhheeeccccchh--hHhhHHhhhhhhcCcccccceEEEEEecCC Q lcl|Aclame:pro 282 DRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPE--FQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) Q Consensus 282 y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~--~~~d~i~~~~~~G~~~lRPe~~~~l~~~~a 346 (347) |.+.- ...+++...|..+ .-.|.|+++-+.=.-++.|.+-+-|.-.-- T Consensus 74 ya~~~-----------------~~Gvevkt~Red~~~nD~yriRaRRvTvpiv~EP~Agv~ltg~g~ 123 (123) T protein:vir:78 74 FAPAG-----------------NTGVEASTERAHQGVKDGYLVRGRRNTVAVVTEPMAGVRLTGTGL 123 (123) T ss_pred ccCCC-----------------CcceeEEeeccccCCCCceEEeeeecceeEEecCccceEEeeecC Confidence 22111 1113444455555 567788888777667777766665553333 No 211 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=40.88 E-value=0.95 Score=20.70 Aligned_cols=305 Identities=9% Similarity=-0.023 Sum_probs=111.2 Q ss_pred CCCCccCccc----cccCcccC-ccccHH--HHHHHHHhHHHHHHHHHHHhhhcccccccccCCceEEEeccccceeeee Q lcl|Aclame:pro 1 MANATGGQQI----GANQGKGQ-SAADKL--ALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGRTKGYYL 73 (347) Q Consensus 1 m~~~~~~~~~----~~~~~~~~-~~~d~~--al~ie~f~geV~~~f~~~s~~~~~~~~rti~~G~tv~i~~iG~~t~~~~ 73 (347) |++.....++ .++.-... ..+|.. -|=.+++.-.|...+.... ++...++-+. .-++..|+++|-....-+ T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l~~g~L~p~~a~~Fl~~v~~~t~-iL~~~r~~~~-~s~~~ei~kig~G~r~~r 78 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAELDGFQLPVDVTEEFLERMQKGVQ-ILGMADTMTL-ARLEMEVPQFGVPRLSGH 78 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccccCceeecHHHHHHHHHHHhhccc-hhhhcceeec-ccccccccccccceeecc Confidence 5552211110 01111111 001111 1222455555554444444 4455554333 346777888876543222 Q ss_pred cCCCCCCC-CCCCCCCCceEE-EEeeeeecchhhccHHHHHhC-------cchHHHHHHHHHHHHHHHH---H------- Q lcl|Aclame:pro 74 APGENLDD-KRKDIKHSEKVI-QIDGLLTSDVLIYDIEDAMNH-------YDVRAEYSAQLGEALAIAA---D------- 134 (347) Q Consensus 74 ~~g~~~~~-~~~~~~~~~~~l-~ID~~~~~~~~Vdd~D~~q~~-------~D~r~~~~~~~g~aLa~~~---D------- 134 (347) ...+.... ..-.++...+.+ ..+...++...+.+..+...+ -.+++.++++.|+-|.... | T Consensus 79 ~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~ 158 (360) T protein:vir:99 79 TRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQ 158 (360) T ss_pred ccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccc Confidence 11111110 001122222222 345444444433222222112 1255666666665443322 1 Q ss_pred ---------HHHHHHHHHhhhccccc-----ccccCcccCceeeeecccc-----cccchhhHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 135 ---------GAVLAEMAKLCNLPAAS-----NENIAGLGQAVVLNIGAAA-----DLVDVEARGKAILKGLTLARARLTK 195 (347) Q Consensus 135 ---------~~il~~l~~~a~~a~~~-----~~~~~g~~~~~~i~~~~~~-----~~~~~~~~~~~i~~~l~~a~~~Lde 195 (347) ...-+.|.++..-.... ...++-......-+..... ...++..... +.|.++.+.|.. T Consensus 159 ~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~---~lf~~~~~~Lp~ 235 (360) T protein:vir:99 159 SIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDT---SLFNETIQTLDS 235 (360) T ss_pred cCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchH---HHHHHHHHhcch Confidence 11111111110000000 0000000000000000000 0000001111 223455566666 Q ss_pred ccC--CCCCCEEEEChHHHHHHhcchhhhhhhc-cccccccccceEEEeceeEEEeccccccccccccccCccccccccc Q lcl|Aclame:pro 196 NYV--PAGDRRFYCAPEDYSAILSALMPNAANY-AALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKH 272 (347) Q Consensus 196 ~~V--P~~gR~~vv~P~~~~~Ll~~~~~~~~~~-~~~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~ 272 (347) +.- |...-+.+++|..+..... .+.+++- .|+.-+..+..-...|++|+..+.+|... T Consensus 236 kyr~~~~~~~~~~~s~~~~~~yr~--~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~~pd~~----------------- 296 (360) T protein:vir:99 236 RYRESDAYSPVLMTSPNQVQSYTM--SLTEREDPLGSAVIFGDSDITPFSYDLVGVNGFPDEY----------------- 296 (360) T ss_pred hhhcCcccceEEEccCchHHHHHH--HHhccCcccchhheecccccccceeeeEEcCCCCCCc----------------- Confidence 642 1112155777765444433 2222222 22233444444467899999999998432 Q ss_pred cccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhh---HHhhh-hhhcCcc-cccceEEEEE--ecC Q lcl|Aclame:pro 273 IFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQAD---QIIGK-YAMGHGG-LRPEAAGALV--FTP 345 (347) Q Consensus 273 ~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d---~i~~~-~~~G~~~-lRPe~~~~l~--~~~ 345 (347) .++-+|+=+..+-..+++++..-++.+..+ .++-. .+.---+ -.+|+++.+. ..| T Consensus 297 ------------------~mlT~p~NLi~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~ 358 (360) T protein:vir:99 297 ------------------MMFTDPNNLAFGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETP 358 (360) T ss_pred ------------------eEEeccCceeEEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCC Confidence 133344444444555555544434444332 11111 1111122 3455555554 455 Q ss_pred CC Q lcl|Aclame:pro 346 AA 347 (347) Q Consensus 346 aa 347 (347) .| T Consensus 359 ~~ 360 (360) T protein:vir:99 359 TA 360 (360) T ss_pred CC Confidence 56 No 212 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=33.44 E-value=1.4 Score=19.86 Aligned_cols=290 Identities=10% Similarity=-0.013 Sum_probs=125.7 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhccccccccc--CCceEEEec---cccceeeeecC Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQ--NGKSASFPV---MGRTKGYYLAP 75 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~~rti~--~G~tv~i~~---iG~~t~~~~~~ 75 (347) | |++.... .|.+ +.-..+-|++-|.-.+.+.--.--+...++.+.+.= .-+++.|+. +|+.++ |.. T Consensus 63 m-Da~~~~~-~t~~-----~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~--ygd 133 (382) T protein:vir:96 63 M-DSNFTAP-VTTP-----SIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVE--YGD 133 (382) T ss_pred c-ccccCCc-cccC-----CccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEE--eec Confidence 2 1111000 0222 122355788888854443333333455565554421 125666654 576653 444 Q ss_pred CCCCCCCCCCCCCCceEEEEeeeeecchhhccHHHHH---hCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Q lcl|Aclame:pro 76 GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAM---NHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASN 152 (347) Q Consensus 76 g~~~~~~~~~~~~~~~~l~ID~~~~~~~~Vdd~D~~q---~~~D~r~~~~~~~g~aLa~~~D~~il~~l~~~a~~a~~~~ 152 (347) +++.+...-+.+..++++.+=+ ..+.+.++++.+ +.+|+-++-...+..+|.++.|+..+.=. . +.... T Consensus 134 ~~D~Pl~d~~~~~~~r~v~~~~---~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~----~-~g~~~ 205 (382) T protein:vir:96 134 HTNIPLTSWNANFERRTIVRGE---LGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGW----Q-SGLGN 205 (382) T ss_pred ccCCCccccccceeEEEEEEEE---EeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEee----e-cCcCc Confidence 5555332233444444443333 334455666655 57888888888888888888776543100 0 00000 Q ss_pred cccCcccCceeeee-cccccccchhhHHHHHHHHHHHHHHHHhhccC----CCC-CCEEEEChHHHHHHhcchhhhhhhc Q lcl|Aclame:pro 153 ENIAGLGQAVVLNI-GAAADLVDVEARGKAILKGLTLARARLTKNYV----PAG-DRRFYCAPEDYSAILSALMPNAANY 226 (347) Q Consensus 153 ~~~~g~~~~~~i~~-~~~~~~~~~~~~~~~i~~~l~~a~~~Lde~~V----P~~-gR~~vv~P~~~~~Ll~~~~~~~~~~ 226 (347) .+-|.-....+.. .++....-..+..+.|++.|..+...|....- |.. ...++|+|..|..|-... +| T Consensus 206 -~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~n-----~~ 279 (382) T protein:vir:96 206 -RTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVTT-----PY 279 (382) T ss_pred -ceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhccccC-----cc Confidence 0011111001111 11111112345677889999988888866652 433 346889999998885421 22 Q ss_pred cc--cccccccceEEEeceeEEEeccccccccccccccCccccccccccccccccccccccccceeEEeechhhhhhhhh Q lcl|Aclame:pro 227 AA--LIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKL 304 (347) Q Consensus 227 ~~--~~~~~~G~v~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~ 304 (347) +- ..-+.. +..+++|...+.|-... .. .. -...+++.|.++....+.. T Consensus 280 g~Tvl~~lk~----n~Pnl~i~t~peL~~a~------------~~--------g~------g~~~~~~~~~~e~~~~~~~ 329 (382) T protein:vir:96 280 GISVSDWIEQ----TYPKMRIVSAPELSGVQ------------MQ--------GK------TPEDALVLFVEEVDASVDG 329 (382) T ss_pred CccHHHHHHH----hcCCcEEEEcccccccc------------CC--------Cc------cceeEEEEecchhhhhccc Confidence 10 010111 13355666655552100 00 00 0111223333332211111 Q ss_pred hheeecccc-----------ch--hhHhhHHhhhhh-hcCcccccceEEEEEec Q lcl|Aclame:pro 305 KDMALERAR-----------RP--EFQADQIIGKYA-MGHGGLRPEAAGALVFT 344 (347) Q Consensus 305 ~~~~~e~~~-----------~~--~~~~d~i~~~~~-~G~~~lRPe~~~~l~~~ 344 (347) +. +....| .. +.-++.+....+ .|+-+.||.+++-+.== T Consensus 330 s~-~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 330 ST-DGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred cc-ccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 10 000000 00 011122222222 35566677776544311 No 213 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=27.79 E-value=1.8 Score=19.17 Aligned_cols=298 Identities=15% Similarity=0.081 Sum_probs=129.5 Q ss_pred CCCCccCccccccCcccCccccHHHHHHHHHhHHHHHHHHHHHhhhcccc--------cccccCCceEEEe--------- Q lcl|Aclame:pro 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHM--------VRTIQNGKSASFP--------- 63 (347) Q Consensus 1 m~~~~~~~~~~~~~~~~~~~~d~~al~ie~f~geV~~~f~~~s~~~~~~~--------~rti~~G~tv~i~--------- 63 (347) |+.+++.=.. -|.-+++......+---|-|-.|..+.|....-...... .-...+......+ T Consensus 97 MTgPTGLIFA-mRsrY~~~~~~~nq~gtEAlfnEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~g~~~~ 175 (462) T protein:vir:10 97 MTGPTGLIFA-MRSFYGSERRPANSDFREALFNEPNAGFSGGAGTGLSNYDPTASSSAVNDAEGANPGLLNDSPAGTYEV 175 (462) T ss_pred CCcchhhhhe-eeeeccCCccccccccchhhhccCCcCccccccccccccccccccccccccccccceeecCCCccceec Confidence 6665532110 111111100000000113333555555532210000000 0000000000000 Q ss_pred -c--cccceeeeecCCCCCCCC-CCCCCCCceEEEEeeeee--------cchhhccHHHHHh-C-cchHHHHHHHHHHHH Q lcl|Aclame:pro 64 -V--MGRTKGYYLAPGENLDDK-RKDIKHSEKVIQIDGLLT--------SDVLIYDIEDAMN-H-YDVRAEYSAQLGEAL 129 (347) Q Consensus 64 -~--iG~~t~~~~~~g~~~~~~-~~~~~~~~~~l~ID~~~~--------~~~~Vdd~D~~q~-~-~D~r~~~~~~~g~aL 129 (347) . .|..++ .++ ..++ ..+....|..+.||+... +...+.-..+.++ | .|.-.|++.=++.++ T Consensus 176 ~~~~~GM~Ta----~aE-~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEI 250 (462) T protein:vir:10 176 TGDATGMATA----TAE-ALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEI 250 (462) T ss_pred ccccccccch----hcc-ccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHH Confidence 0 011111 011 1111 011244677888888643 3445665666666 4 889999999999999 Q ss_pred HHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHHHHHHHHHHHHHHHHh-------hccCCCCC Q lcl|Aclame:pro 130 AIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLT-------KNYVPAGD 202 (347) Q Consensus 130 a~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~i~~~l~~a~~~Ld-------e~~VP~~g 202 (347) ..++.+-|++.|...+.. ..+.+.....++..... ..+.-.++....+.-+++ .+----.+ T Consensus 251 mlEINReii~~l~~~a~~-----~k~~~~~~~Gv~dl~~~-------~~gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~ 318 (462) T protein:vir:10 251 LAEINREVVRTIYVNAVK-----GAIANTATDGIFDLDVD-------SNGRWSVEKFKGLLFQIERDSNAIGQETRRGKG 318 (462) T ss_pred HHHhhHHHHhhhhhhhee-----eecccccccceeeeccc-------cchHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 999999999988765422 11222222222222111 122222333333322222 11112346 Q ss_pred CEEEEChHHHHHHhcchhh--h---h-hhcc-ccccccccceEEEe-ceeEEEe----ccccccccccccccCccccccc Q lcl|Aclame:pro 203 RRFYCAPEDYSAILSALMP--N---A-ANYA-ALIDPETGNIRNVM-GFEVIEV----PHLTVGGAGDNNPADGVAPTNQ 270 (347) Q Consensus 203 R~~vv~P~~~~~Ll~~~~~--~---~-~~~~-~~~~~~~G~v~~i~-G~~V~~s----n~lp~~~~~~~~~~~~~~~t~~ 270 (347) -|+|.+|+..+.|-...-+ . + .... ..++.-...+|.+. |++||.- ||-|.--. .. T Consensus 319 n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~----------~v-- 386 (462) T protein:vir:10 319 NILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFY----------VA-- 386 (462) T ss_pred eEEEEchhHHHHhhhccchhccccccccccccccccccceeEEEecCceEEEEecccCCCcccceE----------EE-- Confidence 7999999999999544321 1 1 1111 11233344566664 7888875 34332110 00 Q ss_pred cccccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 271 KHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 271 ~~~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .|+++-.-..+|||.|=. ++.++ ..-||+.|.-.|-.+.+||-.+ .|= ...+..+.++ T Consensus 387 ----------G~KG~~~~~~glfy~PYv----~l~~~---~~~dp~sfqP~~g~~tRY~l~~-NP~-t~~~~~~~~~ 444 (462) T protein:vir:10 387 ----------GYKGTSPYDAGLFYCPYV----PLQQV---RAINPNTFQPKIGFKTRYGMVS-NPF-SGGLTQGSGA 444 (462) T ss_pred ----------EEeCCcccccceeecccc----ccccc---cccCCccccceeeeeeeeeeee-cCC-CCCcCCcccc Confidence 112222223468888863 33332 2348888877777777776433 221 2222222222 No 214 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=21.10 E-value=2.7 Score=18.26 Aligned_cols=309 Identities=14% Similarity=0.071 Sum_probs=133.7 Q ss_pred CCCCccCcc---------ccccCcccC-ccccH--HHHHHHHHhHHHHHHHHHHHhhhccc----c---------ccccc Q lcl|Aclame:pro 1 MANATGGQQ---------IGANQGKGQ-SAADK--LALFLKVFGGEVLTAFVRRSVTMDKH----M---------VRTIQ 55 (347) Q Consensus 1 m~~~~~~~~---------~~~~~~~~~-~~~d~--~al~ie~f~geV~~~f~~~s~~~~~~----~---------~rti~ 55 (347) |-.++.... ...+.++.. ..++. ..-.-+.|-.|.-+.|-......... . ...+. T Consensus 147 ~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a 226 (529) T protein:vir:10 147 MYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIA 226 (529) T ss_pred cccccccccccccccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccc Confidence 111111000 000111100 00000 00011333334444443221111100 0 00111 Q ss_pred CCceEEEeccccceeeeecCCCCC---CCCCCCCCCCceEEEEeeeee--------cchhhccHHHHHh-C-cchHHHHH Q lcl|Aclame:pro 56 NGKSASFPVMGRTKGYYLAPGENL---DDKRKDIKHSEKVIQIDGLLT--------SDVLIYDIEDAMN-H-YDVRAEYS 122 (347) Q Consensus 56 ~G~tv~i~~iG~~t~~~~~~g~~~---~~~~~~~~~~~~~l~ID~~~~--------~~~~Vdd~D~~q~-~-~D~r~~~~ 122 (347) .|. +....-|..+.. ++.+ .++ ..-...|..+.||+... +...+.-..+.++ | .|.-.|++ T Consensus 227 ~~~-~~~~~~gmsTa~----aEal~~~g~s-s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELs 300 (529) T protein:vir:10 227 AGE-LAEIAEGMATSI----AELRQGFNGT-TDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELN 300 (529) T ss_pred ccc-ccccccccchhh----hhccccCCCC-ccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHH Confidence 111 111112222221 1111 111 11235677888887643 3445666666666 4 89999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhcccccccccCcccCceeeeecccccccchhhHH---HHHHHHHHHHHHHHhhccCC Q lcl|Aclame:pro 123 AQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGLGQAVVLNIGAAADLVDVEARG---KAILKGLTLARARLTKNYVP 199 (347) Q Consensus 123 ~~~g~aLa~~~D~~il~~l~~~a~~a~~~~~~~~g~~~~~~i~~~~~~~~~~~~~~~---~~i~~~l~~a~~~Lde~~VP 199 (347) .=++.++..++.+-|++.+...++.....-....+...| ++......+....-..+ +.++-.+.+.....-.+--- T Consensus 301 NILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~g-v~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~r 379 (529) T protein:vir:10 301 GILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAG-VFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGR 379 (529) T ss_pred HHHHHHHHHHhhHHHHHHhhhhceeeeeeeecccccccc-ceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhcc Confidence 999999999999999987765554322110000000011 11111111111111122 22222222222222221111 Q ss_pred CCCCEEEEChHHHHHHhcchhhhhhhc----cc-ccccccc-ceEEE-eceeEEEeccccccccccccccCccccccccc Q lcl|Aclame:pro 200 AGDRRFYCAPEDYSAILSALMPNAANY----AA-LIDPETG-NIRNV-MGFEVIEVPHLTVGGAGDNNPADGVAPTNQKH 272 (347) Q Consensus 200 ~~gR~~vv~P~~~~~Ll~~~~~~~~~~----~~-~~~~~~G-~v~~i-~G~~V~~sn~lp~~~~~~~~~~~~~~~t~~~~ 272 (347) -.+-|+|.+|+....|-....+.-... .+ ..+...+ ..|.+ .|++||.-++.|..-.. . T Consensus 380 g~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~----------v---- 445 (529) T protein:vir:10 380 GAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYIDQYARQDYFT----------M---- 445 (529) T ss_pred ccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEecCCCCcceEE----------E---- Confidence 235688899999888854322211111 01 0111111 24554 48899988876642111 0 Q ss_pred cccccccccccccccceeEEeechhhhhhhhhhheeeccccchhhHhhHHhhhhhhcCcccccceEEEEEecCCC Q lcl|Aclame:pro 273 IFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) Q Consensus 273 ~~~a~~~~~y~~d~~~~~~l~~h~~A~~tv~~~~~~~e~~~~~~~~~d~i~~~~~~G~~~lRPe~~~~l~~~~aa 347 (347) .|+++..-..+|||.|=- + +++-..+||+.|.-.|-.+.+||-.+ .| .+..+..++.+ T Consensus 446 --------G~KG~~~~~~glfy~PYv----~---l~~~~~~dp~sfqP~~g~~tRY~l~~-NP-~~~~~~~~~~~ 503 (529) T protein:vir:10 446 --------GYRGANNLDAGIYYCPYV----A---LTPLRGSDPKNFQPVMGFKTRYAIGV-NP-FAESRTQAPTS 503 (529) T ss_pred --------EEeCCcccccceeecccc----c---cccccccCCCcccceeeeeeeeceee-cC-ccccccccccc Confidence 122222333478888862 2 33445689999988888888887543 55 55555555544 Done!